期刊论文详细信息
BMC Bioinformatics
An integrative computational approach to effectively guide experimental identification of regulatory elements in promoters
Igor V Deyneko1  Siegfried Weiss1  Sara Leschner1 
[1] Molecular Immunology, Helmholtz Centre for Infection Research, Inhoffenstr. 7, 38124, Braunschweig, Germany
关键词: Transcriptional regulation;    Cis-regulatory modules;    DNA motifs;   
Others  :  1088168
DOI  :  10.1186/1471-2105-13-202
 received in 2012-01-23, accepted in 2012-08-01,  发布年份 2012
PDF
【 摘 要 】

Background

Transcriptional activity of genes depends on many factors like DNA motifs, conformational characteristics of DNA, melting etc. and there are computational approaches for their identification. However, in real applications, the number of predicted, for example, DNA motifs may be considerably large. In cases when various computational programs are applied, systematic experimental knock out of each of the potential elements obviously becomes nonproductive. Hence, one needs an approach that is able to integrate many heterogeneous computational methods and upon that suggest selected regulatory elements for experimental verification.

Results

Here, we present an integrative bioinformatic approach aimed at the discovery of regulatory modules that can be effectively verified experimentally. It is based on combinatorial analysis of known and novel binding motifs, as well as of any other known features of promoters. The goal of this method is the identification of a collection of modules that are specific for an established dataset and at the same time are optimal for experimental verification. The method is particularly effective on small datasets, where most statistical approaches fail. We apply it to promoters that drive tumor-specific gene expression in tumor-colonizing Gram-negative bacteria. The method successfully identified a number of potential modules, which required only a few experiments to be verified. The resulting minimal functional bacterial promoter exhibited high specificity of expression in cancerous tissue.

Conclusions

Experimental analysis of promoter structures guided by bioinformatics has proved to be efficient. The developed computational method is able to include heterogeneous features of promoters and suggest combinatorial modules for experimental testing. Expansibility and robustness of the methodology implemented in the approach ensures good results for a wide range of problems.

【 授权许可】

   
2012 Deyneko et al.; licensee BioMed Central Ltd.

【 预 览 】
附件列表
Files Size Format View
20150117082435177.pdf 473KB PDF download
Figure 2. 83KB Image download
Figure 1. 47KB Image download
【 图 表 】

Figure 1.

Figure 2.

【 参考文献 】
  • [1]Sandve GK, Drablos F: A survey of motif discovery methods in an integrated framework. Biol Direct 2006, 1:11. BioMed Central Full Text
  • [2]Van Loo P, Marynen P: Computational methods for the detection of cis-regulatory modules. Brief Bioinform 2009, 10:509-524.
  • [3]Klepper K, Sandve GK, Abul O, Johansen J, Drablos F: Assessment of composite motif discovery methods. BMC Bioinformatics 2008, 9:123. BioMed Central Full Text
  • [4]Su J, Teichmann SA, Down TA: Assessing computational methods of cis-regulatory module prediction. PLoS Comput Biol 2010, 6:e1001020.
  • [5]Tompa M, Li N, Bailey TL, Church GM, De Moor B, Eskin E, Favorov AV, Frith MC, Fu Y, Kent WJ, et al.: Assessing computational tools for the discovery of transcription factor binding sites. Nat Biotechnol 2005, 23:137-144.
  • [6]Leschner S, Deyneko IV, Lienenklaus S, Wolf K, Bloecker H, Bumann D, Loessner H, Weiss S: Identification of tumor-specific Salmonella Typhimurium promoters and their regulatory logic. Nucleic Acids Res 2012, 40:2984-2994.
  • [7]Olekhnovich IN, Kadner RJ: Crucial roles of both flanking sequences in silencing of the hilA promoter in Salmonella enterica. J Mol Biol 2006, 357:373-386.
  • [8]Leschner S, Weiss S: Salmonella-allies in the fight against cancer. J Mol Med 2010, 88:763-773.
  • [9]Robison K, McGuire AM, Church GM: A comprehensive library of DNA-binding site matrices for 55 proteins applied to the complete Escherichia coli K-12 genome. J Mol Biol 1998, 284:241-254.
  • [10]Matys V, Kel-Margoulis OV, Fricke E, Liebich I, Land S, Barre-Dirrie A, Reuter I, Chekmenev D, Krull M, Hornischer K, et al.: TRANSFAC and its module TRANSCompel: transcriptional gene regulation in eukaryotes. Nucleic Acids Res 2006, 34:D108-110.
  • [11]Portales-Casamar E, Thongjuea S, Kwon AT, Arenillas D, Zhao X, Valen E, Yusuf D, Lenhard B, Wasserman WW, Sandelin A: JASPAR 2010: the greatly expanded open-access database of transcription factor binding profiles. Nucleic Acids Res 2010, 38:D105-110.
  • [12]Bailey TL, Elkan C: Fitting a mixture model by expectation maximization to discover motifs in biopolymers. Proc Int Conf Intell Syst Mol Biol 1994, 2:28-36.
  • [13]Smith AD, Sumazin P, Zhang MQ: Identifying tissue-selective transcription factor binding sites in vertebrate promoters. Proc Natl Acad Sci U S A 2005, 102:1560-1565.
  • [14]Mason MJ, Plath K, Zhou Q: Identification of context-dependent motifs by contrasting ChIP binding data. Bioinformatics 2010, 26:2826-2832.
  • [15]Liu XS, Brutlag DL, Liu JS: An algorithm for finding protein-DNA binding sites with applications to chromatin-immunoprecipitation microarray experiments. Nat Biotechnol 2002, 20:835-839.
  • [16]Wang H, Benham CJ: Superhelical destabilization in regulatory regions of stress response genes. PLoS Comput Biol 2008, 4:e17.
  • [17]Gordon L, Chervonenkis AY, Gammerman AJ, Shahmuradov IA, Solovyev VV: Sequence alignment kernel for recognition of promoter regions. Bioinformatics 2003, 19:1964-1971.
  • [18]Zomer AL, Buist G, Larsen R, Kok J, Kuipers OP: Time-resolved determination of the CcpA regulon of Lactococcus lactis subsp. cremoris MG1363. J Bacteriol 2007, 189:1366-1381.
  • [19]Kel A, Konovalova T, Waleev T, Cheremushkin E, Kel-Margoulis O, Wingender E: Composite Module Analyst: a fitness-based tool for identification of transcription factor binding site combinations. Bioinformatics 2006, 22:1190-1197.
  • [20]Green J, Trageser M, Six S, Unden G, Guest JR: Characterization of the FNR protein of Escherichia coli, an iron-binding transcriptional regulator. Proc Biol Sci 1991, 244:137-144.
  • [21]Yang Y, Hwang CK, D’Souza UM, Lee SH, Junn E, Mouradian MM: Three-amino acid extension loop homeodomain proteins Meis2 and TGIF differentially regulate transcription. J Biol Chem 2000, 275:20734-20741.
  • [22]Pennetier C, Dominguez-Ramirez L, Plumbridge J: Different regions of Mlc and NagC, homologous transcriptional repressors controlling expression of the glucose and N-acetylglucosamine phosphotransferase systems in Escherichia coli, are required for inducer signal recognition. Mol Microbiol 2008, 67:364-377.
  • [23]Melhuish TA, Gallo CM, Wotton D: TGIF2 interacts with histone deacetylase 1 and represses transcription. J Biol Chem 2001, 276:32109-32114.
  • [24]Kolchanov NA, Merkulova TI, Ignatieva EV, Ananko EA, Oshchepkov DY, Levitsky VG, Vasiliev GV, Klimova NV, Merkulov VM, Charles Hodgman T: Combined experimental and computational approaches to study the regulatory elements in eukaryotic genes. Brief Bioinform 2007, 8:266-274.
  • [25]Aerts S, Van Loo P, Thijs G, Moreau Y, De Moor B: Computational detection of cis-regulatory modules. Bioinformatics 2003, 19 Suppl 2:ii5-14.
  • [26]Sharan R, Ovcharenko I, Ben-Hur A, Karp RM: CREME: a framework for identifying cis-regulatory modules in human-mouse conserved segments. Bioinformatics 2003, 19(Suppl 1):i283-291.
  • [27]Pape UJ, Rahmann S, Vingron M: Natural similarity measures between position frequency matrices with an application to clustering. Bioinformatics 2008, 24:350-357.
  • [28]Deyneko IV, Kalybaeva YM, Kel AE, Blöcker H: Human-chimpanzee promoter comparisons: property-conserved evolution? Genomics 2010, 96:129-133.
  • [29]Kel AE, Gossling E, Reuter I, Cheremushkin E, Kel-Margoulis OV, Wingender E: MATCH: A tool for searching transcription factor binding sites in DNA sequences. Nucleic Acids Res 2003, 31:3576-3579.
  • [30]Rice P, Longden I, Bleasby A: EMBOSS: the European Molecular Biology Open Software Suite. Trends Genet 2000, 16:276-277.
  • [31]Bumann D, Valdivia RH: Identification of host-induced pathogen genes by differential fluorescence induction reporter systems. Nat Protoc 2007, 2:770-777.
  • [32]Bumann D: Examination of Salmonella gene expression in an infected mammalian host using the green fluorescent protein and two-colour flow cytometry. Mol Microbiol 2002, 43:1269-1283.
  文献评价指标  
  下载次数:15次 浏览次数:5次