期刊论文详细信息
BMC Genomics
An integrative and applicable phylogenetic footprinting framework for cis-regulatory motifs identification in prokaryotic genomes
Methodology Article
Yu Kang1  Anne Fennell2  Qin Ma2  Qi Liu3  Guanghui Wang4  Guojun Li4  Chuan Zhou4  Bingqiang Liu4  Hanyuan Zhang5 
[1] CAS Key Laboratory of Genome Sciences and information, Beijing Institute of Genomics of CAS, 100101, Beijing, People’s Republic of China;Department of Agronomy, Horticulture, and Plant Science, South Dakota State University, 57007, Brookings, SD, USA;BioSNTR, Brookings, SD, USA;Department of Bioinformatics, School of Life Sciences and Technology, Tongji University, Shanghai, China;School of Mathematics, Shandong University, 250100, Jinan, China;Systems Biology and Biomedical Informatics (SBBI) Laboratory University of Nebraska-Lincoln, 68588-0115, Lincoln, NE, USA;
关键词: Cis;    Phylogenetic footprinting;    Prokaryotic genomes;    Comparative genomics;   
DOI  :  10.1186/s12864-016-2982-x
 received in 2016-04-05, accepted in 2016-07-29,  发布年份 2016
来源: Springer
PDF
【 摘 要 】

BackgroundPhylogenetic footprinting is an important computational technique for identifying cis-regulatory motifs in orthologous regulatory regions from multiple genomes, as motifs tend to evolve slower than their surrounding non-functional sequences. Its application, however, has several difficulties for optimizing the selection of orthologous data and reducing the false positives in motif prediction.ResultsHere we present an integrative phylogenetic footprinting framework for accurate motif predictions in prokaryotic genomes (MP3). The framework includes a new orthologous data preparation procedure, an additional promoter scoring and pruning method and an integration of six existing motif finding algorithms as basic motif search engines. Specifically, we collected orthologous genes from available prokaryotic genomes and built the orthologous regulatory regions based on sequence similarity of promoter regions. This procedure made full use of the large-scale genomic data and taxonomy information and filtered out the promoters with limited contribution to produce a high quality orthologous promoter set. The promoter scoring and pruning is implemented through motif voting by a set of complementary predicting tools that mine as many motif candidates as possible and simultaneously eliminate the effect of random noise. We have applied the framework to Escherichia coli k12 genome and evaluated the prediction performance through comparison with seven existing programs. This evaluation was systematically carried out at the nucleotide and binding site level, and the results showed that MP3 consistently outperformed other popular motif finding tools. We have integrated MP3 into our motif identification and analysis server DMINDA, allowing users to efficiently identify and analyze motifs in 2,072 completely sequenced prokaryotic genomes.ConclusionThe performance evaluation indicated that MP3 is effective for predicting regulatory motifs in prokaryotic genomes. Its application may enhance progress in elucidating transcription regulation mechanism, thus provide benefit to the genomic research community and prokaryotic genome researchers in particular.

【 授权许可】

CC BY   
© The Author(s). 2016

【 预 览 】
附件列表
Files Size Format View
RO202311090059811ZK.pdf 2024KB PDF download
【 参考文献 】
  • [1]
  • [2]
  • [3]
  • [4]
  • [5]
  • [6]
  • [7]
  • [8]
  • [9]
  • [10]
  • [11]
  • [12]
  • [13]
  • [14]
  • [15]
  • [16]
  • [17]
  • [18]
  • [19]
  • [20]
  • [21]
  • [22]
  • [23]
  • [24]
  • [25]
  • [26]
  • [27]
  • [28]
  • [29]
  • [30]
  • [31]
  • [32]
  • [33]
  • [34]
  • [35]
  • [36]
  • [37]
  • [38]
  • [39]
  • [40]
  • [41]
  • [42]
  • [43]
  • [44]
  • [45]
  • [46]
  • [47]
  文献评价指标  
  下载次数:2次 浏览次数:0次