期刊论文详细信息
BMC Genomics
Combining in silico prediction and ribosome profiling in a genome-wide search for novel putatively coding sORFs
Research Article
Walter Luyten1  Gerben Menschaert2  Jeroen Crappé2  Geert Trooskens2  Wim Van Criekinge2  Eisuke Hayakawa3  Geert Baggerman4 
[1] Department of Pharmaceutical and Pharmacological Sciences, Faculty of Medicine, KU Leuven, 3000, Leuven, Belgium;Lab of Bioinformatics and Computational Genomics (BioBix), Department of Mathematical Modelling, Statistics and Bioinformatics, Faculty of Bioscience Engineering, Ghent University, 9000, Ghent, Belgium;Research Group of Functional Genomics and Proteomics, KU Leuven, 3000, Leuven, Belgium;VITO Nv, 2400 Mol, Belgium – CFP, Center For Proteomics, 2020, Antwerpen, Belgium;
关键词: Micropeptide;    Small open reading frame;    Mus musculus;    Genome-wide;    Ribosome profiling;    LincRNA;    sORF;    ncRNA;    Bioactive peptide;   
DOI  :  10.1186/1471-2164-14-648
 received in 2013-01-23, accepted in 2013-09-13,  发布年份 2013
来源: Springer
PDF
【 摘 要 】

BackgroundIt was long assumed that proteins are at least 100 amino acids (AAs) long. Moreover, the detection of short translation products (e.g. coded from small Open Reading Frames, sORFs) is very difficult as the short length makes it hard to distinguish true coding ORFs from ORFs occurring by chance. Nevertheless, over the past few years many such non-canonical genes (with ORFs < 100 AAs) have been discovered in different organisms like Arabidopsis thaliana, Saccharomyces cerevisiae, and Drosophila melanogaster. Thanks to advances in sequencing, bioinformatics and computing power, it is now possible to scan the genome in unprecedented scrutiny, for example in a search of this type of small ORFs.ResultsUsing bioinformatics methods, we performed a systematic search for putatively functional sORFs in the Mus musculus genome. A genome-wide scan detected all sORFs which were subsequently analyzed for their coding potential, based on evolutionary conservation at the AA level, and ranked using a Support Vector Machine (SVM) learning model. The ranked sORFs are finally overlapped with ribosome profiling data, hinting to sORF translation. All candidates are visually inspected using an in-house developed genome browser. In this way dozens of highly conserved sORFs, targeted by ribosomes were identified in the mouse genome, putatively encoding micropeptides.ConclusionOur combined genome-wide approach leads to the prediction of a comprehensive but manageable set of putatively coding sORFs, a very important first step towards the identification of a new class of bioactive peptides, called micropeptides.

【 授权许可】

CC BY   
© Crappé et al.; licensee BioMed Central Ltd. 2013

【 预 览 】
附件列表
Files Size Format View
RO202311091449189ZK.pdf 1210KB PDF download
【 参考文献 】
  • [1]
  • [2]
  • [3]
  • [4]
  • [5]
  • [6]
  • [7]
  • [8]
  • [9]
  • [10]
  • [11]
  • [12]
  • [13]
  • [14]
  • [15]
  • [16]
  • [17]
  • [18]
  • [19]
  • [20]
  • [21]
  • [22]
  • [23]
  • [24]
  • [25]
  • [26]
  • [27]
  • [28]
  • [29]
  • [30]
  • [31]
  • [32]
  • [33]
  • [34]
  • [35]
  • [36]
  • [37]
  • [38]
  • [39]
  • [40]
  • [41]
  • [42]
  • [43]
  • [44]
  • [45]
  • [46]
  • [47]
  • [48]
  • [49]
  • [50]
  • [51]
  文献评价指标  
  下载次数:5次 浏览次数:2次