期刊论文详细信息
BMC Bioinformatics
Computational discovery and annotation of conserved small open reading frames in fungal genomes
  1    2 
[1] 0000 0004 1937 1557, grid.412113.4, Centre for Frontier Sciences, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, UKM, 43600, Bangi, Selangor, Malaysia;0000 0004 1937 1557, grid.412113.4, Institute of Systems Biology, Universiti Kebangsaan Malaysia, UKM, 43600, Bangi, Selangor, Malaysia;0000 0004 1937 1557, grid.412113.4, Centre for Frontier Sciences, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, UKM, 43600, Bangi, Selangor, Malaysia;grid.452569.9, Malaysia Genome Institute, Ministry of Science, Technology & Innovation, Jalan Bangi, 43000, Kajang, Selangor, Malaysia;
关键词: Small open Reading frames;    sORFs;    smORF;    Conserved;    Fungal;   
DOI  :  10.1186/s12859-018-2550-2
来源: publisher
PDF
【 摘 要 】

BackgroundSmall open reading frames (smORF/sORFs) that encode short protein sequences are often overlooked during the standard gene prediction process thus leading to many sORFs being left undiscovered and/or misannotated. For many genomes, a second round of sORF targeted gene prediction can complement the existing annotation. In this study, we specifically targeted the identification of ORFs encoding for 80 amino acid residues or less from 31 fungal genomes. We then compared the predicted sORFs and analysed those that are highly conserved among the genomes.ResultsA first set of sORFs was identified from existing annotations that fitted the maximum of 80 residues criterion. A second set was predicted using parameters that specifically searched for ORF candidates of 80 codons or less in the exonic, intronic and intergenic sequences of the subject genomes. A total of 1986 conserved sORFs were predicted and characterized.ConclusionsIt is evident that numerous open reading frames that could potentially encode for polypeptides consisting of 80 amino acid residues or less are overlooked during standard gene prediction and annotation. From our results, additional targeted reannotation of genomes is clearly able to complement standard genome annotation to identify sORFs. Due to the lack of, and limitations with experimental validation, we propose that a simple conservation analysis can provide an acceptable means of ensuring that the predicted sORFs are sufficiently clear of gene prediction artefacts.

【 授权许可】

CC BY   

【 预 览 】
附件列表
Files Size Format View
RO201909243303812ZK.pdf 6241KB PDF download
  文献评价指标  
  下载次数:0次 浏览次数:1次