BMC Genomics | |
Transcriptome reconstruction and annotation of cynomolgus and African green monkey | |
Raul Rabadan2  Gustavo Palacios3  Travis Warren4  Guo-Yun Yu1  Elyse Nagle1  Oliver Elliott2  Jeffrey Kugelman1  Hossein Khiabanian2  Albert Lee2  | |
[1] Genomics Division, the U.S. Army Medical Research Institute of Infectious Diseases, Fort Detrick, MD 21702, USA;Department of Systems Biology, Columbia University College of Physicians and Surgeons, New York, NY 10032, USA;National Center for Biodefense and Infectious Disease, George Mason University, Manassas, VA 20110, USA;Molecular and Translational Sciences Divisions, the U.S. Army Medical Research Institute of Infectious Diseases, Fort Detrick, MD 21702, USA | |
关键词: Database; Annotation; Genomics; Transcriptome; RNA-seq; Chlorocebus aethiops; African green monkey; Macaca fascicularis; Cynomolgus macaque; | |
Others : 1139157 DOI : 10.1186/1471-2164-15-846 |
|
received in 2014-04-17, accepted in 2014-09-25, 发布年份 2014 | |
【 摘 要 】
Background
Non-human primates (NHPs) and humans share major biological mechanisms, functions, and responses due to their close evolutionary relationship and, as such, provide ideal animal models to study human diseases. RNA expression in NHPs provides specific signatures that are informative of disease mechanisms and therapeutic modes of action. Unlike the human transcriptome, the transcriptomes of major NHP animal models are yet to be comprehensively annotated.
Results
In this manuscript, employing deep RNA sequencing of seven tissue samples, we characterize the transcriptomes of two commonly used NHP animal models: Cynomolgus macaque (Macaca fascicularis) and African green monkey (Chlorocebus aethiops). We present the Multi-Species Annotation (MSA) pipeline that leverages well-annotated primate species and annotates 99.8% of reconstructed transcripts. We elucidate tissue-specific expression profiles and report 13 experimentally validated novel transcripts in these NHP animal models.
Conclusion
We report comprehensively annotated transcriptomes of two non-human primates, which we have made publically available on a customized UCSC Genome Browser interface. The MSA pipeline is also freely available.
【 授权许可】
2014 Lee et al.; licensee BioMed Central Ltd.
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
20150321053646270.pdf | 1770KB | download | |
Figure 5. | 168KB | Image | download |
Figure 4. | 133KB | Image | download |
Figure 3. | 71KB | Image | download |
Figure 2. | 53KB | Image | download |
Figure 1. | 76KB | Image | download |
【 图 表 】
Figure 1.
Figure 2.
Figure 3.
Figure 4.
Figure 5.
【 参考文献 】
- [1]Palermo RE, Tisoncik-Go J, Korth MJ, Katze MG: Old world monkeys and new age science: the evolution of nonhuman primate systems virology. ILAR journal/National Research Council, Institute of Laboratory Animal Resources 2013, 54(2):166-180.
- [2]Gibbs RA, Rogers J, Katze MG, Bumgarner R, Weinstock GM, Mardis ER, Remington KA, Strausberg RL, Venter JC, Wilson RK, Batzer MA, Bustamante CD, Eichler EE, Hahn MW, Hardison RC, Makova KD, Miller W, Milosavljevic A, Palermo RE, Siepel A, Sikela JM, Attaway T, Bell S, Bernard KE, Buhay CJ, Chandrabose MN, Dao M, Davis C, Delehaunty KD, Ding Y, et al.: Evolutionary and biomedical insights from the rhesus macaque genome. Science 2007, 316(5822):222-234.
- [3]Carlsson HE, Schapiro SJ, Farah I, Hau J: Use of primates in research: a global overview. Am J Primatol 2004, 63(4):225-237.
- [4]Grundmann O: Recent advances in the prevention of bioterrorism attacks. J Bioterr Biodef 2011, 2:103. doi:10.4172/2157-2526.1000103
- [5]Jones SM, Feldmann H, Stroher U, Geisbert JB, Fernando L, Grolla A, Klenk HD, Sullivan NJ, Volchkov VE, Fritz EA, Daddario KM, Hensley LE, Jahrling PB, Geisbert TW: Live attenuated recombinant vaccine protects nonhuman primates against Ebola and Marburg viruses. Nat Med 2005, 11(7):786-790.
- [6]Osada N, Hirata M, Tanuma R, Suzuki Y, Sugano S, Terao K, Kusuda J, Kameoka Y, Hashimoto K, Takahashi I: Collection of Macaca fascicularis cDNAs derived from bone marrow, kidney, liver, pancreas, spleen, and thymus. BMC research notes 2009, 2:199. BioMed Central Full Text
- [7]Ebeling M, Kung E, See A, Broger C, Steiner G, Berrera M, Heckel T, Iniguez L, Albert T, Schmucki R, Biller H, Singer T, Certa U: Genome-based analysis of the nonhuman primate Macaca fascicularis as a model for drug safety assessment. Genome Res 2011, 21(10):1746-1756.
- [8]Geisbert TW, Hensley LE, Larsen T, Young HA, Reed DS, Geisbert JB, Scott DP, Kagan E, Jahrling PB, Davis KJ: Pathogenesis of Ebola Hemorrhagic Fever in Cynomolgus Macaques. The American Journal of Pathology 2003, 163(6):2347-2370.
- [9]Fukasawa M, Miura T, Hasegawa A, Morikawa S, Tsujimoto H, Miki K, Kitamura T, Hayami M: Sequence of simian immunodeficiency virus from African green monkey, a new member of the HIV/SIV group. Nature 1988, 333(6172):457-461.
- [10]Apetrei C, Robertson DL, Marx PA: The history of SIVS and AIDS: epidemiology, phylogeny and biology of isolates from naturally SIV infected non-human primates (NHP) in Africa. Frontiers in bioscience: a journal and virtual library 2004, 9:225-254.
- [11]Barczak AK, Gomez JE, Kaufmann BB, Hinson ER, Cosimi L, Borowsky ML, Onderdonk AB, Stanley SA, Kaur D, Bryant KF, Knipe DM, Sloutsky A, Hung DT: RNA signatures allow rapid identification of pathogens and antibiotic susceptibilities. Proc Natl Acad Sci U S A 2012, 109(16):6217-6222.
- [12]Morens DM, Folkers GK, Fauci AS: The challenge of emerging and re-emerging infectious diseases. Nature 2004, 430(6996):242-249.
- [13]Hellmann I, Ebersberger I, Ptak SE, Paabo S, Przeworski M: A neutral explanation for the correlation of diversity with recombination rates in humans. Am J Hum Genet 2003, 72(6):1527-1535.
- [14]Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B: Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Methods 2008, 5(7):621-628.
- [15]Trapnell C, Williams BA, Pertea G, Mortazavi A, Kwan G, van Baren MJ, Salzberg SL, Wold BJ, Pachter L: Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat Biotechnol 2010, 28(5):511-515.
- [16]Wang Z, Gerstein M, Snyder M: RNA-Seq: a revolutionary tool for transcriptomics. Nat Rev Genet 2009, 10(1):57-63.
- [17]Huh JW, Kim YH, Park SJ, Kim DS, Lee SR, Kim KM, Jeong KJ, Kim JS, Song BS, Sim BW, Kim SU, Kim SH, Chang KT: Large-scale transcriptome sequencing and gene analyses in the crab-eating macaque (Macaca fascicularis) for biomedical research. BMC Genomics 2012, 13:163. BioMed Central Full Text
- [18]Chlorocebus Aethiops Sabeus (vervet) Sequence Assembly Release. [http://www.ncbi.nlm.nih.gov/assembly/GCA_000409795.1 webcite]
- [19]Macaca Fascicularis (cynomolgus macaque) Sequence Assembly Release. [http://www.ncbi.nlm.nih.gov/assembly/GCF_000364345.1 webcite]
- [20]Lu B, Zeng Z, Shi T: Comparative study of de novo assembly and genome-guided assembly strategies for transcriptome reconstruction based on RNA-Seq. Science China Life sciences 2013, 56(2):143-155.
- [21]Zhao S: Assessment of the impact of using a reference transcriptome in mapping short RNA-Seq reads. PLoS One 2014, 9(7):e101374.
- [22]The Multi-Species Annotation (MSA) Pipeline. [http://rabadan.c2b2.columbia.edu/software/msa.html webcite]
- [23]The Cynomolgus and African Green Monkey Genome Browser. [http://rabadan.c2b2.columbia.edu/cgi-bin/hgGateway webcite]
- [24]Trapnell C, Roberts A, Goff L, Pertea G, Kim D, Kelley DR, Pimentel H, Salzberg SL, Rinn JL, Pachter L: Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat Protoc 2012, 7(3):562-578.
- [25]Robbertse B, Tatusova T: Fungal genome resources at NCBI. Mycology 2011, 2(3):142-160.
- [26]Brawand D, Soumillon M, Necsulea A, Julien P, Csardi G, Harrigan P, Weier M, Liechti A, Aximu-Petri A, Kircher M, Albert FW, Zeller U, Khaitovich P, Grutzner F, Bergmann S, Nielsen R, Paabo S, Kaessmann H: The evolution of gene expression levels in mammalian organs. Nature 2011, 478(7369):343-348.
- [27]Molecular Hallmarks of Experimentally Acquired Immunity to Malaria. [http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE50957 webcite]
- [28]Molecular Hallmarks of Naturally Acquired Immunity to Malaria. [http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE52166 webcite]
- [29]Barbosa-Morais NL, Irimia M, Pan Q, Xiong HY, Gueroussov S, Lee LJ, Slobodeniuc V, Kutter C, Watt S, Colak R, Kim T, Misquitta-Ali CM, Wilson MD, Kim PM, Odom DT, Frey BJ, Blencowe BJ: The evolutionary landscape of alternative splicing in vertebrate species. Science 2012, 338(6114):1587-1593.
- [30]Vijay N, Poelstra JW, Kunstner A, Wolf JB: Challenges and strategies in transcriptome assembly and differential gene expression quantification. A comprehensive in silico assessment of RNA-seq experiments. Mol Ecol 2013, 22(3):620-634.
- [31]Hiller M, Huse K, Platzer M, Backofen R: Non-EST based prediction of exon skipping and intron retention events using Pfam information. Nucleic Acids Res 2005, 33(17):5611-5621.
- [32]Prensner JR, Iyer MK, Balbin OA, Dhanasekaran SM, Cao Q, Brenner JC, Laxman B, Asangani IA, Grasso CS, Kominsky HD, Cao X, Jing X, Wang X, Siddiqui J, Wei JT, Robinson D, Iyer HK, Palanisamy N, Maher CA, Chinnaiyan AM: Transcriptome sequencing across a prostate cancer cohort identifies PCAT-1, an unannotated lincRNA implicated in disease progression. Nat Biotechnol 2011, 29(8):742-749.
- [33]Lorente-Galdos B, Bleyhl J, Santpere G, Vives L, Ramirez O, Hernandez J, Anglada R, Cooper GM, Navarro A, Eichler EE, Marques-Bonet T: Accelerated exon evolution within primate segmental duplications. Genome Biol 2013, 14(1):R9. BioMed Central Full Text
- [34]Sakharkar MK, Chow VT, Chaturvedi I, Mathura VS, Shapshak P, Kangueane P: A report on single exon genes (SEG) in eukaryotes. Frontiers in bioscience: a journal and virtual library 2004, 9:3262-3267.
- [35]Tay SK, Blythe J, Lipovich L: Global discovery of primate-specific genes in the human genome. Proc Natl Acad Sci U S A 2009, 106(29):12019-12024.
- [36]FastQC. [http://www.bioinformatics.babraham.ac.uk/projects/fastqc webcite]
- [37]FASTX-Toolkit. [http://hannonlab.cshl.edu/fastx_toolkit webcite]
- [38]Schmieder R, Edwards R: Quality control and preprocessing of metagenomic datasets. Bioinformatics 2011, 27(6):863-864.
- [39]Roberts A, Pimentel H, Trapnell C, Pachter L: Identification of novel transcripts in annotated genomes using RNA-Seq. Bioinformatics 2011, 27(17):2325-2329.
- [40]Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 1997, 25(17):3389-3402.
- [41]McGinnis S, Madden TL: BLAST: at the core of a powerful and diverse set of sequence analysis tools. Nucleic Acids Res 2004, 32(Web Server issue):W20-W25.
- [42]Quinlan AR, Hall IM: BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 2010, 26(6):841-842.
- [43]Bruford EA, Lush MJ, Wright MW, Sneddon TP, Povey S, Birney E: The HGNC database in 2008: a resource for the human genome. Nucleic Acids Res 2008, 36(Database issue):D445-D448.
- [44]Anders S, Huber W: Differential expression analysis for sequence count data. Genome Biol 2010, 11(10):R106. BioMed Central Full Text
- [45]Leek JT, Storey JD: Capturing heterogeneity in gene expression studies by surrogate variable analysis. PLoS Genet 2007, 3(9):1724-1735.
- [46]Kasprzyk A: BioMart: Driving a Paradigm change in Biological data Management. Database: the journal of biological databases and curation 2011, 2011:bar049.
- [47]TransDecoder. [http://transdecoder.sourceforge.net webcite]
- [48]Gish W, States DJ: Identification of protein coding regions by database similarity search. Nat Genet 1993, 3(3):266-272.
- [49]Ye J, Coulouris G, Zaretskaya I, Cutcutache I, Rozen S, Madden TL: Primer-BLAST: a tool to design target-specific primers for polymerase chain reaction. BMC bioinformatics 2012, 13:134. BioMed Central Full Text