GigaScience | |
De novo assembly of the chimpanzee transcriptome from NextGen mRNA sequences | |
Robert B Norgren1  Jacob D Madison1  Mnirnal D Maudhoo1  | |
[1] Department of Genetics, Cell Biology and Anatomy, University of Nebraska Medical Center, Omaha 68198, Nebraska, USA | |
关键词: Assembly; mRNA-seq; Transcriptome; Chimpanzee; Pan troglodytes; | |
Others : 1172972 DOI : 10.1186/s13742-015-0061-x |
|
received in 2014-12-19, accepted in 2015-04-13, 发布年份 2015 | |
【 摘 要 】
Background
Common chimpanzees (Pan troglodytes) and bonobos (Pan paniscus) are the species most closely related to humans. For this reason, it is especially important to have complete and accurate chimpanzee nucleotide and protein sequences to understand how humans evolved their unique capabilities. We provide transcriptome data from four untransformed cell types derived from the reference Pan troglodytes, “Clint”, to better annotate the chimpanzee genome and provide empirical validation for proposed gene models of this important species.
Findings
RNA was extracted from primary cells cultured from four tissues: skin, adipose stroma, vascular smooth muscle and skeletal muscle. These four RNA samples were sequenced on the Illumina HiSeq 2000 platform. Sequences were deposited in the National Center for Biotechnology Information (NCBI) Sequence Read Archive (SRA). Transcripts were assembled, annotated and deposited in the NCBI Transcriptome Shotgun Assembly (TSA) database.
Conclusions
We have provided a high quality annotation of 44,275 transcripts with full-length coding sequence (CDS). This set represented a total of 10,110 unique genes, thus providing empirical support for their existence. This dataset can be used to improve the annotation of the Pan troglodytes genome.
【 授权许可】
2015 Maudhoo et al.; licensee BioMed Central.
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
20150422040352107.pdf | 335KB | download |
【 参考文献 】
- [1]Chimpanzee Sequencing and Analysis Consortium: Initial sequence of the chimpanzee genome and comparison with the human genome Nature 2005, 437:69-87.
- [2]Wall JD: Great ape genomics. ILAR J 2013, 54:82-90.
- [3]Ebersberger I, Metzler D, Schwarz C, Pääbo S: Genomewide comparison of DNA sequences between humans and chimpanzees. Am J Hum Genet 2002, 70:1490-1497.
- [4]Bukh J: A critical role for the chimpanzee model in the study of hepatitis C. Hepatology 2004, 39:1469-75.
- [5]de Groot NG, Bontrop RE: The HIV-1 pandemic: does the selective sweep in chimpanzees mirror humankind's future? Retrovirology 2013, 10:53. BioMed Central Full Text
- [6]Wetterbom A, Ameur A, Feuk L, Gyllensten U, Cavelier L: Identification of novel exons and transcribed regions by chimpanzee transcriptome sequencing. Genome Biol 2010, 11:R78. BioMed Central Full Text
- [7]Zimin AV, Cornish AS, Maudhoo MD, Gibbs RM, Zhang X, Pandey S, et al.: A new rhesus macaque assembly and annotation for next-generation sequencing analyses. Biol Direct 2014, 9:20. BioMed Central Full Text
- [8]Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol 1990, 215:403-410.
- [9]Zerbino DR, Birney E: Velvet. algorithms for de novo short read assembly using de Bruijn graphs. Genome Res 2008, 18:821-829.
- [10]Schulz MH, Zerbino DR, Vingron M, Birney E: Oases. Robust de novo RNA-seq assembly across the dynamic range of expression levels. Bioinformatics 2012, 28:1086-1092.
- [11]Gish W, States DJ: Identification of protein coding regions by database similarity search. Nat Genet 1993, 3:266-272.
- [12]Biomart – Ensembl. 2015. http://www.ensembl.org/biomart/
- [13]National Center for Biotechnology Information. ftp://ftp.ncbi.nlm.nih.gov/refseq/H_sapiens/mRNA_Prot/ (2015) Accessed 9 February 2015.
- [14]Maudhoo MD, Madison JD, Norgren RB. Supporting data and materials for "De Novo assembly of the chimpanzee transcriptome from NextGen mRNA sequences". GigaScience Database. 2015 http://dx.doi.org/10.5524/100137