Genome Biology | |
Web Apollo: a web-based genomic annotation editing platform | |
Suzanna E Lewis4  Christine G Elsik2  Ian H Holmes1  Lincoln Stein3  Robert M Buels1  Chris P Childers5  Monica C Munoz-Torres4  Justin T Reese5  Gregg A Helt4  Eduardo Lee4  | |
[1] Department of Bioengineering, University of California Berkeley, Berkeley, CA 94720, USA;Division of Plant Sciences, University of Missouri, Columbia, MO 65211, USA;Department of Molecular Genetics, University of Toronto, 172 St. George Street Toronto, Ontario, Canada M5R 0A3;Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA;Division of Animal Sciences, University of Missouri, Columbia, MO 65211, USA | |
关键词: EDITOR; COLLABORATIVE; GENOME; | |
Others : 864020 DOI : 10.1186/gb-2013-14-8-r93 |
|
received in 2013-05-10, accepted in 2013-08-30, 发布年份 2013 | |
【 摘 要 】
Web Apollo is the first instantaneous, collaborative genomic annotation editor available on the web. One of the natural consequences following from current advances in sequencing technology is that there are more and more researchers sequencing new genomes. These researchers require tools to describe the functional features of their newly sequenced genomes. With Web Apollo researchers can use any of the common browsers (for example, Chrome or Firefox) to jointly analyze and precisely describe the features of a genome in real time, whether they are in the same room or working from opposite sides of the world.
【 授权许可】
2013 Lee et al.; licensee BioMed Central Ltd.
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
20140725075359818.pdf | 3784KB | download | |
50KB | Image | download | |
90KB | Image | download | |
45KB | Image | download | |
66KB | Image | download | |
93KB | Image | download |
【 图 表 】
【 参考文献 】
- [1]Lewis SE, Searle SM, Harris N, Gibson M, Lyer V, Richter J, Wiel C, Bayraktaroglir L, Birney E, Crosby MA, Kaminker JS, Matthews BB, Prochnik SE, Smithy CD, Tupy JL, Rubin GM, Misra S, Mungall CJ, Clamp ME: Apollo: a sequence annotation editor. Genome Biol 2002, 3:RESEARCH0082.
- [2]Rutherford K, Parkhill J, Crook J, Horsnell T, Rice P, Rajandream MA, Barrell B: Artemis: sequence visualization and annotation. Bioinformatics 2000, 16:944-945.
- [3]Eeckman FH, Durbin R: ACeDB and macace. Methods Cell Biol 1995, 48:583-605.
- [4]Pruitt KD, Tatusova T, Brown GR, Maglott DR: NCBI Reference Sequences (RefSeq): current status, new features and genome annotation policy. Nucleic Acids Res 2012, 40:D130-135.
- [5]Flicek P, Ahmed I, Amode MR, Barrell D, Beal K, Brent S, Carvalho-Silva D, Clapham P, Coates G, Fairley S, Fitzgerald S, Gil L, Garcia-Giron C, Gordon L, Hourlier T, Hunt S, Juettemann T, Kahari AK, Keenan S, Komorowska M, Kulesha E, Longden I, Maurel T, McLaren WM, Muffato M, Nag R, Overduin B, Pignatelli M, Pritchard B, Pritchard E, et al.: Ensembl 2013. Nucleic Acids Res 2013, 41:D48-55.
- [6]Marygold SJ, Leyland PC, Seal RL, Goodman JL, Thurmond J, Strelets VB, Wilson RJ: FlyBase: improvements to the bibliography. Nucleic Acids Res 2013, 41:D751-757.
- [7]Yook K, Harris TW, Bieri T, Cabunoc A, Chan J, Chen WJ, Davis P, de la Cruz N, Duong A, Fang R, Ganesan U, Grove C, Howe K, Kadam S, Kishore R, Lee R, Li Y, Muller HM, Nakamura C, Nash B, Ozersky P, Paulini M, Raciti D, Rangarajan A, Schindelman G, Shi X, Schwarz EM, Ann Tuli M, Van Auken K, Wang D, et al.: WormBase 2012: more genomes, more data, new website. Nucleic Acids Res 2012, 40:D735-741.
- [8]Cherry JM, Hong EL, Amundsen C, Balakrishnan R, Binkley G, Chan ET, Christie KR, Costanzo MC, Dwight SS, Engel SR, Fisk DG, Hirschmann JE, Hitz BC, Karra K, Krieger CJ, Miyasato SR, Nash RS, Park J, Skrzypek MS, Simison M, Weng S, Wong ED: Saccharomyces Genome Database: the genomics resource of budding yeast. Nucleic Acids Res 2012, 40:D700-705.
- [9]Lamesch P, Berardini TZ, Li D, Swarbreck D, Wilks C, Sasidharan R, Muller R, Dreher K, Alexander DL, Garcia-Hernandez M, Karthikeyan AS, Lee CH, Nelson WD, Ploetz L, Singh S, Wensel A, Huala E: The Arabidopsis Information Resource (TAIR): improved gene annotation and new tools. Nucleic Acids Res 2012, 40:D1202-1210.
- [10]Bult CJ, Eppig JT, Blake JA, Kadin JA, Richardson JE: The mouse genome database: genotypes, phenotypes, and models of human disease. Nucleic Acids Res 2013, 41:D885-891.
- [11]Internation Human Genome Sequencing Consortium: Finishing the euchromatic sequence of the human genome. Nature 2004, 431:931-945.
- [12]Church DM, Goodstadt L, Hillier LW, Zody MC, Goldstein S, She X, Bult CJ, Agarwala R, Cherry JL, DiCuccio M, Hlavina W, Kapustin Y, Meric P, Maglott D, Birtle Z, Marques AC, Graves T, Zhou S, Teague B, Potamousis K, Churas C, Place M, Herschleb J, Runnheim R, Forrest D, Amos-Landgraf J, Schwartz DC, Cheng Z, Lindblad-Toh K, Eichler EE, et al.: Lineage-specific biology revealed by a finished genome assembly of the mouse. PLoS Biol 2009, 7:e1000112.
- [13]Howe K, Clark MD, Torroja CF, Torrance J, Berthelot C, Muffato M, Collins JE, Humphray S, McLaren K, Matthews L, McLaren S, Sealy I, Caccamo M, Churcher C, Scott C, Barrett JC, Koch R, Rauch GJ, White S, Chow W, Kilian B, Quintais LT, Guerra-Assuncao JA, Zhou Y, Gu Y, Yen J, Vogel JH, Eyre T, Redmond S, Banerjee R, et al.: The zebrafish reference genome sequence and its relationship to the human genome. Nature 2013, 496:498-503.
- [14]Otterlace. [http://www.sanger.ac.uk/resources/software/otterlace/] webcite
- [15]Curwen V, Eyras E, Andrews TD, Clarke L, Mongin E, Searle SM, Clamp M: The Ensembl automatic gene annotation system. Genome Res 2004, 14:942-950.
- [16]Harrow J, Frankish A, Gonzalez JM, Tapanari E, Diekhans M, Kokocinski F, Aken BL, Barrell D, Zadissa A, Searle S, Barnes I, Bignell A, Boychenko V, Hunt T, Kay M, Mukherjee G, Rajan J, Despacio-Reyes G, Saunders G, Steward C, Harte R, Lin M, Howald C, Tanzer A, Derrien T, Chrast J, Walters N, Balasubramanian S, Pei B, Tress M, et al.: GENCODE: the reference human genome annotation for The ENCODE Project. Genome Res 2012, 22:1760-1774.
- [17]Salzberg SL: Genome re-annotation: a wiki solution?. Genome Biol 2007, 8:102. BioMed Central Full Text
- [18]Huss JW, Orozco C, Goodale J, Wu C, Batalov S, Vickers TJ, Valafar F, Su AI: A gene wiki for community annotation of gene function. PLoS Biol 2008, 6:e175.
- [19]Punta M, Coggill PC, Eberhardt RY, Mistry J, Tate J, Boursnell C, Pang N, Forslund K, Ceric G, Clements J, Heger A, Holm L, Sonnhammer EL, Eddy SR, Bateman A, Finn RD: The Pfam protein families database. Nucleic Acids Res 2012, 40:D290-301.
- [20]Gardner PP, Daub J, Tate J, Moore BL, Osuch IH, Griffiths-Jones S, Finn RD, Nawrocki EP, Kolbe DL, Eddy SR, Bateman A: Rfam: Wikipedia, clans and the "decimal" release. Nucleic Acids Res 2011, 39:D141-145.
- [21]Sterck L, Billiau K, Abeel T, Rouze P, Van de Peer Y: ORCAE: online resource for community annotation of eukaryotes. Nat Methods 2012, 9:1041.
- [22]Skinner ME, Uzilov AV, Stein LD, Mungall CJ, Holmes IH: JBrowse: a next-generation genome browser. Genome Res 2009, 19:1630-1638.
- [23]Kuhn RM, Haussler D, Kent WJ: The UCSC genome browser and associated tools. Brief Bioinform 2013, 14:144-161.
- [24]Mungall CJ, Emmert DB: A Chado case study: an ontology-based modular schema for representing genome-associated biological information. Bioinformatics 2007, 23:i337-346.
- [25]Dowell RD, Jokerst RM, Day A, Eddy SR, Stein L: The distributed annotation system. BMC Bioinformatics 2001, 2:7. BioMed Central Full Text
- [26]GenBank XML. [http://www.ncbi.nlm.nih.gov/IEB/ToolBox/XML/] webcite
- [27]Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R: The Sequence Alignment/Map format and SAMtools. Bioinformatics 2009, 25:2078-2079.
- [28]Kent WJ, Zweig AS, Barber G, Hinrichs AS, Karolchik D: BigWig and BigBed: enabling browsing of large distributed datasets. Bioinformatics 2010, 26:2204-2207.
- [29]Cantarel BL, Korf I, Robb SM, Parra G, Ross E, Moore B, Holt C, Sanchez Alvarado A, Yandell M: MAKER: an easy-to-use annotation pipeline designed for emerging model organism genomes. Genome Res 2008, 18:188-196.
- [30]Goecks J, Nekrutenko A, Taylor J: Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences. Genome Biol 2010, 11:R86. BioMed Central Full Text
- [31]Kent WJ: BLAT--the BLAST-like alignment tool. Genome Res 2002, 12:656-664.
- [32]Alkan C, Sajjadian S, Eichler EE: Limitations of next-generation genome sequence assembly. Nat Methods 2011, 8:61-65.
- [33]Wang Z, Gerstein M, Snyder M: RNA-Seq: a revolutionary tool for transcriptomics. Nat Rev Genet 2009, 10:57-63.
- [34]Honey Bee Genome Sequencing Consortium: Insights into social insects from the genome of the honeybee Apis mellifera. Nature 2006, 443:931-949.
- [35]Kirkness EF, Haas BJ, Sun W, Braig HR, Perotti MA, Clark JM, Lee SH, Robertson HM, Kennedy RC, Elhaik E, Gerlach D, Kriventseva EV, Elsik CG, Graur D, Hill CA, Veenstra JA, Walenz B, Tubio JM, Ribeiro JM, Rozas J, Johnston JS, Reese JT, Popadic A, Tojo M, Raoult D, Reed DL, Tomoyasu Y, Kraus E, Mittapalli O, Margam VM, et al.: Genome sequences of the human body louse and its primary endosymbiont provide insights into the permanent parasitic lifestyle. Proc Natl Acad Sci USA 2010, 107:12168-12173.
- [36]Sea Urchin Genome Sequencing Consortium: The genome of the sea urchin Strongylocentrotus purpuratus. Science 2006, 314:941-952.
- [37]Tribolium Genome Sequencing Consortium: The genome of the model beetle and pest Tribolium castaneum. Nature 2008, 452:949-955.
- [38]Werren JH, Richards S, Desjardins CA, Niehuis O, Gadau J, Colbourne JK, Werren JH, Richards S, Desjardins CA, Niehuis O, Gadau J, Colbourne JK, Beukeboom LW, Desplan C, Elsik CG, Grimmelikhuijzen CJ, Kitts P, Lynch JA, Murphy T, Oliveira DC, Smith CD, van de Zande L, Worley KC, Zdobnov EM, Aerts M, Albert S, Anaya VH, Anzola JM, Barchuk AR, Behura SK, et al.: Functional and evolutionary insights from the genomes of three parasitoid Nasonia species. Science 2010, 327:343-348.
- [39]Bovine Genome Sequencing and Analysis Consortium: The genome sequence of taurine cattle: a window to ruminant biology and evolution. Science 2009, 324:522-528.
- [40]Heliconius Genome Consortium: Butterfly genome reveals promiscuous exchange of mimicry adaptations among species. Nature 2012, 487:94-98.
- [41]International Aphid Genomics Consortium: Genome sequence of the pea aphid Acyrthosiphon pisum. PLoS Biol 2010, 8:e1000313.
- [42]Suen G, Teiling C, Li L, Holt C, Abouheif E, Bornberg-Bauer E, Bouffard P, Caldera EJ, Cash E, Cavanaugh A, Denas O, Elhaik E, Fave MJ, Gadau J, Gibson JD, Graur D, Grubbs KJ, Hagen DE, Harkins TT, Helmkampf M, Hu H, Johnson BR, Kim J, Marsh SE, Moeller JA, Munoz-Torres MC, Murphy MC, Naughton MC, Nigam S, Overson R, et al.: The genome sequence of the leaf-cutter ant Atta cephalotes reveals insights into its obligate symbiotic lifestyle. PLoS Genet 2011, 7:e1002007.
- [43]Groenen MA, Archibald AL, Uenishi H, Tuggle CK, Takeuchi Y, Rothschild MF, Rogel-Gaillard C, Park C, Milan D, Megens HJ, Li S, Larkin DM, Kim H, Frantz LA, Caccamo M, Ahn H, Aken BL, Anselmo A, Anthon C, Auvil L, Badaoui B, Beattie CW, Bendixen C, Berman D, Blecha F, Bomberg J, Bolund L, Bosse M, Botti S, Bujie Z, et al.: Analyses of pig genomes provide insight into porcine demography and evolution. Nature 2012, 491:393-398.
- [44]Elsik CG, Worley KC, Zhang L, Milshina NV, Jiang H, Reese JT, Childs KL, Venkatraman A, Dickens CM, Weinstock GM, Gibbs RA: Community annotation: procedures, protocols, and supporting tools. Genome Res 2006, 16:1329-1333.
- [45]Reese JT, Childers CP, Sundaram JP, Dickens CM, Childs KL, Vile DC, Elsik CG: Bovine Genome Database: supporting community annotation and analysis of the Bos taurus genome. BMC Genomics 2010, 11:645. BioMed Central Full Text
- [46]Loveland JE, Gilbert JG, Griffiths E, Harrow JL: Community gene annotation in practice. Database (Oxford) 2012, 2012:bas009.
- [47]Nicol JW, Helt GA, Blanchard SG, Raja A, Loraine AE: The Integrated Genome Browser: free software for distribution and exploration of genome-scale datasets. Bioinformatics 2009, 25:2730-2731.
- [48]O'Connor BD, Merriman B, Nelson SF: SeqWare Query Engine: storing and searching sequence data in the cloud. BMC Bioinformatics 2010, 11(Suppl 12):S2. BioMed Central Full Text
- [49]Mi H, Muruganujan A, Thomas PD: PANTHER in 2013: modeling the evolution of gene function, and other gene attributes, in the context of phylogenetic trees. Nucleic Acids Res 2013, 41:D377-386.
- [50]Croft D, O'Kelly G, Wu G, Haw R, Gillespie M, Matthews L, Caudy M, Garapati P, Gopinath G, Jassal B, Jupe S, Kalatskaya I, Mahajan S, May B, Ndegwa N, Schmidt E, Shamovsky V, Yung C, Birney E, Hermjakob H, D'Eustachio P, Stein L: Reactome: a database of reactions, pathways and biological processes. Nucleic Acids Res 2011, 39:D691-697.
- [51]Hoffmann R: A wiki for the life sciences where authorship matters. Nat Genet 2008, 40:1047-1051.
- [52]Mozilla Persona. [http://www.mozilla.org/en-US/persona/] webcite
- [53]JSON. [http://www.json.org/] webcite
- [54]Web Apollo Demo. [http://genomearchitect.org/WebApolloDemo/] webcite
- [55]Souvorov A, T T, D L: Eukariotic Genome Annotation with Gnomon - a Multi-step Combined Gene Prediction Tool. ISMB 2004.
- [56]Elsik CG, Mackey AJ, Reese JT, Milshina NV, Roos DS, Weinstock GM: Creating a honey bee consensus gene set. Genome Biol 2007, 8:R13. BioMed Central Full Text
- [57]van Baren MJ, Koebbe BC, Brent MR: Using N-SCAN or TWINSCAN to predict gene structures in genomic DNA sequences. Curr Protoc Bioinformatics 2007. Chapter 4:Unit 4 8
- [58]Salamov AA, Solovyev VV: Ab initio gene finding in Drosophila genomic DNA. Genome Res 2000, 10:516-522.
- [59]Solovyev V: Statistical Approaches in Eukaryotic Gene Prediction. In Handbook of Statistical Genetics. Edited by Balding DJ, Bishop M, Cannings C. Chichester: John Wiley & Sons; 2007:97-159.
- [60]Stanke M, Schoffmann O, Morgenstern B, Waack S: Gene prediction in eukaryotes with a generalized hidden Markov model that uses hints from external sources. BMC Bioinformatics 2006, 7:62. BioMed Central Full Text
- [61]Parra G, Blanco E, Guigo R: GeneID in Drosophila. Genome Res 2000, 10:511-515.
- [62]Parra G, Agarwal P, Abril JF, Wiehe T, Fickett JW, Guigo R: Comparative gene prediction in human and mouse. Genome Res 2003, 13:108-117.
- [63]Slater GS, Birney E: Automated generation of heuristics for biological sequence comparison. BMC Bioinformatics 2005, 6:31. BioMed Central Full Text
- [64]Wu TD, Watanabe CK: GMAP: a genomic mapping and alignment program for mRNA and EST sequences. Bioinformatics 2005, 21:1859-1875.
- [65]Kapustin Y, Souvorov A, Tatusova T, Lipman D: Splign: algorithms for computing spliced alignments with identification of paralogs. Biol Direct 2008, 3:20. BioMed Central Full Text
- [66]Trapnell C, Pachter L, Salzberg SL: TopHat: discovering splice junctions with RNA-Seq. Bioinformatics 2009, 25:1105-1111.
- [67]Web Apollo Installation. [http://www.gmod.org/wiki/WebApollo_Installation] webcite
- [68]GMOD-in-the-Cloud. [http://www.gmod.org/wiki/Cloud] webcite
- [69]Web Apollo Virtual Machine User Guide. [http://genomearchitect.org/webapollo/virtual_machine/docs/user_guide.html] webcite
- [70]Web Apollo Releases. [http://genomearchitect.org/webapollo/releases] webcite
- [71]Google Code. [http://code.google.com] webcite
- [72]GitHub. [http://github.com] webcite
- [73]Web Apollo. [http://genomearchitect.org] webcite
- [74]Web Apollo User Guide. [http://genomearchitect.org/webapollo/docs/webapollo_user_guide.pdf] webcite