期刊论文详细信息
BMC Genomics
Open-access synthetic spike-in mRNA-seq data for cancer gene fusions
John D Carpten1  David W Craig1  Timothy K McDaniel2  Shukmei Wong1  Valerie Montel2  Nancy E Kim2  Winnie S Liang1  Han-Yu Chuang2  Christophe Legendre1  Stephanie JK Pond2  Waibhav D Tembe1 
[1] Translational Genomics Research Institute (TGen), 445 N 5th Street, SUITE 600, Phoenix, AZ 85004, USA;Illumina, Inc, San Diego, CA, USA
关键词: Cancer genomics;    Gene fusions;    RNA-seq;   
Others  :  1139494
DOI  :  10.1186/1471-2164-15-824
 received in 2014-04-21, accepted in 2014-09-24,  发布年份 2014
PDF
【 摘 要 】

Background

Oncogenic fusion genes underlie the mechanism of several common cancers. Next-generation sequencing based RNA-seq analyses have revealed an increasing number of recurrent fusions in a variety of cancers. However, absence of a publicly available gene-fusion focused RNA-seq data impedes comparative assessment and collaborative development of novel gene fusions detection algorithms. We have generated nine synthetic poly-adenylated RNA transcripts that correspond to previously reported oncogenic gene fusions. These synthetic RNAs were spiked at known molarity over a wide range into total RNA prior to construction of next-generation sequencing mRNA libraries to generate RNA-seq data.

Results

Leveraging a priori knowledge about replicates and molarity of each synthetic fusion transcript, we demonstrate utility of this dataset to compare multiple gene fusion algorithms’ detection ability. In general, more fusions are detected at higher molarity, indicating that our constructs performed as expected. However, systematic detection differences are observed based on molarity or algorithm-specific characteristics. Fusion-sequence specific detection differences indicate that for applications where specific sequences are being investigated, additional constructs may be added to provide quantitative data that is specific for the sequence of interest.

Conclusions

To our knowledge, this is the first publicly available synthetic RNA-seq data that specifically leverages known cancer gene-fusions. The proposed method of designing multiple gene-fusion constructs over a wide range of molarity allows granular performance analyses of multiple fusion-detection algorithms. The community can leverage and augment this publicly available data to further collaborative development of analytical tools and performance assessment frameworks for gene fusions from next-generation sequencing data.

【 授权许可】

   
2014 Tembe et al.; licensee BioMed Central Ltd.

【 预 览 】
附件列表
Files Size Format View
20150321135825569.pdf 3265KB PDF download
Figure 9. 110KB Image download
Figure 8. 93KB Image download
Figure 7. 51KB Image download
Figure 6. 92KB Image download
Figure 5. 105KB Image download
Figure 4. 56KB Image download
Figure 3. 87KB Image download
Figure 2. 62KB Image download
Figure 1. 74KB Image download
【 图 表 】

Figure 1.

Figure 2.

Figure 3.

Figure 4.

Figure 5.

Figure 6.

Figure 7.

Figure 8.

Figure 9.

【 参考文献 】
  • [1]Villanueva MT: Genetics: gene fusion power. Nat Rev Clin Oncol 2012, 9:188.
  • [2]Goldman JM, Melo JV: Chronic myeloid leukemia–advances in biology and new approaches to treatment. N Engl J Med 2003, 349:1451-1464.
  • [3]Saglio G, Morotti A, Mattioli G, Messa E, Giugliano E, Volpe G, Rege-Cambrin G, Cilloni D: Rational approaches to the design of therapeutics targeting molecular markers: the case of chronic myelogenous leukemia. Ann N Y Acad Sci 2004, 1028:423-431.
  • [4]Kakizuka A, Miller W, Umesono K, Warrel R, Franekl S, Murty V, Dmitrovsky E, Evans R: Chromosomal translocation t(15;17) in human acute promyelocytic leukemia fuses RAR alpha with a novel putative transcription factor, PML. Cell 1991, 66:663-674.
  • [5]Huang ME, Ye YC, Chen SR, Cai JR, Lu JX, Zhoa L, Gu LJ, Wang ZY: Use of all-trans retinoic acid in the treatment of acute promyelocytic leukemia. Blood 1988, 72:567-572.
  • [6]Castaigne S, Chomienne C, Daniel M, Ballerini P, Berger R, Fenaux P, Degos L: All-trans retinoic acid as a differentiation therapy for acute promyelocytic leukemia: I. Clinical results. Blood 1990, 76:1704-1709.
  • [7]Gerber DE, Minna JD: ALK inhibition for non-small cell lung cancer: from discovery to therapy in record time. Cancer Cell 2010, 18:548-551.
  • [8]Ou SH, Bazhenova L, Camidge DR, Solomon BJ, Herman J, Kain T, Bang YJ, Kwak EL, Shaw AT, Salgia R, Maki RG, Clark JW, Wilner KD, Iafrate AJ: Rapid and dramatic radiographic and clinical response to an ALK inhibitor (crizotinib, PF02341066) in an ALK translocation-positive patient with non-small cell lung cancer. J Thorac Oncol 2010, 5:2044-2046.
  • [9]Kwak EL, Bang YJ, Camidge DR, Shaw AT, Solomon B, Maki RG, Ou SH, Dezube BJ, Jänne PA, Costa DB, Varella-Garcia M, Kim WH, Lynch TJ, Fidias P, Stubbs H, Engelman JA, Sequist LV, Tan W, Gandhi L, Mino-Kenudson M, Wei GC, Shreeve SM, Ratain MJ, Settleman J, Christensen JG, Haber DA, Wilner K, Salgia R, Shapiro GI, Clark JW, et al.: Anaplastic lymphoma kinase inhibition in non-small-cell lung cancer. N Engl J Med 2010, 363:1693-1703.
  • [10]Pleasance ED, Cheetham RK, Stephens PJ, McBride DJ, Humphray SJ, Greenman CD, Varela I, Lin ML, Ordóñez GR, Bignell GR, Ye K, Alipaz J, Bauer MJ, Beare D, Butler A, Carter RJ, Chen L, Cox AJ, Edkins S, Kokko-Gonzales PI, Gormley NA, Grocock RJ, Haudenschild CD, Hims MM, James T, Jia M, Kingsbury Z, Leroy C, Marshall J, Menzies A, et al.: A comprehensive catalogue of somatic mutations from a human cancer genome. Nature 2010, 463:191-196.
  • [11]Iyer MK, Chinnaiyan AM, Maher CA: ChimeraScan: a tool for identifying chimeric transcription in sequencing data. Bioinformatics 2011, 27:2903-2904.
  • [12]Kim D, Salzberg SL: TopHat-fusion: an algorithm for discovery of novel fusion transcripts. Genome Biol 2011, 12:R72. BioMed Central Full Text
  • [13]Asmann YW, Hossain A, Necela BM, Middha S, Kalari KR, Sun Z, Chai HS, Williamson DW, Radisky D, Schroth GP, Kocher JP, Perez EA, Thompson EA: A novel bioinformatics pipeline for identification and characterization of fusion transcripts in breast cancer and normal cell lines. Nucleic Acids Res 2011, 39:e100.
  • [14]Carrara M, Beccuti M, Lazzarato F, Cavallo F, Cordero F, Donatelli S, Calogero RA: State-of-the-art fusion-finder algorithms sensitivity and specificity. Biomed Res Int 2013, 2013:340620.
  • [15]Wang Q, Xia J, Jia P, Pao W, Zhao Z: Application of next generation sequencing to human gene fusion detection: computational tools, features and perspectives. Brief Bioinform 2013, 14:506-519.
  • [16]Langmead B, Salzberg SL: Fast gapped-read alignment with Bowtie 2. Nat Methods 2012, 9:357-359.
  • [17]Li H, Durbin R: Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics 2010, 26:589-595.
  • [18]Wu TD, Nacu S: Fast and SNP-tolerant detection of complex variants and splicing in short reads. Bioinformatics 2010, 26:873-881.
  文献评价指标  
  下载次数:46次 浏览次数:9次