| BMC Systems Biology | |
| New network topology approaches reveal differential correlation patterns in breast cancer | |
| Jan Budczies1  Carsten Denkert1  Balazs Györffy2  Frederick Klauschen1  Michael Bockmayr1  | |
| [1] Institute for Pathology, Charité University Hospital Berlin, Charitéplatz 1, 10117 Berlin, Germany;Joint Research Laboratory of the Hungarian Academy of Sciences and the Semmelweis University, Semmelweis University 1st Dept of Pediatrics, 1083 Budapest, Hungary | |
| 关键词: Differential co-expression; Molecular subtypes; Breast cancer; Microarray data; Differential correlation; | |
| Others : 1142347 DOI : 10.1186/1752-0509-7-78 |
|
| received in 2013-04-15, accepted in 2013-08-06, 发布年份 2013 | |
PDF
|
|
【 摘 要 】
Background
Analysis of genome-wide data is often carried out using standard methods such as differential expression analysis, clustering analysis and heatmaps. Beyond that, differential correlation analysis was suggested to identify changes in the correlation patterns between disease states. The detection of differential correlation is a demanding task, as the number of entries in the gene-by-gene correlation matrix is large. Currently, there is no gold standard for the detection of differential correlation and statistical validation.
Results
We developed two untargeted algorithms (DCloc and DCglob) that identify differential correlation patterns by comparing the local or global topology of correlation networks. Construction of networks from correlation structures requires fixing of a correlation threshold. Instead of a single cutoff, the algorithms systematically investigate a series of correlation thresholds and permit to detect different kinds of correlation changes at the same level of significance: strong changes of a few genes and moderate changes of many genes. Comparing the correlation structure of 208 ER- breast carcinomas and 208 ER+ breast carcinomas, DCloc detected 770 differentially correlated genes with a FDR of 12.8%, while DCglob detected 630 differentially correlated genes with a FDR of 12.1%. In two-fold cross-validation, the reproducibility of the list of the top 5% differentially correlated genes in 140 ER- tumors and in 140 ER+ tumors was 49% for DCloc and 33% for DCglob.
Conclusions
We developed two correlation network topology based algorithms for the detection of differential correlations in different disease states. Clusters of differentially correlated genes could be interpreted biologically and included the marker genes hydroxyprostaglandin dehydrogenase (PGDH) and acyl-CoA synthetase medium chain 1 (ACSM1) of invasive apocrine carcinomas that were differentially correlated, but not differentially expressed. Using random subsampling and cross-validation, DCloc and DCglob were shown to identify specific and reproducible lists of differentially correlated genes.
【 授权许可】
2013 Bockmayr et al.; licensee BioMed Central Ltd.
【 预 览 】
| Files | Size | Format | View |
|---|---|---|---|
| 20150328035143596.pdf | 3718KB | ||
| Figure 5. | 68KB | Image | |
| Figure 4. | 101KB | Image | |
| Figure 3. | 225KB | Image | |
| Figure 2. | 26KB | Image | |
| Figure 1. | 109KB | Image |
【 图 表 】
Figure 1.
Figure 2.
Figure 3.
Figure 4.
Figure 5.
【 参考文献 】
- [1]Allison D, Cui X, Page G, Sabripour M: Microarray data analysis: from disarray to consolidation and consensus. Nat Rev Genet 2006, 7:55-65.
- [2]Eisen MB, Spellman PT, Brown PO, Botstein D: Cluster analysis and display of genome-wide expression patterns. Proc Nat Acad Sci 1998, 95(25):14863-14868.
- [3]Sørlie T, Perou CM, Tibshirani R, Aas T, Geisler S, Johnsen H, Hastie T, Eisen MB, van de Rijn M, Jeffrey SS, Thorsen T, Quist H, Matese JC, Brown PO, Botstein D, Lønning PE, Børresen-Dale AL: Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications. Proc Nat Acad Sci 2001, 98(19):10869-10874.
- [4]Li KC: Genome-wide coexpression dynamics: Theory and application. Proc Nat Acad Sci 2002, 99(26):16875-16880.
- [5]Lai Y, Wu B, Chen L, Zhao H: A statistical method for identifying differential gene-gene co-expression patterns. Bioinformatics 2004, 20(17):3146-3155.
- [6]Kostka D, Spang R: Finding disease specific alterations in the co-expression of genes. Bioinformatics 2004, 20(Suppl 1):i194-i199.
- [7]de la Fuente A: From ‘differential expression’ to ‘differential networking’ – identification of dysfunctional regulatory networks in diseases. Trends Genet 2010, 26(7):326-333.
- [8]Tesson B, Breitling R, Jansen R: DiffCoEx: a simple and sensitive method to find differentially coexpressed gene modules. BMC Bioinformatics 2010, 11:497. BioMed Central Full Text
- [9]Choi Y, Kendziorski C: Statistical methods for gene set co-expression analysis. Bioinformatics 2009, 25(21):2780-2786.
- [10]Southworth L, Owen A, Kim S: Aging mice show a decreasing correlation of gene expression within genetic modules. PLoS Genet 2009, 5(12):e1000776.
- [11]Taylor I, Linding R, Warde-Farley D, Liu Y, Pesquita C: Dynamic modularity in protein interaction networks predicts breast cancer outcome. Nat Biotechnol 2009, 27:199-204.
- [12]Teschendorff A, Severini S: Increased entropy of signal transduction in the cancer metastasis phenotype. BMC Syst Biol 2010, 4:104. BioMed Central Full Text
- [13]Ihmels J, Bergmann S, Berman J, Barkai N: Comparative gene expression analysis by differential clustering approach: application to the Candida albicans transcription program. PLoS Genet 2005, 1:e39.
- [14]Watson M: CoXpress: differential co-expression in gene expression data. BMC Bioinformatics 2006, 7:509-509. BioMed Central Full Text
- [15]Choi J, Yu U, Yoo O, Kim S: Differential coexpression analysis using microarray data and its application to human cancer. Bioinformatics (Oxford, England) 2005, 21(24):4348-4355.
- [16]Voy B, Schar J, Perkins A, Saxton A, Borate B, Chesler E, Branstetter L, Langston M: Extracting gene networks for low-dose radiation using graph theoretical algorithms. PloS Comput Biol 2006, 2(7):e89.
- [17]Zhang B, Li H, Riggins RB, Zhan M, Xuan J, Zhang Z, Hoffman EP, Clarke R, Wang Y: Differential dependency network analysis to identify condition-specific topological changes in biological networks. Bioinformatics 2009, 25(4):526-532.
- [18]Altay G, Asim M, Markowetz F, Neal D: Differential C3NET reveals disease networks of direct physical interactions. BMC Bioinformatics 2011, 12:296. BioMed Central Full Text
- [19]Amar D, Safer H, Shamir R: Dissection of regulatory networks that are altered in disease via differential co-expression. PLoS Comput Biol 2013, 9(3):e1002955.
- [20]Gruvberger S, Ringnér M, Chen Y, Panavally S, Saal LH, Borg A, Fernö M, Peterson C, Meltzer PS: Estrogen receptor status in breast cancer is associated with remarkably distinct gene expression patterns. Cancer Res 2001, 61(16):5979-5984.
- [21]van’t Veer L, Dai H, van de Vijver M, He Y, Hart A, Mao M, Peterse H, van der Kooy K, Marton M, Witteveen A, Schreiber G, Kerkhoven R, Roberts C, Linsley P, Bernards R, Friend S: Gene expression profiling predicts clinical outcome of breast cancer. Nature 2002, 415(6871):530-536.
- [22]Budczies J, Weichert W, Noske A, Müller B, Weller C, Wittenberger T, Hofmann H, Dietel M, Denkert C, Gekeler V: Genome-wide gene expression profiling of formalin-fixed paraffin-embedded breast cancer core biopsies using microarrays. J Histochem Cytochem 2011, 59(2):146-157.
- [23]Freudenberg J, Sivaganesan S, Wagner M, Medvedovic M: A semi-parametric Bayesian model for unsupervised differential co-expression analysis. BMC Bioinformatics 2010, 11:234. [http://www.biomedcentral.com/1471-2105/11/234 webcite] BioMed Central Full Text
- [24]Tegge AN, Caldwell CW, Xu D: Pathway correlation profile of gene-gene co-expression for identifying pathway perturbation. PLoS ONE 2012, 7(12):e52127.
- [25]R Development Core Team: R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing; 2011. [http://www.R-project.org/ webcite] [ISBN 3-900051-07-0]
- [26]Steiger J: Test for comparing elements of a correlation matrix. Psychol Bull 1980, 87:245-251.
- [27]Storey JD, Tibshirani R: Statistical significance for genomewide studies. Proc Nat Acad Sci 2003, 100(16):9440-9445.
- [28]Edgar R, Domrachev M, Lash AE: Gene expression omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Res 2002, 30:207-210.
- [29]Györffy B, Lanczky A, Eklund AC, Denkert C, Budczies J, Li Q, Szallasi Z: An online survival analysis tool to rapidly assess the effect of 22,277 genes on breast cancer prognosis using microarray data of 1,809 patients. Breast Cancer Res Treat 2010, 123(3):725-731.
- [30]Gautier L, Cope L, Bolstad BM, Irizarry RA: Affy—analysis of affymetrix GeneChip data at the probe level. Bioinformatics 2004, 20(3):307-315.
- [31]Müller BM, Kronenwett R, Hennig G, Euting H, Weber K, Bohmann K, Weichert W, Altmann G, Roth C, Winzer KJ, Kristiansen G, Petry C, Dietel M, Denkert C: Quantitative determination of estrogen receptor, progesterone receptor, and HER2 mRNA in formalin-fixed paraffin-embedded tissue–a new option for predictive biomarker assessment in breast cancer. Diagn Mol Pathol 2011, 20:1-10.
- [32]Csardi G, Nepusz T: The igraph software package for complex network research. InterJournal Complex Systems 2006,, 1695.
- [33]Smoot ME, Ono K, Ruscheinski J, Wang PL, Ideker T: Cytoscape 2.8: new features for data integration and network visualization. Bioinformatics 2011, 27(3):431-432.
- [34]Huang DW, Sherman BT, Lempicki RA: Systematic and integrative analysis of large gene lists using DAVID Bioinformatics resources. Nat Protoc 2009, 4(1):44-57.
- [35]Huang DW, Sherman BT, Lempicki RA: Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res 2009, 37(1):1-13.
- [36]Bauer K, Parise C, Caggiano V: Use of ER/PR/HER2 subtypes in conjunction with the 2007 St Gallen consensus statement for early breast cancer. BMC Cancer 2010, 10:228. BioMed Central Full Text
- [37]Celis JE, Gromov P, Cabezon T, Moreira JMA, Friis E, Jirstrom K, Llombart-Bosch A, Timmermans-Wielenga V, Rank F, Gromova I: 15-Prostaglandin dehydrogenase expression alone or in combination with ACSM1 defines a subgroup of the apocrine molecular subtype of breast carcinoma. Mol Cell Proteomics 2008, 7(10):1795-1809.
- [38]Celis JE, Cabezon T, Moreira JM, Gromov P, Gromova I, Timmermans-Wielenga V, Iwase T, Akiyama F, Honma N, Rank F: Molecular characterization of apocrine carcinoma of the breast: Validation of an apocrine protein signature in a well-defined cohort. Mol Oncol 2009, 3(3):220-237.
- [39]Staaf J, Jonsson G, Ringner M, Vallon-Christersson J, Grabau D, Arason A, Gunnarsson H, et al.: High-resolution genomic and expression analyses of copy number alterations in HER2-amplified breast cancer. Breast Cancer Res 2010, 12(3):R25. BioMed Central Full Text
- [40]Doane A, Danso M, Lal P, Donaton M, Zhang L, Hudis C, Gerald W: An estrogen receptor-negative breast cancer subset characterized by a hormonally regulated transcriptional program and response to androgen. Oncogene 2006, 25(28):3994-4008.
- [41]Ray PS, Wang J, Qu Y, Sim MS, Shamonki J, Bagaria SP, Ye X, Liu B, Elashoff D, Hoon DS, Walter MA, Martens JW, Richardson AL, Giuliano AE, Cui X: FOXC1 is a potential prognostic biomarker with functional significance in basal-like breast cancer. Cancer Res 2010, 70(10):3870-3876.
- [42]Loibl S, Müller B, von Minckwitz G, Schwabe M, Roller M, Darb-Esfahani S, Ataseven B, du Bois A, Fissler-Eckhoff A, Gerber B, Kulmer U, Alles J, Mehta K, Denkert C: Androgen receptor expression in primary breast cancer and its predictive and prognostic value in patients treated with neoadjuvant chemotherapy. Breast Cancer Res Treat 2011, 130(2):477-87.
- [43]Smith I, Procter M, Gelber R, Guillaume S, Feyereislova A, Dowsett M, Goldhirsch A, Untch M, Mariani G, Baselga J: 2-year follow-up of trastuzumab after adjuvant chemotherapy in HER2-positive breast cancer: a randomised controlled trial. Lancet 2007, 369(9555):29-36.
PDF