Journal of Translational Medicine | |
Empirical study using network of semantically related associations in bridging the knowledge gap | |
Ramin Zand2  Mohammed Yeasin1  Vida Abedi3  | |
[1] College of Arts and Sciences, Bioinformatics Program, Memphis University, Memphis 38152, TN, USA;Department of Neurology, University of Tennessee Health Science Center, Memphis 38163, TN, USA;The Center for Modeling Immunity to Entering Pathogens, Nutritional Immunology and Molecular Medicine Laboratory, Virginia Bioinformatics Institute, Virginia Polytechnic Institute and State University, Blacksburg 24060, VA, USA | |
关键词: Semantic associations; Network of association; Latent semantic analysis (LSA); Multi-gram dictionary; Medical subject headings (MeSH); PubMed; Ontology mapping; Literature mining; Hypothesis generation; Knowledge discovery; | |
Others : 1147066 DOI : 10.1186/s12967-014-0324-9 |
|
received in 2014-08-22, accepted in 2014-11-11, 发布年份 2014 | |
【 摘 要 】
Background
The data overload has created a new set of challenges in finding meaningful and relevant information with minimal cognitive effort. However designing robust and scalable knowledge discovery systems remains a challenge. Recent innovations in the (biological) literature mining tools have opened new avenues to understand the confluence of various diseases, genes, risk factors as well as biological processes in bridging the gaps between the massive amounts of scientific data and harvesting useful knowledge.
Methods
In this paper, we highlight some of the findings using a text analytics tool, called ARIANA - Adaptive Robust and Integrative Analysis for finding Novel Associations.
Results
Empirical study using ARIANA reveals knowledge discovery instances that illustrate the efficacy of such tool. For example, ARIANA can capture the connection between the drug hexamethonium and pulmonary inflammation and fibrosis that caused the tragic death of a healthy volunteer in a 2001 John Hopkins asthma study, even though the abstract of the study was not part of the semantic model.
Conclusion
An integrated system, such as ARIANA, could assist the human expert in exploratory literature search by bringing forward hidden associations, promoting data reuse and knowledge discovery as well as stimulating interdisciplinary projects by connecting information across the disciplines.
【 授权许可】
2014 Abedi et al.; licensee BioMed Central Ltd.
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
20150403195458835.pdf | 1457KB | download | |
Figure 4. | 56KB | Image | download |
Figure 3. | 38KB | Image | download |
Figure 2. | 28KB | Image | download |
Figure 1. | 20KB | Image | download |
【 图 表 】
Figure 1.
Figure 2.
Figure 3.
Figure 4.
【 参考文献 】
- [1][http://www.ncbi.nlm.nih.gov/pubmed] webcite PubMed. []
- [2]Rzhetsky A, Seringhaus M, Gerstein M: Seeking a new biology through text mining. Cell 2008, 134:9-13.
- [3]Wei C-H, Harris BR, Li D, Berardini TZ, Huala E, Kao H-Y, Lu Z: Accelerating literature curation with text-mining tools: a case study of using PubTator to curate genes in PubMed abstracts. Database (Oxford) 2012, 2012:bas041.
- [4]Wang H, Ding Y, Tang J, Dong X, He B, Qiu J, Wild DJ: Finding complex biological relationships in recent PubMed articles using Bio-LDA. PLoS One 2011, 6:e17243.
- [5]Abedi V, Zand R, Yeasin M, Faisal FE: An automated framework for hypotheses generation using literature. BioData Min 2012, 5:13. BioMed Central Full Text
- [6]Lu Z: PubMed and beyond: a survey of web tools for searching biomedical literature. Database (Oxford) 2011, 2011:baq036.
- [7]Chen H, Martin B, Daimon CM, Maudsley S: Effective use of latent semantic indexing and computational linguistics in biological and biomedical applications. Front Physiol 2013, 4:8.
- [8]Landauer TK, Laham D, Derr M: From paragraph to graph: latent semantic analysis for information visualization. PNAS 2004, 101:5214-5219.
- [9]Abedi V, Yeasin M, Zand R: ARIANA: adaptive robust and integrative analysis for finding novel associations. In 2014 Int Conf Adv Big Data Anal. CSREA Press, Las Vegas, NV; 2014.
- [10][http://www.ncbi.nlm.nih.gov/mesh] webcite Medical Subject Headings. .
- [11][http://omim.org/] webcite Online Mendelian Inheritance in Man. .
- [12]Yeasin M, Malempati H, Homayouni R, Sorower M: A systematic study on latent semantic analysis model parameters for mining biomedical literature. BMC Bioinformatics 2009, 10(Suppl 7):A6. BioMed Central Full Text
- [13][http:/ / www.hopkinsmedicine.org/ press/ 2001/ july/ report_of_internal_investigation.ht m] webcite Internal Investigative Committee Membership: Report of Internal Investigation into the Death of a Volunteer Research Subject. 2001, .
- [14]Robillard R, Riopelle JL, Adamkiewicz L, Tremblay G, Genest J: Pulmonary complications during treatment with hexamethonium. Can Med Assoc J 1955, 72:448-451.
- [15]Wollmer MA, Papassotiropoulos A, Streffer JR, Grimaldi LME, Kapaki E, Salani G, Paraskevas GP, Maddalena A, de Quervain D, Bieber C, Umbricht D, Lemke U, Bosshardt S, Degonda N, Henke K, Hegi T, Jung HH, Pasch T, Hock C, Nitsch RM: Genetic polymorphisms and cerebrospinal fluid levels of tissue inhibitor of metalloproteinases 1 in sporadic alzheimer’s disease. Psychiatr Genet 2002, 12:155-160.
- [16]Ridnour LA, Dhanapal S, Hoos M, Wilson J, Lee J, Cheng RYS, Brueggemann EE, Hines HB, Wilcock DM, Vitek MP, Wink DA, Colton CA: Nitric oxide-mediated regulation of β-amyloid clearance via alterations of MMP-9/TIMP-1. J Neurochem 2012, 123:736-749.
- [17]Brinckerhoff CE, Matrisian LM: Matrix metalloproteinases: a tail of a frog that became a prince. Nat Rev Mol Cell Biol 2002, 3:207-214.
- [18]Davidson JM: Biochemistry and turnover of lung interstitium. Eur Respir J Off J Eur Soc Clin Respir Physiol 1990, 3:1048-1063.
- [19]Elkington PT, Ugarte-Gil CA, Friedland JS: Matrix metalloproteinases in tuberculosis. Eur Respir J Off J Eur Soc Clin Respir Physiol 2011, 38:456-464.
- [20]Thuong NTT, Dunstan SJ, Chau TTH, Thorsson V, Simmons CP, Quyen NTH, Thwaites GE, Lan NTN, Hibberd M, Teo YY, Seielstad M, Aderem A, Farrar JJ, Hawn TR: Identification of tuberculosis susceptibility genes with human macrophage gene expression profiles. PLoS Pathog 2008, 4(12):e1000229.
- [21]Mehra S, Pahar B, Dutta NK, Conerly CN, Philippi-Falkenstein K, Alvarez X, Kaushal D: Transcriptional reprogramming in nonhuman primate (Rhesus Macaque) tuberculosis granulomas. PLoS One 2010, 5(8):e122666.
- [22]Russell DG, VanderVen BC, Lee W, Abramovitch RB, Kim M, Homolka S, Niemann S, Rohde KH: Mycobacterium tuberculosis wears what it eats. Cell Host Microbe 2010, 8:68-76.
- [23]Berry MPR, Graham CM, McNab FW, Xu Z, Bloch SAA, Oni T, Wilkinson KA, Banchereau R, Skinner J, Wilkinson RJ, Quinn C, Blankenship D, Dhawan R, Cush JJ, Mejias A, Ramilo O, Kon OM, Pascual V, Banchereau J, Chaussabel D, O’Garra A: An interferon-inducible neutrophil-driven blood transcriptional signature in human tuberculosis. Nature 2010, 466:973-977.
- [24]Van der Sar AM, Spaink HP, Zakrzewska A, Bitter W, Meijer AH: Specificity of the zebrafish host transcriptome response to acute and chronic mycobacterial infection and the role of innate and adaptive immune components. Mol Immunol 2009, 46:2317-2332.
- [25]Yong VW, Krekoski CA, Forsyth PA, Bell R, Edwards DR: Matrix metalloproteinases and diseases of the CNS. Trends Neurosci 1998, 21:75-80.
- [26]Yan P, Hu X, Song H, Yin K, Bateman RJ, Cirrito JR, Xiao Q, Hsu FF, Turk JW, Xu J, Hsu CY, Holtzman DM, Lee J-M: Matrix metalloproteinase-9 degrades amyloid-beta fibrils in vitro and compact plaques in situ. J Biol Chem 2006, 281:24566-24574.
- [27]Rusu C, Dumitrescu B: Stagewise K-SVD to design efficient dictionaries for sparse representations. IEEE Signal Process Lett 2012, 19:631-634.
- [28]Yaguang D, Guofeng Z, Chenyang C, Jian Z, Liang T: A parallel implementation of singular value decomposition based on map-reduce and PARPACK. In Proc 2011 Int Conf Comput Sci Netw Technol. Volume 2. IEEE, Harbin, China; 2011:739-741.
- [29]Liang Z, Li W, Li Y: A parallel probabilistic latent semantic analysis method on MapReduce platform. In 2013 IEEE Int Conf Inf Autom. IEEE, Yinchuan, China; 2013:1017-1022.