期刊论文详细信息
BMC Medical Genomics
Surveillance for the prevention of chronic diseases through information association
Alessandra Alaniz Macedo2  Maria da Graça Pimentel1  Evandro Seron Ruiz2  José Augusto Baranauskas2  Juliana Tarossi Pollettini2 
[1] Department of Computer Science - ICMC - University of São Paulo, São Carlos-SP, Brazil;Department of Computer Science and Mathematics - FFCLRP - University of São Paulo (USP), Ribeirão Preto-SP, Brazil
关键词: Ontology;    Medical records and scientific papers;    Retrieval and application of biomedical knowledge and information;    Biomedical informatics;   
Others  :  797128
DOI  :  10.1186/1755-8794-7-7
 received in 2013-05-27, accepted in 2014-01-16,  发布年份 2014
PDF
【 摘 要 】

Background

Research on Genomic medicine has suggested that the exposure of patients to early life risk factors may induce the development of chronic diseases in adulthood, as the presence of premature risk factors can influence gene expression. The large number of scientific papers published in this research area makes it difficult for the healthcare professional to keep up with individual results and to establish association between them. Therefore, in our work we aim at building a computational system that will offer an innovative approach that alerts health professionals about human development problems such as cardiovascular disease, obesity and type 2 diabetes.

Methods

We built a computational system called Chronic Illness Surveillance System (CISS), which retrieves scientific studies that establish associations (conceptual relationships) between chronic diseases (cardiovascular diseases, diabetes and obesity) and the risk factors described on clinical records. To evaluate our approach, we submitted ten queries to CISS as well as to three other search engines (Google™, Google Scholar™ and Pubmed®;) — the queries were composed of terms and expressions from a list of risk factors provided by specialists.

Results

CISS retrieved a higher number of closely related (+) and somewhat related (+/-) documents, and a smaller number of unrelated (-) and almost unrelated (-/+) documents, in comparison with the three other systems. The results from the Friedman’s test carried out with the post-hoc Holm procedure (95% confidence) for our system (control) versus the results for the three other engines indicate that our system had the best performance in three of the categories (+), (-) and (+/-). This is an important result, since these are the most relevant categories for our users.

Conclusion

Our system should be able to assist researchers and health professionals in finding out relationships between potential risk factors and chronic diseases in scientific papers.

【 授权许可】

   
2014 Pollettini et al.; licensee BioMed Central Ltd.

【 预 览 】
附件列表
Files Size Format View
20140706040242996.pdf 832KB PDF download
Figure 3. 85KB Image download
Figure 2. 32KB Image download
Figure 1. 25KB Image download
【 图 表 】

Figure 1.

Figure 2.

Figure 3.

【 参考文献 】
  • [1]Cardiovascular diseases (CVDs), Fact sheet Nº317, updated March 2013 [http://www.who.int/mediacentre/factsheets/fs317/en/index.html webcite]
  • [2]Verma A, Kasabov N, Rush E, Song Q: Ontology based personalized modeling for chronic disease risk analysis: An integrated approach. In Advances in Neuro-Information Processing, Volume 5506 of Lecture Notes in Computer Science . Edited by Köppen M, Kasabov N, Coghill G. Berlin - Heidelberg: Springer; 2009:1204-1210.
  • [3]10 facts about chronic diseases [http://www.who.int/features/factfiles/chp/en/index.html webcite]
  • [4]10 facts about chronic diseases: Fact 10 [http://www.who.int/features/factfiles/chp/10_en.html webcite]
  • [5]Barker DJP: Fetal and infant origins of adult disease. Monatsschr Kinderheilkd 2001, 149(13):S2-S6.
  • [6]Baker D: The fetal and infant origins of adult disease. BMJ 1990, 301(6761):1111.
  • [7]Baker D: Fetal origins of cardiovascular disease. Ann Med 1999, Suppl 1:3-6.
  • [8]Butte AJ: Translational bioinformatics: coming of age. J Am Med Inform Assoc 2008, 15(6):709-19.
  • [9]American Medical Informatics Association (AMIA): translational bioinformatics [http://www.amia.org/applications-informatics/translational-bioinformatics webcite]
  • [10]Fogel RW: Second thoughts on the european escape from hunger: famines, chronic malnutrition, and mortality rates. In Nutrition and Poverty . Edited by Osmani SR. New York: Oxford University Press Clarendon Press; 1992:243-286.
  • [11]Developmental Origins of Health and Disease (DOHaD) [http://www.mrc-leu.soton.ac.uk/dohad/index.asp webcite]
  • [12]An English stop word list [http://snowball.tartarus.org/algorithms/english/stop.txt webcite]
  • [13]Bird S, Klein E, Loper E: NLTK Book . Sebastopol, CA: O’Reilly Media; 2009.
  • [14]A Portuguese stop word list [http://snowball.tartarus.org/algorithms/portuguese/stop.txt webcite]
  • [15]Instituto da Criança [http://icr.hcnet.usp.br/ webcite]
  • [16]Centro de Saúde-EscolaSamuel B. Pessoa [http://www.fm.usp.br/cseb/ webcite]
  • [17]Friedman M: A comparison of alternative tests of significance for the problem ofm rankings. Ann Math Statist 1940, 11:86-92.
  • [18]Demšar J: Statistical comparison of classifiers over multiple data sets. J Mach Learn Res 2006, 7:1-30.
  • [19]Athenikos S, Han H: Biomedical question answering: A survey. Comput Methods Programs Biomed 2010, 99:1-14.
  • [20]Kolomiyets O, Moens MF: A survey on question answering technology from an information retrieval perspective. Inform Sci 2011, 181(24):5412-5434.
  • [21]Yin X, Huang X, Li Z: Promoting ranking diversity for biomedical information retrieval using wikipedia. In Advances in Information Retrieval, Volume 5993 of Lecture Notes in Computer Science . Edited by Gurrin C, He Y, Kazai G, Kruschwitz U, Little S, Roelleke T, Rüger S, Rijsbergen K. Berlin - Heidelberg: Springer; 2010:495-507.
  • [22]Karimi S, Zobel J, Scholer F: Quantifying the impact of concept recognition on biomedical information retrieval. Inf Process Manage 2012, 48:94-106.
  • [23]Si L, Lu J, Callan J: Combining multiple resources, evidences and criteria for genomic information retrieval. In Proceedings of the Fifteenth Text REtrieval Conference, TREC 2006, Gaithersburg, Maryland, November 14-17, 2006, Volume Special Publication 500-272 . Edited by Voorhees EM, Buckland LP. Gaithersburg, MD: National Institute of Standards and Technology (NIST); 2006.
  • [24]Lin KHY, Hou WJ, Chen HH: Retrieval of biomedical documents by prioritizing key phrases. In Proceedings of the Fourteenth Text REtrieval Conference, TREC 2005, Gaithersburg, Maryland, November 15-18, 2005, Volume Special Publication 500-266 . Edited by Voorhees EM, Buckland LP. Gaithersburg, MD: National Institute of Standards and Technology (NIST); 2005.
  • [25]Patrick J, Li M: An ontology for clinical questions about the contents of patient notes. J Biomed Inform 2012, 45(2):292-306.
  • [26]Ryu B CJ: An evaluation of multiple query representations for the relevance judgments used to build a biomedical test collection. Healthc Inform Res 2012, 18:65-73.
  • [27]Aljaber B, Martinez D, Stokes N: Bailey: Improving MeSH classification of biomedical articles using citation contexts. J Biomed Inform 2011, 44(5):881-96.
  • [28]Ortuño FM, Rojas I, Andrade-Navarro MA, Fontaine JF: Using cited references to improve the retrieval of related biomedical documents. BMC Bioinformatics 2013, 14(113):12.
  • [29]Gobeill J, Pasche E, Vishnyakova D, Ruch P: Managing the data deluge: data-driven GO category assignment improves while complexity of functional annotation increases. 2013, 2013:9.
  • [30]Shah NH, Jonquet C, Chiang AP, Butte AJ, Chen R, Musen MA: Ontology-driven indexing of public datasets for translational bioinformatics. BMC Bioinformatics 2009, 10(Suppl 2)(S1):10.
  • [31]Rindflesh TC, Fiszman M: The interaction of domain knowledge and linguistic structure in natural language processing: Interpreting Hypernymic propositions in biomedical text. J Biomed Inform 2003, 36(6):462-477.
  • [32]Fiszman M, Demner-Fushman D, Lang FM, Goetz P, Rindflesch TC: Interpreting comparative constructions in biomedical text. In Proceedings of the Workshop on BioNLP 2007: Biological, Translational, and Clinical Language Processing . BioNLP ’07, Stroudsburg, PA, USA: Association for Computational Linguistics; 2007:137-144.
  • [33]Aronson AR: Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program. In Proceedings of the AMIA Annual Symposium: 3-7 November 2001; Washington, DC . Edited by Bakken S. Stoneham: Butterworth-Heinemann American Medical Informatics Association; 2011:17-21.
  • [34]Kim JH: Translational bioinformatics has now come of age: TBC 2012 collection. BMC Med Genom 2013, 6(Suppl 2):I1. BioMed Central Full Text
  • [35]Kim S, Park K, Shin C, Cho NH, Ko JJ, Koh I, Kwack K: Diplotyper: diplotype-based association analysis. BMC Med Genom 2013, 6(Suppl 2):S5. [http://www.biomedcentral.com/1755-8794/6/S2/S5 webcite] BioMed Central Full Text
  • [36]van de Wiel MA, de Menezes R, Siebring-van Olst E, van Beusechem V: Analysis of small-sample clinical genomics studies using multi-parameter shrinkage: application to high-throughput RNA interference screening. BMC Med Genom 2013, 6(2):1-9.
  • [37]Kim K, Kwon MS, Oh S, Park T: Identification of multiple gene-gene interactions for ordinal phenotypes. BMC Med Genom 2013, 6(Suppl 2):S9.
  • [38]Chen X, Jiang W, Wang Q, Huang T, Wang P, Li Y, Chen X, Lv Y, Li X: Systematically characterizing and prioritizing chemosensitivity related gene based on gene ontology and protein interaction network. BMC Med Genom 2012, 5(43):12.
  • [39]Pollettini JT, Panico SRG, Daneluzzi JC, Tinós R, Baranauskas JA, Macedo AA: Using machine learning classifiers to assist healthcare-related decisions: Classification of electronic patient records. J Med Syst 2012, 36(6):3861-3874.
  • [40]National Library of Medicine (US): UMLS®;Reference Manual [Internet] . Bethesda, MD, USA; 1999.
  文献评价指标  
  下载次数:44次 浏览次数:9次