BMC Medical Informatics and Decision Making | |
Non-redundant association rules between diseases and medications: an automated method for knowledge base construction | |
Research Article | |
Gabriel Nisand1  Hassina Lefèvre1  Nicolas Jay2  Erik A Sauleau3  Nicolas Meyer3  François Séverac3  | |
[1] Groupe Méthode en Recherche Clinique, Service de Santé Publique, Hôpitaux Universitaires de Strasbourg, Strasbourg, France;LORIA – Equipe Orpailleurs, Université de Nancy, Nancy, France;SPI-EAO, Faculté de Médecine, Université de Nancy, Nancy, France;Laboratoire de Biostatistique et d’Informatique Médicale, Faculté de Médecine, Hôpitaux Universitaires de Strasbourg, Strasbourg, France;Groupe Méthode en Recherche Clinique, Service de Santé Publique, Hôpitaux Universitaires de Strasbourg, Strasbourg, France; | |
关键词: Data mining; Association rules mining; Natural language processing; Knowledge base; | |
DOI : 10.1186/s12911-015-0151-9 | |
received in 2014-05-27, accepted in 2015-03-26, 发布年份 2015 | |
来源: Springer | |
【 摘 要 】
BackgroundThe widespread use of electronic health records (EHRs) has generated massive clinical data storage. Association rules mining is a feasible technique to convert this large amount of data into usable knowledge for clinical decision making, research or billing. We present a data driven method to create a knowledge base linking medications to pathological conditions through their therapeutic indications from elements within the EHRs.MethodsAssociation rules were created from the data of patients hospitalised between May 2012 and May 2013 in the department of Cardiology at the University Hospital of Strasbourg. Medications were extracted from the medication list, and the pathological conditions were extracted from the discharge summaries using a natural language processing tool. Association rules were generated along with different interestingness measures: chi square, lift, conviction, dependency, novelty and satisfaction. All medication-disease pairs were compared to the Summary of Product Characteristics, which is the gold standard. A score based on the other interestingness measures was created to filter the best rules, and the indices were calculated for the different interestingness measures.ResultsAfter the evaluation against the gold standard, a list of accurate association rules was successfully retrieved. Dependency represents the best recall (0.76). Our score exhibited higher exactness (0.84) and precision (0.27) than all of the others interestingness measures. Further reductions in noise produced by this method must be performed to improve the classification precision.ConclusionsAssociation rules mining using the unstructured elements of the EHR is a feasible technique to identify clinically accurate associations between medications and pathological conditions.
【 授权许可】
CC BY
© Séverac et al.; licensee BioMed Central. 2015
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
RO202311097521090ZK.pdf | 632KB | download | |
12864_2016_3440_Article_IEq67.gif | 1KB | Image | download |
【 图 表 】
12864_2016_3440_Article_IEq67.gif
【 参考文献 】
- [1]
- [2]
- [3]
- [4]
- [5]
- [6]
- [7]
- [8]
- [9]
- [10]
- [11]
- [12]
- [13]
- [14]
- [15]
- [16]
- [17]
- [18]
- [19]
- [20]