期刊论文详细信息
BMC Medical Informatics and Decision Making
Combination of conditional random field with a rule based method in the extraction of PICO elements
Samir Chabou1  Michal Iglewski1 
[1] Computer Science and Engineering Department, Université du Québec en Outaouais;
关键词: PICO;    MLMs;    CRF;    RBMs;    Information extraction;    NLP;   
DOI  :  10.1186/s12911-018-0699-2
来源: DOAJ
【 摘 要 】

Abstract Background Extracting primary care information in terms of Patient/Problem, Intervention, Comparison and Outcome, known as PICO elements, is difficult as the volume of medical information expands and the health semantics is complex to capture it from unstructured information. The combination of the machine learning methods (MLMs) with rule based methods (RBMs) could facilitate and improve the PICO extraction. This paper studies the PICO elements extraction methods. The goal is to combine the MLMs with the RBMs to extract PICO elements in medical papers to facilitate answering clinical questions formulated with the PICO framework. Methods First, we analyze the aspects of the MLM model that influence the quality of the PICO elements extraction. Secondly, we combine the MLM approach with the RBMs in order to improve the PICO elements retrieval process. To conduct our experiments, we use a corpus of 1000 abstracts. Results We obtain an F-score of 80% for P element, 64% for the I element and 92% for the O element. Given the nature of the used training corpus where P and I elements represent respectively only 6.5 and 5.8% of total sentences, the results are competitive with previously published ones. Conclusions Our study of the PICO element extraction shows that the task is very challenging. The MLMs tend to have an acceptable precision rate but they have a low recall rate when the corpus is not representative. The RBMs backed up the MLMs to increase the recall rate and consequently the combination of the two methods gave better results.

【 授权许可】

Unknown   

  文献评价指标  
  下载次数:0次 浏览次数:0次