期刊论文详细信息
Quantitative Imaging in Medicine and Surgery
Model-based clinical note entity recognition for rheumatoid arthritis using bidirectional encoder representation from transformers
article
Meiting Li1  Feifei Liu2  Jia’an Zhu2  Ran Zhang1  Yi Qin1  Dongping Gao1 
[1] Institute of Medical Information , Chinese Academy of Medical Sciences;Department of Ultrasound , Peking University People's Hospital
关键词: Named entity recognition;    rheumatoid arthritis (RA);    artificial intelligence;    bidirectional encoder representation from transformers (BERT);    clinical notes;   
DOI  :  10.21037/qims-21-90
学科分类:外科医学
来源: AME Publications
PDF
【 摘 要 】

Background: Rheumatoid arthritis (RA) is a disease of the immune system with a high rate of disability and there are a large amount of valuable disease diagnosis and treatment information in the clinical note of the electronic medical record. Artificial intelligence methods can be used to mine useful information in clinical notes effectively. This study aimed to develop an effective method to identify and classify medical entities in the clinical notes relating to RA and use the entity identification results in subsequent studies. Methods: In this paper, we introduced the bidirectional encoder representation from transformers (BERT) pre-training model to enhance the semantic representation of word vectors. The generated word vectors were then inputted into the model, which is composed of traditional bidirectional long short-term memory neural networks and conditional random field machine learning algorithms for the named entity recognition of clinical notes to improve the model's effectiveness. The BERT method takes the combination of token embeddings, segment embeddings, and position embeddings as the model input and fine-tunes the model during training. Results: Compared with the traditional Word2vec word vector model, the performance of the BERT pre-training model to obtain a word vector as model input was significantly improved. The best F1-score of the named entity recognition task after training using many rheumatoid arthritis clinical notes was 0.936. Conclusions: This paper confirms the effectiveness of using an advanced artificial intelligence method to carry out named entity recognition tasks on a corpus of a large number of clinical notes; this application is promising in the medical setting. Moreover, the extraction of results in this study provides a lot of basic data for subsequent tasks, including relation extraction, medical knowledge graph construction, and disease reasoning.

【 授权许可】

All Rights reserved   

【 预 览 】
附件列表
Files Size Format View
RO202303290000123ZK.pdf 909KB PDF download
  文献评价指标  
  下载次数:7次 浏览次数:0次