期刊论文详细信息
Biomedical Informatics Insights
Using Ensemble Models to Classify the Sentiment Expressed in Suicide Notes:
James A.McCart1 
关键词: sentiment analysis;    machine learning;    text analysis;    i2b2 competition;   
DOI  :  10.4137/BII.S8931
学科分类:医学(综合)
来源: Sage Journals
PDF
【 摘 要 】

In 2007, suicide was the tenth leading cause of death in the U.S. Given the significance of this problem, suicide was the focus of the 2011 Informatics for Integrating Biology and the Bedside (i2b2) Natural Language Processing (NLP) shared task competition (track two). Specifically, the challenge concentrated on sentiment analysis, predicting the presence or absence of 15 emotions (labels) simultaneously in a collection of suicide notes spanning over 70 years. Our team explored multiple approaches combining regular expression-based rules, statistical text mining (STM), and an approach that applies weights to text while accounting for multiple labels. Our best submission used an ensemble of both rules and STM models to achieve a micro-averaged F1 score of 0.5023, slightly above the mean from the 26 teams that competed (0.4875).

【 授权许可】

CC BY   

【 预 览 】
附件列表
Files Size Format View
RO201901217700720ZK.pdf 559KB PDF download
  文献评价指标  
  下载次数:6次 浏览次数:2次