期刊论文详细信息
BMC Medical Informatics and Decision Making
The effect of word sense disambiguation accuracy on literature based discovery
Research
Judita Preiss1  Mark Stevenson1 
[1] Advanced Computing Research Center, Department of Computer Science, The University of Sheffield, 211 Portobello, S1 4DP, Sheffield, UK;
关键词: Data mining;    Text processing;    Literature based discovery;    Word sense disambiguation;   
DOI  :  10.1186/s12911-016-0296-1
来源: Springer
PDF
【 摘 要 】

BackgroundThe volume of research published in the biomedical domain has increasingly lead to researchers focussing on specific areas of interest and connections between findings being missed. Literature based discovery (LBD) attempts to address this problem by searching for previously unnoticed connections between published information (also known as “hidden knowledge”). A common approach is to identify hidden knowledge via shared linking terms. However, biomedical documents are highly ambiguous which can lead LBD systems to over generate hidden knowledge by hypothesising connections through different meanings of linking terms. Word Sense Disambiguation (WSD) aims to resolve ambiguities in text by identifying the meaning of ambiguous terms. This study explores the effect of WSD accuracy on LBD performance.MethodsAn existing LBD system is employed and four approaches to WSD of biomedical documents integrated with it. The accuracy of each WSD approach is determined by comparing its output against a standard benchmark. Evaluation of the LBD output is carried out using timeslicing approach, where hidden knowledge is generated from articles published prior to a certain cutoff date and a gold standard extracted from publications after the cutoff date.ResultsWSD accuracy varies depending on the approach used. The connection between the performance of the LBD and WSD systems are analysed to reveal a correlation between WSD accuracy and LBD performance.ConclusionThis study reveals that LBD performance is sensitive to WSD accuracy. It is therefore concluded that WSD has the potential to improve the output of LBD systems by reducing the amount of spurious hidden knowledge that is generated. It is also suggested that further improvements in WSD accuracy have the potential to improve LBD accuracy.

【 授权许可】

CC BY   
© Preiss and Stevenson. 2016

【 预 览 】
附件列表
Files Size Format View
RO202311093012651ZK.pdf 304KB PDF download
【 参考文献 】
  • [1]
  • [2]
  • [3]
  • [4]
  • [5]
  • [6]
  • [7]
  • [8]
  • [9]
  • [10]
  • [11]
  • [12]
  • [13]
  • [14]
  • [15]
  • [16]
  • [17]
  • [18]
  • [19]
  • [20]
  • [21]
  • [22]
  • [23]
  • [24]
  • [25]
  • [26]
  文献评价指标  
  下载次数:2次 浏览次数:1次