期刊论文详细信息
Gates Open Research
Clinical Trial Risk Tool: software application using natural language processing to identify the risk of trial uninformativeness
article
Thomas A Wood1  Douglas McNair2 
[1]Fast Data Science Ltd
[2]Integrated Development, Bill & Melinda Gates Foundation
关键词: Clinical trial protocol;    risk;    natural language processing;    uninformativeness;   
DOI  :  10.12688/gatesopenres.14416.1
学科分类:电子与电气工程
来源: American Journal Of Pharmtech Research
PDF
【 摘 要 】
Background: A large proportion of clinical trials end without delivering results that are useful for clinical, policy, or research decisions. This problem is called “uninformativeness”. Some high-risk indicators of uninformativeness can be identified at the stage of drafting the protocol, however the necessary information can be hard to find in unstructured text documents. Methods: We have developed a browser-based tool which uses natural language processing to identify and quantify the risk of uninformativeness. The tool reads and parses the text of trial protocols and identifies key features of the trial design, which are fed into a risk model. The application runs in a browser and features a graphical user interface that allows a user to drag and drop the PDF of the trial protocol and visualize the risk indicators and their locations in the text. The user can correct inaccuracies in the tool’s parsing of the text. The tool outputs a PDF report listing the key features extracted. The tool is focused HIV and tuberculosis trials but could be extended to more pathologies in future. Results: On a manually tagged dataset of 300 protocols, the tool was able to identify the condition of a trial with 100% area under curve (AUC), presence or absence of statistical analysis plan with 87% AUC, presence or absence of effect estimate with 95% AUC, number of subjects with 69% accuracy, and simulation with 98% AUC. On a dataset of 11,925 protocols downloaded from ClinicalTrials.gov, the tool was able to identify trial phase with 75% accuracy, number of arms with 58% accuracy, and the countries of investigation with 87% AUC. Conclusion: We have developed and validated a natural language processing tool for identifying and quantifying risks of uninformativeness in clinical trial protocols.
【 授权许可】

CC BY   

【 预 览 】
附件列表
Files Size Format View
RO202307110001040ZK.pdf 2371KB PDF download
  文献评价指标  
  下载次数:0次 浏览次数:0次