期刊论文详细信息
BMC Medical Informatics and Decision Making
A tree based approach for multi-class classification of surgical procedures using structured and unstructured data
Suzan Arslanturk1  Tannaz Khaleghi2  Alper Murat2 
[1] Department of Computer Science, Wayne State University, Detroit, MI, USA;Department of Industrial and Systems Engineering, Wayne State University, Detroit, MI, USA;
关键词: Current procedure terminology (CPT) code;    Machine learning;    Ensemble learning;    Importance weight;    Random Forest;    Multi-class classification;    Surgery code;   
DOI  :  10.1186/s12911-021-01665-w
来源: Springer
PDF
【 摘 要 】

BackgroundIn surgical department, CPT code assignment has been a complicated manual human effort, that entails significant related knowledge and experience. While there are several studies using CPTs to make predictions in surgical services, literature on predicting CPTs in surgical and other services using text features is very sparse. This study improves the prediction of CPTs by the means of informative features and a novel re-prioritization algorithm.MethodsThe input data used in this study is composed of both structured and unstructured data. The ground truth labels (CPTs) are obtained from medical coding databases using relative value units which indicates the major operational procedures in each surgery case. In the modeling process, we first utilize Random Forest multi-class classification model to predict the CPT codes. Second, we extract the key information such as label probabilities, feature importance measures, and medical term frequency. Then, the indicated factors are used in a novel algorithm to rearrange the alternative CPT codes in the list of potential candidates based on the calculated weights.ResultsTo evaluate the performance of both phases, prediction and complementary improvement, we report the accuracy scores of multi-class CPT prediction tasks for datasets of 5 key surgery case specialities. The Random Forest model performs the classification task with 74–76% when predicting the primary CPT (accuracy@1) versus the CPT set (accuracy@2) with respect to two filtering conditions on CPT codes. The complementary algorithm improves the results from initial step by 8% on average. Furthermore, the incorporated text features enhanced the quality of the output by 20–35%. The model outperforms the state-of-the-art neural network model with respect to accuracy, precision and recall.ConclusionsWe have established a robust framework based on a decision tree predictive model. We predict the surgical codes more accurately and robust compared to the state-of-the-art deep neural structures which can help immensely in both surgery billing and scheduling purposes in such units.

【 授权许可】

CC BY   

【 预 览 】
附件列表
Files Size Format View
RO202112048072172ZK.pdf 2029KB PDF download
  文献评价指标  
  下载次数:9次 浏览次数:5次