Symmetry | |
Prognosis Essay Scoring and Article Relevancy Using Multi-Text Features and Machine Learning | |
Arif Mehmood1  Gyu Sang Choi1  Byung-Won On2  Ingyu Lee3  | |
[1] Department of Information and Communication Engineering, Yeungnam University, Gyeongsan 38542, Korea;Department of Statistics and Computer Science, Kunsan National University, Gunsan 54150, Korea;Sorrel College of Business, Troy University, Troy, AL 36082, USA; | |
关键词: data mining; text mining; computer-assisted assessment; article relevancy; prediction model; | |
DOI : 10.3390/sym9010011 | |
来源: DOAJ |
【 摘 要 】
This study develops a model for essay scoring and article relevancy. Essay scoring is a costly process when we consider the time spent by an evaluator. It may lead to inequalities of the effort by various evaluators to apply the same evaluation criteria. Bibliometric research uses the evaluation criteria to find relevancy of articles instead. Researchers mostly face relevancy issues while searching articles. Therefore, they classify the articles manually. However, manual classification is burdensome due to time needed for evaluation. The proposed model performs automatic essay evaluation using multi-text features and ensemble machine learning. The proposed method is implemented in two data sets: a Kaggle short answer data set for essay scoring that includes four ranges of disciplines (Science, Biology, English, and English language Arts), and a bibliometric data set having IoT (Internet of Things) and non-IoT classes. The efficacy of the model is measured against the Tandalla and AutoP approach using Cohen’s kappa. The model achieves kappa values of 0.80 and 0.83 for the first and second data sets, respectively. Kappa values show that the proposed model has better performance than those of earlier approaches.
【 授权许可】
Unknown