期刊论文详细信息
Education Sciences
Predictive Models for Imbalanced Data: A School Dropout Perspective
ThiagoM. Barros1  PlácidoA. Souza Neto1  LuizAffonso Guedes2  Ivanovitch Silva2 
[1] Federal Institute of Rio Grande do Norte (IFRN), 1559 Tirol Natal, Brazil;Federal University of Rio Grande do Norte (UFRN), 59078-970 Natal, Brazil;
关键词: dropout rates;    accuracy paradox;    imbalanced learning;    downsample;    g-mean predict;    mlp;    decision tree;    balanced bagging;    uar;    smote;    adasyn;   
DOI  :  10.3390/educsci9040275
来源: DOAJ
【 摘 要 】

Predicting school dropout rates is an important issue for the smooth execution of an educational system. This problem is solved by classifying students into two classes using educational activities related statistical datasets. One of the classes must identify the students who have the tendency to persist. The other class must identify the students who have the tendency to dropout. This problem often encounters a phenomenon that masks out the obtained results. This study delves into this phenomenon and provides a reliable educational data mining technique that accurately predicts the dropout rates. In particular, the three data classifying techniques, namely, decision tree, neural networks and Balanced Bagging, are used. The performances of these classifies are tested with and without the use of a downsample, SMOTE and ADASYN data balancing. It is found that among other parameters geometric mean and UAR provides reliable results while predicting the dropout rates using Balanced Bagging classifying techniques.

【 授权许可】

Unknown   

  文献评价指标  
  下载次数:0次 浏览次数:0次