期刊论文详细信息
Sensors
Improved PSO_AdaBoost Ensemble Algorithm for Imbalanced Data
Guangyue Zhou1  Mingwen Shao1  Kewen Li1  Jiannan Zhai2  Fulai Li3 
[1] College of Computer and Communication Engineering, China University of Petroleum, Qingdao 266580, Shandong, China;Institute for Sensing and Embedded Network Systems Engineering, Florida Atlantic University, 777 Glades Road, Boca Raton, FL 33431, USA;School of Geosciences, China University of Petroleum, Qingdao 266580, Shandong, China;
关键词: Adaptive Boosting;    imbalanced data;    Area Under Curve;    Particle Swarm Optimization;   
DOI  :  10.3390/s19061476
来源: DOAJ
【 摘 要 】

The Adaptive Boosting (AdaBoost) algorithm is a widely used ensemble learning framework, and it can get good classification results on general datasets. However, it is challenging to apply the AdaBoost algorithm directly to imbalanced data since it is designed mainly for processing misclassified samples rather than samples of minority classes. To better process imbalanced data, this paper introduces the indicator Area Under Curve (AUC) which can reflect the comprehensive performance of the model, and proposes an improved AdaBoost algorithm based on AUC (AdaBoost-A) which improves the error calculation performance of the AdaBoost algorithm by comprehensively considering the effects of misclassification probability and AUC. To prevent redundant or useless weak classifiers the traditional AdaBoost algorithm generated from consuming too much system resources, this paper proposes an ensemble algorithm, PSOPD-AdaBoost-A, which can re-initialize parameters to avoid falling into local optimum, and optimize the coefficients of AdaBoost weak classifiers. Experiment results show that the proposed algorithm is effective for processing imbalanced data, especially the data with relatively high imbalances.

【 授权许可】

Unknown   

  文献评价指标  
  下载次数:0次 浏览次数:0次