2019 The 5th International Conference on Electrical Engineering, Control and Robotics | |
A New Improved Boosting for Imbalanced Data Classification | |
无线电电子学;计算机科学 | |
Zhang, Zongtang^1 ; Qiu, Jiaxing^1 ; Dai, Weiguo^1 | |
Navy Submarine Academy, Qingdao, China^1 | |
关键词: Boosting algorithm; Class imbalance; Cost-sensitive; Imbalanced data; Statistical characteristics; | |
Others : https://iopscience.iop.org/article/10.1088/1757-899X/533/1/012047/pdf DOI : 10.1088/1757-899X/533/1/012047 |
|
学科分类:计算机科学(综合) | |
来源: IOP | |
【 摘 要 】
As one of the most important component of artificial intelligence, machine learning is getting more and more attention. AdaBoost, a classic machine learning algorithm, is widely used. However, when faced with imbalanced data classification, AdaBoost's recognition rate of minority samples is low due to ignoring class imbalance. In many cases, minority samples are of high value. For this shortage, combining the theory of margin and cost-sensitive idea, a new Boosting algorithm called CMBoost is proposed based on cost-sensitive margin statistical characteristics, which is firstly through optimizing margin statistical characteristics to improve formal algorithm and then extended by cost-sensitive. Experimental results on the UCI dataset show that the CMBoost algorithm is superior to AdaBoost for imbalanced data classification problem.
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
A New Improved Boosting for Imbalanced Data Classification | 570KB | download |