期刊论文详细信息
PATTERN RECOGNITION 卷:102
Radial-Based Undersampling for imbalanced data classification
Article
Koziarski, Michal1 
[1] AGH Univ Sci & Technol, Dept Elect, Al Mickiewicza 30, PL-30059 Krakow, Poland
关键词: Machine learning;    Classification;    Imbalanced data;    Undersampling;    Radial basis functions;   
DOI  :  10.1016/j.patcog.2020.107262
来源: Elsevier
PDF
【 摘 要 】

Data imbalance remains one of the most widespread problems affecting contemporary machine learning. The negative effect data imbalance can have on the traditional learning algorithms is most severe in combination with other dataset difficulty factors, such as small disjuncts, presence of outliers and insufficient number of training observations. Aforementioned difficulty factors can also limit the applicability of some of the methods of dealing with data imbalance, in particular the neighborhood-based oversampling algorithms based on SMOTE. Radial-Based Oversampling (RBO) was previously proposed to mitigate some of the limitations of the neighborhood-based methods. In this paper we examine the possibility of utilizing the concept of mutual class potential, used to guide the oversampling process in RBO, in the undersampling procedure. Conducted computational complexity analysis indicates a significantly reduced time complexity of the proposed Radial-Based Undersampling algorithm, and the results of the performed experimental study indicate its usefulness, especially on difficult datasets. (C) 2020 The Author. Published by Elsevier Ltd.

【 授权许可】

Free   

【 预 览 】
附件列表
Files Size Format View
10_1016_j_patcog_2020_107262.pdf 2279KB PDF download
  文献评价指标  
  下载次数:10次 浏览次数:0次