文献详情 >Whale Optimization-based Synth... 收藏

Whale Optimization-based Synthetic Minority Oversampling Technique for Binary Imbalanced Datasets

作者：Pooja Tyagi Jaspreeti Singh Anjana Gosain

作者机构：USIC&T Guru Gobind Singh Indraprastha University Sector 16 Dwarka New Delhi India

年卷期：2024年第235卷

页面：250-263页

主　　题：class imbalance problem oversampling under sampling SMOTE metaheuristic algorithms whale optimization algorithm

摘要：The problem of class imbalance has become a predominant area of research recently. Synthetic Minority Oversampling Technique (SMOTE) stands as a popular and widely adopted oversampling technique that effectively addresses the challenge of class imbalance. However, its performance relies on one of the critical parameters i.e., the number of nearest neighbors, k_neighbors , which is often arbitrarily chosen by the users due to which it may not yield optimal results. Furthermore, the varying imbalance ratios across datasets add complexity to the task of parameter selection in SMOTE. In an effort to address this issue, this paper proposes a hybrid rebalancing technique called Whale optimization algorithm based SMOTE (WOA-SMOTE) that combines a metaheuristic technique, WOA, with SMOTE. The algorithm utilizes the advantages of WOA in finding the optimal value of k_neighbors of SMOTE which is crucial in generating synthetic samples that represent the distribution of samples more appropriately, thereby improving the performance of classifiers on imbalanced datasets. The study evaluates the performance of WOA-SMOTE alongside 6 benchmark sampling techniques on 10 real-world imbalanced datasets from Keel repository. These belong to different domains with imbalance ratio (IR) ranging from 1.25 to 15.46. Four different classifiers are used and the evaluation is based on three performance measures: AUC, g-mean and F1 scores. The experimental results showcase WOA-SMOTE s superior performance over SMOTE in majority of the datasets. Notably, WOA-SMOTE outperforms existing techniques in terms of F1 scores when using SVM and XGBoost classifiers in 8 out of 10 datasets. Moreover, its performance remains impressive in 6 of these datasets with random forest and logistic regression classifiers.

本地馆藏 | 借阅须知 | 我要预约

已订购，未入库

sda

目录详情 | 试阅读 |

读者评论与其他读者分享你的观点

学校读者

用户名:未登录

我的评分

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

看过本文的还看了

相关文献

该作者的其他文献

CADAL相关文献

Whale Optimization-based Synthetic Minority Oversampling Technique for Binary Imbalanced Datasets

读者评论与其他读者分享你的观点

请选择收藏分类：

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

看过本文的还看了

相关文献

该作者的其他文献

CADAL相关文献

Whale Optimization-based Synthetic Minority Oversampling Technique for Binary Imbalanced Datasets

读者评论 与其他读者分享你的观点

请选择收藏分类： 新增自定义分类 确定 取消

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

读者评论与其他读者分享你的观点

请选择收藏分类：