Preliminary Investigation of Balanced Stratified Reduction (BSR) for Imbalanced Datasets

Zakaria, M.H. and Jaafar, J. and Abdulkadir, S.J. (2023) Preliminary Investigation of Balanced Stratified Reduction (BSR) for Imbalanced Datasets. In: UNSPECIFIED.

Full text not available from this repository.
Official URL: https://www.scopus.com/inward/record.uri?eid=2-s2....

Abstract

The prevalence of imbalanced datasets in machine learning poses significant challenges, often leading to models with suboptimal performance, especially in recognizing underrepresented classes. This paper introduces a novel technique, Balanced Stratified Reduction (BSR), aiming to address these challenges by optimizing the sampling process. BSR utilizes horizontal stratification uniquely, drawing inspiration from the quintessence of boxplots. Preliminary results, based on two medical datasets - PIMA Indian Diabetes and Haberman's Survival, showcase BSR's potential in not only managing the class imbalance but also in retaining crucial information from the majority class. The paper outlines the foundational principles of BSR, the motivation behind its inception, a detailed experimental setup for its broader application, and preliminary findings. While the initial outcomes are promising, comprehensive evaluation and application across diverse datasets remain a focal point for future work. BSR's approach, grounded in its methodological rigor, holds promise for more effective handling of imbalanced datasets in real-world scenarios. © 2023 IEEE.

Item Type: Conference or Workshop Item (UNSPECIFIED)
Additional Information: cited By 0; Conference of 5th IEEE International Conference on Artificial Intelligence in Engineering and Technology, IICAIET 2023 ; Conference Date: 12 September 2023 Through 14 September 2023; Conference Code:193996
Uncontrolled Keywords: reductions; Balanced stratified reduction; Class imbalance; Data preprocessing; Evaluation metrics; Haberman survival; Imbalanced dataset; Machine-learning; PIMA indian diabetes; Quartile-based stratification; Sampling technique, Machine learning
Depositing User: Mr Ahmad Suhairi UTP
Date Deposited: 04 Jun 2024 14:11
Last Modified: 04 Jun 2024 14:11
URI: https://khub.utp.edu.my/scholars/id/eprint/19037

Actions (login required)

View Item
View Item