An adaptive rank aggregation-based ensemble multi-filter feature selection method in software defect prediction

Balogun, A.O. and Basri, S. and Capretz, L.F. and Mahamad, S. and Imam, A.A. and Almomani, M.A. and Adeyemo, V.E. and Kumar, G. (2021) An adaptive rank aggregation-based ensemble multi-filter feature selection method in software defect prediction. Entropy, 23 (10). ISSN 10994300

Full text not available from this repository.
Official URL: https://www.scopus.com/inward/record.uri?eid=2-s2....

Abstract

Feature selection is known to be an applicable solution to address the problem of high dimensionality in software defect prediction (SDP). However, choosing an appropriate filter feature selection (FFS) method that will generate and guarantee optimal features in SDP is an open research issue, known as the filter rank selection problem. As a solution, the combination of multiple filter methods can alleviate the filter rank selection problem. In this study, a novel adaptive rank aggregation-based ensemble multi-filter feature selection (AREMFFS) method is proposed to resolve high dimensionality and filter rank selection problems in SDP. Specifically, the proposed AREMFFS method is based on assessing and combining the strengths of individual FFS methods by aggregating multiple rank lists in the generation and subsequent selection of top-ranked features to be used in the SDP process. The efficacy of the proposed AREMFFS method is evaluated with decision tree (DT) and naïve Bayes (NB) models on defect datasets from different repositories with diverse defect granularities. Findings from the experimental results indicated the superiority of AREMFFS over other baseline FFS methods that were evaluated, existing rank aggregation based multi-filter FS methods, and variants of AREMFFS as developed in this study. That is, the proposed AREMFFS method not only had a superior effect on prediction performances of SDP models but also outperformed baseline FS methods and existing rank aggregation based multi-filter FS methods. Therefore, this study recommends the combination of multiple FFS methods to utilize the strength of respective FFS methods and take advantage of filter�filter relationships in selecting optimal features for SDP processes. © 2021 by the authors. Licensee MDPI, Basel, Switzerland.

Item Type: Article
Additional Information: cited By 9
Depositing User: Mr Ahmad Suhairi UTP
Date Deposited: 10 Nov 2023 03:28
Last Modified: 10 Nov 2023 03:28
URI: https://khub.utp.edu.my/scholars/id/eprint/14404

Actions (login required)

View Item
View Item