Hishamuddin, M.N.F. and Hassan, M.F. and Mokhtar, A.A. (2020) Improving Classification Accuracy of Random Forest Algorithm Using Unsupervised Discretization with Fuzzy Partition and Fuzzy Set Intervals. In: UNSPECIFIED.
Full text not available from this repository.Abstract
It is known that certain classification algorithm requires continuous data to be discretized for it to produce better classification accuracy. Hence, many works have explored the pairing of classification algorithm and discretization techniques, yet tree-based classifier especially Classification and Regression Trees (CART) still have an issue with classification accuracy regardless of different pairing with existing discretization techniques. The role of fuzzy partition and fuzzy sets interval are not something new in data discretization but none yet to explore the pairing of fuzzy discretization with tree-based algorithm. This paper will be discussing on an approach of using fuzzy based discretization and a member of tree-based algorithm known as Random Forest, a better version of CART. In this study, continuous data are identified from a dataset and discretized through the fuzzy discretization. Then, 10-fold cross validation is done on the transformed dataset and seven well-known classifiers are used including the proposed approach. Based on the results, better classification accuracy is achieved when fuzzy discretization is paired with Random Forest algorithm compared to CART. On top of that, with the present of fuzzy discretization technique, an increased in the classification accuracy has been obtained compared to other classification algorithms. © 2020 ACM.
Item Type: | Conference or Workshop Item (UNSPECIFIED) |
---|---|
Additional Information: | cited By 1; Conference of 9th International Conference on Software and Computer Applications, ICSCA 2020 ; Conference Date: 18 February 2020 Through 21 February 2020; Conference Code:164254 |
Uncontrolled Keywords: | Application programs; Decision trees; Fuzzy sets; Random forests, 10-fold cross-validation; Classification accuracy; Classification algorithm; Classification and regression tree; Data discretization; Random forest algorithm; Tree-based algorithms; Unsupervised discretization, Classification (of information) |
Depositing User: | Mr Ahmad Suhairi UTP |
Date Deposited: | 10 Nov 2023 03:28 |
Last Modified: | 10 Nov 2023 03:28 |
URI: | https://khub.utp.edu.my/scholars/id/eprint/13460 |