eprintid: 12647 rev_number: 2 eprint_status: archive userid: 1 dir: disk0/00/01/26/47 datestamp: 2023-11-10 03:27:12 lastmod: 2023-11-10 03:27:12 status_changed: 2023-11-10 01:49:11 type: conference_item metadata_visibility: show creators_name: Hishamuddin, M.N.F. creators_name: Hassan, M.F. creators_name: Tran, D.C. creators_name: Mokhtar, A.A. title: Improving Classification Accuracy of Scikit-learn Classifiers with Discrete Fuzzy Interval Values ispublished: pub keywords: Computer software; Data mining; Decision trees; Fuzzy logic; Intelligent computing; Knowledge representation; Machine learning, 10-fold cross-validation; Classification accuracy; Continuous data; Data discretization; Discrete intervals; Discretization process; Fuzzy membership; Linguistic variable, Classification (of information) note: cited By 6; Conference of 2020 International Conference on Computational Intelligence, ICCI 2020 ; Conference Date: 8 October 2020 Through 9 October 2020; Conference Code:164916 abstract: Understanding machine learning (ML) algorithm from scratch is time consuming. Thus, many software and library packages such as Weka and Scikit-Learn have been introduced to help researchers run simulation on several amounts of well-known classifiers. In ML, different classifiers have different performance and this depends on factor such as type of data used as input for the classification phase. Thus, it is necessary to perform data discretization when dealing with continuous data for classifiers that perform better with discrete data. However, in data mining, depending solely on discretization is not enough as real-world data can be large, imprecise and noisy. In addition, knowledge representation is necessary to help researchers to understand better about the data during the discretization process. Thus, the objective of this study is to observe the effect of fuzzy elements inside the discretization phase on the classification accuracy of Scikit-learn classifiers. In this study, fuzzy logic has been proposed to assist the existing discretization technique through fuzzy membership graph, linguistic variables and discrete interval values. All classifiers in Scikit-learn packages were used during the classification phase through 10-fold cross validation. The simulation results showed that the presence of fuzzy in assisting the discretization process slightly improved the classification accuracy of ensemble type classifiers such as Random Forest and Naive Bayes while slightly degrading the performance of other classifiers. © 2020 IEEE. date: 2020 publisher: Institute of Electrical and Electronics Engineers Inc. official_url: https://www.scopus.com/inward/record.uri?eid=2-s2.0-85097524594&doi=10.1109%2fICCI51257.2020.9247696&partnerID=40&md5=80d1e8e4c3f9a70d31ab39ce1de3c105 id_number: 10.1109/ICCI51257.2020.9247696 full_text_status: none publication: 2020 International Conference on Computational Intelligence, ICCI 2020 pagerange: 163-166 refereed: TRUE isbn: 9781728154473 citation: Hishamuddin, M.N.F. and Hassan, M.F. and Tran, D.C. and Mokhtar, A.A. (2020) Improving Classification Accuracy of Scikit-learn Classifiers with Discrete Fuzzy Interval Values. In: UNSPECIFIED.