eprintid: 16404 rev_number: 2 eprint_status: archive userid: 1 dir: disk0/00/01/64/04 datestamp: 2023-12-19 03:22:56 lastmod: 2023-12-19 03:22:56 status_changed: 2023-12-19 03:06:11 type: article metadata_visibility: show creators_name: Yang, L. creators_name: Aghaabbasi, M. creators_name: Ali, M. creators_name: Jan, A. creators_name: Bouallegue, B. creators_name: Javed, M.F. creators_name: Salem, N.M. title: Comparative Analysis of the Optimized KNN, SVM, and Ensemble DT Models Using Bayesian Optimization for Predicting Pedestrian Fatalities: An Advance towards Realizing the Sustainable Safety of Pedestrians ispublished: pub keywords: machine learning, Australia note: cited By 6 abstract: Over the past three decades, more than 8000 pedestrians have been killed in Australia due to vehicular crashes. There is a general assumption that pedestrians are often the most vulnerable to crashes. Sustainable transportation goals are at odds with the high risk of pedestrian fatalities and injuries in car crashes. It is imperative that the reasons for pedestrian injuries be identified if we are to improve the safety of this group of road users who are particularly susceptible. These results were obtained mostly through the use of well-established statistical approaches. A lack of flexibility in managing outliers, incomplete, or inconsistent data, as well as rigid pre-assumptions, have been criticized in these models. This study employed three well-known machine learning models to predict road-crash-related pedestrian fatalities (RCPF). These models included support vector machines (SVM), ensemble decision trees (EDT), and k-nearest neighbors (KNN). These models were hybridized with a Bayesian optimization (BO) algorithm to find the optimum values of their hyperparameters, which are extremely important to accurately predict the RCPF. The findings of this study show that all the three models� performance was improved using the BO. The KNN model had the highest improvement in accuracy (+11) after the BO was applied to it. However, the ultimate accuracy of the SVM and EDT models was higher than that of the KNN model. This study establishes the framework for employing optimized machine learning techniques to reduce pedestrian fatalities in traffic accidents. © 2022 by the authors. date: 2022 publisher: MDPI official_url: https://www.scopus.com/inward/record.uri?eid=2-s2.0-85137936362&doi=10.3390%2fsu141710467&partnerID=40&md5=220fff526a4aefda4ece03b98d7d9a0d id_number: 10.3390/su141710467 full_text_status: none publication: Sustainability (Switzerland) volume: 14 number: 17 refereed: TRUE issn: 20711050 citation: Yang, L. and Aghaabbasi, M. and Ali, M. and Jan, A. and Bouallegue, B. and Javed, M.F. and Salem, N.M. (2022) Comparative Analysis of the Optimized KNN, SVM, and Ensemble DT Models Using Bayesian Optimization for Predicting Pedestrian Fatalities: An Advance towards Realizing the Sustainable Safety of Pedestrians. Sustainability (Switzerland), 14 (17). ISSN 20711050