eprintid: 14653 rev_number: 2 eprint_status: archive userid: 1 dir: disk0/00/01/46/53 datestamp: 2023-11-10 03:29:14 lastmod: 2023-11-10 03:29:14 status_changed: 2023-11-10 01:57:28 type: article metadata_visibility: show creators_name: AlThuwaynee, O.F. creators_name: Kim, S.-W. creators_name: Najemaden, M.A. creators_name: Aydda, A. creators_name: Balogun, A.-L. creators_name: Fayyadh, M.M. creators_name: Park, H.-J. title: Demystifying uncertainty in PM10 susceptibility mapping using variable drop-off in extreme-gradient boosting (XGB) and random forest (RF) algorithms ispublished: pub keywords: accuracy assessment; algorithm; gradient analysis; Landsat; machine learning; mapping method; particulate matter; satellite imagery; uncertainty analysis, algorithm; human; machine learning; particulate matter; uncertainty, Algorithms; Humans; Machine Learning; Particulate Matter; Uncertainty note: cited By 16 abstract: This study investigates uncertainty in machine learning that can occur when there is significant variance in the prediction importance level of the independent variables, especially when the ROC fails to reflect the unbalanced effect of prediction variables. A variable drop-off loop function, based on the concept of early termination for reduction of model capacity, regularization, and generalization control, was tested. A susceptibility index for airborne particulate matter of less than 10 μm diameter (PM10) was modeled using monthly maximum values and spectral bands and indices from Landsat 8 imagery, and Open Street Maps were used to prepare a range of independent variables. Probability and classification index maps were prepared using extreme-gradient boosting (XGBOOST) and random forest (RF) algorithms. These were assessed against utility criteria such as a confusion matrix of overall accuracy, quantity of variables, processing delay, degree of overfitting, importance distribution, and area under the receiver operating characteristic curve (ROC). Graphical abstract: Figure not available: see fulltext. © 2021, The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature. date: 2021 publisher: Springer Science and Business Media Deutschland GmbH official_url: https://www.scopus.com/inward/record.uri?eid=2-s2.0-85104056850&doi=10.1007%2fs11356-021-13255-4&partnerID=40&md5=00a37e73e28bc9ef27fc2fbf0733d1dc id_number: 10.1007/s11356-021-13255-4 full_text_status: none publication: Environmental Science and Pollution Research volume: 28 number: 32 pagerange: 43544-43566 refereed: TRUE issn: 09441344 citation: AlThuwaynee, O.F. and Kim, S.-W. and Najemaden, M.A. and Aydda, A. and Balogun, A.-L. and Fayyadh, M.M. and Park, H.-J. (2021) Demystifying uncertainty in PM10 susceptibility mapping using variable drop-off in extreme-gradient boosting (XGB) and random forest (RF) algorithms. Environmental Science and Pollution Research, 28 (32). pp. 43544-43566. ISSN 09441344