eprintid: 18889 rev_number: 2 eprint_status: archive userid: 1 dir: disk0/00/01/88/89 datestamp: 2024-06-04 14:11:19 lastmod: 2024-06-04 14:11:19 status_changed: 2024-06-04 14:04:22 type: article metadata_visibility: show creators_name: Imai, B. creators_name: Nasir, Q. creators_name: Maulud, A.S. creators_name: Nawaz, M. creators_name: Nasir, R. creators_name: Suleman, H. title: Neural network-based correlation and statistical identification of data outliers in H2S-alkanolamine-H2O and CO2-alkanolamine-H2O datasets ispublished: pub keywords: Alkanolamines; Carbon dioxide; Ethanolamines; Large dataset; Phase equilibria; Statistics, Chemical equilibriums; Data outlier identification; Data outliers; Discrepant data; Network-based; Neural-networks; Outlier identification; Phase equilibrium data; Prediction errors; Standard deviation method, Neural networks note: cited By 0 abstract: Throughout the published literature for phase equilibrium data of CO2-alkanolamine-H2O and H2S-alkanolamine-H2O systems, it is common to find some discrepant data, called data outliers. The presence of these erroneous values induces inaccuracies and prediction errors in the models and simulation studies developed using such experimental datasets. Hence, it is important that the data outliers are identified and later corrected or removed before developing a model or simulation. This study proposes a modified approach to identifying data outliers present in the phase equilibrium data of CO2-alkanolamine-H2O and H2S-alkanolamine-H2O systems using an artificial neural network and data outlier identification methods. Firstly, the suggested approach correlates the experimental phase equilibrium data (2152 data points) of CO2 and H2S-loaded monoethanolamine, diethanolamine, and N-methyldiethanolamine solutions by developing an artificial neural network. Following this, the data outliers are identified by applying a modified IQR method and compared graphically to 2.5 standard deviation method. The identified data outliers can then be truncated or winsorised for developing reliable and accurate models/simulations. The modified IQR method coupled with a neural network (based on the normalised data values) can robustly identify data outliers within a large experimental dataset. The proposed approach is superior to the previous data outlier identification techniques that used 2.5 standard deviations method, as it alleviates the need for a human decision in determining the congruence of experimental values. The results also indicate that the developed method can be reliably extended to other/larger non-linear experimental datasets having similar correlative complexity. © 2022, The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature. date: 2023 publisher: Springer Science and Business Media Deutschland GmbH official_url: https://www.scopus.com/inward/record.uri?eid=2-s2.0-85139545410&doi=10.1007%2fs00521-022-07904-z&partnerID=40&md5=97469bda6b87592a38118dbb42645fc5 id_number: 10.1007/s00521-022-07904-z full_text_status: none publication: Neural Computing and Applications volume: 35 number: 4 pagerange: 3395-3412 refereed: TRUE issn: 09410643 citation: Imai, B. and Nasir, Q. and Maulud, A.S. and Nawaz, M. and Nasir, R. and Suleman, H. (2023) Neural network-based correlation and statistical identification of data outliers in H2S-alkanolamine-H2O and CO2-alkanolamine-H2O datasets. Neural Computing and Applications, 35 (4). pp. 3395-3412. ISSN 09410643