eprintid: 13476 rev_number: 2 eprint_status: archive userid: 1 dir: disk0/00/01/34/76 datestamp: 2023-11-10 03:28:02 lastmod: 2023-11-10 03:28:02 status_changed: 2023-11-10 01:51:16 type: article metadata_visibility: show creators_name: Alam, M.K. creators_name: Aziz, A.A. creators_name: Latif, S.A. creators_name: Awang, A. title: Error-aware data clustering for in-network data reduction in wireless sensor networks ispublished: pub keywords: Anomaly detection; Cluster analysis; Random errors; Reduction; Sensor nodes; Statistics; Wireless sensor networks, Environmental Monitoring; In networks; K-means; K-medoids; Partitional clustering; Time series clustering, K-means clustering note: cited By 18 abstract: A wireless sensor network (WSN) deploys hundreds or thousands of nodes that may introduce large-scale data over time. Dealing with such an amount of collected data is a real challenge for energy-constraint sensor nodes. Therefore, numerous research works have been carried out to design efficient data clustering techniques in WSNs to eliminate the amount of redundant data before transmitting them to the sink while preserving their fundamental properties. This paper develops a new error-aware data clustering (EDC) technique at the cluster-heads (CHs) for in-network data reduction. The proposed EDC consists of three adaptive modules that allow users to choose the module that suits their requirements and the quality of the data. The histogram-based data clustering (HDC) module groups temporal correlated data into clusters and eliminates correlated data from each cluster. Recursive outlier detection and smoothing (RODS) with HDC module provides error-aware data clustering, which detects random outliers using temporal correlation of data to maintain data reduction errors within a predefined threshold. Verification of RODS (V-RODS) with HDC module detects not only random outliers but also frequent outliers simultaneously based on both the temporal and spatial correlations of the data. The simulation results show that the proposed EDC is computationally cheap, able to reduce a significant amount of redundant data with minimum error, and provides efficient error-aware data clustering solutions for remote monitoring environmental applications. © 2020 by the authors. Licensee MDPI, Basel, Switzerland. date: 2020 publisher: MDPI AG official_url: https://www.scopus.com/inward/record.uri?eid=2-s2.0-85079666769&doi=10.3390%2fs20041011&partnerID=40&md5=199cc1c705dd357bec97c5816003e805 id_number: 10.3390/s20041011 full_text_status: none publication: Sensors (Switzerland) volume: 20 number: 4 refereed: TRUE issn: 14248220 citation: Alam, M.K. and Aziz, A.A. and Latif, S.A. and Awang, A. (2020) Error-aware data clustering for in-network data reduction in wireless sensor networks. Sensors (Switzerland), 20 (4). ISSN 14248220