TY - JOUR EP - 4758 SN - 22414487 PB - Dr D. Pylarinos SP - 4755 TI - Performance Analysis of Duplicate Record Detection Techniques N1 - cited By 2 AV - none VL - 9 UR - https://www.scopus.com/inward/record.uri?eid=2-s2.0-85086180308&doi=10.48084%2fetasr.3036&partnerID=40&md5=87e9377bb709a8c3ad401f89067e31b5 JF - Engineering, Technology and Applied Science Research A1 - Adil, S.H. A1 - Ali, S.S.A. A1 - Raza, K. A1 - Ebrahim, M. Y1 - 2019/// ID - scholars11226 N2 - In this paper, a comprehensive performance analysis of duplicate data detection techniques for relational databases has been performed. The research focuses on traditional SQL based and modern bloom filter techniques to find and eliminate records which already exist in the database while performing bulk insertion operation (i.e. bulk insertion involved in the loading phase of the Extract, Transform, and Load (ETL) process and data synchronization in multisite database synchronization). The comprehensive performance analysis was performed on several data sizes using SQL, bloom filter, and parallel bloom filter. The results show that the parallel bloom filter is highly suitable for duplicate detection in the database. © 2019, Dr D. Pylarinos. All rights reserved. IS - 5 ER -