eprintid: 11137 rev_number: 2 eprint_status: archive userid: 1 dir: disk0/00/01/11/37 datestamp: 2023-11-10 03:25:40 lastmod: 2023-11-10 03:25:40 status_changed: 2023-11-10 01:14:33 type: conference_item metadata_visibility: show creators_name: Khan, F.U. creators_name: Aziz, I.B.A. title: Reducing high variability in medical image collection by a novel cluster based synthetic oversampling technique ispublished: pub keywords: Big data; Medical imaging, Clustering; Clustering methods; Concept detection; Data sampling; Intra-class variation; Membership degrees; Oversampling technique; Visual concept detections, Data acquisition note: cited By 1; Conference of 2019 IEEE Conference on Big Data and Analytics, ICBDA 2019 ; Conference Date: 19 November 2019 Through 21 November 2019; Conference Code:157670 abstract: In general, there are two open challenges for domain specific visual concept detection. First is the high intra-class variations and second is to collect large collection of sample training data covering the wide variety. In this research we present a novel medical image sampling approach to handle these two challenges. For huge intra-class variations present in the data collection we propose a unique clustering method to group similar data samples. We propose to measure similarity on the basis of membership degree between candidate groups. At the same time we handled the issue of large data collection over wide variety. We synthetically re-sampled the data on the basis of membership score. The membership score helped to make use of minor groups which has small sample size. Experiments show that our proposed method can achieve promising results and outperforms existing approaches particularly for medical image concept detection. © 2019 IEEE. date: 2019 publisher: Institute of Electrical and Electronics Engineers Inc. official_url: https://www.scopus.com/inward/record.uri?eid=2-s2.0-85080867499&doi=10.1109%2fICBDA47563.2019.8987171&partnerID=40&md5=9db3bed50c6f2fcb65821d9fcc009ee2 id_number: 10.1109/ICBDA47563.2019.8987171 full_text_status: none publication: 2019 IEEE Conference on Big Data and Analytics, ICBDA 2019 pagerange: 45-50 refereed: TRUE isbn: 9781728133089 citation: Khan, F.U. and Aziz, I.B.A. (2019) Reducing high variability in medical image collection by a novel cluster based synthetic oversampling technique. In: UNSPECIFIED.