%0 Journal Article %@ 20763417 %A Usmani, U.A. %A Watada, J. %A Jaafar, J. %A Aziz, I.A. %A Roy, A. %D 2021 %F scholars:14362 %I MDPI %J Applied Sciences (Switzerland) %N 20 %R 10.3390/app11209367 %T A reinforcement learning algorithm for automated detection of skin lesions %U https://khub.utp.edu.my/scholars/14362/ %V 11 %X Skin cancers are increasing at an alarming rate, and detection in the early stages is essential for advanced treatment. The current segmentation methods have limited labeling ability to the ground truth images due to the numerous noisy expert annotations present in the datasets. The precise boundary segmentation is essential to correctly locate and diagnose the various skin lesions. In this work, the lesion segmentation method is proposed as a Markov decision process. It is solved by training an agent to segment the region using a deep reinforcement-learning algorithm. Our method is similar to the delineation of a region of interest by the physicians. The agent follows a set of serial actions for the region delineation, and the action space is defined as a set of continuous action parameters. The segmentation model learns in continuous action space using the deep deterministic policy gradient algorithm. The proposed method enables continuous improvement in performance as we proceed from coarse segmentation results to finer results. Finally, our proposed model is evaluated on the International Skin Imaging Collaboration (ISIC) 2017 image dataset, Human against Machine (HAM10000), and PH2 dataset. On the ISIC 2017 dataset, the algorithm achieves an accuracy of 96.33 for the naevus cases, 95.39 for the melanoma cases, and 94.27 for the seborrheic keratosis cases. The other metrics are evaluated on these datasets and rank higher when compared with the current state-of-the-art lesion segmentation algorithms. © 2021 by the authors. Licensee MDPI, Basel, Switzerland. %Z cited By 10