eprintid: 1766 rev_number: 2 eprint_status: archive userid: 1 dir: disk0/00/00/17/66 datestamp: 2023-11-09 15:49:56 lastmod: 2023-11-09 15:49:56 status_changed: 2023-11-09 15:41:18 type: conference_item metadata_visibility: show creators_name: Hairuman, I.F.Bt. creators_name: Foong, O.-M. title: OCR signage recognition with skew & slant correction for visually impaired people ispublished: pub keywords: Canny edge detection; Computer vision system; Hough Transformation; MicroSoft; Recognition models; Recognition rates; slant correction; Speech applications; Speech synthesizer; Template-matching algorithms; Text file; Visually impaired; Visually impaired people; Way finding, Edge detection; Ferry boats; Intelligent systems; Optical character recognition; Shearing; Template matching, Algorithms note: cited By 11; Conference of 2011 11th International Conference on Hybrid Intelligent Systems, HIS 2011 ; Conference Date: 5 December 2011 Through 8 December 2011; Conference Code:88378 abstract: It is a challenge for visually impaired people (VIPs) to navigate independently whenever they attempt to find their way in unfamiliar buildings searching for amenities (i.e. exits, ladies/gents toilets) even with a walking stick or a guide dog. Camera-based computer vision systems have the potential to assist VIPs in independent navigation or way finding in unfamiliar places. To leverage on previous research of Signage Recognition Framework which could only recognize public signage with slanted angle less than30°, an improved OCR signage recognition model with skew and slant correction in public signage is presented. The proposed OCR method consists of Canny edge detection algorithm, Hough Transformation and Shearing Transformation were used to detect and correct skewed and slanted images. The proposed model would capture a public signage image, compare the image in the database using template matching algorithm and convert to machine readable text in a text file. The text will then be processed by Microsoft Speech Application Program Interface (SAPI) speech synthesizer and translated to voice as output. Experiments were conducted on 5 blind folded subjects to test the performance of the model. The proposed OCR recognition model has achieved satisfactory recognition rate of 82.7. © 2011 IEEE. date: 2011 official_url: https://www.scopus.com/inward/record.uri?eid=2-s2.0-84856723727&doi=10.1109%2fHIS.2011.6122123&partnerID=40&md5=5c7199da3b192d8c7f254a61980a192e id_number: 10.1109/HIS.2011.6122123 full_text_status: none publication: Proceedings of the 2011 11th International Conference on Hybrid Intelligent Systems, HIS 2011 place_of_pub: Malacca pagerange: 306-310 refereed: TRUE isbn: 9781457721502 citation: Hairuman, I.F.Bt. and Foong, O.-M. (2011) OCR signage recognition with skew & slant correction for visually impaired people. In: UNSPECIFIED.