Please use this identifier to cite or link to this item: https://ptsldigital.ukm.my/jspui/handle/123456789/578173
Full metadata record
DC FieldValueLanguage
dc.contributor.authorSaad Mohmad Saad Ismail (UKM)
dc.contributor.authorSiti Norul Huda Sheikh Abdullah (UKM)
dc.contributor.authorFariza Fauzi (UKM)
dc.date.accessioned2023-11-06T02:58:55Z-
dc.date.available2023-11-06T02:58:55Z-
dc.date.issued2019-10
dc.identifier.issn0128-7680
dc.identifier.otherukmvital:113358
dc.identifier.urihttps://ptsldigital.ukm.my/jspui/handle/123456789/578173-
dc.descriptionDetection and identification of text in natural scene images pose major challenges: image quality varies as scenes are taken under different conditions (lighting, angle and resolution) and the contained text entities can be in any form (size, style and orientation). In this paper, a robust approach is proposed to localize, extract and recognize scene texts of different sizes, fonts and orientations from images of varying quality. The proposed method consists of the following steps: preprocessing and enhancement of input image using the National Television System Committee (NTSC) color mapping and the contrast enhancement via mean histogram stretching; candidate text regions detection using hybrid adaptive segmentation and fuzzy c-means clustering techniques; a two-stage text extraction from the candidate text regions to filter out false text regions include local character filtering according to a rule-based approach using shape and statistical features and text region filtering via stroke width transform (SWT); and finally, text recognition using Tesseract OCR engine. The proposed method was evaluated using two benchmark datasets: ICDAR2013 and KAIST image datasets. The proposed method effectively dealt with complex scene images containing texts of various font sizes, colors, and orientation; and outperformed state-of-the-art methods, achieving >80% in both precision and recall measures.
dc.language.isoen
dc.publisherUniversiti Putra Malaysia Press
dc.relation.haspartPertanika Journals
dc.relation.urihttp://www.pertanika.upm.edu.my/regular_issues.php?jtype=2
dc.rightsUKM
dc.subjectAdaptive binarization
dc.subjectFuzzy C-means
dc.subjectImage enhancement
dc.subjectStatistical and geometrical features
dc.subjectText detection
dc.subjectText extraction
dc.titleDetection and recognition via adaptive binarization and fuzzy clustering
dc.typeJournal Article
dc.format.volume27
dc.format.pages1759-1781
dc.format.issue4
Appears in Collections:Journal Content Pages/ Kandungan Halaman Jurnal

Files in This Item:
File Description SizeFormat 
ukmvital_113358+Source01+Source010.PDF1.91 MBAdobe PDFThumbnail
View/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.