Please use this identifier to cite or link to this item: https://ptsldigital.ukm.my/jspui/handle/123456789/578053
Full metadata record
DC FieldValueLanguage
dc.contributor.authorSaad Mohmad Saad Ismail (Universiti Kebangsaan Malaysia)
dc.contributor.authorSiti Norul Huda Sheikh Abdullah (Universiti Kebangsaan Malaysia)
dc.contributor.authorFariza Fauzi (Universiti Kebangsaan Malaysia)
dc.date.accessioned2023-11-06T02:58:09Z-
dc.date.available2023-11-06T02:58:09Z-
dc.date.issued2019-10
dc.identifier.issn0128-7680
dc.identifier.otherukmvital:129817
dc.identifier.urihttps://ptsldigital.ukm.my/jspui/handle/123456789/578053-
dc.descriptionDetection and identification of text in natural scene images pose major challenges: image quality varies as scenes are taken under different conditions (lighting, angle and resolution) and the contained text entities can be in any form (size, style and orientation). In this paper, a robust approach is proposed to localize, extract and recognize scene texts of different sizes, fonts and orientations from images of varying quality. The proposed method consists of the following steps: preprocessing and enhancement of input image using the National Television System Committee (NTSC) color mapping and the contrast enhancement via mean histogram stretching; candidate text regions detection using hybrid adaptive segmentation and fuzzy c-means clustering techniques; a two-stage text extraction from the candidate text regions to filter out false text regions include local character filtering according to a rule-based approach using shape and statistical features and text region filtering via stroke width transform (SWT); and finally, text recognition using Tesseract OCR engine. The proposed method was evaluated using two benchmark datasets: ICDAR2013 and KAIST image datasets. The proposed method effectively dealt with complex scene images containing texts of various font sizes, colors, and orientation; and outperformed state-of-the-art methods, achieving >80% in both precision and recall measures.
dc.language.isoen
dc.publisherUniversiti Putra Malaysia Press
dc.relation.haspartPertanika Journal of Sciences & Technology
dc.relation.urihttp://www.pertanika.upm.edu.my/pjst/browse/archives?journal=JST-27-4-10
dc.rights(c) Universiti Putra Malaysia Press
dc.subjectAdaptive binarization
dc.subjectFuzzy C-means
dc.subjectImage enhancement
dc.subjectStatistical and geometrical features
dc.subjectText detection; text extraction
dc.titleDetection and recognition via adaptive binarization and fuzzy clustering
dc.typeJournal Article
dc.format.volume27
dc.format.pages1759-1781
dc.format.issue4
Appears in Collections:Journal Content Pages/ Kandungan Halaman Jurnal

Files in This Item:
File Description SizeFormat 
ukmvital_129817+Source01+Source010.PDF1.95 MBAdobe PDFThumbnail
View/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.