Detection and recognition via adaptive binarization and fuzzy clustering

Saad Mohmad Saad Ismail (UKM); Siti Norul Huda Sheikh Abdullah (UKM); Fariza Fauzi (UKM)

Please use this identifier to cite or link to this item: https://ptsldigital.ukm.my/jspui/handle/123456789/578173

Full metadata record

DC Field	Value	Language
dc.contributor.author	Saad Mohmad Saad Ismail (UKM)
dc.contributor.author	Siti Norul Huda Sheikh Abdullah (UKM)
dc.contributor.author	Fariza Fauzi (UKM)
dc.date.accessioned	2023-11-06T02:58:55Z	-
dc.date.available	2023-11-06T02:58:55Z	-
dc.date.issued	2019-10
dc.identifier.issn	0128-7680
dc.identifier.other	ukmvital:113358
dc.identifier.uri	https://ptsldigital.ukm.my/jspui/handle/123456789/578173	-
dc.description	Detection and identification of text in natural scene images pose major challenges: image quality varies as scenes are taken under different conditions (lighting, angle and resolution) and the contained text entities can be in any form (size, style and orientation). In this paper, a robust approach is proposed to localize, extract and recognize scene texts of different sizes, fonts and orientations from images of varying quality. The proposed method consists of the following steps: preprocessing and enhancement of input image using the National Television System Committee (NTSC) color mapping and the contrast enhancement via mean histogram stretching; candidate text regions detection using hybrid adaptive segmentation and fuzzy c-means clustering techniques; a two-stage text extraction from the candidate text regions to filter out false text regions include local character filtering according to a rule-based approach using shape and statistical features and text region filtering via stroke width transform (SWT); and finally, text recognition using Tesseract OCR engine. The proposed method was evaluated using two benchmark datasets: ICDAR2013 and KAIST image datasets. The proposed method effectively dealt with complex scene images containing texts of various font sizes, colors, and orientation; and outperformed state-of-the-art methods, achieving >80% in both precision and recall measures.
dc.language.iso	en
dc.publisher	Universiti Putra Malaysia Press
dc.relation.haspart	Pertanika Journals
dc.relation.uri	http://www.pertanika.upm.edu.my/regular_issues.php?jtype=2
dc.rights	UKM
dc.subject	Adaptive binarization
dc.subject	Fuzzy C-means
dc.subject	Image enhancement
dc.subject	Statistical and geometrical features
dc.subject	Text detection
dc.subject	Text extraction
dc.title	Detection and recognition via adaptive binarization and fuzzy clustering
dc.type	Journal Article
dc.format.volume	27
dc.format.pages	1759-1781
dc.format.issue	4
Appears in Collections:	Journal Content Pages/ Kandungan Halaman Jurnal

Files in This Item:

File	Description	Size	Format
ukmvital_113358+Source01+Source010.PDF		1.91 MB	Adobe PDF	View/Open

Show simple item record Recommend this item