Unigram language identification using adaptive neutral network

Please use this identifier to cite or link to this item: https://ptsldigital.ukm.my/jspui/handle/123456789/394961

Full metadata record

DC Field	Value	Language
dc.contributor.author	Ali Selamat	-
dc.contributor.author	Choon-Ching Ng	-
dc.date.accessioned	2023-06-15T07:52:52Z	-
dc.date.available	2023-06-15T07:52:52Z	-
dc.identifier.other	ukmvital:122457	-
dc.identifier.uri	https://ptsldigital.ukm.my/jspui/handle/123456789/394961	-
dc.description.abstract	In general, a web document page may contain several script forms. Each script can be used for constructing different languages. Determining the languages of the document is the required to effectively be able to apply many search and information retrieval techniques. In this work, we propose hybrid-grams feature selection methods by integrating unigram and bigrams. The method makes use of local statistical information or data within a document to determine the language. From the experiments, we have noticed that hybrid-grams are outperformed than unigram and bigrams in Cyrillic and Indic script language identifications.	-
dc.language.iso	eng	-
dc.publisher	Institute of Electrical and Electronics Engineers (IEEE),Piscataway, US	-
dc.subject	Unigram language	-
dc.subject	Neutral network	-
dc.title	Unigram language identification using adaptive neutral network	-
dc.type	Seminar Papers	-
dc.format.pages	5	-
dc.identifier.callno	T58.5.C634 2008 kat sem j.2	-
dc.contributor.conferencename	International Symposium on Information Technology	-
dc.coverage.conferencelocation	Kuala Lumpur Convention Centre	-
dc.date.conferencedate	26/08/2008	-
Appears in Collections:	Seminar Papers/ Proceedings / Kertas Kerja Seminar/ Prosiding

Files in This Item:

There are no files associated with this item.