Please use this identifier to cite or link to this item:
https://ptsldigital.ukm.my/jspui/handle/123456789/394961
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Ali Selamat | - |
dc.contributor.author | Choon-Ching Ng | - |
dc.date.accessioned | 2023-06-15T07:52:52Z | - |
dc.date.available | 2023-06-15T07:52:52Z | - |
dc.identifier.other | ukmvital:122457 | - |
dc.identifier.uri | https://ptsldigital.ukm.my/jspui/handle/123456789/394961 | - |
dc.description.abstract | In general, a web document page may contain several script forms. Each script can be used for constructing different languages. Determining the languages of the document is the required to effectively be able to apply many search and information retrieval techniques. In this work, we propose hybrid-grams feature selection methods by integrating unigram and bigrams. The method makes use of local statistical information or data within a document to determine the language. From the experiments, we have noticed that hybrid-grams are outperformed than unigram and bigrams in Cyrillic and Indic script language identifications. | - |
dc.language.iso | eng | - |
dc.publisher | Institute of Electrical and Electronics Engineers (IEEE),Piscataway, US | - |
dc.subject | Unigram language | - |
dc.subject | Neutral network | - |
dc.title | Unigram language identification using adaptive neutral network | - |
dc.type | Seminar Papers | - |
dc.format.pages | 5 | - |
dc.identifier.callno | T58.5.C634 2008 kat sem j.2 | - |
dc.contributor.conferencename | International Symposium on Information Technology | - |
dc.coverage.conferencelocation | Kuala Lumpur Convention Centre | - |
dc.date.conferencedate | 26/08/2008 | - |
Appears in Collections: | Seminar Papers/ Proceedings / Kertas Kerja Seminar/ Prosiding |
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.