Please use this identifier to cite or link to this item:
https://ptsldigital.ukm.my/jspui/handle/123456789/394963
Title: | Measuring the representativeness of index terms in literary texts: an experiment on the quran |
Authors: | Hayati Abd Rahman Shahrul Azman Noah Hector Jimenez-Salazar |
Conference Name: | International Symposium on Information Technology |
Keywords: | Index terms Literary texts Quran Document classification |
Conference Date: | 26/08/2008 |
Conference Location: | Kuala Lumpur Convention Centre |
Abstract: | Concept hierarchy is a hierarchically organized collection of domain concepts. It is particularly useful in many applications such as information retrieval, document browsing and document classification. One of the important tasks in the construction of concept hierarchy is the identification of suitable terms with appropriate size of domain vocabulary. One way of achieving such a size is by using term reduction. The aim of this paper is to examine the effectiveness of the reduction approach to reduce the size of vocabulary using term selection methods. An experiment has been conducted on the Quran which is assumed to be a literary text. The experiment compares the entropy method, the transition point method and the hybrid of transition point and entropy methods with the Vector Space Model (VSM). Results indicate the effectiveness of the Transition Point method as compared to the others in reducing the size of the vocabulary but at the same time preserve those important terms that exist in tle literary documents. |
Pages: | 5 |
Call Number: | T58.5.C634 2008 kat sem j.2 |
Publisher: | Institute of Electrical and Electronics Engineers (IEEE),Piscataway, US |
Appears in Collections: | Seminar Papers/ Proceedings / Kertas Kerja Seminar/ Prosiding |
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.