Please use this identifier to cite or link to this item: https://ptsldigital.ukm.my/jspui/handle/123456789/476185
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorMohd. Juzaiddin Ab Aziz, Prof. Dr.
dc.contributor.authorHamed Hamdoon Ali Al-Balushi (P65643 )
dc.date.accessioned2023-10-06T09:14:27Z-
dc.date.available2023-10-06T09:14:27Z-
dc.date.issued2014-06-09
dc.identifier.otherukmvital:75235
dc.identifier.urihttps://ptsldigital.ukm.my/jspui/handle/123456789/476185-
dc.descriptionArabic noun compound extraction has become a challenging issue in the field of NLP. Several approaches have been proposed in terms of extracting Arabic noun compounds. Some of them have used linguistic-based approach, statistical methods and the rest have used a hybrid between them. However, there is still a significant demand for improving nested Arabic noun compound extraction in terms of the accuracy. This research proposes a hybrid method of linguistic-based approach and statistical method in order to enhance the extraction of nested Arabic noun compound. The dataset has been collected from online Arabic newspaper archive from Aljazeara.net and Almotamar.net. Several pre-processing steps have been carried out on the data including transformation, normalization, stemming and POS tagging. After that, an n-gram is used to generate bi-gram, tri-gram, 4-gram, and 5-gram candidates of noun compound. Then three association measures which are NC-value, PMI and LLR have been used in order to rank the candidates. The evaluation has been performed using the n-best method with a human annotation (manual selection by expertise). NC-value has outperformed PMI and LLR in terms of extracting nested noun compounds.,Master of Information Technology
dc.language.isoeng
dc.publisherUKM, Bangi
dc.relationFaculty of Information Science and Technology / Fakulti Teknologi dan Sains Maklumat
dc.rightsUKM
dc.subjectHybrid method
dc.subjectUniversiti Kebangsaan Malaysia -- Dissertations
dc.subjectDissertations, Academic -- Malaysia
dc.titleA hybrid method of linguistic approach and statistical method for nested noun compound extraction
dc.typetheses
dc.format.pages84
dc.identifier.callnoP98.A434 2014 3 tesis
dc.identifier.barcode001232
dc.identifier.barcode005656(2021)(PL2)
Appears in Collections:Faculty of Information Science and Technology / Fakulti Teknologi dan Sains Maklumat

Files in This Item:
File Description SizeFormat 
ukmvital_75235+Source01+Source010.PDF
  Restricted Access
1.7 MBAdobe PDFThumbnail
View/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.