Please use this identifier to cite or link to this item: https://ptsldigital.ukm.my/jspui/handle/123456789/476269
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorNazlia Omar, Prof. Madya Dr.-
dc.contributor.authorAhmed Ali Mohammed Al-Saffar (P72233)-
dc.date.accessioned2023-10-06T09:15:34Z-
dc.date.available2023-10-06T09:15:34Z-
dc.date.issued2015-07-14-
dc.identifier.otherukmvital:80507-
dc.identifier.urihttps://ptsldigital.ukm.my/jspui/handle/123456789/476269-
dc.descriptionSentiment analysis or opinion mining is a field that analyses people's opinions towards entities such as products and organizations. Although many studies that focus on sentiment analysis have been conducted, there remains a limited amount of studies that focus on sentiment analysis in the Malay language. Malay sentiment analysis is difficult due to the nature of Malay language which consists of many synonyms and also due to the limitation of available language resources in Malay. This research aims to propose a model for Malay sentiment classification based on selecting the best probabilistic classifier. This study proposes a new set of features, in conjunction with previously proposed features, to capture sentiment features in documents. A Malay sentiment lexicon is used for this purpose. The features values are fed to train three machine learning based classifiers, namely Support Vector Machine (SVM), Naive Bayes (NB), and K-Nearest Neighbour (KNN). A voting algorithm is used to integrate the overall strength of the three classifiers. For the experiments, 2000 reviews written in the Malay language in the movie domain are used. The results show that the combined classifiers, used in conjunction with the lexicon-based method, obtained an accuracy of 93.50% and outperformed the baseline. This shows that the proposed model is viable for the Malay sentiment classification tasks.,Master of Information Technology-
dc.language.isoeng-
dc.publisherUKM, Bangi-
dc.relationFaculty of Information Science and Technology / Fakulti Teknologi dan Sains Maklumat-
dc.rightsUKM-
dc.subjectSemantic computing-
dc.subjectComputational linguistics-
dc.subjectMalay language -- Data processing-
dc.subjectUniversiti Kebangsaan Malaysia -- Dissertations-
dc.subjectDissertations, Academic -- Malaysia-
dc.titleMalay sentiment classification based on machine learning and lexicon based approach-
dc.typetheses-
dc.format.pages92-
dc.identifier.callnoQA76.5913.S236 2015 3 tesis-
dc.identifier.barcode002073 (2016)-
Appears in Collections:Faculty of Information Science and Technology / Fakulti Teknologi dan Sains Maklumat

Files in This Item:
File Description SizeFormat 
ukmvital_80507+SOURCE1+SOURCE1.0.PDF
  Restricted Access
364.31 kBAdobe PDFThumbnail
View/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.