Please use this identifier to cite or link to this item: https://ptsldigital.ukm.my/jspui/handle/123456789/476298
Title: A hybrid of machine learning techniques and lexicon-based approach for Arabic opinion question answering
Authors: Al-Ismaily Khalid Khalifa Juma (P63366)
Supervisor: Prof. Dr. Nazlia Omar
Keywords: Arabic Opinion QA
Difficulties
Lexicon-based approach
Hybrid method of machine learning techniques
Semantic computing.
Issue Date: 2-Jun-2014
Description: Opinion Question Answering (Opinion QA) is the task of enabling users to explore others opinions toward a particular service of product in order to make decisions. Arabic Opinion QA is more challenging due to its complex morphology compared to other languages and has many varieties dialects. On the other hand, there are insignificant research efforts and resources available that focus on Opinion QA in Arabic. This study aims to address the difficulties of Arabic Opinion QA by proposing a hybrid method of machine learning techniques and lexicon-based approach. The machine learning techniques that have been used in this study consists of three classifiers which are Naive Bayes (NB), Support Vector Machine (SVM) and Knearest Neighbor (KNN). The proposed method contains pre-processing phases such as, transformation, normalization and tokenization and exploiting auxiliary information (thesaurus). The lexicon-based approach is executed by replacing some words with its synonyms using the domain dictionary. The classification task is performed by a classifier to classify the opinions based on the positive or negative sentiment polarity. The proposed method has been evaluated using the common information retrieval metrics i.e. Precision, Recall and F-measure. The experimental results have demonstrated that NB outperforms SVM and KNN by achieving 91% accuracy,Soal jawab pendapat adalah tugasan yang membolehkan pengguna meneroka pendapat orang lain tentang sesuatu perkhidmatan atau produk dalam membuat keputusan. Kesukaran soal jawab pendapat terhasil daripada fakta yang ia adalah kombinasi dua tugas pemprosesan bahasa tabii yang mencabar iaitu analisis sentimen dan soal jawab, berbanding aplikasi soal jawab tradisional yang mencari maklumat secara fakta terhadap soalan. Soal jawab pendapat lebih sukar kerana ia mencari pendapat sentimental pengguna ke atas sasaran yang spesifik. Selain daripada itu, tidak banyak usaha yang signifikan dilakukan dalam mengkaji soal jawab pendapat dalam bahasa Arab. Terdapat beberapa sebab mengapa soal jawab Bahasa Arab menjadi agak mencabar. Ia mempunyai morfologi kompleks berbanding bahasa lain dan ia mempunyai pelbagai dialek. Ini membawa kepada satu lagi kesukaran di mana kebanyakan penulis menyatakan persoalan dan pendapat menggunakan dialek setempat berbanding bahasa Arab yang piawai. Kajian ini adalah bertujuan untuk menangani kesukaran soal jawab pendapat di dalam Bahasa Arab dengan mengusulkan kombinasi kaedah pendekatan berasaskan leksikon dan pengelasan menggunakan Naive Bayes. Kaedah yang dicadangkan mengandungi fasa pra pemprosesan seperti transformasi, normalisasi dan tokenisasi dan mengeksploitasi maklumat tesaurus. Hasil eksperimen telah menunjukkan ketepatan sebanyak 91%. Hasil kajian ini menunjukkan keputusan yang menggalakkan dalam bidang soal jawab pendapat.,Master
Pages: 81
Call Number: QA76.5913 .I837 2014 3
Publisher: UKM, Bangi
URI: https://ptsldigital.ukm.my/jspui/handle/123456789/476298
Appears in Collections:Faculty of Information Science and Technology / Fakulti Teknologi dan Sains Maklumat

Files in This Item:
File Description SizeFormat 
ukmvital_81482+SOURCE1+SOURCE1.0.PDF
  Restricted Access
1.39 MBAdobe PDFThumbnail
View/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.