Please use this identifier to cite or link to this item: https://ptsldigital.ukm.my/jspui/handle/123456789/476271
Title: Trend detection in the Arabic social media using voting combination
Authors: Ali Sabah Abdulameer (P72235)
Supervisor: Saidah Saad, Dr.
Keywords: Data mining
Information retrieval
Machine learning
Online social networks
Universiti Kebangsaan Malaysia -- Dissertations
Dissertations, Academic -- Malaysia
Issue Date: 1-Aug-2015
Description: The amount of information have been increasing tremendously especially with the use of the social media applications such Twitter, Facebook and YouTube. Twitter is a common social application which enables users to share their current thoughts and actions, comment on breaking news and engage in discussions. Trends are typically driven by emerging events, breaking news and general topics that attract the attention of a large fraction of Twitter users. Trend detection is thus of high value to news reporters and analysts, as they might point to fast-evolving news stories. Trend detection of events is important to companies, governments, national security agencies, journalists to develop strategies to remedy them. Researcher have been trying to detect trends by using machine learning techniques such as clustering based method in major languages such as English, German, and French while Arabic language is still in its infancy. The Arab world has been contributing to a huge amount of data due to big events that have been occurring in the Middle East. The aim of this research to is detect trends in the arabic social media. However, this research need to overcome several issues such as processing of the Arabic user generated content and lack of resources. In order to solve these issues, this research presents a voting combination clustering approach which is divided into six phases which are dataset collection from Twitter, text pre-processing, spam filtering by using Naive Bayes, feature selection based on Term Frequency Inverse Document Frequency and Entropy, statistical analyses and evaluation. Three statistical approaches are used which are Co-occurence, K-means and voting combination clustering. The analysis was performed in order to classify the trends into three categories which are Arabic nationality events, personal events and other events. The experimental results showed that the voting combination clustering achieved 93%, 87% and 90% for precision, recall and f-measure in trend detection respectively.,Master of Computer Science
Pages: 81
Call Number: QA76.9.D343A236 2015 3 tesis
Publisher: UKM, Bangi
Appears in Collections:Faculty of Information Science and Technology / Fakulti Teknologi dan Sains Maklumat

Files in This Item:
File Description SizeFormat 
ukmvital_80509+SOURCE1+SOURCE1.0.PDF
  Restricted Access
290.8 kBAdobe PDFThumbnail
View/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.