Please use this identifier to cite or link to this item: https://ptsldigital.ukm.my/jspui/handle/123456789/476413
Title: Solving clustering problem in data mining using a modifired tabu search algorithm
Authors: Kharrousheh Adnan Naim (P53698)
Supervisor: Salwani Abdullah, Prof. Dr.
Keywords: Data mining
Issue Date: 30-Jun-2011
Description: Clustering problem in data mining is a process that organizes a data set into a number of clusters such that the data in each cluster share a high degree of similarity while being very dissimilar to data from other clusters. This problem is defined as NP-hard optimization problems. Over last decades, a numerous of approaches have proposed to solve this problem, such as simulated annealing, genetic algorithm, ant colony and artificial bee colony. This thesis considers a modified tabu search algorithm as one of the methods to solve the clustering problem. The objective of this thesis is to present the modified tabu search algorithm that can be used to find the minimum sum-of-squares clustering problem. The heuristic algorithm that has been applied in this thesis is divided into two phases. The first phase is a constructive phase, which is responsible to generate an initial solution using K-means approach. Whilst, the second phase is an improvement phase, aims to enhance the solution quality that is obtained from the first phase. A modified tabu search algorithm is proposed as an improvement method, where two tabu lists are introduced. The first tabu list is used to keep the neighborhood structures, and the second tabu list is used to keep the moves. These two tabu lists are employed to prevent a cycle (get stuck with the same solution) through the search process. The proposed of modified tabu search algorithm was experimented and tested on five benchmark datasets that are available at the University of California at Irvine (UCI) Machine Learning Repository, which cover low, medium and high dimensions. Computational results show that the proposed approach was effective in finding the minimum distance between points and their centers in each cluster. Based on the comparative experiment results, we observed that the modified tabu search method outperforms the other two techniques i.e. k-means and a standard tabu search alone.,Master/ Sarjana
Pages: 73
Call Number: QA76.9.D343K482 2011
Publisher: UKM, Bangi
Appears in Collections:Faculty of Information Science and Technology / Fakulti Teknologi dan Sains Maklumat

Files in This Item:
File Description SizeFormat 
ukmvital_85082+SOURCE1+SOURCE1.0.PDF
  Restricted Access
916.67 kBAdobe PDFThumbnail
View/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.