Pseudo-relevance feedback for English translated Qur'anic text retrieval

Yasir Hadi Farhan

Please use this identifier to cite or link to this item: https://ptsldigital.ukm.my/jspui/handle/123456789/476441

Full metadata record

DC Field	Value	Language
dc.contributor.advisor	Shahrul Azman Mohd Noah, Prof. Dr.
dc.contributor.author	Yasir Hadi Farhan
dc.contributor.other	P74154	-
dc.date.accessioned	2023-10-06T09:18:31Z	-
dc.date.available	2023-10-06T09:18:31Z	-
dc.date.issued	2016-12-27
dc.identifier.other	ukmvital:85959
dc.identifier.other	P74154	-
dc.identifier.uri	https://ptsldigital.ukm.my/jspui/handle/123456789/476441	-
dc.description	One of the biggest issues that affect the Information Retrieval (IR) systems performance is the difficulties facing users to define exactly what their information needs, as that information might be a gap in their knowledge. Such an issue is more problematic for classical and literary documents such as the al-Quran. One of the approaches to overcome such an issue is pseudo-relevance feedback which assumes a small number of top-ranked documents a relevant in the initial retrieval results. It selects related terms from these documents to improve the query representation through query expansion. Among the issues in the Quranic text are ambiguities and complexity of the text. Due to these issues, users need to reformulate and refine their queries to match their information needs. Pseudo-relevance feedback can help relieve these issues. The classic Rocchio algorithm has been widely used to support query reformulation in pseudo relevance feedbacks. In this research, a modified Rocchio algorithm was proposed by considering element of terms selection and query importance. In this case it combines the term frequency and inverse document frequency (TF-IDF) weights and Rocchio's algorithm weights in order to generate a new query. It also uses the frequency of terms to choose suitable expansion words. Evaluation of the proposed algorithm were compared against the probabilistic IR model implemented in Lucene toolkit and against the WordNet query expansion approach. The experiments only consider relevance feedbacks after two iterations. The evaluation used the Quranic dataset previously used by other researchers. Twelve queries were considered during the evaluation. The results of the experiments showed that the proposed method exhibit significant improvement in recall and precision. The average precision through pseudo relevance feedback for the first iteration was 8.3%, and for the second iteration was 11.3%, whereas the average precision by Lucene was 3.3% and the average precision by WordNet query expansion was 2.7%. These results prove that the proposed method improves retrieval performance.,Certification of Master's/Doctoral Thesis" is not available
dc.language.iso	eng
dc.publisher	UKM, Bangi
dc.relation	Faculty of Information Science and Technology / Fakulti Teknologi dan Sains Maklumat
dc.subject	Information Retrieval
dc.subject	English translation
dc.subject	Qur'anic text
dc.subject	Pseudo-relevance feedback
dc.subject	Dissertations, Academic -- Malaysia
dc.title	Pseudo-relevance feedback for English translated Qur'anic text retrieval
dc.type	theses
dc.rights.holder	UKM	-
dc.format.pages	103
dc.identifier.barcode	002650(2017)
Appears in Collections:	Faculty of Information Science and Technology / Fakulti Teknologi dan Sains Maklumat

Files in This Item:

File	Description	Size	Format
ukmvital_85959+SOURCE1+SOURCE1.0.PDF Restricted Access		341.06 kB	Adobe PDF	View/Open

Show simple item record Recommend this item