Please use this identifier to cite or link to this item: https://ptsldigital.ukm.my/jspui/handle/123456789/476632
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorNazlia Omar, Assoc. Prof. Dr.
dc.contributor.authorMona Mohamed Ali Ahmed Asharef (P49070)
dc.date.accessioned2023-10-06T09:22:50Z-
dc.date.available2023-10-06T09:22:50Z-
dc.date.issued2011-08-01
dc.identifier.otherukmvital:122214
dc.identifier.urihttps://ptsldigital.ukm.my/jspui/handle/123456789/476632-
dc.descriptionNamed entity recognition (NER) systems aim to determine and classify automatically the proper nouns, such as names of people, names of locations, dates and time indications, and events in texts. NER systems play a significant role in many areas of Natural Language Processing (NLP) such as question answering systems, text summarization, text classification, information retrieval, information extraction systems and machine translation. Many researchers have attempt to solve this problem in a variety of languages, but only few limited researches have focused on NER for Arabic language. All previous Arabic named entity recognition systems have been built to extract the named entities from several text types. The goal of this research is to design and develop Arabic named entity recognition system in the crime domain. We identify five types of named entities; people, location, organization, date and time. We used rule-based approach to identify the names of entities in crime documents. We produced our corpus by choosing crime documents from Arabic news papers and tagged them by using an automatic tagger (HVMM POS) and we used this corpus as an input for our system. The rules are constructed based on part of speech (POS) tags of the input texts and word lists which consists of indicative verbs and important keywords. In addition, the Gazetteer, which is a dictionary of names, is also used in the development of the rules. The accuracy of the result of our system is 90% which it indicates that the rule based is effective method and the performance of the achieved system is satisfactory.,Master of Information Technology,Certification of Master's / Doctoral Thesis" is not available"
dc.language.isoeng
dc.publisherUKM, Bangi
dc.relationFaculty of Information Science and Technology / Fakulti Teknologi dan Sains Maklumat
dc.rightsUKM
dc.subjectNatural language processing (Computer science)
dc.subjectArabic language -- Data processing
dc.subjectUniversiti Kebangsaan Malaysia -- Dissertations
dc.subjectDissertations, Academic -- Malaysia
dc.titleRule based Arabic named entity recognition for crime documents
dc.typetheses
dc.format.pages110
dc.identifier.callnoQA76.9.N38A834 2012 3 tesis
dc.identifier.barcode005608 (2021)
Appears in Collections:Faculty of Information Science and Technology / Fakulti Teknologi dan Sains Maklumat

Files in This Item:
File Description SizeFormat 
ukmvital_122214+SOURCE1+SOURCE1.0.PDF
  Restricted Access
13.89 MBAdobe PDFThumbnail
View/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.