Please use this identifier to cite or link to this item:
https://ptsldigital.ukm.my/jspui/handle/123456789/476576
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Nazlia Omar, Dr. | - |
dc.contributor.author | Hashim Baydaa (P78912) | - |
dc.date.accessioned | 2023-10-06T09:21:25Z | - |
dc.date.available | 2023-10-06T09:21:25Z | - |
dc.date.issued | 2016-10-12 | - |
dc.identifier.other | ukmvital:121283 | - |
dc.identifier.uri | https://ptsldigital.ukm.my/jspui/handle/123456789/476576 | - |
dc.description | Biomedical Named Entity Recognition (BNER) is the task of identifying biomedical instances such as genes, proteins, diseases, chemical compounds and others. Several approaches have been proposed for BNER specifically supervised machine learning techniques. Most of these techniques demonstrated reasonable performance. However, there is still a gap that lies on the multi-word BNEs such as the chemical compound 'Tri-acetyl-glucal-galactono-lactone' in which the classifier could not recognize these instances due to the complex characters used to separate the words. Complex characters could be punctuation (e.g. – *_^) or numeric characters. Therefore, this study aims to propose a Back-propagation Neural Network (BPNN) for identifying BNEs. BPNN has the ability to encode the characters which facilitate the identification of multi-word BNEs. For this purpose, this study developed multiple features for the encoding task including digits, special characters, affixes and capitalization. Experiments have been conducted using two benchmark datasets including SCAI and GENIA. SCAI is a corpus that contains chemical compounds, whereas GENIA is a corpus that contains multiple biomedical instances such as genes, proteins, DNA and RNA. Using 80% training and 20% testing, BPNN has shown 90% f-measure for the SCAI corpus and 82% f-measure for the GENIA corpus. Such results emphasize an enhancement of f-measure when compared to other related work. The enhancement obtained by BPNN referred to the multi-word BNEs that have been identified which usually are being incorrectly classified in other approaches. This can imply the effectiveness of BPNN in terms of classifying BNEs,“Certification of Master’s / Doctoral Thesis” is not available,Master of Computer Science | - |
dc.language.iso | eng | - |
dc.publisher | UKM, Bangi | - |
dc.relation | Faculty of Information Science and Technology / Fakulti Teknologi dan Sains Maklumat | - |
dc.rights | UKM | - |
dc.subject | Universiti Kebangsaan Malaysia -- Dissertations | - |
dc.subject | Dissertations, Academic -- Malaysia | - |
dc.subject | Pattern recognition systems | - |
dc.subject | Natural language processing (Computer science) | - |
dc.subject | Medicine -- Data processing | - |
dc.title | Named entity recognition for multi-word biomedical using back propagation neural network approach | - |
dc.type | theses | - |
dc.format.pages | 92 | - |
dc.identifier.callno | TK7882.P3H368 2016 3 tesis | - |
dc.identifier.barcode | 003123 (2016) | - |
Appears in Collections: | Faculty of Information Science and Technology / Fakulti Teknologi dan Sains Maklumat |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
ukmvital_121283+SOURCE1+SOURCE1.0.PDF Restricted Access | 1.05 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.