Hybrid Method for Tagging Arabic Text
Abstract
Many natural language expressions are ambiguous and need to draw on other sources of information to be interpreted. Interpretation of the word ﺗﻌﺎون to be considered as a noun or a verb depends on the presence of contextual cues. This study proposes a hybrid method of based- rules and a machine learning method for tagging Arabic words. So this method is based firstly on rules (that considered the post-position, ending of a word and patterns) and then the anomaly is corrected by adopting a memory-based learning method (MBL). The memory based learning is an efficient method to integrate various sources of information and handling exceptional data in natural language processing tasks. Secondly checking the exceptional cases of rules and more information is made available to the learner for treating those exceptional cases. To evaluate the proposed method a number of experiments has been run and in order, to improve the importance of the various information in learning.
DOI: https://doi.org/10.3844/jcssp.2006.245.248
Copyright: © 2006 Yamina Tlili-Guiassa. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
- 3,320 Views
- 2,628 Downloads
- 18 Citations
Download
Keywords
- Arabic language
- based-rules
- exceptions
- memory-based learning
- tagging