Improving Term Extraction Using Particle Swarm Optimization Techniques
Abstract
Problem statement: Term extraction is one of the layers in the ontology development process which has the task to extract all the terms contained in the input document automatically. The purpose of this process is to generate list of terms that are relevant to the domain of the input document. In the literature there are many approaches, techniques and algorithms used for term extraction where each of approaches, techniques and algorithms has the objective to improve the precision of the extracted terms. Approach: We proposed a new approach using particle swarm optimization techniques in order to improve the precision of term extraction results. We choose five features to represent the term score. Results: The approach had been applied to the domain of Islamic documents. We compare our term extraction method with TFIDF, Weirdness, GlossaryExtraction and TermExtractor. Conclusion: The experimental results showed that our proposed approach achieves better precision than those four algorithms.
DOI: https://doi.org/10.3844/jcssp.2010.323.329
Copyright: © 2010 Mohammad Syafrullah and Naomie Salim. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
- 3,639 Views
- 2,491 Downloads
- 2 Citations
Download
Keywords
- Term extraction
- particle swarm optimization
- feature selection
- text mining,