Services on Demand
Journal
Article
Indicators
- Cited by SciELO
- Access statistics
Related links
- Similars in SciELO
Share
RISTI - Revista Ibérica de Sistemas e Tecnologias de Informação
Print version ISSN 1646-9895
Abstract
QUINTEIRO-GONZALEZ, Jose María et al. Clasificación de textos en lenguaje natural usando la Wikipedia. RISTI [online]. 2011, n.8, pp.39-52. ISSN 1646-9895.
Automatic Text Classifiers are needed in environments where the amount of data to handle is so high that human classification would be ineffective. In our study, the proposed classifier takes advantage of the Wikipedia to generate the corpus defining each category. The text is then analyzed syntactically using Natural Language Processing software. The proposed classifier is highly accurate and outperforms Machine Learning trained classifiers.
Keywords : Text Categorization; Wikipedia; tf-idf; Machine Learning; Natural Language Processing.