SciELO - Scientific Electronic Library Online

 
 issue8SCODA para el Desarrollo de Sistemas MultiagenteAnálise de opiniões expressas nas redes sociais author indexsubject indexarticles search
Home Pagealphabetic serial listing  

Services on Demand

Journal

Article

Indicators

Related links

  • Have no similar articlesSimilars in SciELO

Share


RISTI - Revista Ibérica de Sistemas e Tecnologias de Informação

Print version ISSN 1646-9895

Abstract

QUINTEIRO-GONZALEZ, Jose María et al. Clasificación de textos en lenguaje natural usando la Wikipedia. RISTI [online]. 2011, n.8, pp.39-52. ISSN 1646-9895.

Automatic Text Classifiers are needed in environments where the amount of data to handle is so high that human classification would be ineffective. In our study, the proposed classifier takes advantage of the Wikipedia to generate the corpus defining each category. The text is then analyzed syntactically using Natural Language Processing software. The proposed classifier is highly accurate and outperforms Machine Learning trained classifiers.

Keywords : Text Categorization; Wikipedia; tf-idf; Machine Learning; Natural Language Processing.

        · abstract in Spanish     · text in Spanish     · Spanish ( pdf )