8SCODA para el Desarrollo de Sistemas Multiagente 
Home Page  

  • SciELO

  • SciELO


RISTI - Revista Ibérica de Sistemas e Tecnologias de Informação

 ISSN 1646-9895

QUINTEIRO-GONZALEZ, Jose María et al. Clasificación de textos en lenguaje natural usando la Wikipedia. []. , 8, pp.39-52. ISSN 1646-9895.

Automatic Text Classifiers are needed in environments where the amount of data to handle is so high that human classification would be ineffective. In our study, the proposed classifier takes advantage of the Wikipedia to generate the corpus defining each category. The text is then analyzed syntactically using Natural Language Processing software. The proposed classifier is highly accurate and outperforms Machine Learning trained classifiers.

: Text Categorization; Wikipedia; tf-idf; Machine Learning; Natural Language Processing.

        ·     ·     · ( pdf )