Servicios Personalizados
Revista
Articulo
Indicadores
- Citado por SciELO
- Accesos
Links relacionados
- Similares en SciELO
Compartir
RISTI - Revista Ibérica de Sistemas e Tecnologias de Informação
versión impresa ISSN 1646-9895
Resumen
QUINTEIRO-GONZALEZ, Jose María et al. Clasificación de textos en lenguaje natural usando la Wikipedia. RISTI [online]. 2011, n.8, pp.39-52. ISSN 1646-9895.
Automatic Text Classifiers are needed in environments where the amount of data to handle is so high that human classification would be ineffective. In our study, the proposed classifier takes advantage of the Wikipedia to generate the corpus defining each category. The text is then analyzed syntactically using Natural Language Processing software. The proposed classifier is highly accurate and outperforms Machine Learning trained classifiers.
Palabras clave : Text Categorization; Wikipedia; tf-idf; Machine Learning; Natural Language Processing.