34La Semántica de las Imágenes y el Análisis de su ContenidoHeurísticas para Data Augmentation en NLP: Aplicación a Revisiones de Artículos Científicos 
Home Page  

  • SciELO

  • SciELO


RISTI - Revista Ibérica de Sistemas e Tecnologias de Informação

 ISSN 1646-9895

CARRASCAL, Ana Isabel Oviedo; COTTE, David Sanguino; ARANGO, Natalia Andrea Restrepo    VELEZ, Andrés Felipe Patiño. Knowledge Discovery in Medical Records through Text Mining. []. , 34, pp.29-43. ISSN 1646-9895.  https://doi.org/10.17013/risti.34.29-43.

The clinical institutions generate a large amount of unstructured databoth in the registration of procedures in free text by medical staff, and by the images and videos generated by diagnostic aids. This paper proposes a process of knowledge discovery in the unstructured text of the medical records of the trauma area of the San Vicente Foundation Hospital through text mining. Text preparation techniques were applied such as elimination of non-relevant words, substitution of terms, elimination of accents and derivation of words. Regarding mining processes, supervised and unsupervised learning techniques were applied such as decision trees, logistic regression, nearest k-neighbors, hierarchical clustering and association rules. The result obtained is the conformation of a model of the most relevant words in the clinical records of the Hospital in the area of traumatology.

: Text mining; health data mining; natural language processing.

        ·     ·     · ( pdf )

 

Creative Commons License All the contents of this journal, except where otherwise noted, is licensed under a Creative Commons Attribution License