<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>1646-9895</journal-id>
<journal-title><![CDATA[RISTI - Revista Ibérica de Sistemas e Tecnologias de Informação]]></journal-title>
<abbrev-journal-title><![CDATA[RISTI]]></abbrev-journal-title>
<issn>1646-9895</issn>
<publisher>
<publisher-name><![CDATA[AISTI - Associação Ibérica de Sistemas e Tecnologias de Informação]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S1646-98952023000300084</article-id>
<article-id pub-id-type="doi">10.17013/risti.51.84-98</article-id>
<title-group>
<article-title xml:lang="es"><![CDATA[Análisis comparativo de Técnicas de Machine Learning para la predicción de casos de deserción universitaria]]></article-title>
<article-title xml:lang="en"><![CDATA[Comparative analysis of Machine Learning Techniques for the prediction of cases of university dropout]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Tito]]></surname>
<given-names><![CDATA[Anthony Edwin Aco]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Condori]]></surname>
<given-names><![CDATA[Bryan Orlando Hancco]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Vera]]></surname>
<given-names><![CDATA[Yasiel Pérez]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
</contrib-group>
<aff id="Af1">
<institution><![CDATA[,Universidad Nacional de San Agustín de Arequipa  ]]></institution>
<addr-line><![CDATA[Santa Catalina Arequipa]]></addr-line>
<country>Peru</country>
</aff>
<pub-date pub-type="pub">
<day>30</day>
<month>09</month>
<year>2023</year>
</pub-date>
<pub-date pub-type="epub">
<day>30</day>
<month>09</month>
<year>2023</year>
</pub-date>
<numero>51</numero>
<fpage>84</fpage>
<lpage>98</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://scielo.pt/scielo.php?script=sci_arttext&amp;pid=S1646-98952023000300084&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://scielo.pt/scielo.php?script=sci_abstract&amp;pid=S1646-98952023000300084&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://scielo.pt/scielo.php?script=sci_pdf&amp;pid=S1646-98952023000300084&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="es"><p><![CDATA[Resumen La deserción universitaria afecta negativamente a muchos estudiantes, este suceso puede estar relacionado con problemas personales, cuestiones económicas, entre otros. Ante tal situación surge la importancia de desarrollar una forma de predecir estos casos, para esto se propuso el uso de técnicas de Machine Learning, las utilizadas fueron Regresión Logística, Naive Bayes, Red Neuronal Perceptrón Multicapa, Árbol de Decisión, Support Vector Machine y Random Forest; se seleccionó un Dataset, que pasó por una limpieza de datos, se corrigieron los datos faltantes y los valores atípicos; luego se eliminaron los registros cuya variable de salida era Matriculado, centrándose en los tipos Abandono y Graduado. Cada modelo fue entrenado y probado mediante validación cruzada con pliegues, finalmente, se compararon en función de métricas de precisión, exactitud y exhaustividad, donde se concluyó que la Regresión Logística es la técnica que mejores resultados proporciona para predecir la deserción universitaria en el dataset considerado.]]></p></abstract>
<abstract abstract-type="short" xml:lang="en"><p><![CDATA[Abstract University dropout has a detrimental impact on numerous students; this phenomenon may be associated with personal issues, economic constraints, and other factors. Given this situation, the importance of developing a predictive model for such cases arises. To achieve this, Machine Learning techniques were proposed and employed, including Logistic Regression, Naive Bayes, Multilayer Perceptron Neural Network, Decision Tree, Support Vector Machine, and Random Forest. A dataset was selected and underwent data cleaning, addressing missing values and outliers. Subsequently, records with the 'Enrolled' outcome variable were removed, focusing solely on 'Dropout' and 'Graduate' categories. Each model was trained and tested using cross-validation with folds. Ultimately, they were compared based on accuracy, precision, and recall metrics, leading to the conclusion that Logistic Regression is the technique that yields the best results for predicting university dropout in the considered dataset.]]></p></abstract>
<kwd-group>
<kwd lng="es"><![CDATA[Análisis comparativo]]></kwd>
<kwd lng="es"><![CDATA[Deserción Universitaria]]></kwd>
<kwd lng="es"><![CDATA[Machine Learning]]></kwd>
<kwd lng="es"><![CDATA[Predicción]]></kwd>
<kwd lng="es"><![CDATA[Regresión Logística]]></kwd>
<kwd lng="en"><![CDATA[Comparative analysis]]></kwd>
<kwd lng="en"><![CDATA[University dropout]]></kwd>
<kwd lng="en"><![CDATA[Logistic Regression]]></kwd>
<kwd lng="en"><![CDATA[Machine Learning]]></kwd>
<kwd lng="en"><![CDATA[Prediction]]></kwd>
</kwd-group>
</article-meta>
</front><back>
<ref-list>
<ref id="B1">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ayala-Yaguara]]></surname>
<given-names><![CDATA[H. Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Valenzuela-Sabogal]]></surname>
<given-names><![CDATA[G. M.]]></given-names>
</name>
<name>
<surname><![CDATA[Espinosa-García]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Obtención de un modelo de minería de datos aplicado a la deserción universitaria del programa de Ingeniería de Sistemas de la Universidad de Cundinamarca]]></article-title>
<source><![CDATA[Revista Ontare]]></source>
<year>2020</year>
<volume>7</volume>
<page-range>134-50</page-range></nlm-citation>
</ref>
<ref id="B2">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Bedregal Alpaca]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
<name>
<surname><![CDATA[Aruquipa Velazco]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Cornejo Aparicio]]></surname>
<given-names><![CDATA[V.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Técnicas de data mining para extraer perfiles comportamiento académico y predecir la deserción universitaria]]></article-title>
<source><![CDATA[RISTI - Revista Iberica de Sistemas e Tecnologias de Informacao]]></source>
<year>2020</year>
<numero>E30</numero>
<issue>E30</issue>
<page-range>592-604</page-range></nlm-citation>
</ref>
<ref id="B3">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Borja-Robalino]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Monleon-Getino]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Monleón-Getino]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Rodellar]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Estandarización de métricas de rendimiento para clasificadores Machine y Deep Learning]]></article-title>
<source><![CDATA[RISTI - Revista Iberica de Sistemas e Tecnologias de Informacao]]></source>
<year>2020</year>
<numero>E30</numero>
<issue>E30</issue>
<page-range>184-96</page-range></nlm-citation>
</ref>
<ref id="B4">
<nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Carmona]]></surname>
<given-names><![CDATA[E. J.]]></given-names>
</name>
</person-group>
<source><![CDATA[Tutorial sobre Máquinas de Vectores Soporte (SVM)]]></source>
<year>2016</year>
</nlm-citation>
</ref>
<ref id="B5">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Contreras]]></surname>
<given-names><![CDATA[L. E.]]></given-names>
</name>
<name>
<surname><![CDATA[Fuentes]]></surname>
<given-names><![CDATA[H. J.]]></given-names>
</name>
<name>
<surname><![CDATA[Rodríguez]]></surname>
<given-names><![CDATA[J. I.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Predicción del rendimiento académico como indicador de éxito/fracaso de los estudiantes de ingeniería, mediante aprendizaje automático]]></article-title>
<source><![CDATA[Formación Universitaria]]></source>
<year>2020</year>
<volume>13</volume>
<numero>5</numero>
<issue>5</issue>
<page-range>233-46</page-range></nlm-citation>
</ref>
<ref id="B6">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Dimitoglou]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
<name>
<surname><![CDATA[Adams]]></surname>
<given-names><![CDATA[J. A.]]></given-names>
</name>
<name>
<surname><![CDATA[Jim]]></surname>
<given-names><![CDATA[C. M.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Comparison of the C4.5 and a Naive Bayes Classifier for the Prediction of Lung Cancer Survivability Index Terms-Data mining, mining methods and algorithms, text mining]]></article-title>
<source><![CDATA[Journal of Computing]]></source>
<year>2012</year>
<volume>4</volume>
<numero>8</numero>
<issue>8</issue>
</nlm-citation>
</ref>
<ref id="B7">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Fawagreh]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Gaber]]></surname>
<given-names><![CDATA[M. M.]]></given-names>
</name>
<name>
<surname><![CDATA[Elyan]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Random forests: from early developments to recent advancements]]></article-title>
<source><![CDATA[Systems Science &amp; Control Engineering]]></source>
<year>2014</year>
<volume>2</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>602-9</page-range></nlm-citation>
</ref>
<ref id="B8">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ferreyra]]></surname>
<given-names><![CDATA[M. M.]]></given-names>
</name>
<name>
<surname><![CDATA[Avitabile]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Botero Álvarez]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Haimovich Paz]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
<name>
<surname><![CDATA[Urzúa]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<source><![CDATA[At a Crossroads: Higher Education in Latin America and the Caribbean]]></source>
<year>2017</year>
<publisher-name><![CDATA[World Bank]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B9">
<nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Lening]]></surname>
<given-names><![CDATA[C. G.]]></given-names>
</name>
</person-group>
<source><![CDATA[Non-completion in Postsecondary education: Why are so many students not finishing their courses? Centre for the Study of Science and Innovation Policy]]></source>
<year>2022</year>
</nlm-citation>
</ref>
<ref id="B10">
<nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Lizares Castillo]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<source><![CDATA[Comparación de modelos de clasificación: regresión logística y árboles de clasificación para evaluar el rendimiento académico]]></source>
<year>2017</year>
</nlm-citation>
</ref>
<ref id="B11">
<nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[López Martínez]]></surname>
<given-names><![CDATA[J. G.]]></given-names>
</name>
<name>
<surname><![CDATA[Méndez Aguirre]]></surname>
<given-names><![CDATA[Ó. A.]]></given-names>
</name>
</person-group>
<source><![CDATA[Técnicas de Machine learning para la predicción de desempeño académico en el desarrollo del espacio proyectivo del pensamiento espacial]]></source>
<year>2019</year>
</nlm-citation>
</ref>
<ref id="B12">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Lovón Cueva]]></surname>
<given-names><![CDATA[M. A.]]></given-names>
</name>
<name>
<surname><![CDATA[Cisneros Terrones]]></surname>
<given-names><![CDATA[S. A.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Repercusiones de las clases virtuales en los estudiantes universitarios en el contexto de la cuarentena por COVID-19: El caso de la PUCP]]></article-title>
<source><![CDATA[Propósitos y Representaciones]]></source>
<year>2020</year>
<volume>8</volume>
<numero>SPE3</numero>
<issue>SPE3</issue>
</nlm-citation>
</ref>
<ref id="B13">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Martins]]></surname>
<given-names><![CDATA[M. V]]></given-names>
</name>
<name>
<surname><![CDATA[Tolledo]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Machado]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Baptista]]></surname>
<given-names><![CDATA[L. M. T.]]></given-names>
</name>
<name>
<surname><![CDATA[Realinho]]></surname>
<given-names><![CDATA[V.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Early Prediction of student&#8217;s Performance in Higher Education: A Case Study]]></article-title>
<source><![CDATA[Trends and Applications in Information Systems and Technologies]]></source>
<year>2021</year>
<page-range>166-75</page-range><publisher-name><![CDATA[Springer International Publishing]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B14">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Mirna]]></surname>
<given-names><![CDATA[E. M. G.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Factores condicionantes de la deserción universitaria]]></article-title>
<source><![CDATA[Ciencia Latina Revista Científica Multidisciplinar]]></source>
<year>2021</year>
<volume>5</volume>
<numero>4</numero>
<issue>4</issue>
<page-range>5316-28</page-range></nlm-citation>
</ref>
<ref id="B15">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Parrino]]></surname>
<given-names><![CDATA[M. C.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Factores intervinientes en el Fenómeno de la Deserción Universitaria]]></article-title>
<source><![CDATA[Revista Argentina de Educación Superior]]></source>
<year>2014</year>
<volume>8</volume>
</nlm-citation>
</ref>
<ref id="B16">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ramchoun]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Amine]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Idrissi]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Ghanou]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Ettaouil]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Multilayer Perceptron: Architecture Optimization and Training]]></article-title>
<source><![CDATA[International Journal of Interactive Multimedia and Artificial Intelligence]]></source>
<year>2016</year>
<volume>4</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>26</page-range></nlm-citation>
</ref>
<ref id="B17">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ruiz Echeverry]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Consideración de deserción universitaria en estudiantes de Comunicación Social. Un estudio de caso]]></article-title>
<source><![CDATA[Revista Nexus Comunicación]]></source>
<year>2020</year>
<page-range>1-25</page-range></nlm-citation>
</ref>
<ref id="B18">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Solís]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Moreira]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[González]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Fernández]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Hernández]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Perspectives to Predict Dropout in University Students with Machine Learning]]></article-title>
<source><![CDATA[IEEE International Work Conference on Bioinspired Intelligence (IWOBI)]]></source>
<year>2018</year>
<page-range>1-6</page-range></nlm-citation>
</ref>
<ref id="B19">
<nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Tocto]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Huamaní]]></surname>
<given-names><![CDATA[G. T.]]></given-names>
</name>
<name>
<surname><![CDATA[Zuloaga]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Aplicación aprendizaje automático en la gestión universitaria: Modelo de clasificación de la deserción de los estudiantes en Ingeniería en el Perú]]></article-title>
<source><![CDATA[Proceedings of the LACCEI International Multi-Conference for Engineering, Education and Technology]]></source>
<year>2023</year>
</nlm-citation>
</ref>
<ref id="B20">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Upasani]]></surname>
<given-names><![CDATA[D. E.]]></given-names>
</name>
<name>
<surname><![CDATA[Virendra]]></surname>
<given-names><![CDATA[V. S.]]></given-names>
</name>
</person-group>
<source><![CDATA[Machine Learning with Python]]></source>
<year>2020</year>
<publisher-name><![CDATA[IPH]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B21">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Valentim]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Machado]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Baptista]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[Martins]]></surname>
<given-names><![CDATA[M. V.]]></given-names>
</name>
</person-group>
<source><![CDATA[Predict students&#8217; dropout and academic success]]></source>
<year>2021</year>
<publisher-name><![CDATA[Zenodo]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B22">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Viale Tudela]]></surname>
<given-names><![CDATA[H. E.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Una Aproximación Teórica A La Deserción Estudiantil Universitaria]]></article-title>
<source><![CDATA[Revista Digital de Investigación En Docencia Universitaria]]></source>
<year>2014</year>
<volume>8</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>59-76</page-range></nlm-citation>
</ref>
</ref-list>
</back>
</article>
