SciELO - Scientific Electronic Library Online

 
 número121Predileção de riachos para o monitoramento da qualidade da água: um serviço ecossistêmico de provisão na bacia hidrográfica do rio Itajaí-Mirim (Brasil).Teletrabalho em tempo de pandemia: das vantagens às incertezas nos quotidianos das famílias residentes na Área Metropolitana de Lisboa Norte, Portugal índice de autoresíndice de assuntosPesquisa de artigos
Home Pagelista alfabética de periódicos  

Serviços Personalizados

Journal

Artigo

Indicadores

Links relacionados

  • Não possue artigos similaresSimilares em SciELO

Compartilhar


Finisterra - Revista Portuguesa de Geografia

versão impressa ISSN 0430-5027

Resumo

GIOIA, Thamy Barbara; BARROS, Juliana Ramalho  e  SILVA, Renato Rodrigues da. Socioeconomic factors and machine learning algorithms applied to neglected diseases risk prediction. Case study in the municipalities of the Goiás State and Federal District, Brazil. Finisterra [online]. 2022, n.121, pp.109-123.  Epub 31-Dez-2022. ISSN 0430-5027.  https://doi.org/10.18055/finis28635.

Analyzing the relation between socioeconomic variables and neglected tropical diseases can help managers in the conception of public policies to reduce cases. The objective of this study was to evaluate, based on machine learning algorithms, which socioeconomic variables are more important for the risk classification of three neglected diseases: leprosy, cutaneous leishmaniasis, and dengue. Three algorithms based on decision trees were evaluated: Random Forest (RF), XGBoost, and C5.0. As a study area, the municipalities of the state of Goiás and of the Federal District - Brazil, were delimited. For the dengue risk classes, both the RF algorithm and the XGBoost showed accuracy values above 0.6. Both emphasizing the low-income conditions, literacy, and race as the most important predictive variables. In the leprosy risk classes case, the three algorithms presented accuracy results above 0.6, indicating the variables water supply, literacy, race, and housing as important. For the cutaneous leishmaniasis risk classes, the algorithms showed an accuracy lower than 0.4, making the evaluation of possible predictive variables to the model unfeasible. The three evaluated algorithms revealed approximate predictive performance; however, the RF was slightly higher. The most important socioeconomic variables for dengue and leprosy risk classes prediction were similar.

Palavras-chave : Neglected tropical diseases; social determinants; XGBoost; Random Forest; C5.0.

        · resumo em Português | Francês | Espanhol     · texto em Inglês     · Inglês ( pdf )