Services on Demand
Journal
Article
Indicators
- Cited by SciELO
- Access statistics
Related links
- Similars in SciELO
Share
RISTI - Revista Ibérica de Sistemas e Tecnologias de Informação
Print version ISSN 1646-9895
Abstract
GUIMARAES, André José Ribeiro; MENDES JUNIOR, Ricardo and FREITAS, Maria do Carmo Duarte. Requirements for Data Science: analyzing job postings with text mining. RISTI [online]. 2022, n.46, pp.54-70. Epub June 30, 2022. ISSN 1646-9895. https://doi.org/10.17013/risti.46.54-70.
This research identifies in job postings the requirements for data scientists in Brazil. To analyze these documents, it adopts text mining methods of analysis: n-gram, topic modeling, and clustering. The findings point to a concentration of job opportunities in São Paulo while demonstrating that the remote modality is the second most offered. Additionally, it highlights that salaries in Brazil are below the average of other countries, even if organizations look for experienced professionals with an elevated level of education. About the requirements, there is a predominance of technical skills such as machine learning, statistical models, python, and database, among others. The results also demonstrate that n-gram and clustering are more suitable for text mining techniques than topic modeling.
Keywords : Data scientist; Text mining; Requirements for data scientist; Competencies.