SciELO - Scientific Electronic Library Online

 
 número52A relação entre a inovação tecnológica e o desempenho nos meios de hospedagem no contexto da pandemia da Covid-19Perspectivas latinoamericanas del uso de las TIC en estudiantado universitario índice de autoresíndice de assuntosPesquisa de artigos
Home Pagelista alfabética de periódicos  

Serviços Personalizados

Journal

Artigo

Indicadores

Links relacionados

  • Não possue artigos similaresSimilares em SciELO

Compartilhar


RISTI - Revista Ibérica de Sistemas e Tecnologias de Informação

versão impressa ISSN 1646-9895

Resumo

COCON, Felipe et al. Web scraping: Use of data extraction platforms applied to a website about professions in Mexico. RISTI [online]. 2023, n.52, pp.61-73.  Epub 31-Dez-2023. ISSN 1646-9895.  https://doi.org/10.17013/risti.52.61-73.

This article provides a thorough review of the main web scraping tools available on the market and it is comparing their features and functionalities. A specific tool is selected to demonstrate it is use in obtaining data on percentages of graduates in various careers in Mexico, as well as the distribution related to gender and salaries in several states of the country. The main objective of the article is to illustrate how data can be collected using a data extraction tool. Additionally, the importance of accessing reliable data sources is highlighted and a detailed description of the data extraction process using the WebHarvy tool is provided. Ultimately, it is highlighting the importance of web scraping as a powerful technique and professional ethical to collect valuable data from the web to effectively and responsibly.

Palavras-chave : Education; extraction; employment; scraping; professions.

        · resumo em Espanhol     · texto em Espanhol     · Espanhol ( pdf )