SciELO - Scientific Electronic Library Online

 
 número52A relação entre a inovação tecnológica e o desempenho nos meios de hospedagem no contexto da pandemia da Covid-19Perspectivas latinoamericanas del uso de las TIC en estudiantado universitario índice de autoresíndice de materiabúsqueda de artículos
Home Pagelista alfabética de revistas  

Servicios Personalizados

Revista

Articulo

Indicadores

Links relacionados

  • No hay articulos similaresSimilares en SciELO

Compartir


RISTI - Revista Ibérica de Sistemas e Tecnologias de Informação

versión impresa ISSN 1646-9895

Resumen

COCON, Felipe et al. Web scraping: Use of data extraction platforms applied to a website about professions in Mexico. RISTI [online]. 2023, n.52, pp.61-73.  Epub 31-Dic-2023. ISSN 1646-9895.  https://doi.org/10.17013/risti.52.61-73.

This article provides a thorough review of the main web scraping tools available on the market and it is comparing their features and functionalities. A specific tool is selected to demonstrate it is use in obtaining data on percentages of graduates in various careers in Mexico, as well as the distribution related to gender and salaries in several states of the country. The main objective of the article is to illustrate how data can be collected using a data extraction tool. Additionally, the importance of accessing reliable data sources is highlighted and a detailed description of the data extraction process using the WebHarvy tool is provided. Ultimately, it is highlighting the importance of web scraping as a powerful technique and professional ethical to collect valuable data from the web to effectively and responsibly.

Palabras clave : Education; extraction; employment; scraping; professions.

        · resumen en Español     · texto en Español     · Español ( pdf )