52Perspectivas latinoamericanas del uso de las TIC en estudiantado universitario 
Home Page  

  • SciELO

  • SciELO


RISTI - Revista Ibérica de Sistemas e Tecnologias de Informação

 ISSN 1646-9895

COCON, Felipe et al. Web scraping: Use of data extraction platforms applied to a website about professions in Mexico. []. , 52, pp.61-73.   31--2023. ISSN 1646-9895.  https://doi.org/10.17013/risti.52.61-73.

This article provides a thorough review of the main web scraping tools available on the market and it is comparing their features and functionalities. A specific tool is selected to demonstrate it is use in obtaining data on percentages of graduates in various careers in Mexico, as well as the distribution related to gender and salaries in several states of the country. The main objective of the article is to illustrate how data can be collected using a data extraction tool. Additionally, the importance of accessing reliable data sources is highlighted and a detailed description of the data extraction process using the WebHarvy tool is provided. Ultimately, it is highlighting the importance of web scraping as a powerful technique and professional ethical to collect valuable data from the web to effectively and responsibly.

: Education; extraction; employment; scraping; professions.

        ·     ·     · ( pdf )