SciELO - Scientific Electronic Library Online

 
 issue52The relationship between technological innovation and performance in accommodation facilities in the context of the Covid-19 pandemicLatin American perspectives on the use of ICT in university students author indexsubject indexarticles search
Home Pagealphabetic serial listing  

Services on Demand

Journal

Article

Indicators

Related links

  • Have no similar articlesSimilars in SciELO

Share


RISTI - Revista Ibérica de Sistemas e Tecnologias de Informação

Print version ISSN 1646-9895

Abstract

COCON, Felipe et al. Web scraping: Use of data extraction platforms applied to a website about professions in Mexico. RISTI [online]. 2023, n.52, pp.61-73.  Epub Dec 31, 2023. ISSN 1646-9895.  https://doi.org/10.17013/risti.52.61-73.

This article provides a thorough review of the main web scraping tools available on the market and it is comparing their features and functionalities. A specific tool is selected to demonstrate it is use in obtaining data on percentages of graduates in various careers in Mexico, as well as the distribution related to gender and salaries in several states of the country. The main objective of the article is to illustrate how data can be collected using a data extraction tool. Additionally, the importance of accessing reliable data sources is highlighted and a detailed description of the data extraction process using the WebHarvy tool is provided. Ultimately, it is highlighting the importance of web scraping as a powerful technique and professional ethical to collect valuable data from the web to effectively and responsibly.

Keywords : Education; extraction; employment; scraping; professions.

        · abstract in Spanish     · text in Spanish     · Spanish ( pdf )