Strategies to access web-enabled urban spatial data for socioeconomic research using R functions

Item request has been placed!

Item request cannot be made.

Processing Request

اقرأ أكثر حفظ في قائمتي

المؤلفون: Vallone, Andrés; Chasco Yrigoyen, María del Coro; Sánchez, Beatriz
الموضوع:
Spain; Spatial microdata; URL parsing; Web scraping; Economía
نوع التسجيلة:
article in journal/newspaper
اللغة:
English

معلومة اضافية
- Contributors:
  UAM. Departamento de Economía Aplicada
- بيانات النشر:
  Springer
- الموضوع:
  2023
- Collection:
  Universidad Autónoma de Madrid (UAM): Biblos-e Archivo
- نبذة مختصرة :
  This version of the article has been accepted for publication, after peer review (when applicable) and is subject to Springer Nature’s AM terms of use, but is not the Version of Record and does not reflect post-acceptance improvements, or any corrections. The Version of Record is available online at: http://dx.doi.org/10.1007/s10109-019-00309-y ; Since the introduction of the World Wide Web in the 1990s, available information for research purposes has increased exponentially, leading to a significant proliferation of research based on web-enabled data. Nowadays the use of internet-enabled databases, obtained by either primary data online surveys or secondary official and non-official registers, is common. However, information disposal varies depending on data category and country and specifically, the collection of microdata at low geographical level for urban analysis can be a challenge. The most common difficulties when working with secondary web-enabled data can be grouped into two categories: accessibility and availability problems. Accessibility problems are present when the data publication in the servers blocks or delays the download process, which becomes a tedious reiterative task that can produce errors in the construction of big databases. Availability problems usually arise when official agencies restrict access to the information for statistical confidentiality reasons. In order to overcome some of these problems, this paper presents different strategies based on URL parsing, PDF text extraction, and web scraping. A set of functions, which are available under a GPL-2 license, were built in an R package to specifically extract and organize databases at the municipality level (NUTS 5) in Spain for population, unemployment, vehicle fleet, and firm characteristics ; This work was supported by Spanish Ministry of Economics and Competitiveness (ECO2015-65758-P) and the Regional Government of Extremadura (Spain)
- File Description:
  application/pdf
- ISSN:
  1435-5930
- Relation:
  Journal of Geographical Systems; https://doi.org/10.1007/s10109-019-00309-y; Gobierno de España. ECO2015-65758-P; Journal of Geographical Systems 22 (2020): 217 - 239; http://hdl.handle.net/10486/709503; 217; 239; 22
- الرقم المعرف:
  10.1007/s10109-019-00309-y
- Rights:
  © Springer-Verlag GmbH Germany, part of Springer Nature 2019 ; openAccess
- الرقم المعرف:
  edsbas.FEB63392

تعليقات

No Comments.

Strategies to access web-enabled urban spatial data for socioeconomic research using R functions

اتصل بنا

اتبع