This course material is designed for the Summer Institute in Computational Social Science (SICSS) Oxford.
The material introduces the use of geographical information to connect and analyze different spatial data sources. This introduction is limited to the fundamentals of using geographical information in R. The field has develops very fast over since few years, and R now provides a rich set of packages for various spatial data operations. For a more in-depth introduction into spatial data analysis in R, have a look into the materials references below.
Instead of using exemplary data shipped with the packages, the course relies on real world data sources from the London Datastore.
The amount of spatial (‘readymade’) data sources is steadily increasing. For instance, the UK open data portal provides many indicators on a spatially agrregated level: https://data.gov.uk/. Moreover, geographical information is increasingly available for traditional (‘custommade’) data sources, such as survey data. Using geographic information allows us to:
Link information from different (unrelated) data sources
Incorporate a spatial dimension into the analysis
We can combine different sources of social science data, but we can also enrich existing social science data with information on aggregated demographics or other contextual information.
By now, R provides a lot of functionalities for GIS applications and spatial econometrics, and further extensions. There are lots of packages providing a huge variety of spatial functionalities and methods (see e.g. Bivand, Millo, and Piras 2021). Important packages for fundamental spatial operations are:
Spatial data workhorses: sf (Pebesma 2018) and stars (Pebesma 2021)
Visualization: mapview (Appelhans et al. 2021) and tmap (Tennekes 2018)
Spatial weights and other relations: spdep (Bivand and Wong 2018)
Spatial interpolation and kriging: gstat (Gräler, Pebesma, and Heuvelink 2016)
Spatial regression models: spatialreg (Bivand and Piras 2015) and sphet (Bivand and Piras 2015)
Readings
Great up-to-date introduction to spatial R: Lovelace, Nowosad, and Muenchow (2019), updated version available online
Another great introduction, but not up-to-date: Bivand, Pebesma, and Gómez-Rubio (2013). However, Edzer Pebesma and Roger Bivand are working on a new book on Spatial Data Science
Comprehensive introduction to spatial econometrics: LeSage and Pace (2009)
Article-length introductions to spatial econometrics: Halleck Vega and Elhorst (2015), LeSage (2014), and Rüttenauer (2019)
Course materials