Geodata & Spatial Regression

Author

Tobias Rüttenauer

Published

February 6, 2025

Introduction

This course material is designed for the 3-days GESIS workshop on geodata and spatial regression analysis.

In recent years, more and more spatial data has become available, providing the possibility to combine otherwise unrelated data, such as social, economic, and environmental data. This also opens up the possibility of analyzing spatial patterns and processes (e.g., spillover effects or diffusion).

Many social science research questions are spatially dependent such as voting outcomes, housing prices, labour markets, protest behavior, or migration decisions. Observing an event in one region or neighborhood increases the likelihood that we observe similar processes in proximate areas. As Tobler’s first law of geography puts it: “Everything is related to everything else, but near things are more related than distant things”. This dependence can stem from spatial contagion, spatial spillovers, or common confounders. Therefore, basic assumptions of standard regression models are violated when analyzing spatial data. However, more imoprtantly, spatial processes are interesting for their own sake. Spatial regression models can detect spatial dependence and explicitly model spatial relations, identifying spatial clustering, spillovers or diffusion processes.

The main objective of the course is the theoretical understanding and practical application of spatial regression models. This course will first give an overview on how to perform common spatial operations using spatial information, such as aggregating spatial units, calculating distances, merging spatial data as well as visualizing them. The course will further focus on the analysis of geographic data and the application of spatial regression techniques to model and analyze spatial processes, and furthermore, the course addresses several methods for defining spatial relationships, detecting and diagnosing spatial dependence and autocorrelation. Finally, we will discuss various spatial regression techniques to model processes, clarify the assumptions of these models, and show how they differ in their applications and interpretations.

The field has developed very quickly over the past few years, and R now provides a rich set of packages for various spatial data operations. For a more in-depth introduction into spatial data analysis in R, have a look into the materials references below.

The material introduces the use of geographical information to connect and analyze different spatial data sources very briefly. This introduction is limited to the fundamentals of using geographical information in R. Stefan Jünger & Anne-Kathrin Stroppe have provided a comprehensive GESIS workshop on geospatial techniques in R. The focus of this workshop will be on techniques for spatial data analysis, such as spatial regression models.

Schedule

Day 1 Working with Spatial Data
10:00 - 11:30 Refresher on R as GIS
Coffee break
11:45 - 13:00 Spatial Data Manipulation & Visualization
Lunch break
14:00 - 15:30 Defining Spatial Relationships (W)
Coffee break
15:45 - 17:15 Detecting Spatial Dependence
Day 2 Spatial Regression Models I
10:00 - 11:30 Spatial Regression Models: Theory
Coffee break
11:45 - 13:00 Estimating Spatial Regression Models
Lunch break
14:00 - 15:30 Interpreting Results: Spatial Impacts
Coffee break
15:45 - 17:15 Lab Exercises in R
Day 3 Spatial Regression Models II
10:00 - 11:30 Comparing and Selecting Models
Coffee break
11:45 - 13:00 Spatiotemporal models
Lunch break
14:00 - 15:30 Other Models: GWR & Binary Outcomes
Coffee break
15:45 - 17:15 Lab Exercises in R

Some useful packages

By now, R provides a lot of functionalities for GIS applications and spatial econometrics, and further extensions. There are lots of packages providing a huge variety of spatial functionalities and methods (see e.g. R. Bivand, Millo, and Piras 2021). Important packages for fundamental spatial operations are:

Further Readings

  • Great up-to-date introduction to spatial R: Lovelace, Nowosad, and Muenchow (2019), updated version available online

  • Great open-science book on Spatial Data Science Pebesma and Bivand (2023)

  • Comprehensive introduction to spatial econometrics: LeSage and Pace (2009)

  • Relative intuitive introduction to spatial econometrics: Ward and Gleditsch (2008)

  • Article-length introductions to spatial econometrics: Elhorst (2012), Halleck Vega and Elhorst (2015), LeSage (2014), and Rüttenauer (2022)

Course materials

Appelhans, Tim, Florian Detsch, Chritoph Reudenbach, and Stefan Woellauer. 2021. “Mapview: Interactive Viewing of Spatial Data in R.”
Bivand, Roger S., and Colin Rudel. 2018. “Rgeos: Interface to Geometry Engine - Open Source (’GEOS’).”
Bivand, Roger, Giovanni Millo, and Gianfranco Piras. 2021. “A Review of Software for Spatial Econometrics in R.” Mathematics 9 (11): 1276. https://doi.org/10.3390/math9111276.
Bivand, Roger, and Gianfranco Piras. 2015. “Comparing Implementations of Estimation Methods for Spatial Econometrics.” Journal of Statistical Software 63 (18): 1–36. https://doi.org/10.18637/jss.v063.i18.
Elhorst, J. Paul. 2012. “Dynamic Spatial Panels: Models, Methods, and Inferences.” Journal of Geographical Systems 14 (1): 5–28. https://doi.org/10.1007/s10109-011-0158-4.
Gräler, Benedikt, Edzer Pebesma, and Gerard Heuvelink. 2016. “Spatio-Temporal Interpolation Using Gstat.” The R Journal 8 (1): 204–18.
Halleck Vega, Solmaria, and J. Paul Elhorst. 2015. “The SLX Model.” Journal of Regional Science 55 (3): 339–63. https://doi.org/10.1111/jors.12188.
LeSage, James P. 2014. “What Regional Scientists Need to Know about Spatial Econometrics.” The Review of Regional Studies 44 (1): 13–32. https://doi.org/https://dx.doi.org/10.2139/ssrn.2420725.
LeSage, James P., and R. Kelley Pace. 2009. Introduction to Spatial Econometrics. Statistics, Textbooks and Monographs. Boca Raton: CRC Press.
Lovelace, Robin, Jakub Nowosad, and Jannes Muenchow. 2019. Geocomputation with R. 1st ed. Chapman & Hall/CRC the R Series. Boca Raton: Chapman & Hall/CRC.
Pebesma, Edzer. 2018. “Simple Features for R: Standardized Support for Spatial Vector Data.” The R Journal 10 (1): 439. https://doi.org/10.32614/RJ-2018-009.
Pebesma, Edzer, and Roger Bivand. 2023. Spatial Data Science: With Applications in R. First. Boca Raton: Chapman and Hall/CRC. https://doi.org/10.1201/9780429459016.
Rüttenauer, Tobias. 2022. “Spatial Regression Models: A Systematic Comparison of Different Model Specifications Using Monte Carlo Experiments.” Sociological Methods & Research 51 (2): 728–59. https://doi.org/10.1177/0049124119882467.
Tennekes, Martijn. 2018. “Tmap : Thematic Maps in R.” Journal of Statistical Software 84 (6). https://doi.org/10.18637/jss.v084.i06.
Ward, Michael Don, and Kristian Skrede Gleditsch. 2008. Spatial Regression Models. Vol. 155. Quantitative Applications in the Social Sciences. Thousand Oaks: Sage.