Spatial Data Analysis

Author

Tobias Rüttenauer

Published

April 4, 2024

Introduction

This course materials are designed for a 1-day workshop on spatial data analysis. Rüttenauer (2024) provides a handbook chapter accompanying these workshop materials.

In recent years, more and more spatial data has become available, providing the possibility to combine otherwise unrelated data, such as social, economic, and environmental data. This also opens up the possibility of analysing spatial patterns and processes (e.g., spillover effects or diffusion).

Many social science research questions are spatially dependent such as voting outcomes, housing prices, labour markets, protest behaviour, or migration decisions. Observing an event in one region or neighbourhood increases the likelihood that we observe similar processes in proximate areas. As Tobler’s first law of geography puts it: “Everything is related to everything else, but near things are more related than distant things”. This dependence can stem from spatial contagion, spatial spillovers, or common confounders. Therefore, basic assumptions of standard regression models are violated when analysing spatial data. However, more importantly, spatial processes are interesting for their own sake. Spatial regression models can detect spatial dependence and explicitly model spatial relations, identifying spatial clustering, spillovers or diffusion processes.

The main objective of the course is the theoretical understanding and practical application of spatial regression models. This course will first give an overview on how to perform common spatial operations using spatial information, such as aggregating spatial units, calculating distances, merging spatial data as well as visualizing them. The course will further focus on the analysis of geographic data and the application of spatial regression techniques to model and analyze spatial processes, and furthermore, the course addresses several methods for defining spatial relationships, detecting and diagnosing spatial dependence and autocorrelation. Finally, we will discuss various spatial regression techniques to model processes, clarify the assumptions of these models, and show how they differ in their applications and interpretations.

The field has developed very quickly over the past few years, and R now provides a rich set of packages for various spatial data operations. For a more in-depth introduction into spatial data analysis in R, have a look into the materials references below.

The material introduces the use of geographical information to connect and analyze different spatial data sources very briefly. This introduction is limited to the fundamentals of using geographical information in R. Stefan Jünger & Anne-Kathrin Stroppe have provided a comprehensive GESIS workshop on geospatial techniques in R. The focus of this workshop will be on techniques for spatial data analysis, such as spatial regression models.

Schedule

Time Session
09:00 – 09:30 Refresher on R for spatial data
09:30 – 11:00 Spatial Relationships (W) and Spatial Dependence
11:15 – 12:00 Practical exercise
Lunch break
13:00 – 14:30 Spatial Regression Models (SLX, Error, lagged DV)
14:45 – 16:00 Interpreting Results: Spatial Impacts
16:15 – 18:00 Practical exercise

Some useful packages

By now, R provides a lot of functionalities for GIS applications and spatial econometrics, and further extensions. There are lots of packages providing a huge variety of spatial functionalities and methods (see e.g. R. Bivand, Millo, and Piras 2021). Important packages for fundamental spatial operations are:

Further Readings

  • Great up-to-date introduction to spatial R: Lovelace, Nowosad, and Muenchow (2019), updated version available online

  • Great open-science book on Spatial Data Science Pebesma and Bivand (2023)

  • Comprehensive introduction to spatial econometrics: LeSage and Pace (2009)

  • Relative intuitive introduction to spatial econometrics: Ward and Gleditsch (2008)

  • Article-length introductions to spatial econometrics: Elhorst (2012), Halleck Vega and Elhorst (2015), LeSage (2014), Rüttenauer (2024), and Rüttenauer (2022)

Course materials