Acquire knowledge of geospatial analytics by consulting online resources
For the creative brief, you will work with geospatial data, which can be defined as ‘…time-based data that is related to a specific location on the Earth's surface. It can provide insights into relationships between variables and reveal patterns and trends' (Source). For example, the weather station you will build in Week 2, will collect, and subsequently transmit data from a particular location to a database.
What is geospatial analytics?
Geospatial analytics is used to add timing and location to traditional types of data and to build data visualizations. These visualizations can include maps, graphs, statistics and cartograms that show historical changes and current shifts. This additional context allows for a more complete picture of events. Insights that might be overlooked in a massive spreadsheet are revealed in easy-to-recognize visual patterns and images. This can make predictions faster, easier and more accurate (Source).
There are two main types of geospatial data: vector and raster. As regards the creative brief, you will encounter both data representations. For example, the Klimaateffectatlas contains data in GeoPackage (i.e., vector) and TIFF (i.e., raster) format.
Vector data comprises of individual points stored as coordinate pairs that indicate a physical location in the world. These points can be joined, in a particular order, to form lines or joined into closed areas to form polygons (Source)
Raster data provides a representation of the world as a surface divided up into a regular grid array, or cells, where each of these cells has an associated value (Source).
Figure 1. Raster vs. vector data.
Raster data can have a single band, where each cell/pixel has a single value, or they can have multiple bands, as is typically the case for satellite images:
Figure 1. Raster vs. vector data.
Most Python and R packages only support vector data (e.g., geopandas). One way to use raster data is by transforming it into a vector representation. We advise you to use QGIS instead of Python/R when using large data files. For a tutorial, see the article below:
Coordinate Reference Systems (CRS) are systems that ‘… uniquely reference spatial information in space as a set of coordinates (x, y, z) and/or latitude and longitude and height, based on a geodetic horizontal and vertical datum' (Source).
How to select a correct CRS for your data?
When choosing a CRS, you should be mindful of two things:
If your data is on a local level (such as a city or district), use a local projection/coordinate system. For example, in the Netherlands, we commonly use the projected coordinate system EPSG:28992, also known as Rijksdriehoeksstelsel (Source). For more information on measuring units, scope, etc., see the website EPSG.io
Figure 2. Rijksdriehoeksstelsel.
If you are performing any spatial calculations (e.g., distance), your data should be in a projected coordinate system and not a geographic coordinate system (Source).
Useful third party data sources:
There are many online resources available on the topic of geospatial analysis. Please, check the following resources:
Python:
R:
QGIS:
Figure 3. QGIS interface.
Theory: