Finding a feasible control group is central to many empirical methods in order to obtain reliable results as described above. This section describes an applied strategy for identifying comparable cities that can be used as a control group in further analytical steps.
Since randomized control trials are not feasible for studying the impact of Nature-based Solutions (NbS), we need to develop a strategy for identifying a good comparison group that serves as a proxy for the NbS-treated city. Since perfect comparability is not feasible in real-world examples, we rely on an applied matching strategy where we aim to identify a comparison city for each city included in the project. We focus on CiPeLs, which are the project cities including e.g. Munich (Germany) and Leuven (Belgium), as well as cities that have already implemented NbS measures. These cities and the corresponding information on the NbS strategies adopted have been collected by the experts of this project.1
We focus on socio-economic variables such as GDP, but mainly on urban infrastructure variables. The reason is that the variables should be independent of potential outcome variables related to the NbS strategy.
Table 1: Description of constructed measures of comparability
Measure | Description |
---|---|
Population size | Absolute number of people per city |
City size | Absolute city size measured in square kilometer |
Relative local GDP | Local GDP per capita on the city-level adjusted by the average real GDP per capita of the European Union (EU-28 in 2015) |
Relative number of education opportunities | Number of education buildings (e.g., schools, universities) relative to the absolute number of people |
Relative number of tourist attractions | Number of tourist attractions (e.g., galleries) relative to the absolute number of people |
Relative number of access points to public transport | Number of access points to the public transport network (e.g., subway entrances, bus stops) relative to the absolute number of people |
Relative length of the street network | Length of the street network (measured in kilometer) relative to the city size |
Relative length of the railroad network | Relative length of the railroad network Length of the railroad network (measured in kilometer) relative to the city size |
Relative area covered by water bodies | Area covered by water bodies (e.g., rivers, lakes) relative to the city size |
Relative area covered by green space | Area covered by green space (e.g., parks, gardens) relative to the city size |
Most of these measures are constructed in relative terms (e.g., relative to the city's population or size) to make the variables comparable across space.
After constructing these measures, we define a matching algorithm to assign a comparison city (or multiple cities in the case of multiple cities with similar characteristics) to each of the target cities. Since it is unlikely that there will be a perfect match, i.e. the measure of city A is exactly equal to the measure of city B, we define a matching threshold (degree of similarity) of 20%. Thus, if City A's measure falls within ±20% of City B's measure, we consider both cities to be similar for that particular characteristic. The 20% match threshold can be debated, as any other number could work. A lower number would ensure that two cities have a higher degree of similarity, while a higher number would make them less comparable. Beyond these theoretical considerations, it is important to note that from an applied perspective, the number of potential comparison candidates decreases with a lower matching threshold, which could lead to finding no matching city at all. For reliable empirical results, both arguments need to be considered. On the one hand, the comparison city should be a good proxy for the target city, i.e., a good match, which argues for a lower matching threshold, but on the other hand, the sample size must be sufficient, i.e., which argues for a higher matching threshold, to apply different empirical methods. In addition to the matching threshold, we also constrain the algorithm to exclude the scenario of self-matching. Thus, a city cannot be matched to itself as a comparison group. Although one could argue that a city itself would be the best possible comparison group, this also raises the problem of spillovers, as discussed above, which would confound the results.
Figure 2: Education opportunities Augsburg (Germany) and Barcelona (Spain) - 2020 (Source: Own Figure)
Figure 4: Public transportation network Leipzig (Germany) and Vienna (Austria) - 2020 (Source: Own Figure)
We would select the matching city/cities based on the majority rule, i.e., the highest number of similar characteristics. Munich (Germany) matches with two cities, Prague (Czech Republic) and Augsburg (Germany), in five out of ten measures (marked in bold). In a next step, the matched cities (control sites) could be grouped into a synthetic control group, which would be (potentially) even closer to Munich (Germany) than the individual cities.
Table 2: Matching outcome for Munich (Germany)
Munich | Prague | Augsburg | |
---|---|---|---|
Population size |
1,517,793
|
1,316,551
|
333,059
|
City size (km2) |
310.7
|
497.7
|
146.8
|
Relative local GDP |
106.5
|
135.9
|
102.5
|
Relative number of education opportunities (per Tsd. people) |
0.9
|
0.7
|
0.8
|
Relative number of access points to public transport (per Tsd. people) |
2.1
|
2.8
|
2.0
|
Relative number of tourist attractions (per Tsd. people)
|
0.3
|
0.3
|
0.3
|
Relative length of the street network
|
2.8
|
2.6
|
1.6
|
Relative length of the railroad network
|
4.2
|
2.4
|
2.2
|
Relative area covered by water bodies (km2)
|
2.4
|
2.8
|
2.3
|
Relative area covered by green space (km2)
|
36.4
|
37.7
|
1.8
|
Number of total matches
|
5 | 5 |
-
European Environmental Agency (EEA) (2017). France Administrative Boundaries. [Data set]. [Accessed: July 2023].
-
Eurostat (2021). European Grid Data Based on Eurostat-GISCO Database and GEOSTAT. [Data set]. [Accessed July 2023].
-
Eurostat (2023). Real GDP Per Capita. Data code: SDG_08_10. [Data set]. [Accessed: August 2023].
-
Kummu, M., Taka, M., & Guillaume (2018). Data Descriptor: Gridded Global Datasets for Gross Domestic Product and Human Development Index over 1990–2015. [Data set].
-
OSM Data for 2020.
This repository and the corresponding output have been part of the project: Activation of NATURE-based solutions for a JUST low carbon transition funded by European Commission, Horizon 2020. All rights reserved the author of the code Dr. Patrick Thiel.
Footnotes
-
The complete list includes 34 cities which are: Antwerp, Augsburg, Balatonfuzfo, Barcelona, Bari, Basel, Belgrade, Bolzano, Bratislava, Brussels, Budapest, Chania, Eindhoven, Genova, Gyor, Gzira, Hamburg, Krakow, Leipzig, Leuven, Malmo, Meran, Montpellier, Munich, Nicosia, Paris, Poznan, Prague, Szeged, Szombathely, Tampere, Utrecht, Valladolid, Vienna. ↩