The SOSAA Trajectories Dataset includes the settings, inputs, and outputs of several trajectory runs of the SOSAA model.
The SOSAA model is a chemistry transport model that has been actively developed in the Multi-Scale Modelling Group at the University of Helsinki since 2011 1. SOSAA was initially developed to run in stationary mode, in which it simulates the atmospheric processes near a measurement station. However, recent developments have focused on implementing a Lagrangian trajectory mode, in which emissions are picked up along the current mean meteorological trajectory that arrives at the station several days later.
This repository includes the SOSAA trajectories dataset. All runs in this dataset were performed with SOSAA@10618aa. A snapshot of this now-outdated version can be accessed at doi:10.5281/zenodo.7867026. Unfortunately, the SOSAA model is not yet generally publicly available. However, access to the complete SOSAA source code is provided upon request -- please contact Michael Boy (michael.boy@helsinki.fi), Putian Zhou (putian.zhou@helsinki.fi), or Petri Clusius (petri.clusius@helsinki.fi) for more information.
The first section of this README summarises the layout and variables of the dataset. Next, the second section explores the six trajectories that are included. The third section provides an overview of the SOSAA input perturbation runs that the dataset contains. Finally, the fourth section gives a list of selected input and output variables in the dataset.
The dataset is split into an extensive folder structure that contains the SOSAA configuration files and the input and output NetCDF files 2 for each of the trajectories. Each trajectory is identified by the time at which it arrives at the SMEAR II measurement station at Hyytiälä, Finland 3. Both the top-level inputs/
and outputs/
folders are split into baseline/
and perturbation/
directories. The former contains the baseline SOSAA runs that are used in this chapter. The perturbation/
directory is further split by perturbation group and kind, which are explained in the third section. These input directories are then split into HYDE_BASE_Y2018/OUTPUT_bwd_YYYYMMDD
folders based on the arrival date of the trajectories. Each of these folders contains an EMISSIONS_0422/
and a METEO/
directory. For example, a trajectory that arrives on 21.05.2018 at 14:00 UTC has the following four input files:
- aerosol emissions input:
EMISSIONS_0422/20180521_7daybwd_Hyde_traj_AER_10_L3.nc
- anthropogenic emissions input:
EMISSIONS_0422/20180521_7daybwd_Hyde_traj_ANT_10_L3.nc
- biogenic emissions input:
EMISSIONS_0422/20180521_7daybwd_Hyde_traj_BIO_10_L3.nc
- meteorological conditions input:
METEO/METEO_20180521_R10.nc
Note that instead of the arrival hour 14, the input file names use the number of hours until midnight, 10 in this case. In contrast to the input directories, the output folders are directly split by the full arrival time and use the natural time format instead:
- SOSAA output:
20180521_T14/output.nc
Each input and output file stores variables that are indexed by time first and often by height layer second. Note that SOSAA can use arbitrary time and height resolutions. Please refer to the fourth section for a complete list of the input variables.
This dataset includes six example trajectories, chosen from a large set of 480 SOSAA trajectory runs that cover the time period from 09.05.2018 at 00:00 UTC until 28.05.2018 at 23:00 UTC, with one trajectory arriving at every full hour. The six trajectories were chosen to cover different scenarios. The baseline and perturbation runs of these six trajectories and their temporal neighbours were then performed on the CSC Puhti supercomputer.
The figures below show the paths that each of the six trajectories takes over the
- The first trajectory starts in Sweden and travels clockwise over Russia and the Gulf of Finland before arriving in Hyytiälä on 14.05.2018 at 10:00 UTC.
- The second trajectory starts in Russia and travels counter-clockwise over the Baltics, then across to Sweden, and finally across the Gulf of Bothnia to Finland, where it arrives on 15.05.2018 at 19:00 UTC.
- The third trajectory starts in Estonia and travels clockwise across the Gulf of Finland to the west of Finland before turning towards Russia, from where it heads to its arrival at Hyytiälä on 17.05.2018 at 00:00 UTC.
- The fourth trajectory originates in eastern Canada, from where it crosses the Atlantic, over the southern coast of Iceland, before arriving in Norway, from where its path leads north-eastwards across Sweden and the Gulf of Bothnia to Finland, where it arrives on 19.05.2018 at 04:00 UTC.
- The fifth trajectory starts north of Scotland, from where it travels eastwards to Norway. From Norway, it follows a clockwise bow-shape over southern Sweden and Denmark, before travelling north-eastwards towards Hyytiälä, where it arrives on 21.05.2018 at 15:00 UTC.
- The sixth and final trajectory starts north-west of Scotland, from where it first travels southward over England, then sharply turns and travels northwards along the Norwegian coast before making yet another sharp turn to cross the Gulf of Bothnia from northern Sweden to southern Finland, where it arrives on 23.05.2018 at 13:00 UTC.
Maps for the six example trajectories. Each map shows the time-coloured path of the trajectory that arrives at the listed time in Hyytiälä, as well as the paths of the prior and next trajectories.
We have classified most input variables as belonging to one of the following eight perturbation groups (see also the fourth section):
-
ant: anthropogenic emissions, excluding
$\text{CO}$ ,$\text{NO}_{x}$ ,$\text{NH}_3$ ,$\text{CH}_4$ , and$\text{SO}_2$ -
bio: biogenic emissions, excluding
$\text{CO}$ ,$\text{CH}_4$ ,$\text{CH}_2\text{Br}_2$ ,$\text{CH}_3\text{I}$ ,$\text{CHBr}_3$ ,$\text{DMS}$ , and terpene emissions -
aer: aerosol emissions with diameters between
$3\text{nm}$ and$1000\text{nm}$ -
mtp: biogenic monoterpene emissions, including
$\alpha$ -pinene,$\beta$ -pinene, and others - sqt: biogenic sesquiterpene emissions
-
$\text{SO}_2$ : anthropogenic$\text{SO}_2$ emissions -
$\text{NO}_{x}$ : anthropogenic$\text{NO}_{x}$ emissions -
$T$ : air temperature
We assign each group a small increase, small decrease, large increase, and large decrease operation. For the emissions, these are the multiplicative factors
The following sections list the name, semantics, and units for each input and output variable in the SOSAA trajectories dataset that we use. The perturbation group for the third section is also given in brackets after each variable's name. In each section, the first one or two variables describe the time and height layer indexing that is used by the following features, which all note how they are indexed.
-
time
: time until the arrival at Hyytiälä in$\text{s}$ -
lev
: height level in$\text{m}$ -
t
(temperature): temperature in$\text{K}$ , indexed bytime
andlev
-
q
(ungrouped): specific humidity in$\frac{\text{kg}}{\text{kg}}$ , indexed bytime
andlev
-
ssr
(ungrouped): surface net solar radiation in$\frac{\text{W}}{\text{m}^2}$ , indexed bytime
-
lsm
(ungrouped): land sea mask$(0-1)$ , indexed bytime
-
blh
(ungrouped): atmospheric boundary layer height in$\text{m}$ , indexed bytime
-
layer
: elevation from the ground in$\text{m}$ -
time
: time until the arrival at Hyytiälä in$\text{s}$ -
3-10nm
(aerosols):$3-10\text{nm}$ diameter particle emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
10-20nm
(aerosols):$10-20\text{nm}$ diameter particle emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
20-30nm
(aerosols):$20-30\text{nm}$ diameter particle emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
30-50nm
(aerosols):$30-50\text{nm}$ diameter particle emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
50-70nm
(aerosols):$50-70\text{nm}$ diameter particle emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
70-100nm
(aerosols):$70-100\text{nm}$ diameter particle emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
100-200nm
(aerosols):$100-200\text{nm}$ diameter particle emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
200-400nm
(aerosols):$200-400\text{nm}$ diameter particle emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
400-1000nm
(aerosols):$400-1000\text{nm}$ diameter particle emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
layer
: elevation from the ground in$\text{m}$ -
time
: time until the arrival at Hyytiälä in$\text{s}$ -
co
(ungrouped):$\text{CO}$ emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
nox
($\text{NO}_{x}$ ):$\text{NO}_{x}$ emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
co2
(anthropogenic):$\text{CO}_{2}$ emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
nh3
(ungrouped):$\text{NH}_{3}$ emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
ch4
(ungrouped):$\text{CH}_{4}$ emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
so2
($\text{SO}_{2}$ ):$\text{SO}_{2}$ emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
nmvoc
(anthropogenic): non-methane VOC emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
alcohols
(anthropogenic): alcohol emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
ethane
(anthropogenic): ethane emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
propane
(anthropogenic): propane emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
butanes
(anthropogenic): emissions of butanes in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
pentanes
(anthropogenic): emissions of pentanes in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
hexanes
(anthropogenic): emissions of hexanes in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
ethene
(anthropogenic): ethene emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
propene
(anthropogenic): propene emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
acetylene
(anthropogenic): acetylene emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
isoprene
(anthropogenic): isoprene emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
monoterpenes
(anthropogenic): emissions of monoterpenes in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
other-alkenes-and-alkynes
(anthropogenic): emissions of other alkenes and alkynes in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
benzene
(anthropogenic): benzene emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
toluene
(anthropogenic): toluene emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
xylene
(anthropogenic): xylene emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
trimethylbenzene
(anthropogenic): trimethylbenzene emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
other-aromatics
(anthropogenic): emissions of other aromatics in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
esters
(anthropogenic): emissions of esters in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
ethers
(anthropogenic): emissions of ethers in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
formaldehyde
(anthropogenic): formaldehyde emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
other-aldehydes
(anthropogenic): emissions of other aldehydes in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
total-ketones
(anthropogenic): total ketones emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
total-acids
(anthropogenic): total acids emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
other-VOCs
(anthropogenic): emissions of other VOCs in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bylayer
andtime
-
time
: time until the arrival at Hyytiälä in$\text{s}$ -
acetaldehyde
(biogenic): acetaldehyde emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bytime
-
acetone
(biogenic): acetone emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bytime
-
butanes-and-higher-alkanes
(biogenic): emissions of butanes and higher alkanes in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bytime
-
butenes-and-higher-alkenes
(biogenic): emissions of butenes and higher alkenes in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bytime
-
CH4
(ungrouped):$\text{CH}_{4}$ emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bytime
-
CO
(ungrouped):$\text{CO}$ emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bytime
-
ethane
(biogenic): ethane emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bytime
-
ethanol
(biogenic): ethanol emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bytime
-
ethene
(biogenic): ethene emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bytime
-
formaldehyde
(biogenic): formaldehyde emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bytime
-
hydrogen-cyanide
(biogenic): hydrogen-cyanide emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bytime
-
isoprene
(biogenic): isoprene emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bytime
-
MBO
(biogenic): 2-Methyl-3-buten-2-ol (MBO) emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bytime
-
methanol
(biogenic): methanol emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bytime
-
methyl-bromide
(biogenic): methyl-bromide emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bytime
-
methyl-chloride
(biogenic): methyl-chloride emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bytime
-
methyl-iodide
(biogenic): methyl-iodide emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bytime
-
other-aldehydes
(biogenic): emissions of other aldehydes in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bytime
-
other-ketones
(biogenic): emissions of other ketones in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bytime
-
other-monoterpenes
(monoterpenes): emissions of other monoterpenes in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bytime
-
pinene-a
(monoterpenes):$\alpha$ -pinene emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bytime
-
pinene-b
(monoterpenes):$\beta$ -pinene emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bytime
-
propane
(biogenic): propane emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bytime
-
propene
(biogenic): propene emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bytime
-
sesquiterpenes
(sesquiterpenes): emissions of sesquiterpenes in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bytime
-
toluene
(biogenic): toluene emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bytime
-
CH2Br2
(ungrouped): $\text{CH}{2}\text{Br}{2}$ emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bytime
-
CH3I
(ungrouped):$\text{CH}_{3}\text{I}$ emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bytime
-
CHBr3
(ungrouped):$\text{CHBr}_{3}$ emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bytime
-
DMS
(ungrouped): dimethylsulfide (DMS) emissions in$\frac{\text{kg}}{\text{m}^2 \text{s}}$ , indexed bytime
-
time
: time since the beginning of the month in$\text{s}$ -
dp_dry_fs
: dry radius in$\text{m}$ of aerosol particles in each size bin -
lev
: height above the ground in$\text{m}$ -
nconc_par
: particle number concentration in$\frac{1}{\text{m}^3}$ , indexed bytime
,dp_dry_fs
, andlev
Licensed under the CC0 1.0 license (LICENSE or https://creativecommons.org/publicdomain/zero/1.0/).
Please refer to the CITATION.cff file and refer to https://citation-file-format.github.io to extract the citation in a format of your choice.
This dataset was created as part of Juniper Tyree's Masters Thesis "Prudent Response Surface Models" for the M.Sc. Theoretical and Computational Methods programme at the University of Helsinki.
Footnotes
-
M. Boy et al. SOSA – a new model to simulate the concentrations of organic vapours and sulphuric acid inside the ABL – Part 1: Model description and initial evaluation. Atmospheric Chemistry and Physics. 2011;11 (1): 43–51. Available from: doi:10.5194/acp-11-43-2011. ↩
-
R. Rew et al. Unidata NetCDF. 1989. Available from: doi:10.5065/D6H70CW6. ↩
-
P. Hari et al. Station for Measuring Ecosystem-Atmosphere Relations: SMEAR. Physical and Physiological Forest Ecology. Edited by P. Hari, K. Heliövaara and L. Kulmala. Dordrecht: Springer Netherlands, 2013, 471–487. Available from: doi:10.1007/978-94-007-5603-8_9. ↩