These sets of experiments involve the transition between MGDrivE1 and MGDrivE2, so there is an additional toggle in the data analysis routines to accommodate both types of datasets.
Setup the factorial experiment in the Rscript and run the following command:
./STP_simAndPreProcess.sh srv 265 SPA v1 True
This should auto-run the factorial and place the results in the RAID5 drive. After running the sims, it should startup the preProcess followed by the preTraces and preGrids (these two are optional).
- Run the STP_geoCluster.py script with the desired level of aggregation for analysis
- Run STP_preProcess.sh for ECO, HLT, WLD, TRS PREPROCESS
- Run STP_preTraces.sh for ECO, HLT, WLD, TRS preTraces and preGrids
- Run STP_preVideo.sh for desired videos (probably HLT)
- Run STP_pstProcess.sh for HLT, WLD, TRS post process
Folders and files follow this naming convention:
E_rer_ren_rsg_fic_gsv
rer
: Release ratio (x1e8)ren
: Number of weekly releases (x1)rsg
: In and out of frame resistance generation (x1e8)fic
: Trans-gene fitness cost (x1e8)gsv
: Genetic standing variation (x1e8)
For the breakdown of the AOI sets, look at the gene definitions STP.
Exported metrics (MOI) for the drive are:
- WOP: Total sum of time below the threshold
- TTI: First break below the threshold
- TTO: Last break below the threshold
- RAP: Fraction of the population with the genes at given points of time
- MNX: Minimum and maximum of genes in the population
Summary statistic files follow this naming convention:
AOI_MOI_QNT_qnt.csv
Where the main AOI was HLT (presence of mosquitoes) and the outputs (labels) are:
- TTI, TTO, WOP: Fraction's threshold for the metric to be true
- RAP: Fraction of present genotypes at given points (days) of the simulation
- MNX: Min/Max and days at which these are achieved
To put all the data through the ML classification pipeline, run:
chmod +x STP_clsPipeline.sh
./STP_clsPipeline.sh
which launches the following scripts in order:
Summary datasets are named as follows:
D_SEX_AOI_MOI_QNT_qnt.csv
: Experiment set for each of sexes combinations in the releases.Full_AOI_MOI_QNT_qnt
: Collated dataset with sexes variable added as column.CLN_AOI_MOI_QNT_qnt
: Collated dataset with sexes variables one-hot encoded.
For the AOI definitions, look at the .