Coco provides an open source ILP solution for variations of the Reserve Selection problem including connectivity functionality.
Note: this is a pre-release version and not a full version yet
We recommend using a conda environment (https://conda.io) with Python version 3.10.6 and install the following packages:
- Gurobi 9.5.2: a Gurobi license is needed to run Coco (https://www.gurobi.com/). Inside the conda environment run:
python -m pip install gurobipy
- Seaborn 0.12.0:
conda install -c anaconda seaborn
- Python 3.10.6
- Pandas 1.4.4
- Matplotlib 3.5.3
- NetworkX 2.8.6
- Seaborn 0.12.0
To run the scripts in the scripts folder, geopandas is needed. These scripts work only on the data provided in this repository.
- Geopandas 0.11.1:
conda install --channel conda-forge geopandas
Go to the src
folder:
cd src
Currently, Coco has three RSP variants installed:
- RSP with Connectivity Features (RSP-CF)
- RSP with Cost Connectivity (RSP-CC)
- RSP with Cost Budget and Connectivity (RSP-CBC).
To get help on how to run a specific variant, type:
python coco.py --help
To get help with the parameters of a specific variant RSP-VARIANT
, type:
python coco.py RSP-VARIANT --help
To run Coco enter:
python coco.py RSP-VARIANT [global params] [variant params] [gurobi params]
Below we give a detailed explanation of each parameter
The following parameters are available for all RSP variants:
Here we explain the parameters in Coco for input and output files and folders. For more information on the input files needed and output files created, see the documentation in the doc
folder.
To specify the input folder (required):
--input INPUT_FOLDER
This can either be a relative or absolute path. This folder should contain all files needed for the basic RSP.
To specify the output folder (required):
--output OUTPUT_FOLDER
This can either be a relative or absolute path.
The data to be used in the connectivity metric can be provided in different formats. For all formats either a relative path or an absolute path to the file can be given. For an explanation of the exact format of the files, see the documentation in the doc
folder.
At least one of --con-edgelist
, feature-edgelist
, or con-matrix
is required.
If only one connectivity dataset should be considered, one connectivity edgelist can be provided
--con-edgelist FILE
In case multiple connectivity dataset should be considered, e.g., one for each feature, one feature-edgelist can be provided containing all datasets for each feature. Note that the features in the edgelist do not have to be the same as in the feature.csv
--feature-edgelist FILE
Alternatively, one connectivity matrix can be specified
--con-matrix FILE
In case the selected metric, e.g., equivalent connectivity, requires attribute data for each planning unit, this should be provided using (optional):
--pu-data FILE
We currently have the following metrics installed: indegree, outdegree, betweenness centrality and equivalent connectivity. For more information on each metric, see the documentation in the doc
folder. We are expanding the number of metrics Coco has installed, for requests feel free to contact me.
To specify the metric(s) used (required):
--metric {bc, indegree, outdegree, ec}
Coco can run multiple metrics at the same time. To do this repeat the parameter for each metric you want to run, e.g., if we want to run betweenness centrality and equivalent connectivity:
--metric bc -metric ec
To exclude planning units with metric values under/above a specified threshold value from the connectivity optimization, set (optional):
--metric-min VALUE
or
--metric-max VALUE
To exclude planning units with metric values under/above a value depending on the median or mean value of all metric values (optional):
--metric-min-type {median, mean}
or
--metric-max-type {median, mean}
Note that the values below/above the selected type will be dropped for all metrics
If BC is the metric and the connectivity data is a complete graph, connectivity values from the input data below the mean
or median
can be dropped as follows:
--complete-graph {median, mean}
This is needed for BC, since by definition the BC for vertices in a complete graph is 0.
The following parameters are only available for the RSP-CF variant. It is required to set exactly one of the following:
Set the target for the connectivity feature to be reached to a proportion of the total metric value:
--metric-prop VALUE
Set the target of the connectivity featerure to be reached to a specific value:
--metric-target VALUE
Note that these settings are applied to all considered metrics. In case of multiple metrics, metric-prop
should be used since each metric can have very different values.
The following parameters are only available for the RSP-CC variant. Both parameters are used to balance the weight between the cost and the metric values while optimizing.
To set the weight of the metric values (required):
--metric-weight
To set the weight of the cost (optional):
--cost-weight
Balancing this correctly requires some knowledge on the data and metric values. During optimization the metric values are min max normalized and thus can have values [0,1] on each vertex or edge depening on the metric. Setting the cost too low compared to the metric values will result in a negative objective value, which should be prevented at all times. Setting the metric values too low (especially in case of very small values) result in a longer runtime to find the optimal result.
The following parameters are only available for the RSP-CBC variant. It is required to set either one of the values. If a fixed cost is wanted, setting -max-cost
is sufficient, since the optimizer will automatically maximize the connectivity values by taking as many planning units as possible, i.e., maximize the cost.
To set the max cost use:
--max-cost COST
To set the min cost use:
--min-cost COST
These settings are related to the ILP solver, Gurobi. In specific circumstances it might be helpful to adjust the standard settings of Gurobi. All these parameters are optional. For more information on these parameters visit the Gurobi documentation.
To set the minimal gap to the optimal solution that the solver has to find set:
--gap VALUE
The solver will stop when it find a solution within VALUE * 100% of the optimal value. It is not untypical for the ILP solver to find a solution close to optimal relatively quickly and then needing a lot of time to find the last percentages. In these cases, setting a gap will decrease the runtime significantly, without too much influence on the found solution (depending on the gap set).
Instead of setting a gap it is also possible to specify a time limit for the solver. This can be done by giving the max time in seconds after which the solver will report the best solution found:
--time-limit VALUE
Keep in mind that the total runtime of Coco will be longer than the time limit set since this only applies to time spent in the solver.
To set the number of threads used by Gurobi:
--gurobi-threads
To set the memory used by Gurobi:
--gurobi-mem