MARL-EP

This repo attempts to reproduce the results in the paper Multi-Agent Reinforcement Learning with Epistemic Priors by Walker et al. (2023) and is almost completely based on Off-Policy Multi-Agent Reinforcement Learning (MARL) Algorithms with changes for epistemic learning as described in the paper.

1. Usage

For original usage, please see original_README.md.

Otherwise, for qmix training with epistemic priors:

locally: make train-ep
DTU HPC: make train-ep-hpc

For normal qmix training:

locally: make train
DTU HPC: make train-hpc

To modify training parameters, please see train_mpe_qmix_ep.sh and train_mpe_qmix.sh.

"Playing"/visualization of random MPE Spread scenario:

with priors: make play-ep, edit the MODEL_DIR var in play_mpe_qmix_ep.sh to point to the model you want to play.
NOTE: remove the vglrun command from make play-ep in Makefile if you are not on a compatible system

2. Models

trained epistemic model (perfect sensing, no priors) in offpolicy/models/epistemic_planner
models trained with the varying fields-of-view + epistemic priors in offpolicy/models/qmix_ep/*
models trained with the varying fields-of-view, no epistemic priors in offpolicy/models/qmix/*

3. Installation on DTU HPC

conda create -n qmix python=3.6 OR if you don't have space on home dir and have a scratch dir, please see section 'DTU HPC more info' to see how to run conda create command
conda activate qmix
which python3 # double check points to python bin in conda env
module load cuda/10.1 # you must run these icuda commands before installing torch otherwise it will say version not found!!
module load cudnn/v7.6.5.32-prod-cuda-10.1
python3 -m pip install torch==1.5.1+cu101 torchvision==0.6.1+cu101 -f https://download.pytorch.org/whl/torch_stable.html
python3 -m pip install -r requirements.txt

# install offpolicy package
cd marl_ep
python3 -m pip install -e .

3.2 Install MPE

# install this package first
python3 -m pip install seaborn

There are 3 Cooperative scenarios in MPE:

simple_spread
simple_speaker_listener, which is 'Comm' scenario in paper
simple_reference

4. Results

Paper with our results.

5. Training on DTU HPC

edit email in jobscript.sh to be your own (else: spam me)
make queue to submit job to queue
make stat to monitor job status
see wandb output e.g. at https://wandb.ai/elles/MPE/runs/z4277c1c?workspace=user-ellesummer

5.1. DTU HPC more info

https://docs.google.com/document/d/1pBBmoLTj_JPWiCSFYzfHj646bb8uUCh8lMetJxnE68c/edit https://skaftenicki.github.io/dtu_mlops/s10_extra/high_performance_clusters/

5.2 conda create on scratch space directory

If running into python binary issues with conda in your scratch space (aka when using --prefix to point to scratch), make sure to:

$ conda config --set always_copy True
$ conda config --show | grep always_copy always_copy: True
$ conda create --prefix=/off-policy/env python=3.6

Name		Name	Last commit message	Last commit date
Latest commit History 203 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
.idea		.idea
offpolicy		offpolicy
tests/epistemic		tests/epistemic
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
environment.yaml		environment.yaml
jobscript.sh		jobscript.sh
original_README.md		original_README.md
original_requirements.txt		original_requirements.txt
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MARL-EP

1. Usage

2. Models

3. Installation on DTU HPC

3.2 Install MPE

4. Results

5. Training on DTU HPC

5.1. DTU HPC more info

5.2 conda create on scratch space directory

About

Releases

Packages

Languages

License

ellemcfarlane/marl_ep

Folders and files

Latest commit

History

Repository files navigation

MARL-EP

1. Usage

2. Models

3. Installation on DTU HPC

3.2 Install MPE

4. Results

5. Training on DTU HPC

5.1. DTU HPC more info

5.2 conda create on scratch space directory

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages