Improving the Performance of Backward Chained Behavior Trees using Reinforcement Learning

An experimentation environment for Reinforcment Learning in Backward Chained Behavior Trees.

For more details see the related paper on arxiv:2112.13744.

How to set up

Run the following command to install the Python libraries required for the examples in this project.

NB! This has only been tested on Python 3.7 and Project Malmo 0.37.0.

Installation

All of the pip requirements:

pip install --upgrade pip
pip install -r requirements.txt

Malmo 0.37.0:

See Bootstrapping on how to run Malmo from the pip wheel.

Alternatively, install Malmo locally: https://github.com/microsoft/malmo/releases

Torch

If using CPU:

Normal pip has you covered

If using GPU:

pip3 install torch==1.10.2+cu113 torchvision==0.11.3+cu113 torchaudio===0.10.2+cu113 -f https://download.pytorch.org/whl/cu113/torch_stable.html

Cuda 11.1 if using GPU: https://developer.nvidia.com/cuda-11.1.0-download-archive

Bootstrapping

If the installation for the Malmo wheel was successful, you can run bootstrap_malmo() to automatically download the necessary files for malmo.

N.B! Sometimes the necessary libraries are not automatically picked up by python. Seems to be an error in how the wheel is set up. In such a case you can add the library folder to your python path.

Once bootstrapping is complete you can run Malmo from run_malmo().

How to run

Once you have done a manual installation or bootstrapped, you should be able to run Malmo locally.

Once Malmo is running, you can start one of the test mission configurations in main to train an agent.

Alternatively, you can run one of the evaluations on one of the included results from our own experiments. Archives are available under GitHub Releases for this repository.

How to view the Stable-Baselines integrated tensorboard

The logging has already been set up. Running the training examples will generate tensorboard files into a separate directory which can be viewed with the following command.

tensorboard --logdir <tensorboard directory>

Name		Name	Last commit message	Last commit date
Latest commit History 80 Commits
agents		agents
bt		bt
evaluation		evaluation
learning		learning
malmolibrary		malmolibrary
mission		mission
resources		resources
utils		utils
.gitignore		.gitignore
README.md		README.md
baselines_node_experiment.py		baselines_node_experiment.py
evaluations.py		evaluations.py
main.py		main.py
malmo_bootstrap.py		malmo_bootstrap.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Improving the Performance of Backward Chained Behavior Trees using Reinforcement Learning

How to set up

Installation

Bootstrapping

How to run

How to view the Stable-Baselines integrated tensorboard

About

Releases

Packages

Contributors 2

Languages

martkartasev/BTBackchainingRL

Folders and files

Latest commit

History

Repository files navigation

Improving the Performance of Backward Chained Behavior Trees using Reinforcement Learning

How to set up

Installation

Bootstrapping

How to run

How to view the Stable-Baselines integrated tensorboard

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages