GitHub - lclarte/bayesian_multiclass_irl: Code for MVA's master thesis on Bayesian inverse reinforcement learning

Summary

This project deals with the problem of bayesian multitask inverse reinforcement learning (or Bayesian MT-IRL) for Partially Observable Markov Decision Processes (POMDP) : consider an agent whose policy is parametrized by a vector w, whose prior distribution is a mixture of multivariate Gaussian, where the parameters $\mu, \Sigma$ of the Gaussian components $\mathcal{N}(\mu, \Sigma)$ are sampled from a Normal Inverse Wishart hyperprior. The goal is to infer the posterior distribution of the $\mu, \Sigma$ observing the trajectories generated in the MDP.

Relevant articles :

Requirements

This code requires https://github.com/IDSIA/sacred

Code organization

There are 4 main folders : core, experiments, envs and test.

core : contains main algorithms used for the inference.
test : unitary test for functions in core
envs : definitions of the RL environments used for our experiments
experiments : contains .py files, one for each experiment, and logs folder to store results

How to use

This project uses the sacred library. This allows the user to tune parameters in the CLI.

Running unitary tests

In the root folder, run the module unittest on the desired python script. example :

python -m unittest test.test_bp

Running experiments

In the root folder, simply run the desired python script :

python -m experiments.exp_chain

You can tune some hyperparameters such as the number of MDPs to infer, the length of trajectories, etc. by setting them in the CLI. For example, if you want trajectories length = 10 and number of MDPs = 20, simply run

python -m experiments.exp_chain with T=10 M=20

Name		Name	Last commit message	Last commit date
Latest commit History 76 Commits
core		core
envs		envs
experiments		experiments
test		test
README.md		README.md
__init__.py		__init__.py
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Summary

Requirements

Code organization

How to use

Running unitary tests

Running experiments

About

Releases

Packages

Languages

lclarte/bayesian_multiclass_irl

Folders and files

Latest commit

History

Repository files navigation

Summary

Requirements

Code organization

How to use

Running unitary tests

Running experiments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages