Adversarial Behavioral Cloning

Improves on the SeqGAN idea by adding more reinfocement learning and GAN techniques like:

a replay buffer
Consensus Optimization
count-based exploration bonus
Proximal Policy Optimization (was not found to help, but can be found in this commit)
advantage normalization

How to run

If you wish to enable Consensus Optimization (via the --grad-reg option), you'll need to patch PyTorch to allow forcing the use a twice-differentiable RNN.

python3 main.py will run the project with the default options. Output will be written to the run/ directory.

Shameless plug

The em tool makes it really easy to twiddle hyperparameters by tracking changes to code (no need to make everything an option!).

Just run em run -g 0 exp_name with your desired options and you'll find a reproducable snapshot in experiments/<exp_name>!

If you want to resume from a snapshot (perhaps with different options), use em resume -g 0 exp_name ...

You can also fork an experiment and its changes using em fork, but the quick and dirty solution is to run bash scripts/make_links.sh :)

Name		Name	Last commit message	Last commit date
Latest commit History 73 Commits
dataset		dataset
environ		environ
model		model
notebooks		notebooks
scripts		scripts
.gitignore		.gitignore
.pylintrc		.pylintrc
LICENSE.txt		LICENSE.txt
README.md		README.md
common.py		common.py
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Adversarial Behavioral Cloning

How to run

Shameless plug

About

Releases

Packages

Languages

License

nhynes/abc

Folders and files

Latest commit

History

Repository files navigation

Adversarial Behavioral Cloning

How to run

Shameless plug

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages