Lunar Lander

Surprise your colleagues working on DQN by learning to land on moon in under 4 minutes using only 32 weights.

Usage

Run land.py to land on Moon!

Run sga_train.py to train a new agent.

See notes if having problems with PyBox2D.

Agents

Gradient Monte Carlo, Semi Gradient and Episodic Semi Gradient agents can be found in agent.py.

Gradient Monte Carlo (GMC)

A GMC [1] agent implementation for solving Lunar Landing task.

Gradient Monte Carlo algorithm works on the basis of SGD, sampling states and rewards from an environment using provided policy and updating a differentiable value-approximation function at the end of sequence.

Weight update is performed separately for each state-action tuple, rather than accumulating the gradient over all sequence.

Notes for Windows Users

If having problems installing PyBox2D using pip, download the wheel from here and install the package using pip install [wheel filename].

References

"Sutton, R. S., Barto, A. G. (20181019). Reinforcement Learning: An Introduction."

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
challenger_figures		challenger_figures
figures		figures
img		img
weights		weights
README.md		README.md
agent.py		agent.py
create_from_trained.py		create_from_trained.py
land.py		land.py
requirements.txt		requirements.txt
sga_train.py		sga_train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Lunar Lander

Usage

Agents

Gradient Monte Carlo (GMC)

Notes for Windows Users

References

About

Releases

Packages

Languages

sukruc/lunar-lander-gmc

Folders and files

Latest commit

History

Repository files navigation

Lunar Lander

Usage

Agents

Gradient Monte Carlo (GMC)

Notes for Windows Users

References

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages