Reinforcement Learning: An Introduction

R. S. Sutton and A. G. Barto

This repository contains a python implementation of the concepts described in the book Reinforcement Learning: An Introduction, by Sutton and Barto. For each chapter you will find a .py file that contains the main implementation, and a .ipynb used to quickly visualise figures on github.com.

The repository is still WIP. I will try to move linearly ahead with the book, you can check below for a roadmap of the immadiate actions.

Please, feel free to raise issues to ask questions or flag flaws and mistakes in the implementation.
Should you find this useful for you, I would be grateful if you'd star⭐ it :)

Available problems

Chapter 1: Introduction

Tic tac toe

Chapter 2: Multi-armed Bandits

Scheduled

Chapter 3: Finite Markov Decision Processes

Scheduled

Chapter 4: Dynamic Programming (figures)

Chapter 5: Monte Carlo Methods (figures)

References

[1] R. S. Sutton, A. G. Barto, et al. Reinforcement Learning: an Introduction. MIT press, Cambridge, 2018.
[2] Original Code, 2nd Edition. http://incompleteideas.net/book/code/code2nd.html

Name		Name	Last commit message	Last commit date
Latest commit History 94 Commits
chapter-1		chapter-1
chapter-2		chapter-2
chapter-4		chapter-4
chapter-5		chapter-5
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Reinforcement Learning: An Introduction

R. S. Sutton and A. G. Barto

Available problems

Chapter 1: Introduction

Chapter 2: Multi-armed Bandits

Chapter 3: Finite Markov Decision Processes

Chapter 4: Dynamic Programming (figures)

Chapter 5: Monte Carlo Methods (figures)

References

About

Uh oh!

Uh oh!

Languages

License

epignatelli/reinforcement-learning-an-introduction

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning: An Introduction

R. S. Sutton and A. G. Barto

Available problems

Chapter 1: Introduction

Chapter 2: Multi-armed Bandits

Chapter 3: Finite Markov Decision Processes

Chapter 4: Dynamic Programming (figures)

Chapter 5: Monte Carlo Methods (figures)

References

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages