Skip to content

Latest commit

 

History

History
18 lines (16 loc) · 999 Bytes

File metadata and controls

18 lines (16 loc) · 999 Bytes

Q-Learning

Q-learning is an off policy reinforcement learning algorithm that seeks to find the best action to take given the current state. It's considered off-policy because the q-learning function learns from actions that are outside the current policy, like taking random actions, and therefore a policy isn't needed.

Code

python sample_scratch.py

Usefull Resources: