Skip to content

Latest commit

 

History

History
7 lines (6 loc) · 170 Bytes

README.md

File metadata and controls

7 lines (6 loc) · 170 Bytes

Reinforcement learning: Dynamic Programming, Monte Carlo, Temporal Difference. Deep Q Learning. Experience Replay. Twin Delayed DDPG.

More things to come, dreamer, etc.