Skip to content

Latest commit

 

History

History
13 lines (10 loc) · 285 Bytes

README.md

File metadata and controls

13 lines (10 loc) · 285 Bytes

zuma-flow

Ai learns to play zuma

baseline evaluation with random policy, 10 episodes: 81.0

todo:

[ ] add verbose mode to zumaEnvirnment

work plan:

[ ] learn dqn
[ ] implement dqn with 1D action space :(
[ ] research other multi-dimentional action space solutions :)