Refactor poc methods and games #1

zhjngli · 2024-03-07T01:45:08Z

The current simple Q and MonteCarlo learners use inheritance to implement the game specific nuances that it's training. I'm not the biggest fan of inheritance and being able to compose the games along with the learners instead feels a lot cleaner. This branch is an attempt to do that. However, the result is a bit clumsy, so just leaving it in a branch for now.

I think it'd help to write out each use case. The current learners are the simple Q learner, Monte Carlo, and Alpha Zero. Each has slightly different usages of the games that it trains on, so it's tough to generalize everything, or at least, it would require more thought and planning.

zhjngli added 3 commits March 6, 2024 15:22

refactor simple q learner to compose with game class

d9924f7

refactor digit party to new q learner structure

29f79e8

refactor random walk to new q learner structure

b1d83a1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor poc methods and games #1

Refactor poc methods and games #1

zhjngli commented Mar 7, 2024

Refactor poc methods and games #1

Are you sure you want to change the base?

Refactor poc methods and games #1

Conversation

zhjngli commented Mar 7, 2024