This project will be home to my reinforcement learning experiments using dl4j. So far there is a toy test case for DQN in a gridworld.
- Dual DQN
- One step TD actor critic
// Create a DQN solver (hidden layers have ReLu activation and output layer has softmax activation)
DQN dqn = new DQN.DQNBuilder()
.hiddenLayers(new int[] {30, 30, 30})
// Try to solve the gridworld using DQN (10000 epochs, max 200 steps per epoch) GridWorld(), 10000, 200);
Thanks to for the minesweeper env.