Miniproject done in the framework of the CS-456 Artificial Neural Networks course at EPFL.
The goal was to teach an agent to play the Lunar Lander game from OpenAI Gym with REINFORCE algorithm. We investigated the effects of different baselines for the training of the agent.
The notebook can be opened on google colab, where the gym imports are taken care of.