Developed various model-based and model-free Intelligent and Naive algorithms for the beam balance environment in OpenAI Gym.
deep-reinforcement-learning
epsilon-greedy-exploration
boltzman-policy-reward
variational-pid-controller
-
Updated
Mar 29, 2021 - Jupyter Notebook