A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow
-
Updated
Feb 10, 2019 - Python
A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow
Master Thesis: Limit order placement with Reinforcement Learning
💥💥 This is a easy installable extension for OpenAi Gym Environment. This simulates SpaceX Falcon landing.
Training a vision-based agent with the Deep Q Learning Network (DQN) in Atari's Breakout environment, implementation in Tensorflow.
Distributed RL implementations
A collection of Reinforcement Learning algorithms.
A TensorFlow distributed implementation of DeepMind's PathNet architecture for RL training with OpenAI Gym agents. Modified from jaesik817's implementation.
Gym is a toolkit for developing and comparing reinforcement learning algorithms.
Implementation of Deep Q Learning algorithms in pytorch
💥💥 This is a easy installable extension for OpenAi Gym Environment. This simulates SpaceX Falcon landing.
teaching an agent to play the Lunar Lander game from OpenAI Gym with REINFORCE algorithm
Basic Q learning algorithm to solve reaching task on 2D robot arm environment
Add a description, image, and links to the openai-gym-agents topic page so that developers can more easily learn about it.
To associate your repository with the openai-gym-agents topic, visit your repo's landing page and select "manage topics."