My reproduction of various reinforcement learning algorithms (DQN variants, A3C, DPPO, RND with PPO) in Tensorflow.
reinforcement-learning
tensorflow
lstm
dqn
rl
rnd
a3c
per
ddqn
distributed-tensorflow
ppo
dppo
random-network-distillation
dueling-ddqn
n-step
rnd-ppo
n-step-target
n-step-return
-
Updated
Mar 24, 2023 - Jupyter Notebook