pytorch implementation of SAC, TD3 and TD7 with Mujoco Benchmark results from 4 seeds.
-
Updated
Jul 4, 2024 - Python
pytorch implementation of SAC, TD3 and TD7 with Mujoco Benchmark results from 4 seeds.
Official implementation for "On the Reuse Bias in Off-Policy Reinforcement Learning" (IJCAI 2023)
Add a description, image, and links to the off-policy-reinforcement-learning topic page so that developers can more easily learn about it.
To associate your repository with the off-policy-reinforcement-learning topic, visit your repo's landing page and select "manage topics."