Critic Network Error and Poor Rewards in mujoco_sac.py #1166
Unanswered
payamparvizi
asked this question in
Q&A
Replies: 1 comment
-
Hi, sorry for the late answer. It's better to open an issue instead of a discussion, I rarely look at those ;) I'm going to look into it |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hello,
I am trying to run the code in
tianshou/examples/mujoco/mujoco_sac.py
. However, I got an error in the critic networks (net_c1
andnet_c2
). The error message indicates that settingconcat=True
causes a size mismatch.I commented that part and ran the code, but I did not get good rewards from the Ant-v4 environment as expected. To enable
concat=True
, I customized the network, but the results were still not sufficient. The rewards I get are always below 0, no matter how long I run it.Is it possible that the hyperparameters are incorrect?
Please let me know how I can solve this problem.
Regards
Beta Was this translation helpful? Give feedback.
All reactions