Shouldn't Input of Critic be hidden state of RNN? #5

ychen306 · 2018-11-15T05:08:46Z

Hi Faraz,
I am studying the paper and your implementation is very helpful! I have a question though. It seems that the critic network in the paper takes in history -- which in this case is hidden state of the actor's LSTM, I presume -- rather than the observed state of the environment.

https://github.com/fshamshirdar/pytorch-rdpg/blob/master/rdpg.py#L139-L141

HassamSheikh · 2019-02-08T01:06:24Z

I was looking at exactly the same thing. Got your answer?

zhihanyang2022 · 2021-04-21T09:24:57Z

I think this is a valid concern. Making state information available to the critic makes this implementation incorrect.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shouldn't Input of Critic be hidden state of RNN? #5

Shouldn't Input of Critic be hidden state of RNN? #5

ychen306 commented Nov 15, 2018 •

edited

Loading

HassamSheikh commented Feb 8, 2019

zhihanyang2022 commented Apr 21, 2021

Shouldn't Input of Critic be hidden state of RNN? #5

Shouldn't Input of Critic be hidden state of RNN? #5

Comments

ychen306 commented Nov 15, 2018 • edited Loading

HassamSheikh commented Feb 8, 2019

zhihanyang2022 commented Apr 21, 2021

ychen306 commented Nov 15, 2018 •

edited

Loading