Skip to content

Releases: yycho0108/CartPole_tf

Stable working version

03 May 09:19
Compare
Choose a tag to compare

First stable-ish working version of DDRQN with engineered reward,
but with state penalty (no velocity information given).

Auto-generated parameters:

  • TAU : 0.001
  • N_TRACE : 8
  • N_X : 4
  • DESCRIPTION :
  • HS : [32, 64]
  • LR_DECAY_STEPS : 10000000
  • N_BATCH : 32
  • EPS_MIN : 0.05
  • EPS_ANNEAL : 2000000
  • N_TEST : 200
  • N_PRE : 100000
  • N_MEM : 10000
  • LR_MAX : 0.0001
  • N_H : 64
  • LR_MIN : 1e-05
  • U_FREQ : 64
  • EPS_DECAY : 0.999998502135
  • EPS_INIT : 1.0
  • N_A : 2
  • GAMMA : 0.99
  • N_EPOCH : inf