Continuous-Time/State/Action Fitted Value Iteration via Hamilton-Jacobi-Bellman (HJB)
reinforcement-learning flax optimal-control value-iteration continuous-control jax hamilton-jacobi-bellman hamilton-jacobi continuous-value-iteration
-
Updated
Feb 1, 2022 - Python