-
JADS and TU/e
- www.linkedin.com/in/danil-provodin
Pinned Loading
-
batch-bandits
batch-bandits PublicImplementation of popular bandit algorithms in batch environments.
Jupyter Notebook 7
-
cmdp
cmdp Public[ICML 2024] Code for the paper "Efficient Exploration in Average-Reward Constrained RL: Achieving Near-Optimal Regret With Posterior Sampling"
Python
-
rethinking_lupi
rethinking_lupi PublicExperiments on knowledge transfer techniques in the LUPI paradigm
Jupyter Notebook 1
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.