Feat/add impala #138

EdanToledo · 2025-02-25T20:43:53Z

What?

Add Impala sebulba system.

Why?

Useful to have a sebulba system that is not PPO.

roger-creus · 2025-03-09T21:36:30Z

Are there any performance benchmarks with the IMPALA implementation? :)

EdanToledo · 2025-03-10T08:08:02Z

Hi, unfortunately not. I don't have the time to do large scale performance benchmarks. I wanted to do an ATARI-5 run which I still might do but comparing it against the true implementation of impala which is recurrent might be a little unfair.

roger-creus · 2025-03-10T14:37:58Z

I understand. I was just wondering if you have seen it "train" at least

EdanToledo · 2025-03-10T15:06:03Z

oh, yes, i have seen it learn to solve all the simple env pool environments although i will say its worse than PPO in terms of sample efficiency and performance (which makes sense since it only updates once on the data). It also seemed to be more finnicky to hyperparams.

EdanToledo and others added 5 commits February 23, 2025 20:38

feat: add feedforward impala sebulba system

2e39526

chore: fix issue with gymnasium envs

b92d32c

chore: revert launcher configs

0d5b242

chore: edit comments

ce6809d

chore: run precommit

94f9aa3

EdanToledo merged commit e33a7fa into main Mar 9, 2025
3 checks passed

EdanToledo deleted the feat/add_impala branch March 9, 2025 20:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/add impala #138

Feat/add impala #138

EdanToledo commented Feb 25, 2025

roger-creus commented Mar 9, 2025

EdanToledo commented Mar 10, 2025

roger-creus commented Mar 10, 2025

EdanToledo commented Mar 10, 2025

Feat/add impala #138

Feat/add impala #138

Conversation

EdanToledo commented Feb 25, 2025

What?

Why?

roger-creus commented Mar 9, 2025

EdanToledo commented Mar 10, 2025

roger-creus commented Mar 10, 2025

EdanToledo commented Mar 10, 2025