Use Hydra for run configuration? #188
Replies: 1 comment
-
|
So just to make sure we're on the same page: The current state:
Adding Hydra in would mean:
The main issue imo
parser = HfArgumentParser(GRPOConfig) which gives you CLI overrides without adding dependencies
BUT i do get the appeal because I like SWE OOP principles so the modularity is nice, plus:
A better approach imho would be to leverage what's already there example: from verifiers import GRPOConfig def get_config(): examples/grpo/train_wordle.pyfrom configs.wordle_base import get_config parser = argparse.ArgumentParser() add key overridesargs, unknown = parser.parse_known_args() OR a potentially better approach is to match the TOML approach of prime-rl (thoughts? @willccbb) i.e. configs/wordle.toml[training] Anyways nothing hard and true, just some thoughts. but i like the ideas fs |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
hydra is a great library for managing complex applications.
I think it could help with running experiments, e.g. by providing reasonable defaults and a CLI to override learning rates, KL regularization weight, distributed setup, etc.
This would be a major change, so starting a discussion here before working on it.
Beta Was this translation helpful? Give feedback.
All reactions