-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Check restarting/handling of pending config when resuming a run #30
Comments
This happens when the process is force-killed during the evaluation of a config, and is reproducible with a single process. To reproduce:
Alternatively, You can skip the steps 1-5, and manually delete a |
There's been some developments here:
However, what should happen if you Fixing this is non-trivial, although there's some patchwork to make this less bad. a) When a I'll implement the |
I did an implementation in #129 which should be robust enough to common occurences, CTRL+C as-well as SLURM, which sends process signals. The only exception is |
For potential reproducibility of the observed issue:
max_evaluations_total
) evaluations distributed across 4 workers16
21
was generated while config ID16
was not re-evaluated or completed and remainspending
foreverSome more observations:
max_evaluations_total=20
we should have config IDs from 1-20 with each of them having their ownresult.yaml
config_16
does not haveresult.yaml
whereasconfig_21
doesmax_evaluations_total=21
, it now satisfies that extra config required by sampling a new configconfig_22
Should a new worker, re-evaluate pending configs, as priority?
Also with this issue or under this scenario the generated config IDs range from
[1, n+1]
ifmax_evaluations_total=n
.The text was updated successfully, but these errors were encountered: