-
Notifications
You must be signed in to change notification settings - Fork 29
Open
Description
Hi @Yangsenqiao
I have some questions about the code.
-
How many hours are required to train models using bash scripts/run_efficient_gpt4o_judge.sh?Are you using 32 A100 GPUs with 80GB memory each for this training?
-
The code saves checkpoints every 5 steps. How should we select the checkpoint for final evaluation to reproduce the results in Table 2 of the paper? Do we choose the checkpoints with the highest validation accuracy reward?
-
For the visionthink results presented in the paper, were they obtained using gpt4o-as-judge or qwen-as-judge?
Looking forward to your reply. Thank you!
Metadata
Metadata
Assignees
Labels
No labels