You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
To chip in - this problem also occurs for SFT benchmarks, and shadows this issue #1482 in newer pjnl container release (tested on pjnl-20241126). Breaks for phi-3 also. We don't have NeMo deps in the SFT script, so it's probably unrelated.
🐛 Bug
For Llama-3-8B, Mistral-7B-v0.1 and Phi-3-mini-4k-instruct we get the following error:
To Reproduce
Please use:
1 GPU (H100)
Image "INTERNAL_IMAGE:pjnl-20241125"
Training script:
Expected behavior
Environment
As in the pjnl image
cc @tfogal
The text was updated successfully, but these errors were encountered: