-
Notifications
You must be signed in to change notification settings - Fork 93
Open
Description
Hi expert
I can use the sigle machine train,but how to do distributed region training use this script or there is some good sample to help me understand
Using 2 GPUs
ZERO_BAND_LOG_LEVEL=DEBUG ./scripts/simulate_multi_node_diloco.sh 2 1 src/zeroband/train.py @configs/debug/diloco.toml
Thanks
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels