All the code for the hands-on exercies can be found in this repository.
Table of Contents
To request an account on Zaratan, please join slack at the link above, and fill this Google form.
We have pre-built the dependencies required for this tutorial on Zaratan. This will be activated automatically when you run the bash scripts.
The training dataset i.e. MNIST has also
been downloaded in /scratch/zt1/project/isc/shared/data/MNIST
.
cd session_1_basics/
sbatch --reservation=isc2024 run.sh
MIXED_PRECISION=true sbatch --reservation=isc2024 run.sh
CHECKPOINT_ACTIVATIONS=true sbatch --reservation=isc2024 run.sh
cd session_2_data_parallelism
sbatch --reservation=isc2024 run_ddp.sh
sbatch --reservation=isc2024 run_deepspeed.sh
cd session_3_intra_layer_parallelism
sbatch --reservation=isc2024 run.sh
cd session_4_inter_layer_parallelism
sbatch --reservation=isc2024 run.sh
HYBRID_PARR=true sbatch --reservation=isc2024 run.sh