Would be nice to compare footprint with and without zero stage 1. It is currently not implemented https://github.com/jfc4050/dlcalc/blob/8cec28f81017b2142feb0657255a1f4b87e546b6/dlcalc/training_3d.py#L226