[litgpt benchmark] enable force_recompute_fp8_weight_in_bwd
when torchao.float8
is used with FSDP2
#1994
Job | Run time |
---|---|
13s | |
13s |