Plan for extra-large model support (OPT 175B, Bloom 176B) #46

ccmaymay · 2022-08-04T14:02:14Z

No description provided.

ccmaymay · 2022-08-04T14:10:06Z

@rekriz11 @ruyimarone what would it take to support inference on the large (8 x 80GB A100) OPT model? Do you know if that's possible with the public metaseq software? And do all GPUs still need to be on the same host, or can they be distributed?

ccmaymay · 2022-08-09T16:22:11Z

See #47, #48

ccmaymay added the question label Aug 4, 2022

ccmaymay changed the title ~~Look up supporting multiple machines (large OPT, BLOOM)~~ Plan for extra-large model support (OPT 175B, Bloom 176B) Aug 4, 2022

ccmaymay self-assigned this Aug 9, 2022

ccmaymay closed this as completed Aug 9, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Plan for extra-large model support (OPT 175B, Bloom 176B) #46

Plan for extra-large model support (OPT 175B, Bloom 176B) #46

ccmaymay commented Aug 4, 2022

ccmaymay commented Aug 4, 2022

ccmaymay commented Aug 9, 2022

Plan for extra-large model support (OPT 175B, Bloom 176B) #46

Plan for extra-large model support (OPT 175B, Bloom 176B) #46

Comments

ccmaymay commented Aug 4, 2022

ccmaymay commented Aug 4, 2022

ccmaymay commented Aug 9, 2022