Will this work for 6GB Vram ? 3050 #17

vihangasa14 · 2024-12-20T07:29:20Z

Some modules are dispatched on the CPU or the disk. Make sure you have enough GPU RAM to fit the quantized model. If you want to dispatch the model on the CPU or the disk while keeping these modules in 32-bit, you need to set llm_int8_enable_fp32_cpu_offload=True and pass a custom device_map to from_pretrained. Check https://huggingface.co/docs/transformers/main/en/main_classes/quantization#offload-between-cpu-and-gpu for more details.

The text was updated successfully, but these errors were encountered:

vihangasa14 · 2024-12-20T07:31:49Z

JoyCaptionAlpha2Online
not enough values to unpack (expected 2, got 0)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Will this work for 6GB Vram ? 3050 #17

Will this work for 6GB Vram ? 3050 #17

vihangasa14 commented Dec 20, 2024

vihangasa14 commented Dec 20, 2024

Will this work for 6GB Vram ? 3050 #17

Will this work for 6GB Vram ? 3050 #17

Comments

vihangasa14 commented Dec 20, 2024

vihangasa14 commented Dec 20, 2024