[core] support device type device_maps to work with offloading. by sayakpaul · Pull Request #12811 · huggingface/diffusers

sayakpaul · 2025-12-09T05:45:23Z

What does this PR do?

This PR allows users to pass a device_map="cpu" while initializing a pipeline and then enable model CPU offloading.

This is beneficial when users want to initialize the models on CPU (think of low VRAM environments) and then call enable_model_cpu_offload(). Quantized models initialize directly on a supported accelerator. This can lead to OOMs.

Below provides a diff that this PR introduces:

import torch
from diffusers import Flux2Pipeline, AutoModel
from transformers import Mistral3ForConditionalGeneration

repo_id = "diffusers/FLUX.2-dev-bnb-4bit" # quantized text-encoder and DiT. VAE still in bf16
device = "cuda:0"
torch_dtype = torch.bfloat16

- text_encoder = Mistral3ForConditionalGeneration.from_pretrained(
-     repo_id, subfolder="text_encoder", torch_dtype=torch.bfloat16, device_map="cpu"
- )
- dit = AutoModel.from_pretrained(
-     repo_id, subfolder="transformer", torch_dtype=torch.bfloat16, device_map="cpu"
- )
- pipe = Flux2Pipeline.from_pretrained(
-     repo_id, text_encoder=text_encoder, transformer=dit, torch_dtype=torch_dtype
- )
- pipe.enable_model_cpu_offload()
+ pipe = Flux2Pipeline.from_pretrained(repo_id, torch_dtype=torch.bfloat16, device_map="cpu")
+ pipe.enable_model_cpu_offload()

prompt = "Realistic macro photograph of a hermit crab using a soda can as its shell, partially emerging from the can, captured with sharp detail and natural colors, on a sunlit beach with soft shadows and a shallow depth of field, with blurred ocean waves in the background. The can has the text `BFL + Diffusers` on it and it has a color gradient that start with #FF5733 at the top and transitions to #33FF57 at the bottom."
image = pipe(
    prompt=prompt,
    generator=torch.Generator(device=device).manual_seed(42),
    num_inference_steps=50,
    guidance_scale=4,
).images[0]

image.save("flux2_output.png")

cc: @asomoza @apolinario

HuggingFaceDocBuilderDev · 2025-12-09T05:53:35Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

sayakpaul · 2026-01-19T04:56:41Z

@yiyixuxu @DN6 a gentle ping.

DN6

LGTM 👍🏽

sayakpaul · 2026-02-16T06:51:44Z

tests/pipelines/test_pipelines_common.py

                    f"Component '{name}' has dtype {component.dtype} but expected {expected_dtype}",
                )

-    @require_torch_accelerator


So that this runs on the CPUs too as it's supported.

sayakpaul · 2026-02-16T06:52:04Z

tests/models/testing_utils/quantization.py

+    @pytest.mark.parametrize(
+        "config_name",
+        list(BitsAndBytesConfigMixin.BNB_CONFIGS.keys()),
+        ids=list(BitsAndBytesConfigMixin.BNB_CONFIGS.keys()),
+    )


This test is specified to bitsandbytes for now.

support device type device_maps to work with offloading.

83ec2fb

sayakpaul requested review from DN6 and yiyixuxu December 9, 2025 06:43

sayakpaul added 5 commits December 11, 2025 14:47

Merge branch 'main' into device-map-direct

6f5eb0a

Merge branch 'main' into device-map-direct

c61e455

Merge branch 'main' into device-map-direct

3b334de

Merge branch 'main' into device-map-direct

b28d6d4

Merge branch 'main' into device-map-direct

fe4c0be

sayakpaul added 3 commits January 23, 2026 12:34

Merge branch 'main' into device-map-direct

0a58f56

Merge branch 'main' into device-map-direct

59ac2f3

Merge branch 'main' into device-map-direct

661d2b1

DN6 approved these changes Feb 16, 2026

View reviewed changes

sayakpaul added the roadmap Add to current release roadmap label Feb 16, 2026

github-project-automation bot added this to Diffusers Roadmap 0.37 Feb 16, 2026

github-project-automation bot moved this to In Progress in Diffusers Roadmap 0.37 Feb 16, 2026

sayakpaul added 2 commits February 16, 2026 11:58

add tests.

2baa95d

fix tests

2605129

sayakpaul commented Feb 16, 2026

View reviewed changes

sayakpaul added 7 commits February 16, 2026 12:38

skip tests where it's not supported.

1987022

empty

ba272ed

up

3fb47a4

Merge branch 'main' into device-map-direct

abb68a5

up

7049808

Merge branch 'main' into device-map-direct

9f78817

fix allegro.

052b70e

sayakpaul merged commit 35086ac into main Feb 16, 2026
32 of 33 checks passed

github-project-automation bot moved this from In Progress to Done in Diffusers Roadmap 0.37 Feb 16, 2026

sayakpaul deleted the device-map-direct branch February 16, 2026 11:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[core] support device type device_maps to work with offloading.#12811

[core] support device type device_maps to work with offloading.#12811
sayakpaul merged 18 commits intomainfrom
device-map-direct

sayakpaul commented Dec 9, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Dec 9, 2025

Uh oh!

sayakpaul commented Jan 19, 2026

Uh oh!

DN6 left a comment

Uh oh!

sayakpaul Feb 16, 2026

Uh oh!

sayakpaul Feb 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

sayakpaul commented Dec 9, 2025

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Dec 9, 2025

Uh oh!

sayakpaul commented Jan 19, 2026

Uh oh!

DN6 left a comment

Choose a reason for hiding this comment

Uh oh!

sayakpaul Feb 16, 2026

Choose a reason for hiding this comment

Uh oh!

sayakpaul Feb 16, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants