HF LLaVa support #1174

riccardofelluga · 2024-09-19T17:08:46Z

🚀 Model / language coverage

The idea is to support LLaVa model from HF. This issue is mainly for tracking the status.

Blocking issues:

Minimal Repro

First of all get the transformers library with pip install transformers then run this script:

import torch
import thunder
from transformers import LlavaForConditionalGeneration

model = LlavaForConditionalGeneration.from_pretrained(
"llava-hf/llava-1.5-7b-hf",
torch_dtype=torch.bfloat16
)
model.to("cuda")

input_ids = torch.randint(1, 32000, (1, 22), device="cuda")
attention_mask = torch.ones((1, 22), dtype=torch.int64, device="cuda")
pixel_values = torch.randn((1, 3, 336, 336), device="cuda")
labels = torch.randint(-100, 32000, (1, 22), device="cuda")

# Setup fake image id
input_ids[0, 0] = 1
input_ids[0, 5] = 32000

model = thunder.jit(model, executors=thunder.get_default_executors())

out = model(input_ids=input_ids, attention_mask=attention_mask, pixel_values=pixel_values, labels=labels)

The text was updated successfully, but these errors were encountered:

t-vi · 2024-09-19T18:31:24Z

left_padding = not torch.sum(input_ids[:, -1] == torch.tensor(self.pad_token_id))

Note that this looks pretty bad from a "data dependent control flow perspective" and has, indeed, been changed in transformers four months ago.

riccardofelluga · 2024-09-19T18:49:18Z

@t-vi

Note that this looks pretty bad from a "data dependent control flow perspective" and has, indeed, been changed in transformers four months ago.

Indeed it does look kinda bad :(
What do you mean by it has been changed? the line seems to still be there in the file:

https://github.com/huggingface/transformers/blob/4d8908df272c0a9db2e5fbcc8aaed73cdf75442a/src/transformers/models/llava/modeling_llava.py#L284

t-vi · 2024-09-19T18:55:47Z

Right, I'm stupid. They changed it for modelling_llava_next.py not modelling_llava.py. :(

riccardofelluga · 2024-09-25T08:45:26Z

Updated the description with the relevant blocking issues.

csarofeen · 2024-09-30T13:57:04Z

@kshitij12345 does the splitter correctly route these ops to the inductor path?

kshitij12345 · 2024-09-30T17:49:45Z

thunderFX side-steps the data-dependent ops and works on the above snippet.

import torch
import thunder
from transformers import LlavaForConditionalGeneration

model = LlavaForConditionalGeneration.from_pretrained(
"llava-hf/llava-1.5-7b-hf",
torch_dtype=torch.bfloat16
)
model.to("cuda")

input_ids = torch.randint(1, 100, (1, 22), device="cuda")
attention_mask = torch.rand((1, 22), device="cuda") > 0.5
pixel_values = torch.randn((1, 3, 336, 336), device="cuda", dtype=torch.bfloat16, requires_grad=True)
labels = torch.randint(0, 100, (1, 22), device="cuda")

# Setup fake image id
input_ids[0, 0] = 1
input_ids[0, 5] = 32000

# # model = thunder.jit(model, executors=thunder.get_default_executors())
# model = torch.compile(model)

import thunder.dynamo
backend = thunder.dynamo.ThunderCompiler(executors=thunder.get_default_executors())
model = torch.compile(model, backend=backend)

out = model(input_ids=input_ids, attention_mask=attention_mask, pixel_values=pixel_values, labels=labels)
print(out.loss)  # Loss is detached from the graph.

However, I see that out.loss is detached from the computation graph and we can't call backward on it. This is because of a bug in splitter as it doesn't correctly deal with regions under torch.no_grad. Will file a separate issue for the same and look into fixing it. (EDIT - Issue filed at #1219)

riccardofelluga added program-coverage Requests for model and program coverage hf-transformers labels Sep 19, 2024

riccardofelluga self-assigned this Sep 19, 2024

riccardofelluga mentioned this issue Sep 25, 2024

__bool__ (and data dependent control flow) #735

Open

IvanYashchuk mentioned this issue Oct 14, 2024

ThunderFX failure: KeyError: 'l_stack0_' #1293

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HF LLaVa support #1174

HF LLaVa support #1174

riccardofelluga commented Sep 19, 2024 •

edited

Loading

t-vi commented Sep 19, 2024

riccardofelluga commented Sep 19, 2024

t-vi commented Sep 19, 2024

riccardofelluga commented Sep 25, 2024

csarofeen commented Sep 30, 2024

kshitij12345 commented Sep 30, 2024 •

edited

Loading

HF LLaVa support #1174

HF LLaVa support #1174

Comments

riccardofelluga commented Sep 19, 2024 • edited Loading

🚀 Model / language coverage

Minimal Repro

t-vi commented Sep 19, 2024

riccardofelluga commented Sep 19, 2024

t-vi commented Sep 19, 2024

riccardofelluga commented Sep 25, 2024

csarofeen commented Sep 30, 2024

kshitij12345 commented Sep 30, 2024 • edited Loading

riccardofelluga commented Sep 19, 2024 •

edited

Loading

kshitij12345 commented Sep 30, 2024 •

edited

Loading