Incompatibility with HF Model Qwen2-1.5B - Tensor Indexing Error (1-D vs 2-D) #1135
Labels
huggingface
For supporting HF models
mixology
Issues that the mixology team has surfaced
program-coverage
Requests for model and program coverage
triage review
🚀 Model / language coverage
I encountered an issue while attempting to use
thunder.jit
with models outside of the lit-gpt universe, specifically the Hugging Face model Qwen2-1.5B-Instruct. The following error is thrown:The shape of the tensor in question is actually (1, 1024), which could potentially be handled with squeeze().
Pitch
Supporting this case could enable compatibility with Qwen2-1.5B-Instruct and possibly with other models from Qwen family.
Alternatives / Potential work-arounds
Adding the following code to the function
_advanced_indexing
inthunder/clang/__init__.py
resolves the issue temporarily:However, the same issue re-emerges in the prims.take_meta function:
Minimal Repro
results in
The text was updated successfully, but these errors were encountered: