vllm-project / vllm-gaudi Public

Notifications You must be signed in to change notification settings
Fork 71
Star 16

Code
Issues 1
Pull requests 62
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Pull requests: vllm-project/vllm-gaudi

Labels 12 Milestones 0

New pull request New

62 Open 549 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[DRAFT]enable spec decode for Unified Attention

#619 opened Nov 21, 2025 by xuechendi • Draft

Fix transformers version mismatch causing editable install failure

#618 opened Nov 21, 2025 by Saiteja-Garlapati

Loading…

lora, fix for PR28545

#617 opened Nov 21, 2025 by iboiko-habana

Loading…

disabled interleaved sliding window llama4

#616 opened Nov 21, 2025 by Luca-Calabria

Loading…

Updates to validated models list documentation

Improvements or additions to documentation

skip-gaudi-tests

#614 opened Nov 21, 2025 by PatrykWo

Loading…

Allow building vllm-plugin docker for ubuntu with upstream torch

#613 opened Nov 21, 2025 by mmuszynskihabana

Loading…

Spec decode: support of more than one num speculative tokens

#609 opened Nov 21, 2025 by jerrychenhf

Loading…

Add VLLM_DEFRAG env to enable defrag without UA

#608 opened Nov 21, 2025 by xwu-intel

Loading…

Add the missing step to the Quick Start guide documentation

Improvements or additions to documentation

skip-gaudi-tests

#599 opened Nov 20, 2025 by mhelf-intel

Loading…

fix assert failure on hpu_paged_attn

#598 opened Nov 20, 2025 by Luca-Calabria

Loading…

Sleep mode support

#584 opened Nov 18, 2025 by Kacper-Pietkun • Draft

[DOCKER update] update docker to 1.23, transformers to 4.56.0

#580 opened Nov 17, 2025 by xuechendi • Draft

Add support of FP32 softmax to unified attention

#577 opened Nov 17, 2025 by afierka-intel • Draft

Cherry-pick release docker cmdline fixes, WA and long context support

#576 opened Nov 17, 2025 by nngokhale

Loading…

Implementing softmax_fa2 in partial_attn shared and causal

#566 opened Nov 13, 2025 by ksmusz

Loading…

Docs: Missing content from Habana docs documentation

Improvements or additions to documentation

skip-gaudi-tests

#562 opened Nov 13, 2025 by mhelf-intel

Loading…

Add a plugin for variable support in Markdown documentation

Improvements or additions to documentation

skip-gaudi-tests

#554 opened Nov 12, 2025 by mhelf-intel

Loading…

fix loading fp8 static quantized model for compressored_tensors format.

#552 opened Nov 11, 2025 by lkk12014402

Loading…

Prepare Unified Attention biases on HPU + add NumPy memory pooling

#550 opened Nov 7, 2025 by kzawora-intel

Loading…

Michalkuligowski patch 7

#542 opened Nov 6, 2025 by michalkuligowski • Draft

[SW-228042] Add support for dynamic vLLM kv-cache quantization

#538 opened Nov 6, 2025 by dudilester

Loading…

[Attention Metadata Overhaul 2/N] Move metadata processing outside HPUModelAdapter, prepare biases on CPU

#530 opened Nov 5, 2025 by kzawora-intel

Loading…

[Attention Metadata Overhaul 1/N] Extract metadata update to HPUAttentionMetadataProcessor

#526 opened Nov 5, 2025 by kzawora-intel

Loading…

enable lmcache

#521 opened Nov 5, 2025 by hsubramony

Loading…

reduce graph recompilations in input embeddings for Gemma3

#519 opened Nov 4, 2025 by skaulintel • Draft

Previous 1 2 3 Next

Previous Next

ProTip! Adding no:label will show everything without a label.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!