Skip to content

Issues: vllm-project/vllm

[Roadmap] vLLM Roadmap Q2 2025
#15735 opened Mar 29, 2025 by simon-mo
Open 7
[V1] Feedback Thread
#12568 opened Jan 30, 2025 by simon-mo
Open 87
Beta
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

[Bug]: vllm 0.8.4 start with using ray, and ray's dashboard fails to start bug Something isn't working
#16779 opened Apr 17, 2025 by ying2025
1 task done
[Bug]: Could't deploy c4ai-command-a-03-2025 with VLLM docker bug Something isn't working
#16777 opened Apr 17, 2025 by mru4913
1 task done
[Bug]: Invalid Mistral ChatCompletionRequest Body Exception bug Something isn't working
#16774 opened Apr 17, 2025 by JasmondL
1 task done
[Bug]: vllm stopped at vLLM is using nccl==2.21.5 bug Something isn't working
#16772 opened Apr 17, 2025 by WanianXO
1 task done
[Bug]: vllm-v0.7.3 V0 engine TP=16 serve DeepSeek-R1 Crash while inference bug Something isn't working
#16766 opened Apr 17, 2025 by handsome-chips
1 task done
[Bug]: qwen2.5-vl inference truncated bug Something isn't working
#16763 opened Apr 17, 2025 by vivian-chen010
1 task done
[Bug]: InternVL3-78B OOM on 4 A100 40G in 0.8.4 bug Something isn't working
#16749 opened Apr 17, 2025 by hanggun
1 task done
[Feature]: AMD Ryzen AI NPU support feature request New feature or request
#16742 opened Apr 16, 2025 by InspiringCode
1 task done
[Bug]: Vllm serve‘s results is not equal to offline inference. bug Something isn't working
#16718 opened Apr 16, 2025 by tzjtatata
1 task done
[Bug]: vllm 0.8.3 v1 engine CUDA Graph Capturing time is too long bug Something isn't working
#16716 opened Apr 16, 2025 by sjtu-zwh
1 task done
[Usage]: "How can I register Logits Processors in the args?" usage How to use vllm
#16709 opened Apr 16, 2025 by hjlee1995
1 task done
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.