-
Notifications
You must be signed in to change notification settings - Fork 678
Pull requests: PaddlePaddle/FastDeploy
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[OPs] ep_moe_expert_dispatch.cu dispatch num_experts_per_rank 5
#5890
opened Jan 5, 2026 by
yuanlehome
Loading…
5 tasks
[Cherry-Pick][OPs] ep_moe_expert_dispatch.cu dispatch num_experts_per_rank 5
#5889
opened Jan 5, 2026 by
yuanlehome
Loading…
5 tasks
[Speculative Decoding]Support multi-step mtp with cudagraph
#5886
opened Jan 5, 2026 by
freeliuzc
Loading…
5 tasks
[Cherry-Pick] [BugFix] fix mtp cache attaching for pd disaggregation (#5884)
#5885
opened Jan 5, 2026 by
liyonghua0910
Loading…
5 tasks
[BugFix] fix mtp cache attaching for pd disaggregation
#5884
opened Jan 5, 2026 by
liyonghua0910
Loading…
5 tasks
[Feature] add golang router
contributor
External developers
#5882
opened Jan 5, 2026 by
mouxinqq
Loading…
5 tasks
[BugFix][Cherry-Pick] Cp fix eb5 prefix cache(#5879)
#5881
opened Jan 5, 2026 by
kevincheng2
Loading…
5 tasks
[Optimization] Accelerate Qwen3 QK RMSNorm via Fused Triton Kernel
#5880
opened Jan 5, 2026 by
Sunny-bot1
Loading…
5 tasks done
[XPU] move xpu_attn_backend.py to FastDeploy/fastdeploy/model_executor/layers/backends/xpu
#5878
opened Jan 5, 2026 by
zccjjj
Loading…
5 tasks
[Metax] optimize flash attention backend
contributor
External developers
#5876
opened Jan 5, 2026 by
neilzhuu
Loading…
5 tasks
[KVCache] launch cache transfer processes only if hierarchical cache or kv cache storage is enabled
#5871
opened Jan 5, 2026 by
liyonghua0910
Loading…
5 tasks
[XPU]xpu support ep4tp1 in pd disaggregation
#5860
opened Jan 4, 2026 by
ddchenhao66
Loading…
5 tasks
[Cherry-Pick] [KVCache] launch cache transfer processes only if hierarchical cache or kv cache storage is enabled (#5871)
#5859
opened Jan 4, 2026 by
liyonghua0910
Loading…
5 tasks
[Intel HPU] enable MoE EP for hpu
contributor
External developers
#5855
opened Jan 4, 2026 by
yanfeich
Loading…
2 tasks
[Optim] When the token is small, use a gemm with a smaller N
#5853
opened Jan 4, 2026 by
yangjianfengo1
Loading…
5 tasks
[Cherry-Pick][Feature] support rl_tp_degree(#5850)
#5851
opened Dec 31, 2025 by
lizhenyun01
Loading…
5 tasks
[Graph Optimization] Wrap
m_grouped_gemm_fp8_fp8_bf16_nt_contiguous as custom pyop
#5847
opened Dec 31, 2025 by
DrRyanHuang
Loading…
5 tasks done
[Feature] get_output_kv_signal blocking read mode & send_first_token
#5836
opened Dec 30, 2025 by
ST-XX
Loading…
5 tasks done
[Cherry-Pick][Optimization]Decode attention support(#5767)
#5833
opened Dec 30, 2025 by
lizhenyun01
Loading…
5 tasks
[BugFix] Fix redundant prompt_logprobs in the second chunk of streaming response when return_token_ids is enabled for v1/completions and fix trace file name
#5829
opened Dec 30, 2025 by
qwes5s5
Loading…
5 tasks
Previous Next
ProTip!
Exclude everything labeled
bug with -label:bug.