-
-
Notifications
You must be signed in to change notification settings - Fork 16.8k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[fix] add verify_quantization on intel platform
intel-gpu
Related to Intel GPU
#42727
opened May 15, 2026 by
Alex-ai-future
•
Draft
4 tasks
[ZenCPU] Add zencpu Platform Runtime Logging and Docs
cpu
Related to CPU backends
documentation
Improvements or additions to documentation
#42726
opened May 15, 2026 by
amd-lalithnc
Contributor
Loading…
1 of 4 tasks
[XPU] fix weight scale shape
intel-gpu
Related to Intel GPU
#42725
opened May 15, 2026 by
zufangzhu
Contributor
Loading…
[Bugfix] All pyNCCL copy-only operation to use int8 instead of fp8
bug
Something isn't working
#42724
opened May 15, 2026 by
mickaelseznec
Contributor
Loading…
4 tasks
fix: validate xxhash prefix cache dependency
v1
#42723
opened May 15, 2026 by
he-yufeng
Contributor
Loading…
fix: add --api-key support and authentication warning to gRPC server
frontend
#42721
opened May 15, 2026 by
ChristinaSaikoy
Loading…
4 tasks
[Bugfix] Fix IndexError on empty slice of FlatLogprobs
bug
Something isn't working
#42719
opened May 15, 2026 by
Dev-X25874
Loading…
Bump the minor-update group across 1 directory with 143 updates
ci/build
dependencies
Pull requests that update a dependency file
nvidia
rocm
Related to AMD ROCm
#42717
opened May 15, 2026 by
dependabot
Bot
Loading…
Fix Weight loading for Qwen3.5-MTP and Qwen3-VL using runai_streamer
qwen
Related to Qwen models
#42716
opened May 15, 2026 by
weizhoublue
Loading…
Fix : crash in DeepSeek V4 _forward_rocm due to stale ffn_norm reference after norm-gate fusion
deepseek
Related to DeepSeek models
rocm
Related to AMD ROCm
#42711
opened May 15, 2026 by
weizhoublue
Loading…
[MRV2][XPU] add Model Runner V2 log
intel-gpu
Related to Intel GPU
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#42710
opened May 15, 2026 by
zhenwei-intel
Contributor
Loading…
4 tasks
[Bugfix] Ensure embeding model compilation on CPU
bug
Something isn't working
cpu
Related to CPU backends
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#42709
opened May 15, 2026 by
bigPYJ1151
Member
Loading…
4 tasks
[CPU] Add fused GDN support for AMX CPU platform
cpu
Related to CPU backends
ready
ONLY add when PR is ready to merge/full CI is needed
#42707
opened May 15, 2026 by
bigPYJ1151
Member
Loading…
4 tasks
[Bugfix] Unwrap VLM wrappers for EPLB on Model Runner V2
bug
Something isn't working
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#42706
opened May 15, 2026 by
JasonKeyiL
Contributor
Loading…
[Bugfix] dflash-qwen3.5-acceptance-rate lower than baseline
bug
Something isn't working
qwen
Related to Qwen models
v1
#42704
opened May 15, 2026 by
xiaohajiayou
Contributor
•
Draft
4 tasks
[Examples] Add NixlConnector support to disagg_proxy_demo
documentation
Improvements or additions to documentation
kv-connector
#42703
opened May 15, 2026 by
mihirn
Loading…
[WIP][Verify] VLLM_BATCH_INVARIANT=1 fixes test_async_scheduling rank flip
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#42702
opened May 15, 2026 by
haosdent
Contributor
Loading…
Revert "[Model Runner v2] Oracle for model runner v2 - qwen3 dense model by default [1/N]" (#39337)
nvidia
qwen
Related to Qwen models
v1
#42698
opened May 15, 2026 by
vllm-agent
•
Draft
Revert "[RFC] Replace shared-memory routed experts with ModelRunnerOutput transfer and HTTP support" (#39568)
frontend
v1
#42697
opened May 15, 2026 by
vllm-agent
•
Draft
[KVConnector][Mooncake] Wire reset_cache cascade end-to-end
kv-connector
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#42694
opened May 15, 2026 by
aoshen02
Collaborator
Loading…
[Bugfix] DFlash FP8 KV-Cache
bug
Something isn't working
dflash
qwen
Related to Qwen models
ready
ONLY add when PR is ready to merge/full CI is needed
speculative-decoding
v1
#42692
opened May 15, 2026 by
benchislett
Collaborator
Loading…
[Bugfix] Fix reasoning dropped on streaming boundary deltas
bug
Something isn't working
#42691
opened May 15, 2026 by
sfeng33
Collaborator
Loading…
[KV Connector] Support disk offloading in MooncakeStoreConnector
documentation
Improvements or additions to documentation
kv-connector
v1
#42689
opened May 14, 2026 by
zhewenl
Collaborator
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.