Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[fix] add verify_quantization on intel platform intel-gpu Related to Intel GPU
#42727 opened May 15, 2026 by Alex-ai-future Draft
4 tasks
[ZenCPU] Add zencpu Platform Runtime Logging and Docs cpu Related to CPU backends documentation Improvements or additions to documentation
#42726 opened May 15, 2026 by amd-lalithnc Contributor Loading…
1 of 4 tasks
[XPU] fix weight scale shape intel-gpu Related to Intel GPU
#42725 opened May 15, 2026 by zufangzhu Contributor Loading…
[Bugfix] All pyNCCL copy-only operation to use int8 instead of fp8 bug Something isn't working
#42724 opened May 15, 2026 by mickaelseznec Contributor Loading…
4 tasks
fix: validate xxhash prefix cache dependency v1
#42723 opened May 15, 2026 by he-yufeng Contributor Loading…
[Bugfix][Scheduler] Account for empty spec decode output bug Something isn't working v1
#42722 opened May 15, 2026 by tuukkjs Contributor Draft
[Bugfix] Fix IndexError on empty slice of FlatLogprobs bug Something isn't working
#42719 opened May 15, 2026 by Dev-X25874 Loading…
Bump the minor-update group across 1 directory with 143 updates ci/build dependencies Pull requests that update a dependency file nvidia rocm Related to AMD ROCm
#42717 opened May 15, 2026 by dependabot Bot Loading…
Fix Weight loading for Qwen3.5-MTP and Qwen3-VL using runai_streamer qwen Related to Qwen models
#42716 opened May 15, 2026 by weizhoublue Loading…
Fix : crash in DeepSeek V4 _forward_rocm due to stale ffn_norm reference after norm-gate fusion deepseek Related to DeepSeek models rocm Related to AMD ROCm
#42711 opened May 15, 2026 by weizhoublue Loading…
[MRV2][XPU] add Model Runner V2 log intel-gpu Related to Intel GPU ready ONLY add when PR is ready to merge/full CI is needed v1
#42710 opened May 15, 2026 by zhenwei-intel Contributor Loading…
4 tasks
[Bugfix] Ensure embeding model compilation on CPU bug Something isn't working cpu Related to CPU backends ready ONLY add when PR is ready to merge/full CI is needed v1
#42709 opened May 15, 2026 by bigPYJ1151 Member Loading…
4 tasks
[CPU] Add fused GDN support for AMX CPU platform cpu Related to CPU backends ready ONLY add when PR is ready to merge/full CI is needed
#42707 opened May 15, 2026 by bigPYJ1151 Member Loading…
4 tasks
[Bugfix] Unwrap VLM wrappers for EPLB on Model Runner V2 bug Something isn't working ready ONLY add when PR is ready to merge/full CI is needed v1
#42706 opened May 15, 2026 by JasonKeyiL Contributor Loading…
[Bugfix] dflash-qwen3.5-acceptance-rate lower than baseline bug Something isn't working qwen Related to Qwen models v1
#42704 opened May 15, 2026 by xiaohajiayou Contributor Draft
4 tasks
[Examples] Add NixlConnector support to disagg_proxy_demo documentation Improvements or additions to documentation kv-connector
#42703 opened May 15, 2026 by mihirn Loading…
[WIP][Verify] VLLM_BATCH_INVARIANT=1 fixes test_async_scheduling rank flip ready ONLY add when PR is ready to merge/full CI is needed v1
#42702 opened May 15, 2026 by haosdent Contributor Loading…
[Bugfix] Replace deprecated Qwen2VLImageProcessorFast with Qwen2VLImageProcessor bug Something isn't working qwen Related to Qwen models ready ONLY add when PR is ready to merge/full CI is needed
#42700 opened May 15, 2026 by abinggo Contributor Loading…
[KVConnector][Mooncake] Wire reset_cache cascade end-to-end kv-connector ready ONLY add when PR is ready to merge/full CI is needed v1
#42694 opened May 15, 2026 by aoshen02 Collaborator Loading…
[Bugfix] DFlash FP8 KV-Cache bug Something isn't working dflash qwen Related to Qwen models ready ONLY add when PR is ready to merge/full CI is needed speculative-decoding v1
#42692 opened May 15, 2026 by benchislett Collaborator Loading…
[Bugfix] Fix reasoning dropped on streaming boundary deltas bug Something isn't working
#42691 opened May 15, 2026 by sfeng33 Collaborator Loading…
[KV Connector] Support disk offloading in MooncakeStoreConnector documentation Improvements or additions to documentation kv-connector v1
#42689 opened May 14, 2026 by zhewenl Collaborator Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.