-
-
Notifications
You must be signed in to change notification settings - Fork 10.6k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Misc] Add VLLM_DISTRIBUTED_INIT_METHOD_OVERRIDE env var
#27162
opened Oct 19, 2025 by
WoosukKwon
Loading…
[BugFix] Fix lazy imports involving outlines_core
ready
ONLY add when PR is ready to merge/full CI is needed
structured-output
v1
#27158
opened Oct 19, 2025 by
22quinn
Loading…
5 tasks
Add auto max model len for available memory with Improvements or additions to documentation
v1
--max-model-len -1
codex
documentation
#27155
opened Oct 18, 2025 by
mgoin
Loading…
[Chore] Separate out hashing utilities from vllm.utils
kv-connector
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#27151
opened Oct 18, 2025 by
dongbo910220
Loading…
[torch.compile] Enable silu_mul_fp8_quant fusion without custom ops enabled
#27146
opened Oct 18, 2025 by
ZJY0516
Loading…
5 tasks
[Model][3/N] Improve all pooling task | Support chunked prefill with ALL pooling
frontend
v1
#27145
opened Oct 18, 2025 by
noooop
Loading…
5 tasks
[Bugfix] fixes the decoding metadata of dense mla's fp8 kvcache.
ci/build
v1
#27144
opened Oct 18, 2025 by
sighingnow
Loading…
[NIXL] use Host buffer to support TP_ratio > 1 for XPU
kv-connector
#27140
opened Oct 18, 2025 by
xuechendi
Loading…
5 tasks
[Fix][Spec Decode] Fix llama4 draft loading with different quantization
llama
Related to Llama models
speculative-decoding
#27136
opened Oct 18, 2025 by
linzebing
Loading…
3 of 5 tasks
[Bugfix] Fix incorrect kv cache metrics in grafana.json
documentation
Improvements or additions to documentation
#27133
opened Oct 17, 2025 by
fangpings
Loading…
5 tasks
Early exit for MoE LoRA kernels
ci/build
deepseek
Related to DeepSeek models
gpt-oss
Related to GPT-OSS models
needs-rebase
qwen
Related to Qwen models
[BugFix] bugfix for Flash Attention MLA with full cuda graph IMA following pr-25490
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#27128
opened Oct 17, 2025 by
Daisy-Ma-coder
Loading…
make flash_attn ViT upgrade opt-in
ci/build
ci-failure
Issue about an unexpected test failure in CI
qwen
Related to Qwen models
rocm
Related to AMD ROCm
#27124
opened Oct 17, 2025 by
bradleyhd
Loading…
[Bugfix] Fix allocation & free logic of SingleWriterShmRingBuffer
#27117
opened Oct 17, 2025 by
imkero
Loading…
5 tasks
Add missing opentelemetry dependency to base docker image
ci/build
#27109
opened Oct 17, 2025 by
Aymendje
Loading…
3 of 5 tasks
[CI] Fix mypy for ONLY add when PR is ready to merge/full CI is needed
v1
vllm/v1/core
and vllm/v1/engine
ready
#27108
opened Oct 17, 2025 by
yewentao256
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2025-10-15.