Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Notice for deprecation of AutoAWQ documentation Improvements or additions to documentation
#26820 opened Oct 14, 2025 by HDCharles Loading…
4 tasks done
[CI Failure] Fix tests with missing TinyLlama-1.1B-Chat-v1.0-FP8-e2e llama Related to Llama models ready ONLY add when PR is ready to merge/full CI is needed
#26816 opened Oct 14, 2025 by mgoin Loading…
5 tasks
[Bugfix] Fix qwen3-omni audio truncation issue qwen Related to Qwen models
#26815 opened Oct 14, 2025 by Isotr0py Loading…
1 of 5 tasks
[Core] Use envs.__getattr__ for all Unify to environment variable access multi-modality Related to multi-modality (#4194) v1
#26810 opened Oct 14, 2025 by Jialin Loading…
3 of 5 tasks
[Docs] update README.md to display logo correctly and fix links documentation Improvements or additions to documentation
#26809 opened Oct 14, 2025 by ddalgrande Loading…
3 of 5 tasks
[Feature] GatedDeltaNet Automatic Prefix Caching qwen Related to Qwen models v1
#26807 opened Oct 14, 2025 by simondanielsson Draft
1 of 11 tasks
Fix seed reproducibility issue by adding output.copy_(out)
#26805 opened Oct 14, 2025 by XuanofXXX Loading…
3 of 5 tasks
[Metrics] Refactor LoRA state tracking ready ONLY add when PR is ready to merge/full CI is needed v1
#26801 opened Oct 14, 2025 by markmc Loading…
[Model] add kosmos2_5 for vllm new-model Requests to new models
#26800 opened Oct 14, 2025 by yugeeklab Loading…
3 of 5 tasks
[V1][performance] add multi step v1
#26796 opened Oct 14, 2025 by chengda-wu Loading…
5 tasks
[Doc] ruff format remaining Python examples documentation Improvements or additions to documentation
#26795 opened Oct 14, 2025 by DarkLight1337 Loading…
5 tasks
make fp4 scaled_mm works for 5090 gpu ci/build
#26793 opened Oct 14, 2025 by XiaobingSuper Loading…
3 of 5 tasks
llama4_vision_rope: add HIP override to accept (q, k) and avoid (positions, q, k) mismatch llama Related to Llama models
#26790 opened Oct 14, 2025 by hl475 Loading…
5 tasks
[bugfix] remove unused parameters to reduce unnecessary vram usage ready ONLY add when PR is ready to merge/full CI is needed
#26789 opened Oct 14, 2025 by ReinForce-II Loading…
3 of 5 tasks
[Feature] default --extra-body param to disable thinking in vllm bench serve frontend performance Performance-related issues ready ONLY add when PR is ready to merge/full CI is needed
#26784 opened Oct 14, 2025 by lengrongfu Loading…
5 tasks
[Fix] Avoid UserWarning when creating tensors from base64 embeddings documentation Improvements or additions to documentation
#26782 opened Oct 14, 2025 by mmangkad Loading…
5 tasks
[CI/Build][Bugfix] fix qutlass cmake error when set QUTLASS_SRC_DIR bug Something isn't working ci/build ready ONLY add when PR is ready to merge/full CI is needed
#26773 opened Oct 14, 2025 by izhuhaoran Loading…
5 tasks
ProTip! Exclude everything labeled bug with -label:bug.