-
-
Notifications
You must be signed in to change notification settings - Fork 8.5k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Hardware][PPC64LE] Enable V1 for ppc64le
v1
#20554
opened Jul 7, 2025 by
Akashcodes732
Loading…
4 tasks
Replace Improvements or additions to documentation
frontend
tool-calling
--expand-tools-even-if-tool-choice-none
with --exclude-tools-when-tool-choice-none
for v0.10.0
documentation
#20544
opened Jul 7, 2025 by
okdshin
Loading…
[Model] Support VLMs with transformers backend
ci/build
documentation
Improvements or additions to documentation
multi-modality
Related to multi-modality (#4194)
#20543
opened Jul 7, 2025 by
zucchini-nlp
Loading…
[Test] Remove docker build and docker clean from test.
ci/build
tpu
Related to Google TPUs
#20542
opened Jul 7, 2025 by
QiliangCui
Loading…
3 tasks done
[CI/Build] Ensure compatability with Transformers v4.53
ci/build
multi-modality
Related to multi-modality (#4194)
qwen
Related to Qwen models
ready
ONLY add when PR is ready to merge/full CI is needed
#20541
opened Jul 7, 2025 by
Isotr0py
Loading…
1 of 4 tasks
[Misc] Set the minimum openai version
ci/build
ready
ONLY add when PR is ready to merge/full CI is needed
#20539
opened Jul 7, 2025 by
jeejeelee
Loading…
4 tasks
DO NOT MERGE - debug
needs-rebase
performance
Performance-related issues
#20535
opened Jul 7, 2025 by
robertgshaw2-redhat
•
Draft
Refactor: Remove numpy dependency from LoggingStatLogger
v1
#20529
opened Jul 6, 2025 by
skyloevil
Loading…
[Benchmark] Parameterization of streaming loading of multimodal datasets
performance
Performance-related issues
#20528
opened Jul 6, 2025 by
Potabk
Loading…
3 tasks done
[Third Party] Add a hook to the GPU Model Runner in Worker KV Connector start_load_kv()
v1
#20524
opened Jul 6, 2025 by
sammshen
Loading…
[Benchmarks] Add memory tracking to serving benchmark
ci/build
performance
Performance-related issues
#20519
opened Jul 6, 2025 by
sfeng33
Loading…
[Bugfix] fix the block.prev_block reference - release problem
#20512
opened Jul 5, 2025 by
CLFutureX
Loading…
Add reproducible prefix-cache block hashing using SHA-256 + CBOR
ci/build
v1
#20511
opened Jul 5, 2025 by
vMaroon
Loading…
adds optional reasoning content field to ConversationMessage
frontend
#20505
opened Jul 4, 2025 by
arpitg1991
Loading…
4 tasks
feat: Add streaming support for Mistral v11 tool format
frontend
tool-calling
#20503
opened Jul 4, 2025 by
sjuxax
Loading…
[Benchmark] Add expert parallelism for tuning with benchmark_moe.py
performance
Performance-related issues
#20501
opened Jul 4, 2025 by
Chen-zexi
Loading…
3 of 4 tasks
[V1] [Doc] Automated choice of attention block size for hybrid models in V1
documentation
Improvements or additions to documentation
v1
#20499
opened Jul 4, 2025 by
tdoublep
Loading…
Environment variable to use uniform random topk ids for performance experiments
#20498
opened Jul 4, 2025 by
tlrmchlsmth
•
Draft
[Bugfix] Prevent IndexError for cached requests when pipeline parallelism is disabled
v1
#20486
opened Jul 4, 2025 by
panpan0000
Loading…
1 of 4 tasks
[Model] Add Ling implementation
new-model
Requests to new models
#20482
opened Jul 4, 2025 by
ant-yy
Loading…
3 tasks done
Previous Next
ProTip!
Adding no:label will show everything without a label.