Only run `get_attr_docs` if generating help text #23723

hmellor · 2025-08-27T08:38:52Z

The docstrings for config class attributes are only actually needed if we're generating --help text or documentation.

This saves 235ms on my machine by completely eliminating the cost of extracting the docstrings on start up.

$ vllm serve --help
INFO 08-27 10:36:28 [__init__.py:241] Automatically detected platform cuda.
INFO 08-27 10:36:30 [arg_utils.py:165] Computed kwargs for FrontendArgs in 0.0038s (cumulative: 0.0038s)
INFO 08-27 10:36:30 [arg_utils.py:165] Computed kwargs for ModelConfig in 0.0383s (cumulative: 0.0421s)
INFO 08-27 10:36:31 [arg_utils.py:165] Computed kwargs for LoadConfig in 0.0263s (cumulative: 0.0684s)
INFO 08-27 10:36:31 [arg_utils.py:165] Computed kwargs for DecodingConfig in 0.0302s (cumulative: 0.0985s)
INFO 08-27 10:36:31 [arg_utils.py:165] Computed kwargs for ParallelConfig in 0.0061s (cumulative: 0.1046s)
INFO 08-27 10:36:31 [arg_utils.py:165] Computed kwargs for CacheConfig in 0.0032s (cumulative: 0.1078s)
INFO 08-27 10:36:31 [arg_utils.py:165] Computed kwargs for MultiModalConfig in 0.0284s (cumulative: 0.1362s)
INFO 08-27 10:36:31 [arg_utils.py:165] Computed kwargs for LoRAConfig in 0.0283s (cumulative: 0.1645s)
INFO 08-27 10:36:31 [arg_utils.py:165] Computed kwargs for ObservabilityConfig in 0.0302s (cumulative: 0.1947s)
INFO 08-27 10:36:31 [arg_utils.py:165] Computed kwargs for SchedulerConfig in 0.0034s (cumulative: 0.1980s)
INFO 08-27 10:36:31 [arg_utils.py:165] Computed kwargs for VllmConfig in 0.0370s (cumulative: 0.2350s)
...

$ vllm serve 
INFO 08-27 10:36:57 [__init__.py:241] Automatically detected platform cuda.
INFO 08-27 10:37:00 [arg_utils.py:165] Computed kwargs for FrontendArgs in 0.0000s (cumulative: 0.0000s)
INFO 08-27 10:37:00 [arg_utils.py:165] Computed kwargs for ModelConfig in 0.0000s (cumulative: 0.0000s)
INFO 08-27 10:37:00 [arg_utils.py:165] Computed kwargs for LoadConfig in 0.0000s (cumulative: 0.0000s)
INFO 08-27 10:37:00 [arg_utils.py:165] Computed kwargs for DecodingConfig in 0.0000s (cumulative: 0.0000s)
INFO 08-27 10:37:00 [arg_utils.py:165] Computed kwargs for ParallelConfig in 0.0000s (cumulative: 0.0000s)
INFO 08-27 10:37:00 [arg_utils.py:165] Computed kwargs for CacheConfig in 0.0000s (cumulative: 0.0000s)
INFO 08-27 10:37:00 [arg_utils.py:165] Computed kwargs for MultiModalConfig in 0.0000s (cumulative: 0.0000s)
INFO 08-27 10:37:00 [arg_utils.py:165] Computed kwargs for LoRAConfig in 0.0000s (cumulative: 0.0000s)
INFO 08-27 10:37:00 [arg_utils.py:165] Computed kwargs for ObservabilityConfig in 0.0000s (cumulative: 0.0000s)
INFO 08-27 10:37:00 [arg_utils.py:165] Computed kwargs for SchedulerConfig in 0.0000s (cumulative: 0.0000s)
INFO 08-27 10:37:00 [arg_utils.py:165] Computed kwargs for VllmConfig in 0.0000s (cumulative: 0.0000s)
...

Signed-off-by: Harry Mellor <[email protected]>

gemini-code-assist

Code Review

This pull request aims to optimize startup time by avoiding the extraction of docstrings when not generating help text. While the performance improvement is valuable, the current implementation introduces a critical caching bug by making a cached function's behavior dependent on a global variable (sys.argv). My review provides a detailed explanation of this bug and a robust solution to fix it, ensuring both performance and correctness.

vllm/engine/arg_utils.py

Signed-off-by: Harry Mellor <[email protected]>

mgoin · 2025-08-27T09:06:10Z

Thank you!

ZJY0516 · 2025-08-27T09:17:29Z

Hi @hmellor . I was wondering how to measure time like this?

INFO 08-27 10:36:30 [arg_utils.py:165] Computed kwargs for FrontendArgs in 0.0038s (cumulative: 0.0038s)

hmellor · 2025-08-27T09:27:05Z

I added some temporary logs that I did not commit, the code was:

cumulative_time = 0

@functools.lru_cache(maxsize=30)
def _compute_kwargs(cls: ConfigType, needs_help: bool) -> dict[str, Any]:
    import time
    global cumulative_time
    start = time.perf_counter()
    # Save time only getting attr docs if we're generating help text
    cls_docs = get_attr_docs(cls) if NEEDS_HELP else {}
    duration = time.perf_counter() - start
    cumulative_time += duration
    logger.info(f"Computed kwargs for {cls.__name__} in {duration:.4f}s (cumulative: {cumulative_time:.4f}s)")
    ...

hmellor · 2025-08-27T09:59:21Z

RTD docs build currently isn't triggering get_attr_docs just fixing it then I'll re-enable auto-merge

Signed-off-by: Harry Mellor <[email protected]>

hmellor · 2025-08-27T12:29:12Z

Docs look good again

ProExpertProg · 2025-08-27T13:02:51Z

Perhaps a follow up but could we also extract this during build time? Because now we pay this cost during vllm help when we could avoid it all together.

hmellor · 2025-08-27T13:06:47Z

Yeah this could be a build time constant. I'm not sure how it would work though (I've never tried to do it before, I'm sure it can be done)

Signed-off-by: Harry Mellor <[email protected]>

Signed-off-by: Harry Mellor <[email protected]> Signed-off-by: Xiao Yu <[email protected]>

Signed-off-by: Harry Mellor <[email protected]>

Only run get_attr_docs if generating help text

86aefde

Signed-off-by: Harry Mellor <[email protected]>

gemini-code-assist bot reviewed Aug 27, 2025

View reviewed changes

vllm/engine/arg_utils.py Outdated Show resolved Hide resolved

Fix for docs build

7c9b255

Signed-off-by: Harry Mellor <[email protected]>

hmellor mentioned this pull request Aug 27, 2025

optimize get attr docs #23156

Closed

mgoin approved these changes Aug 27, 2025

View reviewed changes

mgoin enabled auto-merge (squash) August 27, 2025 09:06

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Aug 27, 2025

hmellor disabled auto-merge August 27, 2025 09:39

hmellor added 2 commits August 27, 2025 12:09

Fix NEEDS_HELP for RTD docs build

0b72bc9

Signed-off-by: Harry Mellor <[email protected]>

Stop using orig_argv for py3.9 and use := to reduce lookups

8b8cee4

Signed-off-by: Harry Mellor <[email protected]>

hmellor enabled auto-merge (squash) August 27, 2025 12:29

ProExpertProg approved these changes Aug 27, 2025

View reviewed changes

zou3519 approved these changes Aug 27, 2025

View reviewed changes

hmellor merged commit 513c1fe into vllm-project:main Aug 27, 2025
40 checks passed

hmellor deleted the start-up-speed branch August 27, 2025 13:55

epwalsh pushed a commit to epwalsh/vllm that referenced this pull request Aug 28, 2025

Only run get_attr_docs if generating help text (vllm-project#23723)

6646361

Signed-off-by: Harry Mellor <[email protected]>

xiao-llm pushed a commit to xiao-llm/vllm that referenced this pull request Aug 28, 2025

Only run get_attr_docs if generating help text (vllm-project#23723)

2b96b20

Signed-off-by: Harry Mellor <[email protected]> Signed-off-by: Xiao Yu <[email protected]>

zhewenl pushed a commit to zhewenl/vllm that referenced this pull request Aug 28, 2025

Only run get_attr_docs if generating help text (vllm-project#23723)

8c1d986

Signed-off-by: Harry Mellor <[email protected]>

zhewenl pushed a commit to zhewenl/vllm that referenced this pull request Sep 3, 2025

Only run get_attr_docs if generating help text (vllm-project#23723)

7dabd8d

Signed-off-by: Harry Mellor <[email protected]>

FeiDaLI pushed a commit to FeiDaLI/vllm that referenced this pull request Sep 25, 2025

Only run get_attr_docs if generating help text (vllm-project#23723)

35a4338

Signed-off-by: Harry Mellor <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Only run `get_attr_docs` if generating help text #23723

Only run `get_attr_docs` if generating help text #23723

Uh oh!

hmellor commented Aug 27, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

mgoin commented Aug 27, 2025

Uh oh!

ZJY0516 commented Aug 27, 2025

Uh oh!

hmellor commented Aug 27, 2025

Uh oh!

hmellor commented Aug 27, 2025

Uh oh!

hmellor commented Aug 27, 2025

Uh oh!

ProExpertProg commented Aug 27, 2025

Uh oh!

hmellor commented Aug 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

Only run get_attr_docs if generating help text #23723

Only run get_attr_docs if generating help text #23723

Uh oh!

Conversation

hmellor commented Aug 27, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

mgoin commented Aug 27, 2025

Uh oh!

ZJY0516 commented Aug 27, 2025

Uh oh!

hmellor commented Aug 27, 2025

Uh oh!

hmellor commented Aug 27, 2025

Uh oh!

hmellor commented Aug 27, 2025

Uh oh!

ProExpertProg commented Aug 27, 2025

Uh oh!

hmellor commented Aug 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Only run `get_attr_docs` if generating help text #23723

Only run `get_attr_docs` if generating help text #23723