[tests] Improve speed and reliability of test_transcription_api_correctness #23854

russellb · 2025-08-28T18:56:36Z

Improve the performance of this test by only creating the tokenizer
once instead of hundreds of times + serialized due to doing it while
holding a semaphore.

The previous code would also frequently get rate limited by
HuggingFace from requesting https://huggingface.co/openai/whisper-large-v3/resolve/main/tokenizer_config.json
too many times. This would sometimes cause the test to fail.

On my laptop, here is the time difference:

Before:

5m3.389s

After:

2m5.471s

This is a piece split out from #21088.

Signed-off-by: Russell Bryant [email protected]

…ctness Improve the performance of this test by only creating the tokenizer once instead of hundreds of times + serialized due to doing it while holding a semaphore. The previous code would also frequently get rate limited by HuggingFace from requesting https://huggingface.co/openai/whisper-large-v3/resolve/main/tokenizer_config.json too many times. This would sometimes cause the test to fail. On my laptop, here is the time difference: Before: - 5m3.389s After: - 2m5.471s This is a piece split out from vllm-project#21088. Signed-off-by: Russell Bryant <[email protected]>

gemini-code-assist

Code Review

This pull request provides a significant and well-executed optimization for the test_transcription_api_correctness test. By initializing the tokenizer once outside the main processing loop, it correctly addresses a performance bottleneck and a source of test flakiness caused by repeated network requests to HuggingFace. The code changes are clear, logical, and effectively improve both the speed and reliability of the test suite. No issues were found in the implementation.

DarkLight1337

Thanks!

…ctness (vllm-project#23854) Signed-off-by: Russell Bryant <[email protected]>

russellb requested review from DarkLight1337, aarnphm, robertgshaw2-redhat and simon-mo as code owners August 28, 2025 18:56

gemini-code-assist bot reviewed Aug 28, 2025

View reviewed changes

russellb mentioned this pull request Aug 28, 2025

[v1] Add Whisper model support (encoder-decoder) #21088

Merged

3 tasks

DarkLight1337 approved these changes Aug 29, 2025

View reviewed changes

DarkLight1337 enabled auto-merge (squash) August 29, 2025 02:26

DarkLight1337 added the ready ONLY add when PR is ready to merge/full CI is needed label Aug 29, 2025

DarkLight1337 merged commit c8b3b29 into vllm-project:main Aug 29, 2025
18 of 21 checks passed

eicherseiji pushed a commit to eicherseiji/vllm that referenced this pull request Sep 9, 2025

[tests] Improve speed and reliability of test_transcription_api_corre…

be138a7

…ctness (vllm-project#23854) Signed-off-by: Russell Bryant <[email protected]>

FeiDaLI pushed a commit to FeiDaLI/vllm that referenced this pull request Sep 25, 2025

[tests] Improve speed and reliability of test_transcription_api_corre…

c40733d

…ctness (vllm-project#23854) Signed-off-by: Russell Bryant <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[tests] Improve speed and reliability of test_transcription_api_correctness #23854

[tests] Improve speed and reliability of test_transcription_api_correctness #23854

russellb commented Aug 28, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

DarkLight1337 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

[tests] Improve speed and reliability of test_transcription_api_correctness #23854

[tests] Improve speed and reliability of test_transcription_api_correctness #23854

Conversation

russellb commented Aug 28, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

DarkLight1337 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants