Use exponential_backoff_retry for completion call #8023

TomeHirata · 2025-03-27T09:01:09Z

Some users reported that their programs were stuck due to Rate limit errors. This PR configures exponential backoff retry for LiteLLM completion call since LiteLLM uses constant backoff even for rate limits (ref), which is ineffective.

One tradeoff here is that we will start using exponential backoff for other types of exceptions (e.g. internal server error) after this change. LiteLLM has a smart logic for async completion that it switches to exponential backoff only for RateLimitError (ref), but this does not exist for sync completion. Therefore, another solution is that we file a PR to LiteLLM side to implement the logic for sync completion to use exponential backoff only to RateLimitError.

TomeHirata requested review from chenmoneygithub and okhat March 27, 2025 09:01

use exponential_backoff_retry for completion call

7b41bc8

TomeHirata force-pushed the feat/exponential_backoff_retry branch from 04624d5 to 7858751 Compare March 31, 2025 07:50

add test for exponential_backoff_retry

70ac8ae

TomeHirata force-pushed the feat/exponential_backoff_retry branch from 7858751 to 70ac8ae Compare March 31, 2025 07:56

okhat merged commit 7a877d1 into stanfordnlp:main Mar 31, 2025
4 checks passed

TomeHirata mentioned this pull request Apr 9, 2025

Rate Limiting #2048

Open

renovate bot mentioned this pull request Sep 12, 2025

Update all non-major dependencies autoblocksai/autoblocks-examples#242

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use exponential_backoff_retry for completion call #8023

Use exponential_backoff_retry for completion call #8023

Uh oh!

TomeHirata commented Mar 27, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Use exponential_backoff_retry for completion call #8023

Use exponential_backoff_retry for completion call #8023

Uh oh!

Conversation

TomeHirata commented Mar 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

TomeHirata commented Mar 27, 2025 •

edited

Loading