You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
You can set `retry_wait_time` and `max_retry_period` to handle rate limit error. And you can set `request_timeout` to handle timeout error. They can all be specified in `llm_config` for an agent, which will be used in the [`create`](/docs/reference/oai/completion#create) function for LLM inference.
106
+
107
+
-`retry_wait_time` (int): the time interval to wait (in seconds) before retrying a failed request.
108
+
-`max_retry_period` (int): the total timeout (in seconds) allowed for retrying failed requests.
109
+
-`request_timeout` (int): the timeout (in seconds) sent with a single request.
110
+
111
+
Please refer to the [documentation](/docs/Use-Cases/enhanced_inference#runtime-error) for more info.
Copy file name to clipboardExpand all lines: website/docs/Use-Cases/enhanced_inference.md
+5-1Lines changed: 5 additions & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -123,7 +123,11 @@ API call results are cached locally and reused when the same request is issued.
123
123
124
124
### Runtime error
125
125
126
-
It is easy to hit error when calling OpenAI APIs, due to connection, rate limit, or timeout. Some of the errors are transient. `autogen.Completion.create` deals with the transient errors and retries automatically. Initial request timeout, retry timeout and retry time interval can be configured via `request_timeout`, `retry_timeout` and `autogen.Completion.retry_time`.
126
+
It is easy to hit error when calling OpenAI APIs, due to connection, rate limit, or timeout. Some of the errors are transient. `autogen.Completion.create` deals with the transient errors and retries automatically. Request timeout, max retry period and retry wait time can be configured via `request_timeout`, `max_retry_period` and `retry_wait_time`.
127
+
128
+
-`request_timeout` (int): the timeout (in seconds) sent with a single request.
129
+
-`max_retry_period` (int): the total time (in seconds) allowed for retrying failed requests.
130
+
-`retry_wait_time` (int): the time interval to wait (in seconds) before retrying a failed request.
127
131
128
132
Moreover, one can pass a list of configurations of different models/endpoints to mitigate the rate limits. For example,
0 commit comments