-
-
Notifications
You must be signed in to change notification settings - Fork 406
Error on chat openai with litellm api for self hosted llm #1308
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
@Christhian16 Thanks for reporting this issue. From the error logs you provided, it doesn't seem like this is an issue with Jupyter AI. The error logs suggest that this is an issue with the LiteLLM server that you are connecting to. Specifically, To remedy this, I recommend looking through the LiteLLM docs to see if Hope this helps! Let me know if you have other questions. |
@dlqqq Thank you for your quick response.
10.Note that using the jupyter_ai_magics like below work for me: See the full error belowTraceback (most recent call last): |
hello @dlqqq It only works with "meta-llama/Llama-3.1-70B" and litellm for the api. See below: Thank you in advance for your response. Kind regards |
@Christhian16 @dlqqq Let me look into it this week. I am working on another PR and should be able to rotate to this one after that. |
@srdas I'd like to reiterate my needs regarding this bug. |
@Christhian16 Can you briefly explain how you have deployed your self-hosted |
@srdas
Thank you in advance for the help on this |
Description
I'm currently using self hosted llm through litellm api.
When I use the Openai library with "openai.OpenAI.completions.create" on a notebook it worked but when I try to configure the jupyterlab-ai chat extension [filled in api_url, api_key, model_id] I get the following error:
"openai.InternalServerError: Error code: 500 - {'error': {'message': 'litellm.APIError: APIError: Hosted_vllmException - Internal Server Error\nReceived Model Group=google/gemma2-27b\nAvailable Model Group Fallbacks=None', 'type': None, 'param': None, 'code': '500'}}"
Reproduce
%%ai openai-chat-custom:google/gemma2-27b
Write a short poem
Expected behavior
I was expecting the chat to return an answer without an error
Context
The text was updated successfully, but these errors were encountered: