-
-
Notifications
You must be signed in to change notification settings - Fork 126
feat: pass an image as part of the evaluation #88
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Interested in this kind of multimodal support. Any update on progress? |
Does this encompass adding support for running llava models or should that be a separate feature request? I noticed that llama-cpp-python already includes llava support from llama.cpp so this shouldn't be too hard with setting up the bindings. |
I haven't started working on this yet, but it is planned as part of the roadmap. I'll work on this once |
Make sense. Hopefully llama.cpp finalizes that API. |
Any update on this? Would love to leverage multimodal models! Love the library so far :) |
An official API for this is in active development on |
Beautiful. Looking forward to it, this library is insane. |
Any updates on this feature, it seems there were some experimental support for gemma 3 vision last week ggml-org/llama.cpp#12344 |
@wisng I'm waiting for an official stable API for this, which is still in the works. |
Perhaps we can't call it |
I've started poking around with the |
When
llama.cpp
's support for this will be stable.Hopefully, there will be an official API for this after ggml-org/llama.cpp#11292 is implemented.
The text was updated successfully, but these errors were encountered: