List of supported models (and sticky)? #14601

brunette69-ruby · 2025-07-09T17:02:21Z

brunette69-ruby
Jul 9, 2025

Where can I find list of supported model?
Embedding: ?
Rerank: ?
Generate: ?

Thx

ExtReMLapin · 2025-07-10T08:38:28Z

ExtReMLapin
Jul 10, 2025

type gguf in huggingface search bar

0 replies

brunette69-ruby · 2025-07-10T10:06:52Z

brunette69-ruby
Jul 10, 2025
Author

Not quite. Some seem to be, some not. Not format wise (gguf is there), but function wise - tokenization doesn't work.
For example:
Qwen3 are widely available in GGUF and often best in class (embedding and reranking). But not supported.

ggerganov
on Jul 6, 2025

Qwen3 Embedding and Qwen3 Reranking models are not supported yet.

on Jul 6, 2025

Also the GGUFs provided by Qwen are missing important metadata and will not work until fixed and supported.

0 replies

ryan-mangeno · 2025-07-11T01:04:05Z

ryan-mangeno
Jul 11, 2025

https://github.com/ggml-org/llama.cpp/blob/master/src/llama-arch.h will show you supported architectures, mtmd is being worked towards so vlms wont be immeditaly obvious, finding embeddings ( or tensor names ) you can use the llama-eval-callback script to dump the tensors and to generate gguf from hf you would use the convert_hf_to_gguf to generate .gguf files from their hf impl

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

List of supported models (and sticky)? #14601

Uh oh!

{{title}}

Uh oh!

Replies: 3 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

List of supported models (and sticky)? #14601

Uh oh!

brunette69-ruby Jul 9, 2025

Replies: 3 comments

Uh oh!

ExtReMLapin Jul 10, 2025

Uh oh!

brunette69-ruby Jul 10, 2025 Author

Uh oh!

ryan-mangeno Jul 11, 2025

brunette69-ruby
Jul 9, 2025

ExtReMLapin
Jul 10, 2025

brunette69-ruby
Jul 10, 2025
Author

ryan-mangeno
Jul 11, 2025