Stars
Smart proxy for LLM APIs that enables model-specific parameter control, automatic mode switching (like Qwen3's /think and /no_think), and <think> tag filtering. Perfect for using advanced models wi…
Model swapping for llama.cpp (or any local OpenAPI compatible server)