Ollama API Proxy for LLM Providers

This application serves as a proxy that implements the Ollama API interface but forwards requests to different LLM providers like Anthropic's Claude and Perplexity AI. This allows IDE plugins that support Ollama to work with these alternative LLM providers.

Looking at you, JetBrains.

Features

Implements Ollama's API endpoints:
- /api/chat - For chat completions
- /api/tags - Lists available models
- /api/show - Shows model details
- / - Health check endpoint
Supports multiple LLM providers:
- Perplexity AI (Llama models)
- Anthropic (Claude models)
Configurable server settings
Easy provider switching via configuration

Installation

Clone the repository
Install Rust if you haven't already (https://rustup.rs/)
Create a Config.toml file in the project root

Configuration

Create a Config.toml file with the following structure:

# Provider configuration
provider_type = "perplexity"  # or "anthropic"
perplexity_api_key = "your-perplexity-key"
anthropic_api_key = "your-anthropic-key"

# Server configuration
[server]
host = "127.0.0.1"
port = 11434

Available Models

Perplexity AI

llama-3.1-sonar-small-128k-online (8B parameters)
llama-3.1-sonar-large-128k-online (70B parameters)
llama-3.1-sonar-huge-128k-online (405B parameters)

Anthropic

claude-3-5-sonnet-20241022
claude-3-5-haiku-20241022
claude-3-opus-20240229

Usage

Start the server:
```
cargo run
```

Configure your IDE's Ollama plugin to use the proxy URL:

http://localhost:11434  # or your configured host:port

API Endpoints

GET /

Returns "Ollama is running" to indicate the server is up.

GET /api/tags

Lists all available models for the configured provider.

POST /api/chat

Handles chat completions. Example request:

{
  "model": "llama2",
  "messages": [
    {
      "role": "system",
      "content": "You are a helpful assistant."
    },
    {
      "role": "user",
      "content": "Hello!"
    }
  ],
  "options": {
    "temperature": 0.7,
    "top_p": 0.9
  }
}

POST /api/show

Shows details about a specific model. Example request:

{
  "name": "llama2"
}

Development

The application is built with:

Rust
Axum web framework
Tokio async runtime
Reqwest for HTTP clients

The architecture follows a trait-based approach for provider implementations, making it easy to add new LLM providers.

Error Handling

The application includes comprehensive error handling for:

API communication errors
Request parsing errors
Configuration errors
Invalid model selections

Contributing

Fork the repository
Create a feature branch
Commit your changes
Push to the branch
Create a Pull Request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.github/workflows		.github/workflows
src		src
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Config_template.toml		Config_template.toml
LICENSE		LICENSE
README.md		README.md
log4rs.yml		log4rs.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Ollama API Proxy for LLM Providers

Features

Installation

Configuration

Available Models

Perplexity AI

Anthropic

Usage

API Endpoints

GET /

GET /api/tags

POST /api/chat

POST /api/show

Development

Error Handling

Contributing

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Languages

License

timheide/ollama_proxy

Folders and files

Latest commit

History

Repository files navigation

Ollama API Proxy for LLM Providers

Features

Installation

Configuration

Available Models

Perplexity AI

Anthropic

Usage

API Endpoints

GET /

GET /api/tags

POST /api/chat

POST /api/show

Development

Error Handling

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Languages

Packages