Mcp Llm Gateway

STDIOregistry active

Summary

This is a proxy layer that sits between Claude and any OpenAI-compatible LLM API. You point it at a downstream provider via DOWNSTREAM_URL, set a default model, and it exposes two MCP tools: list_models() to see what's available and complete() to send prompts with configurable temperature and max tokens. It also surfaces resources for the model list and current config. Reach for this when you want Claude to route completion requests to a different LLM provider without changing your client setup, or when you need to query multiple models through a single gateway. It handles API key passthrough and model discovery from endpoints like models.dev, so you can swap providers by changing environment variables.

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

AI notepad for back-to-back meetings

Notes, actions and memory. Without a meeting bot. First month 100% off.

Download for free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Email for Agents: Free tier available

Give your AI agent a complete email layer—sending, inbound inboxes, and sandbox testing.

Get 4K emails/month free →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

CodeScene MCP Server

Your agent targets a perfect 10 Code Health score. Deterministic. Every commit.

Try For Free →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

AI notepad for back-to-back meetings

Notes, actions and memory. Without a meeting bot. First month 100% off.

Download for free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Email for Agents: Free tier available

Give your AI agent a complete email layer—sending, inbound inboxes, and sandbox testing.

Get 4K emails/month free →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

CodeScene MCP Server

Your agent targets a perfect 10 Code Health score. Deterministic. Every commit.

Try For Free →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

MCP LLM Gateway

MCP-compatible LLM gateway that proxies completion requests to downstream OpenAI-compatible providers.

mcp-name: io.github.daedalus/mcp-llm-gateway

Install

pip install mcp-llm-gateway

Usage

Configuration

Set the following environment variables:

DOWNSTREAM_URL: Base URL for the OpenAI-compatible downstream API (required)
DEFAULT_MODEL: Default model to use for completions (required)
MODEL_LIST_URL: URL to fetch available models from (optional, defaults to models.dev)
API_KEY: Optional API key for downstream (passthrough)
TIMEOUT: Request timeout in seconds (optional, default: 60)

MCP Server

Run the MCP server with stdio transport:

mcp-llm-gateway

MCP Tools

The server exposes the following tools:

list_models(): List all available models from the remote endpoint
complete(prompt, model, max_tokens, temperature): Send a completion request to the downstream LLM provider

MCP Resources

models://list: Returns the list of available models
config://info: Returns current gateway configuration

Development

git clone https://github.com/daedalus/mcp-llm-gateway.git
cd mcp-llm-gateway
pip install -e ".[test]"

# run tests
pytest

# format
ruff format src/ tests/

# lint
ruff check src/ tests/

# type check
mypy src/

API

core.models

Model: Dataclass representing an available LLM model
CompletionRequest: Dataclass for completion request payloads
GatewayConfig: Dataclass for gateway configuration

adapters.http

HTTPAdapter: HTTP client for downstream API communication
ModelListAdapter: Adapter for fetching model list from remote endpoints

services.gateway

ModelService: Service for managing model discovery and caching
CompletionService: Service for handling completion requests
ConfigService: Service for managing gateway configuration

Featured

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

AI notepad for back-to-back meetings

Notes, actions and memory. Without a meeting bot. First month 100% off.

Download for free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Email for Agents: Free tier available

Give your AI agent a complete email layer—sending, inbound inboxes, and sandbox testing.

Get 4K emails/month free →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

CodeScene MCP Server

Your agent targets a perfect 10 Code Health score. Deterministic. Every commit.

Try For Free →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

MCP LLM Gateway

MCP-compatible LLM gateway that proxies completion requests to downstream OpenAI-compatible providers.

mcp-name: io.github.daedalus/mcp-llm-gateway

Install

pip install mcp-llm-gateway

Usage

Configuration

Set the following environment variables:

DOWNSTREAM_URL: Base URL for the OpenAI-compatible downstream API (required)
DEFAULT_MODEL: Default model to use for completions (required)
MODEL_LIST_URL: URL to fetch available models from (optional, defaults to models.dev)
API_KEY: Optional API key for downstream (passthrough)
TIMEOUT: Request timeout in seconds (optional, default: 60)

MCP Server

Run the MCP server with stdio transport:

mcp-llm-gateway

MCP Tools

The server exposes the following tools:

list_models(): List all available models from the remote endpoint
complete(prompt, model, max_tokens, temperature): Send a completion request to the downstream LLM provider

MCP Resources

models://list: Returns the list of available models
config://info: Returns current gateway configuration

Development

git clone https://github.com/daedalus/mcp-llm-gateway.git
cd mcp-llm-gateway
pip install -e ".[test]"

# run tests
pytest

# format
ruff format src/ tests/

# lint
ruff check src/ tests/

# type check
mypy src/

API

core.models

Model: Dataclass representing an available LLM model
CompletionRequest: Dataclass for completion request payloads
GatewayConfig: Dataclass for gateway configuration

adapters.http

HTTPAdapter: HTTP client for downstream API communication
ModelListAdapter: Adapter for fetching model list from remote endpoints

services.gateway

ModelService: Service for managing model discovery and caching
CompletionService: Service for handling completion requests
ConfigService: Service for managing gateway configuration

Mcp Llm Gateway

MCP LLM Gateway

Install

Usage

Configuration

MCP Server

MCP Tools

MCP Resources

Development

API

core.models

adapters.http

services.gateway

Mcp Llm Gateway

MCP LLM Gateway

Install

Usage

Configuration

MCP Server

MCP Tools

MCP Resources

Development

API

core.models

adapters.http

services.gateway

Related AI & LLM Tools MCP Servers

Related AI & LLM Tools MCP Servers