Before you fire off another expensive Claude API call, this server asks whether a local model could handle it instead. It exposes a single tool, check_local_viability, that analyzes your task and returns LOCAL or CLOUD with reasoning, cost savings estimates, and recommended local models like llama3.2 or mistral-7b. If you mark data as CONFIDENTIAL, it forces local routing regardless of complexity. Works over stdio or HTTP, supports LangChain and OpenAI Agents SDK integrations. Free tier gives you 20 routing decisions per month, paid tiers unlock bundles. Think of it as a gatekeeper that saves you money by catching tasks that don't need Sonnet's firepower.
claude mcp add --transport stdio ojaskord-local-model-suitability-mcp uvx local-model-suitability-mcp