Exposes live LLM pricing comparisons through five tools: get_pricing for real-time GNK/USD rates and per-million-token costs, compare_providers to benchmark against OpenAI/Anthropic/DeepSeek, calculate_savings to estimate monthly and annual cost reductions based on current spend, get_available_models for listing inference options, and get_signup_link for referral-enabled registration. Pricing data refreshes every 10 minutes from Gonka's API, with competitor rates pulled daily from LiteLLM's database. Useful when evaluating inference cost optimization or building cost-aware agent workflows. Works over streamable HTTP, so you can call it from Claude Desktop, the Python MCP SDK, fastmcp CLI, or raw curl with session headers.
claude mcp add --transport http bystray-gonka-mcp-server https://mcp.gogonka.com/mcp