This is a cost-first model routing strategy that keeps you on local Ollama models (qwen3-coder, gpt-oss) unless infrastructure actually fails. It defines five tiers from free local to paid Claude Opus, but the philosophy is explicit: you only climb the ladder on 429s or timeouts, never because a task looks hard or you want better output. Tier 1 uses Cerebras at up to 2100 TPS for free, tier 3 brings in Hermes 405B for deep reasoning, and tier 4 is break-glass only with spending alerts at $5 daily. If you burn through API budgets or want to stay on local models by default with automatic failover only when services are down, this is the routing logic. It includes health checks and a spend guard that flags you when OpenRouter usage crosses thresholds.
npx -y skills add open-gitagent/opengap --skill compute-ladder --agent claude-codeInstalls into .claude/skills of the current project.
Select a file.
juliusbrussee/caveman
mattpocock/skills
shadcn/improve
obra/superpowers
forrestchang/andrej-karpathy-skills
vercel-labs/skills