Cost-aware-llm-pipeline provides developers building LLM-powered applications with patterns to optimize API costs while maintaining output quality, using model routing by task complexity, immutable budget tracking, targeted retry logic, and prompt caching. It solves the problem of controlling expenses when processing variable-complexity workloads across multiple LLM API calls without sacrificing performance on demanding tasks. The skill is designed for teams processing batch items or high-volume API usage who need to stay within defined budgets.
npx skills add https://github.com/affaan-m/everything-claude-code --skill cost-aware-llm-pipeline