When your agentic workflow is burning tokens or taking too long to respond, this walks you through a systematic performance audit. It measures baseline metrics like p50/p95 latency and cost per request, then guides you through specific optimizations: token reduction, model cascading to cheaper models for simple tasks, caching strategies, and parallelization. The structure is methodical, maybe overly so, but that's the point. It forces you to measure before and after each change and verify quality didn't regress with golden tests. The trade-off documentation is smart because speed improvements always cost something, even if it's just added complexity. Best used when you've got a working workflow that's too expensive to run at scale.
npx skills add https://github.com/sharpdeveye/maestro --skill accelerate