This walks you through systematic problem solving when things break in production. It's built around the 5 Whys technique and includes a structured RCA process with templates for incident reports. You get concrete examples like tracing a 503 error back through database connection pools to missing indexes to inadequate load testing environments. The skill covers when to do an RCA (production incidents, recurring problems, performance issues) and includes follow-up guides for prevention. Honestly most valuable for teams that keep fighting the same fires instead of fixing the underlying issues. The templates help you actually document what went wrong instead of just scrambling to restore service and moving on.
npx skills add https://github.com/aj-geddes/useful-ai-prompts --skill root-cause-analysis