A practical workflow for setting up monitoring infrastructure or fixing what's already there. The decision tree approach is smart: it routes you to metrics design if you're starting fresh, troubleshooting if something's broken, or specific sections for alerts, dashboards, and SLOs if you're iterating. There's a dedicated section on Datadog cost optimization, which honestly tells you what kind of shops have contributed to this. It's comprehensive enough to be your observability playbook without being prescriptive about tools, though the tool comparison reference suggests it helps with vendor decisions too. Good for platform teams who need structured guidance instead of scattered tribal knowledge.
npx skills add https://github.com/ahmedasmar/devops-claude-skills --skill monitoring-observability