This one's built for when you need real production observability, not just a quick dashboard. It covers the full stack: Prometheus and Grafana setups, distributed tracing with Jaeger and OpenTelemetry, ELK for logs, and proper SLI/SLO management with error budgets. The skill shines when you're designing monitoring from scratch or fixing noisy alert systems that wake people up for nothing. It also handles chaos engineering and cost optimization, which matters when your observability bill starts competing with your actual infrastructure costs. Good fit if you're operating at scale and need someone who knows the difference between metrics, logs, and traces, and when to use each.
npx skills add https://github.com/sickn33/antigravity-awesome-skills --skill observability-engineer