This is your production monitoring foundation with Prometheus and Grafana, covering everything from basic scrape configs to Kubernetes service discovery and PromQL queries. You get practical patterns for the four pillars of observability (metrics, logs, traces, events), concrete examples of recording rules and alerting setups, and solid guidance on label cardinality pitfalls that will bite you at scale. The SLO section is especially useful, walking through error budgets and reliability targets with real numbers. If you're moving beyond basic uptime checks or migrating from legacy monitoring tools, the Kubernetes autodiscovery examples and relabeling configs alone will save you hours of documentation diving.
npx skills add https://github.com/manutej/luxor-claude-marketplace --skill observability-monitoring