This is a structured checklist that stops you from running A/B tests before you've actually thought them through. It enforces hard gates at hypothesis definition, sample size calculation, and metric selection, refusing to proceed until you've locked in a primary metric and calculated whether you have enough traffic to detect your minimum effect size. The discipline here is real: no peeking at results early, no changing success criteria mid-test, and guardrail metrics that can veto a win if they tank. Use it when you need to run statistically valid experiments and want something that will actively push back on sloppy test design. It's opinionated about rigor over speed, which is either exactly what you need or going to feel heavy handed depending on your team's maturity.
npx skills add https://github.com/sickn33/antigravity-awesome-skills --skill ab-test-setup