This is meta tooling for building and refining skills themselves. It walks you through the full cycle: drafting a skill from a rough idea, generating test prompts, running quantitative evals with variance analysis, reviewing results both qualitatively and quantitatively, then iterating based on what breaks. It can also optimize skill descriptions to improve triggering accuracy, which matters since Claude tends to undertrigger skills. The approach is pragmatic about user technical level and flexible about process. If you want formal benchmarking, it'll set that up. If you just want to vibe and iterate, it does that too. Essentially a skill workshop that handles the tedious parts of the create, test, measure, revise loop.
npx -y skills add syahiidkamil/software-engineer-ai-agent-atlas --skill skill-creator --agent claude-codeInstalls into .claude/skills of the current project.
Select a file.
wshobson/agents
github/awesome-copilot