This gives Claude the knowledge to work with Stable Baselines3, the PyTorch RL library. It covers training agents with PPO, SAC, DQN and other algorithms, building custom Gym environments with proper validation, setting up vectorized environments for parallel training, and implementing callbacks for monitoring. The skill includes actual code patterns for common workflows like saving models, evaluating policies, and recording videos. Use this when you need to prototype RL experiments, train agents on custom tasks, or integrate reinforcement learning into a project without digging through docs. The guidance on environment constraints and vectorized env gotchas is genuinely helpful since those trip people up constantly.
npx skills add https://github.com/davila7/claude-code-templates --skill stable-baselines3