If you're running PyTorch training across multiple GPUs or machines and don't want to rewrite your entire codebase, this skill wraps HuggingFace's Accelerate library. The pitch is simple: add four lines of code to convert a single-GPU script into something that works with distributed setups, mixed precision, and different hardware backends. It's part of the HuggingFace ecosystem, so it integrates cleanly if you're already using their transformers or datasets libraries. The skill comes from a template collection with decent traction (27.7K stars), though it passed two security audits and got a warning from Snyk. Honestly, Accelerate shines when you want distributed training without the boilerplate of raw PyTorch DDP or DeepSpeed configuration.
npx skills add https://github.com/davila7/claude-code-templates --skill huggingface-accelerate