If you're training models with reinforcement learning from human feedback, this skill wraps OpenRLHF, a Ray-based framework that uses vLLM for inference acceleration during training. It's built for distributed setups and comes with Docker container instructions that handle the slightly messy dependency conflicts (uninstalling xgboost, transformer_engine, flash_attn). The installation is straightforward if you're already working in containerized environments with NVIDIA GPUs. Worth noting this has over 27k GitHub stars, so it's a well-established project in the RLHF space. You'd reach for this when you need to scale up reinforcement learning training beyond what fits on a single machine and want inference that doesn't bottleneck your iteration speed.
npx skills add https://github.com/davila7/claude-code-templates --skill openrlhf-training