This is a security wrapper around AIDefence that checks user input for prompt injection attacks, jailbreak attempts, and other adversarial patterns before you process it. You get a quick boolean safety check, detailed threat analysis with confidence scores, and the ability to train the system on new attack patterns. If you're building anything that accepts untrusted input from users or APIs, this gives you a structured way to catch injection attempts and policy violations before they hit your prompts. The stats endpoint is a nice touch for tracking detection rates over time.
npx skills add https://github.com/ruvnet/ruflo --skill safety-scan