This gets Kokoro TTS running for local voice synthesis without cloud dependencies. It's built around streaming output to minimize latency, which matters if you're building something like a voice assistant that needs to feel responsive. The implementation prioritizes security with content filtering for sensitive data like passwords and API keys, plus proper cleanup of generated audio files. Includes test patterns for validating audio output quality, sample rates, and handling edge cases. The performance optimizations are practical: model caching to avoid cold starts, chunked audio processing to save memory, and async generation so you don't block your event loop. If you need offline TTS with decent guardrails, this covers the essentials.
npx skills add https://github.com/martinholovsky/claude-skills-generator --skill text-to-speech