This is the TTS skill you'd actually use during development because it's stupidly fast. 90ms to first sound versus the usual 4-8 second wait means you can get audio feedback without breaking your flow. The tradeoff is simple: you get 8 built-in voices instead of custom cloning, and no emotion tags. First run takes a few seconds while the daemon loads a 100MB model into memory, then it stays warm for an hour. The Rust CLI streams from a Python daemon running pocket-tts, and it'll auto-manage the daemon lifecycle. If you need Morgan Freeman or laugh tags, use the speak skill instead. If you just need Claude to talk back without the wait, this is it.
npx skills add https://github.com/emzod/speak-turbo --skill speakturbo-tts