If you've ever tried to turn a raw transcript into readable prose, you know the pain: filler words everywhere, wrong homophones from speech recognition, and walls of text with no breathing room. This skill tackles that by cleaning up transcripts while keeping the speaker's actual words and style intact. It handles both solo talks and multi-person conversations, fixes common errors like 的/得/地 confusion, and breaks things into paragraphs based on semantic shifts rather than arbitrary sentence counts. The approach is more editor than summarizer, which means you get polished text that still sounds like the person talking, not a corporate rewrite. Works with SRT, VTT, or plain text files.
npx skills add https://github.com/rookie-ricardo/erduo-skills --skill transcript-polisher