This handles video transcription and understanding across nine different providers with automatic fallback. You throw a YouTube URL or local file at it, and it picks the best available option based on your API keys, preferring full video analysis (Gemini, Vertex AI, OpenRouter) over audio-only transcription. The smart part is the fallback chain: if you don't have API keys configured, it drops down to FFMPEG with scene detection or local Whisper. Each provider supports multiple models, output is normalized JSON across all of them, and it handles the annoying bits like macOS path issues and format conversion. Useful when you need to extract information from video content without manually configuring which service to use.
npx skills add https://github.com/jrusso1020/video-understand-skills --skill video-understand