This orchestrates Nano Banana and Kling AI to generate talking head videos that don't scream "AI generated." The workflow is thorough: generate a base image with intentional imperfections like pores and mixed lighting, chunk your script into 55-60 syllable segments for natural pacing, then generate 10-second video clips with choreographed micro-movements. The documentation is refreshingly honest about the hands problem (AI models can't do realistic hand movement, so crop them out or keep them static). It's a multi-step process that requires post-production, but if you need UGC-style spokesperson videos and care about them looking real, this gives you the specific prompting strategies and pacing math to pull it off.
npx skills add https://github.com/dennisonbertram/claude-media-skills --skill realistic-ugc-video