A comprehensive wrapper around MiniMax's multimodal API that handles text chat, image generation, video synthesis, speech, and music creation from the command line. The standout feature is how well it plays with agents: proper non-interactive flags, JSON output, async task handling for long-running video jobs, and stdin/stdout streaming where it matters. The music generation is surprisingly detailed, letting you specify not just genre but BPM, key signature, vocal style, and song structure. If you're building workflows that need to orchestrate multiple media types through a single interface, this consolidates what would otherwise be a pile of separate API calls into one consistent CLI.
npx -y skills add minimax-ai/skills --skill minimax-multimodal-toolkit --agent claude-codeInstalls into .claude/skills of the current project.
Select a file.
juliusbrussee/caveman
mattpocock/skills
shadcn/improve
obra/superpowers
forrestchang/andrej-karpathy-skills
vercel-labs/skills