Plugs Claude directly into Google's Gemini image generation API, specifically the Nano Banana and Nano Banana Pro models. You get five tools over stdio: list available models, generate images from text prompts, edit or compose existing images with instructions, batch generate consistent image sets from a master prompt, and run multi-turn refinement sessions via the Interactions API. Images land on disk by default with paths returned, or you can pull them inline as base64. Requires a Gemini API key and runs through the generativelanguage.googleapis.com v1beta endpoint. Reach for this when you need Claude to generate or iterate on images without leaving the MCP context.
GEMINI_API_KEY*secretYour Google Gemini API key (aistudio.google.com/apikey)
GEMINI_IMAGE_MODELOverride the default image model (default: gemini-3-pro-image)
GEMINI_OUTPUT_DIRDefault directory for generated images (default: current working directory)
GEMINI_INPUT_DIRDirectory to resolve bare input-image filenames against
MCP server for Google Gemini image generation and editing. Exposes four tools to Claude over stdio: list available models, generate images from text prompts, edit or compose images with text instructions, and generate a consistent set of images from a master prompt. Images are written to disk by default (path returned) or returned inline as base64. Built on the Gemini v1beta API (generativelanguage.googleapis.com) using the Nano Banana / Nano Banana Pro model family.
Developed and maintained by AI (Claude Code).
| Variable | Required | Description |
|---|---|---|
GEMINI_API_KEY | Yes | Your Google Gemini API key (aistudio.google.com/apikey) |
GEMINI_IMAGE_MODEL | No | Override the default image model (default: gemini-3-pro-image) |
GEMINI_OUTPUT_DIR | No | Default directory for generated images (default: current working directory) |
GEMINI_INPUT_DIR | No | Directory to resolve bare input-image filenames against (so images: ["foo.jpg"] works) |
| Tool | Description |
|---|---|
gemini_list_models | List available Gemini image models and the current default |
gemini_generate_image | Generate image(s) from a text prompt |
gemini_edit_image | Edit or compose one or more input images with a text instruction |
gemini_generate_set | Generate a master image plus N consistent images referencing it |
gemini_interact | Multi-turn generation/editing via the Interactions API (Beta); returns an interaction_id to refine the same image conversationally |
{
"mcpServers": {
"gemini": {
"command": "npx",
"args": ["-y", "@chrischall/gemini-mcp"],
"env": {
"GEMINI_API_KEY": "your-api-key-here"
}
}
}
}
See SKILL.md for full usage documentation.
io.github.ericm1018/skillfm-llm-cost-optimizer-openai-anthropic-usage
io.github.mikerawsonnz/llm-orchestration-agent
io.github.mikerawsonnz/authenticated-llm-agent
labforgedev/copilot-memory-mcp
csoai-org/agent-prompt-injection-firewall-mcp
io.github.mikerawsonnz/authenticated-multi-llm-agent