Gemini Computer Use

1.2k installs937 stars

Summary

This is a Playwright-based agent loop that wires up Gemini 2.5's Computer Use model to actually drive a browser. You give it a goal and a starting URL, it takes screenshots, asks the model what to do next, executes the actions, and repeats until the task is done or you hit the turn limit. The standout piece is the safety confirmation flow: risky actions get flagged and require human approval before execution. It defaults to Chromium but you can point it at Chrome, Edge, or Brave. The operational advice is solid: run this in a sandbox, use the exclude flag to block actions you don't trust, and keep the viewport at 1440x900. It's a clean reference implementation if you want to build browser automation on top of Gemini's multimodal function calling.

Install to Claude Code

npx -y skills add am-will/codex-skills --skill gemini-computer-use --agent claude-code

Installs into .claude/skills of the current project.

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

CodeScene MCP Server

Your agent targets a perfect 10 Code Health score. Deterministic. Every commit.

Try For Free →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

Email for Agents: Free tier available

Give your AI agent a complete email layer—sending, inbound inboxes, and sandbox testing.

Get 4K emails/month free →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

CodeScene MCP Server

Your agent targets a perfect 10 Code Health score. Deterministic. Every commit.

Try For Free →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

Email for Agents: Free tier available

Give your AI agent a complete email layer—sending, inbound inboxes, and sandbox testing.

Get 4K emails/month free →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Files

SKILL.md

Select a file.

Featured

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

CodeScene MCP Server

Your agent targets a perfect 10 Code Health score. Deterministic. Every commit.

Try For Free →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

Email for Agents: Free tier available

Give your AI agent a complete email layer—sending, inbound inboxes, and sandbox testing.

Get 4K emails/month free →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

View on GitHub

Gemini Computer Use

Install to Claude Code

Gemini Computer Use

Install to Claude Code

Recommended

Recommended