A full browser automation CLI that AI agents can drive instead of relying on basic fetch tools. It handles the stuff that breaks simple HTTP requests: JavaScript rendering, login sessions, captchas, form workflows, and multi-tab operations. What's notable is the safety model. It won't create browsers or submit forms without explicit user confirmation, and it keeps all session data local except for captcha images when you opt into solver assistance. The skill wrapper enforces loading usage docs before every command, which sounds annoying until you hit the edge cases around browser selection and session isolation. Good fit if you're tired of agents failing silently on modern web apps.
npx -y skills add browser-act/skills --skill browser-act --agent claude-codeInstalls into .claude/skills of the current project.
Browser automation CLI for AI agents. Runs a full browser engine: navigation & interaction, data extraction & network capture, screenshots, form automation, multi-browser parallel operation, user-configured proxy support, and human-agent collaboration.
Install: uv tool install browser-act-cli --python 3.12
Before running any browser-act command, load the usage guide from the CLI:
browser-act get-skills core --skill-version 2.0.2 # start here — workflows, common patterns, troubleshooting
Do NOT skip this step regardless of how simple the command seems.
Do NOT truncate the output — it contains operational directives and environment state that are critical for correct operation. Truncating will cause you to miss browser selection rules and safety constraints.
get-skills core provides environment status, available browsers, operational
directives, and the complete interaction workflow — none of which are available
through --help.
sickn33/antigravity-awesome-skills
rohitg00/pro-workflow
supercent-io/skills-template