A comprehensive desktop automation wrapper around PyAutoGUI that gives Claude control over mouse, keyboard, screenshots, and image recognition through a unified CLI interface. You get pixel-perfect color detection, region-based screenshot capture, and the ability to wait for specific images to appear or disappear on screen. The structure is clean with separate modules for mouse, keyboard, screen, and dialog operations, all returning JSON for easy parsing. Built-in failsafe triggers if you slam the mouse to the top-left corner, which is handy when things go sideways. Best for automating repetitive UI tasks, testing desktop apps, or building workflows that need to interact with applications without APIs. Supports Windows-specific features like active window detection if you have pywin32 installed.
npx skills add https://github.com/steelan9199/wechat-publisher --skill pyautogui-automation