Gives Claude control over your mouse, keyboard, and screen through PyAutoGUI. You can automate clicks, type text, execute hotkeys, take screenshots, and even use OCR to locate text on screen. Works across Windows, macOS, and Linux. The app control commands let you open and focus applications by name. Most useful for repetitive GUI tasks like form filling, testing interfaces, or scraping data from applications without APIs. The image matching feature is handy but needs exact visual matches, so adjust confidence levels if things don't line up. Remember PyAutoGUI's fail-safe: mouse to corner kills everything.
npx skills add https://github.com/patrickporto/desktop-agent --skill desktop-control