Gives Claude direct control over your desktop apps through Orca's accessibility API. You can list running apps, read their UI state, click buttons, fill fields, press keys, and scroll, all through a CLI that returns JSON plus screenshots. Designed for the snapshot-act-snapshot loop: grab the current state, pick an element by index, perform one action, then check what happened before deciding the next move. Works best with native apps that expose rich accessibility trees. Electron and web-view-heavy apps often need the screenshot more than the tree. The action model prefers semantic operations like set-value over raw typing, which matters because keyboard input can fail silently when focus isn't where you think it is. Refreshing state between actions isn't optional since element indexes go stale after almost any UI change.
npx skills add https://github.com/stablyai/orca --skill computer-use