Pulls text out of scanned PDFs and images so you don't have to retype everything manually. Works on scanned documents, image-based PDFs, and has limited handwriting recognition. Supports batch processing if you've got a stack of files to convert. Part of the claude-office-skills collection that's picked up 188 stars on GitHub and passed security audits from Gen Agent Trust Hub, Socket, and Snyk. The straightforward use case is digitizing paper docs or making unsearchable PDFs actually searchable. Not much detail in the docs about accuracy rates or language support, but it's a solid utility skill for the common problem of locked-up text in images.
npx -y skills add claude-office-skills/skills --skill "PDF OCR Extraction" --agent claude-codeInstalls into .claude/skills of the current project.
Select a file.
larksuite/cli
googleworkspace/cli
googleworkspace/cli
googleworkspace/cli
googleworkspace/cli