This is a quadruple-engine OCR tool that pulls text from scanned PDFs and images. You get RapidOCR and PaddleOCR for local processing, RapidDoc for enhanced layout analysis with table and formula recognition, and SiliconFlow's API when you need cloud-based accuracy. The local engines run offline and free, while SiliconFlow taps into DeepSeek's OCR model for tougher jobs. It handles Chinese and English, converts PDF pages to images automatically, and falls back intelligently when one engine struggles. Good for processing contracts, scanned books, or batches of documents where you want flexibility between speed and precision without rewriting code.
npx skills add https://github.com/yejinlei/pdf-ocr-skill --skill pdf-ocr