Gives Claude surgical access to PDFs through eight specialized tools built on PyMuPDF. The workflow is efficient: call pdf_info to get page count and TOC, use pdf_search for hybrid BM25+semantic search with paragraph excerpts, then pdf_read_pages for targeted reads or pdf_render_pages to feed diagrams to vision models. Includes OCR via Tesseract for scanned documents, structured table and image extraction, and SQLite caching that persists text, embeddings, and rendered pages across server restarts. Reach for this when you need to interrogate large documents without dumping entire PDFs into context. HTTPS-only URL fetching with SSRF protection blocks local network access.
claude mcp add --transport stdio jztan-pdf-mcp uvx pdf-mcp