This covers all the PDF operations you'd expect: reading and extracting text with pdfplumber, merging and splitting with pypdf, creating new documents with reportlab, and OCR for scanned files. The guide is practical, with working code snippets for common tasks like extracting tables to Excel, adding watermarks, and password protection. It includes both Python libraries and command-line tools like qpdf and pdftotext. One thing worth noting: there's a specific warning about not using Unicode subscript characters in reportlab because they render as black boxes, use the XML markup tags instead. The form-filling functionality references a separate forms.md file not included here.
npx skills add https://github.com/k-dense-ai/scientific-agent-skills --skill pdf