This is purpose-built for scraping documentation portals that normal scrapers choke on: JavaScript-heavy sites, login-gated content, paginated help centers, or anything with complex navigation. It uses Firecrawl's browser automation to actually navigate the site like a human would, following sidebars and pagination controls while extracting clean markdown or structured JSON. The workflow interviews you briefly for the portal URL and auth requirements, then crawls methodically while preserving code blocks and tables. Honest take: if you're trying to ingest Notion docs, enterprise knowledge bases, or modern SaaS documentation into a RAG system, this beats fighting with Puppeteer yourself. Outputs include metadata tracking, failed page reports, and a clean JSON shape ready for vector storage.
npx skills add https://github.com/firecrawl/firecrawl-workflows --skill firecrawl-knowledge-ingest