This extracts everything you need from a webpage for migration work: downloads all images locally (converting WebP, AVIF, and SVG to PNG along the way), pulls metadata like Open Graph and JSON-LD, and gives you cleaned HTML with images already mapped to local paths. It runs headless Chromium through Playwright, scrolls the page to catch lazy-loaded images, and fixes common DOM issues like background images and picture elements. You get a metadata.json with sanitized document paths, a full-page screenshot for reference, and all assets ready to go. Designed as the first step in the page-import workflow, feeding into structure identification and HTML generation. Honest take: solid automation for the tedious parts of content migration, though you'll still need to verify the output.
npx skills add https://github.com/adobe/skills --skill scrape-webpage