zshot/cliDownload

--crawl-url-is-sitemap

ValueFORMAT
LicensePRO

Treat the target URL as a manifest of URLs and capture each one into a single WARC. FORMAT is auto, xml, or text, and defaults to auto: the format is inferred from the target’s extension, then its Content-Type. Plain-text URL lists require an explicit text. A target that already resolves to a sitemap under WARC output is treated as a manifest without this flag. Follow nested <sitemapindex> levels with --crawl-sitemap-max-depth; filter the listed URLs with --crawl-deny-url and --crawl-allow-url.

Example

zshot -t warc -f site.warc.gz --crawl-url-is-sitemap xml https://example.com/sitemap.xml