Browsertrix12 February 2026·4 minsai-manual Web external Web Crawler Headless Warc Web ArchivingA cloud-native, headless browser-based web crawler that creates high-fidelity WARC archives, capturing JavaScript-rendered content that traditional crawlers miss.
datalad-crawler12 February 2026·3 minsai-partial Code Artifacts native-datalad Web CON Datalad Crawler Web Pipeline ArchivalA DataLad extension that provides pipeline-based web crawling for systematically tracking and archiving web resources into DataLad datasets.