🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
-
Updated
Jun 15, 2026 - Python
Internet Archive is a website for a digital collection run by the archive.org group, also responsible for the Wayback Machine software.
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
Fetch known URLs from AlienVault's Open Threat Exchange, the Wayback Machine, and Common Crawl.
An archiving tool with an IM-style interface that prioritizes privacy and accessibility, integrated with various archival services including Internet Archive, archive.today, Ghostarchive, IPFS, Telegraph, and file systems.
Browser extension for viewing archived and cached versions of web pages, available for Chrome, Edge and Safari
A collection of special paths linked to common sensitive APIs, devops internals, frameworks conf, known misconfigurations, juicy APIs ..etc. It could be used as a part of web content discovery, to scan passively for high-quality endpoints and quick-wins.
Serverless replay of web archives directly in the browser
Wayback Machine API interface & a command-line tool
A command-line utility and Scrapy middleware for scraping time series data from Archive.org's Wayback Machine.
Download an entire website from the Wayback Machine.
Ashok is a OSINT Recon Tool , a.k.a 😍 Swiss Army knife .
Browse emulated browsers connected to old web sites in your browser!
File downloader for archive.org ⬇️
Secret and/or credential patterns used for gf.
A lightweight tool for scraping current and historic Google Analytics data
Wayback Machine OSINT Framework
Archived tweets from the Wayback Machine
Automate downloading archived deleted Tweets.
Extracts URLs from OSINT Archives for Security Insights
Extract web archive data using Wayback Machine and Common Crawl