#

wayback-machine

Internet Archive is a website for a digital collection run by the archive.org group, also responsible for the Wayback Machine software.

Here are 325 public repositories matching this topic...

ArchiveBox

ArchiveBox / ArchiveBox

🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

Updated Jun 15, 2026
Python

lc / gau

Fetch known URLs from AlienVault's Open Threat Exchange, the Wayback Machine, and Common Crawl.

security wayback-machine hacktoberfest alienvault gau

Updated Mar 20, 2026
Go

wabarc / wayback

An archiving tool with an IM-style interface that prioritizes privacy and accessibility, integrated with various archival services including Internet Archive, archive.today, Ghostarchive, IPFS, Telegraph, and file systems.

Updated Jun 13, 2026
Go

web-archives

dessant / web-archives

Browser extension for viewing archived and cached versions of web pages, available for Chrome, Edge and Safari

chrome-extension safari-extension google yandex cache archive firefox-extension browser-extension wayback-machine

Updated Jun 15, 2026
JavaScript

ayoubfathi / leaky-paths

A collection of special paths linked to common sensitive APIs, devops internals, frameworks conf, known misconfigurations, juicy APIs ..etc. It could be used as a part of web content discovery, to scan passively for high-quality endpoints and quick-wins.

Updated Apr 3, 2026

webrecorder / replayweb.page

Serverless replay of web archives directly in the browser

service-worker warc web-archiving wayback-machine web-archive replay-web-page web-replay wacz

Updated Jun 12, 2026
TypeScript

waybackpy

akamhy / waybackpy

Wayback Machine API interface & a command-line tool

osint internet-archive web-archiving wayback-machine webarchiving cdx-api internet-archiving savepagenow archive-webpage archive-webpages wayback-machine-api wayback-machine-python

Updated Feb 26, 2024
Python

sangaline / wayback-machine-scraper

A command-line utility and Scrapy middleware for scraping time series data from Archive.org's Wayback Machine.

python web-scraping command-line-tool wayback-machine wayback-archiver archive-dot-org

Updated Feb 23, 2024
Python

StrawberryMaster / wayback-machine-downloader

Download an entire website from the Wayback Machine.

ruby scraper osint internet-archive wayback-machine wayback webarchive archive-org waybackmachine osint-tool wayback-machine-downloader osint-tools wayback-downloader archive-downloader

Updated Jun 11, 2026
Ruby

powerexploit / Ashok

Ashok is a OSINT Recon Tool , a.k.a 😍 Swiss Army knife .

github dns osint penetration-testing wayback-machine hacking-tool reconnaissance googledork subnet-lookup geoip-lookup http-headers banner-grabbing cmsdetecter recon-tools linkextractor subdomain-finder nmap-scanning githubrecon

Updated Jan 25, 2022
Python

travisbrown / cancel-culture

Tools for fighting abuse on Twitter

twitter-api wayback-machine

Updated Dec 17, 2025
Rust

oldweb-today

oldweb-today / oldweb-today

Browse emulated browsers connected to old web sites in your browser!

emulator web emulation wayback-machine web-archives webrecorder oldweb-today

Updated Oct 31, 2024
JavaScript

wimpysworld / ia-get

File downloader for archive.org ⬇️

rust downloader internet-archive wayback-machine hacktoberfest download-manager

Updated Jun 15, 2026
Rust

dwisiswant0 / gf-secrets

Secret and/or credential patterns used for gf.

crawler infosec bugbounty wayback-machine wayback alienvault-otx gf trufflehog gitleaks secrets-detection waybackurl gau open-threat-exchange trufflehog3

Updated Feb 10, 2023
Shell

bellingcat / wayback-google-analytics

A lightweight tool for scraping current and historic Google Analytics data

python scraper command-line google-analytics wayback-machine open-source-research

Updated Aug 21, 2024
Python

mhmdiaa / chronos

Wayback Machine OSINT Framework

security mapping wordlist penetration-testing infosec pentesting recon wayback-machine wordlist-generator security-tools web-application-security reconnaissance wordlists penetration-testing-tools

Updated Jul 28, 2024
Go

claromes / waybacktweets

Archived tweets from the Wayback Machine

twitter tweets internet-archive wayback-machine x research-tools

Updated May 26, 2025
Python

twayback

humandecoded / twayback

Automate downloading archived deleted Tweets.

python downloader osint twitter help-wanted wayback-machine wayback osint-resources osinttool deleted-tweets pythontools osint-python needs-maintainer waybackmachine osint-tool pythontool waybackurl osint-tools deletedtweets

Updated Jul 7, 2023
Python

urx

hahwul / urx

Extracts URLs from OSINT Archives for Security Insights

url security osint wayback-machine urx osint-tool

Updated Jun 12, 2026
Rust

karust / gogetcrawl

Extract web archive data using Wayback Machine and Common Crawl

golang crawler concurrency wayback-machine webarchive commoncrawl

Updated Nov 4, 2024
Go

Followers: 29 followers
Website: github.com/topics/internet-archive