🌐 Guide and tools to run a full offline mirror of Wikipedia.org with three different approaches: Nginx caching proxy, Kiwix + ZIM dump, and MediaWiki/XOWA + XML dump
-
Updated
Apr 7, 2021 - Shell
🌐 Guide and tools to run a full offline mirror of Wikipedia.org with three different approaches: Nginx caching proxy, Kiwix + ZIM dump, and MediaWiki/XOWA + XML dump
Submit URLs listed inside a file to website archival services
Repository for collecting scripts to help capture MyConvento newsroom press-releases from the MyConvento PR management suite. The README provides an analysis of the MyConvento URL architecture for users hoping to develop a solution for themselves.
Pick a date and explore websites from the early days of the internet to now all in an easy-to-use browser format! 💻
Pick a date and explore websites from the early days of the internet to now all in an easy-to-use browser format! 💻
Desktop Electron app for ArchiveBox internet archiver. (ALPHA: not ready for general use)
upload stuff to the Internet Archive using a shell script
Download and archive RSS feeds to Wayback Machine. Save a list of archived feed in locad db.
Navigator for Web Archive
DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by ArchiveBox.io under the hood.
Homebrew formula for the ArchiveBox self-hosted internet archiving solution.
Home of the official docker image for ArchiveBox
Wayback Machine API interface & a command-line tool
Source for the Github Wiki / ReadTheDocs documentation for AchiveBox, the self-hosted internet archiving solution.
😇 A Docker Compose bundle to run on servers with spare CPU, RAM, disk, and bandwidth to help the world. Includes Tor, ArchiveWarrior, BOINC, and more...
Home of the official apt/deb package for Ubuntu/Debian-based systems.
Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page's article text.
Scrape posts, threads from forums, news aggregators, mail archives, export to JSONL, mailbox, WARC
Official ArchiveBox MITM proxy: saves URLs of all requests passing through to an ArchiveBox server for archival.
Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.
Add a description, image, and links to the internet-archiving topic page so that developers can more easily learn about it.
To associate your repository with the internet-archiving topic, visit your repo's landing page and select "manage topics."