The self-hosted internet archiving solution maintained by @pirate. #webarchiving #internetarchiving #digipres
23
Public repositories
29,199
Total stars
489
Followers
ArchiveBox is an organization on GitHub that serves as a self-hosted internet archiving solution, maintained by @pirate. It features a wide range of repositories primarily developed in Python, JavaScript, TypeScript, and other languages, with notable projects like ArchiveBox and archivebox-browser-extension that focus on web archiving and preserving browsing history.
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
🖥️ Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.
😇 A Docker Compose bundle to run on servers with spare CPU, RAM, disk, and bandwidth to help the world. Includes Tor, ArchiveWarrior, BOINC, and more...
Desktop Electron app for ArchiveBox internet archiver. (ALPHA: not ready for general use)
⬇️ A simple all-in-one CLI tool to download EVERYTHING from a URL (like youtube-dl/yt-dlp, forum-dl, gallery-dl, simpler ArchiveBox). 🎭 Uses headless Chrome to get HTML, JS, CSS, images/video/audio/subtitles, PDFs, screenshots, article text, git repos, and more...
Home of the official docker image for ArchiveBox
Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page's article text.
[FREE] A service to help export your pocket bookmarks, tags, saved article text, and more...
Official ArchiveBox MITM proxy: saves URLs of all requests passing through to an ArchiveBox server for archival.
Homebrew formula for the ArchiveBox self-hosted internet archiving solution.
📦 Modern strongly typed Python library for managing system dependencies with package managers like apt, brew, pip, npm, etc.
DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by ArchiveBox.io under the hood.
🧩 Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser environments, puppeteer, playwright, extensions, AI tools, and many other contexts with minimal adjustment.
Source for the Github Wiki / ReadTheDocs documentation for AchiveBox, the self-hosted internet archiving solution.
Home of the official apt/deb package for Ubuntu/Debian-based systems.
🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.
Official Python package for ArchiveBox, the self-hosted internet archiving solution.
📢 Fast multi-language Event Bus library for Python/TS/Golang/Rust with support for advanced concurrency control features, nested event tracking, type enforcement, bridges to other backends, and more...
🧩 Plugins and extractors that ArchiveBox + abx-dl use: chrome, ytdlp, wget, singlefile, readability, forum-dl, gallery-dl, papers-dl, and more...
A wiki of the broader Web Archiving Community: important organizations, alternative projects, blog posts, and more.
GitHub user contribution dashboards · serves precomputed stats at githubusers.archivebox.io/<login>
Extension to collect all open browser tabs for a given domain into a new window (with suspender support).
🛠️ Development-only monorepo config for archivebox + abx-dl + abx-plugins + abx-pkg + abxbus.
ArchiveBox builds a variety of tools focused on web archiving. Key repositories include ArchiveBox, which enables users to save and manage web content, and archivebox-browser-extension, which helps preserve browsing history.
ArchiveBox primarily uses Python, JavaScript, TypeScript, HTML, CSS, and Shell for its development. This diverse set of languages supports various functionalities across its repositories.
Yes, all of ArchiveBox's repositories are public on GitHub. This transparency allows users and contributors to access, use, and contribute to the projects focused on internet archiving.
Monitor ArchiveBox with RepoGuard and get alerted the moment a new public repository appears.
Monitor this account