The self-hosted internet archiving solution maintained by @pirate. #webarchiving #internetarchiving #digipres
23
Publiczne repozytoria
29 199
Łączna liczba gwiazdek
489
Obserwujący
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
🖥️ Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.
😇 A Docker Compose bundle to run on servers with spare CPU, RAM, disk, and bandwidth to help the world. Includes Tor, ArchiveWarrior, BOINC, and more...
Desktop Electron app for ArchiveBox internet archiver. (ALPHA: not ready for general use)
⬇️ A simple all-in-one CLI tool to download EVERYTHING from a URL (like youtube-dl/yt-dlp, forum-dl, gallery-dl, simpler ArchiveBox). 🎭 Uses headless Chrome to get HTML, JS, CSS, images/video/audio/subtitles, PDFs, screenshots, article text, git repos, and more...
Home of the official docker image for ArchiveBox
Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page's article text.
[FREE] A service to help export your pocket bookmarks, tags, saved article text, and more...
Official ArchiveBox MITM proxy: saves URLs of all requests passing through to an ArchiveBox server for archival.
Homebrew formula for the ArchiveBox self-hosted internet archiving solution.
📦 Modern strongly typed Python library for managing system dependencies with package managers like apt, brew, pip, npm, etc.
DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by ArchiveBox.io under the hood.
🧩 Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser environments, puppeteer, playwright, extensions, AI tools, and many other contexts with minimal adjustment.
Source for the Github Wiki / ReadTheDocs documentation for AchiveBox, the self-hosted internet archiving solution.
Home of the official apt/deb package for Ubuntu/Debian-based systems.
🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.
Official Python package for ArchiveBox, the self-hosted internet archiving solution.
📢 Fast multi-language Event Bus library for Python/TS/Golang/Rust with support for advanced concurrency control features, nested event tracking, type enforcement, bridges to other backends, and more...
🧩 Plugins and extractors that ArchiveBox + abx-dl use: chrome, ytdlp, wget, singlefile, readability, forum-dl, gallery-dl, papers-dl, and more...
A wiki of the broader Web Archiving Community: important organizations, alternative projects, blog posts, and more.
GitHub user contribution dashboards · serves precomputed stats at githubusers.archivebox.io/<login>
Extension to collect all open browser tabs for a given domain into a new window (with suspender support).
🛠️ Development-only monorepo config for archivebox + abx-dl + abx-plugins + abx-pkg + abxbus.
Monitoruj ArchiveBox z RepoGuard i otrzymuj powiadomienia w momencie, gdy pojawi się nowe publiczne repozytorium.
Monitoruj to konto