RepoGuard
Updated 1 h ago
ArchiveBox

Organization

Public GitHub footprint of ArchiveBox

@ArchiveBox
View profile on GitHub

The self-hosted internet archiving solution maintained by @pirate. #webarchiving #internetarchiving #digipres

23

Public repositories

29,199

Total stars

489

Followers

ArchiveBox is an organization on GitHub that serves as a self-hosted internet archiving solution, maintained by @pirate. It features a wide range of repositories primarily developed in Python, JavaScript, TypeScript, and other languages, with notable projects like ArchiveBox and archivebox-browser-extension that focus on web archiving and preserving browsing history.

Top languages

Python 7JavaScript 3TypeScript 2HTML 2CSS 1Shell 1Rust 1

Public repositories

ArchiveBox

27,690

🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

Python
Updated Jun 13, 2026

archivebox-browser-extension

453

🖥️ Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.

TypeScript
Updated Jun 12, 2026

good-karma-kit

394

😇 A Docker Compose bundle to run on servers with spare CPU, RAM, disk, and bandwidth to help the world. Includes Tor, ArchiveWarrior, BOINC, and more...

Unknown Language
Updated Jun 9, 2026

electron-archivebox

184

Desktop Electron app for ArchiveBox internet archiver. (ALPHA: not ready for general use)

JavaScript
Updated May 30, 2026

abx-dl

124

⬇️ A simple all-in-one CLI tool to download EVERYTHING from a URL (like youtube-dl/yt-dlp, forum-dl, gallery-dl, simpler ArchiveBox). 🎭 Uses headless Chrome to get HTML, JS, CSS, images/video/audio/subtitles, PDFs, screenshots, article text, git repos, and more...

Python
Updated Jun 13, 2026

docker-archivebox

58

Home of the official docker image for ArchiveBox

Unknown Language
Updated Jun 3, 2026

readability-extractor

43

Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page's article text.

JavaScript
Updated Apr 11, 2026

pocket-exporter

34

[FREE] A service to help export your pocket bookmarks, tags, saved article text, and more...

TypeScript
Updated Jun 3, 2026

archivebox-proxy

32

Official ArchiveBox MITM proxy: saves URLs of all requests passing through to an ArchiveBox server for archival.

Python
Updated Jun 2, 2026

homebrew-archivebox

28

Homebrew formula for the ArchiveBox self-hosted internet archiving solution.

Python
Updated Jun 12, 2026

abxpkg

28

📦 Modern strongly typed Python library for managing system dependencies with package managers like apt, brew, pip, npm, etc.

Python
Updated Jun 11, 2026

DigestBox

21

DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by ArchiveBox.io under the hood.

HTML
Updated Jun 3, 2026

abx-spec-behaviors

20

🧩 Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser environments, puppeteer, playwright, extensions, AI tools, and many other contexts with minimal adjustment.

JavaScript
Updated Jun 3, 2026

docs

17

Source for the Github Wiki / ReadTheDocs documentation for AchiveBox, the self-hosted internet archiving solution.

CSS
Updated May 24, 2026

debian-archivebox

16

Home of the official apt/deb package for Ubuntu/Debian-based systems.

Shell
Updated Jun 9, 2026

internet-archiving-talk

15

🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.

Unknown Language
Updated Mar 19, 2025

pip-archivebox

12

Official Python package for ArchiveBox, the self-hosted internet archiving solution.

Unknown Language
Updated Dec 9, 2025

abxbus

11

📢 Fast multi-language Event Bus library for Python/TS/Golang/Rust with support for advanced concurrency control features, nested event tracking, type enforcement, bridges to other backends, and more...

Rust
Updated Jun 10, 2026

abx-plugins

8

🧩 Plugins and extractors that ArchiveBox + abx-dl use: chrome, ytdlp, wget, singlefile, readability, forum-dl, gallery-dl, papers-dl, and more...

Python
Updated Jun 11, 2026

community

6

A wiki of the broader Web Archiving Community: important organizations, alternative projects, blog posts, and more.

Unknown Language
Updated Nov 10, 2025

githubusers

3

GitHub user contribution dashboards · serves precomputed stats at githubusers.archivebox.io/<login>

HTML
Updated Jun 10, 2026

squasher-browser-extension

1

Extension to collect all open browser tabs for a given domain into a new window (with suspender support).

Unknown Language
Updated Jun 3, 2026

monorepo

1

🛠️ Development-only monorepo config for archivebox + abx-dl + abx-plugins + abx-pkg + abxbus.

Python
Updated Jun 1, 2026

Frequently asked questions

What does ArchiveBox build on GitHub?

ArchiveBox builds a variety of tools focused on web archiving. Key repositories include ArchiveBox, which enables users to save and manage web content, and archivebox-browser-extension, which helps preserve browsing history.

Which programming languages does ArchiveBox use?

ArchiveBox primarily uses Python, JavaScript, TypeScript, HTML, CSS, and Shell for its development. This diverse set of languages supports various functionalities across its repositories.

Are ArchiveBox's repositories public?

Yes, all of ArchiveBox's repositories are public on GitHub. This transparency allows users and contributors to access, use, and contribute to the projects focused on internet archiving.

Is this exposure intended?

Monitor ArchiveBox with RepoGuard and get alerted the moment a new public repository appears.

Monitor this account