An open API service providing repository metadata for many open source software ecosystems.

GitHub / ArchiveBox 16 Repositories

The self-hosted internet archiving solution maintained by @pirate. #webarchiving #internetarchiving #digipres

Donate: https://github.com/sponsors/ArchiveBox

ArchiveBox/ArchiveBox

🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

Language: Python - Size: 10.9 MB - Last synced at: 3 days ago - Pushed at: about 2 months ago - Stars: 23,786 - Forks: 1,260

ArchiveBox/archivebox-browser-extension

Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.

Language: JavaScript - Size: 935 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 311 - Forks: 30

ArchiveBox/abx-spec-behaviors

🧩 Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser environments, puppeteer, playwright, extensions, AI tools, and many other contexts with minimal adjustment.

Language: JavaScript - Size: 785 KB - Last synced at: about 4 hours ago - Pushed at: 2 months ago - Stars: 18 - Forks: 0

ArchiveBox/good-karma-kit

😇 A Docker Compose bundle to run on servers with spare CPU, RAM, disk, and bandwidth to help the world. Includes Tor, ArchiveWarrior, BOINC, and more...

Size: 86.9 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 362 - Forks: 12

ArchiveBox/abx-pkg

📦 Modern strongly typed Python library for managing system dependencies with package managers like apt, brew, pip, npm, etc.

Language: Python - Size: 563 KB - Last synced at: 4 days ago - Pushed at: about 1 month ago - Stars: 17 - Forks: 0

ArchiveBox/abx-dl

⬇️ A simple all-in-one CLI tool to download EVERYTHING from a URL (like youtube-dl/yt-dlp, forum-dl, gallery-dl, simpler ArchiveBox). 🎭 Uses headless Chrome to get HTML, JS, CSS, images/video/audio/subtitles, PDFs, screenshots, article text, git repos, and more...

Language: JavaScript - Size: 177 KB - Last synced at: 4 days ago - Pushed at: 4 months ago - Stars: 73 - Forks: 4

ArchiveBox/docs

Source for the Github Wiki / ReadTheDocs documentation for AchiveBox, the self-hosted internet archiving solution.

Language: CSS - Size: 7.48 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 16 - Forks: 5

ArchiveBox/docker-archivebox

Home of the official docker image for ArchiveBox

Size: 93.8 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 48 - Forks: 12

ArchiveBox/readability-extractor

Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page's article text.

Language: JavaScript - Size: 93.8 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 32 - Forks: 13

ArchiveBox/electron-archivebox

Desktop Electron app for ArchiveBox internet archiver. (ALPHA: not ready for general use)

Language: JavaScript - Size: 156 KB - Last synced at: 9 months ago - Pushed at: about 2 years ago - Stars: 174 - Forks: 15

ArchiveBox/squasher-browser-extension Fork of pirate/squasher-browser-extension

Extension to collect all open browser tabs for a given domain into a new window (with suspender support).

Size: 89.8 KB - Last synced at: about 4 hours ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

ArchiveBox/archivebox-proxy

Official ArchiveBox MITM proxy: saves URLs of all requests passing through to an ArchiveBox server for archival.

Language: Python - Size: 12.7 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 10 - Forks: 0

ArchiveBox/pip-archivebox

Official Python package for ArchiveBox, the self-hosted internet archiving solution.

Size: 15.4 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 13 - Forks: 2

ArchiveBox/DigestBox

DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by ArchiveBox.io under the hood.

Language: HTML - Size: 1.75 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 11 - Forks: 0

ArchiveBox/community

A wiki of the broader Web Archiving Community: important organizations, alternative projects, blog posts, and more.

Size: 17.6 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 0

ArchiveBox/internet-archiving-talk Fork of pirate/internet-archiving-talk

🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.

Size: 27.6 MB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 12 - Forks: 1

ArchiveBox/debian-archivebox

Home of the official apt/deb package for Ubuntu/Debian-based systems.

Language: Python - Size: 3.34 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 17 - Forks: 5

ArchiveBox/homebrew-archivebox

Homebrew formula for the ArchiveBox self-hosted internet archiving solution.

Language: Ruby - Size: 61.8 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 24 - Forks: 3

ArchiveBox/archivebox-plugin

Template of an archivebox extractor, fork this to contribute a new extractor!

Language: Python - Size: 5.86 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0