Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: internet-archiving

ArchiveBox/docs

Source for the Github Wiki / ReadTheDocs documentation for AchiveBox, the self-hosted internet archiving solution.

Language: CSS - Size: 6.94 MB - Last synced: 4 days ago - Pushed: 4 days ago - Stars: 11 - Forks: 3

akamhy/waybackpy

Wayback Machine API interface & a command-line tool

Language: Python - Size: 575 KB - Last synced: 3 days ago - Pushed: 3 months ago - Stars: 435 - Forks: 33

ArchiveBox/ArchiveBox

🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

Language: Python - Size: 7.73 MB - Last synced: 10 days ago - Pushed: 10 days ago - Stars: 19,808 - Forks: 1,077

Own-Data-Privateer/pwebarc

A suite of tools for mirroring and hoarding web pages you visit for later offline viewing. I.e. your own personal Wayback Machine that can also archive HTTP POST requests and responses, as well as most other HTTP-level data, which also follows "archive everything now, figure out what to do with it later" philosophy.

Language: Python - Size: 637 KB - Last synced: 7 days ago - Pushed: 7 days ago - Stars: 22 - Forks: 0

ArchiveBox/archivebox-proxy

Official ArchiveBox MITM proxy: saves URLs of all requests passing through to an ArchiveBox server for archival.

Language: Python - Size: 8.79 KB - Last synced: 10 days ago - Pushed: 4 months ago - Stars: 7 - Forks: 0

mikwielgus/forum-dl

Scrape posts, threads from forums, news aggregators, mail archives, export to JSONL, mailbox, WARC

Language: Python - Size: 391 KB - Last synced: 8 days ago - Pushed: 8 months ago - Stars: 60 - Forks: 1

ArchiveBox/archivebox-browser-extension

Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.

Language: TypeScript - Size: 114 KB - Last synced: 10 days ago - Pushed: about 1 month ago - Stars: 159 - Forks: 13

pirate/wikipedia-mirror

🌐 Guide and tools to run a full offline mirror of Wikipedia.org with three different approaches: Nginx caching proxy, Kiwix + ZIM dump, and MediaWiki/XOWA + XML dump

Language: Shell - Size: 10.5 MB - Last synced: 9 days ago - Pushed: about 3 years ago - Stars: 330 - Forks: 27

ArchiveBox/good-karma-kit

😇 A Docker Compose bundle to run on servers with spare CPU, RAM, disk, and bandwidth to help the world. Includes Tor, ArchiveWarrior, BOINC, and more...

Size: 64.5 KB - Last synced: 10 days ago - Pushed: 12 months ago - Stars: 295 - Forks: 8

ArchiveBox/pip-archivebox

Official Python package for ArchiveBox, the self-hosted internet archiving solution.

Size: 15.4 MB - Last synced: 10 days ago - Pushed: 17 days ago - Stars: 13 - Forks: 2

ArchiveBox/docker-archivebox

Home of the official docker image for ArchiveBox

Language: Dockerfile - Size: 70.3 KB - Last synced: 10 days ago - Pushed: 3 months ago - Stars: 41 - Forks: 12

ArchiveBox/electron-archivebox

Desktop Electron app for ArchiveBox internet archiver. (ALPHA: not ready for general use)

Language: JavaScript - Size: 156 KB - Last synced: 10 days ago - Pushed: about 1 year ago - Stars: 173 - Forks: 15

pirate/internet-archiving-talk

🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.

Language: JavaScript - Size: 27.6 MB - Last synced: 9 days ago - Pushed: over 3 years ago - Stars: 47 - Forks: 5

ArchiveBox/readability-extractor

Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page's article text.

Language: JavaScript - Size: 93.8 KB - Last synced: 10 days ago - Pushed: about 1 month ago - Stars: 32 - Forks: 13

ArchiveBox/DigestBox

DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by ArchiveBox.io under the hood.

Language: HTML - Size: 1.75 MB - Last synced: 10 days ago - Pushed: 3 months ago - Stars: 11 - Forks: 0

ArchiveBox/debian-archivebox

Home of the official apt/deb package for Ubuntu/Debian-based systems.

Language: Python - Size: 3.34 MB - Last synced: 10 days ago - Pushed: about 1 month ago - Stars: 17 - Forks: 5

ArchiveBox/homebrew-archivebox

Homebrew formula for the ArchiveBox self-hosted internet archiving solution.

Language: Ruby - Size: 61.8 MB - Last synced: 10 days ago - Pushed: 3 months ago - Stars: 24 - Forks: 3

itsliamdowd/WaybackBrowserMacOS

Pick a date and explore websites from the early days of the internet to now all in an easy-to-use browser format! 💻

Language: Swift - Size: 32.2 KB - Last synced: 3 months ago - Pushed: almost 2 years ago - Stars: 8 - Forks: 1

vegetableman/vandal

Navigator for Web Archive

Language: JavaScript - Size: 128 MB - Last synced: 3 months ago - Pushed: 6 months ago - Stars: 149 - Forks: 5

itsliamdowd/WaybackBrowserWindows

Pick a date and explore websites from the early days of the internet to now all in an easy-to-use browser format! 💻

Language: Python - Size: 106 KB - Last synced: 7 months ago - Pushed: almost 2 years ago - Stars: 3 - Forks: 0

Fooftilly/RSS_archiver

Download and archive RSS feeds to Wayback Machine. Save a list of archived feed in locad db.

Language: Python - Size: 30.3 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

Quoorex/archive-file-urls

Submit URLs listed inside a file to website archival services

Language: Python - Size: 17.6 KB - Last synced: 28 days ago - Pushed: over 2 years ago - Stars: 3 - Forks: 0

gabldotink/sharkive.old 📦

upload stuff to the Internet Archive using a shell script

Language: Shell - Size: 104 KB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 1 - Forks: 0

httpreserve/conventoarchiver

Repository for collecting scripts to help capture MyConvento newsroom press-releases from the MyConvento PR management suite. The README provides an analysis of the MyConvento URL architecture for users hoping to develop a solution for themselves.

Language: Python - Size: 23.4 KB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 1 - Forks: 0