An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: web-archiving

aidatorajiro/misc

mysterious box of various codes

Language: Ruby - Size: 1.63 MB - Last synced at: about 18 hours ago - Pushed at: about 19 hours ago - Stars: 0 - Forks: 0

webrecorder/browsertrix

Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more accessible for all!

Language: TypeScript - Size: 15.1 MB - Last synced at: about 16 hours ago - Pushed at: about 16 hours ago - Stars: 271 - Forks: 50

ArchiveBox/ArchiveBox

πŸ—ƒ Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

Language: Python - Size: 10.9 MB - Last synced at: 1 day ago - Pushed at: about 2 months ago - Stars: 23,832 - Forks: 1,258

webrecorder/replayweb.page

Serverless replay of web archives directly in the browser

Language: TypeScript - Size: 87.5 MB - Last synced at: 3 days ago - Pushed at: 7 days ago - Stars: 790 - Forks: 69

webrecorder/browsertrix-crawler

Run a high-fidelity browser-based web archiving crawler in a single Docker container

Language: TypeScript - Size: 52.9 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 775 - Forks: 101

webrecorder/archiveweb.page

A High-Fidelity Web Archiving Extension for Chrome and Chromium based browsers!

Language: TypeScript - Size: 52.9 MB - Last synced at: 3 days ago - Pushed at: 7 days ago - Stars: 997 - Forks: 70

harvard-lil/perma

Indelible links

Language: JavaScript - Size: 67.4 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 468 - Forks: 76

programminghistorian/ph-submissions

The repository and website hosting the peer review process for new Programming Historian lessons

Language: HTML - Size: 1.22 GB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 144 - Forks: 115

Ray-D-Song/web-archive

Free web archiving and sharing service based on Cloudflare. θ·‘εœ¨ Cloudflare δΈŠηš„ε…θ΄Ήη½‘ι‘΅ε½’ζ‘£ε’Œεˆ†δΊ«ε·₯具。

Language: TypeScript - Size: 10.3 MB - Last synced at: 4 days ago - Pushed at: 14 days ago - Stars: 824 - Forks: 289

Rhizome-Conifer/conifer

Collect and revisit web pages.

Language: Python - Size: 25.7 MB - Last synced at: 6 days ago - Pushed at: 4 months ago - Stars: 1,501 - Forks: 122

helgeho/ArchiveSpark

An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive.

Language: Scala - Size: 1.22 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 150 - Forks: 19

webrecorder/pywb

Core Python Web Archiving Toolkit for replay and recording of web archives

Language: JavaScript - Size: 31.7 MB - Last synced at: 8 days ago - Pushed at: 12 days ago - Stars: 1,499 - Forks: 228

bellingcat/auto-archiver-api

API to manage users/sheets/URLs and call the auto-archiver in dedicated workers.

Language: Python - Size: 1.16 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 4 - Forks: 1

webrecorder/warcio

Streaming WARC/ARC library for fast web archive IO

Language: Python - Size: 293 KB - Last synced at: 3 days ago - Pushed at: 5 months ago - Stars: 412 - Forks: 62

ArchiveBox/archivebox-browser-extension

Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.

Language: JavaScript - Size: 935 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 311 - Forks: 30

oduwsdl/ipwb

InterPlanetary Wayback: A distributed and persistent archive replay system using IPFS

Language: Python - Size: 6.26 MB - Last synced at: 3 days ago - Pushed at: 2 months ago - Stars: 632 - Forks: 41

machawk1/wail

:whale2: Web Archiving Integration Layer: One-Click User Instigated Preservation

Language: Roff - Size: 1.04 GB - Last synced at: about 1 hour ago - Pushed at: 2 months ago - Stars: 374 - Forks: 37

nla/pandas4

Web archive workflow system

Language: Java - Size: 2.39 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 3 - Forks: 2

nla/nla-pywb

pywb config overlay for the Australian Web Archive

Language: HTML - Size: 367 KB - Last synced at: 1 day ago - Pushed at: 14 days ago - Stars: 2 - Forks: 0

bellingcat/auto-archiver

Automatically archive links to videos, images, and social media content from Google Sheets (and more).

Language: Python - Size: 12.1 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 691 - Forks: 75

Florents-Tselai/WarcDB

WarcDB: Web crawl data as SQLite databases.

Language: Python - Size: 51.7 MB - Last synced at: 4 days ago - Pushed at: 10 months ago - Stars: 398 - Forks: 11

project-polymorph/platform-home

homepage and platform for chinese trans digital archive

Language: TypeScript - Size: 75.9 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 3 - Forks: 2

internetarchive/fatcat

Perpetual Access To The Scholarly Record

Language: Python - Size: 8.44 MB - Last synced at: 6 days ago - Pushed at: 10 months ago - Stars: 120 - Forks: 18

maxcountryman/warc-parquet

πŸ—„οΈ A simple CLI for converting WARC to Parquet.

Language: Rust - Size: 106 KB - Last synced at: 1 day ago - Pushed at: 3 months ago - Stars: 110 - Forks: 0

N0taN3rd/node-warc

Parse And Create Web ARChive (WARC) files with node.js

Language: JavaScript - Size: 7.99 MB - Last synced at: 7 days ago - Pushed at: 4 months ago - Stars: 98 - Forks: 22

Own-Data-Privateer/hoardy-web

Passively capture, archive, and hoard your web browsing history, including the contents of the pages you visit, for later offline viewing, replay, mirroring, data scraping, and/or indexing. Your own personal private Wayback Machine that can also archive HTTP POST requests and responses, as well as most other HTTP-level data.

Language: Python - Size: 2.65 MB - Last synced at: 20 days ago - Pushed at: about 1 month ago - Stars: 76 - Forks: 7

gildas-lormeau/single-file-cli

CLI tool for saving a faithful copy of a complete web page in a single HTML file (based on SingleFile)

Language: JavaScript - Size: 5.16 MB - Last synced at: 30 days ago - Pushed at: about 1 month ago - Stars: 769 - Forks: 76

gildas-lormeau/mhtml-to-html

Convert MHTML to HTML

Language: JavaScript - Size: 461 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 32 - Forks: 3

akamhy/waybackpy

Wayback Machine API interface & a command-line tool

Language: Python - Size: 575 KB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 519 - Forks: 35

ArchiveBox/docs

Source for the Github Wiki / ReadTheDocs documentation for AchiveBox, the self-hosted internet archiving solution.

Language: CSS - Size: 7.48 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 16 - Forks: 5

oduwsdl/archivenow

A Tool To Push Web Resources Into Web Archives

Language: Python - Size: 20.4 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 420 - Forks: 41

machawk1/warcreate

Chrome extension to "Create WARC files from any webpage"

Language: JavaScript - Size: 2.23 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 220 - Forks: 14

rahiel/archiveror

Archiveror will help you preserve the webpages you love. πŸ’Ύ

Language: JavaScript - Size: 168 KB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 443 - Forks: 43

oduwsdl/MemGator

A Memento Aggregator CLI and Server in Go

Language: Go - Size: 15 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 62 - Forks: 11

anjackson/sliver

A tool for collection archival slivers of the web and web archives

Language: Python - Size: 61.5 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 13 - Forks: 1

webrecorder/webrecorder-player πŸ“¦

Webrecorder Player for Desktop (OSX/Windows/Linux). (Built with Electron + Webrecorder)

Language: JavaScript - Size: 6 MB - Last synced at: 3 days ago - Pushed at: over 4 years ago - Stars: 446 - Forks: 42

cocrawler/cdx_toolkit

A toolkit for CDX indices such as Common Crawl and the Internet Archive's Wayback Machine

Language: Python - Size: 209 KB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 169 - Forks: 31

caltechlibrary/eprints2archives

Send records from an EPrints server to the Internet Archive and other web archives

Language: Python - Size: 504 KB - Last synced at: 30 days ago - Pushed at: almost 2 years ago - Stars: 4 - Forks: 0

Oliverwebdev/WebArchiver

A powerful desktop application to download, archive, and manage web pages locally with full resource support, built with Python and PyQt6.

Language: Python - Size: 159 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

PKHarsimran/website-downloader

Website-downloader is a powerful and versatile Python script designed to download entire websites along with all their assets. This tool allows you to create a local copy of a website, including HTML pages, images, CSS, JavaScript files, and other resources. It is ideal for web archiving, offline browsing, and web development.

Language: Python - Size: 46.9 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 29 - Forks: 5

nla/bamboo

Web archive collection manager

Language: Java - Size: 2.61 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 8 - Forks: 4

TarekJor/bookmark-archiver Fork of ArchiveBox/ArchiveBox

πŸ—„ Save an archived copy of websites from Pocket/Pinboard/Bookmarks/RSS. Outputs HTML, PDFs, and more...

Language: Python - Size: 2.65 MB - Last synced at: about 2 months ago - Pushed at: almost 7 years ago - Stars: 36 - Forks: 2

project-polymorph/news-website

δΈ­ζ–‡θ·¨ζ€§εˆ«η›Έε…³ζ–°ι—»ε­˜ζ‘£η«™η‚Ή

Size: 306 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 4 - Forks: 3

N0taN3rd/wail Fork of machawk1/wail

:whale2: One-Click User Instigated Preservation

Language: JavaScript - Size: 421 MB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 126 - Forks: 9

q-m/replayweb.page-docker

Docker image for ReplayWeb.page

Language: Dockerfile - Size: 2.93 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

cvyl/cf-static-archive-worker

A serverless website archiving solution built with Cloudflare Workers. This tool crawls and archives static websites, storing all assets (HTML, CSS, JS, images, etc.) in Cloudflare R2 storage.

Language: TypeScript - Size: 32.2 KB - Last synced at: 3 days ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

oduwsdl/oduwsdl.github.io

ODU Web Science and Digital Libraries Research Group (WS-DL) home page.

Language: HTML - Size: 47.2 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 3 - Forks: 37

xarantolus/Collect

A server to collect & archive websites that also supports video downloads

Language: TypeScript - Size: 2.07 MB - Last synced at: 4 days ago - Pushed at: over 2 years ago - Stars: 86 - Forks: 12

oduwsdl/warrick

Recover lost websites from the Web Infrastructure

Language: HTML - Size: 2.66 MB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 88 - Forks: 10

webrecorder/cdxj-indexer

CDXJ Indexing of WARC/ARCs

Language: Python - Size: 78.1 KB - Last synced at: 3 days ago - Pushed at: 5 months ago - Stars: 25 - Forks: 13

zytedata/web-snap

Create "perfect" snapshots of web pages

Language: JavaScript - Size: 790 KB - Last synced at: 15 days ago - Pushed at: 5 months ago - Stars: 32 - Forks: 4

webrecorder/dat-share

A prototype server to swarm multiple DATs for Webrecorder

Language: JavaScript - Size: 238 KB - Last synced at: 3 days ago - Pushed at: about 6 years ago - Stars: 14 - Forks: 4

nla/outbackcdx

Web archive index server based on RocksDB

Language: Java - Size: 861 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 34 - Forks: 20

mrrfv/webArchive

Crawls websites and saves found URLs to a file.

Language: JavaScript - Size: 18.6 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

gwu-libraries/sfm-ui

Social Feed Manager user interface application.

Language: Python - Size: 44.6 MB - Last synced at: 22 days ago - Pushed at: 11 months ago - Stars: 155 - Forks: 25

pirate/internet-archiving-talk

🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.

Language: JavaScript - Size: 27.6 MB - Last synced at: about 2 months ago - Pushed at: 9 months ago - Stars: 52 - Forks: 5

rybesh/capture-urls

Archive a list of URLs using the Wayback Machine

Language: Python - Size: 39.1 KB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 5 - Forks: 0

knot126/WebWar

Really hacky proof of concept http archival using mitmproxy

Language: Python - Size: 10.7 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

internetarchive/scrapy-warcio

Support for writing WARC files with Scrapy

Language: Python - Size: 31.3 KB - Last synced at: 7 days ago - Pushed at: over 5 years ago - Stars: 21 - Forks: 6

dbeley/archiveboxmatic

ArchiveBoxMatic: configure ArchiveBox with the simplicity of a yaml file.

Language: Python - Size: 57.6 KB - Last synced at: 15 days ago - Pushed at: about 4 years ago - Stars: 14 - Forks: 3

httpreserve/linkstat

CLI implementation of httpreserve that can test links and retrieve internet archive replacements

Language: Go - Size: 47.9 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 10 - Forks: 0

pakelcomedy/SiteMirror

Python tool for advanced web scraping and site mirroring. It downloads entire websites, including HTML, CSS, JS, images, and other assets, while preserving site structure and updating links for offline use. Ideal for developers needing detailed and customizable website backups.

Language: Python - Size: 12.7 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

nla/pywb Fork of webrecorder/pywb

Core Python Web Archiving Toolkit for replay and recording of web archives

Language: JavaScript - Size: 22.7 MB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 1 - Forks: 0

nla/httrack2warc

Converts HTTrack crawls to WARC files

Language: Java - Size: 158 KB - Last synced at: 6 months ago - Pushed at: 9 months ago - Stars: 30 - Forks: 6

meequrox/flb-archiver

Flareboard web archiver in C using libcurl

Language: C - Size: 107 KB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

nla/heritrix3 Fork of internetarchive/heritrix3

Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.

Language: Java - Size: 10.5 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

webis-de/scriptor

Plug-and-play reproducible web analysis.

Language: JavaScript - Size: 1.61 MB - Last synced at: 6 days ago - Pushed at: 10 months ago - Stars: 5 - Forks: 2

ArchiveBox/electron-archivebox

Desktop Electron app for ArchiveBox internet archiver. (ALPHA: not ready for general use)

Language: JavaScript - Size: 156 KB - Last synced at: 10 months ago - Pushed at: about 2 years ago - Stars: 174 - Forks: 15

nla/chronicrawl πŸ“¦

Experimental continouous web crawler for web archiving

Language: Java - Size: 329 KB - Last synced at: 1 day ago - Pushed at: over 2 years ago - Stars: 9 - Forks: 0

TarekJor/PixivUtil2 Fork of Nandaka/PixivUtil2

Download images from Pixiv and more!

Language: Python - Size: 11.4 MB - Last synced at: 10 months ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

N0taN3rd/memgatorBulkDownload

Language: Python - Size: 13.7 KB - Last synced at: about 2 months ago - Pushed at: about 7 years ago - Stars: 1 - Forks: 1

oduwsdl/offtopic-goldstandard-data

Data for testing the Offtopic detection software

Language: Python - Size: 274 KB - Last synced at: 3 months ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

Rhizome-Conifer/conifer-deploy

Conifer setup and deployment via Ansible

Language: Shell - Size: 22.5 KB - Last synced at: about 1 month ago - Pushed at: almost 5 years ago - Stars: 12 - Forks: 7

helgeho/Tempas2ArchiveSpark

ArchiveSpark DataSpec to analyze the Internet Archive's Web archive through temporal search results returned by Tempas (v2)

Language: Scala - Size: 23.4 KB - Last synced at: 30 days ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

helgeho/WarcPartitioner

Partition (W)ARC Files by MIME Type and Year

Language: Java - Size: 8.79 KB - Last synced at: 30 days ago - Pushed at: about 8 years ago - Stars: 1 - Forks: 1

helgeho/HadoopConcatGz

A Splitable Hadoop InputFormat for Concatenated GZIP Files and *.(w)arc.gz

Language: Java - Size: 51.8 KB - Last synced at: 30 days ago - Pushed at: over 7 years ago - Stars: 9 - Forks: 3

nla/chropro πŸ“¦

Chrome debugging protocol client for Java

Language: Java - Size: 115 KB - Last synced at: 1 day ago - Pushed at: about 5 years ago - Stars: 10 - Forks: 2

ArchiveBox/archivebox-proxy

Official ArchiveBox MITM proxy: saves URLs of all requests passing through to an ArchiveBox server for archival.

Language: Python - Size: 12.7 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 10 - Forks: 0

ArchiveBox/pip-archivebox

Official Python package for ArchiveBox, the self-hosted internet archiving solution.

Size: 15.4 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 13 - Forks: 2

sul-dlss-deprecated/openwayback Fork of iipc/openwayback πŸ“¦

(used on swap vm 6/2020) Stanford's fork of iipc/openwayback, which is used on our "swap" (Stanford Web Archiving Portal) machines. (See also sul-dlss/swap which is intended as a replacement)

Language: Java - Size: 29.1 MB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 1

internetarchive/pdf_trio Fork of tralfamadude/pdf_trio

A PDF classifier ensemble with REST API service

Language: Python - Size: 15.5 MB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 23 - Forks: 1

yuzhoumo/piazzabox

Piazza course archiver and viewer

Language: Python - Size: 2.46 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

internetarchive/sandcrawler

Backend, IA-specific tools for crawling and processing the scholarly web. Content ends up in https://fatcat.wiki

Language: HTML - Size: 2.55 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 23 - Forks: 2

ArchiveBox/DigestBox

DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by ArchiveBox.io under the hood.

Language: HTML - Size: 1.75 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 11 - Forks: 0

ArchiveBox/debian-archivebox

Home of the official apt/deb package for Ubuntu/Debian-based systems.

Language: Python - Size: 3.34 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 17 - Forks: 5

nla/warcquet

Language: Java - Size: 44.9 KB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

nla/pandora-labs

Australian web archive tools and experiments

Language: Python - Size: 8.79 KB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

ArchiveBox/homebrew-archivebox

Homebrew formula for the ArchiveBox self-hosted internet archiving solution.

Language: Ruby - Size: 61.8 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 24 - Forks: 3

nla/wombat Fork of webrecorder/wombat

Wombat.js client-side rewriting library

Language: JavaScript - Size: 1.87 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

wdhdev/web-archiver πŸ“¦

Easily scrape, download and preview websites.

Language: EJS - Size: 664 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

ArchivingToolsForWBM/AdvancedInternetArchiving

Makes saving pages in bulk to the wayback machine much easier

Language: HTML - Size: 396 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 1

LouayMagdy/webarchive-commons-py

Python Implementation for iipc/webarchive-commons

Language: Python - Size: 300 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

pebnn/AutoInternetArchive

AutoInternetArchive is a very simple program designed to automatically archive webpages to The wayback machine with hourly intervals. AutoInternetArchive was designed to be run though a console window and left open for days or even months

Language: Python - Size: 22.5 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

shawnmjones/OffTopic-Detection Fork of yasmina85/OffTopic-Detection

This system evaluates a series of mementos (archived web pages) to determine which are off topic. The series can be part of an Archive-It collection, a single TimeMap, or stored in a WARC file.

Language: Python - Size: 712 MB - Last synced at: almost 2 years ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

mkrzmr/mkrzmr.github.io

Michael Kurzmeier, 4th year Phd Digital Humanities @Maynooth University

Size: 1.39 MB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

nla/jwebrenderer

Simple web service to render pages with headless chrome

Language: Java - Size: 17.6 KB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

nla/outbackproxy

HTTP/S proxy server which replays content from a web archive

Language: Java - Size: 26.4 KB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

nla/heritrixctl πŸ“¦

Heritrix runner and API client for Java

Language: Java - Size: 27.3 KB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

nla/butterflynet πŸ“¦

Streamline single-document web archiving tool

Language: Java - Size: 163 KB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

ukwa/ukwa-manage

Shepherding our web archives from crawl to access.

Language: Jupyter Notebook - Size: 122 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 10 - Forks: 5