An open API service providing repository metadata for many open source software ecosystems.

Topic: "web-archive"

DO-SAY-GO/dn

💾 dn - offline full-text search and archiving for your Chromium-based browser.

Language: JavaScript - Size: 11 MB - Last synced at: 2 days ago - Pushed at: 2 months ago - Stars: 3,833 - Forks: 148

Ray-D-Song/web-archive

Free web archiving and sharing service based on Cloudflare. 基于 Cloudflare 的免费网页归档和分享工具。

Language: TypeScript - Size: 10.3 MB - Last synced at: 4 days ago - Pushed at: 9 days ago - Stars: 819 - Forks: 287

webrecorder/replayweb.page

Serverless replay of web archives directly in the browser

Language: TypeScript - Size: 84.3 MB - Last synced at: 3 days ago - Pushed at: 10 days ago - Stars: 787 - Forks: 68

webrecorder/browsertrix

Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more accessible for all!

Language: TypeScript - Size: 15.3 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 263 - Forks: 49

devanshbatham/ArchiveFuzz

Hunt down the secrets from the WebArchives for Fun and Profit

Language: Python - Size: 111 KB - Last synced at: 19 days ago - Pushed at: over 2 years ago - Stars: 167 - Forks: 39

Own-Data-Privateer/hoardy-web

Passively capture, archive, and hoard your web browsing history, including the contents of the pages you visit, for later offline viewing, replay, mirroring, data scraping, and/or indexing. Your own personal private Wayback Machine that can also archive HTTP POST requests and responses, as well as most other HTTP-level data.

Language: Python - Size: 2.65 MB - Last synced at: 3 days ago - Pushed at: 23 days ago - Stars: 76 - Forks: 7

internetarchive/cdx-summary

Summarize web archive capture index (CDX) files.

Language: Python - Size: 227 KB - Last synced at: 26 days ago - Pushed at: over 2 years ago - Stars: 65 - Forks: 13

TarekJor/bookmark-archiver Fork of ArchiveBox/ArchiveBox

🗄 Save an archived copy of websites from Pocket/Pinboard/Bookmarks/RSS. Outputs HTML, PDFs, and more...

Language: Python - Size: 2.65 MB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 36 - Forks: 2

webis-de/archive-query-log

📜 The Archive Query Log.

Language: Jupyter Notebook - Size: 52.6 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 28 - Forks: 0

ShaunLWM/ark

🚢 A self-hosted, personal archival application

Language: JavaScript - Size: 757 KB - Last synced at: 14 days ago - Pushed at: over 2 years ago - Stars: 21 - Forks: 0

antiufo/Shaman.Dokan.Warc

Mounts WARC files on Windows

Language: C# - Size: 241 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 17 - Forks: 1

YGGverse/YGGo 📦

YGGo! Distributed Web Search Engine

Language: PHP - Size: 4.03 MB - Last synced at: 1 day ago - Pushed at: over 1 year ago - Stars: 15 - Forks: 3

anjackson/sliver

A tool for collection archival slivers of the web and web archives

Language: Python - Size: 61.5 KB - Last synced at: 18 days ago - Pushed at: 2 months ago - Stars: 13 - Forks: 1

oduwsdl/MementoMap

A Tool to Summarize Web Archive Holdings

Language: Python - Size: 184 KB - Last synced at: 12 days ago - Pushed at: almost 4 years ago - Stars: 10 - Forks: 1

swve/gitstorykit

Build rich git projects history discovery apps with ease, used by Gitstory

Language: TypeScript - Size: 1.07 MB - Last synced at: 4 days ago - Pushed at: over 2 years ago - Stars: 9 - Forks: 0

oritwoen/omnichron

Unified TypeScript interface for multiple web archive platforms.

Language: TypeScript - Size: 709 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 6 - Forks: 0

ysdn-info/ysdn.info

An archive of the York/Sheridan Design Program

Language: HTML - Size: 6.41 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 5 - Forks: 0

minch-dev/DownTheMoon Fork of downthemall/downthemall-legacy

A continuation of legacy XUL version of DownThemAll! ✔️preserves web.archive.org timestamps, ✔️advanced filters for remote directory tree mirroring, ✔️UI is tweaked for better UX

Language: JavaScript - Size: 13 MB - Last synced at: 10 months ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

bottomless-archive-project/java-warc Fork of laxika/java-warc

Read Web ARChive (WARC) files in Java.

Language: Java - Size: 185 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 0

thiagolopes/alexandria

Backup and save websites

Language: Python - Size: 75.2 KB - Last synced at: about 11 hours ago - Pushed at: 8 months ago - Stars: 3 - Forks: 0

q-m/replayweb.page-docker

Docker image for ReplayWeb.page

Language: Dockerfile - Size: 2.93 KB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

ArtificialOSS/WebCrawl

Crawls the web to generate a huge dataset for training

Language: Python - Size: 18.6 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

ibnesayeed/utils

Miscellaneous utility scripts

Language: Python - Size: 20.5 KB - Last synced at: 7 months ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

india-ultimate/the-huddle

A mirror of The Huddle magazine

Language: Python - Size: 4.89 MB - Last synced at: 12 months ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 1

laxika/java-warc Fork of Mixnode/mixnode-warcreader-java 📦

Read Web ARChive (WARC) files in Java.

Language: Java - Size: 130 KB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 1

grey-land/warc-browser

a cli toolkit for working with web archives

Language: Go - Size: 469 KB - Last synced at: 10 days ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 1

sergio11/retrospect

Retrospect 🔍 is a cybersecurity tool that analyzes historical web snapshots 🕒 from the Wayback Machine, uncovering vulnerabilities 🛡️, sensitive data leaks 🔓, and security misconfigurations 🛠️. It empowers security pros to predict and mitigate threats ⚠️ before they become exploitable.

Language: Python - Size: 901 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

wayback-if-down/wayback-if-down.github.io

Redirect to a live website or an archived version if it's down.

Language: HTML - Size: 27.3 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

wdhdev/web-archiver 📦

Easily scrape, download and preview websites.

Language: EJS - Size: 664 KB - Last synced at: 12 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

paulmelnikow/wabac

A versioned cache backed by cloud storage

Language: JavaScript - Size: 428 KB - Last synced at: 19 days ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

AndreMor8/wubbzy-sites

Wubbzy archived sites

Language: HTML - Size: 364 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 2

extua/wacksy

An experimental library for reading and writing WACZ files

Language: Rust - Size: 130 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

michaelvcolianna/mattsfacial.com

A pointless site that means a lot for inexplicable reasons.

Language: JavaScript - Size: 422 KB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

meadowingc/waybacker

Periodically crawl a set of websites and ensure that all of their pages are archived on the Wayback Machine. Mirror of https://codeberg.org/meadowingc/waybacker

Language: Go - Size: 9.77 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

jskherman/web-clips

An archive site of some webpages on the Internet created with the help of the SingleFile extension.

Language: CSS - Size: 53.5 MB - Last synced at: about 1 month ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

s5-dev/archiver

Tool to archive websites and other content available on the Internet on the content-addressed S5 Network

Language: Dart - Size: 6.84 KB - Last synced at: about 1 month ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

jskherman/SingleFile-Archives Fork of gildas-lormeau/SingleFile-Archives

Pages saved with the SingleFile browser extension.

Language: HTML - Size: 78.7 MB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

rodfer0x80/lit_archive

a cool and easy way to maintain a web archive

Language: Python - Size: 277 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

publicdocs-platform/archiver-extension

Save current url to web.archive.org. Not affiliated with the Internet Archive. Chrome extension

Language: JavaScript - Size: 8.79 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 1