Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub / internetarchive 234 repositories

The Internet Archive is "the library of the Internet", and a big supporter of Free Software.

internetarchive/iiif

The official Internet Archive IIIF service

Language: JavaScript - Size: 83 MB - Last synced: about 21 hours ago - Pushed: 1 day ago - Stars: 20 - Forks: 4

internetarchive/iaux-item-userlists

Add/remove item to userlists on Details page

Language: TypeScript - Size: 1.05 MB - Last synced: 1 day ago - Pushed: 2 days ago - Stars: 0 - Forks: 0

internetarchive/Zeno

State-of-the-art web crawler 🔱

Language: Go - Size: 650 KB - Last synced: 1 day ago - Pushed: 1 day ago - Stars: 29 - Forks: 1

internetarchive/brozzler

brozzler - distributed browser-based web crawler

Language: Python - Size: 4.1 MB - Last synced: 2 days ago - Pushed: 2 days ago - Stars: 629 - Forks: 94

internetarchive/wayback-diff

React components to render differences between captures at the Wayback Machine

Language: JavaScript - Size: 13.6 MB - Last synced: 2 days ago - Pushed: 2 days ago - Stars: 30 - Forks: 8

internetarchive/iaux-modal-manager

A Modal Manager WebComponent

Language: TypeScript - Size: 1.4 MB - Last synced: 2 days ago - Pushed: 3 days ago - Stars: 0 - Forks: 1

internetarchive/ads-common

Common components and utilities for the Archiving & Data Services (ADS) team at the Internet Archive

Language: TypeScript - Size: 257 KB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 2 - Forks: 0

internetarchive/iare

An interactive IARI JSON viewer

Language: JavaScript - Size: 12.6 MB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 5 - Forks: 4

internetarchive/archive-hocr-tools

Efficient hOCR tooling

Language: Python - Size: 241 KB - Last synced: 3 days ago - Pushed: 4 days ago - Stars: 32 - Forks: 8

internetarchive/openlibrary-client

Python Client Library for the Archive.org OpenLibrary API

Language: Python - Size: 471 KB - Last synced: 13 days ago - Pushed: 27 days ago - Stars: 340 - Forks: 90

internetarchive/iaux-analytics-manager

Language: TypeScript - Size: 164 KB - Last synced: 9 days ago - Pushed: 9 days ago - Stars: 0 - Forks: 1

internetarchive/iaux-styles

Language: TypeScript - Size: 304 KB - Last synced: 10 days ago - Pushed: 16 days ago - Stars: 0 - Forks: 0

internetarchive/hind

Hashistack-IN-Docker (single container with nomad + consul + caddy)

Language: Shell - Size: 2 MB - Last synced: 10 days ago - Pushed: 10 days ago - Stars: 46 - Forks: 4

internetarchive/wayback Fork of iipc/openwayback

IA's public Wayback Machine (moved from SourceForge)

Language: Java - Size: 12.9 MB - Last synced: 12 days ago - Pushed: about 2 months ago - Stars: 708 - Forks: 124

internetarchive/archiveorg-e2e-playwright

Language: TypeScript - Size: 134 KB - Last synced: 14 days ago - Pushed: 15 days ago - Stars: 2 - Forks: 2

internetarchive/wayback-machine-webextension

A web browser extension for Chrome, Firefox, Edge, and Safari 14.

Language: JavaScript - Size: 33.8 MB - Last synced: 10 days ago - Pushed: 15 days ago - Stars: 590 - Forks: 203

internetarchive/openlibrary

One webpage for every book ever published!

Language: Python - Size: 85.8 MB - Last synced: 13 days ago - Pushed: 13 days ago - Stars: 4,824 - Forks: 1,190

internetarchive/internetarchivebot

Language: PHP - Size: 8.15 MB - Last synced: 14 days ago - Pushed: 19 days ago - Stars: 107 - Forks: 34

internetarchive/openlibrary-api

API documentation for https://github.com/internetarchive/openlibrary

Language: HTML - Size: 36.4 MB - Last synced: 14 days ago - Pushed: 16 days ago - Stars: 3 - Forks: 2

internetarchive/heritrix3

Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.

Language: Java - Size: 10.5 MB - Last synced: 12 days ago - Pushed: 16 days ago - Stars: 2,675 - Forks: 753

internetarchive/bookreader

The Internet Archive BookReader

Language: JavaScript - Size: 45.5 MB - Last synced: 10 days ago - Pushed: 16 days ago - Stars: 928 - Forks: 408

internetarchive/cicd

build & test using github registry; deploy to nomad clusters

Size: 63.5 KB - Last synced: 14 days ago - Pushed: about 1 month ago - Stars: 10 - Forks: 0

internetarchive/warctools

Command line tools and libraries for handling and manipulating WARC files (and HTTP contents)

Language: Python - Size: 278 KB - Last synced: 12 days ago - Pushed: over 3 years ago - Stars: 141 - Forks: 25

internetarchive/web_collection_search

An API wrapper to the Elasticsearch index of web archival collections and a web UI to explore those indexes.

Language: Python - Size: 81.1 KB - Last synced: 14 days ago - Pushed: 5 months ago - Stars: 7 - Forks: 5

internetarchive/archive-pdf-tools

Fast PDF generation and compression. Deals with millions of pages daily.

Language: Python - Size: 25.8 MB - Last synced: 14 days ago - Pushed: 5 months ago - Stars: 79 - Forks: 13

internetarchive/newsum

Daily TV News Summary using GPT

Language: Python - Size: 181 KB - Last synced: 14 days ago - Pushed: about 1 month ago - Stars: 19 - Forks: 3

internetarchive/iaux-shared-resize-observer

An efficient ResizeObserver to be shared amongst many components

Language: TypeScript - Size: 808 KB - Last synced: 14 days ago - Pushed: over 2 years ago - Stars: 2 - Forks: 0

internetarchive/dweb-mirror

Offline Internet Archive project

Language: JavaScript - Size: 1.74 MB - Last synced: 14 days ago - Pushed: 2 months ago - Stars: 243 - Forks: 25

internetarchive/openlibrary-bots

A repository of cleanup bots implementing the openlibrary-client

Language: Python - Size: 509 KB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 57 - Forks: 47

internetarchive/fatcat

Perpetual Access To The Scholarly Record

Language: Python - Size: 8.4 MB - Last synced: 14 days ago - Pushed: 6 months ago - Stars: 109 - Forks: 19

internetarchive/sandcrawler

Backend, IA-specific tools for crawling and processing the scholarly web. Content ends up in https://fatcat.wiki

Language: HTML - Size: 2.55 MB - Last synced: 14 days ago - Pushed: over 1 year ago - Stars: 23 - Forks: 2

internetarchive/Sparkling

Internet Archive's Sparkling Data Processing Library

Language: Scala - Size: 557 KB - Last synced: 14 days ago - Pushed: about 2 months ago - Stars: 10 - Forks: 2

internetarchive/strainer

Heritrix frontier files manipulation tool.

Language: Go - Size: 38.1 KB - Last synced: 14 days ago - Pushed: almost 3 years ago - Stars: 3 - Forks: 0

internetarchive/warcprox

WARC writing MITM HTTP/S proxy

Language: Python - Size: 1.5 MB - Last synced: 14 days ago - Pushed: 6 months ago - Stars: 360 - Forks: 55

internetarchive/draintasker

a tool for continuously ingesting w/arc files into the archive

Language: Python - Size: 960 KB - Last synced: 14 days ago - Pushed: over 2 years ago - Stars: 9 - Forks: 7

internetarchive/trendmachine

A mathematical model to calculate a normalized score to quantify the temporal resilience of a web page as a time-series data based on the historical observations of the page in web archives.

Language: Python - Size: 23.4 KB - Last synced: 14 days ago - Pushed: about 2 months ago - Stars: 5 - Forks: 1

internetarchive/map-of-the-web

Language: Python - Size: 3.29 MB - Last synced: 12 days ago - Pushed: over 5 years ago - Stars: 4 - Forks: 2

internetarchive/surt Fork of rajbot/surt

Sort-friendly URI Reordering Transform (SURT) python module

Language: Python - Size: 120 KB - Last synced: 12 days ago - Pushed: 8 months ago - Stars: 38 - Forks: 16

internetarchive/wayback-radial-tree

Language: JavaScript - Size: 2.12 MB - Last synced: 10 days ago - Pushed: 17 days ago - Stars: 7 - Forks: 8

internetarchive/arklet

ARK minter, binder, resolver

Language: Python - Size: 150 KB - Last synced: 14 days ago - Pushed: 9 months ago - Stars: 18 - Forks: 8

internetarchive/arch

Web application for distributed compute analysis of Archive-It web archive collections.

Language: Scala - Size: 54.5 MB - Last synced: 12 days ago - Pushed: 8 months ago - Stars: 13 - Forks: 4

internetarchive/crawling-for-nomore404

Language: Python - Size: 11 MB - Last synced: 13 days ago - Pushed: about 1 month ago - Stars: 23 - Forks: 17

internetarchive/fatcat-scholar

search interface for scholarly works

Language: Python - Size: 5.28 MB - Last synced: 14 days ago - Pushed: 2 months ago - Stars: 75 - Forks: 14

internetarchive/dweb-transports

Language: JavaScript - Size: 16.3 MB - Last synced: 10 days ago - Pushed: 9 months ago - Stars: 24 - Forks: 16

internetarchive/iaux-music-player

IA music player

Language: TypeScript - Size: 586 KB - Last synced: 10 days ago - Pushed: 2 months ago - Stars: 2 - Forks: 0

internetarchive/dweb-transport

Internet Archive Decentralized Web Common API

Size: 10.3 MB - Last synced: 14 days ago - Pushed: about 4 years ago - Stars: 37 - Forks: 10

internetarchive/wikibase-patcher

Python library for interacting with the Wikibase REST API

Language: Python - Size: 18.6 KB - Last synced: 14 days ago - Pushed: 7 months ago - Stars: 5 - Forks: 1

internetarchive/wiki-references-db

Data models and scripts to build a database of references (broadly defined) appearing on Wikipedia and other wikis

Language: Python - Size: 17.6 KB - Last synced: 14 days ago - Pushed: 11 months ago - Stars: 2 - Forks: 0

internetarchive/iabot-deploy-helpers

Scripts used to help with InternetArchiveBot deployment on Wikipedia

Language: Python - Size: 1000 Bytes - Last synced: 14 days ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0

internetarchive/iaux-democracys-library

A web component that highlights Democracy's Library

Language: TypeScript - Size: 580 KB - Last synced: 10 days ago - Pushed: 10 months ago - Stars: 2 - Forks: 2

internetarchive/iaux-collection-browser

Language: TypeScript - Size: 10.3 MB - Last synced: 14 days ago - Pushed: 24 days ago - Stars: 4 - Forks: 1

internetarchive/annotate-client Fork of hypothesis/client

The Hypothesis web-based annotation client.

Language: HTML - Size: 31.5 MB - Last synced: 14 days ago - Pushed: over 3 years ago - Stars: 1 - Forks: 0

internetarchive/isodos

Go module to interact with Internet Archive's Isodos API

Language: Go - Size: 50.8 KB - Last synced: 14 days ago - Pushed: over 3 years ago - Stars: 4 - Forks: 0

internetarchive/iaux-sharing-options

Sharing options for Internet Archive items

Language: JavaScript - Size: 481 KB - Last synced: 14 days ago - Pushed: over 1 year ago - Stars: 1 - Forks: 1

internetarchive/iaux-book-search-results

Book search results pane for ia-menu-slider

Language: JavaScript - Size: 166 KB - Last synced: 14 days ago - Pushed: over 3 years ago - Stars: 1 - Forks: 0

internetarchive/openlibrary-librarians

Coordination between the OpenLibrary.org Librarian community

Size: 3.91 KB - Last synced: 14 days ago - Pushed: over 3 years ago - Stars: 16 - Forks: 3

internetarchive/pdf_trio Fork of tralfamadude/pdf_trio

A PDF classifier ensemble with REST API service

Language: Python - Size: 15.5 MB - Last synced: 14 days ago - Pushed: about 3 years ago - Stars: 22 - Forks: 1

internetarchive/dweb-archivecontroller

Language: JavaScript - Size: 1.95 MB - Last synced: 14 days ago - Pushed: about 1 year ago - Stars: 6 - Forks: 2

internetarchive/wayback-discover-diff Fork of ftsalamp/wayback-discover-diff

A Python 3.6+ application that calculates and returns simhash values for Internet Archive's snapshots

Language: Python - Size: 217 KB - Last synced: 14 days ago - Pushed: about 1 year ago - Stars: 5 - Forks: 4

internetarchive/rulesengine-client

Python client package for the playback rules engine

Language: Python - Size: 87.9 KB - Last synced: 14 days ago - Pushed: about 2 months ago - Stars: 2 - Forks: 2

internetarchive/dweb-archive

Language: JavaScript - Size: 26.9 MB - Last synced: 10 days ago - Pushed: over 1 year ago - Stars: 54 - Forks: 16

internetarchive/internet-archive-voice-apps

Voice Apps (Actions on Google, Alexa Skill) of Internet Archive. Just say: "Ok Google, Ask Internet Archive to Play Jazz" or "Alexa, Ask Internet Internet Archive to play Instrumental Music"

Language: JavaScript - Size: 4.4 MB - Last synced: 12 days ago - Pushed: 17 days ago - Stars: 45 - Forks: 45

internetarchive/wayback-machine-safari

Language: JavaScript - Size: 3.7 MB - Last synced: 10 days ago - Pushed: over 6 years ago - Stars: 5 - Forks: 6

internetarchive/trough

Trough: Big data, small databases.

Language: Python - Size: 738 KB - Last synced: 14 days ago - Pushed: 11 months ago - Stars: 36 - Forks: 7

internetarchive/doublethink

rethinkdb python library

Language: Python - Size: 108 KB - Last synced: 14 days ago - Pushed: 7 months ago - Stars: 11 - Forks: 5

internetarchive/CDX-Writer Fork of rajbot/CDX-Writer

Python script to create CDX index files of WARC data

Language: Arc - Size: 5.59 MB - Last synced: 12 days ago - Pushed: over 2 years ago - Stars: 20 - Forks: 12

internetarchive/warc

Python library for reading and writing warc files

Language: Python - Size: 202 KB - Last synced: 9 days ago - Pushed: about 2 years ago - Stars: 232 - Forks: 114

internetarchive/infogami Fork of infogami/infogami

Language: Python - Size: 2.59 MB - Last synced: 13 days ago - Pushed: 23 days ago - Stars: 40 - Forks: 26

internetarchive/bookserver

Archive.org OPDS Bookserver - A standard for digital book distribution

Language: Python - Size: 289 KB - Last synced: 13 days ago - Pushed: over 5 years ago - Stars: 113 - Forks: 19

internetarchive/cdx-summary

Summarize web archive capture index (CDX) files.

Language: Python - Size: 227 KB - Last synced: 10 days ago - Pushed: over 1 year ago - Stars: 43 - Forks: 7

internetarchive/epub3 Fork of deborahgu/abbyy-to-epub3

Internet Archive utility which converts abbyy to epub3

Language: Python - Size: 17 MB - Last synced: 13 days ago - Pushed: over 4 years ago - Stars: 3 - Forks: 2

internetarchive/ArchiveSpark Fork of helgeho/ArchiveSpark

An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive.

Language: Scala - Size: 1.15 MB - Last synced: 14 days ago - Pushed: 23 days ago - Stars: 6 - Forks: 1

internetarchive/certstream-go Fork of pathtofile/certstream-go

Go library for connecting to CertStream

Language: Go - Size: 26.4 KB - Last synced: 14 days ago - Pushed: 2 months ago - Stars: 0 - Forks: 0

internetarchive/polyfill-service Fork of polyfillpolyfill/polyfill-service

Automatic polyfill service.

Size: 31.5 MB - Last synced: 14 days ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

internetarchive/iaux-collection-name-cache

Language: TypeScript - Size: 446 KB - Last synced: 10 days ago - Pushed: 2 months ago - Stars: 0 - Forks: 0

internetarchive/jpg241

Author & serve single progressive JPEG image which can be served as two different qualities "Two for One" :)

Language: TypeScript - Size: 229 KB - Last synced: 14 days ago - Pushed: about 1 month ago - Stars: 1 - Forks: 0

internetarchive/dyno

Language: JavaScript - Size: 238 KB - Last synced: 14 days ago - Pushed: 2 months ago - Stars: 3 - Forks: 1

internetarchive/tocky

[WIP] Extract structured table of contents data from digitized books

Language: Python - Size: 95.7 KB - Last synced: 14 days ago - Pushed: 2 months ago - Stars: 0 - Forks: 1

internetarchive/archive-ocr-tools

Language: Python - Size: 26.4 KB - Last synced: 14 days ago - Pushed: about 1 year ago - Stars: 5 - Forks: 1

internetarchive/ia

A JS interface to archive.org

Language: JavaScript - Size: 172 KB - Last synced: 14 days ago - Pushed: 3 months ago - Stars: 7 - Forks: 2

internetarchive/umbra

A queue-controlled browser automation tool for improving web crawl quality

Language: Python - Size: 243 KB - Last synced: 12 days ago - Pushed: about 4 years ago - Stars: 58 - Forks: 25

internetarchive/iaux-typescript-wc-template

IAUX Typescript WebComponent Template

Language: TypeScript - Size: 1.18 MB - Last synced: 9 days ago - Pushed: 10 days ago - Stars: 7 - Forks: 4

internetarchive/iaux-search-service

Language: TypeScript - Size: 1.13 MB - Last synced: 14 days ago - Pushed: about 2 months ago - Stars: 4 - Forks: 2

internetarchive/public-domain-day-film-contest

Internet Archive Public Domain Day Film Contest 2024 Entries

Language: HTML - Size: 5.86 KB - Last synced: 14 days ago - Pushed: 3 months ago - Stars: 1 - Forks: 0

internetarchive/testy

Language: Dockerfile - Size: 11.7 KB - Last synced: 14 days ago - Pushed: 2 months ago - Stars: 0 - Forks: 0

internetarchive/build-nocache

GitHub Action to build docker image, like "build" sister repo/action, just doesn't use "cache-to" and "cache-from"

Size: 14.6 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

internetarchive/esbuild_es5

minify JS/TS files using `esbuild` and `swc` down to ES5 (uses `deno`)

Language: TypeScript - Size: 81.1 KB - Last synced: 14 days ago - Pushed: 3 months ago - Stars: 5 - Forks: 0

internetarchive/wayback-machine-ios

Wayback Machine application for iOS

Language: Swift - Size: 1.81 MB - Last synced: 14 days ago - Pushed: about 2 months ago - Stars: 0 - Forks: 1

internetarchive/wayback-machine-android

Language: Kotlin - Size: 495 KB - Last synced: 12 days ago - Pushed: 8 months ago - Stars: 10 - Forks: 10

internetarchive/emularity-config

archive.org software emulation

Language: Dockerfile - Size: 671 KB - Last synced: 14 days ago - Pushed: 18 days ago - Stars: 2 - Forks: 0

internetarchive/emularity-engine

archive.org software emulation

Language: JavaScript - Size: 4.66 GB - Last synced: 14 days ago - Pushed: 17 days ago - Stars: 0 - Forks: 0

internetarchive/emularity-bios

archive.org software emulation

Language: Dockerfile - Size: 82.2 MB - Last synced: 14 days ago - Pushed: 18 days ago - Stars: 0 - Forks: 0

internetarchive/iaux

Monorepo for Archive.org UX development and prototyping.

Language: JavaScript - Size: 34.5 MB - Last synced: about 22 hours ago - Pushed: 1 day ago - Stars: 63 - Forks: 85

internetarchive/wayback-machine-firefox

Reduce annoying 404 pages by automatically checking for an archived copy in the Wayback Machine. Learn more about this Test Pilot experiment at https://testpilot.firefox.com/

Language: JavaScript - Size: 4.17 MB - Last synced: 13 days ago - Pushed: over 5 years ago - Stars: 52 - Forks: 17

internetarchive/iari

Import workflows for the Wikipedia Citations Database

Language: Python - Size: 6.55 MB - Last synced: 14 days ago - Pushed: about 2 months ago - Stars: 12 - Forks: 9

internetarchive/xfetch

Cache stampede test harness. Code accompanies the presentation made at RedisConf 2017, 30 May to 1 June, 2017, in San Francisco.

Language: PHP - Size: 42 KB - Last synced: 14 days ago - Pushed: about 5 years ago - Stars: 19 - Forks: 2

internetarchive/read_api_extras

Demo code for the Open Library Read API

Size: 94.7 KB - Last synced: 12 days ago - Pushed: over 12 years ago - Stars: 7 - Forks: 9

internetarchive/AspectMock Fork of Codeception/AspectMock

The most powerful and flexible mocking framework for PHPUnit / Codeception.

Size: 504 KB - Last synced: 14 days ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

internetarchive/parser-reflection Fork of goaop/parser-reflection

Parser Reflection API - Provides source code analysis without loading classes into the PHP memory

Size: 311 KB - Last synced: 14 days ago - Pushed: 10 months ago - Stars: 0 - Forks: 0

internetarchive/iacopilot

Summarize and ask questions about items in the Internet Archive

Language: Python - Size: 32.2 KB - Last synced: 14 days ago - Pushed: about 1 year ago - Stars: 13 - Forks: 5