Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub / internetarchive 234 repositories
The Internet Archive is "the library of the Internet", and a big supporter of Free Software.
internetarchive/iiif
The official Internet Archive IIIF service
Language: JavaScript - Size: 83 MB - Last synced: about 21 hours ago - Pushed: 1 day ago - Stars: 20 - Forks: 4
internetarchive/iaux-item-userlists
Add/remove item to userlists on Details page
Language: TypeScript - Size: 1.05 MB - Last synced: 1 day ago - Pushed: 2 days ago - Stars: 0 - Forks: 0
internetarchive/Zeno
State-of-the-art web crawler 🔱
Language: Go - Size: 650 KB - Last synced: 1 day ago - Pushed: 1 day ago - Stars: 29 - Forks: 1
internetarchive/brozzler
brozzler - distributed browser-based web crawler
Language: Python - Size: 4.1 MB - Last synced: 2 days ago - Pushed: 2 days ago - Stars: 629 - Forks: 94
internetarchive/wayback-diff
React components to render differences between captures at the Wayback Machine
Language: JavaScript - Size: 13.6 MB - Last synced: 2 days ago - Pushed: 2 days ago - Stars: 30 - Forks: 8
internetarchive/iaux-modal-manager
A Modal Manager WebComponent
Language: TypeScript - Size: 1.4 MB - Last synced: 2 days ago - Pushed: 3 days ago - Stars: 0 - Forks: 1
internetarchive/ads-common
Common components and utilities for the Archiving & Data Services (ADS) team at the Internet Archive
Language: TypeScript - Size: 257 KB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 2 - Forks: 0
internetarchive/iare
An interactive IARI JSON viewer
Language: JavaScript - Size: 12.6 MB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 5 - Forks: 4
internetarchive/archive-hocr-tools
Efficient hOCR tooling
Language: Python - Size: 241 KB - Last synced: 3 days ago - Pushed: 4 days ago - Stars: 32 - Forks: 8
internetarchive/openlibrary-client
Python Client Library for the Archive.org OpenLibrary API
Language: Python - Size: 471 KB - Last synced: 13 days ago - Pushed: 27 days ago - Stars: 340 - Forks: 90
internetarchive/iaux-analytics-manager
Language: TypeScript - Size: 164 KB - Last synced: 9 days ago - Pushed: 9 days ago - Stars: 0 - Forks: 1
internetarchive/iaux-styles
Language: TypeScript - Size: 304 KB - Last synced: 10 days ago - Pushed: 16 days ago - Stars: 0 - Forks: 0
internetarchive/hind
Hashistack-IN-Docker (single container with nomad + consul + caddy)
Language: Shell - Size: 2 MB - Last synced: 10 days ago - Pushed: 10 days ago - Stars: 46 - Forks: 4
internetarchive/wayback Fork of iipc/openwayback
IA's public Wayback Machine (moved from SourceForge)
Language: Java - Size: 12.9 MB - Last synced: 12 days ago - Pushed: about 2 months ago - Stars: 708 - Forks: 124
internetarchive/archiveorg-e2e-playwright
Language: TypeScript - Size: 134 KB - Last synced: 14 days ago - Pushed: 15 days ago - Stars: 2 - Forks: 2
internetarchive/wayback-machine-webextension
A web browser extension for Chrome, Firefox, Edge, and Safari 14.
Language: JavaScript - Size: 33.8 MB - Last synced: 10 days ago - Pushed: 15 days ago - Stars: 590 - Forks: 203
internetarchive/openlibrary
One webpage for every book ever published!
Language: Python - Size: 85.8 MB - Last synced: 13 days ago - Pushed: 13 days ago - Stars: 4,824 - Forks: 1,190
internetarchive/internetarchivebot
Language: PHP - Size: 8.15 MB - Last synced: 14 days ago - Pushed: 19 days ago - Stars: 107 - Forks: 34
internetarchive/openlibrary-api
API documentation for https://github.com/internetarchive/openlibrary
Language: HTML - Size: 36.4 MB - Last synced: 14 days ago - Pushed: 16 days ago - Stars: 3 - Forks: 2
internetarchive/heritrix3
Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.
Language: Java - Size: 10.5 MB - Last synced: 12 days ago - Pushed: 16 days ago - Stars: 2,675 - Forks: 753
internetarchive/bookreader
The Internet Archive BookReader
Language: JavaScript - Size: 45.5 MB - Last synced: 10 days ago - Pushed: 16 days ago - Stars: 928 - Forks: 408
internetarchive/cicd
build & test using github registry; deploy to nomad clusters
Size: 63.5 KB - Last synced: 14 days ago - Pushed: about 1 month ago - Stars: 10 - Forks: 0
internetarchive/warctools
Command line tools and libraries for handling and manipulating WARC files (and HTTP contents)
Language: Python - Size: 278 KB - Last synced: 12 days ago - Pushed: over 3 years ago - Stars: 141 - Forks: 25
internetarchive/web_collection_search
An API wrapper to the Elasticsearch index of web archival collections and a web UI to explore those indexes.
Language: Python - Size: 81.1 KB - Last synced: 14 days ago - Pushed: 5 months ago - Stars: 7 - Forks: 5
internetarchive/archive-pdf-tools
Fast PDF generation and compression. Deals with millions of pages daily.
Language: Python - Size: 25.8 MB - Last synced: 14 days ago - Pushed: 5 months ago - Stars: 79 - Forks: 13
internetarchive/newsum
Daily TV News Summary using GPT
Language: Python - Size: 181 KB - Last synced: 14 days ago - Pushed: about 1 month ago - Stars: 19 - Forks: 3
internetarchive/iaux-shared-resize-observer
An efficient ResizeObserver to be shared amongst many components
Language: TypeScript - Size: 808 KB - Last synced: 14 days ago - Pushed: over 2 years ago - Stars: 2 - Forks: 0
internetarchive/dweb-mirror
Offline Internet Archive project
Language: JavaScript - Size: 1.74 MB - Last synced: 14 days ago - Pushed: 2 months ago - Stars: 243 - Forks: 25
internetarchive/openlibrary-bots
A repository of cleanup bots implementing the openlibrary-client
Language: Python - Size: 509 KB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 57 - Forks: 47
internetarchive/fatcat
Perpetual Access To The Scholarly Record
Language: Python - Size: 8.4 MB - Last synced: 14 days ago - Pushed: 6 months ago - Stars: 109 - Forks: 19
internetarchive/sandcrawler
Backend, IA-specific tools for crawling and processing the scholarly web. Content ends up in https://fatcat.wiki
Language: HTML - Size: 2.55 MB - Last synced: 14 days ago - Pushed: over 1 year ago - Stars: 23 - Forks: 2
internetarchive/Sparkling
Internet Archive's Sparkling Data Processing Library
Language: Scala - Size: 557 KB - Last synced: 14 days ago - Pushed: about 2 months ago - Stars: 10 - Forks: 2
internetarchive/strainer
Heritrix frontier files manipulation tool.
Language: Go - Size: 38.1 KB - Last synced: 14 days ago - Pushed: almost 3 years ago - Stars: 3 - Forks: 0
internetarchive/warcprox
WARC writing MITM HTTP/S proxy
Language: Python - Size: 1.5 MB - Last synced: 14 days ago - Pushed: 6 months ago - Stars: 360 - Forks: 55
internetarchive/draintasker
a tool for continuously ingesting w/arc files into the archive
Language: Python - Size: 960 KB - Last synced: 14 days ago - Pushed: over 2 years ago - Stars: 9 - Forks: 7
internetarchive/trendmachine
A mathematical model to calculate a normalized score to quantify the temporal resilience of a web page as a time-series data based on the historical observations of the page in web archives.
Language: Python - Size: 23.4 KB - Last synced: 14 days ago - Pushed: about 2 months ago - Stars: 5 - Forks: 1
internetarchive/map-of-the-web
Language: Python - Size: 3.29 MB - Last synced: 12 days ago - Pushed: over 5 years ago - Stars: 4 - Forks: 2
internetarchive/surt Fork of rajbot/surt
Sort-friendly URI Reordering Transform (SURT) python module
Language: Python - Size: 120 KB - Last synced: 12 days ago - Pushed: 8 months ago - Stars: 38 - Forks: 16
internetarchive/wayback-radial-tree
Language: JavaScript - Size: 2.12 MB - Last synced: 10 days ago - Pushed: 17 days ago - Stars: 7 - Forks: 8
internetarchive/arklet
ARK minter, binder, resolver
Language: Python - Size: 150 KB - Last synced: 14 days ago - Pushed: 9 months ago - Stars: 18 - Forks: 8
internetarchive/arch
Web application for distributed compute analysis of Archive-It web archive collections.
Language: Scala - Size: 54.5 MB - Last synced: 12 days ago - Pushed: 8 months ago - Stars: 13 - Forks: 4
internetarchive/crawling-for-nomore404
Language: Python - Size: 11 MB - Last synced: 13 days ago - Pushed: about 1 month ago - Stars: 23 - Forks: 17
internetarchive/fatcat-scholar
search interface for scholarly works
Language: Python - Size: 5.28 MB - Last synced: 14 days ago - Pushed: 2 months ago - Stars: 75 - Forks: 14
internetarchive/dweb-transports
Language: JavaScript - Size: 16.3 MB - Last synced: 10 days ago - Pushed: 9 months ago - Stars: 24 - Forks: 16
internetarchive/iaux-music-player
IA music player
Language: TypeScript - Size: 586 KB - Last synced: 10 days ago - Pushed: 2 months ago - Stars: 2 - Forks: 0
internetarchive/dweb-transport
Internet Archive Decentralized Web Common API
Size: 10.3 MB - Last synced: 14 days ago - Pushed: about 4 years ago - Stars: 37 - Forks: 10
internetarchive/wikibase-patcher
Python library for interacting with the Wikibase REST API
Language: Python - Size: 18.6 KB - Last synced: 14 days ago - Pushed: 7 months ago - Stars: 5 - Forks: 1
internetarchive/wiki-references-db
Data models and scripts to build a database of references (broadly defined) appearing on Wikipedia and other wikis
Language: Python - Size: 17.6 KB - Last synced: 14 days ago - Pushed: 11 months ago - Stars: 2 - Forks: 0
internetarchive/iabot-deploy-helpers
Scripts used to help with InternetArchiveBot deployment on Wikipedia
Language: Python - Size: 1000 Bytes - Last synced: 14 days ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0
internetarchive/iaux-democracys-library
A web component that highlights Democracy's Library
Language: TypeScript - Size: 580 KB - Last synced: 10 days ago - Pushed: 10 months ago - Stars: 2 - Forks: 2
internetarchive/iaux-collection-browser
Language: TypeScript - Size: 10.3 MB - Last synced: 14 days ago - Pushed: 24 days ago - Stars: 4 - Forks: 1
internetarchive/annotate-client Fork of hypothesis/client
The Hypothesis web-based annotation client.
Language: HTML - Size: 31.5 MB - Last synced: 14 days ago - Pushed: over 3 years ago - Stars: 1 - Forks: 0
internetarchive/isodos
Go module to interact with Internet Archive's Isodos API
Language: Go - Size: 50.8 KB - Last synced: 14 days ago - Pushed: over 3 years ago - Stars: 4 - Forks: 0
internetarchive/iaux-sharing-options
Sharing options for Internet Archive items
Language: JavaScript - Size: 481 KB - Last synced: 14 days ago - Pushed: over 1 year ago - Stars: 1 - Forks: 1
internetarchive/iaux-book-search-results
Book search results pane for ia-menu-slider
Language: JavaScript - Size: 166 KB - Last synced: 14 days ago - Pushed: over 3 years ago - Stars: 1 - Forks: 0
internetarchive/openlibrary-librarians
Coordination between the OpenLibrary.org Librarian community
Size: 3.91 KB - Last synced: 14 days ago - Pushed: over 3 years ago - Stars: 16 - Forks: 3
internetarchive/pdf_trio Fork of tralfamadude/pdf_trio
A PDF classifier ensemble with REST API service
Language: Python - Size: 15.5 MB - Last synced: 14 days ago - Pushed: about 3 years ago - Stars: 22 - Forks: 1
internetarchive/dweb-archivecontroller
Language: JavaScript - Size: 1.95 MB - Last synced: 14 days ago - Pushed: about 1 year ago - Stars: 6 - Forks: 2
internetarchive/wayback-discover-diff Fork of ftsalamp/wayback-discover-diff
A Python 3.6+ application that calculates and returns simhash values for Internet Archive's snapshots
Language: Python - Size: 217 KB - Last synced: 14 days ago - Pushed: about 1 year ago - Stars: 5 - Forks: 4
internetarchive/rulesengine-client
Python client package for the playback rules engine
Language: Python - Size: 87.9 KB - Last synced: 14 days ago - Pushed: about 2 months ago - Stars: 2 - Forks: 2
internetarchive/dweb-archive
Language: JavaScript - Size: 26.9 MB - Last synced: 10 days ago - Pushed: over 1 year ago - Stars: 54 - Forks: 16
internetarchive/internet-archive-voice-apps
Voice Apps (Actions on Google, Alexa Skill) of Internet Archive. Just say: "Ok Google, Ask Internet Archive to Play Jazz" or "Alexa, Ask Internet Internet Archive to play Instrumental Music"
Language: JavaScript - Size: 4.4 MB - Last synced: 12 days ago - Pushed: 17 days ago - Stars: 45 - Forks: 45
internetarchive/wayback-machine-safari
Language: JavaScript - Size: 3.7 MB - Last synced: 10 days ago - Pushed: over 6 years ago - Stars: 5 - Forks: 6
internetarchive/trough
Trough: Big data, small databases.
Language: Python - Size: 738 KB - Last synced: 14 days ago - Pushed: 11 months ago - Stars: 36 - Forks: 7
internetarchive/doublethink
rethinkdb python library
Language: Python - Size: 108 KB - Last synced: 14 days ago - Pushed: 7 months ago - Stars: 11 - Forks: 5
internetarchive/CDX-Writer Fork of rajbot/CDX-Writer
Python script to create CDX index files of WARC data
Language: Arc - Size: 5.59 MB - Last synced: 12 days ago - Pushed: over 2 years ago - Stars: 20 - Forks: 12
internetarchive/warc
Python library for reading and writing warc files
Language: Python - Size: 202 KB - Last synced: 9 days ago - Pushed: about 2 years ago - Stars: 232 - Forks: 114
internetarchive/infogami Fork of infogami/infogami
Language: Python - Size: 2.59 MB - Last synced: 13 days ago - Pushed: 23 days ago - Stars: 40 - Forks: 26
internetarchive/bookserver
Archive.org OPDS Bookserver - A standard for digital book distribution
Language: Python - Size: 289 KB - Last synced: 13 days ago - Pushed: over 5 years ago - Stars: 113 - Forks: 19
internetarchive/cdx-summary
Summarize web archive capture index (CDX) files.
Language: Python - Size: 227 KB - Last synced: 10 days ago - Pushed: over 1 year ago - Stars: 43 - Forks: 7
internetarchive/epub3 Fork of deborahgu/abbyy-to-epub3
Internet Archive utility which converts abbyy to epub3
Language: Python - Size: 17 MB - Last synced: 13 days ago - Pushed: over 4 years ago - Stars: 3 - Forks: 2
internetarchive/ArchiveSpark Fork of helgeho/ArchiveSpark
An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive.
Language: Scala - Size: 1.15 MB - Last synced: 14 days ago - Pushed: 23 days ago - Stars: 6 - Forks: 1
internetarchive/certstream-go Fork of pathtofile/certstream-go
Go library for connecting to CertStream
Language: Go - Size: 26.4 KB - Last synced: 14 days ago - Pushed: 2 months ago - Stars: 0 - Forks: 0
internetarchive/polyfill-service Fork of polyfillpolyfill/polyfill-service
Automatic polyfill service.
Size: 31.5 MB - Last synced: 14 days ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
internetarchive/iaux-collection-name-cache
Language: TypeScript - Size: 446 KB - Last synced: 10 days ago - Pushed: 2 months ago - Stars: 0 - Forks: 0
internetarchive/jpg241
Author & serve single progressive JPEG image which can be served as two different qualities "Two for One" :)
Language: TypeScript - Size: 229 KB - Last synced: 14 days ago - Pushed: about 1 month ago - Stars: 1 - Forks: 0
internetarchive/dyno
Language: JavaScript - Size: 238 KB - Last synced: 14 days ago - Pushed: 2 months ago - Stars: 3 - Forks: 1
internetarchive/tocky
[WIP] Extract structured table of contents data from digitized books
Language: Python - Size: 95.7 KB - Last synced: 14 days ago - Pushed: 2 months ago - Stars: 0 - Forks: 1
internetarchive/archive-ocr-tools
Language: Python - Size: 26.4 KB - Last synced: 14 days ago - Pushed: about 1 year ago - Stars: 5 - Forks: 1
internetarchive/ia
A JS interface to archive.org
Language: JavaScript - Size: 172 KB - Last synced: 14 days ago - Pushed: 3 months ago - Stars: 7 - Forks: 2
internetarchive/umbra
A queue-controlled browser automation tool for improving web crawl quality
Language: Python - Size: 243 KB - Last synced: 12 days ago - Pushed: about 4 years ago - Stars: 58 - Forks: 25
internetarchive/iaux-typescript-wc-template
IAUX Typescript WebComponent Template
Language: TypeScript - Size: 1.18 MB - Last synced: 9 days ago - Pushed: 10 days ago - Stars: 7 - Forks: 4
internetarchive/iaux-search-service
Language: TypeScript - Size: 1.13 MB - Last synced: 14 days ago - Pushed: about 2 months ago - Stars: 4 - Forks: 2
internetarchive/public-domain-day-film-contest
Internet Archive Public Domain Day Film Contest 2024 Entries
Language: HTML - Size: 5.86 KB - Last synced: 14 days ago - Pushed: 3 months ago - Stars: 1 - Forks: 0
internetarchive/testy
Language: Dockerfile - Size: 11.7 KB - Last synced: 14 days ago - Pushed: 2 months ago - Stars: 0 - Forks: 0
internetarchive/build-nocache
GitHub Action to build docker image, like "build" sister repo/action, just doesn't use "cache-to" and "cache-from"
Size: 14.6 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0
internetarchive/esbuild_es5
minify JS/TS files using `esbuild` and `swc` down to ES5 (uses `deno`)
Language: TypeScript - Size: 81.1 KB - Last synced: 14 days ago - Pushed: 3 months ago - Stars: 5 - Forks: 0
internetarchive/wayback-machine-ios
Wayback Machine application for iOS
Language: Swift - Size: 1.81 MB - Last synced: 14 days ago - Pushed: about 2 months ago - Stars: 0 - Forks: 1
internetarchive/wayback-machine-android
Language: Kotlin - Size: 495 KB - Last synced: 12 days ago - Pushed: 8 months ago - Stars: 10 - Forks: 10
internetarchive/emularity-config
archive.org software emulation
Language: Dockerfile - Size: 671 KB - Last synced: 14 days ago - Pushed: 18 days ago - Stars: 2 - Forks: 0
internetarchive/emularity-engine
archive.org software emulation
Language: JavaScript - Size: 4.66 GB - Last synced: 14 days ago - Pushed: 17 days ago - Stars: 0 - Forks: 0
internetarchive/emularity-bios
archive.org software emulation
Language: Dockerfile - Size: 82.2 MB - Last synced: 14 days ago - Pushed: 18 days ago - Stars: 0 - Forks: 0
internetarchive/iaux
Monorepo for Archive.org UX development and prototyping.
Language: JavaScript - Size: 34.5 MB - Last synced: about 22 hours ago - Pushed: 1 day ago - Stars: 63 - Forks: 85
internetarchive/wayback-machine-firefox
Reduce annoying 404 pages by automatically checking for an archived copy in the Wayback Machine. Learn more about this Test Pilot experiment at https://testpilot.firefox.com/
Language: JavaScript - Size: 4.17 MB - Last synced: 13 days ago - Pushed: over 5 years ago - Stars: 52 - Forks: 17
internetarchive/iari
Import workflows for the Wikipedia Citations Database
Language: Python - Size: 6.55 MB - Last synced: 14 days ago - Pushed: about 2 months ago - Stars: 12 - Forks: 9
internetarchive/xfetch
Cache stampede test harness. Code accompanies the presentation made at RedisConf 2017, 30 May to 1 June, 2017, in San Francisco.
Language: PHP - Size: 42 KB - Last synced: 14 days ago - Pushed: about 5 years ago - Stars: 19 - Forks: 2
internetarchive/read_api_extras
Demo code for the Open Library Read API
Size: 94.7 KB - Last synced: 12 days ago - Pushed: over 12 years ago - Stars: 7 - Forks: 9
internetarchive/AspectMock Fork of Codeception/AspectMock
The most powerful and flexible mocking framework for PHPUnit / Codeception.
Size: 504 KB - Last synced: 14 days ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0
internetarchive/parser-reflection Fork of goaop/parser-reflection
Parser Reflection API - Provides source code analysis without loading classes into the PHP memory
Size: 311 KB - Last synced: 14 days ago - Pushed: 10 months ago - Stars: 0 - Forks: 0
internetarchive/iacopilot
Summarize and ask questions about items in the Internet Archive
Language: Python - Size: 32.2 KB - Last synced: 14 days ago - Pushed: about 1 year ago - Stars: 13 - Forks: 5