Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: archivespark

helgeho/ArchiveSpark

An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive.

Language: Scala - Size: 1.15 MB - Last synced: 28 days ago - Pushed: 2 months ago - Stars: 141 - Forks: 19

helgeho/ArchiveSpark2Triples

Convert web archives to RDF triples with ArchiveSpark

Language: Jupyter Notebook - Size: 40 KB - Last synced: about 1 year ago - Pushed: about 7 years ago - Stars: 1 - Forks: 1

helgeho/Tempas2ArchiveSpark

ArchiveSpark DataSpec to analyze the Internet Archive's Web archive through temporal search results returned by Tempas (v2)

Language: Scala - Size: 23.4 KB - Last synced: about 1 year ago - Pushed: over 6 years ago - Stars: 0 - Forks: 0