Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: archivespark
helgeho/ArchiveSpark
An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive.
Language: Scala - Size: 1.15 MB - Last synced: 28 days ago - Pushed: 2 months ago - Stars: 141 - Forks: 19
helgeho/ArchiveSpark2Triples
Convert web archives to RDF triples with ArchiveSpark
Language: Jupyter Notebook - Size: 40 KB - Last synced: about 1 year ago - Pushed: about 7 years ago - Stars: 1 - Forks: 1
helgeho/Tempas2ArchiveSpark
ArchiveSpark DataSpec to analyze the Internet Archive's Web archive through temporal search results returned by Tempas (v2)
Language: Scala - Size: 23.4 KB - Last synced: about 1 year ago - Pushed: over 6 years ago - Stars: 0 - Forks: 0