An open API service providing repository metadata for many open source software ecosystems.

GitHub / commoncrawl / nutch

Common Crawl fork of Apache Nutch

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/commoncrawl%2Fnutch
PURL: pkg:github/commoncrawl/nutch

Fork of Aloisius/nutch
Stars: 34
Forks: 2
Open issues: 6

License: apache-2.0
Language: Java
Size: 132 MB
Dependencies parsed at: Pending

Created at: almost 11 years ago
Updated at: about 1 month ago
Pushed at: about 1 month ago
Last synced at: about 1 month ago

Topics: big-data, commoncrawl, hadoop, java, web-crawler

    Loading...