GitHub / commoncrawl / nutch
Common Crawl fork of Apache Nutch
JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/commoncrawl%2Fnutch
PURL: pkg:github/commoncrawl/nutch
Fork of Aloisius/nutch
Stars: 34
Forks: 2
Open issues: 6
License: apache-2.0
Language: Java
Size: 132 MB
Dependencies parsed at: Pending
Created at: almost 11 years ago
Updated at: about 1 month ago
Pushed at: about 1 month ago
Last synced at: about 1 month ago
Topics: big-data, commoncrawl, hadoop, java, web-crawler