Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub / scrapinghub / aduana
Frontera backend to guide a crawl using PageRank, HITS or other ranking algorithms based on the link structure of the web graph, even when making big crawls (one billion pages).
JSON API: https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/scrapinghub%2Faduana
Stars: 53
Forks: 8
Open Issues: 11
License: bsd-3-clause
Language: C
Repo Size: 11.4 MB
Dependencies:
36
Created: about 9 years ago
Updated: about 1 month ago
Last pushed: about 2 months ago
Last synced: about 1 month ago
Topics: data-science
Files
Dependencies
- breathe *
- beautifulsoup4 ==4.3.2
- frontera *
- scrapy *
- xxhash *
- aduana *
- beautifulsoup4 ==4.3.2
- frontera *
- langdetect *
- lmdb *
- marisa_trie *
- networkx >=1.10
- nltk *
- numpy *
- requests *
- scipy *
- scrapy *
- sklearn *
- xxhash *