Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub / DavidNemeskey / zim_to_corpus
Scripts to extract (mostly) Wikipedia pages from .zim archives.
JSON API: https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/DavidNemeskey%2Fzim_to_corpus
Stars: 0
Forks: 3
Open Issues: 2
License: mit
Language: Python
Repo Size: 139 KB
Dependencies:
10
Created: over 4 years ago
Updated: over 2 years ago
Last pushed: almost 2 years ago
Last synced: about 1 year ago
Files
Loading...
Readme
Loading...
Dependencies
requirements.txt
pypi
- spacy *
setup.py
pypi
- For *
- Progress *
- beautifulsoup4 *
- lxml ==4.6.5
- multiprocessing-logging *
- particularly *
- regex *
- requests *
- tqdm *