Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub / johnbumgarner / newshound
This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around the world in over 50 languages.
JSON API: https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/johnbumgarner%2Fnewshound
Stars: 29
Forks: 3
Open Issues: 1
License: None
Language:
Repo Size: 28.3 KB
Dependencies:
21
Created: over 2 years ago
Updated: 5 months ago
Last pushed: about 1 year ago
Last synced: 1 day ago
Commit Stats
Commits: 8
Authors: 1
Mean commits per author: 8.0
Development Distribution Score: 0.0
More commit stats: https://commits.ecosyste.ms/hosts/GitHub/repositories/johnbumgarner/newshound
Topics: article-extracting, article-extractor, data-extraction, data-mining, data-science, datascience, news, news-aggregator, news-crawler, newspaper-crawler, python-newspaper, python3, text-mining, web-scraping, webscraping
Funding links: https://www.buymeacoffee.com/johnbumgarner
Files
Dependencies
- backoff >=1.11.1
- beautifulsoup4 >=4.10.0
- certifi >=2021.10.8
- charset_normalizer >=2.0.9
- deckar01-ratelimit >=3.0.2
- dicttoxml >=1.7.4
- filelock >=3.4.0
- htmlmin >=0.1.12
- idna >=3.3
- langdetect >=1.0.9
- lxml >=4.6.4
- numpy >=1.21.5
- pandas >=1.3.5
- python-dateutil >=2.8.2
- pytz >=2021.3
- requests >=2.26.0
- requests-file >=1.5.1
- six >=1.16.0
- soupsieve >=2.31
- tldextract >=3.1.2
- urllib3 =1.25.11