Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: news-crawler
lewisdonovan/google-news-scraper
Lightweight scraper for Google News
Language: JavaScript - Size: 285 KB - Last synced: 3 days ago - Pushed: 7 days ago - Stars: 183 - Forks: 54
fhamborg/news-please
news-please - an integrated web crawler and information extractor for news that just works
Language: Python - Size: 2.98 MB - Last synced: 17 days ago - Pushed: 5 months ago - Stars: 1,937 - Forks: 403
karimhabush/TheguardianScrapper
A Scrapy webscraper that can scrape and store articles of theguardian.com
Language: Python - Size: 17.6 KB - Last synced: 22 days ago - Pushed: about 1 year ago - Stars: 0 - Forks: 2
lumyjuwon/KoreaNewsCrawler
A korean news crawler built to ingest large amounts of news data.
Language: Python - Size: 1.38 MB - Last synced: 22 days ago - Pushed: 22 days ago - Stars: 217 - Forks: 104
aufamiri/berita-crawler
a web crawler to take all the latest indonesian news from many sources
Language: Python - Size: 84 KB - Last synced: about 1 month ago - Pushed: 7 months ago - Stars: 0 - Forks: 0
sunight1999/news-crawler
Naver and Daum news web crawler via JSoup + Selenium.
Language: Java - Size: 9.61 MB - Last synced: about 1 month ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0
adbar/trafilatura
Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
Language: Python - Size: 23.2 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 2,688 - Forks: 205
flairNLP/fundus
A very simple news crawler with a funny name
Language: Python - Size: 14.4 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 38 - Forks: 5
divkakwani/webcorpus
Generate large textual corpora for almost any language by crawling the web
Language: Python - Size: 44.9 MB - Last synced: 10 days ago - Pushed: over 2 years ago - Stars: 7 - Forks: 8
AndyTheFactory/article-extraction-dataset
Article title, authors, date and body extraction dataset.
Language: HTML - Size: 31.9 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 1 - Forks: 0
thinh-vu/vnnews
A Python package that helps capture news updates from top Vietnamese news sites
Language: Jupyter Notebook - Size: 90.8 KB - Last synced: 18 days ago - Pushed: about 1 year ago - Stars: 5 - Forks: 1
johnbumgarner/newshound
This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around the world in over 50 languages.
Size: 28.3 KB - Last synced: 2 days ago - Pushed: about 1 year ago - Stars: 29 - Forks: 3
sakshamssr/GNews-API
A Fast and lightweight Python API that search for articles on Google News and returns a JSON response.
Language: Python - Size: 8.18 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 2 - Forks: 0
arian-askari/persian_news_websites_crawler
Crawler (Scraper) for several well-known persian news for scraping public data
Language: Python - Size: 23.4 KB - Last synced: 5 months ago - Pushed: about 2 years ago - Stars: 1 - Forks: 0
codyle50/review-pundits
Language: JavaScript - Size: 29.6 MB - Last synced: 4 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0
LuChang-CS/news-crawler
A news crawler for BBC News, Reuters and New York Times.
Language: Python - Size: 46.9 KB - Last synced: 8 months ago - Pushed: over 1 year ago - Stars: 93 - Forks: 39
AmaanHaider/News-crawler
Language: JavaScript - Size: 3.71 MB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 0 - Forks: 0
andymithamclarke/Pundits-Review
11/09/2020 - Complete directory for Pundits Review web application. https://www.punditsreview.com/
Language: JavaScript - Size: 29.6 MB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 3 - Forks: 0
stardust95/NewsFeeds
Newsfeeds website using nodejs as server and mongo as storage backends, including a simple recommendation system. 基于Node.js的新闻聚合网站, 支持基于用户行为推荐新闻.
Language: HTML - Size: 45.7 MB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 15 - Forks: 7
SecondDim/crawler-news
Use python scrapy build crawler for real-time Taiwan NEWS website.
Language: Python - Size: 146 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 13 - Forks: 4
atulyakumar97/news-sentiment-analysis
The spider crawls moneycontrol.com and economictimes.com to fetch news of input companies and also scores and classifies the companies to raise an early warning signal
Size: 112 MB - Last synced: about 1 year ago - Pushed: almost 5 years ago - Stars: 16 - Forks: 3
nploi/news_crawler 📦
News crawler là một công cụ giúp bạn có thể crawl dữ liệu của một trang tin tức.
Language: Python - Size: 198 KB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 13 - Forks: 5
tunahanoguz/news-crawler
News crawler project written in Python.
Language: Python - Size: 97.7 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
PolunLin/Crawler
Language: Jupyter Notebook - Size: 63.5 KB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0
siristechnology/news-crawler
Config based news crawler using Google Puppeteer
Language: JavaScript - Size: 215 KB - Last synced: 23 days ago - Pushed: about 3 years ago - Stars: 2 - Forks: 2
MoritzGoeckel/NodeJsNewsCrawler
📰 Search engine for news in NodesJS
Language: JavaScript - Size: 515 KB - Last synced: about 1 year ago - Pushed: almost 7 years ago - Stars: 7 - Forks: 4
santhoshse7en/Alcoholics-Anonymous
Research Project to analyse the knowledge about Alcoholics Anonymous in public
Language: Jupyter Notebook - Size: 79.3 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 2 - Forks: 1