Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: news-crawler

lewisdonovan/google-news-scraper

Lightweight scraper for Google News

Language: JavaScript - Size: 285 KB - Last synced: 3 days ago - Pushed: 7 days ago - Stars: 183 - Forks: 54

fhamborg/news-please

news-please - an integrated web crawler and information extractor for news that just works

Language: Python - Size: 2.98 MB - Last synced: 17 days ago - Pushed: 5 months ago - Stars: 1,937 - Forks: 403

karimhabush/TheguardianScrapper

A Scrapy webscraper that can scrape and store articles of theguardian.com

Language: Python - Size: 17.6 KB - Last synced: 22 days ago - Pushed: about 1 year ago - Stars: 0 - Forks: 2

lumyjuwon/KoreaNewsCrawler

A korean news crawler built to ingest large amounts of news data.

Language: Python - Size: 1.38 MB - Last synced: 22 days ago - Pushed: 22 days ago - Stars: 217 - Forks: 104

aufamiri/berita-crawler

a web crawler to take all the latest indonesian news from many sources

Language: Python - Size: 84 KB - Last synced: about 1 month ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

sunight1999/news-crawler

Naver and Daum news web crawler via JSoup + Selenium.

Language: Java - Size: 9.61 MB - Last synced: about 1 month ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0

adbar/trafilatura

Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments

Language: Python - Size: 23.2 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 2,688 - Forks: 205

flairNLP/fundus

A very simple news crawler with a funny name

Language: Python - Size: 14.4 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 38 - Forks: 5

divkakwani/webcorpus

Generate large textual corpora for almost any language by crawling the web

Language: Python - Size: 44.9 MB - Last synced: 10 days ago - Pushed: over 2 years ago - Stars: 7 - Forks: 8

AndyTheFactory/article-extraction-dataset

Article title, authors, date and body extraction dataset.

Language: HTML - Size: 31.9 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 1 - Forks: 0

thinh-vu/vnnews

A Python package that helps capture news updates from top Vietnamese news sites

Language: Jupyter Notebook - Size: 90.8 KB - Last synced: 18 days ago - Pushed: about 1 year ago - Stars: 5 - Forks: 1

johnbumgarner/newshound

This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around the world in over 50 languages.

Size: 28.3 KB - Last synced: 2 days ago - Pushed: about 1 year ago - Stars: 29 - Forks: 3

sakshamssr/GNews-API

A Fast and lightweight Python API that search for articles on Google News and returns a JSON response.

Language: Python - Size: 8.18 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 2 - Forks: 0

arian-askari/persian_news_websites_crawler

Crawler (Scraper) for several well-known persian news for scraping public data

Language: Python - Size: 23.4 KB - Last synced: 5 months ago - Pushed: about 2 years ago - Stars: 1 - Forks: 0

codyle50/review-pundits

Language: JavaScript - Size: 29.6 MB - Last synced: 4 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

LuChang-CS/news-crawler

A news crawler for BBC News, Reuters and New York Times.

Language: Python - Size: 46.9 KB - Last synced: 8 months ago - Pushed: over 1 year ago - Stars: 93 - Forks: 39

AmaanHaider/News-crawler

Language: JavaScript - Size: 3.71 MB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 0 - Forks: 0

andymithamclarke/Pundits-Review

11/09/2020 - Complete directory for Pundits Review web application. https://www.punditsreview.com/

Language: JavaScript - Size: 29.6 MB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 3 - Forks: 0

stardust95/NewsFeeds

Newsfeeds website using nodejs as server and mongo as storage backends, including a simple recommendation system. 基于Node.js的新闻聚合网站, 支持基于用户行为推荐新闻.

Language: HTML - Size: 45.7 MB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 15 - Forks: 7

SecondDim/crawler-news

Use python scrapy build crawler for real-time Taiwan NEWS website.

Language: Python - Size: 146 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 13 - Forks: 4

atulyakumar97/news-sentiment-analysis

The spider crawls moneycontrol.com and economictimes.com to fetch news of input companies and also scores and classifies the companies to raise an early warning signal

Size: 112 MB - Last synced: about 1 year ago - Pushed: almost 5 years ago - Stars: 16 - Forks: 3

nploi/news_crawler 📦

News crawler là một công cụ giúp bạn có thể crawl dữ liệu của một trang tin tức.

Language: Python - Size: 198 KB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 13 - Forks: 5

tunahanoguz/news-crawler

News crawler project written in Python.

Language: Python - Size: 97.7 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

PolunLin/Crawler

Language: Jupyter Notebook - Size: 63.5 KB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0

siristechnology/news-crawler

Config based news crawler using Google Puppeteer

Language: JavaScript - Size: 215 KB - Last synced: 23 days ago - Pushed: about 3 years ago - Stars: 2 - Forks: 2

MoritzGoeckel/NodeJsNewsCrawler

📰 Search engine for news in NodesJS

Language: JavaScript - Size: 515 KB - Last synced: about 1 year ago - Pushed: almost 7 years ago - Stars: 7 - Forks: 4

santhoshse7en/Alcoholics-Anonymous

Research Project to analyse the knowledge about Alcoholics Anonymous in public

Language: Jupyter Notebook - Size: 79.3 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 2 - Forks: 1