An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: scraper-engine

metatube-community/metatube-sdk-go

MetaTube SDK & API Server in Golang

Language: Go - Size: 9.76 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 704 - Forks: 144

fredwu/crawler

A high performance web crawler / scraper in Elixir.

Language: Elixir - Size: 385 KB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 959 - Forks: 90

santhoshse7en/news-fetch

A Python Package which helps to scrape all news details from any news websites

Language: Python - Size: 59.6 KB - Last synced at: 9 days ago - Pushed at: 3 months ago - Stars: 214 - Forks: 110

notFaad/msl-engine

a powerful web scraping engine that uses a custom Domain Specific Language (DSL) to define scraping pipelines.

Language: Rust - Size: 19.5 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Machine-Learning-Labs/GathererImageGatherer Fork of devonmurphy/GathererImageGatherer

Builds a database of Magic the Gathering card images with card name, set, and perceptual hash of artwork.

Language: Python - Size: 18.7 MB - Last synced at: 6 days ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

okey-dokie/eitaa-extractor

Eitaa Extractor is a Go web scraper that efficiently extracts posts from Eitaa channels, saving details to JSON files. 🚀🛠️

Language: Go - Size: 13.7 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

BaseMax/eitaa-extractor

Eitaa Extractor is a Go-based web scraper designed to extract posts from Eitaa channels. It retrieves post details such as text, images, videos, timestamps, and metadata (e.g., forwarded or reply information) and saves them to a JSON file. The project includes Docker and Docker Compose configurations for easy deployment.

Language: Go - Size: 15.6 KB - Last synced at: 7 days ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

warifp/Shopee-Scrape

Shopee Scrape is a tool that functions to collect data - the data needed, such as finding data from photos, prices, names, store locations and others.

Language: PHP - Size: 559 KB - Last synced at: 7 days ago - Pushed at: about 4 years ago - Stars: 96 - Forks: 27

TufayelLUS/LinkedIn-Scraper

A LinkedIn Scraper to scrape up to 1k LinkedIn profiles(due to LinkedIn limit) from company profile links and save their e-mail addresses if available! (actively maintained, if anything doesn't work, open an issue in the repo)

Language: Python - Size: 12.1 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 235 - Forks: 64

fozbek/scrawler

Simple, schema based scraping tool

Language: PHP - Size: 76.2 KB - Last synced at: 3 days ago - Pushed at: 3 months ago - Stars: 12 - Forks: 0

Pieter79/LinkedIn-Scraper-Windows-Mac-and-Linux

LinkedIn scraper Windows, MAC and Linux Speed (at my home PC) is about 1 million URLs per hour

Size: 11.7 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

scrape-do/node-client

Scrape.do's official http client for node.js

Language: TypeScript - Size: 129 KB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 6 - Forks: 0

flulemon/sneakpeek

Sneakpeek is a framework that helps to quickly and conviniently develop scrapers. It’s the best choice for scrapers that have some specific complex scraping logic that needs to be run on a constant basis

Language: Python - Size: 19.7 MB - Last synced at: 2 months ago - Pushed at: about 2 years ago - Stars: 37 - Forks: 0

RahulSDevloper/GoSearch-Search-Engine-Scraper

Get search results from google, bing, duckduckgo, etc easily using GoSearch

Language: Go - Size: 221 KB - Last synced at: 8 days ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

fernandod1/ProductHunt-scraper

Producthunt.com famous website scraper script. Scrap all offers and save in spreadsheet excel file.

Language: Python - Size: 9.77 KB - Last synced at: 3 months ago - Pushed at: over 3 years ago - Stars: 26 - Forks: 9

1x-eng/chart_data_extractor

This webservice will help scrape data out of chart(s) presented on any given website. (At this moment, I only support scrape from HighCharts and AmCharts. Other libraries, maybe next time).

Language: Python - Size: 79.1 KB - Last synced at: 12 days ago - Pushed at: 13 days ago - Stars: 10 - Forks: 9

TufayelLUS/LinkedIn-CV-Downloader

A Python based GUI automation software for downloading bulk LinkedIn CV / LinkedIn Resume from a list of profile links

Language: Python - Size: 33.6 MB - Last synced at: 6 months ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 1

labteral/bluebird 📦

Unofficial Python client for Twitter

Language: Python - Size: 112 KB - Last synced at: 2 months ago - Pushed at: over 4 years ago - Stars: 43 - Forks: 14

obetomuniz/tatooine

A powerful scraper for JavaScript Developers.

Language: TypeScript - Size: 1.76 MB - Last synced at: 19 days ago - Pushed at: over 2 years ago - Stars: 86 - Forks: 3

philipperemy/japanese-street-addresses-scraper

Scraper for Japanese street addresses (住所).

Language: Python - Size: 7.02 MB - Last synced at: 4 months ago - Pushed at: over 3 years ago - Stars: 6 - Forks: 2

serping/express-scraper

Language: HTML - Size: 1.25 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 1

MuhammadAhmed-0/search-scraper

This Python-based tool scrapes Google search results and presents the top 10 results along with their URLs. Useful for SEO optimization and content writing.

Language: Python - Size: 20.5 KB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 3

MrSentex/0day.today-API

Unofficial API for 0day.today database | Supported languages: Python and PHP

Language: PHP - Size: 17 MB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 13 - Forks: 8

Vitcesar/plugin.video.anitube

Add-on para AniTube, famoso site que ofereceu uma gigantesca biblioteca de animes para assistir online e de graça.

Language: Python - Size: 17 MB - Last synced at: 5 months ago - Pushed at: about 6 years ago - Stars: 8 - Forks: 3

breakpointninja/spiderjuice

Language: Python - Size: 64.5 KB - Last synced at: over 1 year ago - Pushed at: over 8 years ago - Stars: 0 - Forks: 0

DaveSimoes/Web-Scraping

The goal of this project is to provide a basic structure for web scraping HTML pages and collecting specific data. The main script (main.py) initializes a WebScraper object and calls the scrape() method to collect data from a specific URL.

Language: Python - Size: 7.81 KB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

vinayakmp007/Car-folio-scraper

Scrapes car details from carfolio.com to a xml file

Language: Python - Size: 2.93 KB - Last synced at: over 1 year ago - Pushed at: over 8 years ago - Stars: 1 - Forks: 0

kingzbauer/scraperlang

A DSL aimed at making writing web scrapers/crawlers a breeze

Language: Go - Size: 1.77 MB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 2 - Forks: 0

lukluk/scraper-engine

Async Scraper Framework Based Nodejs

Language: JavaScript - Size: 2.85 MB - Last synced at: about 2 months ago - Pushed at: over 9 years ago - Stars: 11 - Forks: 7

animeshkundu/pyscrape

Lightweight headless web scraper with javascript rendering in python with http api

Language: Python - Size: 9.77 KB - Last synced at: 16 days ago - Pushed at: over 8 years ago - Stars: 4 - Forks: 0

fazxid/shopee-product-scraper

Scrape Data & Images From Marketplace Shopee

Language: JavaScript - Size: 17.6 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 6 - Forks: 1

techieforfun/web-scraper

:microscope: Scraper for the web

Language: PHP - Size: 261 KB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 0

dasileker/scraper_project

This project is a capstone project for the Microverse program Ruby, it's a scraper code that searches any torrent name you input.

Language: Ruby - Size: 22.1 MB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

RBEGamer/OP1.FUN-SCRAPER

Scrapes all Packages from the Op1.fun Site

Language: Python - Size: 32.2 KB - Last synced at: 7 days ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 1

vreddi/DotaScraper

🚽 Scraper for Dota

Language: C# - Size: 133 KB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

seralexger/filmaffinity-scraper

Unofficial class for scrap and recollect info from Filmmaffinity

Language: Python - Size: 14.6 KB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 2

soulomoon/HotelScraper

A very slow hotel scraper for airbnb, booking, using selenium, beautifulsoup4

Language: Python - Size: 23.4 KB - Last synced at: 6 months ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0