An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: scraping-framework

zhuyingda/webster

a reliable high-level web crawling & scraping framework for Node.js.

Language: JavaScript - Size: 181 KB - Last synced at: about 5 hours ago - Pushed at: 5 months ago - Stars: 546 - Forks: 53

omkarcloud/botasaurus

The All in One Framework to Build Undefeatable Scrapers

Language: Python - Size: 63 MB - Last synced at: about 10 hours ago - Pushed at: 4 days ago - Stars: 2,094 - Forks: 189

omkarcloud/botasaurus-starter

πŸš€ OFFICIAL STARTER TEMPLATE FOR BOTASAURUS SCRAPING FRAMEWORK πŸ€–

Language: TypeScript - Size: 402 KB - Last synced at: 9 days ago - Pushed at: 17 days ago - Stars: 25 - Forks: 9

pim97/scrappey-wrapper-python

An API wrapper for Scrappey.com written in Python (cloudflare, datadome bypass & solver)

Language: Python - Size: 107 KB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 20 - Forks: 0

lorien/awesome-web-scraping

List of libraries, tools and APIs for web scraping and data processing.

Language: Makefile - Size: 473 KB - Last synced at: 25 days ago - Pushed at: 7 months ago - Stars: 7,060 - Forks: 811

alephdata/memorious

Lightweight web scraping toolkit for documents and structured data.

Language: Python - Size: 1.39 MB - Last synced at: 30 days ago - Pushed at: over 1 year ago - Stars: 312 - Forks: 62

diprog/python-tls-client-async Fork of FlorianREGAZ/Python-Tls-Client

Async fork of Python-TLS-Client with modern asyncio support, updated dependencies, and fixes for issues in the original abandoned library. Includes enhanced compatibility, stability improvements, and ongoing maintenance for Python 3.9–3.13.

Language: Python - Size: 286 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 11 - Forks: 2

flulemon/sneakpeek

Sneakpeek is a framework that helps to quickly and conviniently develop scrapers. It’s the best choice for scrapers that have some specific complex scraping logic that needs to be run on a constant basis

Language: Python - Size: 19.7 MB - Last synced at: 25 days ago - Pushed at: almost 2 years ago - Stars: 37 - Forks: 0

kennethreitz/requests-html Fork of psf/requests-html

Pythonic HTML Parsing for Humansβ„’

Language: Python - Size: 2.7 MB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 320 - Forks: 42

rajanrx/php-scrape

A simple, easy to use, scalable scraping framework written in PHP

Language: PHP - Size: 8.8 MB - Last synced at: 18 days ago - Pushed at: almost 5 years ago - Stars: 10 - Forks: 4

omkarcloud/selenium-2captcha-recaptcha-solver-demo

πŸš€ FINAL CODE FOR TUTORIAL ON HOW TO SOLVE CAPTCHA IN SELENIUM USING 2CAPTCHA πŸ€–

Language: Python - Size: 5.86 KB - Last synced at: 9 days ago - Pushed at: about 2 years ago - Stars: 6 - Forks: 2

LearnCodingEasy/Web_Scraping

Web Scraping

Language: Vue - Size: 165 KB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

omkarcloud/omkar-temp-mail

πŸš€ OMKAR TEMP MAIL HELPS YOU USE TEMPORARY EMAILS. πŸ€–

Language: Python - Size: 15.6 KB - Last synced at: 9 days ago - Pushed at: over 1 year ago - Stars: 13 - Forks: 4

peterbencze/serritor

Serritor is an open source web crawler framework built upon Selenium and written in Java. It can be used to crawl dynamic web pages that require JavaScript to render data.

Language: Java - Size: 969 KB - Last synced at: 3 days ago - Pushed at: about 3 years ago - Stars: 32 - Forks: 14

mr-mudgal/Amazon-Scrapper

This Python-based Amazon Scraper is designed to efficiently extract detailed product data from Amazon's product pages. The tool leverages powerful libraries like BeautifulSoup4 and csv, along with the Scrapingant API to simulate browser behavior and bypass Amazon’s anti-scraping algorithms.

Language: Python - Size: 17.6 KB - Last synced at: 4 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

crawlbase/proxycrawl-php πŸ“¦

ProxyCrawl PHP library for scraping and crawling websites

Language: PHP - Size: 34.2 KB - Last synced at: 22 days ago - Pushed at: about 2 years ago - Stars: 21 - Forks: 5

datagrind-io/amazon-products

Scrape Amazon products

Language: Python - Size: 3.91 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

omkarcloud/dentalkart-scraper

πŸš€ SCRAPE 1000'S OF PRODUCTS FROM DENTALKART πŸ€–

Language: Python - Size: 908 KB - Last synced at: 9 days ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 2

omkarcloud/web-scraping-template

πŸš€ THIS WEB SCRAPING TEMPLATE PROVIDES YOU WITH A GREAT STARTING POINT WHEN CREATING WEB SCRAPING BOTS. πŸ€–

Language: Python - Size: 104 KB - Last synced at: 9 days ago - Pushed at: about 2 years ago - Stars: 7 - Forks: 3

umihico/minigun-requests

Web scraping API to outsource tons of GET & xpath to cloud computing

Language: Python - Size: 392 KB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 7 - Forks: 0

Ashis678/ProxyCrawl

Size: 23.4 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 44 - Forks: 0

JimmyLaurent/node-crawling-framework

✨ NodeJs crawling & scraping framework heavily inspired by Scrapy

Language: JavaScript - Size: 227 KB - Last synced at: about 1 month ago - Pushed at: about 7 years ago - Stars: 4 - Forks: 1

DemonMartin/scrappey-wrapper

An API wrapper for Scrappey.com written in Node.js (cloudflare bypass & solver)

Language: JavaScript - Size: 61.5 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 6 - Forks: 0

tor4z/crabs

Scraping framework for Python

Language: Python - Size: 77.1 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

lauramatilda/newsScraping

M.A. Thesis work, news scraping framework/pipeline using python, beautifulsoup, newspaper3k, flask and mongodb with a custom api.

Language: Python - Size: 34.2 KB - Last synced at: 9 days ago - Pushed at: about 7 years ago - Stars: 3 - Forks: 0

Related Keywords
scraping-framework 25 scraping 18 crawling 13 scraper 12 web-scraping 12 crawler 11 crawling-framework 7 scraping-python 7 selenium 6 web-scraper 6 webscraping 6 web-crawling 6 scraping-websites 6 scraping-tool 6 web-crawler 5 beautifulsoup 4 crawling-python 4 scrapers 3 cloudflare-bypass 3 crawling-tool 3 node-crawler 3 spider 3 python 3 python3 3 headless 2 web-scraping-python 2 python-crawler 2 vue 2 proxycrawl 2 headless-chrome 2 scraping-api 2 automation 2 data-extraction 2 nodejs-framework 2 cloudflare-anti-bot 2 anti-bot 2 captcha-solver 2 captcha-bypass 2 web-scraping-solution 2 captcha 2 mongodb 2 crawlers 2 python-scraper 2 scraping-service 2 scraping-library 2 proxycrawl-api 2 temp-mail 1 tempmail 1 temporary-email 1 crawl 1 data-mining 1 dynamic-webpages 1 dynamic-website 1 extract-data 1 mail-api 1 free-mail 1 disposable-email-addresses 1 disposable-email 1 10minutemail 1 10minute 1 web 1 vuejs 1 vscode 1 vite 1 scraping-web 1 scraping-data 1 django-rest-framework 1 django-framework 1 django 1 flask 1 scrapy-crawler 1 website-scraping-tool 1 website-data-extraction 1 web-data-extraction 1 turnstile-solver 1 scraping-solution 1 data-scraping-tool 1 cloudflare-solver 1 api-scraping 1 scrapy 1 middleware 1 elasticsearch 1 dentalkart-scraping 1 dentalkart-scraper 1 dentalkart-product-scraper 1 dentalkart 1 datagrind 1 amazon-scraping 1 web-scrapping 1 web-scraping-software 1 csv-export 1 csv 1 amazon 1 webspider 1 selenium-crawler 1 java 1 information-retrieval 1 information-extraction 1 framework 1 captcha-recaptcha 1