An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: scraping-framework

zhuyingda/webster

a reliable high-level web crawling & scraping framework for Node.js.

Language: JavaScript - Size: 181 KB - Last synced at: 3 days ago - Pushed at: 3 months ago - Stars: 527 - Forks: 57

omkarcloud/botasaurus

The All in One Framework to Build Undefeatable Scrapers

Language: Python - Size: 62.9 MB - Last synced at: 9 days ago - Pushed at: 16 days ago - Stars: 1,946 - Forks: 165

lorien/awesome-web-scraping

List of libraries, tools and APIs for web scraping and data processing.

Language: Makefile - Size: 473 KB - Last synced at: 11 days ago - Pushed at: 5 months ago - Stars: 7,016 - Forks: 809

pim97/scrappey-wrapper-python

An API wrapper for Scrappey.com written in Python (cloudflare, datadome bypass & solver)

Language: Python - Size: 98.6 KB - Last synced at: 1 day ago - Pushed at: 12 days ago - Stars: 20 - Forks: 0

flulemon/sneakpeek

Sneakpeek is a framework that helps to quickly and conviniently develop scrapers. It’s the best choice for scrapers that have some specific complex scraping logic that needs to be run on a constant basis

Language: Python - Size: 19.7 MB - Last synced at: 16 days ago - Pushed at: almost 2 years ago - Stars: 37 - Forks: 0

kennethreitz/requests-html Fork of psf/requests-html

Pythonic HTML Parsing for Humansβ„’

Language: Python - Size: 2.7 MB - Last synced at: 24 days ago - Pushed at: 12 months ago - Stars: 320 - Forks: 42

alephdata/memorious

Lightweight web scraping toolkit for documents and structured data.

Language: Python - Size: 1.39 MB - Last synced at: 16 days ago - Pushed at: over 1 year ago - Stars: 310 - Forks: 62

omkarcloud/botasaurus-starter

πŸš€ OFFICIAL STARTER TEMPLATE FOR BOTASAURUS SCRAPING FRAMEWORK πŸ€–

Language: TypeScript - Size: 397 KB - Last synced at: 7 days ago - Pushed at: about 1 month ago - Stars: 25 - Forks: 8

omkarcloud/selenium-2captcha-recaptcha-solver-demo

πŸš€ FINAL CODE FOR TUTORIAL ON HOW TO SOLVE CAPTCHA IN SELENIUM USING 2CAPTCHA πŸ€–

Language: Python - Size: 5.86 KB - Last synced at: 7 days ago - Pushed at: almost 2 years ago - Stars: 6 - Forks: 2

LearnCodingEasy/Web_Scraping

Web Scraping

Language: Vue - Size: 165 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

omkarcloud/omkar-temp-mail

πŸš€ OMKAR TEMP MAIL HELPS YOU USE TEMPORARY EMAILS. πŸ€–

Language: Python - Size: 15.6 KB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 13 - Forks: 4

peterbencze/serritor

Serritor is an open source web crawler framework built upon Selenium and written in Java. It can be used to crawl dynamic web pages that require JavaScript to render data.

Language: Java - Size: 969 KB - Last synced at: about 2 months ago - Pushed at: almost 3 years ago - Stars: 32 - Forks: 15

mr-mudgal/Amazon-Scrapper

This Python-based Amazon Scraper is designed to efficiently extract detailed product data from Amazon's product pages. The tool leverages powerful libraries like BeautifulSoup4 and csv, along with the Scrapingant API to simulate browser behavior and bypass Amazon’s anti-scraping algorithms.

Language: Python - Size: 17.6 KB - Last synced at: 2 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

crawlbase/proxycrawl-php πŸ“¦

ProxyCrawl PHP library for scraping and crawling websites

Language: PHP - Size: 34.2 KB - Last synced at: 1 day ago - Pushed at: almost 2 years ago - Stars: 21 - Forks: 5

datagrind-io/amazon-products

Scrape Amazon products

Language: Python - Size: 3.91 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

omkarcloud/dentalkart-scraper

πŸš€ SCRAPE 1000'S OF PRODUCTS FROM DENTALKART πŸ€–

Language: Python - Size: 908 KB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 2

omkarcloud/web-scraping-template

πŸš€ THIS WEB SCRAPING TEMPLATE PROVIDES YOU WITH A GREAT STARTING POINT WHEN CREATING WEB SCRAPING BOTS. πŸ€–

Language: Python - Size: 104 KB - Last synced at: 7 days ago - Pushed at: almost 2 years ago - Stars: 7 - Forks: 3

umihico/minigun-requests

Web scraping API to outsource tons of GET & xpath to cloud computing

Language: Python - Size: 392 KB - Last synced at: 7 days ago - Pushed at: about 1 year ago - Stars: 7 - Forks: 0

Ashis678/ProxyCrawl

Size: 23.4 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 44 - Forks: 0

rajanrx/php-scrape

A simple, easy to use, scalable scraping framework written in PHP

Language: PHP - Size: 8.8 MB - Last synced at: 6 months ago - Pushed at: over 4 years ago - Stars: 11 - Forks: 4

JimmyLaurent/node-crawling-framework

✨ NodeJs crawling & scraping framework heavily inspired by Scrapy

Language: JavaScript - Size: 227 KB - Last synced at: 11 days ago - Pushed at: almost 7 years ago - Stars: 4 - Forks: 1

DemonMartin/scrappey-wrapper

An API wrapper for Scrappey.com written in Node.js (cloudflare bypass & solver)

Language: JavaScript - Size: 61.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 0

tor4z/crabs

Scraping framework for Python

Language: Python - Size: 77.1 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

lauramatilda/newsScraping

M.A. Thesis work, news scraping framework/pipeline using python, beautifulsoup, newspaper3k, flask and mongodb with a custom api.

Language: Python - Size: 34.2 KB - Last synced at: 6 days ago - Pushed at: almost 7 years ago - Stars: 3 - Forks: 0

Related Keywords
scraping-framework 24 scraping 17 crawling 13 scraper 12 crawler 11 web-scraping 11 scraping-python 7 crawling-framework 7 scraping-tool 6 webscraping 6 web-scraper 6 web-crawling 6 scraping-websites 6 selenium 6 web-crawler 5 beautifulsoup 4 crawling-python 4 cloudflare-bypass 3 crawling-tool 3 scrapers 3 python3 3 node-crawler 3 spider 3 vue 2 python 2 crawlers 2 web-scraping-solution 2 scraping-service 2 scraping-library 2 headless 2 proxycrawl 2 python-crawler 2 proxycrawl-api 2 captcha 2 captcha-solver 2 data-extraction 2 cloudflare-anti-bot 2 captcha-bypass 2 web-scraping-python 2 python-scraper 2 headless-chrome 2 mongodb 2 scraping-api 2 nodejs-framework 2 vuejs 1 web 1 10minute 1 10minutemail 1 information-extraction 1 framework 1 disposable-email 1 extract-data 1 disposable-email-addresses 1 free-mail 1 mail-api 1 temp-mail 1 tempmail 1 temporary-email 1 automation 1 crawl 1 data-mining 1 dynamic-webpages 1 dynamic-website 1 flask 1 scrapy-crawler 1 website-scraping-tool 1 website-data-extraction 1 web-data-extraction 1 turnstile-solver 1 scraping-solution 1 data-scraping-tool 1 cloudflare-solver 1 api-scraping 1 scrapy 1 middleware 1 elasticsearch 1 scrape 1 php 1 dentalkart-scraping 1 dentalkart-scraper 1 dentalkart-product-scraper 1 dentalkart 1 datagrind 1 amazon-scraping 1 web-scrapping 1 web-scraping-software 1 csv-export 1 csv 1 amazon 1 webspider 1 selenium-crawler 1 java 1 information-retrieval 1 perimetex 1 incapsula 1 datadome 1 anti-bot-api 1 akamai 1 captcha-recaptcha 1 undetected-chromedriver 1