GitHub topics: scrapy
casangi/casadocs
Common Astronomy Software Applications Documentation
Language: Python - Size: 47.4 MB - Last synced at: 33 minutes ago - Pushed at: about 1 hour ago - Stars: 12 - Forks: 10

Data-Wrangling-and-Visualisation/JobHack
Understanding job market
Language: Jupyter Notebook - Size: 2.19 MB - Last synced at: about 5 hours ago - Pushed at: about 6 hours ago - Stars: 0 - Forks: 0

doforce/github-trending
GitHub trending repositories and developers APIs for real time, powered by crawlers | 通过爬虫获取 GitHub 热门项目和开发者的实时 API
Language: Python - Size: 102 KB - Last synced at: about 7 hours ago - Pushed at: about 8 hours ago - Stars: 61 - Forks: 24

dev-chenxing/learning-notes
📝 Programming Guide and Learning Notes, web scraping, frontend development, handheld gaming | 爬虫,网页前端,游戏机配置. Built with Jekyll
Language: SCSS - Size: 9.22 MB - Last synced at: about 7 hours ago - Pushed at: about 8 hours ago - Stars: 0 - Forks: 0

iaoongin/GachaClock
卡池倒计时。支持查看崩坏 星穹铁道,绝区零,鸣潮卡池信息
Language: TypeScript - Size: 46 MB - Last synced at: about 15 hours ago - Pushed at: about 16 hours ago - Stars: 0 - Forks: 0

rmax/scrapy-redis
Redis-based components for Scrapy.
Language: Python - Size: 228 KB - Last synced at: about 5 hours ago - Pushed at: 10 months ago - Stars: 5,590 - Forks: 1,586

navs-svan/nba-dashboard
PowerBI Dashboard using scraped box score data of the NBA Season 2024-25
Language: Python - Size: 7.2 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

xuexingdong/databox
A databox setup with scrapy
Language: JavaScript - Size: 11.2 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 3 - Forks: 0

rheaacharya77/foodmandu-scraper
Foodmandu Scraper extracts restaurant information such as restaurant URLs, images,names,addresses,and cuisines.
Language: Python - Size: 1.05 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

sazzadhossainmilon/got-scraping-client
got-scraping-client is a lightweight and efficient tool for web scraping tasks using the popular Got HTTP client. It simplifies data extraction from web pages by providing a straightforward API and built-in support for handling common challenges like pagination and rate limiting.
Language: TypeScript - Size: 24.4 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

Solihatun1/AI-Cursor-Scraping-Assistant
A powerful tool that leverages Cursor AI and MCP (Model Context Protocol) to easily generate web scrapers for various types of websites.
Language: Python - Size: 16.6 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

elacuesta/scrapy-pyppeteer 📦
Pyppeteer integration for Scrapy
Language: Python - Size: 219 KB - Last synced at: about 6 hours ago - Pushed at: about 4 years ago - Stars: 58 - Forks: 13

Harishwarrior/movie_scrap_backend
Python Scrapy script to scrap magnet links and titles from Tamilrockers site.
Language: Python - Size: 40 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

fuunshi/ShareSansarDataScrape
Daily auto scrapping of Share price form Share Sansar
Language: Python - Size: 5.65 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 3 - Forks: 0

ihandmine/aioscpy
An asyncio + aiolibs crawler imitate scrapy framework
Language: Python - Size: 1.69 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 125 - Forks: 10

City-Bureau/city-scrapers-det
City Scrapers project for Detroit
Language: Python - Size: 1.06 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 4 - Forks: 5

City-Bureau/city-scrapers-cle
City Scrapers project for Cleveland
Language: Python - Size: 2.11 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 16 - Forks: 15

City-Bureau/city-scrapers-akr
City Scrapers project for Akron
Language: Python - Size: 2.96 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 1

honzajavorek/czap
Scraping czap.cz data so you can filter available psychotherapists by any criteria you wish
Language: Python - Size: 2.65 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 2 - Forks: 0

RaccoonOnion/job-scraping
A job post processing pipeline built with Scrapy (scrapping), MongDB (storage) and Redis (Deduplication). Containerized and easy to run & deploy!
Language: Python - Size: 190 KB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

vieyahn2017/pypy
python trial collections
Language: Python - Size: 13.9 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 1 - Forks: 1

alltheplaces/alltheplaces
A set of spiders and scrapers to extract location information from places that post their location on the internet.
Language: Python - Size: 32.1 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 695 - Forks: 227

proxymesh/scrapy-proxy-headers
Handle custom proxy headers when making HTTPS requests through proxies in scrapy
Language: Python - Size: 8.79 KB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 2 - Forks: 0

crawlab-team/crawlab
Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架
Language: Go - Size: 23.5 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 11,717 - Forks: 1,829

aleehamza25/AutoJobTracker
JobTrackr: Automated Job Scraping & Reporting Tool
Language: Python - Size: 10.7 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

NguyenDa18/Portland-Jail-Data-Crawler
Scraper used for recording changes to Portland jail database
Language: Jupyter Notebook - Size: 37.6 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 4 - Forks: 0

salimt/Transfermarkt-ETL-and-LIVE-Scores
asyncIO, Github Actions, GCP, dbt, Terraform, Docker
Language: Python - Size: 105 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 2 - Forks: 0

schmwong/APAC-McDelivery-Menu-Logger
Automatically scrapes McDelivery menu data and records it for future visualisation projects
Language: Python - Size: 36.5 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 6 - Forks: 2

Erkmik/best-python-html-parsers
The top Python HTML parsers for web scraping, including Beautiful Soup, lxml, PyQuery, Scrapy, and more.
Size: 6.84 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

pi-2r/devoxxfr2025-tock-studio-IA-Gen
Projet issu du codelab Devoxx France 2025 “À la recherche du RAG perdu” : atelier de 3h pour apprendre à créer un chatbot IA Générative autonome, local et sans Internet, basé uniquement sur des frameworks open source
Language: Python - Size: 54.5 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 1

GregoryHue/duck-or-cat
Duck or Cat is a binary classification model. It classifies pictures of ducks and cats.
Language: Jupyter Notebook - Size: 210 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 4 - Forks: 0

Rogendo/Web-Scraping
This repository contains automation dockerized and undockerized scripts that scrape various websites.
Language: Jupyter Notebook - Size: 6.23 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

hericlibong/worldPressPhotoGalery
web application designed for photojournalism enthusiasts with Scrapy and Django
Language: Python - Size: 49.9 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1 - Forks: 0

irineulucas/SentimenTA
SentimenTA is a sentiment analysis tool designed to analyze and interpret emotions and opinions from textual data. It utilizes natural language processing techniques to provide insights into the sentiment behind the text.
Size: 1000 Bytes - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

mtmarco87/betparser_crawler
BetParser Crawler: A Python tool for extracting and standardizing betting odds from websites, using Scrapy, Selenium, machine learning, and Firebase for real-time updates.
Language: Python - Size: 36.6 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0

TikHub/TikHub-API-Python-SDK
High-performance asynchronous Douyin(抖音) TikTok Xiaohongshu(小红书) Kuaishou(快手) Weibo(微博) Instagram YouTube(油管) Twitter(X) Captcha Solver(验证码解决器) Temp Mail(临时邮箱) API(接口).
Language: Python - Size: 2.05 MB - Last synced at: 6 days ago - Pushed at: 5 months ago - Stars: 445 - Forks: 53

zehengl/scrapy-indeed-company-reviews
A scrapy app to crawl company reviews from Indeed
Language: Python - Size: 63.2 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 4 - Forks: 1

zehengl/scrapy-darksky-api
A scrapy app to crawl weather data from Dark Sky Api
Language: Python - Size: 6.84 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 6 - Forks: 0

LuckyZXL2016/Movie_Recommend
基于Spark的电影推荐系统,包含爬虫项目、web网站、后台管理系统以及spark推荐系统
Language: Java - Size: 55.1 MB - Last synced at: 7 days ago - Pushed at: about 6 years ago - Stars: 2,915 - Forks: 1,050

DonBenn/parser_scrapy
Language: Python - Size: 37.1 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

VictorClvtt/mercado_livre_web_scraping_etl
Language: Python - Size: 33.2 KB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

perfectbullet/zj_spider
zj爬虫
Language: Python - Size: 1.74 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

george-gca/ai_papers_scrapper
Download papers pdfs and other info from main AI conferences
Language: Python - Size: 164 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 21 - Forks: 2

wkunzhi/Python3-Spider
Python爬虫实战 - 模拟登陆各大网站 包含但不限于:滑块验证、拼多多、美团、百度、bilibili、大众点评、淘宝,如果喜欢请start ❤️
Language: Python - Size: 41.6 MB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 3,167 - Forks: 1,024

eracle/linkedin
Linkedin Scraper using Selenium Web Driver, Chromium headless, Docker and Scrapy
Language: Python - Size: 681 KB - Last synced at: 8 days ago - Pushed at: 9 days ago - Stars: 868 - Forks: 136

drogbadvc/crawlit
This project is a web crawler based on Scrapy, visualization 2D, PageRank
Language: Python - Size: 1.6 MB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 26 - Forks: 10

Time1ess/ProxyPool
A ProxyPool based on Scrapy and Redis(基于Scrapy和Redis的代理池)
Language: Python - Size: 1.48 MB - Last synced at: about 18 hours ago - Pushed at: almost 8 years ago - Stars: 20 - Forks: 9

istresearch/scrapy-cluster
This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster.
Language: Python - Size: 28 MB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 1,200 - Forks: 322

andros21/pgrank
pgrank - cpp app for computing pagerank
Language: C++ - Size: 675 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

jonbakerfish/TweetScraper
TweetScraper is a simple crawler/spider for Twitter Search without using API
Language: Python - Size: 58.6 KB - Last synced at: 8 days ago - Pushed at: about 4 years ago - Stars: 1,035 - Forks: 313

rajputpriyankaa/scrapy
Web scraping with Scrapy
Language: Python - Size: 18.6 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

18520339/web-scraping-with-scrapy
Python web scraping with Scrapy
Language: Python - Size: 479 KB - Last synced at: about 7 hours ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

hellock/icrawler
A multi-thread crawler framework with many builtin image crawlers provided.
Language: Python - Size: 282 KB - Last synced at: 3 days ago - Pushed at: about 1 month ago - Stars: 879 - Forks: 178

EasyPi/docker-scrapyd
🕷️ Scrapyd is an application for deploying and running Scrapy spiders.
Language: Dockerfile - Size: 57.6 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 83 - Forks: 22

buildwithtract/planning-applications
Scrape planning applications from local planning authorities in the UK
Language: Python - Size: 3.02 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 9 - Forks: 5

alanchn31/Data-Engineering-Projects
Personal Data Engineering Projects
Language: Jupyter Notebook - Size: 2.92 MB - Last synced at: 9 days ago - Pushed at: about 2 years ago - Stars: 921 - Forks: 203

q-m/scrapyd-k8s
Scrapyd on container infrastructure
Language: Python - Size: 101 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 14 - Forks: 8

ConlinH/aio-scrapy
Implement scrapy with asyncio
Language: Python - Size: 496 KB - Last synced at: 7 days ago - Pushed at: 6 months ago - Stars: 63 - Forks: 10

pixelomo/GPTNews-template
Scrape news and GPT generates translated articles
Language: Python - Size: 108 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

DropsDevopsOrg/ECommerceCrawlers
实战🐍多种网站、电商数据爬虫🕷。包含🕸:淘宝商品、微信公众号、大众点评、企查查、招聘网站、闲鱼、阿里任务、博客园、微博、百度贴吧、豆瓣电影、包图网、全景网、豆瓣音乐、某省药监局、搜狐新闻、机器学习文本采集、fofa资产采集、汽车之家、国家统计局、百度关键词收录数、蜘蛛泛目录、今日头条、豆瓣影评、携程、小米应用商店、安居客、途家民宿❤️❤️❤️。微信爬虫展示项目:
Language: Python - Size: 7.58 MB - Last synced at: 11 days ago - Pushed at: 11 months ago - Stars: 4,985 - Forks: 1,378

tb0hdan/domains
World’s single largest Internet domains dataset
Language: HTML - Size: 1.68 GB - Last synced at: 8 days ago - Pushed at: 23 days ago - Stars: 761 - Forks: 122

lining0806/PythonSpiderNotes
Python入门网络爬虫之精华版
Language: Python - Size: 7 MB - Last synced at: 11 days ago - Pushed at: almost 4 years ago - Stars: 7,134 - Forks: 2,175

desiquant/news_scraper
Scrapes indian market news data
Language: Python - Size: 194 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 5 - Forks: 3

Ntrashh/smallder
Small Simple Lightweight Spider Framework
Language: Python - Size: 129 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 8 - Forks: 0

ispras/scrapy-puppeteer
Library that helps use puppeteer in scrapy.
Language: Python - Size: 342 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 52 - Forks: 4

my8100/logparser
A tool for parsing Scrapy log files periodically and incrementally, extending the HTTP JSON API of Scrapyd.
Language: Python - Size: 172 KB - Last synced at: 9 days ago - Pushed at: 4 months ago - Stars: 91 - Forks: 25

gridaco/figma-archives
Figma Files Scraper for Research & Studies
Language: Python - Size: 103 MB - Last synced at: 3 days ago - Pushed at: over 1 year ago - Stars: 23 - Forks: 3

scrapy-plugins/scrapy-playwright
🎭 Playwright integration for Scrapy
Language: Python - Size: 982 KB - Last synced at: 11 days ago - Pushed at: about 2 months ago - Stars: 1,144 - Forks: 129

bytebuff/JSpider
JSpider会每周更新至少一个网站的JS解密方式,欢迎 Star,交流微信:13298307816
Language: JavaScript - Size: 576 KB - Last synced at: 9 days ago - Pushed at: almost 3 years ago - Stars: 1,090 - Forks: 240

chinesehuazhou/ScrapyProject
Scrapy项目(mysql+mongodb豆瓣top250电影)
Language: Python - Size: 62.5 KB - Last synced at: 11 days ago - Pushed at: almost 8 years ago - Stars: 22 - Forks: 7

SpiderClub/haipproxy
:sparkling_heart: High available distributed ip proxy pool, powerd by Scrapy and Redis
Language: Python - Size: 1.16 MB - Last synced at: 11 days ago - Pushed at: over 2 years ago - Stars: 5,471 - Forks: 914

Boris-code/feapder
🚀🚀🚀feapder is an easy to use, powerful crawler framework | feapder是一款上手简单,功能强大的Python爬虫框架。内置AirSpider、Spider、TaskSpider、BatchSpider四种爬虫解决不同场景的需求。且支持断点续爬、监控报警、浏览器渲染、海量数据去重等功能。更有功能强大的爬虫管理系统feaplat为其提供方便的部署及调度
Language: Python - Size: 1.48 MB - Last synced at: 11 days ago - Pushed at: about 1 month ago - Stars: 3,202 - Forks: 507

nghuyong/WeiboSpider
持续维护的新浪微博采集工具🚀🚀🚀
Language: Python - Size: 15.6 MB - Last synced at: 12 days ago - Pushed at: 9 months ago - Stars: 3,796 - Forks: 839

rl1987/trickster.dev
Language: HTML - Size: 481 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 5 - Forks: 5

juancarlospaco/faster-than-requests
Faster requests on Python 3
Language: Nim - Size: 20.3 MB - Last synced at: 11 days ago - Pushed at: 12 days ago - Stars: 1,115 - Forks: 91

DormyMo/SpiderKeeper
admin ui for scrapy/open source scrapinghub
Language: Python - Size: 3.62 MB - Last synced at: 8 days ago - Pushed at: almost 2 years ago - Stars: 2,763 - Forks: 505

gabriel-nds/WebScrapingGloboEsporte
Web scraping tool built using Scrapy and Selenium to extract sports news articles from ge.globo.com
Language: Python - Size: 38.3 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 2 - Forks: 0

chyroc/WechatSogou
基于搜狗微信搜索的微信公众号爬虫接口
Language: Python - Size: 4.12 MB - Last synced at: 12 days ago - Pushed at: over 1 year ago - Stars: 6,033 - Forks: 1,715

gaalcaras/mailingListScraper
A python web scraper for public email lists.
Language: Python - Size: 140 KB - Last synced at: 3 days ago - Pushed at: almost 7 years ago - Stars: 33 - Forks: 13

ispras/scrapy-puppeteer-service
A special service that runs puppeteer instances.
Language: JavaScript - Size: 355 KB - Last synced at: 11 days ago - Pushed at: about 2 months ago - Stars: 16 - Forks: 5

Xcrap-Cloud/transformer
Language: TypeScript - Size: 138 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 2 - Forks: 0

Lan-ce-lot/pythorch-text-classification
对豆瓣影评进行文本分类情感分析,利用爬虫豆瓣爬取评论,进行数据清洗,分词,采用BERT、CNN、LSTM等模型进行训练,采用tensorboardX可视化训练过程,自然语言处理项目\A project for text classification, based on torch 1.7.1
Language: Python - Size: 1.58 MB - Last synced at: 12 days ago - Pushed at: about 2 years ago - Stars: 145 - Forks: 9

joaopauloaramuni/python
Repo Python
Language: Python - Size: 150 MB - Last synced at: 11 days ago - Pushed at: 13 days ago - Stars: 44 - Forks: 1

Xcrap-Cloud/got-scraping-client
Language: TypeScript - Size: 53.7 KB - Last synced at: 12 days ago - Pushed at: 13 days ago - Stars: 1 - Forks: 0

windrises/dialogue.moe
Language: Python - Size: 1.02 MB - Last synced at: 12 days ago - Pushed at: over 2 years ago - Stars: 324 - Forks: 8

librauee/Reptile
🏀 Python3 网络爬虫实战(部分含详细教程)猫眼 腾讯视频 豆瓣 研招网 微博 笔趣阁小说 百度热点 B站 CSDN 网易云阅读 阿里文学 百度股票 今日头条 微信公众号 网易云音乐 拉勾 有道 unsplash 实习僧 汽车之家 英雄联盟盒子 大众点评 链家 LPL赛程 台风 梦幻西游、阴阳师藏宝阁 天气 牛客网 百度文库 睡前故事 知乎 Wish
Language: Python - Size: 7.08 MB - Last synced at: 13 days ago - Pushed at: about 4 years ago - Stars: 1,652 - Forks: 516

my8100/scrapydweb
Web app for Scrapyd cluster management, Scrapy log analysis & visualization, Auto packaging, Timer tasks, Monitor & Alert, and Mobile UI. Docs 文档 :point_right:
Language: Python - Size: 3.05 MB - Last synced at: 13 days ago - Pushed at: 2 months ago - Stars: 3,260 - Forks: 578

mouday/spider-admin-pro
spider-admin-pro 一个集爬虫Scrapy+Scrapyd爬虫项目查看 和 爬虫任务定时调度的可视化管理工具,SpiderAdmin的升级版
Language: Python - Size: 2.82 MB - Last synced at: 12 days ago - Pushed at: 5 months ago - Stars: 587 - Forks: 84

kezhenxu94/house-renting 📦
Possibly the best practice of Scrapy 🕷 and renting a house 🏡
Language: Python - Size: 1.43 MB - Last synced at: 2 days ago - Pushed at: over 3 years ago - Stars: 795 - Forks: 144

eliasdabbas/advertools
advertools - online marketing productivity and analysis tools
Language: Python - Size: 23 MB - Last synced at: 12 days ago - Pushed at: 22 days ago - Stars: 1,213 - Forks: 226

scrapy-plugins/scrapy-splash
Scrapy+Splash for JavaScript integration
Language: Python - Size: 331 KB - Last synced at: 11 days ago - Pushed at: 2 months ago - Stars: 3,195 - Forks: 457

QianyanTech/Image-Downloader
Download images from Google, Bing, Baidu. 谷歌、百度、必应图片下载.
Language: Python - Size: 24.6 MB - Last synced at: 10 days ago - Pushed at: 10 months ago - Stars: 2,281 - Forks: 577

City-Bureau/city-scrapers
Scrape, standardize and share public meetings from local government websites
Language: HTML - Size: 11.2 MB - Last synced at: 14 days ago - Pushed at: 2 months ago - Stars: 350 - Forks: 312

marc7666/PRAC1-TCVD-Web-scraping
PRAC1 of the subject "Data typology and life cycle" of the MSc in Data Science at Universitat oberta de Catalunya
Size: 240 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 1

moyada/stealer
抖音、快手、火山、皮皮虾,视频去水印程序
Language: Python - Size: 3.89 MB - Last synced at: 13 days ago - Pushed at: 11 months ago - Stars: 1,033 - Forks: 286

TeamHG-Memex/scrapy-rotating-proxies
use multiple proxies with Scrapy
Language: Python - Size: 44.9 KB - Last synced at: 14 days ago - Pushed at: almost 3 years ago - Stars: 756 - Forks: 161

bradchow/department_store_brands
Climb out of all the brands and floors inside the official websites of major department stores in Taiwan
Language: Python - Size: 2.69 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

scrapy/itemadapter
Common interface for data container classes
Language: Python - Size: 270 KB - Last synced at: 13 days ago - Pushed at: 28 days ago - Stars: 66 - Forks: 12

open-news-brasil/open-news
📰 Coleta automatizada das últimas notícias das cidades brasileiras
Language: Python - Size: 2.54 MB - Last synced at: 6 days ago - Pushed at: 16 days ago - Stars: 16 - Forks: 2

m-niemiec/captcha_solving_service
Captcha Solving Service is a mock-up SaaS that allows users to send their captcha images and receive solutions in simple text format. Project is divided into 4 parts. Scraping datasets for machine learning models, GUI for renaming collected images, captcha solving OCR and captcha solving API.
Language: Python - Size: 61.9 MB - Last synced at: 14 days ago - Pushed at: about 1 year ago - Stars: 18 - Forks: 3
