GitHub topics: scrapyd
EasyPi/docker-scrapyd
🕷️ Scrapyd is an application for deploying and running Scrapy spiders.
Language: Dockerfile - Size: 57.6 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 83 - Forks: 22

q-m/scrapyd-k8s
Scrapyd on container infrastructure
Language: Python - Size: 101 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 14 - Forks: 8

ConlinH/aio-scrapy
Implement scrapy with asyncio
Language: Python - Size: 496 KB - Last synced at: 9 days ago - Pushed at: 7 months ago - Stars: 63 - Forks: 10

my8100/logparser
A tool for parsing Scrapy log files periodically and incrementally, extending the HTTP JSON API of Scrapyd.
Language: Python - Size: 172 KB - Last synced at: 11 days ago - Pushed at: 4 months ago - Stars: 91 - Forks: 25

DormyMo/SpiderKeeper
admin ui for scrapy/open source scrapinghub
Language: Python - Size: 3.62 MB - Last synced at: 10 days ago - Pushed at: almost 2 years ago - Stars: 2,763 - Forks: 505

my8100/scrapydweb
Web app for Scrapyd cluster management, Scrapy log analysis & visualization, Auto packaging, Timer tasks, Monitor & Alert, and Mobile UI. Docs 文档 :point_right:
Language: Python - Size: 3.05 MB - Last synced at: 15 days ago - Pushed at: 2 months ago - Stars: 3,260 - Forks: 578

mouday/spider-admin-pro
spider-admin-pro 一个集爬虫Scrapy+Scrapyd爬虫项目查看 和 爬虫任务定时调度的可视化管理工具,SpiderAdmin的升级版
Language: Python - Size: 2.82 MB - Last synced at: 14 days ago - Pushed at: 5 months ago - Stars: 587 - Forks: 84

kezhenxu94/house-renting 📦
Possibly the best practice of Scrapy 🕷 and renting a house 🏡
Language: Python - Size: 1.43 MB - Last synced at: 4 days ago - Pushed at: over 3 years ago - Stars: 795 - Forks: 144

my8100/files
Docs and files for ScrapydWeb, Scrapyd, Scrapy, and other projects
Size: 16.7 MB - Last synced at: 17 days ago - Pushed at: about 2 months ago - Stars: 420 - Forks: 71

bitmakerla/estela
estela, an elastic web scraping cluster 🕸
Language: TypeScript - Size: 4.48 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 179 - Forks: 15

crawlab-team/crawlab-lite
Lite version of Crawlab. 轻量版 Crawlab 爬虫管理平台
Language: Vue - Size: 2.36 MB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 224 - Forks: 75

ScrapeOps/scrapeops-scrapy-sdk
Scrapy extension that gives you all the scraping monitoring, alerting, scheduling, and data validation you will need straight out of the box.
Language: Python - Size: 89.8 KB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 36 - Forks: 10

jxltom/scrapymon
Simple Web UI for Scrapy spider management via Scrapyd
Language: Python - Size: 2.61 MB - Last synced at: 4 days ago - Pushed at: almost 7 years ago - Stars: 51 - Forks: 11

datawizard1337/ARGUS
ARGUS is an easy-to-use web scraping tool. The program is based on the Scrapy Python framework and is able to crawl a broad range of different websites. On the websites, ARGUS is able to perform tasks like scraping texts or collecting hyperlinks between websites. See: https://link.springer.com/article/10.1007/s11192-020-03726-9
Language: Python - Size: 14.7 MB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 88 - Forks: 25

Gerapy/Gerapy
Distributed Crawler Management Framework Based on Scrapy, Scrapyd, Django and Vue.js
Language: Python - Size: 36.6 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 3,350 - Forks: 644

jxltom/scrapyd-heroku
Wrapper for running Scrapyd in Heroku or locally as a service
Language: Python - Size: 44.9 KB - Last synced at: 4 days ago - Pushed at: over 4 years ago - Stars: 19 - Forks: 36

my8100/scrapyd-cluster-on-heroku
Set up free and scalable Scrapyd cluster for distributed web-crawling with just a few clicks. DEMO :point_right:
Language: Python - Size: 236 KB - Last synced at: 15 days ago - Pushed at: about 5 years ago - Stars: 122 - Forks: 88

abebus/spider-info-webservice
Language: Python - Size: 64.5 KB - Last synced at: 11 days ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

zubedev/scrapydoo
Scrapy dappy doo crawler for proxy sites
Language: Python - Size: 379 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

Tiago-Lira/scrapyd-mongodb
Library designed to replace the SQLite backend by a MongoDB backend on Scrapy queue management
Language: Python - Size: 12.7 KB - Last synced at: about 22 hours ago - Pushed at: over 7 years ago - Stars: 17 - Forks: 9

Dainius-P/scrapyd-dash
Scrapyd Dashboard
Language: CSS - Size: 2.79 MB - Last synced at: about 1 month ago - Pushed at: about 3 years ago - Stars: 8 - Forks: 1

liangWenPeng/scrapy-admin
A django admin site for scrapy
Language: Python - Size: 779 KB - Last synced at: 4 months ago - Pushed at: over 7 years ago - Stars: 45 - Forks: 12

slymit/scrapyq
Scrapyd queue management using Redis.
Language: Python - Size: 7.81 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

Dog-Egg/scrapy-ui
A web service that manages and schedules Scrapyd. 🕷️
Language: TypeScript - Size: 32.5 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

harrisbaird/dockerfiles
Various Dockerfiles.
Language: Shell - Size: 21.5 KB - Last synced at: about 1 year ago - Pushed at: almost 7 years ago - Stars: 9 - Forks: 0

koneb71/SpiderManager Fork of DormyMo/SpiderKeeper
admin ui for scrapy/open source scrapinghub
Language: Python - Size: 3.64 MB - Last synced at: 9 months ago - Pushed at: almost 4 years ago - Stars: 2 - Forks: 1

casual-silva/NewsCrawl
狠心开源企业级舆情新闻爬虫项目:支持任意数量爬虫一键运行、爬虫定时任务、爬虫批量删除;爬虫一键部署;爬虫监控可视化; 配置集群爬虫分配策略;👉 现成的docker一键部署文档已为大家踩坑
Language: Python - Size: 15.2 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 402 - Forks: 119

slymit/scrapyduler
Scrapyd module that schedules scrapy spiders by time.
Language: Python - Size: 7.81 KB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

speakol-ads/scrapyd-redis Fork of Tiago-Lira/scrapyd-mongodb
Library designed to replace the SQLite backend by a redis backend on Scrapy queue management
Language: Python - Size: 16.6 KB - Last synced at: 4 months ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 1

WangPei0316/scrapy-zhihu-user
知乎用户爬虫,使用scrapy_redis,scrapyd,gerapy等
Language: Python - Size: 24.4 KB - Last synced at: about 1 year ago - Pushed at: almost 7 years ago - Stars: 10 - Forks: 5

baabaaox/ScrapyDouban
豆瓣电影/豆瓣读书 Scarpy 爬虫
Language: Python - Size: 31.3 KB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 636 - Forks: 195

aaldaber/Distributed-Multi-User-Scrapy-System-with-a-Web-UI
Django based application that allows creating, deploying and running Scrapy spiders in a distributed manner
Language: Python - Size: 4.97 MB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 108 - Forks: 46

AzizNadirov/scrapy-monit
scrapy-monit: web app for monitoring, scheduling and managing scrapyd instances.
Language: Python - Size: 246 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

ermissa/scrapyd-django-mongodb-setup
Setup project to run Scrapy + Django and save parsed data to MongoDB.
Language: Python - Size: 12.7 KB - Last synced at: 12 months ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 1

inTheRye/docker-scrapyd
docker platform sample for scrapyd
Size: 1000 Bytes - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 0

ammirsm/data-grabber-cnn-twitter
Basic setup to get data from twitter and CNN with a keyword.
Language: JavaScript - Size: 689 KB - Last synced at: 25 days ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 1

f213/scrapyc 📦
CLI and client library for scrapyd. Done right.
Language: Python - Size: 39.1 KB - Last synced at: about 22 hours ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 0

dharrisbaird/dailyteedeals_scrapers 📦
A collection of spiders for extracting designs from daily tee websites, including: ShirtPunch, Teefury, Yetee, Qwertee and 40+ other sites.
Language: Python - Size: 48.8 KB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0

INNOVINATI/scrapretty
A pretty and serverless dashboard for your Scrapyd instances
Language: Vue - Size: 559 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 1

fliot/ScrapyKeeper Fork of DormyMo/SpiderKeeper
admin ui for scrapy/open source scrapinghub
Language: Python - Size: 4.18 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 56 - Forks: 20

mpszumowski/djangocrawler
start a new (random) life | World Bank data scraper on django rest app
Language: Python - Size: 161 KB - Last synced at: almost 2 years ago - Pushed at: about 7 years ago - Stars: 4 - Forks: 3

heidudu/tophub
后端基于Python的Flask和Scrapy,前端基于React,redux,采用docker部署的资讯收集站
Language: Python - Size: 1.25 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 17 - Forks: 6

fuji44/iocage-plugin-scrapyd
It is an iocage-plugin made to easily use Scrapyd with FreeBSD, TrueNAS, FreeNAS.
Language: Shell - Size: 3.91 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Benknightdark/ScrapydDashboard
Scrapyd 爬蟲儀表板
Language: TypeScript - Size: 594 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

napoler/scrapyd_docker
scrapyd_docker
Language: Dockerfile - Size: 101 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

my8100/scrapyd-cluster-on-heroku-scrapyd-app
How to set up Scrapyd cluster on Heroku
Language: Python - Size: 26.4 KB - Last synced at: 11 days ago - Pushed at: about 3 years ago - Stars: 9 - Forks: 29

xxl4tomxu98/scrapy_web_crawler
Integration of django and scrapy package for web scraping
Language: Python - Size: 21.5 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

ToonoW/SpiderManager
爬虫管理平台
Language: Python - Size: 62.5 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 27 - Forks: 10

civilcoder55/imdb-scrapy-app
simple imdb movie scraper, built with python scrapy
Language: Python - Size: 160 KB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

rnovec/scrapy-template
Scrapy project template
Language: Python - Size: 3.49 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

quicksandznzn/scrapyd_docker
Language: Dockerfile - Size: 2.93 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

perrornet/SpiderMan
SpiderMan Based on Scrapy, scrapyd, scrapy-API, tornado spider distributed management framework.
Language: Python - Size: 2.51 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 12 - Forks: 6

idrodriguez/scrapyd-auth
Scrapy daemon running in an authenticated way through nginx
Language: Dockerfile - Size: 7.81 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

rnovec/scrapyd-api
A Node.js wrapper for working with the Scrapyd API
Language: JavaScript - Size: 1.1 MB - Last synced at: 3 days ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

crawlaio/scrapyd-heroku
在 heroku 上搭建 scrapyd 集群教程
Language: Python - Size: 10.7 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 3

shivamacs/scrapjango
A web crawling utility that gets you the links at one place.
Language: Python - Size: 19.7 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

netflame/docker-scrapyd
Dockerized Scrapyd based on Alpine
Language: Dockerfile - Size: 3.91 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 1 - Forks: 1

my8100/scrapyd-cluster-on-heroku-scrapydweb-app-git
How to set up Scrapyd cluster on Heroku
Language: Python - Size: 36.1 KB - Last synced at: 22 days ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 21

silenceliang/ptt-crawler-scrapyRedis
ptt-crawler with scrapy-redis framework in python
Language: Python - Size: 5.89 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

ChristianYeah/scrapyd-egg-checksum
Extension of scrapyd to get egg's md5 checksum for distributed scrapyd
Language: Python - Size: 5.86 KB - Last synced at: 13 days ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

my8100/scrapyd-cluster-on-heroku-scrapydweb-app
How to set up Scrapyd cluster on Heroku
Language: Python - Size: 43.9 KB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 3

BruceDone/python-scrapyd-api Fork of djm/python-scrapyd-api
A Python wrapper for working with Scrapyd's API.
Language: Python - Size: 46.9 KB - Last synced at: about 2 years ago - Pushed at: about 8 years ago - Stars: 1 - Forks: 0

usernamehcx/usernamehcx.github.io
my blog website
Size: 10.7 KB - Last synced at: almost 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

perrornet/async_scrapyd_api
异步scrapyd api实现
Language: Python - Size: 7.81 KB - Last synced at: 3 months ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

mushoffa/docker-scrapyd
Size: 3.91 KB - Last synced at: about 1 month ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0
