GitHub topics: pyquery
Erkmik/best-python-html-parsers
The top Python HTML parsers for web scraping, including Beautiful Soup, lxml, PyQuery, Scrapy, and more.
Size: 6.84 KB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

psf/requests-html
Pythonic HTML Parsing for Humans™
Language: Python - Size: 2.84 MB - Last synced at: 8 days ago - Pushed at: about 1 year ago - Stars: 13,808 - Forks: 988

shengqiangzhang/examples-of-web-crawlers
一些非常有趣的python爬虫例子,对新手比较友好,主要爬取淘宝、天猫、微信、微信读书、豆瓣、QQ等网站。(Some interesting examples of python crawlers that are friendly to beginners. )
Language: Python - Size: 233 MB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 14,151 - Forks: 3,831

lleans/lyricfind-scrapper
Simple API scrapper on LyricFInd 🎹
Language: Python - Size: 42 KB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 7 - Forks: 1

foggyspace/MSpider
自用爬虫模板
Language: Python - Size: 10.7 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

oxylabs/parse-html-pyquery
Learn to parse HTML using PyQuery, a Python library for web scraping and manipulating HTML.
Language: Python - Size: 19.5 KB - Last synced at: 28 days ago - Pushed at: 3 months ago - Stars: 3 - Forks: 0

ArkaitzUlibarri/ewrc-results
Scraping of ewrc-results.com
Language: Python - Size: 140 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 2 - Forks: 2

christabor/job_scraper
[NOT MAINTAINED] Job/Occupation data scraping tool that uses scrapy and pyquery
Language: Python - Size: 290 KB - Last synced at: 24 days ago - Pushed at: almost 8 years ago - Stars: 10 - Forks: 2

shivammathur/TwitterScraper
Twitter Scraper - Scrape tweets for a user or a #hashtag.
Language: Python - Size: 9.77 KB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 14 - Forks: 5

b09780978/crawler
My crawler docker file
Language: Dockerfile - Size: 102 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

flyyang/medical-news
spider for chinese medical websites with wechat notification enabled
Language: Python - Size: 90.8 KB - Last synced at: about 1 month ago - Pushed at: almost 8 years ago - Stars: 29 - Forks: 10

terokarvinen/hoto
Extract HTML tags and metadata, optionally rename files. Supports MAFF as used by WebScrapbook.
Language: Python - Size: 39.1 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

samuraitruong/py-scrapper
A python application to scrap & clone static website
Language: Python - Size: 26.4 KB - Last synced at: 11 months ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

samuraitruong/fifa-ranking-data
simple python script to collect fifa ranking data and create publish json + csv
Language: Python - Size: 10.9 MB - Last synced at: 11 months ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

gyje/tieba_qiandao
百度贴吧极速签到,200个吧用时1.6秒,简单直接
Language: Python - Size: 16.6 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 7 - Forks: 1

ryyos/CNN-scraping.PY
CNN Website Scraping Using Python
Language: Python - Size: 29.8 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

qiaofei32/yunCrawler
智能云爬虫Demo
Language: HTML - Size: 7.11 MB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 32 - Forks: 17

yodhcn/dlsite-doujin-renamer
「DLsite 同人作品重命名工具」依据 RJ|VJ|BJ 号从 dlsite.com 爬取 "标题" 和 "社团" 等信息,按照自定义模板对文件夹批量格式化命名,并将文件夹封面修改为作品封面。
Language: Python - Size: 518 KB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 206 - Forks: 17

wallacesilva/webscraper-alternativa-empregos
Pega vagas de trabalho do site Alternativa Empregos (http://empregos.alternativa.co.jp/)
Language: Python - Size: 9.77 KB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

nelttjen/kakao_parser
Freelance order. Goal: download selected chapters from kakao authomaticaly
Language: Python - Size: 5.63 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

zhangshyue/scrapy-webscrapper-for-bbcnews
A webscrapper that can scrape bbcnews using Scrapy and mongoDB
Language: Python - Size: 22.5 KB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 7 - Forks: 0

Rohith-2/url_classification_dl
URL Feature extraction and Engineering aided with Classification via Neural Networks
Language: Python - Size: 30.9 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 7 - Forks: 10

joeyprettyman/python-scripts
Python scripts I've written in the past.
Language: Python - Size: 2.12 MB - Last synced at: 8 months ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

hellowac/pyquery-doc-zh
python包pyquery的中文翻译
Language: HTML - Size: 1.33 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

nelttjen/telega_manga_bot
1st Freelance order. Goal: telegram notifier when new product on any of 2 sites was released
Language: Python - Size: 5.67 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nelttjen/fetch_sites_bot
Freelance order. Goal: get all products that have more items than customer's site
Language: Python - Size: 13.6 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 1

dateolive/python-crawler
爬虫学习仓库,适合零基础的人学习,对新手比较友好
Language: Python - Size: 3.38 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 39 - Forks: 13

myounus96/stock-market-prediction-using-sentiment-analysis
stock market predictions using sentiment analysis, a deep learning project(data and news based on pakistani stock exchange and news(Dawn news))
Language: Jupyter Notebook - Size: 4.19 MB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 15 - Forks: 20

Aly-Reda/Scraper_School
complete guide to scrape almost any website
Language: Python - Size: 53.7 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 0

koty/hokuto_program_scraper
ホクト文化ホールのプログラムをスクレイピングします。
Language: Python - Size: 18.6 KB - Last synced at: about 12 hours ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 0

RodrigoRosmaninho/TimeTableParser-ua
Python tool for scraping any timetable page on the University of Aveiro's academic portal (PACO) and exporting it to different formats
Language: Python - Size: 253 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 15 - Forks: 1

PrestonYU/WebCrawling-with-Python-Marathon-Challenge
Cupoy AI Learning Hub - 1st Python Web Crawling Challenge
Language: Jupyter Notebook - Size: 2.19 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

Live-Lyrics/amalgama-pq
Amalgama lyrics scraping
Language: HTML - Size: 127 KB - Last synced at: 5 days ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

yyhsong/iPySpider
Python网络爬虫与信息提取
Language: Python - Size: 172 KB - Last synced at: 24 days ago - Pushed at: over 4 years ago - Stars: 6 - Forks: 1

luanxiangming/spider
Selenium with ChromeDrive/PhantomJS; Crawler with BeautifulSoup4, Scrapy, PyQuery
Language: Python - Size: 970 KB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 8 - Forks: 5

ShahzaibWaseem/FreeSpringerBooks
Downloader for Free Books from Springer COVID-19 Package
Language: Python - Size: 3.56 GB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 1

miozune/AtCoderUsersAPI_DB
AtCoderUsersAPIのスクレイピング部分
Language: Python - Size: 14.6 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 1 - Forks: 0

seaung/NySpiders
the Spiders set
Language: Python - Size: 39.1 KB - Last synced at: 2 months ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

jsxz/crawl-tutorial
python 3 crawl tutorial
Language: Jupyter Notebook - Size: 84 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

xkdcc/Robots
My spiderbot
Language: Python - Size: 36.1 KB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

hgz6536/spiderman
:beetle::cherry_blossom:小蜘蛛
Language: Python - Size: 9.77 KB - Last synced at: about 2 years ago - Pushed at: almost 8 years ago - Stars: 1 - Forks: 5

ckcks12/debian-package-manager
fucking army shit the fucking army twice
Language: Python - Size: 6.84 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

lishulongVI/whisky
About Cloud Baidu Resource cloud_group
Language: Python - Size: 194 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

zxzxzxygithub/pythonscarch
base on python 2.7
Language: HTML - Size: 89.8 KB - Last synced at: about 1 year ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0
