An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: web-scraping-python

frarlo/garfield_bluesky_bot

Simple Python Bluesky bot to post random Garfield comics every four hours.

Language: Python - Size: 31.3 KB - Last synced at: about 1 hour ago - Pushed at: about 2 hours ago - Stars: 0 - Forks: 0

seleniumbase/SeleniumBase

Python APIs for web automation, testing, and bypassing bot-detection.

Language: Python - Size: 13.3 MB - Last synced at: about 11 hours ago - Pushed at: about 12 hours ago - Stars: 11,297 - Forks: 1,381

rushi-analytics/Selenium-Mini-Project

A mini project using Selenium to scrape product data from Croma and visualize insights with Power BI.

Language: Jupyter Notebook - Size: 159 KB - Last synced at: about 13 hours ago - Pushed at: about 14 hours ago - Stars: 1 - Forks: 0

GaelGil/web-scraper

A web scraper I created using selenium. Its intended to scrape items from several pages. I am using it to scrape books from goodreads.

Language: Python - Size: 1.27 GB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 2 - Forks: 0

Mindful-AI-Assistants/SP2024-Election-Analysis

📊 An analysis of voting patterns in São Paulo's 2024 elections, focusing on voter behavior, absenteeism, and geographic trends.

Language: HTML - Size: 86.6 MB - Last synced at: about 21 hours ago - Pushed at: 2 days ago - Stars: 7 - Forks: 3

a-endari/Learning_German

A comprehensive toolkit for learning German that combines automated translation, audio pronunciation, and flashcard generation. This project streamlines the process of creating study materials by extracting definitions, examples, and audio from online sources and formatting them into structured markdown notes and Anki flashcards.

Language: Python - Size: 1.89 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

scrapy/scrapy

Scrapy, a fast high-level web crawling & scraping framework for Python.

Language: Python - Size: 27.1 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 57,332 - Forks: 10,935

pdoup/PyScraperX

Resilient, powerful asynchronous web scraping framework in Python with a real-time UI for monitoring, scheduling, and managing concurrent JSON scraping tasks

Language: Python - Size: 296 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

Karthick-840/AH_Recommendation_with_LLM

Very Small Description: Python scripts to scrape, clean, and merge airline crash data from Wikipedia and other web sources for practice and potential Kaggle publication.

Language: Python - Size: 2.73 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

scrapfly/scrapfly-scrapers

Scalable Python web scraping scripts for +40 popular domains

Language: Python - Size: 4.93 MB - Last synced at: 3 days ago - Pushed at: 18 days ago - Stars: 546 - Forks: 131

WhiteeRabbit/dork-seeker

Simple Automatizated Google dorker script written in python

Language: Python - Size: 16.6 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 1 - Forks: 0

Bernso/NovelReaderWeb

Website made in python that scrapes lightnovelpub.vip for the novel inputted and will create a page for each of the chapters inside of those novels, also it includes features such as text to speech, text opacity and a font selector. All of these settings will save on your device.

Language: Python - Size: 311 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 2 - Forks: 1

D4Vinci/Scrapling

🕷️ An undetectable, powerful, flexible, high-performance Python library to make Web Scraping Easy and Effortless as it should be!

Language: Python - Size: 1.97 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 5,451 - Forks: 301

omkarcloud/botasaurus

The All in One Framework to Build Undefeatable Scrapers

Language: Python - Size: 63 MB - Last synced at: 4 days ago - Pushed at: 20 days ago - Stars: 2,018 - Forks: 179

0x676e67/rnet

A blazing-fast Python HTTP Client with TLS fingerprint

Language: Rust - Size: 1.22 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 539 - Forks: 49

thewebscraping/tls-requests

TLS Requests is a powerful Python library for secure HTTP requests, offering browser-like TLS client, fingerprinting, anti-bot page bypass, and high performance.

Language: Python - Size: 3.7 MB - Last synced at: 3 days ago - Pushed at: about 1 month ago - Stars: 68 - Forks: 5

tinyfish-io/agentql

AgentQL is a suite of tools for connecting your AI to the web. Featuring a query language and Playwright integrations for interacting with elements and extracting data quickly, precisely, and at scale. Includes REST API, Python and JavaScript SDKs, browser debugger.

Language: Python - Size: 895 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 850 - Forks: 115

CPalmer3200/Destiny_Scraping_Tools

Web scraping tools designed to assemble automated daily/monthly literature reviews

Language: Python - Size: 8.43 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 1 - Forks: 0

oxylabs/Python-Web-Scraping-Tutorial

In this Python Web Scraping Tutorial, we will outline everything needed to get started with web scraping. We will begin with simple examples and move on to relatively more complex.

Language: Python - Size: 111 KB - Last synced at: 8 days ago - Pushed at: 3 months ago - Stars: 284 - Forks: 31

00ryanwelzel/minionProfitsCalculator

Accesses an online database for current item prices in hypixel skyblock, then calculates the most profitable minions.

Language: Python - Size: 4.88 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

irfanalidv/trustpilot_scraper

A Python library for scraping Trustpilot reviews.

Language: Python - Size: 56.6 KB - Last synced at: 1 day ago - Pushed at: about 1 year ago - Stars: 7 - Forks: 4

Powerostad/JobVision-Crawler

Simple Async Crawler for JobVision JobPosts

Language: Python - Size: 144 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

Majmal66/Apple-Watch-Price-Analysis

Scraping & Comparing Apple Watch Prices from Amazon & Noon using Python.

Language: Jupyter Notebook - Size: 376 KB - Last synced at: 10 days ago - Pushed at: 20 days ago - Stars: 1 - Forks: 0

sarperavci/kick-unofficial-api

🛡️ Unofficial Kick.com API wrapper with automatic bypass protection.

Language: Python - Size: 15.6 KB - Last synced at: 18 days ago - Pushed at: 5 months ago - Stars: 7 - Forks: 3

Dagvadorj1120/python-hltv-scraper

A straightforward web scraper for HLTV.org that uses AsyncCamoufox and BeautifulSoup. This project offers a reliable way to gather data on matches, teams, and players. 🐍💻

Language: Python - Size: 70.3 KB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 0 - Forks: 0

oxylabs/how-to-scrape-amazon-product-data

The process of extracting product data from Amazon using Python, including titles, ratings, prices, images, and descriptions.

Size: 2.42 MB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 29 - Forks: 1

ClassicalClemi/python-hltv-scraper

A simple and open-source HLTV.org web scraper built with Camoufox and BeautifulSoup, written entirely in Python.

Language: Python - Size: 41 KB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 1 - Forks: 0

Gauff/BelgianElectricCarMarketAnalyser

Python tool for analyzing the belgian second hand electric car market by scraping and visualizing data from multiple car listing websites. Features parallel web scraping, price tracking, and interactive dashboards.

Language: Python - Size: 1.47 MB - Last synced at: 27 days ago - Pushed at: 28 days ago - Stars: 1 - Forks: 0

oxylabs/curl-with-python

Master cURL in Python by using the PycURL library. Learn to send GET and POST requests, custom HTTP headers, and how to fix common problems.

Language: Python - Size: 26.4 KB - Last synced at: 10 days ago - Pushed at: 5 months ago - Stars: 4 - Forks: 1

sycstitch/truecar-webscraper-data-analysis

Web scraper that collects and analyzes car listings from TrueCar. School assignment turned market analysis tool with data cleaning & visualization for car shopping research.

Language: Jupyter Notebook - Size: 37.1 KB - Last synced at: 30 days ago - Pushed at: 30 days ago - Stars: 0 - Forks: 0

shawnCaza/compodio

Putting the podcast in community radio

Language: Python - Size: 186 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 6 - Forks: 2

zumatt/msa

Multi Search Aggregator is a python script to perform systematic literature review on multiple platforms

Language: Python - Size: 12.7 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

tarqhilmarsiregar/fashion-scraping-etl

Implementasi ETL pipeline sederhana untuk web scraping data fashion, meliputi ekstraksi, pembersihan, transformasi, dan penyimpanan ke format CSV, Database postgreSQL, serta Google Sheets sebagai dasar insight data

Language: Python - Size: 6.84 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

DishaAggarwal31/Job-Market-Data-Analysis

An interactive job market analytics dashboard built with Python, Matplotlib, and ipywidgets. Explore job trends by industry, location, and experience with dynamic filters and visual insights.

Language: Jupyter Notebook - Size: 3.27 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Decodo/Python-scraper-tutorial

A short introduction to scraping with Python with given steps and an example scraper script.

Language: Python - Size: 106 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 30 - Forks: 5

samshad/Data_Scrape_Auto_Tinder

Data Scrape & Auto‑Swipe for Tinder – Python scripts that authenticate with Tinder’s unofficial API, save profile metadata to CSV, and auto‑like/pass based on simple filters. For educational use only, automation violates Tinder’s ToS.

Language: Python - Size: 33.2 KB - Last synced at: 20 days ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 0

subhanalii/instagram-scraper

A Python automation tool that logs into Instagram, searches profiles via Bing, scrapes public data like bio, followers, and emails, and saves the results. Demo included. Full script available on request.

Language: Python - Size: 4.03 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Yahia-kilany/Oscar-Nominations-Database

Oscars Database Project is a comprehensive system designed to store, manage, and query detailed data about the Academy Awards (Oscars). This project includes both terminal-based and web-based applications to interact with the data, which covers Oscar-related information from the 10th to the 96th iteration.

Language: Jupyter Notebook - Size: 15.4 MB - Last synced at: 15 days ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Blisse1/caprendizaje-web-scraping

Script en Python que automatiza la extracción de empresas con vacantes para aprendices en 'Caprendizaje' (plataforma del SENA)

Language: Python - Size: 11.7 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

mhmdkardosha/CAT-Reloaded-2025-Data-Science-Roadmap

Roadmap for Data Science circle associated with CAT Reloaded.

Size: 83 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 26 - Forks: 1

AsmrCodeZ-YT/WebScrappers

Welcome to this repository! 🎉 Here, you will find a collection of 10 free scrapers for extracting data from various websites. This project aims to help developers, researchers, and web scraping enthusiasts.

Language: Jupyter Notebook - Size: 109 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

mohammed-Alhusini/movie-info-agent

Scrapes VOX Cinemas to show live movie listings with a Gradio interface

Language: Jupyter Notebook - Size: 23.4 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

JoyalMPaul/Coursicle-Ratings

Web Scraping Application using Coursicle to organize Professors ratings

Language: Python - Size: 6.08 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

herrerovir/Web-scraping-chemical-producers

End to end data analysis project on the largest chemical companies in the world using Python.

Language: Jupyter Notebook - Size: 2.47 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

DataCrawl-AI/datacrawl

A simple and easy to use web crawler for Python

Language: Python - Size: 2.16 MB - Last synced at: 16 days ago - Pushed at: 10 months ago - Stars: 62 - Forks: 11

2003HARSH/SkillHorizon

Skill Horizon is an AI-based tool that uses real job data and course reviews to identify skill gaps and recommend personalized courses based on user queries and real learner feedback.

Language: Python - Size: 16.6 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

amehnd/Data_mining_R_n_Python

A learning project dedicated to data mining using R and Python. This repository contains scripts for web scraping, data retrieval, and data preparation, with the goal of creating datasets for future machine learning model training. The project is designed to help develop skills in data handling, processing, and structuring

Language: R - Size: 14.6 KB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

davidteather/everything-web-scraping

Learn everything web scraping with David Teather Codes on YouTube

Language: HTML - Size: 7.6 MB - Last synced at: 2 months ago - Pushed at: almost 2 years ago - Stars: 378 - Forks: 76

ghxstling/pc-part-hunter

Language: TypeScript - Size: 129 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

Karthick-840/Crawl4ai-RAG-with-Local-LLM

A tool for scraping web documentation using Crawl4AI, converting it to Markdown, and preparing it for integration with local LLMs (like Ollama) to enhance their knowledge for learning and "vibe coding" workflows.

Language: Python - Size: 33.1 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

omkarcloud/gitpod-selenium

Run Python Selenium in GitPod

Language: Dockerfile - Size: 4.88 KB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 7

farukalamai/yelp-scraper-scrapy-python

Yelp Restaurant data scraping using python, scrapy spider

Language: Python - Size: 23.4 KB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 4 - Forks: 2

ANONYMOUSx46/Advanced-Web-Scrapping-Tool

A web-scrapping-tool I built to automate the process with advanced techniques, ready to use in your Kali Linux Terminal!

Language: Python - Size: 20.5 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

Jesjsssi/Web-Scrapper

Website Scraper Using Python

Language: HTML - Size: 7.81 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

lombardo-luca/LePrAn

Letterboxd Profile Analyzer (LePrAn) is a simple tool to see statistics about your letterboxd.com profile.

Language: Python - Size: 91.8 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 5 - Forks: 0

hari7261/Webinsight-Automation

The project processes these tasks asynchronously in the background, allowing users to check the status of their analyses and download the results (both HTML content and screenshots).

Language: HTML - Size: 2.78 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

toricodesthings/Discord-Bot-Statistify

Spotify Web API wrapped to a Discord Bot with ability to Scrape for Monthly Listener & Track Playcount (Web Application version coming soon)

Language: Python - Size: 171 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

WLXie-Tony/Movie_Review_Analysis

A comprehensive pipeline for scraping, structuring, and analyzing IMDb movie reviews. This repository includes automated web scraping scripts, structured datasets, and advanced large language model (LLM)-based sentiment analysis to extract insights from user reviews.

Language: Python - Size: 120 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

chiemekaifemegbulem/Useful_tools

Advanced Web Scraping

Language: Python - Size: 252 KB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

omkarcloud/gitpod-botasaurus

Run Botasaurus in GitPod

Language: Dockerfile - Size: 7.81 KB - Last synced at: 4 days ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 1

NickTurenne/UFC_Data_Scraper

A Web Scraper that extracts fighter information from matchups for the next upcoming UFC event and graphs the data.

Language: Python - Size: 17.6 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

oxylabs/asynchronous-web-scraping-python

A comparison of asynchronous and synchronous web scraping methods with practical examples.

Language: Python - Size: 8.34 MB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 7 - Forks: 0

oxylabs/web-scraping-google-sheets

Guide to Using Google Sheets for Basic Web Scraping

Size: 22.5 KB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 97 - Forks: 2

Joao-Pedro-P-Holanda/gh-education-offers-scrapper

Simple python script for extracting all offers from the student pack on Github Education

Language: Python - Size: 11.7 KB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

novianggita/Web-Scraping

Traveloka web scraping using python (selenium and bs4) and look for insights using SQL. This data analysis aims to determine the extent of the availability of adequate accommodation information in Lombok Island.

Language: Jupyter Notebook - Size: 1.79 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

jakbin/pcdt-scraper

A PyChromeDevTools based WebScraper and selenium like syntax.

Language: Python - Size: 6.84 KB - Last synced at: 11 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

kvcops/Deep-Research-using-Gemini-api

AI-powered deep research tool leveraging web scraping for cost-effective, comprehensive analysis. Open-source and API-cost free!

Language: HTML - Size: 394 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 12 - Forks: 2

RomanGW/lukki

Completely free code for a webcrawling bot.

Language: Python - Size: 19.5 KB - Last synced at: 4 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

ingluiserge/Deep-Research-using-Gemini-api

AI-powered deep research tool leveraging web scraping for cost-effective, comprehensive analysis. Open-source and API-cost free!

Size: 0 Bytes - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

vishwajeetdabholkar/eGet-Crawler-for-ai

Web scraping framework built for AI applications. Extract clean, structured content from any website with dynamic content handling, markdown conversion, and intelligent crawling capabilities. Perfect for RAG applications and AI training data pipelines. Features async processing, browser management, and Prometheus monitoring.

Language: Python - Size: 248 KB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 41 - Forks: 15

paulajr/machine_learning

Undergraduate Economy projects

Language: Jupyter Notebook - Size: 2.17 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

alisa-yar/Source-Code-Viewer

Online Source Code Viewer (get HTML source code from URL)

Language: Python - Size: 455 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 2

itskovacs/songkick-concerts

🎵 Python Songkick concerts crawler. No API usage. Telegram notifications.

Language: Python - Size: 158 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

Atia-Farha/HTML-Fetcher-Script

A Python script that allows users to fetch and optionally save the HTML content from a specified URL using `requests` library.

Language: Python - Size: 83 KB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 4 - Forks: 0

gayanukabulegoda/Web-Scraping-Starter-Kit

Repository designed to help freshers easily grasp the basics of web scripting, offering simple guides and examples to build a strong foundation.

Language: Python - Size: 10.7 KB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 7 - Forks: 0

hhuayuan/spider-course

《深入了解Python爬虫攻防》课程课件及相关代码:大部分爬虫教程都是教一些基础或者是直接找一些案例讲解,已经入门但未熟练的人难以找到适合的课程及练习网站;只教人爬不教原理,以至于部分人学完还是知其然不知其所以然,无法灵活应用;而且很多课程掺杂了大量Python基础语法等内容充集数、知识点不连贯或者避重就轻等。 本课程以横向教学为主,介绍爬虫实际工作中用到的技术、思路及工具,并且以边开发网页边爬取的方式逐步深入爬虫与反爬虫的攻防知识,知己知彼。

Language: HTML - Size: 9.57 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

oxylabs/python-cache-tutorial

A guide to caching web scraping scripts in Python.

Language: Python - Size: 417 KB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 1

oxylabs/parse-html-pyquery

Learn to parse HTML using PyQuery, a Python library for web scraping and manipulating HTML.

Language: Python - Size: 19.5 KB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 3 - Forks: 0

oxylabs/web-scraping-selenium-python

Web Scraping with Python Selenium: Tutorial for Beginners

Language: Python - Size: 11.7 KB - Last synced at: 29 days ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

TufayelLUS/suedtirol.info-scraper-using-python-with-dataset

A python based web scraper implementation of suedtirol.info accommodation lead scraper based on given region

Language: Python - Size: 776 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

namdharayush/Flipkart-Products-Scraper-Automatic-Bot-with-Selenium-Python

Flipkart Products Scraper - Automatic Bot with Selenium and Python

Language: Python - Size: 365 KB - Last synced at: 3 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

MarionChaff/windguru-scraper

Python script that scrapes weather forecast from Windguru using Selenium

Language: Python - Size: 3.91 KB - Last synced at: 3 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 1

itachi1621/G2A_Scraper

Python script scrapes product information from G2A, extracts pricing, ratings, and seller names, creates an HTML table using ChatGPT, and sends email notifications to recipients specified in the configuration file.

Language: Python - Size: 30.3 KB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

LakshayD02/Web_Scraping_Python

This repository contains a Python program that scrapes product information (names, prices, ratings, etc.) from an e-commerce website and stores the data in a CSV file. A useful tool for data collection and analysis! 📊

Language: Python - Size: 5.86 KB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 4 - Forks: 0

SciFrozen-Git/website-scraper

A powerful and easy-to-use tool built with Scrapy and Node.js that allows you to scrape and download the entire source code and assets of any website. Perfect for developers, researchers, and web enthusiasts who need offline access to websites or want to analyze their structure.

Language: Python - Size: 14.6 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

BreadyBred/codewars-rank-fetcher

This Python GUI application simplifies the process of fetching and storing your Codewars ranking data across various categories. It provides a user-friendly interface for configuration and displays retrieved ranks in a clear format.

Size: 30.6 MB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

FilipCieciuch/Web-scraping-1

A Python project for web scraping data on therapeutic centers in Poland and visualizing their geographic distribution using Tableau. Includes an interactive dashboard and CSV export for further analysis.

Language: Python - Size: 4.17 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

GoncaloMark/CobWeb-lnx

CobWeb is a Python library for web scraping. The library consists of two classes: Spider and Scraper.

Language: Python - Size: 7.75 MB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 38 - Forks: 2

Human505-oatmeal/100-Days-Of-Python-Projects

🐍 100 Days Of Python Projects 🐍

Language: Python - Size: 11.6 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

jsun-dot/DramaNite

A command-line tool designed to seamlessly stream Asian TV shows and movies by aggregating content directly from KissAsian.

Language: Python - Size: 63.5 KB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

nicopujia/old_projects

Old projects that I made when I was learning to program

Language: Python - Size: 51.2 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

mr-mudgal/Amazon-Scrapper

This Python-based Amazon Scraper is designed to efficiently extract detailed product data from Amazon's product pages. The tool leverages powerful libraries like BeautifulSoup4 and csv, along with the Scrapingant API to simulate browser behavior and bypass Amazon’s anti-scraping algorithms.

Language: Python - Size: 17.6 KB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

Code-Quang/Linkedin-Scraping

I scraped the specific company follower's url, name, education and so on.

Language: Python - Size: 5.59 MB - Last synced at: 4 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

madhurimarawat/CSVTU-GPT

This repository hosts the CSVTU GPT app, a Streamlit-based interactive application designed to provide efficient access to subject-specific academic information and resources. It supports functionalities like fuzzy matching, exact word matching, and syllabus search capabilities, enabling users to query data conveniently.

Language: Jupyter Notebook - Size: 9.38 MB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

lithoeme/souper

gather data from any site.

Language: Python - Size: 3.91 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

PythonicXwarraich/TCDB-Scraper

A web scraper to extract data from TCDb. The spider collects details such as player names, team names, card images, total cards, and release dates for baseball card sets, almost (6500) data..

Language: Python - Size: 0 Bytes - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

BhattJayD/PassBreachFinder

A Python script that checks whether a password has been compromised using the Have I Been Pwned service. The script automates the process of querying the website and retrieving the results for the given password, leveraging Selenium and a headless Firefox browser. It’s a simple tool for testing password security and checking for data breaches.

Language: Python - Size: 1.99 MB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

BadrAnalyst/Scraping-Data-from-a-Real-Website

Web-scraped data on the largest U.S. companies by revenue, capturing rank, name, industry, revenue (in USD billions), employees, and headquarters location. Data is structured into a CSV dataset, ready for analysis and insights into major corporate players.

Language: Jupyter Notebook - Size: 61.5 KB - Last synced at: 4 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

SathvikNayak123/sentiment-anyalysis

Sentiment Analysis using DistlBERT Transformer from HuggingFace. Also integrated Airflow for end-to-end pipeline

Language: Jupyter Notebook - Size: 17.7 MB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

ffatahillah7/Web-Scraping-Multiple-Page-Using-Python

Extract data and content from an online store

Language: Jupyter Notebook - Size: 14.6 KB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

Related Keywords
web-scraping-python 226 python 120 web-scraping 117 web-scraper 40 python3 31 beautifulsoup 25 selenium 23 scraping 21 webscraping 20 beautifulsoup4 17 scraper 16 automation 15 crawler 14 web-scraping-api 14 data-science 12 selenium-python 12 data-extraction 12 python-scraper 11 selenium-webdriver 11 python-web-scraper 11 web-scrapping 11 data-analysis 10 requests 10 machine-learning 10 scraping-python 9 web-crawler-python 9 flask 8 pandas 8 python-script 8 web-crawler 8 data-visualization 8 scraping-websites 7 scrapy 7 web-scraping-software 7 web-scraping-tutorials 7 web 7 data-mining 6 webscraper 6 python-web-crawler 6 crawling 6 web-crawling 5 hacktoberfest 5 jupyter-notebook 5 captcha-solving 5 flask-application 5 web-scraping-project 5 cloudflare-bypass 5 data-scraping 4 python-web-scraping 4 spider 4 visualization 4 sql 4 data 4 crawling-python 4 web-scrapper 4 scrapy-crawler 4 playwright 4 telegram-bot 4 data-analysis-python 4 ai 4 css 3 discord-bot 3 csv 3 amazon-captcha-solving 3 amazon-captcha-solver 3 web-scraping-nodejs 3 bypass-cloudflare 3 json 3 requests-library-python 3 matplotlib 3 web-scraping-solution 3 funcaptcha-amazon-captcha-solver 3 javascript 3 django 3 amazon 3 funcaptcha-twitter 3 website 3 data-cleaning 3 puppeteer 3 scraper-python 3 json-database-python 3 captcha-solver 3 data-collection 3 docker 3 chrome 3 captcha-recognition 3 llm 3 sentiment-analysis 3 web-scrapers 3 react 3 webdriver-manager 3 github-python 3 scrapy-spider 3 finance 3 bing-search 3 html 3 captcha 3 playwright-python 3 fastapi 3 agent 2