An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: web-scraping-python

scrapfly/scrapfly-scrapers

Scalable Python web scraping scripts for +40 popular domains

Language: Python - Size: 6.89 MB - Last synced at: about 4 hours ago - Pushed at: 3 days ago - Stars: 771 - Forks: 172

0x676e67/rnet

An ergonomic Python HTTP Client with TLS fingerprint

Language: Rust - Size: 4.03 MB - Last synced at: about 2 hours ago - Pushed at: 1 day ago - Stars: 1,052 - Forks: 80

seleniumbase/SeleniumBase

Python APIs for web automation, testing, and bypassing bot-detection with ease.

Language: Python - Size: 13.9 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 11,930 - Forks: 1,455

happytaoer/web-craft

A Python-based modular web scraping framework focused on efficient single URL crawling, supporting asynchronous processing, API services, and highly customizable spider modules.

Language: Python - Size: 311 KB - Last synced at: 1 day ago - Pushed at: 5 days ago - Stars: 16 - Forks: 1

6SUPER6SONIC6/Diffly

Game price comparison platform across multiple regions

Language: HTML - Size: 162 KB - Last synced at: 1 day ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

AsmrCodeZ-YT/WebScrappers

Welcome to this repository! 🎉 Here, you will find a collection of 10 free scrapers for extracting data from various websites. This project aims to help developers, researchers, and web scraping enthusiasts.

Language: Jupyter Notebook - Size: 25.1 MB - Last synced at: about 2 hours ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

ParallaxAPIs/parallaxapis-sdk-py

Language: Python - Size: 36.2 MB - Last synced at: 2 days ago - Pushed at: 4 days ago - Stars: 13 - Forks: 0

scrapy/scrapy

Scrapy, a fast high-level web crawling & scraping framework for Python.

Language: Python - Size: 27.6 MB - Last synced at: 6 days ago - Pushed at: 12 days ago - Stars: 59,022 - Forks: 11,162

thewebscraping/tls-requests

TLS Requests is a powerful Python library for secure HTTP requests, offering browser-like TLS client, fingerprinting, anti-bot page bypass, and high performance.

Language: Python - Size: 3.71 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 111 - Forks: 9

omkarcloud/botasaurus

The All in One Framework to Build Undefeatable Scrapers

Language: Python - Size: 86.1 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 3,302 - Forks: 281

CPalmer3200/Destiny_Scraping_Tools

Web scraping tools designed to assemble automated daily/monthly literature reviews

Language: Python - Size: 8.36 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0

D4Vinci/Scrapling

🕷️ An undetectable, powerful, flexible, high-performance Python library to make Web Scraping Easy and Effortless as it should be!

Language: Python - Size: 4.1 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 8,193 - Forks: 466

menaceXnadin/Nepse_Ticker-Discord-Bot

A Discord bot providing real-time updates from the Nepal Stock Exchange (NEPSE) with commands for tracking stocks, indices, and market performance.

Language: Python - Size: 87.9 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

irfanalidv/trustpilot_scraper

A Python library for scraping Trustpilot reviews.

Language: Python - Size: 56.6 KB - Last synced at: 1 day ago - Pushed at: over 1 year ago - Stars: 12 - Forks: 10

oxylabs/oxylabs-ai-studio-py

Structured data gathering from any website using AI-powered scraper, crawler, and browser automation. Scraping and crawling with natural language prompts. Equip your LLM agents with fresh data. AI Studio python SDK for intelligent web data gathering.

Language: Python - Size: 2.36 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 2,061 - Forks: 13

haslanin/strava-cz-python

🍽️ Simplify interactions with Strava.cz using Python, featuring login, menu retrieval, and meal ordering through a straightforward API.

Language: Python - Size: 1.33 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 1 - Forks: 0

kurtnettle/crackmes-extractor

A Python CLI tool to scrape, extract, and compile challenge data from crackmes into a structured JSON format.

Language: Python - Size: 1.3 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 0 - Forks: 0

oxylabs/oxylabs-ai-studio-js

Structured data gathering from any website using AI-powered scraper, crawler, and browser automation. Scraping and crawling with natural language prompts. Equip your LLM agents with fresh data. AI Studio JS SDK for intelligent web data gathering.

Language: TypeScript - Size: 1.4 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 13 - Forks: 0

Mr67009/Web-Automation-Bot

Language: TypeScript - Size: 1.3 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

louaih/rmp_scraper

A Python-based tool that leverages OpenAI's API for RMP review analysis, Google Search API for professor discovery, and Selenium for web scraping RateMyProfessors data. The tool provides comprehensive course selection insights by analyzing teaching quality, difficulty ratings, and student feedback.

Language: Python - Size: 80.1 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 1 - Forks: 0

z1shivam/peapix-scraper

A simple web scraper which scrapes peapix website and collect data of bing wallpapers and also download them automatically.

Language: Python - Size: 26.4 KB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

jsem-nerad/strava-cz-python

High level API pro interakci s webovou aplikaci Strava.cz udelane v Pythonu

Language: Python - Size: 122 KB - Last synced at: 17 days ago - Pushed at: 18 days ago - Stars: 0 - Forks: 0

itsalivafaei/Job-Agent

Agentic AI for job boards with CI/CD, database integration, and parallel processing

Language: HTML - Size: 605 KB - Last synced at: 17 days ago - Pushed at: 18 days ago - Stars: 0 - Forks: 0

yousefkotp/local-leads-finder

Local Leads Finder helps you uncover nearby business prospects in minutes, enter a keyword and city, watch real-time progress, and download clean lead lists ready for outreach. Perfect for agencies, freelancers, and growth teams who need consistent, enriched local data without the heavy work.

Language: Python - Size: 7.83 MB - Last synced at: 5 days ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

tbharathraj205/web-scraping

A Python-based web scraping tool that searches DuckDuckGo, collects webpage links, extracts content, and uses OpenAI GPT models to summarize each page. The scraper runs headlessly with Selenium, parses content using BeautifulSoup, and stores results in a structured JSON file. Designed for research, SEO analysis, automated content monitoring, and AI

Language: Python - Size: 18.6 KB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 0 - Forks: 0

tinyfish-io/agentql

AgentQL is a suite of tools for connecting your AI to the web. Featuring a query language and Playwright integrations for interacting with elements and extracting data quickly, precisely, and at scale. Includes REST API, Python and JavaScript SDKs, browser debugger.

Language: Python - Size: 869 KB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 988 - Forks: 127

davidteather/everything-web-scraping

Learn everything web scraping with David Teather Codes on YouTube

Language: HTML - Size: 7.6 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 431 - Forks: 86

WaveInCode/InfoLens

A python and CustomTkinter based dashboard which provides personalized news, live stock market updates, and weather information - initially made for my school exhibition

Language: Python - Size: 1.35 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

r0hankrishnan/racket-semantic-search

(WIP) Using semantic search to find the right tennis racket from Tennis Warehouse.

Language: Jupyter Notebook - Size: 3.69 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

sarperavci/kick-unofficial-api

🛡️ Unofficial Kick.com API wrapper with automatic bypass protection.

Language: Python - Size: 15.6 KB - Last synced at: 22 days ago - Pushed at: 10 months ago - Stars: 10 - Forks: 4

Mharfe23/fantasy-writer-ai

AI-powered writing platform for fantasy storytellers. Generate images, audio narration, and summaries inside a creative writing environment.

Language: TypeScript - Size: 27.9 MB - Last synced at: 15 days ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

Sumdiboii/web-crawler-openalex-semantic-research-papers

Full-stack FastAPI + React app to search, filter, and analyze papers from OpenAlex & Semantic Scholar. Features charts, bookmarks, CSV export, and advanced filters for streamlined academic research.

Language: JavaScript - Size: 1.8 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

ElizabethHaberstroh/Oracular-Spectacular

An end-to-end data analysis project- from web scraping to data modeling to visualization- presenting insights into Kermit Lynch's wine portfolio.

Language: Jupyter Notebook - Size: 82 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

shamspias/web-scraper

A Web Scraper that automatically extracts data from websites, processes the information, and stores it in a structured format

Language: Vue - Size: 78.1 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Sumdiboii/web-crawler-openalex-semantic-research-papers-public

Full-stack FastAPI + React app to search, filter, and analyze papers from OpenAlex & Semantic Scholar. Features charts, bookmarks, CSV export, and advanced filters for streamlined academic research.

Size: 1.71 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 3 - Forks: 0

Haimonmon/snippy

A Book scraping bot that ables to give you books data, but be cautious as may result this a banning of your ip.

Language: Python - Size: 429 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

RNFS/google_trends_scraper

Google Trends Scraper

Size: 5.86 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

frarlo/garfield_bluesky_bot

Simple Python Bluesky bot to post random Garfield comics every four hours.

Language: Python - Size: 32.2 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

vlagehj/5chsita_mpcrwrl

community response crawler for MapleStort, Nexon

Language: Python - Size: 81.1 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

vladislavpyatnitskiy/datapy

Language: Python - Size: 4.88 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

lombardo-luca/LePrAn

Letterboxd Profile Analyzer (LePrAn) is a simple tool to see statistics about your letterboxd.com profile.

Language: Python - Size: 102 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 6 - Forks: 1

oxylabs/asynchronous-web-scraping-python

A comparison of asynchronous and synchronous web scraping methods with practical examples.

Language: Python - Size: 8.34 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 7 - Forks: 0

oxylabs/parse-html-pyquery

Learn to parse HTML using PyQuery, a Python library for web scraping and manipulating HTML.

Language: Python - Size: 25.4 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 3 - Forks: 0

oxylabs/web-scraping-selenium-python

Web Scraping with Python Selenium: Tutorial for Beginners

Language: Python - Size: 15.6 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0

oxylabs/web-scraping-google-sheets

Guide to Using Google Sheets for Basic Web Scraping

Size: 30.3 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 83 - Forks: 3

oxylabs/how-to-scrape-amazon-product-data

The process of extracting product data from Amazon using Python, including titles, ratings, prices, images, and descriptions.

Size: 2.43 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 103 - Forks: 3

GoncaloMark/CobWeb-lnx

CobWeb is a Python library for web scraping. The library consists of two classes: Spider and Scraper.

Language: Python - Size: 7.75 MB - Last synced at: 2 months ago - Pushed at: about 2 years ago - Stars: 37 - Forks: 2

oxylabs/Python-Web-Scraping-Tutorial

In this Python Web Scraping Tutorial, we will outline everything needed to get started with web scraping. We will begin with simple examples and move on to relatively more complex.

Language: Python - Size: 117 KB - Last synced at: 2 months ago - Pushed at: 5 months ago - Stars: 294 - Forks: 32

vishwajeetdabholkar/eGet-Crawler-for-ai

Web scraping framework built for AI applications. Extract clean, structured content from any website with dynamic content handling, markdown conversion, and intelligent crawling capabilities. Perfect for RAG applications and AI training data pipelines. Features async processing, browser management, and Prometheus monitoring.

Language: Python - Size: 292 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 46 - Forks: 17

DataCrawl-AI/datacrawl

A simple and easy to use web crawler for Python

Language: Python - Size: 2.16 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 64 - Forks: 11

GaelGil/web-scraper

A web scraper I created using selenium. Its intended to scrape items from several pages. I am using it to scrape books from goodreads.

Language: Python - Size: 1.27 GB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

ScraperHub/goodfirms-scraper

Goodfirms.com Search Listing and Company Page Scraper. To handle JS rendering and CAPTCHAs, we are using Crawlbase Crawling API.

Language: Python - Size: 6.84 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

ScraperHub/farfetch-scrapers

Farfetch.com Search Listings Scraper and Product Details Page Scraper. Scrapers effectively handle JS rendering and CAPTCHA using Crawlbase Crawling API.

Language: Python - Size: 5.86 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

ScraperHub/google-hotels-scrapers

Google Hotels Search Listing and Hotel Details Page Scraper. To handle JS rendering, Pagination, and CAPTCHAs, we are using Crawlbase Crawling API.

Language: Python - Size: 5.86 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

a-endari/Learning_German

A comprehensive toolkit for learning German that combines automated translation, audio pronunciation, and flashcard generation. This project streamlines the process of creating study materials by extracting definitions, examples, and audio from online sources and formatting them into structured markdown notes and Anki flashcards.

Language: Python - Size: 1.9 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Mindful-AI-Assistants/SP2024-Election-Analysis

📊 An analysis of voting patterns in São Paulo's 2024 elections, focusing on voter behavior, absenteeism, and geographic trends.

Language: HTML - Size: 86.6 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 7 - Forks: 3

wuhulamb/miguvideo-catalog

A Python scraper that collects structured catalog data from Migu Video.

Language: Python - Size: 314 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

ReXiOP/Daraz-Global-WebScraper

🔥 Daraz Scraper – Extract product data, prices, ratings & images from Daraz with Python & Playwright. Export to Excel or MongoDB effortlessly! 🚀

Language: Python - Size: 734 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Tiffano-Dev/Web_Scraping_Automation_GPT

A versatile and resilient web scraping tool designed to allow for automation and scaling of web scraping jobs. Leveraging Selenium for browser automation and gpt-4o for cost-effective and scalable data processing, autoScraper automatically cleans and formats the extracted data for easy analysis.

Language: Python - Size: 181 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

loganbarron1/web-crawler-openalex-semantic-research-papers-public

📚 Fetch and visualize research papers from OpenAlex and Semantic Scholar to enhance your academic exploration and analysis.

Size: 1.7 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

HERALDEXX/tmdb-movie-scraper

Python scraper to collect movie data from TMDb API. Includes dataset of up to 10,000 popular movies.

Language: Python - Size: 9.66 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

riyanshibariyaa/Web-Scrapping-Text-Analysis

Python-based application designed to scrape web pages, extract textual content, and perform advanced text analysis

Language: Python - Size: 31.3 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

erfancode83/product-scraper-dashboard

A web scraping and dashboard project using Python & Streamlit.

Size: 7.12 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

farukalamai/yelp-scraper-scrapy-python

Yelp Restaurant data scraping using python, scrapy spider

Language: Python - Size: 23.4 KB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 2

zenny455/weather-checker

Srapes real-time weather data from timeanddate.com for any city and country, displays results in a clean Tkinter GUI, with error handling for invalid or unreachable locations.

Language: Python - Size: 3.91 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Kamran6789/Airbnb-Data-Scraper

This project automates the process of collecting Airbnb listing data across multiple cities and varying guest counts using Selenium WebDriver. It scrapes availability, pricing, and listing volumes for different weekends and guest configurations to analyze trends in Airbnb accommodation data.

Language: Python - Size: 9.77 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

Powerostad/JobVision-Crawler

Simple Async Crawler for JobVision JobPosts

Language: Python - Size: 209 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Nitya1950/Keylogger-Detection Fork of sreya-kambhatla/Keylogger-Detection

Multi-layer keylogger detection system combining heuristic monitoring, simulated adversarial activity, and machine learning classification.

Language: Python - Size: 310 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Royalkavya/Web_Scraping_Flask_App

A Flask web app that scrapes motivational quotes and displays them beautifully on a webpage.

Language: Python - Size: 13 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

lynkos/downloader

Basic web scraper to download media from websites. Supports .pdf generation and vertical image stacking; useful for downloading manga, comics, etc.

Language: Python - Size: 68.4 KB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 2 - Forks: 0

erikmilesi-data/fin-sentiment-analysis

Análise de sentimento de ações (Ibovespa, S&P 500, Nasdaq) usando múltiplas fontes de notícias com interface interativa em Jupyter Notebook.

Language: Python - Size: 15.6 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

hangngdata/IMDb_reviews_scraper

Python-based scraper for collecting movie data from The Numbers and user reviews from IMDb, useful for text mining, sentiment analysis, and forecasting

Language: Jupyter Notebook - Size: 94.7 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

WhiteeRabbit/dork-seeker

Simple Automatizated Google dorker script written in python

Language: Python - Size: 20.5 KB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

gomgomnbn/municipal-bid-tracker

Automate procurement tracking with Municipal Bid Tracker. Scrape RFPs and RFQs across California cities, empowering small businesses to compete effectively. 🛠️📊

Language: Jupyter Notebook - Size: 1.84 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

subhanalii/instagram-scraper

A Python automation tool that logs into Instagram, searches profiles via Bing, scrapes public data like bio, followers, and emails, and saves the results. Demo included. Full script available on request.

Language: Python - Size: 4.03 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 3 - Forks: 1

pdoup/PyScraperX

Resilient, powerful asynchronous web scraping framework in Python with a real-time UI for monitoring, scheduling, and managing concurrent JSON scraping tasks

Language: Python - Size: 415 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

rushi-analytics/Selenium-Mini-Project

A mini project using Selenium to scrape product data from Croma and visualize insights with Power BI.

Language: Jupyter Notebook - Size: 159 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

Karthick-840/AH_Recommendation_with_LLM

Very Small Description: Python scripts to scrape, clean, and merge airline crash data from Wikipedia and other web sources for practice and potential Kaggle publication.

Language: Python - Size: 2.73 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

oxylabs/python-cache-tutorial

A guide to caching web scraping scripts in Python.

Language: Python - Size: 421 KB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 1

oxylabs/curl-with-python

Master cURL in Python by using the PycURL library. Learn to send GET and POST requests, custom HTTP headers, and how to fix common problems.

Language: Python - Size: 30.3 KB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 4 - Forks: 1

00ryanwelzel/minionProfitsCalculator

Accesses an online database for current item prices in hypixel skyblock, then calculates the most profitable minions.

Language: Python - Size: 4.88 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

sycstitch/truecar-webscraper

Web scraper that collects and analyzes car listings from TrueCar. School assignment turned market analysis tool with data cleaning & visualization for car shopping research.

Language: Jupyter Notebook - Size: 46.9 KB - Last synced at: 5 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

Majmal66/Apple-Watch-Price-Analysis

Scraping & Comparing Apple Watch Prices from Amazon & Noon using Python.

Language: Jupyter Notebook - Size: 376 KB - Last synced at: 5 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

Dagvadorj1120/python-hltv-scraper

A straightforward web scraper for HLTV.org that uses AsyncCamoufox and BeautifulSoup. This project offers a reliable way to gather data on matches, teams, and players. 🐍💻

Language: Python - Size: 70.3 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

ClassicalClemi/python-hltv-scraper

A simple and open-source HLTV.org web scraper built with Camoufox and BeautifulSoup, written entirely in Python.

Language: Python - Size: 41 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

Gauff/BelgianElectricCarMarketAnalyser

Python tool for analyzing the belgian second hand electric car market by scraping and visualizing data from multiple car listing websites. Features parallel web scraping, price tracking, and interactive dashboards.

Language: Python - Size: 1.47 MB - Last synced at: 4 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

shawnCaza/compodio

Putting the podcast in community radio

Language: Python - Size: 186 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 6 - Forks: 2

zumatt/msa

Multi Search Aggregator is a python script to perform systematic literature review on multiple platforms

Language: Python - Size: 12.7 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

tarqhilmarsiregar/fashion-scraping-etl

Implementasi ETL pipeline sederhana untuk web scraping data fashion, meliputi ekstraksi, pembersihan, transformasi, dan penyimpanan ke format CSV, Database postgreSQL, serta Google Sheets sebagai dasar insight data

Language: Python - Size: 6.84 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

DishaAggarwal31/Job-Market-Data-Analysis

An interactive job market analytics dashboard built with Python, Matplotlib, and ipywidgets. Explore job trends by industry, location, and experience with dynamic filters and visual insights.

Language: Jupyter Notebook - Size: 3.27 MB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

Decodo/Python-scraper-tutorial

A short introduction to scraping with Python with given steps and an example scraper script.

Language: Python - Size: 106 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 30 - Forks: 5

samshad/Data_Scrape_Auto_Tinder

Data Scrape & Auto‑Swipe for Tinder – Python scripts that authenticate with Tinder’s unofficial API, save profile metadata to CSV, and auto‑like/pass based on simple filters. For educational use only, automation violates Tinder’s ToS.

Language: Python - Size: 33.2 KB - Last synced at: 4 months ago - Pushed at: 7 months ago - Stars: 2 - Forks: 0

Yahia-kilany/Oscar-Nominations-Database

Oscars Database Project is a comprehensive system designed to store, manage, and query detailed data about the Academy Awards (Oscars). This project includes both terminal-based and web-based applications to interact with the data, which covers Oscar-related information from the 10th to the 96th iteration.

Language: Jupyter Notebook - Size: 15.4 MB - Last synced at: 6 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

Blisse1/caprendizaje-web-scraping

Script en Python que automatiza la extracción de empresas con vacantes para aprendices en 'Caprendizaje' (plataforma del SENA)

Language: Python - Size: 11.7 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

mhmdkardosha/CAT-Reloaded-2025-Data-Science-Roadmap

Roadmap for Data Science circle associated with CAT Reloaded.

Size: 83 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 26 - Forks: 1

mohammed-Alhusini/movie-info-agent

Scrapes VOX Cinemas to show live movie listings with a Gradio interface

Language: Jupyter Notebook - Size: 23.4 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

JoyalMPaul/Coursicle-Ratings

Web Scraping Application using Coursicle to organize Professors ratings

Language: Python - Size: 6.08 MB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

herrerovir/Web-scraping-chemical-producers

End to end data analysis project on the largest chemical companies in the world using Python.

Language: Jupyter Notebook - Size: 2.47 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

2003HARSH/SkillHorizon

Skill Horizon is an AI-based tool that uses real job data and course reviews to identify skill gaps and recommend personalized courses based on user queries and real learner feedback.

Language: Python - Size: 16.6 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

amehnd/Data_mining_R_n_Python

A learning project dedicated to data mining using R and Python. This repository contains scripts for web scraping, data retrieval, and data preparation, with the goal of creating datasets for future machine learning model training. The project is designed to help develop skills in data handling, processing, and structuring

Language: R - Size: 14.6 KB - Last synced at: 28 days ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

Related Keywords
python 141 web-scraping 134 web-scraper 45 python3 33 selenium 29 beautifulsoup 28 webscraping 22 scraper 21 scraping 21 beautifulsoup4 20 web-scraping-api 18 automation 17 crawler 17 selenium-python 15 requests 14 data-extraction 14 data-science 13 selenium-webdriver 13 data-analysis 12 flask 11 web-scrapping 11 python-scraper 11 machine-learning 11 python-web-scraper 11 data-visualization 10 python-script 9 web-crawler 9 scraping-python 9 web-crawler-python 9 scrapy 8 webscraper 8 pandas 8 web-scraping-tutorials 7 web-scraping-software 7 playwright 7 web 7 crawling 7 scraping-websites 7 data-mining 7 web-crawling 6 cloudflare-bypass 6 css 6 python-web-crawler 6 jupyter-notebook 6 postgresql 5 requests-library-python 5 captcha-solving 5 python-web-scraping 5 sql 5 data 5 web-scraping-project 5 crawling-python 5 data-analysis-python 4 data-collection 4 spider 4 finance 4 api 4 telegram-bot 4 captcha-solver 4 web-scrapper 4 flask-application 4 django 4 bot 4 hacktoberfest 4 visualization 4 scrapy-crawler 4 data-scraping 4 sentiment-analysis 4 ai 4 semantic-scholar 3 captcha 3 bot-detection 3 research-tool 3 web-scraping-solution 3 captcha-recognition 3 vite 3 playwright-python 3 web-scraping-nodejs 3 amazon 3 gui 3 discord-bot 3 scraper-python 3 json-database-python 3 webdriver-manager 3 web-scrapers 3 puppeteer 3 docker 3 chrome 3 funcaptcha-twitter 3 funcaptcha-amazon-captcha-solver 3 amazon-captcha-solving 3 amazon-captcha-solver 3 scrapy-spider 3 html 3 gui-application 3 json 3 csv 3 llm 3 matplotlib 3 fastapi 3