An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: web-scraping-python

tinyfish-io/agentql

AgentQL is a suite of tools for connecting your AI to the web. Featuring a query language and Playwright integrations for interacting with elements and extracting data quickly, precisely, and at scale. Includes REST API, Python and JavaScript SDKs, browser debugger.

Language: Python - Size: 868 KB - Last synced at: about 18 hours ago - Pushed at: about 20 hours ago - Stars: 958 - Forks: 120

seleniumbase/SeleniumBase

Python APIs for web automation, testing, and bypassing bot-detection.

Language: Python - Size: 13.7 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 11,686 - Forks: 1,425

Sumdiboii/web-crawler-openalex-semantic-research-papers-public

Full-stack FastAPI + React app to search, filter, and analyze papers from OpenAlex & Semantic Scholar. Features charts, bookmarks, CSV export, and advanced filters for streamlined academic research.

Size: 1.71 MB - Last synced at: about 16 hours ago - Pushed at: about 2 months ago - Stars: 3 - Forks: 0

jsem-nerad/strava-cz-python

High level API pro interakci s webovou aplikaci Strava.cz udelane v Pythonu

Language: Python - Size: 72.3 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

omkarcloud/botasaurus

The All in One Framework to Build Undefeatable Scrapers

Language: Python - Size: 85.8 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 3,089 - Forks: 258

haslanin/strava-cz-python

🍽️ Simplify interactions with Strava.cz using Python, featuring login, menu retrieval, and meal ordering through a straightforward API.

Language: Python - Size: 48.8 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

0x676e67/rnet

A blazing-fast Python HTTP Client with TLS fingerprint

Language: Rust - Size: 3.06 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 874 - Forks: 67

scrapy/scrapy

Scrapy, a fast high-level web crawling & scraping framework for Python.

Language: Python - Size: 27.4 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 58,424 - Forks: 11,069

RNFS/google_trends_scraper

Google Trends Scraper

Size: 5.86 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

scrapfly/scrapfly-scrapers

Scalable Python web scraping scripts for +40 popular domains

Language: Python - Size: 6.47 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 661 - Forks: 154

CPalmer3200/Destiny_Scraping_Tools

Web scraping tools designed to assemble automated daily/monthly literature reviews

Language: Python - Size: 7.74 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1 - Forks: 0

vlagehj/5chsita_mpcrwrl

community response crawler for MapleStort, Nexon

Language: Python - Size: 81.1 KB - Last synced at: 6 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0

Haimonmon/snippy

A Book scraping bot that ables to give you alot of data, but be cautious as may result this a banning of your ip.

Language: Python - Size: 101 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

D4Vinci/Scrapling

🕷️ An undetectable, powerful, flexible, high-performance Python library to make Web Scraping Easy and Effortless as it should be!

Language: Python - Size: 3.87 MB - Last synced at: 9 days ago - Pushed at: 10 days ago - Stars: 7,383 - Forks: 417

vladislavpyatnitskiy/datapy

Language: Python - Size: 4.88 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

lombardo-luca/LePrAn

Letterboxd Profile Analyzer (LePrAn) is a simple tool to see statistics about your letterboxd.com profile.

Language: Python - Size: 102 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 6 - Forks: 1

oxylabs/asynchronous-web-scraping-python

A comparison of asynchronous and synchronous web scraping methods with practical examples.

Language: Python - Size: 8.34 MB - Last synced at: 10 days ago - Pushed at: 12 days ago - Stars: 7 - Forks: 0

oxylabs/parse-html-pyquery

Learn to parse HTML using PyQuery, a Python library for web scraping and manipulating HTML.

Language: Python - Size: 25.4 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 3 - Forks: 0

oxylabs/oxylabs-ai-studio-py

Oxylabs AI Studio python SDK

Language: Python - Size: 1.65 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 85 - Forks: 0

oxylabs/oxylabs-ai-studio-js

Oxylabs AI Studio JS SDK

Language: TypeScript - Size: 1.67 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 3 - Forks: 0

oxylabs/web-scraping-selenium-python

Web Scraping with Python Selenium: Tutorial for Beginners

Language: Python - Size: 15.6 KB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 2 - Forks: 0

oxylabs/web-scraping-google-sheets

Guide to Using Google Sheets for Basic Web Scraping

Size: 30.3 KB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 83 - Forks: 3

oxylabs/how-to-scrape-amazon-product-data

The process of extracting product data from Amazon using Python, including titles, ratings, prices, images, and descriptions.

Size: 2.43 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 103 - Forks: 3

GoncaloMark/CobWeb-lnx

CobWeb is a Python library for web scraping. The library consists of two classes: Spider and Scraper.

Language: Python - Size: 7.75 MB - Last synced at: 10 days ago - Pushed at: almost 2 years ago - Stars: 37 - Forks: 2

oxylabs/Python-Web-Scraping-Tutorial

In this Python Web Scraping Tutorial, we will outline everything needed to get started with web scraping. We will begin with simple examples and move on to relatively more complex.

Language: Python - Size: 117 KB - Last synced at: 19 days ago - Pushed at: 3 months ago - Stars: 294 - Forks: 32

sarperavci/kick-unofficial-api

🛡️ Unofficial Kick.com API wrapper with automatic bypass protection.

Language: Python - Size: 15.6 KB - Last synced at: 17 days ago - Pushed at: 8 months ago - Stars: 9 - Forks: 4

irfanalidv/trustpilot_scraper

A Python library for scraping Trustpilot reviews.

Language: Python - Size: 56.6 KB - Last synced at: 18 days ago - Pushed at: over 1 year ago - Stars: 11 - Forks: 9

vishwajeetdabholkar/eGet-Crawler-for-ai

Web scraping framework built for AI applications. Extract clean, structured content from any website with dynamic content handling, markdown conversion, and intelligent crawling capabilities. Perfect for RAG applications and AI training data pipelines. Features async processing, browser management, and Prometheus monitoring.

Language: Python - Size: 292 KB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 46 - Forks: 17

DataCrawl-AI/datacrawl

A simple and easy to use web crawler for Python

Language: Python - Size: 2.16 MB - Last synced at: 10 days ago - Pushed at: about 1 year ago - Stars: 64 - Forks: 11

GaelGil/web-scraper

A web scraper I created using selenium. Its intended to scrape items from several pages. I am using it to scrape books from goodreads.

Language: Python - Size: 1.27 GB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

thewebscraping/tls-requests

TLS Requests is a powerful Python library for secure HTTP requests, offering browser-like TLS client, fingerprinting, anti-bot page bypass, and high performance.

Language: Python - Size: 3.7 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 78 - Forks: 6

ScraperHub/goodfirms-scraper

Goodfirms.com Search Listing and Company Page Scraper. To handle JS rendering and CAPTCHAs, we are using Crawlbase Crawling API.

Language: Python - Size: 6.84 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

ScraperHub/farfetch-scrapers

Farfetch.com Search Listings Scraper and Product Details Page Scraper. Scrapers effectively handle JS rendering and CAPTCHA using Crawlbase Crawling API.

Language: Python - Size: 5.86 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

ScraperHub/google-hotels-scrapers

Google Hotels Search Listing and Hotel Details Page Scraper. To handle JS rendering, Pagination, and CAPTCHAs, we are using Crawlbase Crawling API.

Language: Python - Size: 5.86 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

a-endari/Learning_German

A comprehensive toolkit for learning German that combines automated translation, audio pronunciation, and flashcard generation. This project streamlines the process of creating study materials by extracting definitions, examples, and audio from online sources and formatting them into structured markdown notes and Anki flashcards.

Language: Python - Size: 1.9 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

frarlo/garfield_bluesky_bot

Simple Python Bluesky bot to post random Garfield comics every four hours.

Language: Python - Size: 32.2 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

Mindful-AI-Assistants/SP2024-Election-Analysis

📊 An analysis of voting patterns in São Paulo's 2024 elections, focusing on voter behavior, absenteeism, and geographic trends.

Language: HTML - Size: 86.6 MB - Last synced at: 6 days ago - Pushed at: 8 days ago - Stars: 7 - Forks: 3

wuhulamb/miguvideo-catalog

A Python scraper that collects structured catalog data from Migu Video.

Language: Python - Size: 314 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

ReXiOP/Daraz-Global-WebScraper

🔥 Daraz Scraper – Extract product data, prices, ratings & images from Daraz with Python & Playwright. Export to Excel or MongoDB effortlessly! 🚀

Language: Python - Size: 734 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Maksym-TopDev/Web_Scraping_Automation_GPT

A versatile and resilient web scraping tool designed to allow for automation and scaling of web scraping jobs. Leveraging Selenium for browser automation and gpt-4o for cost-effective and scalable data processing, autoScraper automatically cleans and formats the extracted data for easy analysis.

Language: Python - Size: 181 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

loganbarron1/web-crawler-openalex-semantic-research-papers-public

📚 Fetch and visualize research papers from OpenAlex and Semantic Scholar to enhance your academic exploration and analysis.

Size: 1.7 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

HERALDEXX/tmdb-movie-scraper

Python scraper to collect movie data from TMDb API. Includes dataset of up to 10,000 popular movies.

Language: Python - Size: 9.66 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

riyanshibariyaa/Web-Scrapping-Text-Analysis

Python-based application designed to scrape web pages, extract textual content, and perform advanced text analysis

Language: Python - Size: 31.3 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

r0hankrishnan/racket-semantic-search

(WIP) Using semantic search to find the right tennis racket from Tennis Warehouse.

Language: Jupyter Notebook - Size: 3.22 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

davidteather/everything-web-scraping

Learn everything web scraping with David Teather Codes on YouTube

Language: HTML - Size: 7.6 MB - Last synced at: about 2 months ago - Pushed at: about 2 years ago - Stars: 407 - Forks: 81

erfancode83/product-scraper-dashboard

A web scraping and dashboard project using Python & Streamlit.

Size: 7.12 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Mharfe23/fantasy-writer-ai

AI-powered writing platform for fantasy storytellers. Generate images, audio narration, and summaries inside a creative writing environment.

Language: TypeScript - Size: 27.9 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 0

farukalamai/yelp-scraper-scrapy-python

Yelp Restaurant data scraping using python, scrapy spider

Language: Python - Size: 23.4 KB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 2

zenny455/weather-checker

Srapes real-time weather data from timeanddate.com for any city and country, displays results in a clean Tkinter GUI, with error handling for invalid or unreachable locations.

Language: Python - Size: 3.91 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Kamran6789/Airbnb-Data-Scraper

This project automates the process of collecting Airbnb listing data across multiple cities and varying guest counts using Selenium WebDriver. It scrapes availability, pricing, and listing volumes for different weekends and guest configurations to analyze trends in Airbnb accommodation data.

Language: Python - Size: 9.77 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

Powerostad/JobVision-Crawler

Simple Async Crawler for JobVision JobPosts

Language: Python - Size: 209 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Nitya1950/Keylogger-Detection Fork of sreya-kambhatla/Keylogger-Detection

Multi-layer keylogger detection system combining heuristic monitoring, simulated adversarial activity, and machine learning classification.

Language: Python - Size: 310 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Royalkavya/Web_Scraping_Flask_App

A Flask web app that scrapes motivational quotes and displays them beautifully on a webpage.

Language: Python - Size: 13 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

lynkos/downloader

Basic web scraper to download media from websites. Supports .pdf generation and vertical image stacking; useful for downloading manga, comics, etc.

Language: Python - Size: 68.4 KB - Last synced at: 1 day ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0

erikmilesi-data/fin-sentiment-analysis

Análise de sentimento de ações (Ibovespa, S&P 500, Nasdaq) usando múltiplas fontes de notícias com interface interativa em Jupyter Notebook.

Language: Python - Size: 15.6 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

z1shivam/peapix-scraper

A simple web scraper which scrapes peapix website and collect data of bing wallpapers and also download them automatically.

Language: Python - Size: 25.4 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

hangngdata/IMDb_reviews_scraper

Python-based scraper for collecting movie data from The Numbers and user reviews from IMDb, useful for text mining, sentiment analysis, and forecasting

Language: Jupyter Notebook - Size: 94.7 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

WhiteeRabbit/dork-seeker

Simple Automatizated Google dorker script written in python

Language: Python - Size: 20.5 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

gomgomnbn/municipal-bid-tracker

Automate procurement tracking with Municipal Bid Tracker. Scrape RFPs and RFQs across California cities, empowering small businesses to compete effectively. 🛠️📊

Language: Jupyter Notebook - Size: 1.84 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

pdoup/PyScraperX

Resilient, powerful asynchronous web scraping framework in Python with a real-time UI for monitoring, scheduling, and managing concurrent JSON scraping tasks

Language: Python - Size: 415 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

rushi-analytics/Selenium-Mini-Project

A mini project using Selenium to scrape product data from Croma and visualize insights with Power BI.

Language: Jupyter Notebook - Size: 159 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

Karthick-840/AH_Recommendation_with_LLM

Very Small Description: Python scripts to scrape, clean, and merge airline crash data from Wikipedia and other web sources for practice and potential Kaggle publication.

Language: Python - Size: 2.73 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

oxylabs/python-cache-tutorial

A guide to caching web scraping scripts in Python.

Language: Python - Size: 421 KB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 1

oxylabs/curl-with-python

Master cURL in Python by using the PycURL library. Learn to send GET and POST requests, custom HTTP headers, and how to fix common problems.

Language: Python - Size: 30.3 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 4 - Forks: 1

00ryanwelzel/minionProfitsCalculator

Accesses an online database for current item prices in hypixel skyblock, then calculates the most profitable minions.

Language: Python - Size: 4.88 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

sycstitch/truecar-webscraper

Web scraper that collects and analyzes car listings from TrueCar. School assignment turned market analysis tool with data cleaning & visualization for car shopping research.

Language: Jupyter Notebook - Size: 46.9 KB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Majmal66/Apple-Watch-Price-Analysis

Scraping & Comparing Apple Watch Prices from Amazon & Noon using Python.

Language: Jupyter Notebook - Size: 376 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

Dagvadorj1120/python-hltv-scraper

A straightforward web scraper for HLTV.org that uses AsyncCamoufox and BeautifulSoup. This project offers a reliable way to gather data on matches, teams, and players. 🐍💻

Language: Python - Size: 70.3 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

ClassicalClemi/python-hltv-scraper

A simple and open-source HLTV.org web scraper built with Camoufox and BeautifulSoup, written entirely in Python.

Language: Python - Size: 41 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

Gauff/BelgianElectricCarMarketAnalyser

Python tool for analyzing the belgian second hand electric car market by scraping and visualizing data from multiple car listing websites. Features parallel web scraping, price tracking, and interactive dashboards.

Language: Python - Size: 1.47 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

shawnCaza/compodio

Putting the podcast in community radio

Language: Python - Size: 186 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 6 - Forks: 2

zumatt/msa

Multi Search Aggregator is a python script to perform systematic literature review on multiple platforms

Language: Python - Size: 12.7 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

tarqhilmarsiregar/fashion-scraping-etl

Implementasi ETL pipeline sederhana untuk web scraping data fashion, meliputi ekstraksi, pembersihan, transformasi, dan penyimpanan ke format CSV, Database postgreSQL, serta Google Sheets sebagai dasar insight data

Language: Python - Size: 6.84 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

DishaAggarwal31/Job-Market-Data-Analysis

An interactive job market analytics dashboard built with Python, Matplotlib, and ipywidgets. Explore job trends by industry, location, and experience with dynamic filters and visual insights.

Language: Jupyter Notebook - Size: 3.27 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Decodo/Python-scraper-tutorial

A short introduction to scraping with Python with given steps and an example scraper script.

Language: Python - Size: 106 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 30 - Forks: 5

samshad/Data_Scrape_Auto_Tinder

Data Scrape & Auto‑Swipe for Tinder – Python scripts that authenticate with Tinder’s unofficial API, save profile metadata to CSV, and auto‑like/pass based on simple filters. For educational use only, automation violates Tinder’s ToS.

Language: Python - Size: 33.2 KB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

subhanalii/instagram-scraper

A Python automation tool that logs into Instagram, searches profiles via Bing, scrapes public data like bio, followers, and emails, and saves the results. Demo included. Full script available on request.

Language: Python - Size: 4.03 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Yahia-kilany/Oscar-Nominations-Database

Oscars Database Project is a comprehensive system designed to store, manage, and query detailed data about the Academy Awards (Oscars). This project includes both terminal-based and web-based applications to interact with the data, which covers Oscar-related information from the 10th to the 96th iteration.

Language: Jupyter Notebook - Size: 15.4 MB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Blisse1/caprendizaje-web-scraping

Script en Python que automatiza la extracción de empresas con vacantes para aprendices en 'Caprendizaje' (plataforma del SENA)

Language: Python - Size: 11.7 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

mhmdkardosha/CAT-Reloaded-2025-Data-Science-Roadmap

Roadmap for Data Science circle associated with CAT Reloaded.

Size: 83 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 26 - Forks: 1

AsmrCodeZ-YT/WebScrappers

Welcome to this repository! 🎉 Here, you will find a collection of 10 free scrapers for extracting data from various websites. This project aims to help developers, researchers, and web scraping enthusiasts.

Language: Jupyter Notebook - Size: 109 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

mohammed-Alhusini/movie-info-agent

Scrapes VOX Cinemas to show live movie listings with a Gradio interface

Language: Jupyter Notebook - Size: 23.4 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

JoyalMPaul/Coursicle-Ratings

Web Scraping Application using Coursicle to organize Professors ratings

Language: Python - Size: 6.08 MB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

herrerovir/Web-scraping-chemical-producers

End to end data analysis project on the largest chemical companies in the world using Python.

Language: Jupyter Notebook - Size: 2.47 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

2003HARSH/SkillHorizon

Skill Horizon is an AI-based tool that uses real job data and course reviews to identify skill gaps and recommend personalized courses based on user queries and real learner feedback.

Language: Python - Size: 16.6 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

amehnd/Data_mining_R_n_Python

A learning project dedicated to data mining using R and Python. This repository contains scripts for web scraping, data retrieval, and data preparation, with the goal of creating datasets for future machine learning model training. The project is designed to help develop skills in data handling, processing, and structuring

Language: R - Size: 14.6 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

ghxstling/pc-part-hunter

Language: TypeScript - Size: 129 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

Karthick-840/Crawl4ai-RAG-with-Local-LLM

A tool for scraping web documentation using Crawl4AI, converting it to Markdown, and preparing it for integration with local LLMs (like Ollama) to enhance their knowledge for learning and "vibe coding" workflows.

Language: Python - Size: 33.1 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

omkarcloud/gitpod-selenium

Run Python Selenium in GitPod

Language: Dockerfile - Size: 4.88 KB - Last synced at: 2 days ago - Pushed at: almost 2 years ago - Stars: 5 - Forks: 8

ANONYMOUSx46/Advanced-Web-Scrapping-Tool

A web-scrapping-tool I built to automate the process with advanced techniques, ready to use in your Kali Linux Terminal!

Language: Python - Size: 20.5 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

Jesjsssi/Web-Scrapper

Website Scraper Using Python

Language: HTML - Size: 7.81 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

hari7261/Webinsight-Automation

The project processes these tasks asynchronously in the background, allowing users to check the status of their analyses and download the results (both HTML content and screenshots).

Language: HTML - Size: 2.78 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

toricodesthings/Discord-Bot-Statistify

Spotify Web API wrapped to a Discord Bot with ability to Scrape for Monthly Listener & Track Playcount (Web Application version coming soon)

Language: Python - Size: 171 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

WLXie-Tony/Movie_Review_Analysis

A comprehensive pipeline for scraping, structuring, and analyzing IMDb movie reviews. This repository includes automated web scraping scripts, structured datasets, and advanced large language model (LLM)-based sentiment analysis to extract insights from user reviews.

Language: Python - Size: 120 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

chiemekaifemegbulem/Useful_tools

Advanced Web Scraping

Language: Python - Size: 252 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

omkarcloud/gitpod-botasaurus

Run Botasaurus in GitPod

Language: Dockerfile - Size: 7.81 KB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 1

NickTurenne/UFC_Data_Scraper

A Web Scraper that extracts fighter information from matchups for the next upcoming UFC event and graphs the data.

Language: Python - Size: 17.6 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

Joao-Pedro-P-Holanda/gh-education-offers-scrapper

Simple python script for extracting all offers from the student pack on Github Education

Language: Python - Size: 11.7 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

novianggita/Web-Scraping

Traveloka web scraping using python (selenium and bs4) and look for insights using SQL. This data analysis aims to determine the extent of the availability of adequate accommodation information in Lombok Island.

Language: Jupyter Notebook - Size: 1.79 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

jakbin/pcdt-scraper

A PyChromeDevTools based WebScraper and selenium like syntax.

Language: Python - Size: 6.84 KB - Last synced at: 9 days ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

Related Keywords
web-scraping-python 254 python 133 web-scraping 130 web-scraper 42 python3 31 selenium 27 beautifulsoup 26 webscraping 22 scraping 21 scraper 20 beautifulsoup4 19 automation 17 web-scraping-api 17 crawler 16 selenium-python 13 data-science 13 requests 13 data-extraction 13 selenium-webdriver 12 python-web-scraper 11 python-scraper 11 machine-learning 11 web-scrapping 11 data-analysis 11 flask 10 data-visualization 9 scraping-python 9 web-crawler-python 9 python-script 9 web-crawler 9 webscraper 8 pandas 8 crawling 7 web-scraping-software 7 scrapy 7 web-scraping-tutorials 7 web 7 scraping-websites 7 data-mining 7 web-crawling 6 jupyter-notebook 6 python-web-crawler 6 cloudflare-bypass 5 playwright 5 captcha-solving 5 data 5 crawling-python 5 web-scraping-project 5 css 5 requests-library-python 5 sentiment-analysis 4 bot 4 visualization 4 hacktoberfest 4 flask-application 4 web-scrapper 4 spider 4 finance 4 python-web-scraping 4 postgresql 4 api 4 sql 4 telegram-bot 4 data-scraping 4 data-analysis-python 4 ai 4 scrapy-crawler 4 fastapi 3 funcaptcha-amazon-captcha-solver 3 funcaptcha-twitter 3 scraper-python 3 html 3 web-scraping-solution 3 playwright-python 3 web-scraping-nodejs 3 json-database-python 3 github-python 3 amazon-captcha-solver 3 nlp 3 data-cleaning 3 matplotlib 3 json 3 stock-market 3 ai-scraping 3 amazon-captcha-solving 3 javascript 3 proxy-scraper 3 data-collection 3 scrapy-spider 3 django 3 gui 3 bing-search 3 bypass-cloudflare 3 discord-bot 3 amazon 3 data-mining-python 3 webdriver-manager 3 web-scrapers 3 mongodb 3 puppeteer 3