GitHub topics: web-scraping-python
tinyfish-io/agentql
AgentQL is a suite of tools for connecting your AI to the web. Featuring a query language and Playwright integrations for interacting with elements and extracting data quickly, precisely, and at scale. Includes REST API, Python and JavaScript SDKs, browser debugger.
Language: Python - Size: 868 KB - Last synced at: about 18 hours ago - Pushed at: about 20 hours ago - Stars: 958 - Forks: 120

seleniumbase/SeleniumBase
Python APIs for web automation, testing, and bypassing bot-detection.
Language: Python - Size: 13.7 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 11,686 - Forks: 1,425

Sumdiboii/web-crawler-openalex-semantic-research-papers-public
Full-stack FastAPI + React app to search, filter, and analyze papers from OpenAlex & Semantic Scholar. Features charts, bookmarks, CSV export, and advanced filters for streamlined academic research.
Size: 1.71 MB - Last synced at: about 16 hours ago - Pushed at: about 2 months ago - Stars: 3 - Forks: 0

jsem-nerad/strava-cz-python
High level API pro interakci s webovou aplikaci Strava.cz udelane v Pythonu
Language: Python - Size: 72.3 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

omkarcloud/botasaurus
The All in One Framework to Build Undefeatable Scrapers
Language: Python - Size: 85.8 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 3,089 - Forks: 258

haslanin/strava-cz-python
🍽️ Simplify interactions with Strava.cz using Python, featuring login, menu retrieval, and meal ordering through a straightforward API.
Language: Python - Size: 48.8 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

0x676e67/rnet
A blazing-fast Python HTTP Client with TLS fingerprint
Language: Rust - Size: 3.06 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 874 - Forks: 67

scrapy/scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
Language: Python - Size: 27.4 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 58,424 - Forks: 11,069

RNFS/google_trends_scraper
Google Trends Scraper
Size: 5.86 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

scrapfly/scrapfly-scrapers
Scalable Python web scraping scripts for +40 popular domains
Language: Python - Size: 6.47 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 661 - Forks: 154

CPalmer3200/Destiny_Scraping_Tools
Web scraping tools designed to assemble automated daily/monthly literature reviews
Language: Python - Size: 7.74 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1 - Forks: 0

vlagehj/5chsita_mpcrwrl
community response crawler for MapleStort, Nexon
Language: Python - Size: 81.1 KB - Last synced at: 6 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0

Haimonmon/snippy
A Book scraping bot that ables to give you alot of data, but be cautious as may result this a banning of your ip.
Language: Python - Size: 101 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

D4Vinci/Scrapling
🕷️ An undetectable, powerful, flexible, high-performance Python library to make Web Scraping Easy and Effortless as it should be!
Language: Python - Size: 3.87 MB - Last synced at: 9 days ago - Pushed at: 10 days ago - Stars: 7,383 - Forks: 417

vladislavpyatnitskiy/datapy
Language: Python - Size: 4.88 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

lombardo-luca/LePrAn
Letterboxd Profile Analyzer (LePrAn) is a simple tool to see statistics about your letterboxd.com profile.
Language: Python - Size: 102 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 6 - Forks: 1

oxylabs/asynchronous-web-scraping-python
A comparison of asynchronous and synchronous web scraping methods with practical examples.
Language: Python - Size: 8.34 MB - Last synced at: 10 days ago - Pushed at: 12 days ago - Stars: 7 - Forks: 0

oxylabs/parse-html-pyquery
Learn to parse HTML using PyQuery, a Python library for web scraping and manipulating HTML.
Language: Python - Size: 25.4 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 3 - Forks: 0

oxylabs/oxylabs-ai-studio-py
Oxylabs AI Studio python SDK
Language: Python - Size: 1.65 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 85 - Forks: 0

oxylabs/oxylabs-ai-studio-js
Oxylabs AI Studio JS SDK
Language: TypeScript - Size: 1.67 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 3 - Forks: 0

oxylabs/web-scraping-selenium-python
Web Scraping with Python Selenium: Tutorial for Beginners
Language: Python - Size: 15.6 KB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 2 - Forks: 0

oxylabs/web-scraping-google-sheets
Guide to Using Google Sheets for Basic Web Scraping
Size: 30.3 KB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 83 - Forks: 3

oxylabs/how-to-scrape-amazon-product-data
The process of extracting product data from Amazon using Python, including titles, ratings, prices, images, and descriptions.
Size: 2.43 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 103 - Forks: 3

GoncaloMark/CobWeb-lnx
CobWeb is a Python library for web scraping. The library consists of two classes: Spider and Scraper.
Language: Python - Size: 7.75 MB - Last synced at: 10 days ago - Pushed at: almost 2 years ago - Stars: 37 - Forks: 2

oxylabs/Python-Web-Scraping-Tutorial
In this Python Web Scraping Tutorial, we will outline everything needed to get started with web scraping. We will begin with simple examples and move on to relatively more complex.
Language: Python - Size: 117 KB - Last synced at: 19 days ago - Pushed at: 3 months ago - Stars: 294 - Forks: 32

sarperavci/kick-unofficial-api
🛡️ Unofficial Kick.com API wrapper with automatic bypass protection.
Language: Python - Size: 15.6 KB - Last synced at: 17 days ago - Pushed at: 8 months ago - Stars: 9 - Forks: 4

irfanalidv/trustpilot_scraper
A Python library for scraping Trustpilot reviews.
Language: Python - Size: 56.6 KB - Last synced at: 18 days ago - Pushed at: over 1 year ago - Stars: 11 - Forks: 9

vishwajeetdabholkar/eGet-Crawler-for-ai
Web scraping framework built for AI applications. Extract clean, structured content from any website with dynamic content handling, markdown conversion, and intelligent crawling capabilities. Perfect for RAG applications and AI training data pipelines. Features async processing, browser management, and Prometheus monitoring.
Language: Python - Size: 292 KB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 46 - Forks: 17

DataCrawl-AI/datacrawl
A simple and easy to use web crawler for Python
Language: Python - Size: 2.16 MB - Last synced at: 10 days ago - Pushed at: about 1 year ago - Stars: 64 - Forks: 11

GaelGil/web-scraper
A web scraper I created using selenium. Its intended to scrape items from several pages. I am using it to scrape books from goodreads.
Language: Python - Size: 1.27 GB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

thewebscraping/tls-requests
TLS Requests is a powerful Python library for secure HTTP requests, offering browser-like TLS client, fingerprinting, anti-bot page bypass, and high performance.
Language: Python - Size: 3.7 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 78 - Forks: 6

ScraperHub/goodfirms-scraper
Goodfirms.com Search Listing and Company Page Scraper. To handle JS rendering and CAPTCHAs, we are using Crawlbase Crawling API.
Language: Python - Size: 6.84 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

ScraperHub/farfetch-scrapers
Farfetch.com Search Listings Scraper and Product Details Page Scraper. Scrapers effectively handle JS rendering and CAPTCHA using Crawlbase Crawling API.
Language: Python - Size: 5.86 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

ScraperHub/google-hotels-scrapers
Google Hotels Search Listing and Hotel Details Page Scraper. To handle JS rendering, Pagination, and CAPTCHAs, we are using Crawlbase Crawling API.
Language: Python - Size: 5.86 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

a-endari/Learning_German
A comprehensive toolkit for learning German that combines automated translation, audio pronunciation, and flashcard generation. This project streamlines the process of creating study materials by extracting definitions, examples, and audio from online sources and formatting them into structured markdown notes and Anki flashcards.
Language: Python - Size: 1.9 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

frarlo/garfield_bluesky_bot
Simple Python Bluesky bot to post random Garfield comics every four hours.
Language: Python - Size: 32.2 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

Mindful-AI-Assistants/SP2024-Election-Analysis
📊 An analysis of voting patterns in São Paulo's 2024 elections, focusing on voter behavior, absenteeism, and geographic trends.
Language: HTML - Size: 86.6 MB - Last synced at: 6 days ago - Pushed at: 8 days ago - Stars: 7 - Forks: 3

wuhulamb/miguvideo-catalog
A Python scraper that collects structured catalog data from Migu Video.
Language: Python - Size: 314 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

ReXiOP/Daraz-Global-WebScraper
🔥 Daraz Scraper – Extract product data, prices, ratings & images from Daraz with Python & Playwright. Export to Excel or MongoDB effortlessly! 🚀
Language: Python - Size: 734 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Maksym-TopDev/Web_Scraping_Automation_GPT
A versatile and resilient web scraping tool designed to allow for automation and scaling of web scraping jobs. Leveraging Selenium for browser automation and gpt-4o for cost-effective and scalable data processing, autoScraper automatically cleans and formats the extracted data for easy analysis.
Language: Python - Size: 181 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

loganbarron1/web-crawler-openalex-semantic-research-papers-public
📚 Fetch and visualize research papers from OpenAlex and Semantic Scholar to enhance your academic exploration and analysis.
Size: 1.7 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

HERALDEXX/tmdb-movie-scraper
Python scraper to collect movie data from TMDb API. Includes dataset of up to 10,000 popular movies.
Language: Python - Size: 9.66 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

riyanshibariyaa/Web-Scrapping-Text-Analysis
Python-based application designed to scrape web pages, extract textual content, and perform advanced text analysis
Language: Python - Size: 31.3 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

r0hankrishnan/racket-semantic-search
(WIP) Using semantic search to find the right tennis racket from Tennis Warehouse.
Language: Jupyter Notebook - Size: 3.22 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

davidteather/everything-web-scraping
Learn everything web scraping with David Teather Codes on YouTube
Language: HTML - Size: 7.6 MB - Last synced at: about 2 months ago - Pushed at: about 2 years ago - Stars: 407 - Forks: 81

erfancode83/product-scraper-dashboard
A web scraping and dashboard project using Python & Streamlit.
Size: 7.12 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Mharfe23/fantasy-writer-ai
AI-powered writing platform for fantasy storytellers. Generate images, audio narration, and summaries inside a creative writing environment.
Language: TypeScript - Size: 27.9 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 0

farukalamai/yelp-scraper-scrapy-python
Yelp Restaurant data scraping using python, scrapy spider
Language: Python - Size: 23.4 KB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 2

zenny455/weather-checker
Srapes real-time weather data from timeanddate.com for any city and country, displays results in a clean Tkinter GUI, with error handling for invalid or unreachable locations.
Language: Python - Size: 3.91 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Kamran6789/Airbnb-Data-Scraper
This project automates the process of collecting Airbnb listing data across multiple cities and varying guest counts using Selenium WebDriver. It scrapes availability, pricing, and listing volumes for different weekends and guest configurations to analyze trends in Airbnb accommodation data.
Language: Python - Size: 9.77 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

Powerostad/JobVision-Crawler
Simple Async Crawler for JobVision JobPosts
Language: Python - Size: 209 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Nitya1950/Keylogger-Detection Fork of sreya-kambhatla/Keylogger-Detection
Multi-layer keylogger detection system combining heuristic monitoring, simulated adversarial activity, and machine learning classification.
Language: Python - Size: 310 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Royalkavya/Web_Scraping_Flask_App
A Flask web app that scrapes motivational quotes and displays them beautifully on a webpage.
Language: Python - Size: 13 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

lynkos/downloader
Basic web scraper to download media from websites. Supports .pdf generation and vertical image stacking; useful for downloading manga, comics, etc.
Language: Python - Size: 68.4 KB - Last synced at: 1 day ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0

erikmilesi-data/fin-sentiment-analysis
Análise de sentimento de ações (Ibovespa, S&P 500, Nasdaq) usando múltiplas fontes de notícias com interface interativa em Jupyter Notebook.
Language: Python - Size: 15.6 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

z1shivam/peapix-scraper
A simple web scraper which scrapes peapix website and collect data of bing wallpapers and also download them automatically.
Language: Python - Size: 25.4 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

hangngdata/IMDb_reviews_scraper
Python-based scraper for collecting movie data from The Numbers and user reviews from IMDb, useful for text mining, sentiment analysis, and forecasting
Language: Jupyter Notebook - Size: 94.7 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

WhiteeRabbit/dork-seeker
Simple Automatizated Google dorker script written in python
Language: Python - Size: 20.5 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

gomgomnbn/municipal-bid-tracker
Automate procurement tracking with Municipal Bid Tracker. Scrape RFPs and RFQs across California cities, empowering small businesses to compete effectively. 🛠️📊
Language: Jupyter Notebook - Size: 1.84 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

pdoup/PyScraperX
Resilient, powerful asynchronous web scraping framework in Python with a real-time UI for monitoring, scheduling, and managing concurrent JSON scraping tasks
Language: Python - Size: 415 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

rushi-analytics/Selenium-Mini-Project
A mini project using Selenium to scrape product data from Croma and visualize insights with Power BI.
Language: Jupyter Notebook - Size: 159 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

Karthick-840/AH_Recommendation_with_LLM
Very Small Description: Python scripts to scrape, clean, and merge airline crash data from Wikipedia and other web sources for practice and potential Kaggle publication.
Language: Python - Size: 2.73 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

oxylabs/python-cache-tutorial
A guide to caching web scraping scripts in Python.
Language: Python - Size: 421 KB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 1

oxylabs/curl-with-python
Master cURL in Python by using the PycURL library. Learn to send GET and POST requests, custom HTTP headers, and how to fix common problems.
Language: Python - Size: 30.3 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 4 - Forks: 1

00ryanwelzel/minionProfitsCalculator
Accesses an online database for current item prices in hypixel skyblock, then calculates the most profitable minions.
Language: Python - Size: 4.88 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

sycstitch/truecar-webscraper
Web scraper that collects and analyzes car listings from TrueCar. School assignment turned market analysis tool with data cleaning & visualization for car shopping research.
Language: Jupyter Notebook - Size: 46.9 KB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Majmal66/Apple-Watch-Price-Analysis
Scraping & Comparing Apple Watch Prices from Amazon & Noon using Python.
Language: Jupyter Notebook - Size: 376 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

Dagvadorj1120/python-hltv-scraper
A straightforward web scraper for HLTV.org that uses AsyncCamoufox and BeautifulSoup. This project offers a reliable way to gather data on matches, teams, and players. 🐍💻
Language: Python - Size: 70.3 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

ClassicalClemi/python-hltv-scraper
A simple and open-source HLTV.org web scraper built with Camoufox and BeautifulSoup, written entirely in Python.
Language: Python - Size: 41 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

Gauff/BelgianElectricCarMarketAnalyser
Python tool for analyzing the belgian second hand electric car market by scraping and visualizing data from multiple car listing websites. Features parallel web scraping, price tracking, and interactive dashboards.
Language: Python - Size: 1.47 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

shawnCaza/compodio
Putting the podcast in community radio
Language: Python - Size: 186 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 6 - Forks: 2

zumatt/msa
Multi Search Aggregator is a python script to perform systematic literature review on multiple platforms
Language: Python - Size: 12.7 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

tarqhilmarsiregar/fashion-scraping-etl
Implementasi ETL pipeline sederhana untuk web scraping data fashion, meliputi ekstraksi, pembersihan, transformasi, dan penyimpanan ke format CSV, Database postgreSQL, serta Google Sheets sebagai dasar insight data
Language: Python - Size: 6.84 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

DishaAggarwal31/Job-Market-Data-Analysis
An interactive job market analytics dashboard built with Python, Matplotlib, and ipywidgets. Explore job trends by industry, location, and experience with dynamic filters and visual insights.
Language: Jupyter Notebook - Size: 3.27 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Decodo/Python-scraper-tutorial
A short introduction to scraping with Python with given steps and an example scraper script.
Language: Python - Size: 106 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 30 - Forks: 5

samshad/Data_Scrape_Auto_Tinder
Data Scrape & Auto‑Swipe for Tinder – Python scripts that authenticate with Tinder’s unofficial API, save profile metadata to CSV, and auto‑like/pass based on simple filters. For educational use only, automation violates Tinder’s ToS.
Language: Python - Size: 33.2 KB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

subhanalii/instagram-scraper
A Python automation tool that logs into Instagram, searches profiles via Bing, scrapes public data like bio, followers, and emails, and saves the results. Demo included. Full script available on request.
Language: Python - Size: 4.03 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Yahia-kilany/Oscar-Nominations-Database
Oscars Database Project is a comprehensive system designed to store, manage, and query detailed data about the Academy Awards (Oscars). This project includes both terminal-based and web-based applications to interact with the data, which covers Oscar-related information from the 10th to the 96th iteration.
Language: Jupyter Notebook - Size: 15.4 MB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Blisse1/caprendizaje-web-scraping
Script en Python que automatiza la extracción de empresas con vacantes para aprendices en 'Caprendizaje' (plataforma del SENA)
Language: Python - Size: 11.7 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

mhmdkardosha/CAT-Reloaded-2025-Data-Science-Roadmap
Roadmap for Data Science circle associated with CAT Reloaded.
Size: 83 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 26 - Forks: 1

AsmrCodeZ-YT/WebScrappers
Welcome to this repository! 🎉 Here, you will find a collection of 10 free scrapers for extracting data from various websites. This project aims to help developers, researchers, and web scraping enthusiasts.
Language: Jupyter Notebook - Size: 109 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

mohammed-Alhusini/movie-info-agent
Scrapes VOX Cinemas to show live movie listings with a Gradio interface
Language: Jupyter Notebook - Size: 23.4 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

JoyalMPaul/Coursicle-Ratings
Web Scraping Application using Coursicle to organize Professors ratings
Language: Python - Size: 6.08 MB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

herrerovir/Web-scraping-chemical-producers
End to end data analysis project on the largest chemical companies in the world using Python.
Language: Jupyter Notebook - Size: 2.47 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

2003HARSH/SkillHorizon
Skill Horizon is an AI-based tool that uses real job data and course reviews to identify skill gaps and recommend personalized courses based on user queries and real learner feedback.
Language: Python - Size: 16.6 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

amehnd/Data_mining_R_n_Python
A learning project dedicated to data mining using R and Python. This repository contains scripts for web scraping, data retrieval, and data preparation, with the goal of creating datasets for future machine learning model training. The project is designed to help develop skills in data handling, processing, and structuring
Language: R - Size: 14.6 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

ghxstling/pc-part-hunter
Language: TypeScript - Size: 129 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

Karthick-840/Crawl4ai-RAG-with-Local-LLM
A tool for scraping web documentation using Crawl4AI, converting it to Markdown, and preparing it for integration with local LLMs (like Ollama) to enhance their knowledge for learning and "vibe coding" workflows.
Language: Python - Size: 33.1 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

omkarcloud/gitpod-selenium
Run Python Selenium in GitPod
Language: Dockerfile - Size: 4.88 KB - Last synced at: 2 days ago - Pushed at: almost 2 years ago - Stars: 5 - Forks: 8

ANONYMOUSx46/Advanced-Web-Scrapping-Tool
A web-scrapping-tool I built to automate the process with advanced techniques, ready to use in your Kali Linux Terminal!
Language: Python - Size: 20.5 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

Jesjsssi/Web-Scrapper
Website Scraper Using Python
Language: HTML - Size: 7.81 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

hari7261/Webinsight-Automation
The project processes these tasks asynchronously in the background, allowing users to check the status of their analyses and download the results (both HTML content and screenshots).
Language: HTML - Size: 2.78 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

toricodesthings/Discord-Bot-Statistify
Spotify Web API wrapped to a Discord Bot with ability to Scrape for Monthly Listener & Track Playcount (Web Application version coming soon)
Language: Python - Size: 171 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

WLXie-Tony/Movie_Review_Analysis
A comprehensive pipeline for scraping, structuring, and analyzing IMDb movie reviews. This repository includes automated web scraping scripts, structured datasets, and advanced large language model (LLM)-based sentiment analysis to extract insights from user reviews.
Language: Python - Size: 120 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

chiemekaifemegbulem/Useful_tools
Advanced Web Scraping
Language: Python - Size: 252 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

omkarcloud/gitpod-botasaurus
Run Botasaurus in GitPod
Language: Dockerfile - Size: 7.81 KB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 1

NickTurenne/UFC_Data_Scraper
A Web Scraper that extracts fighter information from matchups for the next upcoming UFC event and graphs the data.
Language: Python - Size: 17.6 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

Joao-Pedro-P-Holanda/gh-education-offers-scrapper
Simple python script for extracting all offers from the student pack on Github Education
Language: Python - Size: 11.7 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

novianggita/Web-Scraping
Traveloka web scraping using python (selenium and bs4) and look for insights using SQL. This data analysis aims to determine the extent of the availability of adequate accommodation information in Lombok Island.
Language: Jupyter Notebook - Size: 1.79 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

jakbin/pcdt-scraper
A PyChromeDevTools based WebScraper and selenium like syntax.
Language: Python - Size: 6.84 KB - Last synced at: 9 days ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0
