GitHub topics: data-scraper
Police-Data-Accessibility-Project/docs
Documentation for the Police Data Accessibility Project.
Size: 74.7 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 4 - Forks: 6

AIxBlock-2023/awesome-ai-dev-platform-opensource
An On-Chain Open-Source Platform for Rapid AI Model Productization Using Decentralized Resources with Flexibility and Scalability
Language: TypeScript - Size: 178 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 463 - Forks: 91

iamdulanga/iqair-data-scraper
Air quality data scraping tool for get real-time data from https://www.iqair.com/ in Sri Lanka by FECTSL
Language: HTML - Size: 50 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0

egbertbouman/youtube-comment-downloader
Simple script for downloading Youtube comments without using the Youtube API
Language: Python - Size: 60.5 KB - Last synced at: 22 days ago - Pushed at: 10 months ago - Stars: 1,053 - Forks: 241

je-suis-tm/web-scraping
Detailed web scraping tutorials for dummies with financial data crawlers on Reddit WallStreetBets, CME (both options and futures), US Treasury, CFTC, LME, MacroTrends, SHFE and alternative data crawlers on Tomtom, BBC, Wall Street Journal, Al Jazeera, Reuters, Financial Times, Bloomberg, CNN, Fortune, The Economist
Language: Python - Size: 1.88 MB - Last synced at: 27 days ago - Pushed at: over 3 years ago - Stars: 787 - Forks: 177

farukalamai/yelp-scraper-scrapy-python
Yelp Restaurant data scraping using python, scrapy spider
Language: Python - Size: 23.4 KB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 4 - Forks: 2

JasonG7234/NBA-Draft-Model
This is a Python college basketball data scraper + draft model.
Language: Python - Size: 5.54 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 4 - Forks: 0

kaymen99/ai-web-scraper
AI web scraper built with Crawl4AI for extracting structured leads data from websites.
Language: Python - Size: 19.5 KB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 14 - Forks: 1

ShahrozAtiq/Hevy-Exercises-Data-Scraper
This bot automates the process of logging into the Hevy website using your email and password, navigating to the exercises section, and extracting detailed data for each exercise. The extracted attributes include: Name Equipment Primary Muscle Secondary Muscle Source Source Type Thumbnail The data is systematically stored in a CSV file
Language: Python - Size: 14.7 MB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 2 - Forks: 0

ranbot-ai/instagram-scraper
A Nodejs script that scrapes data from instagram profiles.
Language: TypeScript - Size: 78.1 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 5 - Forks: 0

Antkky/Go_Crypto_Scraper
Highly efficient crypto data scraper built on go, works on multiple exchanges at once
Language: Go - Size: 1.09 MB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Antkky/Python_Crypto_Scraper
Python crypto data scraper that scrapes across multiple exchanges
Language: PowerShell - Size: 472 KB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

ranbot-ai/web-scraper
A NodeJS script that scrapes metadata from public websites | 2025
Language: TypeScript - Size: 72.3 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

bitArtisan1/netDigger
A .NET 8.0 C# WPF desktop application for web scraping data into structured databases with a modern UI, comprehensive logging and optimized high performance.
Language: C# - Size: 375 KB - Last synced at: 2 months ago - Pushed at: 11 months ago - Stars: 5 - Forks: 1

eVowIO/Data-Tracking-Sentinel
eVow.io Data Tracking Sentinel
Size: 39.1 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

r7avi/Google-Maps-Data-Scrapper
Google Maps/Place Business Data Scrapper, Scrape data from any google business listed on google maps. Get Name, Address, Phone Number, Plus Code, Reviews, Longitude and Latitude etc. Scrape Emails from Available Websites and Docker Support for automation.
Language: Python - Size: 240 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 4 - Forks: 0

abtsousa/clippy
A simple file downloader / scraper for NOVA School of Science and Technology's own e-learning platform, CLIP.
Language: Python - Size: 4.54 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 7 - Forks: 1

shivam-moray/Data-Scraper-Geocoding-Neighbourhood-Clustering
Scraping neighbourhood data, analyzing and clustering similar neighbourhoods.
Language: Jupyter Notebook - Size: 41 KB - Last synced at: 10 months ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

lukas-buergi/km-stat
This project is intended to provide accurate and accessible information about arms exports
Language: HTML - Size: 183 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 1

OKEYResidentialProxies/Scrape-Facebook
Scraping Facebook can offer a wealth of information for various applications, from market research to academic studies. However, it’s essential to approach this task responsibly, adhering to legal and ethical guidelines to ensure that the data is used appropriately and securely.
Size: 3.91 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

danielfrees/scrapemed
ScrapeMed: Data scraping for PubMed Central.
Language: Python - Size: 13.6 MB - Last synced at: 10 months ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 1

ProMahmudul/goodreads-book-scraper
The "GoodReads Book Scraper" is a project developed using Laravel 11 framework. This tool aims to extract and organize data from the popular book review platform, GoodReads. With its intuitive interface and powerful scraping capabilities, users can effortlessly gather information such as book original titles, language, ISBN, and published data.
Language: PHP - Size: 73.2 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

adil6572/PropertyPulse
PropertyPulse is a versatile web scraping tool designed to streamline the process of gathering real estate data from various websites.
Language: Jupyter Notebook - Size: 2.35 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

adil6572/qbcc-local-contractor-scraper
This Python-based web scraper automates the collection of contractor details from the Queensland Building and Construction Commission's (QBCC) website, simplifying data gathering for analysis and database creation in the construction industry.
Language: Python - Size: 11.7 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

adil6572/pages24-scraper
Pages24 Scraper is a Python tool designed to efficiently extract valuable information from the Pages24 website. This scraper simplifies the process of collecting diverse data, including names, URLs, addresses, and more, from Pages24 listings.
Language: Python - Size: 7.81 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ohiTuna/DATA2000
tool for scraping state physician DATA2000 waiver counts from SAMHSA
Language: Go - Size: 5.86 KB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

debajit13/books-data-scrapper
A nodejs based webscrapper for books.toscrape.com website
Language: TypeScript - Size: 35.2 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

mertaybat/boardgamegeek-downloader
Boardgamegeek data downloader
Language: TypeScript - Size: 18.6 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

ortanaV2/Data-Scraper
A data-scraper that makes it possible to filter out the most important information from huge amounts of text based data.
Language: Python - Size: 6.84 KB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

FoamoftheSea/tda_scraper
A web-scraper which uses selenium webdriver to obtain in-depth data from TD Ameritrade website for use with python.
Language: Jupyter Notebook - Size: 190 KB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 5 - Forks: 3

kanitsharma/mangafox-scraper
Data scraper for mangafox
Language: JavaScript - Size: 8.79 KB - Last synced at: about 1 month ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0

oxygenkun/YetAnotherDataScraper
Yet another data scraper for you-know-what
Language: Python - Size: 53.7 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 0

logan-lauton/nfl_webscrape
web scrape performed for Kaggle dataset.
Language: Jupyter Notebook - Size: 14.6 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

logan-lauton/nba_webscrape
web scrapes performed for Kaggle datasets.
Language: Jupyter Notebook - Size: 86.9 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

You-sha/Naukri.com-Scraper
Scraper for Naukri.com | BeautifulSoup and Selenium
Language: Python - Size: 1.95 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

encoreshao/puppeteer-typescript-starter
The most basic Puppeteer TypeScript starter
Size: 2.93 KB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

h26k2/node-gui-scraper
A GUI Based Web-Scraper for scraping e-commerce website using NodeJS
Language: JavaScript - Size: 151 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 2

sallamy2580/python-web-scrapping
Detailed web scraping tutorials for dummies with financial data crawlers on Reddit WallStreetBets, CME (both options and futures), US Treasury, CFTC, LME, MacroTrends, SHFE and alternative data crawlers on Tomtom, BBC, Wall Street Journal, Al Jazeera, Reuters, Financial Times, Bloomberg, CNN, Fortune, The Economist
Language: Python - Size: 1.56 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 8 - Forks: 0

Rafaqfg/web-scraping-project-Python
In this project I created a python script using data scraping techniques to extract HTML content data from the Trybe's blog and stored in a MongoDB database.
Language: Python - Size: 3.56 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

lucifermorningstar1305/scrappy
Web Scrapper for any website
Language: Python - Size: 8.79 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

aarhank/Periculum-API
A working API for accessing/scraping kickasstorrent Torrents data.
Language: JavaScript - Size: 46.9 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

luka-j/UpisScraper
Crawls upin.mpn.gov.rs for data.
Language: Java - Size: 180 KB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

SabujXi/Python-Scraper-and-Data-Analysts-Admin-Panel-in-Django
A data scraper from texas govt site and a helping web app for managing, reviewing and editing the data
Language: Python - Size: 1.18 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

christiancameron/websitescraping
Data scraper and analysis from the top 500 websites
Language: Python - Size: 252 KB - Last synced at: about 2 months ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

lurodrigo/anw-cue-sheet
Generates cue sheets from .edl files and Audio Network metadata
Language: Clojure - Size: 40 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

DanMesh/ParkrunData
Data scraper for Parkrun results
Language: Java - Size: 8.79 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0
