An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: data-scraper

Police-Data-Accessibility-Project/docs

Documentation for the Police Data Accessibility Project.

Size: 74.7 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 4 - Forks: 6

AIxBlock-2023/awesome-ai-dev-platform-opensource

An On-Chain Open-Source Platform for Rapid AI Model Productization Using Decentralized Resources with Flexibility and Scalability

Language: TypeScript - Size: 178 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 463 - Forks: 91

iamdulanga/iqair-data-scraper

Air quality data scraping tool for get real-time data from https://www.iqair.com/ in Sri Lanka by FECTSL

Language: HTML - Size: 50 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0

egbertbouman/youtube-comment-downloader

Simple script for downloading Youtube comments without using the Youtube API

Language: Python - Size: 60.5 KB - Last synced at: 22 days ago - Pushed at: 10 months ago - Stars: 1,053 - Forks: 241

je-suis-tm/web-scraping

Detailed web scraping tutorials for dummies with financial data crawlers on Reddit WallStreetBets, CME (both options and futures), US Treasury, CFTC, LME, MacroTrends, SHFE and alternative data crawlers on Tomtom, BBC, Wall Street Journal, Al Jazeera, Reuters, Financial Times, Bloomberg, CNN, Fortune, The Economist

Language: Python - Size: 1.88 MB - Last synced at: 27 days ago - Pushed at: over 3 years ago - Stars: 787 - Forks: 177

farukalamai/yelp-scraper-scrapy-python

Yelp Restaurant data scraping using python, scrapy spider

Language: Python - Size: 23.4 KB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 4 - Forks: 2

JasonG7234/NBA-Draft-Model

This is a Python college basketball data scraper + draft model.

Language: Python - Size: 5.54 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 4 - Forks: 0

kaymen99/ai-web-scraper

AI web scraper built with Crawl4AI for extracting structured leads data from websites.

Language: Python - Size: 19.5 KB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 14 - Forks: 1

ShahrozAtiq/Hevy-Exercises-Data-Scraper

This bot automates the process of logging into the Hevy website using your email and password, navigating to the exercises section, and extracting detailed data for each exercise. The extracted attributes include: Name Equipment Primary Muscle Secondary Muscle Source Source Type Thumbnail The data is systematically stored in a CSV file

Language: Python - Size: 14.7 MB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 2 - Forks: 0

ranbot-ai/instagram-scraper

A Nodejs script that scrapes data from instagram profiles.

Language: TypeScript - Size: 78.1 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 5 - Forks: 0

Antkky/Go_Crypto_Scraper

Highly efficient crypto data scraper built on go, works on multiple exchanges at once

Language: Go - Size: 1.09 MB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Antkky/Python_Crypto_Scraper

Python crypto data scraper that scrapes across multiple exchanges

Language: PowerShell - Size: 472 KB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

ranbot-ai/web-scraper

A NodeJS script that scrapes metadata from public websites | 2025

Language: TypeScript - Size: 72.3 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

bitArtisan1/netDigger

A .NET 8.0 C# WPF desktop application for web scraping data into structured databases with a modern UI, comprehensive logging and optimized high performance.

Language: C# - Size: 375 KB - Last synced at: 2 months ago - Pushed at: 11 months ago - Stars: 5 - Forks: 1

eVowIO/Data-Tracking-Sentinel

eVow.io Data Tracking Sentinel

Size: 39.1 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

r7avi/Google-Maps-Data-Scrapper

Google Maps/Place Business Data Scrapper, Scrape data from any google business listed on google maps. Get Name, Address, Phone Number, Plus Code, Reviews, Longitude and Latitude etc. Scrape Emails from Available Websites and Docker Support for automation.

Language: Python - Size: 240 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 4 - Forks: 0

abtsousa/clippy

A simple file downloader / scraper for NOVA School of Science and Technology's own e-learning platform, CLIP.

Language: Python - Size: 4.54 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 7 - Forks: 1

shivam-moray/Data-Scraper-Geocoding-Neighbourhood-Clustering

Scraping neighbourhood data, analyzing and clustering similar neighbourhoods.

Language: Jupyter Notebook - Size: 41 KB - Last synced at: 10 months ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

lukas-buergi/km-stat

This project is intended to provide accurate and accessible information about arms exports

Language: HTML - Size: 183 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 1

OKEYResidentialProxies/Scrape-Facebook

Scraping Facebook can offer a wealth of information for various applications, from market research to academic studies. However, it’s essential to approach this task responsibly, adhering to legal and ethical guidelines to ensure that the data is used appropriately and securely.

Size: 3.91 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

danielfrees/scrapemed

ScrapeMed: Data scraping for PubMed Central.

Language: Python - Size: 13.6 MB - Last synced at: 10 months ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 1

ProMahmudul/goodreads-book-scraper

The "GoodReads Book Scraper" is a project developed using Laravel 11 framework. This tool aims to extract and organize data from the popular book review platform, GoodReads. With its intuitive interface and powerful scraping capabilities, users can effortlessly gather information such as book original titles, language, ISBN, and published data.

Language: PHP - Size: 73.2 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

adil6572/PropertyPulse

PropertyPulse is a versatile web scraping tool designed to streamline the process of gathering real estate data from various websites.

Language: Jupyter Notebook - Size: 2.35 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

adil6572/qbcc-local-contractor-scraper

This Python-based web scraper automates the collection of contractor details from the Queensland Building and Construction Commission's (QBCC) website, simplifying data gathering for analysis and database creation in the construction industry.

Language: Python - Size: 11.7 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

adil6572/pages24-scraper

Pages24 Scraper is a Python tool designed to efficiently extract valuable information from the Pages24 website. This scraper simplifies the process of collecting diverse data, including names, URLs, addresses, and more, from Pages24 listings.

Language: Python - Size: 7.81 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ohiTuna/DATA2000

tool for scraping state physician DATA2000 waiver counts from SAMHSA

Language: Go - Size: 5.86 KB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

debajit13/books-data-scrapper

A nodejs based webscrapper for books.toscrape.com website

Language: TypeScript - Size: 35.2 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

mertaybat/boardgamegeek-downloader

Boardgamegeek data downloader

Language: TypeScript - Size: 18.6 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

ortanaV2/Data-Scraper

A data-scraper that makes it possible to filter out the most important information from huge amounts of text based data.

Language: Python - Size: 6.84 KB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

FoamoftheSea/tda_scraper

A web-scraper which uses selenium webdriver to obtain in-depth data from TD Ameritrade website for use with python.

Language: Jupyter Notebook - Size: 190 KB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 5 - Forks: 3

kanitsharma/mangafox-scraper

Data scraper for mangafox

Language: JavaScript - Size: 8.79 KB - Last synced at: about 1 month ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0

oxygenkun/YetAnotherDataScraper

Yet another data scraper for you-know-what

Language: Python - Size: 53.7 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 0

logan-lauton/nfl_webscrape

web scrape performed for Kaggle dataset.

Language: Jupyter Notebook - Size: 14.6 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

logan-lauton/nba_webscrape

web scrapes performed for Kaggle datasets.

Language: Jupyter Notebook - Size: 86.9 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

You-sha/Naukri.com-Scraper

Scraper for Naukri.com | BeautifulSoup and Selenium

Language: Python - Size: 1.95 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

encoreshao/puppeteer-typescript-starter

The most basic Puppeteer TypeScript starter

Size: 2.93 KB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

h26k2/node-gui-scraper

A GUI Based Web-Scraper for scraping e-commerce website using NodeJS

Language: JavaScript - Size: 151 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 2

sallamy2580/python-web-scrapping

Detailed web scraping tutorials for dummies with financial data crawlers on Reddit WallStreetBets, CME (both options and futures), US Treasury, CFTC, LME, MacroTrends, SHFE and alternative data crawlers on Tomtom, BBC, Wall Street Journal, Al Jazeera, Reuters, Financial Times, Bloomberg, CNN, Fortune, The Economist

Language: Python - Size: 1.56 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 8 - Forks: 0

Rafaqfg/web-scraping-project-Python

In this project I created a python script using data scraping techniques to extract HTML content data from the Trybe's blog and stored in a MongoDB database.

Language: Python - Size: 3.56 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

lucifermorningstar1305/scrappy

Web Scrapper for any website

Language: Python - Size: 8.79 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

aarhank/Periculum-API

A working API for accessing/scraping kickasstorrent Torrents data.

Language: JavaScript - Size: 46.9 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

luka-j/UpisScraper

Crawls upin.mpn.gov.rs for data.

Language: Java - Size: 180 KB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

SabujXi/Python-Scraper-and-Data-Analysts-Admin-Panel-in-Django

A data scraper from texas govt site and a helping web app for managing, reviewing and editing the data

Language: Python - Size: 1.18 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

christiancameron/websitescraping

Data scraper and analysis from the top 500 websites

Language: Python - Size: 252 KB - Last synced at: about 2 months ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

lurodrigo/anw-cue-sheet

Generates cue sheets from .edl files and Audio Network metadata

Language: Clojure - Size: 40 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

DanMesh/ParkrunData

Data scraper for Parkrun results

Language: Java - Size: 8.79 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0