GitHub topics: crawling-sites
Onixx241/GuineaWebCrawler
A C# Web Crawler named after my favorite animal that crawls !🐹🐾
Language: C# - Size: 1.82 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 2 - Forks: 0

RevoltSecurities/SpideyX
SpideyX a multipurpose Web Penetration Testing tool with asynchronous concurrent performance with multiple mode and configurations.
Language: Python - Size: 973 KB - Last synced at: 14 days ago - Pushed at: 3 months ago - Stars: 164 - Forks: 28

fernandod1/ProductHunt-scraper
Producthunt.com famous website scraper script. Scrap all offers and save in spreadsheet excel file.
Language: Python - Size: 9.77 KB - Last synced at: about 1 month ago - Pushed at: about 3 years ago - Stars: 26 - Forks: 9

yan043/tlkm_leak
Language: PHP - Size: 22.5 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

kingschan1204/easyCrawl
A crawler toolkit implemented in Java
Language: Java - Size: 351 KB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 65 - Forks: 11

BaseMax/StockExchangeCrawler
A crawler program to extract all of the data and the price for symbols in the global stock exchange.
Language: PHP - Size: 30.3 KB - Last synced at: 10 days ago - Pushed at: almost 6 years ago - Stars: 9 - Forks: 8

myawesomebike/Text-Extraction-and-Processing
Crawl websites and extract meaningful information from HTML and site content
Language: Python - Size: 8.79 KB - Last synced at: 4 months ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 1

P4o1o/Dysdera
dysdera web crawler
Language: Python - Size: 2.91 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

mcxiaoxiao/xiaohongshuCrawler
🍠小红书 简易爬虫 获取文章title、文章id、文章内容、话题标签 👌🏻 三步实现
Language: JavaScript - Size: 7.19 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 18 - Forks: 2

miroshnikov/scrapyteer
Web crawling & scraping framework for Node.js on top of headless Chrome browser
Language: TypeScript - Size: 384 KB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 19 - Forks: 0

zhaotianff/Qzone
想起那天夕阳下的奔跑,那是我逝去的青春
Language: C# - Size: 427 KB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 0

mustafadalga/website-crawler
Hedef web sitesini tarayarak linklerini listeleyen bir web crawler scripti || A web crawler script that lists links by scanning the target website.
Language: Python - Size: 19.5 KB - Last synced at: about 13 hours ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 3

chandrasekharan98/Multisite-Python-Crawler
An almost generic web crawler built using Scrapy and Python 3.7 to recursively crawl entire websites.
Language: Python - Size: 15.6 KB - Last synced at: 8 months ago - Pushed at: over 3 years ago - Stars: 16 - Forks: 5

BaseMax/NetPHP
Useful functions for connecting to the network in the PHP based applications.
Language: PHP - Size: 23.4 KB - Last synced at: 10 days ago - Pushed at: about 5 years ago - Stars: 7 - Forks: 1

viclafouch/Fetch-Crawler
📌 A Node.JS Web crawler using the API Fetch to scrap static websites
Language: JavaScript - Size: 690 KB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 11 - Forks: 3

KatrojuSaiChaitanya/Webscraping_email_phone
Web scraping of Emails and Phone numbers from various websites
Language: Python - Size: 16.6 KB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 24 - Forks: 8

spypunk/sponge
sponge is a website crawler and links downloader command-line tool
Language: Kotlin - Size: 267 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

krespers/python-TJ-karaoke-songlist-maker
[python] TJ노래방 노래번호 순서대로 제목과 가수 리스트를 출력합니다.
Language: Python - Size: 3.91 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

Team-CMD/SPTJ_Web-Crawling
This Project is "SPTJ_Web-Crawling" that result of one of the activity in Team CMD.
Language: HTML - Size: 13.5 MB - Last synced at: almost 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 2

rpant1728/InstagramCrawler
A python script to crawl the Instagram profiles and scrape information (posts, followers, following, comments etc.)
Language: Python - Size: 8.79 KB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 13 - Forks: 3

gabfl/sitecrawl
Simple Python module to crawl a website and extract URLs
Language: Python - Size: 30.3 KB - Last synced at: about 2 months ago - Pushed at: about 2 years ago - Stars: 5 - Forks: 0

talhapythoneer/yellowpages_scraper
It's a python based scraper to scrape leads from yellowpages.
Language: Python - Size: 20.5 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

talhapythoneer/allrecipes_scraper
This is a Python(Scrapy) based scraper to scrape Recipes information in detail from AllRecipes which is the world's largest community-driven food brand which publishes home cooks and recipes with detail.
Language: Python - Size: 5.47 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 2

BoloniniD/XmlSiteMapper-rs
A sitemapper written in Rust
Language: Rust - Size: 28.3 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

talhapythoneer/foreclosure_property_scraper
This scraper is built to scrape Foreclosure for property listings which is a login based website.
Language: Python - Size: 26.4 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 1

talhapythoneer/redfinScraper
This scraper is built to scrape Redfin for property listings which is a Captcha protected website.
Language: Python - Size: 138 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

talhapythoneer/realtor_property_scraper
This script is built to scrape property data from realtor for property listings. We have used ScrapingBee to render JS on this website. It scrapes listings for targetted postal codes from targetCodes.txt file.
Language: Python - Size: 7.81 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

talhapythoneer/imovirtual_property_scraper
This scraper is built to scrape Imovirtual for property listings.
Language: Python - Size: 50.8 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

talhapythoneer/opensea_activity_scraper
It's a Python(selenium) based scraper to get trade activites for a given collection URL from Opensea which is the world's first and largest web3 marketplace for NFTs and crypto collectibles.
Language: Python - Size: 42 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

BoyanDimov20/OzoneCrawler
Open source app crawler for Ozone.bg
Language: C# - Size: 271 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

devidw/google-untitled-spam-spider
A spam spider which is targeting 'Untitled' spam pages from the Google search results.
Language: Python - Size: 6.84 KB - Last synced at: 8 months ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

dansari2020/web-crawler
Web Crawler is the automated fetching of all products of a web pages by a software process.
Language: Ruby - Size: 3.32 MB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

EbraLim/dss_project_Crawling
Crawling hotel data on 3 hotel reservation platforms in realtime, enabling users to compare them and reserve a room with the best price.
Language: Jupyter Notebook - Size: 6.3 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

FachriezalNugraha/Crawling_Twitter
Crawling data Twitter dengan mengggunakan JupyterNotebook dan Library tweepy
Language: Jupyter Notebook - Size: 702 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 4

Tejas07PSK/scrapo
A simple webapp to crawl through google-images and extract a desired number of image-url results, based on an input search-key !! This project was the development task, assigned to me for zillion.io 's hiring challenge !!
Language: JavaScript - Size: 118 KB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

Shikhar-S/Site-Custom-Search
Training
Language: HTML - Size: 1.49 GB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

kapsali29/Crawler-for-Greek-business
Crawler which extract business data from their websites
Language: Jupyter Notebook - Size: 2.12 MB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

somdipdey/Scrapping_And_Crawling_FinancialNews
Language: Python - Size: 568 KB - Last synced at: about 2 months ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0
