GitHub topics: webcrawler
dbaofd/pushingbarriers-webcrawler
It can be used to grab fixtures for different club teams in Queensland.
Language: Python - Size: 95.7 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

Corin-R/stadtradeln
This is a private project to crawl the stadtradeln event for a single city.
Language: Python - Size: 2.21 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 2 - Forks: 0

LOKESH-loky/Concurrent-Web-Crawler
The Concurrent Web Crawler is a Go-based application designed to crawl web pages efficiently using Go's powerful concurrency features.
Language: Go - Size: 12.7 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

Lehoczky/apro-scrape
Helpful web scraper for hardverapro.hu
Language: TypeScript - Size: 5.09 MB - Last synced at: 3 days ago - Pushed at: 5 days ago - Stars: 8 - Forks: 0

crawlab-team/crawlab
Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架
Language: Go - Size: 23.6 MB - Last synced at: 5 days ago - Pushed at: 8 days ago - Stars: 11,737 - Forks: 1,833

WebCrawlerAPI/webcrawlerapi-js-sdk
A WebcrawlerAPI SDK for Node JS
Language: TypeScript - Size: 41 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0

deviknitkkr/Jemini
This project provides a REST API that allows users to submit URLs for crawling. The app internally uses RabbitMQ to publish the URLs, and then listens back to fetch the contents of the URLs using Jsoup. The app also scrapes links and indexes the content using Apache Lucene.
Language: Java - Size: 112 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 2 - Forks: 0

Aravindha1234u/SocialScraper
Social Scraper is a python tool meant for Detection of Child Predators/Cyber Harassers on Social Media
Language: Python - Size: 739 KB - Last synced at: 6 days ago - Pushed at: over 4 years ago - Stars: 59 - Forks: 13

dipanshuchaubey/ecom-price-crawler
A simple web crawler which tracks price of products on Flipkart and Amazon.
Language: JavaScript - Size: 8.79 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 1 - Forks: 0

ptrumpis/snap-lens-web-crawler
JavaScript library to crawl and download Snap Lenses from lens.snapchat.com with ease.
Language: JavaScript - Size: 183 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 7 - Forks: 4

Galarzaa90/TibiaKt
Kotlin library to fetch and parse Tibia.com pages.
Language: Kotlin - Size: 32.9 MB - Last synced at: about 18 hours ago - Pushed at: 10 days ago - Stars: 1 - Forks: 2

swalsh76/SillySpider
Got bored so wrote a crawler / downloader with GPT4o. Maybe someone can use it for something ¯\_(ツ)_/¯
Language: Python - Size: 13.7 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

yufree/scifetch
webpage crawling tools for pubmed, google scholar and rss
Language: R - Size: 44.9 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 13 - Forks: 6

lgcarmo/WebHunterScreen
This program aims to check active targets by saving screenshots in a project.
Language: Python - Size: 5.57 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 13 - Forks: 0

whats2000/CodeBRT
CodeBRT is an AI program generation plugin for VSCode. It helps you quickly generate code through AI, thus improving development efficiency.
Language: TypeScript - Size: 7.09 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 25 - Forks: 4

Lucs1590/cobWeb
🌧 🐛.🌿 Web crawler to get data from weather, bugs and plant!
Language: Python - Size: 9 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

z0m31en7/Uscrapper
Uscrapper Vanta: Dive deeper into the web with this powerful open-source tool. Extract valuable insights with ease and efficiency, from both surface and deep web sources. Empower your data mining and analysis with Vanta's advanced capabilities. Fast, reliable, and user-friendly, Uscrapper Vanta is the ultimate choice for researchers and analysts.
Language: Python - Size: 438 KB - Last synced at: 8 days ago - Pushed at: 6 months ago - Stars: 590 - Forks: 61

JaCraig/Spidey
A multi threaded web crawler library that is generic enough to allow different engines to be swapped in.
Language: C# - Size: 23.9 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 12 - Forks: 3

Jeanetted3v/Web-Crawler-Playground
A playground to testing out website crawling tools
Language: Python - Size: 16.6 KB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

amirgamil/apollo
A Unix-style personal search engine and web crawler for your digital footprint.
Language: Go - Size: 532 KB - Last synced at: about 3 hours ago - Pushed at: over 1 year ago - Stars: 1,373 - Forks: 52

openviglet/turing
:sparkles: :dna: Turing ES - Enterprise Search, Semantic Navigation, Chatbot using Search Engine and Generative AI.
Language: Java - Size: 298 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 58 - Forks: 9

xyzxyq/Deepseek-QandA-system-based-on-web-crawler-knowledge-base
该项目是基于网络爬虫,首先通过对百度搜索引擎对关键字进行搜索进而获取最实时性的消息,然后将获取到的消息创建为知识库,在调用deepseek模型时使用知识库中的内容让模型基于该内容进行回答
Language: Python - Size: 0 Bytes - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

nayandas69/SEO-Sentinel
SEO Sentinel Your site’s SEO glow-up BFF! Sniff out broken links, missing metadata, & keyword drama while serving fire HTML reports.
Size: 188 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 11 - Forks: 0

rohitajariwal/web-app-security-scanner
A web crawler and vulnerability scanner tool developed by Rohit Ajariwal
Language: Python - Size: 32.2 KB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 1 - Forks: 1

NSYSU-OpenDev/NSYSUCourseAPI
中山大學選課列表API
Language: Python - Size: 258 MB - Last synced at: about 12 hours ago - Pushed at: about 13 hours ago - Stars: 2 - Forks: 0

fengqimin/WebAnts
一个用httpx实现的简单异步网络爬虫框架。
Language: Python - Size: 211 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

Nqhuy300106/open_deep_research
Together Open Deep Research
Language: Python - Size: 236 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

iBz-04/Hudgent
Official code implementation for my ready tensor publication, an ai agent that retrieves data from an islamic website -> uses the data as alignment criteria to answer the user
Language: Python - Size: 32.3 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 2 - Forks: 0

hesamz3090/Moss
Moss is a lightweight, efficient, and modular web crawler designed to explore, analyze, and extract data from the vast landscape of the internet.
Language: Python - Size: 14.6 KB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 1 - Forks: 0

pyoneerC/Mercadix
Price histogram generator for MercadoLibre product listings.
Language: HTML - Size: 19.4 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 16 - Forks: 0

sudonym-i/Web-Scraper
A web scraper that follows a chain specifid in 'crawlchain.txt', and collects data using start/stop points (ex <a> and </a>).
Language: Makefile - Size: 36.9 MB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 2 - Forks: 0

MertenD/node-crawler
Node-Crawler is a highly customizable, Node-based web application for creating web crawlers and further processing and transforming the retrieved data.
Language: TypeScript - Size: 1.25 MB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 23 - Forks: 2

Conso1eCowb0y/Deepminer 📦
Deep web crawler and search engine
Language: Python - Size: 14.6 KB - Last synced at: 22 days ago - Pushed at: almost 5 years ago - Stars: 50 - Forks: 12

GeneralNewsExtractor/GeneralNewsExtractor
新闻网页正文通用抽取器 Beta 版.
Language: Python - Size: 17.4 MB - Last synced at: 24 days ago - Pushed at: 11 months ago - Stars: 3,715 - Forks: 538

jishnukoliyadan/NAV_Scrapper
NAV Scraper is a Python tool that fetches real-time stock and mutual fund NAVs, merges them with holdings data, and stores results in a database or exports to JSON. It supports automated daily updates via cron jobs and uses rate-limited API calls for efficient scraping.
Language: Python - Size: 24.4 KB - Last synced at: 28 days ago - Pushed at: 29 days ago - Stars: 0 - Forks: 0

zorlan/skycaiji
蓝天采集器是一款开源免费的爬虫系统,仅需点选编辑规则即可采集数据,可运行在本地、虚拟主机或云服务器中,几乎能采集所有类型的网页,无缝对接各类CMS建站程序,免登录实时发布数据,全自动无需人工干预!是网页大数据采集软件中完全跨平台的云端爬虫系统
Language: PHP - Size: 24.9 MB - Last synced at: 30 days ago - Pushed at: about 2 months ago - Stars: 1,986 - Forks: 593

GeminidSystems/GoogleNewsScraper
A Python package that scrapes Google News article data while remaining undetected by Google. Our scraper can scrape page data up until the last page and never trigger a CAPTCHA (download stats: https://pepy.tech/project/GoogleNewsScraper)
Language: Python - Size: 15.3 MB - Last synced at: 30 days ago - Pushed at: about 3 years ago - Stars: 12 - Forks: 5

ssssssss-team/spider-flow
新一代爬虫平台,以图形化方式定义爬虫流程,不写代码即可完成爬虫。
Language: Java - Size: 3.23 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 9,902 - Forks: 1,908

MCStreetguy/Crawler
An advanced web-crawler written in PHP.
Language: PHP - Size: 224 KB - Last synced at: about 1 month ago - Pushed at: about 6 years ago - Stars: 5 - Forks: 0

iiicebearrr/spiders-for-all
A set of useful and scalable spiders to crawl data/videos from bilibili, xiaohongshu, etc.
Language: Python - Size: 1.06 MB - Last synced at: 29 days ago - Pushed at: about 1 year ago - Stars: 28 - Forks: 11

Aavache/LLMWebCrawler
A Web Crawler based on LLMs implemented with Ray and Huggingface. The embeddings are saved into a vector database for fast clustering and retrieval. Use it for your RAG.
Language: Python - Size: 20.5 KB - Last synced at: 28 days ago - Pushed at: over 1 year ago - Stars: 94 - Forks: 10

Acollie/Go-Webcrawler
Webcrawler in Go with a graph database and DynamoDB for backing
Language: Go - Size: 2.18 MB - Last synced at: 6 days ago - Pushed at: 24 days ago - Stars: 0 - Forks: 0

kingname/SourceCodeOfBook
《Python爬虫开发 从入门到实战》配套源代码。
Language: Python - Size: 85.1 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 364 - Forks: 127

KELVI23/Java-Web-Crawler
Web Crawler project that navigates the web and indexes pages. Project makes use of Jsoup (Java html parsing library). It crawls webpages at the depth of 2 and returns target title, links and text then saves them to a file.
Language: HTML - Size: 43 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

LanHao99/pubfetch
a simple python-based pubmed abstract fetcher
Language: Python - Size: 10.7 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Avesay/imdb_search
Web crawler designed for scraping data about top 250 movies on IMDb.
Language: Python - Size: 123 KB - Last synced at: 29 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

itsNavinSingh/crawler
A Web Crawler for crawling the internet
Language: C++ - Size: 5.86 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

scrapinghub/scrapyrt
HTTP API for Scrapy spiders
Language: Python - Size: 233 KB - Last synced at: 28 days ago - Pushed at: 11 months ago - Stars: 852 - Forks: 160

XenosWarlocks/MultiCrawl
MultiCrawl is a powerful and flexible web crawling framework that provides multiple crawling strategies to suit different use cases and performance requirements. The library supports sequential, threaded, and asynchronous crawling methods, making it adaptable to various data extraction needs.
Language: Python - Size: 567 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

3nock/SpiderSuite
Advance web security spider/crawler
Size: 6.98 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 634 - Forks: 70

leticosta4/API_dados_processos
API Flask com web crawling para coleta de dados sobre processos jurídicos
Language: Python - Size: 35.2 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

mehmetozkaya/DotnetCrawler
DotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c
Language: C# - Size: 70.3 KB - Last synced at: about 21 hours ago - Pushed at: over 2 years ago - Stars: 176 - Forks: 66

pavlovtech/WebReaper
Web scraper, crawler and parser in C#. Designed as simple, declarative and scalable web scraping solution.
Language: C# - Size: 37.3 MB - Last synced at: 29 days ago - Pushed at: 7 months ago - Stars: 119 - Forks: 28

havardnyboe/dagenidag
Gjenskapning av NRKs side 199 fra Tekst-TV
Language: TypeScript - Size: 4.27 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 3 - Forks: 0

mominurr/Web-Scraping-Projects
Explore a variety of web scraping projects showcasing my skills and experience in extracting valuable data and solving complex challenges.
Size: 19.5 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

mominurr/realSelf.com_scraper
realself.com data scraper that scrape website all information and bypass ip blocking and press & hold captcha.
Size: 146 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 1

mominurr/Real-Estate-Web-Scraping
Real Estate Web Scraping – Collects comprehensive property and agent data while bypassing IP blocking measures
Size: 1.2 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

mominurr/Google-Map-Scraping
google map scraper collect google map all available data and collect email from business website.
Size: 21.5 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

mominurr/Social-Media-Scraping
Social Media Scraping – Scrapes data from TikTok, LinkedIn, Facebook, and Twitter (X.com), including user profiles, posts, engagement metrics, and comments.
Size: 324 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

mominurr/cars.com
Cars.com Scraper – Extracts car listings (make, model, year, price, seller details) from cars.com using Selenium and BeautifulSoup, saving data in CSV format.
Language: Python - Size: 555 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

salimk/Rcrawler
An R web crawler and scraper
Language: R - Size: 597 KB - Last synced at: 30 days ago - Pushed at: about 3 years ago - Stars: 354 - Forks: 92

mominurr/Yellow-Pages-Data-Scraping
Yellow Pages Data Scraping – Automates the extraction of business details (name, email, phone, address, website) from Yellow Pages directories, providing structured and accurate data.
Size: 191 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

code-yeongyu/TrackPurchase
단 몇줄의 코드로 다양한 쇼핑 플랫폼에서 결제 내역을 긁어오자!
Language: TypeScript - Size: 619 KB - Last synced at: 23 days ago - Pushed at: over 3 years ago - Stars: 17 - Forks: 0

antsinar/CrawlerAPI
An async web crawler implemented as a web API, mainly for educational purposes.
Language: Python - Size: 152 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

voliveirajr/seleniumcrawler
An example using Selenium webdrivers for python and Scrapy framework to create a web scraper to crawl an ASP site
Language: Python - Size: 20.5 KB - Last synced at: about 1 month ago - Pushed at: about 6 years ago - Stars: 127 - Forks: 45

jaeksoft/opensearchserver
Open-source Enterprise Grade Search Engine Software
Language: Java - Size: 498 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 505 - Forks: 189

kleindasash/Content-Grabber
Content Grabber is a powerful software for automatic data extraction from websites.
Size: 2.93 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

DedSecInside/gotor
This program provides efficient web scraping services for Tor and non-Tor sites. The program has both a CLI and REST API.
Language: Go - Size: 10.7 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 166 - Forks: 44

mominurr/stackoverflow.com
A web scraper collecting Stack Overflow questions for NLP, using threading and user-agent rotation
Language: Python - Size: 0 Bytes - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

gdgd009xcd/RequestRecorder
A ZAPROXY Add-on that allows testing of web application vulnerabilities by recording complex multi-step sequences. You can test applications that need to access pages in a specific order, such as shopping carts or registration of member information.
Language: Java - Size: 50.7 MB - Last synced at: 2 days ago - Pushed at: 2 months ago - Stars: 22 - Forks: 4

Ns81000/ai-chat-web-crawler
🤖 Chat with AI models that respond in real-time to your questions and prompts. 🕸️ Crawl websites to extract valuable information with adjustable depth and limits. 📄 Process documents like PDFs and text files to include in your AI conversations.
Language: Python - Size: 26.4 KB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 1

adrianosferreira/afrodite.json
O maior livro de receitas culinárias em língua portuguesa
Size: 540 KB - Last synced at: about 1 month ago - Pushed at: almost 9 years ago - Stars: 187 - Forks: 43

devBhas/DevCrawler
DevCrawler - An LLM Friendly Web Crawler & Data Scraper
Language: Python - Size: 27.3 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

rodri-santos/SD-2024
Webcrawler feito no âmbito da cadeira Sistemas Distribuídos do 2º semestre de 2023/2024 do 3º ano da Licenciatura em Engenharia e Ciência de Dados
Language: Java - Size: 946 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

glaubermag/reddit_web_crawler
Este projeto Python coleta dados do Reddit (posts e comentários), armazena em um banco de dados PostgreSQL e fornece ferramentas para análise de sentimentos e previsão de tendências. Inclui scripts para coleta de dados, manipulação de banco de dados e consultas.
Language: Python - Size: 36.1 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

shinrenpan/WebParser
網頁爬蟲
Language: Swift - Size: 39.1 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

JacobTaylor3/Web-Crawler
Web Crawler for ethical hackers / pen testers
Language: Python - Size: 40 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 2

OsamaNagi/http-health-checker
🕷️ Go Web Crawler - A lightning-fast concurrent web crawler for performing deep health checks on websites. Built with Go's powerful concurrency primitives.
Language: Go - Size: 32.2 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

SlidrusForeal/Webcrawler
Language: Python - Size: 17.6 KB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

datacollectionspecialist/Web-Crawler-in-Python
Learn how to build a web crawler in Python with this step-by-step guide for 2025.
Size: 9.77 KB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 3 - Forks: 0

shubhampandit/ai-web-scraper
Web Scraper using Gen-AI
Language: Python - Size: 7.81 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

manishkolla/Multi-Threaded-Web-Crawler
This project is a multi-threaded web crawler implemented in Java that efficiently explores websites using Jsoup for HTML parsing and ExecutorService for concurrent URL processing. It supports depth control, manages crawled URLs, and ensures that the crawler can resume from a previous state using a persistent state file.
Language: Java - Size: 9.9 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

JoshuaWink/WebCrawler
A Python-based web crawler that maps website structure and extracts content. This tool can generate both text and Excel outputs of crawled pages along with visual sitemaps.
Language: Python - Size: 8.33 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

LucasMendesl/mugiwara
:tophat: a simple web scraping to extract and download videos from animesproject.com
Language: JavaScript - Size: 46.9 KB - Last synced at: 12 days ago - Pushed at: over 6 years ago - Stars: 5 - Forks: 4

meffordh/KnowledgeHunter
Open Source Deep Research
Language: TypeScript - Size: 3.75 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

pythagoras-19/CrawlerRust
Another web crawler... but rusty
Language: Rust - Size: 67.4 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

deep5050/Abosar
অবসর 📚 A collection of short Bengali stories web scraped from various Bengali eMagazines and eNewspapers.
Language: Python - Size: 88.3 MB - Last synced at: 12 days ago - Pushed at: 3 months ago - Stars: 13 - Forks: 2

hfreire/browser-as-a-service
A web browser :earth_americas: hosted as a service, to render your JavaScript web pages as HTML
Language: JavaScript - Size: 3.88 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 55 - Forks: 12

sebastianenger1981/CPAN
Webcrawler and SEO Web Spider: Software, die ich auf CPAN.org und METACPAN.org veröffentlicht habe
Language: Perl - Size: 101 KB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

iloveuic/UIC-MIS-ROBBER
🏫 At BNU-HKBU UIC, 100% course selected guarantee. 在北师港浸大,给你100%的保证抢到课。
Language: Python - Size: 10.8 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 3 - Forks: 0

So-Sori/webcrawlerhttp
What began as a straightforward web crawler has evolved into a versatile and feature-rich tool. It now provides users with the ability to extract, analyze, and present insightful data from various websites. The tool adapts to multiple needs, from data extraction to content discovery, providing a user-friendly interface for seamless interaction.
Language: JavaScript - Size: 788 KB - Last synced at: 2 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

oussemabenhassena5/Laptop-Scraper
🕸️ Advanced web scraper for extracting comprehensive laptop product information from TunisiaNet using Python, Selenium, and multi-format data export.
Language: Python - Size: 338 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

paganini2008/greenfinger
GreenFinger is a cutting-edge distributed web crawling framework built on Spring Cloud, PostgreSQL, and Elasticsearch, powered by the high-performance Netty NIO engine. It features an intuitive Web UI for managing and monitoring tasks, dynamic node scaling, and real-time data processing.
Language: Java - Size: 273 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 1 - Forks: 1

MadExploits/madrawler
Web crawler for finding easy endpoint
Language: Python - Size: 3.91 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 1

hedii/php-crawler 📦
A php crawler that finds emails on the internets
Language: PHP - Size: 1.42 MB - Last synced at: 2 months ago - Pushed at: almost 4 years ago - Stars: 134 - Forks: 65

VickyDev810/WebOasis-web-crawler
A web crawler made using QT framework for C++ uses Unsplash, Bing & Wikipedia API for working at the back-end crawls informtion based on the given query return images & a summary about the search query a user-friendly UI.
Language: C++ - Size: 4.28 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Andriy-Bilenko/WebLinkExtractor
Multithreaded C++ utility for recursively extracting webpage links, with PostgreSQL integration and Python bindings.
Language: C++ - Size: 76 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

FehmiTahsinDemirkan/Mindsite-Case
Mindsite Interview Task : Powerful web scraping tool for e-commerce data with email notifications and flexible data export. Supports N11 and Trendyol.
Language: Python - Size: 51.1 MB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Ahmard/queliwrap
QueryList PHP web scrapper wrapper
Language: PHP - Size: 93.8 KB - Last synced at: 23 days ago - Pushed at: about 2 years ago - Stars: 6 - Forks: 1

shenxiangzhuang/PythonDataAnalysis
The data and code that used in my book.
Language: Jupyter Notebook - Size: 8.52 MB - Last synced at: about 1 month ago - Pushed at: 11 months ago - Stars: 71 - Forks: 46
