GitHub topics: webcrawler | Ecosyste.ms: Repos

dbaofd/pushingbarriers-webcrawler

It can be used to grab fixtures for different club teams in Queensland.

Language: Python - Size: 95.7 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

Corin-R/stadtradeln

This is a private project to crawl the stadtradeln event for a single city.

Language: Python - Size: 2.21 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 2 - Forks: 0

LOKESH-loky/Concurrent-Web-Crawler

The Concurrent Web Crawler is a Go-based application designed to crawl web pages efficiently using Go's powerful concurrency features.

Language: Go - Size: 12.7 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

Lehoczky/apro-scrape

Helpful web scraper for hardverapro.hu

Language: TypeScript - Size: 5.09 MB - Last synced at: 3 days ago - Pushed at: 5 days ago - Stars: 8 - Forks: 0

crawlab-team/crawlab

Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台，支持任何语言和框架

Language: Go - Size: 23.6 MB - Last synced at: 5 days ago - Pushed at: 8 days ago - Stars: 11,737 - Forks: 1,833

WebCrawlerAPI/webcrawlerapi-js-sdk

A WebcrawlerAPI SDK for Node JS

Language: TypeScript - Size: 41 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0

This project provides a REST API that allows users to submit URLs for crawling. The app internally uses RabbitMQ to publish the URLs, and then listens back to fetch the contents of the URLs using Jsoup. The app also scrapes links and indexes the content using Apache Lucene.

Language: Java - Size: 112 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 2 - Forks: 0

Aravindha1234u/SocialScraper

Social Scraper is a python tool meant for Detection of Child Predators/Cyber Harassers on Social Media

Language: Python - Size: 739 KB - Last synced at: 6 days ago - Pushed at: over 4 years ago - Stars: 59 - Forks: 13

dipanshuchaubey/ecom-price-crawler

A simple web crawler which tracks price of products on Flipkart and Amazon.

Language: JavaScript - Size: 8.79 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 1 - Forks: 0

ptrumpis/snap-lens-web-crawler

JavaScript library to crawl and download Snap Lenses from lens.snapchat.com with ease.

Language: JavaScript - Size: 183 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 7 - Forks: 4

Galarzaa90/TibiaKt

Kotlin library to fetch and parse Tibia.com pages.

Language: Kotlin - Size: 32.9 MB - Last synced at: about 18 hours ago - Pushed at: 10 days ago - Stars: 1 - Forks: 2

swalsh76/SillySpider

Got bored so wrote a crawler / downloader with GPT4o. Maybe someone can use it for something ¯\_(ツ)_/¯

Language: Python - Size: 13.7 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

yufree/scifetch

webpage crawling tools for pubmed, google scholar and rss

Language: R - Size: 44.9 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 13 - Forks: 6

lgcarmo/WebHunterScreen

This program aims to check active targets by saving screenshots in a project.

Language: Python - Size: 5.57 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 13 - Forks: 0

whats2000/CodeBRT

CodeBRT is an AI program generation plugin for VSCode. It helps you quickly generate code through AI, thus improving development efficiency.

Language: TypeScript - Size: 7.09 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 25 - Forks: 4

Lucs1590/cobWeb

🌧 🐛.🌿 Web crawler to get data from weather, bugs and plant!

Language: Python - Size: 9 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

z0m31en7/Uscrapper

Uscrapper Vanta: Dive deeper into the web with this powerful open-source tool. Extract valuable insights with ease and efficiency, from both surface and deep web sources. Empower your data mining and analysis with Vanta's advanced capabilities. Fast, reliable, and user-friendly, Uscrapper Vanta is the ultimate choice for researchers and analysts.

Language: Python - Size: 438 KB - Last synced at: 8 days ago - Pushed at: 6 months ago - Stars: 590 - Forks: 61

JaCraig/Spidey

A multi threaded web crawler library that is generic enough to allow different engines to be swapped in.

Language: C# - Size: 23.9 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 12 - Forks: 3

Jeanetted3v/Web-Crawler-Playground

A playground to testing out website crawling tools

Language: Python - Size: 16.6 KB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

amirgamil/apollo

A Unix-style personal search engine and web crawler for your digital footprint.

Language: Go - Size: 532 KB - Last synced at: about 3 hours ago - Pushed at: over 1 year ago - Stars: 1,373 - Forks: 52

openviglet/turing

:sparkles: :dna: Turing ES - Enterprise Search, Semantic Navigation, Chatbot using Search Engine and Generative AI.

Language: Java - Size: 298 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 58 - Forks: 9

xyzxyq/Deepseek-QandA-system-based-on-web-crawler-knowledge-base

该项目是基于网络爬虫，首先通过对百度搜索引擎对关键字进行搜索进而获取最实时性的消息，然后将获取到的消息创建为知识库，在调用deepseek模型时使用知识库中的内容让模型基于该内容进行回答

Language: Python - Size: 0 Bytes - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

nayandas69/SEO-Sentinel

SEO Sentinel Your site’s SEO glow-up BFF! Sniff out broken links, missing metadata, & keyword drama while serving fire HTML reports.

Size: 188 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 11 - Forks: 0

rohitajariwal/web-app-security-scanner

A web crawler and vulnerability scanner tool developed by Rohit Ajariwal

Language: Python - Size: 32.2 KB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 1 - Forks: 1

NSYSU-OpenDev/NSYSUCourseAPI

中山大學選課列表API

Language: Python - Size: 258 MB - Last synced at: about 12 hours ago - Pushed at: about 13 hours ago - Stars: 2 - Forks: 0

fengqimin/WebAnts

一个用httpx实现的简单异步网络爬虫框架。

Language: Python - Size: 211 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

Nqhuy300106/open_deep_research

Together Open Deep Research

Language: Python - Size: 236 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

iBz-04/Hudgent

Official code implementation for my ready tensor publication, an ai agent that retrieves data from an islamic website -> uses the data as alignment criteria to answer the user

Language: Python - Size: 32.3 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 2 - Forks: 0

hesamz3090/Moss

Moss is a lightweight, efficient, and modular web crawler designed to explore, analyze, and extract data from the vast landscape of the internet.

Language: Python - Size: 14.6 KB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 1 - Forks: 0

pyoneerC/Mercadix

Price histogram generator for MercadoLibre product listings.

Language: HTML - Size: 19.4 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 16 - Forks: 0

sudonym-i/Web-Scraper

A web scraper that follows a chain specifid in 'crawlchain.txt', and collects data using start/stop points (ex <a> and </a>).

Language: Makefile - Size: 36.9 MB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 2 - Forks: 0

MertenD/node-crawler

Node-Crawler is a highly customizable, Node-based web application for creating web crawlers and further processing and transforming the retrieved data.

Language: TypeScript - Size: 1.25 MB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 23 - Forks: 2

Conso1eCowb0y/Deepminer 📦

Deep web crawler and search engine

Language: Python - Size: 14.6 KB - Last synced at: 22 days ago - Pushed at: almost 5 years ago - Stars: 50 - Forks: 12

GeneralNewsExtractor/GeneralNewsExtractor

新闻网页正文通用抽取器 Beta 版.

Language: Python - Size: 17.4 MB - Last synced at: 24 days ago - Pushed at: 11 months ago - Stars: 3,715 - Forks: 538

jishnukoliyadan/NAV_Scrapper

NAV Scraper is a Python tool that fetches real-time stock and mutual fund NAVs, merges them with holdings data, and stores results in a database or exports to JSON. It supports automated daily updates via cron jobs and uses rate-limited API calls for efficient scraping.

Language: Python - Size: 24.4 KB - Last synced at: 28 days ago - Pushed at: 29 days ago - Stars: 0 - Forks: 0

zorlan/skycaiji

蓝天采集器是一款开源免费的爬虫系统，仅需点选编辑规则即可采集数据，可运行在本地、虚拟主机或云服务器中，几乎能采集所有类型的网页，无缝对接各类CMS建站程序，免登录实时发布数据，全自动无需人工干预！是网页大数据采集软件中完全跨平台的云端爬虫系统

Language: PHP - Size: 24.9 MB - Last synced at: 30 days ago - Pushed at: about 2 months ago - Stars: 1,986 - Forks: 593

GeminidSystems/GoogleNewsScraper

A Python package that scrapes Google News article data while remaining undetected by Google. Our scraper can scrape page data up until the last page and never trigger a CAPTCHA (download stats: https://pepy.tech/project/GoogleNewsScraper)

Language: Python - Size: 15.3 MB - Last synced at: 30 days ago - Pushed at: about 3 years ago - Stars: 12 - Forks: 5

ssssssss-team/spider-flow

新一代爬虫平台，以图形化方式定义爬虫流程，不写代码即可完成爬虫。

Language: Java - Size: 3.23 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 9,902 - Forks: 1,908

MCStreetguy/Crawler

An advanced web-crawler written in PHP.

Language: PHP - Size: 224 KB - Last synced at: about 1 month ago - Pushed at: about 6 years ago - Stars: 5 - Forks: 0

iiicebearrr/spiders-for-all

A set of useful and scalable spiders to crawl data/videos from bilibili, xiaohongshu, etc.

Language: Python - Size: 1.06 MB - Last synced at: 29 days ago - Pushed at: about 1 year ago - Stars: 28 - Forks: 11

Aavache/LLMWebCrawler

A Web Crawler based on LLMs implemented with Ray and Huggingface. The embeddings are saved into a vector database for fast clustering and retrieval. Use it for your RAG.

Language: Python - Size: 20.5 KB - Last synced at: 28 days ago - Pushed at: over 1 year ago - Stars: 94 - Forks: 10

Acollie/Go-Webcrawler

Webcrawler in Go with a graph database and DynamoDB for backing

Language: Go - Size: 2.18 MB - Last synced at: 6 days ago - Pushed at: 24 days ago - Stars: 0 - Forks: 0

kingname/SourceCodeOfBook

《Python爬虫开发从入门到实战》配套源代码。

Language: Python - Size: 85.1 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 364 - Forks: 127

KELVI23/Java-Web-Crawler

Web Crawler project that navigates the web and indexes pages. Project makes use of Jsoup (Java html parsing library). It crawls webpages at the depth of 2 and returns target title, links and text then saves them to a file.

Language: HTML - Size: 43 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

LanHao99/pubfetch

a simple python-based pubmed abstract fetcher

Language: Python - Size: 10.7 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Avesay/imdb_search

Web crawler designed for scraping data about top 250 movies on IMDb.

Language: Python - Size: 123 KB - Last synced at: 29 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

itsNavinSingh/crawler

A Web Crawler for crawling the internet

Language: C++ - Size: 5.86 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

scrapinghub/scrapyrt

HTTP API for Scrapy spiders

Language: Python - Size: 233 KB - Last synced at: 28 days ago - Pushed at: 11 months ago - Stars: 852 - Forks: 160

XenosWarlocks/MultiCrawl

MultiCrawl is a powerful and flexible web crawling framework that provides multiple crawling strategies to suit different use cases and performance requirements. The library supports sequential, threaded, and asynchronous crawling methods, making it adaptable to various data extraction needs.

Language: Python - Size: 567 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

3nock/SpiderSuite

Advance web security spider/crawler

Size: 6.98 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 634 - Forks: 70

leticosta4/API_dados_processos

API Flask com web crawling para coleta de dados sobre processos jurídicos

Language: Python - Size: 35.2 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

mehmetozkaya/DotnetCrawler

DotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c

Language: C# - Size: 70.3 KB - Last synced at: about 21 hours ago - Pushed at: over 2 years ago - Stars: 176 - Forks: 66

pavlovtech/WebReaper

Web scraper, crawler and parser in C#. Designed as simple, declarative and scalable web scraping solution.

Language: C# - Size: 37.3 MB - Last synced at: 29 days ago - Pushed at: 7 months ago - Stars: 119 - Forks: 28

havardnyboe/dagenidag

Gjenskapning av NRKs side 199 fra Tekst-TV

Language: TypeScript - Size: 4.27 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 3 - Forks: 0

mominurr/Web-Scraping-Projects

Explore a variety of web scraping projects showcasing my skills and experience in extracting valuable data and solving complex challenges.

Size: 19.5 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

mominurr/realSelf.com_scraper

realself.com data scraper that scrape website all information and bypass ip blocking and press & hold captcha.

Size: 146 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 1

mominurr/Real-Estate-Web-Scraping

Real Estate Web Scraping – Collects comprehensive property and agent data while bypassing IP blocking measures

Size: 1.2 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

mominurr/Google-Map-Scraping

google map scraper collect google map all available data and collect email from business website.

Size: 21.5 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

mominurr/Social-Media-Scraping

Social Media Scraping – Scrapes data from TikTok, LinkedIn, Facebook, and Twitter (X.com), including user profiles, posts, engagement metrics, and comments.

Size: 324 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

mominurr/cars.com

Cars.com Scraper – Extracts car listings (make, model, year, price, seller details) from cars.com using Selenium and BeautifulSoup, saving data in CSV format.

Language: Python - Size: 555 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

salimk/Rcrawler

An R web crawler and scraper

Language: R - Size: 597 KB - Last synced at: 30 days ago - Pushed at: about 3 years ago - Stars: 354 - Forks: 92

mominurr/Yellow-Pages-Data-Scraping

Yellow Pages Data Scraping – Automates the extraction of business details (name, email, phone, address, website) from Yellow Pages directories, providing structured and accurate data.

Size: 191 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

code-yeongyu/TrackPurchase

단 몇줄의 코드로 다양한 쇼핑 플랫폼에서 결제 내역을 긁어오자!

Language: TypeScript - Size: 619 KB - Last synced at: 23 days ago - Pushed at: over 3 years ago - Stars: 17 - Forks: 0

antsinar/CrawlerAPI

An async web crawler implemented as a web API, mainly for educational purposes.

Language: Python - Size: 152 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

voliveirajr/seleniumcrawler

An example using Selenium webdrivers for python and Scrapy framework to create a web scraper to crawl an ASP site

Language: Python - Size: 20.5 KB - Last synced at: about 1 month ago - Pushed at: about 6 years ago - Stars: 127 - Forks: 45

jaeksoft/opensearchserver

Open-source Enterprise Grade Search Engine Software

Language: Java - Size: 498 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 505 - Forks: 189

kleindasash/Content-Grabber

Content Grabber is a powerful software for automatic data extraction from websites.

Size: 2.93 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

DedSecInside/gotor

This program provides efficient web scraping services for Tor and non-Tor sites. The program has both a CLI and REST API.

Language: Go - Size: 10.7 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 166 - Forks: 44

mominurr/stackoverflow.com

A web scraper collecting Stack Overflow questions for NLP, using threading and user-agent rotation

Language: Python - Size: 0 Bytes - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

gdgd009xcd/RequestRecorder

A ZAPROXY Add-on that allows testing of web application vulnerabilities by recording complex multi-step sequences. You can test applications that need to access pages in a specific order, such as shopping carts or registration of member information.

Language: Java - Size: 50.7 MB - Last synced at: 2 days ago - Pushed at: 2 months ago - Stars: 22 - Forks: 4

Ns81000/ai-chat-web-crawler

🤖 Chat with AI models that respond in real-time to your questions and prompts. 🕸️ Crawl websites to extract valuable information with adjustable depth and limits. 📄 Process documents like PDFs and text files to include in your AI conversations.

Language: Python - Size: 26.4 KB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 1

adrianosferreira/afrodite.json

O maior livro de receitas culinárias em língua portuguesa

Size: 540 KB - Last synced at: about 1 month ago - Pushed at: almost 9 years ago - Stars: 187 - Forks: 43

devBhas/DevCrawler

DevCrawler - An LLM Friendly Web Crawler & Data Scraper

Language: Python - Size: 27.3 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

rodri-santos/SD-2024

Webcrawler feito no âmbito da cadeira Sistemas Distribuídos do 2º semestre de 2023/2024 do 3º ano da Licenciatura em Engenharia e Ciência de Dados

Language: Java - Size: 946 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

glaubermag/reddit_web_crawler

Este projeto Python coleta dados do Reddit (posts e comentários), armazena em um banco de dados PostgreSQL e fornece ferramentas para análise de sentimentos e previsão de tendências. Inclui scripts para coleta de dados, manipulação de banco de dados e consultas.

Language: Python - Size: 36.1 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

shinrenpan/WebParser

網頁爬蟲

Language: Swift - Size: 39.1 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

JacobTaylor3/Web-Crawler

Web Crawler for ethical hackers / pen testers

Language: Python - Size: 40 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 2

OsamaNagi/http-health-checker

🕷️ Go Web Crawler - A lightning-fast concurrent web crawler for performing deep health checks on websites. Built with Go's powerful concurrency primitives.

Language: Go - Size: 32.2 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

SlidrusForeal/Webcrawler

Language: Python - Size: 17.6 KB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

datacollectionspecialist/Web-Crawler-in-Python

Learn how to build a web crawler in Python with this step-by-step guide for 2025.

Size: 9.77 KB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 3 - Forks: 0

shubhampandit/ai-web-scraper

Web Scraper using Gen-AI

Language: Python - Size: 7.81 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

manishkolla/Multi-Threaded-Web-Crawler

This project is a multi-threaded web crawler implemented in Java that efficiently explores websites using Jsoup for HTML parsing and ExecutorService for concurrent URL processing. It supports depth control, manages crawled URLs, and ensures that the crawler can resume from a previous state using a persistent state file.

Language: Java - Size: 9.9 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

JoshuaWink/WebCrawler

A Python-based web crawler that maps website structure and extracts content. This tool can generate both text and Excel outputs of crawled pages along with visual sitemaps.

Language: Python - Size: 8.33 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

LucasMendesl/mugiwara

:tophat: a simple web scraping to extract and download videos from animesproject.com

Language: JavaScript - Size: 46.9 KB - Last synced at: 12 days ago - Pushed at: over 6 years ago - Stars: 5 - Forks: 4

meffordh/KnowledgeHunter

Open Source Deep Research

Language: TypeScript - Size: 3.75 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

pythagoras-19/CrawlerRust

Another web crawler... but rusty

Language: Rust - Size: 67.4 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

deep5050/Abosar

অবসর 📚 A collection of short Bengali stories web scraped from various Bengali eMagazines and eNewspapers.

Language: Python - Size: 88.3 MB - Last synced at: 12 days ago - Pushed at: 3 months ago - Stars: 13 - Forks: 2

hfreire/browser-as-a-service

A web browser :earth_americas: hosted as a service, to render your JavaScript web pages as HTML

Language: JavaScript - Size: 3.88 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 55 - Forks: 12

sebastianenger1981/CPAN

Webcrawler and SEO Web Spider: Software, die ich auf CPAN.org und METACPAN.org veröffentlicht habe

Language: Perl - Size: 101 KB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

iloveuic/UIC-MIS-ROBBER

🏫 At BNU-HKBU UIC, 100% course selected guarantee. 在北师港浸大，给你100%的保证抢到课。

Language: Python - Size: 10.8 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 3 - Forks: 0

So-Sori/webcrawlerhttp

What began as a straightforward web crawler has evolved into a versatile and feature-rich tool. It now provides users with the ability to extract, analyze, and present insightful data from various websites. The tool adapts to multiple needs, from data extraction to content discovery, providing a user-friendly interface for seamless interaction.

Language: JavaScript - Size: 788 KB - Last synced at: 2 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

oussemabenhassena5/Laptop-Scraper

🕸️ Advanced web scraper for extracting comprehensive laptop product information from TunisiaNet using Python, Selenium, and multi-format data export.

Language: Python - Size: 338 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

paganini2008/greenfinger

GreenFinger is a cutting-edge distributed web crawling framework built on Spring Cloud, PostgreSQL, and Elasticsearch, powered by the high-performance Netty NIO engine. It features an intuitive Web UI for managing and monitoring tasks, dynamic node scaling, and real-time data processing.

Language: Java - Size: 273 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 1 - Forks: 1