Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: crawler4j

Muhammad-Elgendi/Distributed-crawler4j

Distributed crawler4j using java agent development environment (jade framework)

Language: Java - Size: 9.54 MB - Last synced: about 2 months ago - Pushed: about 6 years ago - Stars: 1 - Forks: 1

rzo1/crawler4j Fork of yasserg/crawler4j

Open Source Web Crawler for Java - A maintained fork of yasserg/crawler4j

Language: Java - Size: 1.9 MB - Last synced: about 1 month ago - Pushed: about 2 months ago - Stars: 22 - Forks: 4

brianmadden/krawler

A web crawling framework written in Kotlin

Language: Kotlin - Size: 403 KB - Last synced: about 2 months ago - Pushed: almost 3 years ago - Stars: 130 - Forks: 16

peterchenhdu/future-framework

future-framework project. https://issues.sonatype.org/browse/OSSRH-41434

Language: Java - Size: 3.47 MB - Last synced: 4 months ago - Pushed: almost 6 years ago - Stars: 0 - Forks: 0

yasirerkam/YSRsearch

Search Engine

Language: CSS - Size: 26.4 MB - Last synced: 4 months ago - Pushed: almost 2 years ago - Stars: 1 - Forks: 1

chanddu/Book-Search-Engine

Search Engine for Books (Java, Apache Lucene, crawler4j, Apache Spark)

Language: Java - Size: 6.84 KB - Last synced: 3 months ago - Pushed: almost 6 years ago - Stars: 9 - Forks: 0

tirthmehta/Google-Cloud-Platform-based-Hadoop-Map-Reduce

Determination of which words occur in a dataset of textbooks along with each word's occurrence count identification with the help of Google Cloud Platform based Dataproc cluster formation.

Language: Java - Size: 1010 KB - Last synced: 8 months ago - Pushed: almost 7 years ago - Stars: 0 - Forks: 0

Keerthivasan13/CSCI572-Information_Retrieval_And_Web_Search_Engines

Search Engine projects

Language: Java - Size: 34.5 MB - Last synced: 8 months ago - Pushed: about 4 years ago - Stars: 11 - Forks: 17

asifzubair/information_retrieval

Information Retrieval and Web Search Engines

Language: PHP - Size: 2.25 MB - Last synced: 10 months ago - Pushed: about 7 years ago - Stars: 1 - Forks: 0

wwyqianqian/information-retrieval

Information retrieval.

Language: Java - Size: 271 KB - Last synced: 10 months ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0

javagaorui5944/ProxyIpPool

:bullettrain_side:The Crawler Proxy IP Pool Component

Language: Java - Size: 102 KB - Last synced: 10 months ago - Pushed: almost 2 years ago - Stars: 65 - Forks: 26

soberqian/Java-Carwler-Technology

网络数据采集技术—Java网络爬虫 (书稿完整代码,涉及网络爬虫的各种技术和知识点)

Language: Java - Size: 31.3 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 61 - Forks: 20

guillevc/eli5-crawling

Crawling and searching reddit.com/r/explainlikeimfive

Language: Java - Size: 2.18 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0

AMOOOMA/StockDataCrawler

Stock Data Crawler made with crawler4j, data from wsj.com

Language: Java - Size: 8.78 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0

fedor-malyshkin/story_line2_crawler

StoryLine 2. News site's crawler (based on my own's fork of edu.uci.ics:crawler4j)

Language: Java - Size: 240 KB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0

LUMR/crawler-job

分布式网络爬虫

Language: Java - Size: 85 KB - Last synced: over 1 year ago - Pushed: over 5 years ago - Stars: 0 - Forks: 1

mahimagupta/WebCrawler

Language: Java - Size: 1.19 MB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0

mukeshkdangi/nypost_crawler

Language: Java - Size: 144 KB - Last synced: 10 months ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0

renuka-raju/Building-a-Web-Search-Engine

Language: Java - Size: 11.7 KB - Last synced: 10 months ago - Pushed: over 6 years ago - Stars: 0 - Forks: 0