An open API service providing repository metadata for many open source software ecosystems.

Topic: "hbase"

heibaiying/BigData-Notes

大数据入门指南 :star:

Language: Java - Size: 22.9 MB - Last synced at: about 9 hours ago - Pushed at: over 1 year ago - Stars: 16,368 - Forks: 4,277

zhisheng17/flink-learning

flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》

Language: Java - Size: 41.6 MB - Last synced at: 3 days ago - Pushed at: about 1 month ago - Stars: 14,744 - Forks: 3,931

aalansehaiyang/technology-talk

【大厂面试专栏】一份Java程序员需要的技术指南,这里有面试题、系统架构、职场锦囊、主流中间件等,让你成为更牛的自己!

Size: 127 MB - Last synced at: 17 days ago - Pushed at: over 1 year ago - Stars: 14,404 - Forks: 3,789

wangzhiwubigdata/God-Of-BigData

专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...

Size: 66.3 MB - Last synced at: 16 days ago - Pushed at: over 1 year ago - Stars: 10,043 - Forks: 3,201

JanusGraph/janusgraph

JanusGraph: an open-source, distributed graph database

Language: Java - Size: 58 MB - Last synced at: 2 days ago - Pushed at: 22 days ago - Stars: 5,476 - Forks: 1,190

apache/hbase

Apache HBase

Language: Java - Size: 475 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 5,328 - Forks: 3,347

dunwu/db-tutorial

📚 后端程序员应该掌握的主流数据库知识

Language: Java - Size: 12.4 MB - Last synced at: 16 days ago - Pushed at: 7 months ago - Stars: 4,585 - Forks: 588

MoRan1607/BigDataGuide

大数据学习,从零开始学习大数据,包含大数据学习各阶段学习视频、面试资料

Size: 154 MB - Last synced at: 15 days ago - Pushed at: about 2 months ago - Stars: 2,871 - Forks: 898

vector4wang/spring-boot-quick

:herb: 基于springboot的快速学习示例,整合自己遇到的开源框架,如:rabbitmq(延迟队列)、Kafka、jpa、redies、oauth2、swagger、jsp、docker、k3s、k3d、k8s、mybatis加解密插件、异常处理、日志输出、多模块开发、多环境打包、缓存cache、爬虫、jwt、GraphQL、dubbo、zookeeper和Async等等:pushpin:

Language: Java - Size: 5.15 MB - Last synced at: 12 days ago - Pushed at: about 2 months ago - Stars: 2,611 - Forks: 915

geekyouth/SZT-bigdata

深圳地铁大数据客流分析系统🚇🚄🌟

Language: Scala - Size: 42.1 MB - Last synced at: 12 days ago - Pushed at: 11 months ago - Stars: 2,344 - Forks: 608

baidu/tera

An Internet-Scale Database.

Language: C++ - Size: 15.7 MB - Last synced at: 14 days ago - Pushed at: 11 months ago - Stars: 1,900 - Forks: 435

farmerjohngit/myblog

有深度的Java技术博客

Size: 13.7 KB - Last synced at: 17 days ago - Pushed at: about 6 years ago - Stars: 1,841 - Forks: 289

gchq/Gaffer

A large-scale entity and relation database supporting aggregation of properties

Language: Java - Size: 218 MB - Last synced at: 2 days ago - Pushed at: 3 months ago - Stars: 1,781 - Forks: 359

water8394/BigData-Interview

:dart: :star2:[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结

Size: 6.59 MB - Last synced at: 18 days ago - Pushed at: over 3 years ago - Stars: 1,610 - Forks: 446

collabH/bigdata-growth

大数据知识仓库涉及到数据仓库建模、实时计算、大数据、数据中台、系统设计、Java、算法等。

Language: Shell - Size: 221 MB - Last synced at: 14 days ago - Pushed at: 5 months ago - Stars: 1,579 - Forks: 377

HariSekhon/Dockerfiles

50+ DockerHub public images for Docker & Kubernetes - DevOps, CI/CD, GitHub Actions, CircleCI, Jenkins, TeamCity, Alpine, CentOS, Debian, Fedora, Ubuntu, Hadoop, Kafka, ZooKeeper, HBase, Cassandra, Solr, SolrCloud, Presto, Apache Drill, Nifi, Spark, Consul, Riak

Language: Shell - Size: 7.73 MB - Last synced at: 15 days ago - Pushed at: about 1 month ago - Stars: 1,339 - Forks: 472

OBenner/data-engineering-interview-questions

More than 2000+ Data engineer interview questions.

Size: 938 KB - Last synced at: 14 days ago - Pushed at: 3 months ago - Stars: 1,303 - Forks: 465

docs4dev/docs4dev

后端开发常用框架文档及中文翻译,包含 Spring 系列文档(Spring, Spring Boot, Spring Cloud, Spring Security, Spring Session),大数据(Apache Hive, HBase, Apache Flume),日志(Log4j2, Logback),Http Server(NGINX,Apache),Python,数据库(OpenTSDB,MySQL,PostgreSQL)等最新官方文档以及对应的中文翻译。

Size: 1.54 MB - Last synced at: 17 days ago - Pushed at: over 3 years ago - Stars: 1,302 - Forks: 221

HariSekhon/Nagios-Plugins

450+ AWS, Hadoop, Cloud, Kafka, Docker, Elasticsearch, RabbitMQ, Redis, HBase, Solr, Cassandra, ZooKeeper, HDFS, Yarn, Hive, Presto, Drill, Impala, Consul, Spark, Jenkins, Travis CI, Git, MySQL, Linux, DNS, Whois, SSL Certs, Yum Security Updates, Kubernetes, Cloudera etc...

Language: Python - Size: 8.83 MB - Last synced at: 15 days ago - Pushed at: about 1 month ago - Stars: 1,145 - Forks: 507

apache/phoenix

Apache Phoenix

Language: Java - Size: 80.7 MB - Last synced at: about 5 hours ago - Pushed at: about 6 hours ago - Stars: 1,037 - Forks: 1,007

HariSekhon/DevOps-Python-tools

80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Functions, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.

Language: Python - Size: 3.11 MB - Last synced at: about 8 hours ago - Pushed at: about 9 hours ago - Stars: 794 - Forks: 347

sunnyandgood/BigData

💎🔥大数据学习笔记

Language: Java - Size: 316 MB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 647 - Forks: 222

gangly/datafaker

Datafaker is a large-scale test data and flow test data generation tool. Datafaker fakes data and inserts to varied data sources. 测试数据生成工具

Language: Python - Size: 1.36 MB - Last synced at: 11 months ago - Pushed at: over 3 years ago - Stars: 617 - Forks: 167

TurboWay/spiderman

基于 scrapy-redis 的通用分布式爬虫框架

Language: Python - Size: 4.26 MB - Last synced at: 21 days ago - Pushed at: about 2 years ago - Stars: 602 - Forks: 128

locationtech/geowave

GeoWave provides geospatial and temporal indexing on top of Accumulo, HBase, BigTable, Cassandra, Kudu, Redis, RocksDB, and DynamoDB.

Language: Java - Size: 937 MB - Last synced at: 18 days ago - Pushed at: almost 2 years ago - Stars: 508 - Forks: 191

Raray-chuan/xichuan_note

xichuan的学习总结笔记,覆盖了java、spring、java其他常用框架,以及大数据相关组件等📚

Language: Java - Size: 10.9 MB - Last synced at: 21 days ago - Pushed at: over 2 years ago - Stars: 492 - Forks: 95

fabiogjardim/bigdata_docker

Big Data Ecosystem Docker

Language: VBA - Size: 126 MB - Last synced at: 21 days ago - Pushed at: almost 2 years ago - Stars: 407 - Forks: 319

imposter-project/imposter-jvm-engine

Scriptable, multipurpose mock server. Run standalone mock servers, or embed mocks within your tests or CI/CD pipeline.

Language: Kotlin - Size: 14.3 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 387 - Forks: 62

dajobe/hbase-docker

HBase running in Docker

Language: Shell - Size: 36.1 KB - Last synced at: 21 days ago - Pushed at: over 2 years ago - Stars: 331 - Forks: 188

datawhalechina/juicy-bigdata

🎉🎉🐳 Datawhale大数据处理导论教程 | 大数据技术方向的开篇课程🎉🎉

Language: Python - Size: 27.4 MB - Last synced at: 17 days ago - Pushed at: almost 2 years ago - Stars: 312 - Forks: 43

nerdammer/spark-hbase-connector

Connect Spark to HBase for reading and writing data with ease

Language: Scala - Size: 127 KB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 299 - Forks: 108

hbase-rdd/hbase-rdd

Spark RDD to read, write and delete from HBase

Language: Scala - Size: 607 KB - Last synced at: 19 days ago - Pushed at: over 4 years ago - Stars: 276 - Forks: 114

rayokota/hgraphdb

HBase as a TinkerPop Graph Database

Language: Java - Size: 825 KB - Last synced at: 17 days ago - Pushed at: 21 days ago - Stars: 256 - Forks: 54

paypal/gimel

Big Data Processing Framework - Unified Data API or SQL on Any Storage

Language: Scala - Size: 62.1 MB - Last synced at: 17 days ago - Pushed at: 4 months ago - Stars: 244 - Forks: 81

adaltas/node-hbase

Asynchronous HBase client for NodeJs using REST

Language: CoffeeScript - Size: 368 KB - Last synced at: about 9 hours ago - Pushed at: 11 months ago - Stars: 242 - Forks: 73

apache/hbase-connectors

Apache HBase Connectors

Language: Scala - Size: 1.04 MB - Last synced at: about 18 hours ago - Pushed at: about 2 months ago - Stars: 240 - Forks: 179

huangfox/dpkb

大数据相关内容汇总,包括分布式存储引擎、分布式计算引擎、数仓建设等。关键词:Hadoop、HBase、ES、Kudu、Hive、Presto、Spark、Flink、Kylin、ClickHouse

Size: 81.1 KB - Last synced at: 29 days ago - Pushed at: 5 months ago - Stars: 229 - Forks: 60

HariSekhon/HAProxy-configs

80+ HAProxy Configs for Hadoop, Big Data, NoSQL, Docker, Kubernetes, Elasticsearch, SolrCloud, HBase, MySQL, PostgreSQL, Apache Drill, Hive, Presto, Impala, Hue, ZooKeeper, SSH, RabbitMQ, Redis, Riak, Cloudera, OpenTSDB, InfluxDB, Prometheus, Kibana, Graphite, Rancher etc.

Language: Shell - Size: 496 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 220 - Forks: 79

Chabane/bigdata-playground

A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL

Language: TypeScript - Size: 3.08 MB - Last synced at: 13 days ago - Pushed at: about 6 years ago - Stars: 209 - Forks: 74

LinMingQiang/sparkstreaming

:boom: :rocket: 封装sparkstreaming动态调节batch time(有数据就执行计算);:rocket: 支持运行过程中增删topic;:rocket: 封装sparkstreaming 1.6 - kafka 010 用以支持 SSL。

Language: Scala - Size: 258 KB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 182 - Forks: 83

apachecn/hbase-doc-zh 📦

:book: HBase 中文参考指南

Language: JavaScript - Size: 3.07 MB - Last synced at: 7 months ago - Pushed at: over 4 years ago - Stars: 181 - Forks: 58

apache/hbase-operator-tools

Apache HBase Operator Tools

Language: Java - Size: 575 KB - Last synced at: about 18 hours ago - Pushed at: 5 months ago - Stars: 177 - Forks: 147

rayokota/awesome-hbase

A curated list of awesome HBase projects and resources.

Size: 81.1 KB - Last synced at: 4 days ago - Pushed at: almost 3 years ago - Stars: 172 - Forks: 38

singgel/SpringBoot-Templates

springboot和dubbo、netty的集成,redis mongodb的nosql模板, kafka rocketmq rabbit的MQ模板, solr solrcloud elasticsearch查询引擎

Language: Java - Size: 2.31 MB - Last synced at: 23 days ago - Pushed at: almost 3 years ago - Stars: 171 - Forks: 101

HY-ZhengWei/HBaseClient

HBase客户端数据管理软件

Language: Java - Size: 344 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 161 - Forks: 69

sburn/docker-apache-atlas

This Apache Atlas is built from the latest release source tarball and patched to be run in a Docker container.

Language: Shell - Size: 131 KB - Last synced at: 11 months ago - Pushed at: over 1 year ago - Stars: 135 - Forks: 70

phelps-sg/python-bigdata

Data science and Big Data with Python

Language: Jupyter Notebook - Size: 1.44 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 129 - Forks: 197

TFdream/blog

个人技术博客,博文写在 Issues 里。

Size: 60.4 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 124 - Forks: 18

kakao/hbase-region-inspector

A visual dashboard of HBase region statistics

Language: Clojure - Size: 543 KB - Last synced at: 23 days ago - Pushed at: almost 2 years ago - Stars: 107 - Forks: 29

waterguo/antsdb

AntsDB is a low latency, high concurrency, MySQL compliant SQL layer for HBase

Language: Java - Size: 1.08 GB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 105 - Forks: 28

gglinux/wifi

基于wifi抓取信息的大数据查询分析系统

Language: Java - Size: 113 MB - Last synced at: over 1 year ago - Pushed at: almost 8 years ago - Stars: 105 - Forks: 64

LuckyZXL2016/Cloud-Note

基于分布式的云笔记(参考某道云笔记),数据存储在redis与hbase中

Language: Java - Size: 3.23 MB - Last synced at: 22 days ago - Pushed at: over 7 years ago - Stars: 98 - Forks: 44

HariSekhon/DevOps-Perl-tools

25+ DevOps CLI Tools - Anonymizer, SQL ReCaser (MySQL, PostgreSQL, AWS Redshift, Snowflake, Apache Drill, Hive, Impala, Cassandra CQL, Microsoft SQL Server, Oracle, Couchbase N1QL, Dockerfiles), Hadoop HDFS & Hive tools, Solr/SolrCloud CLI, Nginx stats & HTTP(S) URL watchers for load-balanced web farms, Linux tools etc.

Language: Perl - Size: 2.13 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 94 - Forks: 43

pinterest/orion

Management and automation platform for Stateful Distributed Systems

Language: Java - Size: 1.09 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 94 - Forks: 32

tlhhup/litemall-dw

基于开源Litemall电商项目的大数据项目,包含前端埋点(openresty+lua)、后端埋点;数据仓库(五层)、实时计算和用户画像。大数据平台采用CDH6.3.2(已使用vagrant+ansible脚本化),同时也包含了Azkaban的workflow。

Language: Java - Size: 9.98 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 82 - Forks: 43

flipkart-incubator/hbase-orm

A production-grade HBase ORM library that makes accessing HBase clean, fast and fun (Can also be used as Bigtable ORM)

Language: Java - Size: 363 KB - Last synced at: 10 days ago - Pushed at: almost 2 years ago - Stars: 81 - Forks: 41

cdapio/hadoop_cookbook

Cookbook to install Hadoop 2.0+ using Chef

Language: Ruby - Size: 1.3 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 81 - Forks: 77

greenplum-db/pxf

Platform Extension Framework: Federated Query Engine

Language: Java - Size: 27.6 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 79 - Forks: 60

IBM/sparksql-for-hbase

Learn how to use Spark SQL and HSpark connector package to create / query data tables that reside in HBase region servers

Size: 614 KB - Last synced at: 17 days ago - Pushed at: over 2 years ago - Stars: 69 - Forks: 27

hw2499/etl-engine

etl engine 轻量级 跨平台 流批一体ETL引擎 数据抽取-转换-装载 ETL engine lightweight cross platform batch flow integration ETL engine data extraction transformation loading

Language: Go - Size: 1.6 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 68 - Forks: 13

TurboWay/pybigdata

使用 python 操作大数据的各种组件

Language: Python - Size: 85 KB - Last synced at: 13 days ago - Pushed at: about 2 years ago - Stars: 63 - Forks: 18

zdkzdk/aaocp

一个对用户行为日志进行分析的大数据项目

Language: PLpgSQL - Size: 74.8 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 56 - Forks: 20

pkeropen/BigData-News

基于Spark2.2新闻网大数据实时系统项目

Language: Scala - Size: 187 KB - Last synced at: about 1 year ago - Pushed at: about 6 years ago - Stars: 55 - Forks: 21

chenxingxing6/disk

基于hadoop+hbase+springboot实现分布式网盘系统

Language: JavaScript - Size: 51.4 MB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 55 - Forks: 28

v5tech/cloud

云计算之hadoop、hive、hue、oozie、sqoop、hbase、zookeeper环境搭建及配置文件

Language: Shell - Size: 31.7 MB - Last synced at: 7 days ago - Pushed at: about 8 years ago - Stars: 54 - Forks: 43

barseghyanartur/starbase

DEPRECATED - HBase Stargate (REST API) client wrapper for Python.

Language: Python - Size: 255 KB - Last synced at: 9 days ago - Pushed at: over 6 years ago - Stars: 53 - Forks: 32

Maicius/WebLogsAnalysisSystem

A big data platform for analyzing web access logs

Language: Java - Size: 3.79 MB - Last synced at: 21 days ago - Pushed at: over 2 years ago - Stars: 52 - Forks: 35

liumingmusic/HadoopLearning

全套大数据基础学习教程,包含最基础的centos、maven。大数据主要包含hdfs、mr、yarn、hbase、kafka、scala、sparkcore、sparkstreaming、sparksql。教程包含所有的源代码演示以及在线文档说明。

Language: Scala - Size: 5.95 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 52 - Forks: 24

ColumbiaDVMM/ColumbiaImageSearch

Columbia Image and Face Search tool for MEMEX

Language: Python - Size: 30.5 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 52 - Forks: 30

sergevs/ansible-cloudera-hadoop

ansible playbook to deploy cloudera hadoop components to the cluster

Language: Shell - Size: 6.3 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 52 - Forks: 41

novabyte/diver

A HBase driver for Erlang/Elixir using Jinterface and the Asynchbase Java client to query the database.

Language: Java - Size: 24.9 MB - Last synced at: 16 days ago - Pushed at: over 8 years ago - Stars: 51 - Forks: 8

DarkPhoenixs/connection-pool-client

💥 A simple multi-purpose connection pool client (Kafka & Hbase & Redis & RMDB & Socket & Http)

Language: Java - Size: 899 KB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 48 - Forks: 50

asdf2014/yuzhouwan

Code Library for My Blog

Language: Java - Size: 43.8 MB - Last synced at: 16 days ago - Pushed at: about 1 month ago - Stars: 47 - Forks: 23

dunwu/bigdata-tutorial

Language: Java - Size: 8.81 MB - Last synced at: 20 days ago - Pushed at: about 1 year ago - Stars: 46 - Forks: 16

vivek2319/Learn-Hadoop-and-Spark

This repository focuses on gathering and making a curated list resources to learn Hadoop for FREE.

Language: Python - Size: 211 MB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 46 - Forks: 39

3601314/hbase-python

hbase-python is a pure python package used to access HBase.

Language: Python - Size: 209 KB - Last synced at: 12 days ago - Pushed at: 12 months ago - Stars: 42 - Forks: 18

mysql-time-machine/replicator

MySQL Replicator. Replicates MySQL tables to Kafka and HBase, keeping the data changes history in HBase.

Language: Java - Size: 34.6 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 42 - Forks: 25

xgugeng/dev-notes

开发进阶笔记

Language: Objective-C - Size: 22.1 MB - Last synced at: 7 days ago - Pushed at: almost 3 years ago - Stars: 42 - Forks: 17

kakao/mango 📦

Core utility library & data connectors designed for simpler usage in Scala

Language: Scala - Size: 2.66 MB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 41 - Forks: 13

LB-Yu/data-systems-learning

Learning summary and examples about data systems.

Language: Java - Size: 984 KB - Last synced at: 22 days ago - Pushed at: 11 months ago - Stars: 40 - Forks: 34

DarkPhoenixs/hbase-meta-repair

Repair hbase metadata table from hdfs.

Language: Java - Size: 17.6 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 39 - Forks: 31

kailanyue/SZ-Metro

深圳地铁大数据客流分析系统

Language: Java - Size: 20.6 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 39 - Forks: 11

bigdata-labs/spark2-hadoop2.6-hbase-labs

Language: Shell - Size: 1.18 MB - Last synced at: almost 2 years ago - Pushed at: about 8 years ago - Stars: 39 - Forks: 13

baifendian/swordfish

Open-source distribute workflow schedule tools, also support streaming task.

Language: Java - Size: 3.84 MB - Last synced at: 12 months ago - Pushed at: over 7 years ago - Stars: 38 - Forks: 18

junneyang/xxhadoop

Data Analysis Using Hadoop/Spark/Storm/ElasticSearch/MachineLearning etc. This is My Daily Notes/Code/Demo. Don't fork, Just star !

Language: Java - Size: 16.3 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 37 - Forks: 16

29DCH/Real-time-log-analysis-system

:penguin:基于spark streaming+flume+kafka+hbase的实时日志处理分析系统(分为控制台版本和基于springboot、Echarts等的Web UI可视化版本)

Language: Java - Size: 357 KB - Last synced at: 20 days ago - Pushed at: almost 2 years ago - Stars: 36 - Forks: 34

kakao/hbase-packet-inspector

Analyzes network traffic of HBase RegionServers

Language: Clojure - Size: 490 KB - Last synced at: 23 days ago - Pushed at: almost 2 years ago - Stars: 36 - Forks: 5

apache/hbase-native-client

Apache HBase Native Client

Language: C++ - Size: 829 KB - Last synced at: 19 minutes ago - Pushed at: about 3 years ago - Stars: 36 - Forks: 19

winstonelei/BigDataTools

tools for bigData

Language: Java - Size: 235 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 36 - Forks: 24

kplxq/talos

Language: Java - Size: 2.37 MB - Last synced at: 9 days ago - Pushed at: over 6 years ago - Stars: 36 - Forks: 12

jorgeacf/dockerfiles

Multi docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, Hue, Mesos, ... )

Language: Shell - Size: 479 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 34 - Forks: 11

agile-lab-dev/darwin

Avro Schema Evolution made easy

Language: Scala - Size: 2.67 MB - Last synced at: 13 days ago - Pushed at: about 1 year ago - Stars: 34 - Forks: 10

yuan-more/bigdata-book

上百本大数据电子书,附带下载链接,包括计算机基础,Java,hadoop,spark,flink,kafka,hbase,hive,数仓等

Size: 28.3 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 34 - Forks: 10

kakao/cmux

A set of commands for managing CDH clusters using Cloudera Manager REST API.

Language: Ruby - Size: 113 KB - Last synced at: 23 days ago - Pushed at: almost 2 years ago - Stars: 33 - Forks: 8

apache/flink-connector-hbase

Apache flink

Language: Java - Size: 607 KB - Last synced at: 7 days ago - Pushed at: 3 months ago - Stars: 32 - Forks: 34

rainmaple/WIFI_BussinessBigDataAnalyseSystem

A System is designed to analyse BigData collect from Wifi probe

Language: JavaScript - Size: 98.6 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 31 - Forks: 18

agile-lab-dev/wasp

WASP is a framework to build complex real time big data applications. It relies on a kind of Kappa/Lambda architecture mainly leveraging Kafka and Spark. If you need to ingest huge amount of heterogeneous data and analyze them through complex pipelines, this is the framework for you.

Language: Scala - Size: 7.59 MB - Last synced at: 17 days ago - Pushed at: 19 days ago - Stars: 30 - Forks: 11

OrangeDrk/JavaNotes

Java后端学习笔记。包括Linux、maven、git、互联网架构、大数据体系等

Size: 149 MB - Last synced at: 15 days ago - Pushed at: about 1 year ago - Stars: 29 - Forks: 9

huangjianqin/SparkRecommerSystem

基于Spark的实时推荐系统,使用MovieLens作为测试数据集

Language: JavaScript - Size: 34.2 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 29 - Forks: 13

bomeng/Heracles

High performance HBase / Spark SQL engine

Language: Scala - Size: 488 KB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 29 - Forks: 12