An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: hadoop-ecosystem

madd86/awesome-system-design

A curated list of awesome System Design (A.K.A. Distributed Systems) resources.

Size: 1.71 MB - Last synced at: about 21 hours ago - Pushed at: about 1 year ago - Stars: 10,464 - Forks: 1,174

Flixteu356/BigData-Architecture

Big Data system predicts pandemic risk (COVID-19) via data analysis, ML modeling, and real-time dashboard.

Language: Python - Size: 35.2 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

Hazim-HF/Data-Management

This repository covers data management and big data technologies, including databases, querying, and big data processing. Topics include Hadoop (MapReduce, HDFS), Apache Spark, data security, and optimization techniques. Students will learn Spark’s architecture, data distribution, parallel computing, and memory caching to enhance big data solutions

Language: Jupyter Notebook - Size: 65.2 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

dhkdn9192/data_engineer_career

DE직무에 필요한 모든 것

Language: Jupyter Notebook - Size: 14.6 MB - Last synced at: 18 days ago - Pushed at: about 1 month ago - Stars: 201 - Forks: 28

ZuInnoTe/hadoopoffice

HadoopOffice - Analyze Office documents using the Hadoop ecosystem (Spark/Flink/Hive)

Language: Java - Size: 7.61 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 64 - Forks: 31

jodth07/hadoop-installation

Instructions on setting up Hadoop, HDFS, java, sbt, kafka, scala, spark and flume on Ubuntu 18.04

Language: Shell - Size: 61.5 KB - Last synced at: 11 months ago - Pushed at: almost 4 years ago - Stars: 8 - Forks: 15

ArwaEiad/TMDB-Project

This project focuses on analyzing movie data using Pyspark tailored for efficient data processing on Hadoop Distributed File System (HDFS)

Language: Jupyter Notebook - Size: 9.77 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

hyeonsangjeon/dataplatform

Hadoop3.2 single/cluster mode with web terminal gotty, spark, jupyter pyspark, hive, eco etc.

Language: Shell - Size: 549 KB - Last synced at: 2 months ago - Pushed at: over 5 years ago - Stars: 11 - Forks: 1

keyhong/keyhong.github.repo

항상 꾸준히 공부하고, 기록하고, 배움을 쌓아 올리는 나만의 지식 메모리

Language: HTML - Size: 1.63 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

SarahAyaz/YouTube_Data_Analysis

Analysis of YouTube Data using Hadoop Mapreduce framework in Java.

Language: Java - Size: 24.5 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 2

Jayvardhan-Reddy/BigData-Ecosystem-Architecture

Life-cycle: Internal working of HDFS, SQOOP, HIVE, SPARK, HBASE, KAFKA with code.

Language: Shell - Size: 562 KB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 13 - Forks: 16

satyajeetmaharana/floodprediction

The goal of this project is to identify the flood-prone areas with probabilities of flood in counties in a future date, using Spark MLLib.

Language: Scala - Size: 3.46 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 1

Cigna/ibis

IBIS is a workflow creation-engine that abstracts the Hadoop internals of ingesting RDBMS data.

Language: Python - Size: 749 KB - Last synced at: 7 months ago - Pushed at: about 3 years ago - Stars: 51 - Forks: 15

oykuyildirim/Flume-Service

Getting tweets using Flume service and analyzing tweets

Size: 288 KB - Last synced at: about 1 year ago - Pushed at: about 6 years ago - Stars: 1 - Forks: 0

tingjhenjiang/bigdata_docker_images

資料平行批次與串流處理以及搭建機器學習環境會用到的container

Language: Dockerfile - Size: 54.7 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

pfisterer/apache-knox-helm

Helm chart for Apache Knox

Language: Mustache - Size: 1020 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

pfisterer/apache-knox-docker

Dockerfile for running Apache Knox (http://knox.apache.org/) in Docker

Language: Dockerfile - Size: 15.6 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 7 - Forks: 7

meliodaseren/mapreduce-demo

Hadoop MapReduce

Language: Java - Size: 13.7 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

mayankskb/Hadoop-Times

Practise programs in hadoop ecosystem for refrence

Size: 4.29 MB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 1

Rohit-Jain-2801/HadoopInstallGuide

Apache Hadoop Components Installation Guide on Windows

Size: 4.88 KB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

AnkitaSinha98/Customer360-Data-Analysis

Big Data is Stored and analyzed of various Customer using Hadoop and other tools like Hive, Zookeeper, Hbase and sqoop and all details of the customer is analyzed then result are given.This result is very useful for companies.

Size: 292 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 1

meliodaseren/avro-file-format

Avro File Format Quick Start Tutorial

Language: Java - Size: 8.79 KB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

PrathameshNimkar/Big-Data-Analysis-using-the-Hadoop-Ecosystem

Learn and implement the Hadoop Ecosystem to drive Big Data Analytics.

Size: 8.61 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

alex-ber/docker-hive Fork of ops-guru/docker-hive

EMR 5.25.0 cluster single node Hadoop docker image. With Amazon Linux, Hadoop 2.8.5 and Hive 2.3.5

Language: Shell - Size: 45.9 KB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 1

DiegoBulhoes/hadoop-ansible-single-node

Ambiente com o objetivo de praticar o uso das ferramentas Ansible e Hadoop usando uma única instância

Language: Shell - Size: 23.4 KB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

reggert/cumulative

[Work in progress] Client library for simplified access to Apache Accumulo

Language: Scala - Size: 176 KB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

m-r-tanha/Hadoop-Ecosystem

This repository is going to update based on my challenges in installing and using the Hadoop's tools Spark

Size: 395 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

simple-learning/Hadoop

Hadoop Projects

Language: Java - Size: 28.7 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

meliodaseren/spark-sql-demo

SparkSQL Quick Start Tutorial

Language: Scala - Size: 95.7 KB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 2

f2e-awesome/HadoopEcosystem

Hadoop 生态体系(ecosystem)

Language: JavaScript - Size: 3.91 KB - Last synced at: over 2 years ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 1

meliodaseren/spark-streaming-kafka-demo

Spark Streaming & Kafka Quick Start Tutorial

Language: Scala - Size: 9.79 MB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

PykaAlexandro/A-MapReduce-Vademecum-via-Hadoop

Some basic procedures for parallel computing in the Hadoop environment

Language: Python - Size: 1.95 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

meliodaseren/hadoop-2.7.3-ha Fork of reaganwei0216/hadoop-2.7.3-ha

This is the hadoop configuration files for system HA

Size: 4.88 KB - Last synced at: over 2 years ago - Pushed at: over 8 years ago - Stars: 0 - Forks: 0

meliodaseren/spark-demo

Learning Spark 2 on Cloudera, programming with scala 2.10.

Language: Java - Size: 18.6 KB - Last synced at: over 2 years ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0

meliodaseren/hive-udf-demo

Hive

Language: Java - Size: 18.6 KB - Last synced at: over 2 years ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0

meliodaseren/structure-streaming-demo

Structure Streaming Quick Start Tutorial

Language: Scala - Size: 10.7 KB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0