An open API service providing repository metadata for many open source software ecosystems.

Topic: "hdfs-dfs"

linkedin/dynamometer

A tool for scale and performance testing of HDFS with a specific focus on the NameNode.

Language: Java - Size: 297 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 129 - Forks: 36

AhmetFurkanDEMIR/Data-Engineering-Project-with-HDFS-and-Kafka

Data Engineering Project with Hadoop HDFS and Kafka

Language: Python - Size: 3.46 MB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 102 - Forks: 25

Subham2S/BigData-Engineering-Capstone-Project-1

BigData Engineering Capstone Project with Tech-stack : Linux, MySQL, sqoop, HDFS, Hive, Impala, SparkSQL, SparkML, git

Language: Python - Size: 15.2 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 0

guangie88/hdfs-to-local

Go program to copy/sync directory recursively from HDFS server to local storage.

Language: Go - Size: 33.2 KB - Last synced at: about 2 months ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 0

ShreevathsaBK/Mimic-HDFS Fork of isj25/hadoop2021_bigdata

Simulation of a Hadoop distributed file system

Language: Python - Size: 85.9 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 2 - Forks: 1

Youssef22Ashraf/-Electricity-Consumption-Prediction-in-Egypt

This project aims to address Egypt's energy challenges by leveraging data-driven solutions. With increasing demand from urban centers and industries, conventional approaches such as random power cuts have proven ineffective. To tackle this issue, we are adopting a proactive strategy grounded in data analytics.

Language: Jupyter Notebook - Size: 188 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 0

brunoleomenezes/ararajuba

The Ararajuba script aims to identify whether Optimized Row Columnar files in the Hadoop Distributed File System are corrupted, for this purpose it uses the count method and analyzes the difference in schemas in the tables.

Language: Shell - Size: 227 KB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

monoid/hdfesse

WIP: hdfs/libhdfs drop-in replacements without Java

Language: Rust - Size: 525 KB - Last synced at: 5 days ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

ekane3/HiveSQL

Simple Hive CRUD queries

Size: 13.7 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 1

jakekemple/Hadoop-Tweet-Wordcounter

A Hadoop Wordcounter Job - Retrieves tweets and runs a MapReduce wordcounter for sentimental analysis

Language: Python - Size: 537 KB - Last synced at: about 2 years ago - Pushed at: about 8 years ago - Stars: 1 - Forks: 1

milenkovicm/testcontainers-minidfs-rs

Rust MiniDFS (local HDFS) Testcontainer

Language: Rust - Size: 33.2 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

GirishCodeAlchemy/News-sentiment-ML-ETL-pipeline

News Sentiment Analysis using ETL pipeline

Language: Jupyter Notebook - Size: 37.7 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

sinevmaxim/WebHDFSClient

Big Data project. Web client for HDFS. Working in the terminal. Has ability to manipulate local and Hadoop storage

Language: Python - Size: 11.7 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

ekane3/MapReduce

A project displaying examples of MapReduce jobs, using the "Remarkable Trees of Paris" dataset (https://opendata.paris.fr/explore/dataset/arbresremarquablesparis/information/,

Language: Java - Size: 41 KB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

jaipal24/Consumer-Behavioral-System

This is a sample data analysis project on consumer behavior system using HDFS,HIVE and PYSPARK.

Size: 2.93 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

amineelalaoui/Hadoop-clone

Language: Java - Size: 30.3 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

sulaiman-muhammad/Hadoop-KNN-Map-Reduce

Map-Reduce paradigm in Apache Hadoop for KNN algorithm based on Kaggle Titanic Dataset

Language: Python - Size: 1.75 MB - Last synced at: 2 months ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

mohammedyunus009/hadoop_learn_old

This is old repository from my archive . Hope this might help me in near future

Language: Java - Size: 12.7 KB - Last synced at: about 1 month ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

SakhriHoussem/MapReduce-Python

MapReduce Python Example

Language: Python - Size: 20.5 KB - Last synced at: almost 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 4