gitlab.com topics: hdfs
yhm-amber/chrislusf.seaweedfs
SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC active-active replication, Kubernetes, POSIX FUSE mount, S3 API, S3 Gateway, Hadoop, WebDAV, encryption, Erasure Coding. https://github.com/chrislusf/seaweedfs.wiki.git
Last synced at: over 2 years ago - Stars: 0 - Forks: 0

leo-plese/big-data-algorithms/apache-hadoop-framework-hdfs-mapreduce-programming-model
Last synced at: over 2 years ago - Stars: 0 - Forks: 0

rychly-edu/theses/dist-forensic-digital-data-repo
Distributed storage for digital forensic data with data/metadata repository, API for queries and incoming/outgoing data, indexing, plug-in system for yet unsupported data-types, etc.
Last synced at: over 2 years ago - Stars: 0 - Forks: 0
amit-kamat/Word-counting-hadoop
A coherent introduction to the Hadoop environment and HDFS.
Last synced at: over 2 years ago - Stars: 0 - Forks: 0

amit-kamat/Map-Reduce-Ukraine
This project aggregates trending data from Ukraine based Twitter accounts. The raw aggregated data is cleansed before analysis using some Big-data methods. The purpose of this project is to familiarize myself with the workings of Hadoop for HDFS and Map-Reduce infrastructure.
Last synced at: over 2 years ago - Stars: 0 - Forks: 0

braineering/moviedoop
Map/Reduce application that analyzes movie ratings collected by Movielens, leveraging Hadoop MapReduce, Hadoop Distributed File System and Apache Flume. Coursework in Structures and Architectures for Big Data 2016/2017.
Last synced at: over 2 years ago - Stars: 1 - Forks: 0

ccis-irad/spark-analytics
Analytics developed for platform
Last synced at: over 2 years ago - Stars: 0 - Forks: 1