An open API service providing repository metadata for many open source software ecosystems.

gitlab.com topics: hdfs

yhm-amber/chrislusf.seaweedfs

SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC active-active replication, Kubernetes, POSIX FUSE mount, S3 API, S3 Gateway, Hadoop, WebDAV, encryption, Erasure Coding. https://github.com/chrislusf/seaweedfs.wiki.git

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

tayoso/big-data-jobs

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

rychly-edu/theses/dist-forensic-digital-data-repo

Distributed storage for digital forensic data with data/metadata repository, API for queries and incoming/outgoing data, indexing, plug-in system for yet unsupported data-types, etc.

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

amit-kamat/Word-counting-hadoop

A coherent introduction to the Hadoop environment and HDFS.

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

amit-kamat/Map-Reduce-Ukraine

This project aggregates trending data from Ukraine based Twitter accounts. The raw aggregated data is cleansed before analysis using some Big-data methods. The purpose of this project is to familiarize myself with the workings of Hadoop for HDFS and Map-Reduce infrastructure.

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

braineering/moviedoop

Map/Reduce application that analyzes movie ratings collected by Movielens, leveraging Hadoop MapReduce, Hadoop Distributed File System and Apache Flume. Coursework in Structures and Architectures for Big Data 2016/2017.

Last synced at: over 2 years ago - Stars: 1 - Forks: 0

ccis-irad/spark-analytics

Analytics developed for platform

Last synced at: over 2 years ago - Stars: 0 - Forks: 1