Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: mapreduce

WonYong-Jang/Hadoop-labs

Hadoop Programming ( using Mapreduce)

Language: Java - Size: 1.15 MB - Last synced: about 2 hours ago - Pushed: about 6 years ago - Stars: 0 - Forks: 0

casangi/graphviper

Dask Based MapReduce for Multi Xarray Datasets.

Language: Python - Size: 2.13 MB - Last synced: about 3 hours ago - Pushed: about 7 hours ago - Stars: 1 - Forks: 0

redisson/redisson

Redisson - Easy Redis Java client and Real-Time Data Platform. Sync/Async/RxJava/Reactive API. Over 50 Redis based Java objects and services: Set, Multimap, SortedSet, Map, List, Queue, Deque, Semaphore, Lock, AtomicLong, Map Reduce, Bloom filter, Spring Cache, Tomcat, Scheduler, JCache API, Hibernate, RPC, local cache ...

Language: Java - Size: 25.2 MB - Last synced: about 6 hours ago - Pushed: about 14 hours ago - Stars: 22,789 - Forks: 5,266

collabH/bigdata-growth

大数据知识仓库涉及到数据仓库建模、实时计算、大数据、数据中台、系统设计、Java、算法等。

Language: Shell - Size: 215 MB - Last synced: about 1 hour ago - Pushed: about 15 hours ago - Stars: 1,294 - Forks: 324

apache/incubator-uniffle

Uniffle is a high performance, general purpose Remote Shuffle Service.

Language: Java - Size: 10.1 MB - Last synced: about 15 hours ago - Pushed: about 18 hours ago - Stars: 357 - Forks: 131

RedisGears/RedisGears

Dynamic execution framework for your Redis data

Language: Rust - Size: 4.8 MB - Last synced: about 4 hours ago - Pushed: about 15 hours ago - Stars: 355 - Forks: 62

cwensel/cascading

Cascading is a feature rich API for defining and executing complex and fault tolerant data processing flows locally or on a cluster.

Language: Java - Size: 32 MB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 342 - Forks: 222

groda/big_data

Tutorials on Big Data essentials: Hadoop, MapReduce, Spark.

Language: Jupyter Notebook - Size: 46.2 MB - Last synced: 5 days ago - Pushed: 6 days ago - Stars: 61 - Forks: 23

Jack-Christopher/PyDoop

PyDoop: Revolutionizing Big Data Processing 🚀

Language: Python - Size: 13.7 KB - Last synced: 8 days ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

datawhalechina/juicy-bigdata

🎉🎉🐳 Datawhale大数据处理导论教程 | 大数据技术方向的开篇课程🎉🎉

Language: Python - Size: 27.4 MB - Last synced: 8 days ago - Pushed: about 1 year ago - Stars: 233 - Forks: 34

oaarnikoivu/mapreduce

MapReduce architecture in Python

Language: Python - Size: 1.19 MB - Last synced: 10 days ago - Pushed: 11 days ago - Stars: 0 - Forks: 0

Yaon-C2H8N2/Projet-SGD

Projet réalisé dans le cadre de l'UE Systèmes de Gestion de Documents à l'université de Bourgogne

Language: Python - Size: 1.57 MB - Last synced: 12 days ago - Pushed: 12 days ago - Stars: 0 - Forks: 0

Jacob12138xieyuan/hadoop-mapreduce-with-python

hadoop mapreduce algorithm with hadoop streaming (Python)

Language: Jupyter Notebook - Size: 16.6 KB - Last synced: 12 days ago - Pushed: 13 days ago - Stars: 1 - Forks: 0

jamestiotio/dbsys

SUTD 2021 50.043 Database and Big Data Systems Code Dump

Language: Java - Size: 69.7 MB - Last synced: 14 days ago - Pushed: almost 2 years ago - Stars: 1 - Forks: 3

JordiCorbilla/MapReduce

Data parallel text processing with MapReduce

Language: C# - Size: 17.2 MB - Last synced: 14 days ago - Pushed: over 3 years ago - Stars: 4 - Forks: 2

wtanaka/ansible-role-apache-spark

Ansible role to install Apache Spark

Language: Shell - Size: 15.6 KB - Last synced: 14 days ago - Pushed: about 5 years ago - Stars: 2 - Forks: 0

windson/ReviewsByDate

Get date wise number of reviews in the descending order using HDInsight

Language: C# - Size: 45.5 MB - Last synced: 14 days ago - Pushed: about 7 years ago - Stars: 1 - Forks: 0

windson/HDInsight-TopN-Reviews-MapReduce

TopN Products by category using HDInsight Streaming MapReduce

Language: C# - Size: 53.2 MB - Last synced: 14 days ago - Pushed: about 7 years ago - Stars: 1 - Forks: 0

windson/HDInsight-Top-N-OverPriced-Products-MapReduce

Top N OverPriced Products Using HDInsight streaming MapReduce Job

Language: C# - Size: 77.1 MB - Last synced: 14 days ago - Pushed: about 7 years ago - Stars: 1 - Forks: 0

ArangoGutierrez/GoReduce

A map reduce example made in pure Go

Language: Go - Size: 9.16 MB - Last synced: 14 days ago - Pushed: over 6 years ago - Stars: 1 - Forks: 0

nellore/rail

Scalable RNA-seq analysis

Language: Python - Size: 249 MB - Last synced: 2 days ago - Pushed: over 3 years ago - Stars: 72 - Forks: 11

AlbertSuarez/CBDE-MapReduce

🗺️ Laboratory 4 of CBDE subject.

Language: Java - Size: 29.3 KB - Last synced: 15 days ago - Pushed: over 7 years ago - Stars: 0 - Forks: 0

sujilnt/PythonCourserwork

A python Coursework on Bigdata and Mapreduce , This coursework related to distribute computing

Language: Python - Size: 37.1 KB - Last synced: 15 days ago - Pushed: almost 6 years ago - Stars: 0 - Forks: 0

jayybhatt/LS-Bloom-Filter

Implementation of bloom filter for large scale data using MapReduce

Language: Java - Size: 4.88 KB - Last synced: 16 days ago - Pushed: over 6 years ago - Stars: 0 - Forks: 0

gmarciani/mapreduce-app

Scaffolding for Map/Reduce applications, leveraging Apache Hadoop.

Language: Shell - Size: 1000 Bytes - Last synced: 16 days ago - Pushed: almost 7 years ago - Stars: 1 - Forks: 0

deepjyotiroy079/native-mapreduce-hadoop

Native MapReduce Implementation in Hadoop for weather data.

Language: Java - Size: 6.84 KB - Last synced: 16 days ago - Pushed: over 3 years ago - Stars: 1 - Forks: 0

deepjyotiroy079/big-data-stack

Codes created while learning Big Data Stack.

Language: Jupyter Notebook - Size: 949 KB - Last synced: 16 days ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 0

Walrussin/MapReduce-Examples

Analyzing air quality index of eight states

Language: Java - Size: 35.6 MB - Last synced: 17 days ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0

pkill37/articles

A demo of MongoDB and the MapReduce programming model on a dummy articles dataset.

Language: JavaScript - Size: 169 KB - Last synced: 18 days ago - Pushed: over 5 years ago - Stars: 1 - Forks: 0

Howeng98/MapReduce

CS542200 Parallel Programming HW4. Implement MapReduce by MPI + Pthread.

Language: C++ - Size: 5.25 MB - Last synced: 18 days ago - Pushed: over 2 years ago - Stars: 1 - Forks: 1

big-data-team/big-data-course

Practice course on Big Data

Language: Jupyter Notebook - Size: 310 KB - Last synced: 19 days ago - Pushed: 19 days ago - Stars: 14 - Forks: 36

CamDavidsonPilon/tdigest

t-Digest data structure in Python. Useful for percentiles and quantiles, including distributed enviroments like PySpark

Language: Python - Size: 91.8 KB - Last synced: 14 days ago - Pushed: about 1 year ago - Stars: 376 - Forks: 53

donnemartin/data-science-ipython-notebooks

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

Language: Python - Size: 46.8 MB - Last synced: 20 days ago - Pushed: about 2 months ago - Stars: 26,475 - Forks: 7,726

RababKaf/Projet_Gestion_Bancaire_BigData

Notre projet fusionne l'innovation tech et la gestion financière pour une expérience bancaire épique. Avec MongoDB, MapReduce et Node.js, notre app va ravir même les utilisateurs les plus exigeants. Prêts pour une gestion des comptes bancaires qui décoiffe ?

Language: EJS - Size: 11.6 MB - Last synced: 20 days ago - Pushed: 20 days ago - Stars: 0 - Forks: 0

kunxlv/stack-overflow-data-analysis

Data analysis of top 200,000 Stack Overflow queries with respect to their view count.

Language: Python - Size: 1.05 MB - Last synced: 21 days ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0

Kaushal1011/CS441SimRankForGraphs

This is the implementation of an algorithm that finds traceability links in two graphs such that the other graph is a perturbed version of the original graph.

Language: Scala - Size: 1.41 MB - Last synced: 22 days ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

asuiu/streamerate

Iterable Java8 style Streams for Python

Language: Python - Size: 435 KB - Last synced: 22 days ago - Pushed: 22 days ago - Stars: 6 - Forks: 2

saarthak2002/serverless-mr

A Serverless implementation of MapReduce

Language: TeX - Size: 1.27 MB - Last synced: 22 days ago - Pushed: 22 days ago - Stars: 0 - Forks: 0

jordicenzano/hadoop-tutorial

Initial experiments with Hadoop

Language: Java - Size: 271 KB - Last synced: 22 days ago - Pushed: about 5 years ago - Stars: 1 - Forks: 0

N0-man/Kofun

Functional Programming concepts using Kotlin

Size: 2.93 KB - Last synced: 22 days ago - Pushed: 2 months ago - Stars: 0 - Forks: 0

shellyln/go-graphdt

A datatable that represents object graphs for Go.

Language: Go - Size: 720 KB - Last synced: 22 days ago - Pushed: about 1 year ago - Stars: 2 - Forks: 0

feng-li/Distributed-Statistical-Computing

Teaching Materials for Distributed Statistical Computing (大数据分布式计算教学材料)

Language: HTML - Size: 46.9 MB - Last synced: 22 days ago - Pushed: 22 days ago - Stars: 104 - Forks: 65

harrykieu/mpi-mapreduce-avgpts

Calculate Average Points using MapReduce + MPI

Language: Java - Size: 2.57 MB - Last synced: 22 days ago - Pushed: 23 days ago - Stars: 0 - Forks: 0

BobErgot/OTT-Movies-Insights-to-Recommendations

Analyze movie ratings and build a recommendation system using MapReduce. This project utilizes the Apriori algorithm, optimized for handling large datasets like the Netflix prize data, to provide personalized movie recommendations.

Language: Java - Size: 1.48 MB - Last synced: 22 days ago - Pushed: 23 days ago - Stars: 0 - Forks: 0

jishnub/ParallelUtilities.jl

Fast and easy parallel mapreduce on HPC clusters

Language: Julia - Size: 992 KB - Last synced: 21 days ago - Pushed: over 2 years ago - Stars: 31 - Forks: 0

microsoft/Mobius

C# and F# language binding and extensions to Apache Spark

Language: C# - Size: 6.44 MB - Last synced: 11 days ago - Pushed: 4 months ago - Stars: 937 - Forks: 212

rodrigoorf/HadoopStudies

Repo with a few Hadoop exercises

Language: Java - Size: 72.3 KB - Last synced: 27 days ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0

ImagineJHY/Imagine_MapReduce

MapReduce framework with C++11

Language: C++ - Size: 4.05 MB - Last synced: 27 days ago - Pushed: 27 days ago - Stars: 3 - Forks: 0

jaidevd/ipec-fdp

Language: Jupyter Notebook - Size: 1.34 MB - Last synced: 27 days ago - Pushed: almost 6 years ago - Stars: 0 - Forks: 0

dedalozzo/eoc-server

A complete CouchDB Query Server written in PHP.

Language: PHP - Size: 150 KB - Last synced: 27 days ago - Pushed: almost 8 years ago - Stars: 4 - Forks: 0

klebermagno/Artificial-Inteligence

The propouse of this project is organize some Artificial Inteligenc techinics and projects. So this project is structured in git submodules to others projects and README.md to documentate some techinics.

Size: 5.86 KB - Last synced: 28 days ago - Pushed: almost 6 years ago - Stars: 1 - Forks: 0

xiangkangjw/assignments_template Fork of COS418F18/assignments_template

Language: Go - Size: 6.09 MB - Last synced: 29 days ago - Pushed: about 4 years ago - Stars: 0 - Forks: 0

changfubai/hadoop-wordcount

Language: Java - Size: 3.64 MB - Last synced: 29 days ago - Pushed: over 6 years ago - Stars: 0 - Forks: 0

cdapio/cdap

An open source framework for building data analytic applications.

Language: Java - Size: 608 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 735 - Forks: 339

wangz315/MapReduceApriori

MapReduce Apriori Algorithm

Language: C++ - Size: 4.99 MB - Last synced: 30 days ago - Pushed: about 4 years ago - Stars: 0 - Forks: 0

minkcv/cloudal

The most popular Repo as a Service

Language: HTML - Size: 64.5 KB - Last synced: 30 days ago - Pushed: about 4 years ago - Stars: 2 - Forks: 0

saleyn/etran

Erlang Parse Transforms Including Fold (MapReduce) comprehension, Elixir-like Pipeline, and default function arguments

Language: Erlang - Size: 130 KB - Last synced: 17 days ago - Pushed: 7 months ago - Stars: 27 - Forks: 2

yennanliu/spark_emr_dev

Collection of code for submitting Spark/Hadoop/Hive/Pig tasks to EMR (AWS Elastic MapReduce) | #DE

Language: Scala - Size: 3.72 MB - Last synced: about 1 month ago - Pushed: over 4 years ago - Stars: 3 - Forks: 1

oryband/go-web-mapreduce 📦

A MapReduce server using web browsers as workers, written in Go.

Language: Go - Size: 8.08 MB - Last synced: about 1 month ago - Pushed: over 8 years ago - Stars: 2 - Forks: 2

christian-konrad/mapreduce-invertedindexer-example

Simplified example of an Inverted Indexer for plain text documents built on Hadoop's MapReduce framework.

Language: Java - Size: 61.5 KB - Last synced: about 1 month ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0

jsteinberg4/icarus

I Could Actually Really Use Support (ICARUS): A custom implementation of MapReduce

Language: C++ - Size: 97.7 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0

GoCollaborate/src

A light-weight distributed stream computing framework for Golang

Language: Go - Size: 9.33 MB - Last synced: about 1 month ago - Pushed: about 6 years ago - Stars: 84 - Forks: 24

kevwan/mapreduce

A in-process MapReduce library to help you optimizing service response time or concurrent task processing.

Language: Go - Size: 44.9 KB - Last synced: 29 days ago - Pushed: over 1 year ago - Stars: 160 - Forks: 23

ak1103dev/219351_homework

Web Application Development Homework

Language: Java - Size: 12.7 KB - Last synced: about 1 month ago - Pushed: almost 8 years ago - Stars: 1 - Forks: 0

PowerJob/PowerJob

Enterprise job scheduling middleware with distributed computing ability.

Language: Java - Size: 20.1 MB - Last synced: about 1 month ago - Pushed: about 2 months ago - Stars: 6,443 - Forks: 1,141

douban/dpark

Python clone of Spark, a MapReduce alike framework in Python

Language: Python - Size: 2.65 MB - Last synced: about 1 month ago - Pushed: over 3 years ago - Stars: 2,693 - Forks: 535

cubefs/compass

Compass is a task diagnosis platform for bigdata

Language: Java - Size: 5.88 MB - Last synced: 29 days ago - Pushed: 29 days ago - Stars: 304 - Forks: 118

mahmoudparsian/big-data-mapreduce-course

Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University

Language: HTML - Size: 549 MB - Last synced: 28 days ago - Pushed: 28 days ago - Stars: 141 - Forks: 142

benedekh/bigdata-projects

Student projects in Big Data field.

Language: Java - Size: 184 KB - Last synced: 29 days ago - Pushed: about 2 months ago - Stars: 13 - Forks: 12

liboz/MIT-6.824

Go Concurrent Systems Projects

Language: Go - Size: 2.02 MB - Last synced: about 1 month ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0

H1ghBre4k3r/rust-map-reduce

A small hobby implementation of MapReduce that I hacked together at 2am.

Language: Rust - Size: 27.3 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0

goku321/dist-map-reduce

Distributed MapReduce word count application.

Language: Go - Size: 63.5 KB - Last synced: about 1 month ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

BobErgot/Large-Scale-Data-Processing-Design-Patterns

Explore essential MapReduce design patterns for big data processing! This repository includes practical implementations of patterns from the "MapReduce Design Patterns" book, complete with examples across summarization, filtering, organization, joins, and more.

Language: Java - Size: 37 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0

antoinebqt/Hadoop-MapReduce

School project to obtain a sorted list (in ascending order) of the films that have been a user's favorite the most times, using Hadoop MapReduce.

Language: Java - Size: 1.88 MB - Last synced: about 1 month ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

mimecast/dtail

DTail is a distributed DevOps tool for tailing, grepping, catting logs and other text files on many remote machines at once.

Language: Go - Size: 12.3 MB - Last synced: about 1 month ago - Pushed: about 2 months ago - Stars: 123 - Forks: 8

grycap/marla

MApReduce on AWS LAmbda

Language: Shell - Size: 17 MB - Last synced: about 1 month ago - Pushed: about 3 years ago - Stars: 3 - Forks: 5

CocaineCong/tangseng

Tangseng search engine including full text search and vector search base on golang. 基于go语言的搜索引擎,信息检索系统

Language: Go - Size: 6.07 MB - Last synced: 29 days ago - Pushed: about 2 months ago - Stars: 95 - Forks: 27

iflytek/Guitar

A Simple and Efficient Distributed Multidimensional BI Analysis Engine.

Language: Java - Size: 1.5 MB - Last synced: 28 days ago - Pushed: over 2 years ago - Stars: 85 - Forks: 22

ptobarra/Business-Intelligence-on-Big-Data-_-U-TAD-2017-Big-Data-Master-Final-Project

This is the final project I had to do to finish my Big Data Expert Program in U-TAD in September 2017. It uses the following technologies: Apache Spark v2.2.0, Python v2.7.3, Jupyter Notebook (PySpark), HDFS, Hive, Cloudera Impala, Cloudera HUE and Tableau.

Language: Jupyter Notebook - Size: 130 MB - Last synced: 23 days ago - Pushed: about 6 years ago - Stars: 6 - Forks: 1

zrq166/distributed-system

WordCount using MapReduce

Language: Go - Size: 1.65 MB - Last synced: about 1 month ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0

patrickakak/6.824-golab-2020

MIT 6.824-golab-2020

Language: Go - Size: 1.27 MB - Last synced: about 1 month ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0

prem11k/Top-K-Heavy-Hitters

Language: Go - Size: 5.86 KB - Last synced: about 1 month ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

anchitrao/OrangeSyrup

A parallel cloud computing framework based on the core principles of Apache Hadoop.

Language: Go - Size: 130 KB - Last synced: about 1 month ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

bergstartup/Map-Reduce

An implementation to orchestrate map-reduce jobs among servers

Language: Go - Size: 35.4 MB - Last synced: about 1 month ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0

aaqib-ahmed-nazir/BDA_Assignment02

This repository aims to develop a basic search engine utilizing Hadoop's MapReduce framework to index and process extensive text corpora efficiently. The dataset used for this project is a subset of the English Wikipedia dump, totaling 5.2 GB in size. The project focuses on implementing a naive search algorithm to address challenges in information.

Language: Jupyter Notebook - Size: 120 KB - Last synced: about 1 month ago - Pushed: about 2 months ago - Stars: 1 - Forks: 0

ahtezaz123/hadoop-mapreduce-on-wikipedia-articles-

Big Data Analytics Assignment on Hadoop MapReduce

Language: Jupyter Notebook - Size: 5.54 MB - Last synced: about 1 month ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0

jsgygujun/bigdata-study

大数据学习

Language: Scala - Size: 8.03 MB - Last synced: about 1 month ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

chaokunyang/athena

A task scheduler for spark, flink, mapreduce, java, python, bash

Language: Java - Size: 176 KB - Last synced: 30 days ago - Pushed: about 1 month ago - Stars: 3 - Forks: 3

flipkart-incubator/hbase-orm

A production-grade HBase ORM library that makes accessing HBase clean, fast and fun (Can also be used as Bigtable ORM)

Language: Java - Size: 363 KB - Last synced: about 1 month ago - Pushed: 11 months ago - Stars: 77 - Forks: 41

mohammad-malik/naive-search

This repository houses a naïve search engine utilising MapReduce technology which leverages a 5GB csv file as dataset. It makes use of the Vector Space Model for Information Retrieval. This was developed as part of an assignment for the course Fundamentals of Big Data Analytics (DS2004).

Language: Python - Size: 983 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0

Tencent/Firestorm

Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shuffle data on remote servers

Language: Java - Size: 1.63 MB - Last synced: 8 days ago - Pushed: about 1 year ago - Stars: 248 - Forks: 75

grailbio/bigslice

A serverless cluster computing system for the Go programming language

Language: Go - Size: 2.66 MB - Last synced: 28 days ago - Pushed: 12 months ago - Stars: 545 - Forks: 35

mmd-nemati/OS-Course-CAs

A collection of projects for the Operating Systems course at the University of Tehran, Fall 2023. Featuring a networked restaurant application in C, a utility bill calculator using MapReduce in C++, and an image processing suite with both serialized and parallelized versions in C++.

Language: C - Size: 883 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0

manojpannala/da-projects

List of projects based on Big data ecosystem.

Language: R - Size: 362 KB - Last synced: about 2 months ago - Pushed: over 6 years ago - Stars: 0 - Forks: 0

juagarmar/Reression-methods-with-pbdR

Reression methods with pbdR.

Language: R - Size: 1.95 KB - Last synced: about 2 months ago - Pushed: about 7 years ago - Stars: 3 - Forks: 0

tahoe01/Passionfruit

Distributed Storage & Big Data Processing Framework

Language: Java - Size: 413 KB - Last synced: about 2 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0

CSCLabTW/CloudDOE

CloudDOE is a user friendly software package to deploy, operate and extend a MapReduce-based bioinformatics environment, which is collectively denoted as a CloudDOE Cloud.

Language: Java - Size: 49.5 MB - Last synced: about 2 months ago - Pushed: about 10 years ago - Stars: 1 - Forks: 1

CSCLabTW/CloudEC

CloudEC is a MapReduce-based algorithm for correcting errors in next-generation sequencing big data.

Language: Java - Size: 107 KB - Last synced: about 2 months ago - Pushed: over 6 years ago - Stars: 1 - Forks: 0

Abdurrehman7452/search-engine-utilising-hadoop-MapReduce-technology-with-python-on-wikipedia-articles

Developing a Naive Search Engine Utilising Apache Hadoop MapReduce Technology on a dataset in comma-separated values (CSV) format containing around 5 million Wikipedia articles provided by Wikimedia, as part of an assignment for the Fundamental of Big Data Analytics (DS2004) course.

Size: 1.95 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0

goldmansachs/MRWord2Vec

A MapReduce / Hadoop implementation of Word2Vec

Language: Java - Size: 52.7 KB - Last synced: about 2 months ago - Pushed: about 2 years ago - Stars: 17 - Forks: 11