Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: mapreduce
WonYong-Jang/Hadoop-labs
Hadoop Programming ( using Mapreduce)
Language: Java - Size: 1.15 MB - Last synced: about 2 hours ago - Pushed: about 6 years ago - Stars: 0 - Forks: 0
casangi/graphviper
Dask Based MapReduce for Multi Xarray Datasets.
Language: Python - Size: 2.13 MB - Last synced: about 3 hours ago - Pushed: about 7 hours ago - Stars: 1 - Forks: 0
redisson/redisson
Redisson - Easy Redis Java client and Real-Time Data Platform. Sync/Async/RxJava/Reactive API. Over 50 Redis based Java objects and services: Set, Multimap, SortedSet, Map, List, Queue, Deque, Semaphore, Lock, AtomicLong, Map Reduce, Bloom filter, Spring Cache, Tomcat, Scheduler, JCache API, Hibernate, RPC, local cache ...
Language: Java - Size: 25.2 MB - Last synced: about 6 hours ago - Pushed: about 14 hours ago - Stars: 22,789 - Forks: 5,266
collabH/bigdata-growth
大数据知识仓库涉及到数据仓库建模、实时计算、大数据、数据中台、系统设计、Java、算法等。
Language: Shell - Size: 215 MB - Last synced: about 1 hour ago - Pushed: about 15 hours ago - Stars: 1,294 - Forks: 324
apache/incubator-uniffle
Uniffle is a high performance, general purpose Remote Shuffle Service.
Language: Java - Size: 10.1 MB - Last synced: about 15 hours ago - Pushed: about 18 hours ago - Stars: 357 - Forks: 131
RedisGears/RedisGears
Dynamic execution framework for your Redis data
Language: Rust - Size: 4.8 MB - Last synced: about 4 hours ago - Pushed: about 15 hours ago - Stars: 355 - Forks: 62
cwensel/cascading
Cascading is a feature rich API for defining and executing complex and fault tolerant data processing flows locally or on a cluster.
Language: Java - Size: 32 MB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 342 - Forks: 222
groda/big_data
Tutorials on Big Data essentials: Hadoop, MapReduce, Spark.
Language: Jupyter Notebook - Size: 46.2 MB - Last synced: 5 days ago - Pushed: 6 days ago - Stars: 61 - Forks: 23
Jack-Christopher/PyDoop
PyDoop: Revolutionizing Big Data Processing 🚀
Language: Python - Size: 13.7 KB - Last synced: 8 days ago - Pushed: 5 months ago - Stars: 0 - Forks: 0
datawhalechina/juicy-bigdata
🎉🎉🐳 Datawhale大数据处理导论教程 | 大数据技术方向的开篇课程🎉🎉
Language: Python - Size: 27.4 MB - Last synced: 8 days ago - Pushed: about 1 year ago - Stars: 233 - Forks: 34
oaarnikoivu/mapreduce
MapReduce architecture in Python
Language: Python - Size: 1.19 MB - Last synced: 10 days ago - Pushed: 11 days ago - Stars: 0 - Forks: 0
Yaon-C2H8N2/Projet-SGD
Projet réalisé dans le cadre de l'UE Systèmes de Gestion de Documents à l'université de Bourgogne
Language: Python - Size: 1.57 MB - Last synced: 12 days ago - Pushed: 12 days ago - Stars: 0 - Forks: 0
Jacob12138xieyuan/hadoop-mapreduce-with-python
hadoop mapreduce algorithm with hadoop streaming (Python)
Language: Jupyter Notebook - Size: 16.6 KB - Last synced: 12 days ago - Pushed: 13 days ago - Stars: 1 - Forks: 0
jamestiotio/dbsys
SUTD 2021 50.043 Database and Big Data Systems Code Dump
Language: Java - Size: 69.7 MB - Last synced: 14 days ago - Pushed: almost 2 years ago - Stars: 1 - Forks: 3
JordiCorbilla/MapReduce
Data parallel text processing with MapReduce
Language: C# - Size: 17.2 MB - Last synced: 14 days ago - Pushed: over 3 years ago - Stars: 4 - Forks: 2
wtanaka/ansible-role-apache-spark
Ansible role to install Apache Spark
Language: Shell - Size: 15.6 KB - Last synced: 14 days ago - Pushed: about 5 years ago - Stars: 2 - Forks: 0
windson/ReviewsByDate
Get date wise number of reviews in the descending order using HDInsight
Language: C# - Size: 45.5 MB - Last synced: 14 days ago - Pushed: about 7 years ago - Stars: 1 - Forks: 0
windson/HDInsight-TopN-Reviews-MapReduce
TopN Products by category using HDInsight Streaming MapReduce
Language: C# - Size: 53.2 MB - Last synced: 14 days ago - Pushed: about 7 years ago - Stars: 1 - Forks: 0
windson/HDInsight-Top-N-OverPriced-Products-MapReduce
Top N OverPriced Products Using HDInsight streaming MapReduce Job
Language: C# - Size: 77.1 MB - Last synced: 14 days ago - Pushed: about 7 years ago - Stars: 1 - Forks: 0
ArangoGutierrez/GoReduce
A map reduce example made in pure Go
Language: Go - Size: 9.16 MB - Last synced: 14 days ago - Pushed: over 6 years ago - Stars: 1 - Forks: 0
nellore/rail
Scalable RNA-seq analysis
Language: Python - Size: 249 MB - Last synced: 2 days ago - Pushed: over 3 years ago - Stars: 72 - Forks: 11
AlbertSuarez/CBDE-MapReduce
🗺️ Laboratory 4 of CBDE subject.
Language: Java - Size: 29.3 KB - Last synced: 15 days ago - Pushed: over 7 years ago - Stars: 0 - Forks: 0
sujilnt/PythonCourserwork
A python Coursework on Bigdata and Mapreduce , This coursework related to distribute computing
Language: Python - Size: 37.1 KB - Last synced: 15 days ago - Pushed: almost 6 years ago - Stars: 0 - Forks: 0
jayybhatt/LS-Bloom-Filter
Implementation of bloom filter for large scale data using MapReduce
Language: Java - Size: 4.88 KB - Last synced: 16 days ago - Pushed: over 6 years ago - Stars: 0 - Forks: 0
gmarciani/mapreduce-app
Scaffolding for Map/Reduce applications, leveraging Apache Hadoop.
Language: Shell - Size: 1000 Bytes - Last synced: 16 days ago - Pushed: almost 7 years ago - Stars: 1 - Forks: 0
deepjyotiroy079/native-mapreduce-hadoop
Native MapReduce Implementation in Hadoop for weather data.
Language: Java - Size: 6.84 KB - Last synced: 16 days ago - Pushed: over 3 years ago - Stars: 1 - Forks: 0
deepjyotiroy079/big-data-stack
Codes created while learning Big Data Stack.
Language: Jupyter Notebook - Size: 949 KB - Last synced: 16 days ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 0
Walrussin/MapReduce-Examples
Analyzing air quality index of eight states
Language: Java - Size: 35.6 MB - Last synced: 17 days ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0
pkill37/articles
A demo of MongoDB and the MapReduce programming model on a dummy articles dataset.
Language: JavaScript - Size: 169 KB - Last synced: 18 days ago - Pushed: over 5 years ago - Stars: 1 - Forks: 0
Howeng98/MapReduce
CS542200 Parallel Programming HW4. Implement MapReduce by MPI + Pthread.
Language: C++ - Size: 5.25 MB - Last synced: 18 days ago - Pushed: over 2 years ago - Stars: 1 - Forks: 1
big-data-team/big-data-course
Practice course on Big Data
Language: Jupyter Notebook - Size: 310 KB - Last synced: 19 days ago - Pushed: 19 days ago - Stars: 14 - Forks: 36
CamDavidsonPilon/tdigest
t-Digest data structure in Python. Useful for percentiles and quantiles, including distributed enviroments like PySpark
Language: Python - Size: 91.8 KB - Last synced: 14 days ago - Pushed: about 1 year ago - Stars: 376 - Forks: 53
donnemartin/data-science-ipython-notebooks
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Language: Python - Size: 46.8 MB - Last synced: 20 days ago - Pushed: about 2 months ago - Stars: 26,475 - Forks: 7,726
RababKaf/Projet_Gestion_Bancaire_BigData
Notre projet fusionne l'innovation tech et la gestion financière pour une expérience bancaire épique. Avec MongoDB, MapReduce et Node.js, notre app va ravir même les utilisateurs les plus exigeants. Prêts pour une gestion des comptes bancaires qui décoiffe ?
Language: EJS - Size: 11.6 MB - Last synced: 20 days ago - Pushed: 20 days ago - Stars: 0 - Forks: 0
kunxlv/stack-overflow-data-analysis
Data analysis of top 200,000 Stack Overflow queries with respect to their view count.
Language: Python - Size: 1.05 MB - Last synced: 21 days ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0
Kaushal1011/CS441SimRankForGraphs
This is the implementation of an algorithm that finds traceability links in two graphs such that the other graph is a perturbed version of the original graph.
Language: Scala - Size: 1.41 MB - Last synced: 22 days ago - Pushed: 7 months ago - Stars: 0 - Forks: 0
asuiu/streamerate
Iterable Java8 style Streams for Python
Language: Python - Size: 435 KB - Last synced: 22 days ago - Pushed: 22 days ago - Stars: 6 - Forks: 2
saarthak2002/serverless-mr
A Serverless implementation of MapReduce
Language: TeX - Size: 1.27 MB - Last synced: 22 days ago - Pushed: 22 days ago - Stars: 0 - Forks: 0
jordicenzano/hadoop-tutorial
Initial experiments with Hadoop
Language: Java - Size: 271 KB - Last synced: 22 days ago - Pushed: about 5 years ago - Stars: 1 - Forks: 0
N0-man/Kofun
Functional Programming concepts using Kotlin
Size: 2.93 KB - Last synced: 22 days ago - Pushed: 2 months ago - Stars: 0 - Forks: 0
shellyln/go-graphdt
A datatable that represents object graphs for Go.
Language: Go - Size: 720 KB - Last synced: 22 days ago - Pushed: about 1 year ago - Stars: 2 - Forks: 0
feng-li/Distributed-Statistical-Computing
Teaching Materials for Distributed Statistical Computing (大数据分布式计算教学材料)
Language: HTML - Size: 46.9 MB - Last synced: 22 days ago - Pushed: 22 days ago - Stars: 104 - Forks: 65
harrykieu/mpi-mapreduce-avgpts
Calculate Average Points using MapReduce + MPI
Language: Java - Size: 2.57 MB - Last synced: 22 days ago - Pushed: 23 days ago - Stars: 0 - Forks: 0
BobErgot/OTT-Movies-Insights-to-Recommendations
Analyze movie ratings and build a recommendation system using MapReduce. This project utilizes the Apriori algorithm, optimized for handling large datasets like the Netflix prize data, to provide personalized movie recommendations.
Language: Java - Size: 1.48 MB - Last synced: 22 days ago - Pushed: 23 days ago - Stars: 0 - Forks: 0
jishnub/ParallelUtilities.jl
Fast and easy parallel mapreduce on HPC clusters
Language: Julia - Size: 992 KB - Last synced: 21 days ago - Pushed: over 2 years ago - Stars: 31 - Forks: 0
microsoft/Mobius
C# and F# language binding and extensions to Apache Spark
Language: C# - Size: 6.44 MB - Last synced: 11 days ago - Pushed: 4 months ago - Stars: 937 - Forks: 212
rodrigoorf/HadoopStudies
Repo with a few Hadoop exercises
Language: Java - Size: 72.3 KB - Last synced: 27 days ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0
ImagineJHY/Imagine_MapReduce
MapReduce framework with C++11
Language: C++ - Size: 4.05 MB - Last synced: 27 days ago - Pushed: 27 days ago - Stars: 3 - Forks: 0
jaidevd/ipec-fdp
Language: Jupyter Notebook - Size: 1.34 MB - Last synced: 27 days ago - Pushed: almost 6 years ago - Stars: 0 - Forks: 0
dedalozzo/eoc-server
A complete CouchDB Query Server written in PHP.
Language: PHP - Size: 150 KB - Last synced: 27 days ago - Pushed: almost 8 years ago - Stars: 4 - Forks: 0
klebermagno/Artificial-Inteligence
The propouse of this project is organize some Artificial Inteligenc techinics and projects. So this project is structured in git submodules to others projects and README.md to documentate some techinics.
Size: 5.86 KB - Last synced: 28 days ago - Pushed: almost 6 years ago - Stars: 1 - Forks: 0
xiangkangjw/assignments_template Fork of COS418F18/assignments_template
Language: Go - Size: 6.09 MB - Last synced: 29 days ago - Pushed: about 4 years ago - Stars: 0 - Forks: 0
changfubai/hadoop-wordcount
Language: Java - Size: 3.64 MB - Last synced: 29 days ago - Pushed: over 6 years ago - Stars: 0 - Forks: 0
cdapio/cdap
An open source framework for building data analytic applications.
Language: Java - Size: 608 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 735 - Forks: 339
wangz315/MapReduceApriori
MapReduce Apriori Algorithm
Language: C++ - Size: 4.99 MB - Last synced: 30 days ago - Pushed: about 4 years ago - Stars: 0 - Forks: 0
minkcv/cloudal
The most popular Repo as a Service
Language: HTML - Size: 64.5 KB - Last synced: 30 days ago - Pushed: about 4 years ago - Stars: 2 - Forks: 0
saleyn/etran
Erlang Parse Transforms Including Fold (MapReduce) comprehension, Elixir-like Pipeline, and default function arguments
Language: Erlang - Size: 130 KB - Last synced: 17 days ago - Pushed: 7 months ago - Stars: 27 - Forks: 2
yennanliu/spark_emr_dev
Collection of code for submitting Spark/Hadoop/Hive/Pig tasks to EMR (AWS Elastic MapReduce) | #DE
Language: Scala - Size: 3.72 MB - Last synced: about 1 month ago - Pushed: over 4 years ago - Stars: 3 - Forks: 1
oryband/go-web-mapreduce 📦
A MapReduce server using web browsers as workers, written in Go.
Language: Go - Size: 8.08 MB - Last synced: about 1 month ago - Pushed: over 8 years ago - Stars: 2 - Forks: 2
christian-konrad/mapreduce-invertedindexer-example
Simplified example of an Inverted Indexer for plain text documents built on Hadoop's MapReduce framework.
Language: Java - Size: 61.5 KB - Last synced: about 1 month ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0
jsteinberg4/icarus
I Could Actually Really Use Support (ICARUS): A custom implementation of MapReduce
Language: C++ - Size: 97.7 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0
GoCollaborate/src
A light-weight distributed stream computing framework for Golang
Language: Go - Size: 9.33 MB - Last synced: about 1 month ago - Pushed: about 6 years ago - Stars: 84 - Forks: 24
kevwan/mapreduce
A in-process MapReduce library to help you optimizing service response time or concurrent task processing.
Language: Go - Size: 44.9 KB - Last synced: 29 days ago - Pushed: over 1 year ago - Stars: 160 - Forks: 23
ak1103dev/219351_homework
Web Application Development Homework
Language: Java - Size: 12.7 KB - Last synced: about 1 month ago - Pushed: almost 8 years ago - Stars: 1 - Forks: 0
PowerJob/PowerJob
Enterprise job scheduling middleware with distributed computing ability.
Language: Java - Size: 20.1 MB - Last synced: about 1 month ago - Pushed: about 2 months ago - Stars: 6,443 - Forks: 1,141
douban/dpark
Python clone of Spark, a MapReduce alike framework in Python
Language: Python - Size: 2.65 MB - Last synced: about 1 month ago - Pushed: over 3 years ago - Stars: 2,693 - Forks: 535
cubefs/compass
Compass is a task diagnosis platform for bigdata
Language: Java - Size: 5.88 MB - Last synced: 29 days ago - Pushed: 29 days ago - Stars: 304 - Forks: 118
mahmoudparsian/big-data-mapreduce-course
Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University
Language: HTML - Size: 549 MB - Last synced: 28 days ago - Pushed: 28 days ago - Stars: 141 - Forks: 142
benedekh/bigdata-projects
Student projects in Big Data field.
Language: Java - Size: 184 KB - Last synced: 29 days ago - Pushed: about 2 months ago - Stars: 13 - Forks: 12
liboz/MIT-6.824
Go Concurrent Systems Projects
Language: Go - Size: 2.02 MB - Last synced: about 1 month ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0
H1ghBre4k3r/rust-map-reduce
A small hobby implementation of MapReduce that I hacked together at 2am.
Language: Rust - Size: 27.3 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0
goku321/dist-map-reduce
Distributed MapReduce word count application.
Language: Go - Size: 63.5 KB - Last synced: about 1 month ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
BobErgot/Large-Scale-Data-Processing-Design-Patterns
Explore essential MapReduce design patterns for big data processing! This repository includes practical implementations of patterns from the "MapReduce Design Patterns" book, complete with examples across summarization, filtering, organization, joins, and more.
Language: Java - Size: 37 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0
antoinebqt/Hadoop-MapReduce
School project to obtain a sorted list (in ascending order) of the films that have been a user's favorite the most times, using Hadoop MapReduce.
Language: Java - Size: 1.88 MB - Last synced: about 1 month ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
mimecast/dtail
DTail is a distributed DevOps tool for tailing, grepping, catting logs and other text files on many remote machines at once.
Language: Go - Size: 12.3 MB - Last synced: about 1 month ago - Pushed: about 2 months ago - Stars: 123 - Forks: 8
grycap/marla
MApReduce on AWS LAmbda
Language: Shell - Size: 17 MB - Last synced: about 1 month ago - Pushed: about 3 years ago - Stars: 3 - Forks: 5
CocaineCong/tangseng
Tangseng search engine including full text search and vector search base on golang. 基于go语言的搜索引擎,信息检索系统
Language: Go - Size: 6.07 MB - Last synced: 29 days ago - Pushed: about 2 months ago - Stars: 95 - Forks: 27
iflytek/Guitar
A Simple and Efficient Distributed Multidimensional BI Analysis Engine.
Language: Java - Size: 1.5 MB - Last synced: 28 days ago - Pushed: over 2 years ago - Stars: 85 - Forks: 22
ptobarra/Business-Intelligence-on-Big-Data-_-U-TAD-2017-Big-Data-Master-Final-Project
This is the final project I had to do to finish my Big Data Expert Program in U-TAD in September 2017. It uses the following technologies: Apache Spark v2.2.0, Python v2.7.3, Jupyter Notebook (PySpark), HDFS, Hive, Cloudera Impala, Cloudera HUE and Tableau.
Language: Jupyter Notebook - Size: 130 MB - Last synced: 23 days ago - Pushed: about 6 years ago - Stars: 6 - Forks: 1
zrq166/distributed-system
WordCount using MapReduce
Language: Go - Size: 1.65 MB - Last synced: about 1 month ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0
patrickakak/6.824-golab-2020
MIT 6.824-golab-2020
Language: Go - Size: 1.27 MB - Last synced: about 1 month ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0
prem11k/Top-K-Heavy-Hitters
Language: Go - Size: 5.86 KB - Last synced: about 1 month ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0
anchitrao/OrangeSyrup
A parallel cloud computing framework based on the core principles of Apache Hadoop.
Language: Go - Size: 130 KB - Last synced: about 1 month ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
bergstartup/Map-Reduce
An implementation to orchestrate map-reduce jobs among servers
Language: Go - Size: 35.4 MB - Last synced: about 1 month ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0
aaqib-ahmed-nazir/BDA_Assignment02
This repository aims to develop a basic search engine utilizing Hadoop's MapReduce framework to index and process extensive text corpora efficiently. The dataset used for this project is a subset of the English Wikipedia dump, totaling 5.2 GB in size. The project focuses on implementing a naive search algorithm to address challenges in information.
Language: Jupyter Notebook - Size: 120 KB - Last synced: about 1 month ago - Pushed: about 2 months ago - Stars: 1 - Forks: 0
ahtezaz123/hadoop-mapreduce-on-wikipedia-articles-
Big Data Analytics Assignment on Hadoop MapReduce
Language: Jupyter Notebook - Size: 5.54 MB - Last synced: about 1 month ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0
jsgygujun/bigdata-study
大数据学习
Language: Scala - Size: 8.03 MB - Last synced: about 1 month ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0
chaokunyang/athena
A task scheduler for spark, flink, mapreduce, java, python, bash
Language: Java - Size: 176 KB - Last synced: 30 days ago - Pushed: about 1 month ago - Stars: 3 - Forks: 3
flipkart-incubator/hbase-orm
A production-grade HBase ORM library that makes accessing HBase clean, fast and fun (Can also be used as Bigtable ORM)
Language: Java - Size: 363 KB - Last synced: about 1 month ago - Pushed: 11 months ago - Stars: 77 - Forks: 41
mohammad-malik/naive-search
This repository houses a naïve search engine utilising MapReduce technology which leverages a 5GB csv file as dataset. It makes use of the Vector Space Model for Information Retrieval. This was developed as part of an assignment for the course Fundamentals of Big Data Analytics (DS2004).
Language: Python - Size: 983 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0
Tencent/Firestorm
Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shuffle data on remote servers
Language: Java - Size: 1.63 MB - Last synced: 8 days ago - Pushed: about 1 year ago - Stars: 248 - Forks: 75
grailbio/bigslice
A serverless cluster computing system for the Go programming language
Language: Go - Size: 2.66 MB - Last synced: 28 days ago - Pushed: 12 months ago - Stars: 545 - Forks: 35
mmd-nemati/OS-Course-CAs
A collection of projects for the Operating Systems course at the University of Tehran, Fall 2023. Featuring a networked restaurant application in C, a utility bill calculator using MapReduce in C++, and an image processing suite with both serialized and parallelized versions in C++.
Language: C - Size: 883 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0
manojpannala/da-projects
List of projects based on Big data ecosystem.
Language: R - Size: 362 KB - Last synced: about 2 months ago - Pushed: over 6 years ago - Stars: 0 - Forks: 0
juagarmar/Reression-methods-with-pbdR
Reression methods with pbdR.
Language: R - Size: 1.95 KB - Last synced: about 2 months ago - Pushed: about 7 years ago - Stars: 3 - Forks: 0
tahoe01/Passionfruit
Distributed Storage & Big Data Processing Framework
Language: Java - Size: 413 KB - Last synced: about 2 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0
CSCLabTW/CloudDOE
CloudDOE is a user friendly software package to deploy, operate and extend a MapReduce-based bioinformatics environment, which is collectively denoted as a CloudDOE Cloud.
Language: Java - Size: 49.5 MB - Last synced: about 2 months ago - Pushed: about 10 years ago - Stars: 1 - Forks: 1
CSCLabTW/CloudEC
CloudEC is a MapReduce-based algorithm for correcting errors in next-generation sequencing big data.
Language: Java - Size: 107 KB - Last synced: about 2 months ago - Pushed: over 6 years ago - Stars: 1 - Forks: 0
Abdurrehman7452/search-engine-utilising-hadoop-MapReduce-technology-with-python-on-wikipedia-articles
Developing a Naive Search Engine Utilising Apache Hadoop MapReduce Technology on a dataset in comma-separated values (CSV) format containing around 5 million Wikipedia articles provided by Wikimedia, as part of an assignment for the Fundamental of Big Data Analytics (DS2004) course.
Size: 1.95 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0
goldmansachs/MRWord2Vec
A MapReduce / Hadoop implementation of Word2Vec
Language: Java - Size: 52.7 KB - Last synced: about 2 months ago - Pushed: about 2 years ago - Stars: 17 - Forks: 11