Topic: "hadoop-streaming"
monisjaved/Data-Processing-With-Hadoop
Text Processing Using Hadoop
Language: Jupyter Notebook - Size: 21 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 6 - Forks: 2

LMAPcoder/Hadoop-on-Colab
Installation and configuration of Hadoop on Google Colaboratory
Language: Jupyter Notebook - Size: 620 KB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 5 - Forks: 5

adrian83/go-hadoop-streaming
Hadoop Streaming example written in Go
Language: Go - Size: 1.24 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 4 - Forks: 0

peloyeje/map543-dijkstra-mapreduce-spark
[MAP543] Hadoop Streaming (MapReduce) and Spark implementations of the Dijkstra shortest path algorithm
Language: Jupyter Notebook - Size: 23.4 MB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 4 - Forks: 4

Lapis-Hong/ctrcount
python map reduce statics
Language: Python - Size: 3.58 MB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 4 - Forks: 0

thedatasociety/lab-hadoop
Language: PLpgSQL - Size: 4.6 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 7

sreetamparida/Hiraishin
A REST-based service that translates the SQL query into MapReduce and Spark jobs. It runs these jobs and provides the JSON object. SQL to MapReduce and Spark translator.
Language: Python - Size: 194 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 0

jomarsilio/Bootcamp-IGTI-Analista-de-Dados
Bootcamp ministrado pela IGTI com o objetivo de abordar de forma intensiva conceitos e práticas da análise de dados, habilitando o aluno para atuar profissionalmente na área.
Language: Jupyter Notebook - Size: 127 MB - Last synced at: almost 2 years ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 0

highoncarbs/hadoopwithpy
:elephant: :heavy_plus_sign: :snake: Learning Hadoop with Python
Language: Python - Size: 86.6 MB - Last synced at: about 1 month ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 0

MandarGogate/Association-Rule-Mining-Hadoop-Python
A case study on mining association rules between different factors related to deaths of people in the United States
Language: Python - Size: 146 KB - Last synced at: about 2 years ago - Pushed at: almost 8 years ago - Stars: 3 - Forks: 2

krishnadey30/Intro-to-Hadoop-and-MapReduce
Language: Python - Size: 6.54 MB - Last synced at: about 1 month ago - Pushed at: almost 7 years ago - Stars: 2 - Forks: 0

GINK03/aws-emr-streaming-templates
AWS Elastic Map Reduce Streaming Templates
Language: Python - Size: 9.72 MB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 0

KeerthanaJ-rec/210701118-CS19P16-DA-Lab
Data Analytics Laboratory
Language: R - Size: 23.1 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 1 - Forks: 0

Jacob12138xieyuan/hadoop-mapreduce-with-python
hadoop mapreduce algorithm with hadoop streaming (Python)
Language: Jupyter Notebook - Size: 16.6 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 0

simple-learning/Hadoop
Hadoop Projects
Language: Java - Size: 28.7 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

BurraAbhishek/Python_Hadoop_MapReduce_MarketBasketAnalysis
Market Basket Analysis using Hadoop MapReduce in Python
Language: Python - Size: 103 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 2

vbugaevskii/hadoop-streaming-protoseq
A small library example how to work with binary files with Hadoop Streaming.
Language: Java - Size: 33.2 KB - Last synced at: 9 months ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

6vedant/TwitterAnalyticsHadoop
Twitter Streaming Analytics Project (Big Data Analysis using Hadoop)
Language: Java - Size: 8.79 KB - Last synced at: almost 2 years ago - Pushed at: about 7 years ago - Stars: 1 - Forks: 0

tertiarycourses/ApacheHadoop
Exercise files for Apache Hadoop Big Data Training
Size: 63.5 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

yanglgtm/hadoop-skeleton
A hadoop skeleton streaming script
Language: Shell - Size: 6.84 KB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

niz-ka/pbd-project
Repository to the needs of Big Data course at university
Language: Python - Size: 4.88 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

YMaher99/Parallelizing-the-Feedforward-Operation-of-Neural-Networks-in-Hadoop-MapReduce
Leveraging the mapreduce paradigm we propose a solution to parallelize the feedforward operation of neural networks in order to speed it up for sufficiently large NN architectures and for sufficiently large datasets. Tested Using the MNIST dataset results can be found in the results.html and results.ipynb files.
Language: HTML - Size: 2.1 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

shreyasnagare/GCP-Dataproc-Hadoop-MapReduce
A Hadoop MapReduce application to find the maximum temperature in every day of the years 1901 and 1902 from the NCDC weather records.
Language: Python - Size: 11.2 MB - Last synced at: 10 months ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

KhanShaheb34/MapReduce
Learning Hadoop MapReduce Using Python
Language: Python - Size: 54.7 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

Sidl419/hadoop_streaming
Построение рекомендательной системы на основе алгоритма коллаборативной фильтрации и технологии Hadoop Streaming
Language: Python - Size: 1.94 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

AleksaMCode/odabrana-poglavlja-iz-operativnih-sistema
Rjesenje rokova iz predmeta Odabrana poglavlja iz operativnih sistema na Elektrotehničkom fakultetu u Banjoj Luci.
Language: C# - Size: 41 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

aditeyabaral/mapreduce-word2vec
Implementation of Word2Vec for large datasets as a Map-Reduce Job using Hadoop Streaming.
Language: Python - Size: 1.45 MB - Last synced at: about 2 months ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

crazypegasusvv/Mutations 📦
Mutations
Size: 1.56 MB - Last synced at: about 1 year ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

NilufaYeasmin/MapReduce
This repo contains implementations of Mapreduce program in a large text corpus with Apache Hadoop Environment | Nilufa Yeasmin | https://www.linkedin.com/in/nilufayeasmin/
Language: CSS - Size: 3.53 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

antoinewg/ocr-page-rank
PageRank algorithm using Hadoop Streaming
Language: Python - Size: 438 KB - Last synced at: 16 days ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

antoinewg/ocr-tfidf
TF-IDF with Hadoop Streaming
Language: Python - Size: 64.5 KB - Last synced at: 16 days ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

bug-data/Big_Data_First_Project
First project for Big Data course held at Roma Tre University
Language: Jupyter Notebook - Size: 2.16 MB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0

ggodreau/huhdewp
Hadoop streaming EMR job
Language: Python - Size: 27.3 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

amitthere/clustering-algorithms
K-Means, Hierarchical Agglomerative, Density based and Map Reduce K-Means Clustering implemented on 2 Gene Datasets in Python
Language: Python - Size: 81.1 KB - Last synced at: 9 days ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 1

HarigovindV10/NYC-Subway-Data-Analysis
An analysis of NYC Subway Data using Hadoop Map Reduce
Language: Jupyter Notebook - Size: 529 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 1

SakhriHoussem/MapReduce-Python
MapReduce Python Example
Language: Python - Size: 20.5 KB - Last synced at: almost 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 4

ahmedhumza94/udacity-intro-to-hadoop-and-mapreduce
Repository containing python code for MapReduce jobs to answer questions about Udacity forum data.
Language: Python - Size: 5.86 KB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

rmcnew/Wiki_Edit_MapReduce
Simple MapReduce code for use with Hadoop for Distributed Systems graduate school class
Language: Java - Size: 12.7 KB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

dishaumarwani/Parking_Data_Analysis
Parking Data Analysis in Hadoop MapReduce Framework
Language: Python - Size: 8.79 KB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 1

amcquade/TweetleStreams
Map and Reduce algorithm on a pool of tweets.
Language: C++ - Size: 1.14 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

imdeepanshugpt/Hadoop
Hadoop-Cluster
Language: Python - Size: 887 KB - Last synced at: about 1 month ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

E7su/rhyhorn
easy examples with Hadoop's Java API & Hadoop Streaming
Language: Java - Size: 8.79 KB - Last synced at: over 1 year ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0

avikdatta/hadoop_streaming_python_script
A repository for hadoop streaming python scripts
Language: Shell - Size: 23.4 KB - Last synced at: 6 months ago - Pushed at: over 8 years ago - Stars: 0 - Forks: 0

penguin138/hadoop-tasks
Hadoop tasks repository for Parallel and Distributed Computing course at MIPT 2015
Language: Java - Size: 237 KB - Last synced at: 5 months ago - Pushed at: over 9 years ago - Stars: 0 - Forks: 0
