Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: big-data-analytics

nickenshidqia/Big_Data_Analytics_Kimia_Farma

Big Data Analytics Project gives challenges to create data mart design and dashboard on Kimia Farma

Size: 5.52 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

Fili-ai/knn_cuda

KNN written in CUDA without any external library like CUBLAS or anything else

Language: Cuda - Size: 3.53 MB - Last synced: 3 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

KOHENOORKEN/KGS-Global

All sub projects of KGS Global will be kept here

Language: Solidity - Size: 5.69 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

Laetitia-Deken/Chicago_Taxi_Trips

Exploration of Chicago Taxi Trips - BigQuery Data with Python (January 2013 - October 2023)

Language: Jupyter Notebook - Size: 1.58 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 0

marcocolangelo/Big-Data-processing-and-Analytics

The current repository contains all the code developed during the Big Data processing and Analytics laboratories. Data are processed and analyzed using Hadoop and Spark

Language: Java - Size: 6.1 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

arxiver/Airbnb-EDA-and-Regression

Big data exploration and analysis on Airbnb dataset as well as regression model for price prediction of entities

Language: Jupyter Notebook - Size: 3.11 MB - Last synced: 13 days ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 1

IsaacMwendwa/Big-Data-with-PySpark

This repository contains the materials (code & theory) I compiled while undertaking DataCamp's Big Data with PySpark Learning Track

Size: 147 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 0

aeronaut2001/Car-Insurance-Cold-Calls-Data-Analysis

Car Insurance Cold Calls Data Analysis using Apache Hive

Language: HiveQL - Size: 1.17 MB - Last synced: 5 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

aeronaut2001/Telecom-Data-Analysis

Telecom Data Analysis with Apache Hive

Language: HiveQL - Size: 345 KB - Last synced: 5 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

lithops-cloud/lithops

A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud ☁️🚀

Language: Python - Size: 12.3 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 289 - Forks: 91

tekdogan/iccbdc-21

Experiment files for ICCBDC'21 paper "Benchmarking Apache Spark and Hadoop MapReduce on Big Data Classification"

Language: Python - Size: 111 KB - Last synced: 6 months ago - Pushed: over 2 years ago - Stars: 1 - Forks: 0

harshh351998/Market-Basket-Items-Recommendation

This project provide the retailer with information to understand the purchase behaviour of a buyer and recommends products to user on their purchase history.

Language: Jupyter Notebook - Size: 1.11 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 1 - Forks: 0

anshul1004/MutualFriends

Implementation of Hadoop and Spark

Language: Java - Size: 23 MB - Last synced: 6 months ago - Pushed: about 4 years ago - Stars: 1 - Forks: 0

XuanyouLiu/US-Real-Estate-Analysis

US Real Estate Rental Price Analysis

Language: Jupyter Notebook - Size: 23 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 10 - Forks: 1

waseemsalami/project-Big-Data-in-behavioral-science-

An exciting Big Data project done during a course I took at the Technion university

Language: HTML - Size: 31.8 MB - Last synced: 4 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

125ryun/Espresso

서강대학교 2023-2 '빅데이터의 이해와 교육적 활용(캡스톤디자인)' 과목 '에스프레소' 팀

Language: Python - Size: 7.32 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

abroniewski/IdleCompute-Data-Management-Architecture

Implementation of a big data management and analysis backbone architecture using PySpark for distributed and scalable data ingestion and MLlib for machine learning analysis. Part of Big Data Management and Analytics (BDMA) program.

Language: Jupyter Notebook - Size: 34.8 MB - Last synced: 23 days ago - Pushed: 6 months ago - Stars: 1 - Forks: 1

sparkerhoney/BDC-KR

Repo. Big Data Certification KR(빅데이터 분석기사 자격증 시험)

Language: Python - Size: 15.6 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

SaiprakashShetty/Big-Data-Airline-Delay-Prediction

Predicting US Airline Delay using spark(pyspark) and Apache Arrow.The objective of this project is to perform analysis on the historical flight data to gain valuable insights and build a predictive model to predict whether a flight will be delayed or not for a given set of flight characteristics.

Language: Jupyter Notebook - Size: 68.9 MB - Last synced: 6 months ago - Pushed: about 3 years ago - Stars: 2 - Forks: 2

mdafer/Machine-Learning-For-Big-Data-Project

Language: Python - Size: 5.84 MB - Last synced: 6 months ago - Pushed: almost 5 years ago - Stars: 1 - Forks: 1

mdafer/Machine-Learning-For-Big-Data-Assignment-1

Language: Python - Size: 1.26 MB - Last synced: 6 months ago - Pushed: almost 5 years ago - Stars: 1 - Forks: 0

Rifat392000/BigDataAnalytics

Language: Jupyter Notebook - Size: 18.6 KB - Last synced: 4 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

gangodu/cloud

AWS Cloudera Hadoop setup with H2O, Spark, MR

Language: Java - Size: 49.1 MB - Last synced: 7 months ago - Pushed: about 7 years ago - Stars: 0 - Forks: 0

ssiarhei115/Customer-Classification

Developing ML model predicting bank' customer inclination to open a deposit

Language: Jupyter Notebook - Size: 0 Bytes - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

adriana-takahagui/mba-big-data

Projeto de Conclusão da Disciplina de "Big Data" do MBA em Data Science

Language: Jupyter Notebook - Size: 1.18 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

pdoup/avoulos

Big Data Analytics Project - Fall '21

Language: Scala - Size: 3.84 MB - Last synced: 7 months ago - Pushed: about 2 years ago - Stars: 1 - Forks: 0

kaushik03/Modern-Big-Data-Analysis-using-SQL

RDBMS techniques for Big Data analysis

Size: 1.57 MB - Last synced: 7 months ago - Pushed: over 3 years ago - Stars: 7 - Forks: 1

Akande-hub/Python--codes

Some of my programming experiences using Python

Language: Jupyter Notebook - Size: 889 KB - Last synced: 7 months ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

ronellsalunke/Titanic-BigData

Java Hadoop MapReduce code for my Big Data Analytics Project using the Titanic dataset

Language: Java - Size: 41 KB - Last synced: 7 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

karan-owalekar/Movie-Recommendation-System

Language: Jupyter Notebook - Size: 61.2 MB - Last synced: 8 months ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0

askmrsinh/kibitzer

Media Recommendations Using Big Data Analytics.

Language: Scala - Size: 35.3 MB - Last synced: 8 months ago - Pushed: over 1 year ago - Stars: 2 - Forks: 0

ManinderpreetPuri/Big-Data-Manipulation-On-Cloud

I used big data tools (Hive, SparkRDDs, and Spark SQL). I solved challenging big data processing tasks by finding highly efficient solutions. Experienced processing four different types of real data: Standard multi-attribute data (video game sales data), Time series data (Twitter feed), Bag of words data, A News aggregation corpus.

Language: Scala - Size: 500 KB - Last synced: 8 months ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0

tirthmehta/Big-Data-Analysis-with-Apache-Hadoop-Pig-Latin

Big Data Analysis of datasets for taking into account the character occurrences.

Language: PigLatin - Size: 1000 Bytes - Last synced: 8 months ago - Pushed: almost 7 years ago - Stars: 0 - Forks: 0

Dammonoit/Student-performance-analysis-using-Big-data

This project analyses and correlates student performance with different attributes. Then at last, it determines most suitable algorithm from bunch of them.

Language: Python - Size: 1.48 MB - Last synced: 8 months ago - Pushed: over 6 years ago - Stars: 12 - Forks: 11

hello-albesta/Python-BDAPyspark-UniversityDataAnalysisSystem

This repository houses my project for a university data analysis system that utilizes PySpark.

Language: Jupyter Notebook - Size: 4.16 MB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 0 - Forks: 0

okazaki0/PARALLEL-COMPUTING

Big data Algorithm

Language: CSS - Size: 13.6 MB - Last synced: 4 months ago - Pushed: about 1 year ago - Stars: 2 - Forks: 0

noobpk/gemini-bigdata

Gemini-Big Data (G-BD)

Language: CSS - Size: 2.78 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0

amitkedia007/Analysis-of-AirBnB-data-Hadoop-Mapreduce

This repo explains the implementation of Map-Reduce Algorithm on the AirBnb data to understand the consumer satisfaction region and country wise. This is the effective use of parallel distributed computing to resolve the big data problems

Language: Java - Size: 1.8 MB - Last synced: 4 months ago - Pushed: 8 months ago - Stars: 1 - Forks: 0

nikpapage23/Big-Data-Analytics-project

Using Python and Apache Spark framework to run queries on a large MovieLens dataset.

Language: Jupyter Notebook - Size: 462 KB - Last synced: 8 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

SwapnilNair/McFlAi-OTPMS

An on-time performance management system for airlines using Spark and Kafka streaming

Language: Python - Size: 15.6 KB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 0 - Forks: 0

ferkuellar/practica_advanced_sql

El objetivo principal del proyecto es desarrollar un modelo de datos robusto y eficiente que permita analizar y comprender las interacciones del cliente a través del sistema IVR (Respuesta de Voz Interactiva).

Size: 5.13 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

Mikel-UA/BigData_Analysis_Beer_Dataset

Cleaning, exploratory analysis and drawing conclusions from data from: https://www.kaggle.com/rdoume/beerreviews

Size: 6.84 KB - Last synced: 8 months ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0

tatsuiman/rpot2

Real-time Packet Observation Tool

Language: Bro - Size: 145 MB - Last synced: 2 months ago - Pushed: 8 months ago - Stars: 40 - Forks: 6

AthinaKyriakou/mrbox

An open source experimental application aiming to simplify working with remote heterogeneous analytics and storage services via the file system of the Linux operating system.

Language: Python - Size: 219 KB - Last synced: 9 months ago - Pushed: about 3 years ago - Stars: 0 - Forks: 1

rbalbinotti/Prevendo_Cons_Energia_Carros

Curso - Big Data Analytics com R e Microsoft Azure Machine Learning - Projeto Conclusão

Language: R - Size: 2.54 MB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 0 - Forks: 0

h-fuzzy-logic/technical-writing

Technical writing samples. Includes walkthroughs and tutorials around data engineering and cloud architectures.

Size: 5.86 KB - Last synced: 4 months ago - Pushed: 9 months ago - Stars: 0 - Forks: 0

Heisenberg0203/Apache_Spark

Apache Spark Projects :-From beginners to advanced level

Language: Java - Size: 64.5 KB - Last synced: 9 months ago - Pushed: over 5 years ago - Stars: 1 - Forks: 0

Syed-Bakhtawar-Fahim/DataVisualization

Data Visualization with Python

Language: Jupyter Notebook - Size: 2.39 MB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0

leorrose/BGU-Big-Data-Course

Ben Gurion University "The Art of Analyzing Big Data - The Data Scientist’s Toolbox (372.2.5401)" course assignments & solutions

Language: Jupyter Notebook - Size: 23.3 MB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0

Gugo-le/student-performance-predict

Big data was learned using tensorflow.

Language: Jupyter Notebook - Size: 1.31 MB - Last synced: about 1 month ago - Pushed: over 1 year ago - Stars: 5 - Forks: 0

PeterSchuld/Sparkify

Capstone Project in the Udacity Data Scientist Nanodegree program. We manipulate large and realistic datasets with Spark to engineer relevant features for predicting churn. We'll learn how to use Spark MLlib to build machine learning models with large datasets, far beyond what could be done with non-distributed technologies like scikit-learn.

Language: HTML - Size: 2.44 MB - Last synced: 10 months ago - Pushed: almost 3 years ago - Stars: 1 - Forks: 0

vishu-tyagi/BigQuery-ELT

BigQuery data pipeline with dbt, Spark, Docker, Airflow, Terraform, GCP

Language: Python - Size: 1.19 MB - Last synced: 10 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

Snigda0402/Education-trends-on-Twitter

Size: 7.5 MB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0

pattyjula/pandas_lambda

Apply lambda function to Pandas value_counts

Language: Python - Size: 1000 Bytes - Last synced: 10 months ago - Pushed: about 6 years ago - Stars: 1 - Forks: 0

anshhagrawal/BigData-Analysis

In this jupyter notebook file, fictional data of football players was used to perform big data analytics in python. It involves using librarires such as pandas and matplotlib.

Language: Jupyter Notebook - Size: 3.03 MB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0

mrjakhi/MADS-milestone-2-SIADS-696-EHRAnalysis-ICD_Code_Prediction

Milestone 2 project - Electronic Health Record Analysis and ICD Code Prediction

Language: Jupyter Notebook - Size: 134 MB - Last synced: 10 months ago - Pushed: over 1 year ago - Stars: 2 - Forks: 1

garynth41/-Java-Programming-and-Software-Engineering-Fundamentals-Specialization

About this Specialization: Take your first step towards a career in software development with this introduction to Java—one of the most in-demand programming languages and the foundation of the Android operating system. Designed for beginners, this Specialization will teach you core programming concepts and equip you to write programs to solve complex problems. In addition, you will gain the foundational skills a software engineer needs to solve real-world problems, from designing algorithms to testing and debugging your programs.

Language: JavaScript - Size: 375 KB - Last synced: 10 months ago - Pushed: over 3 years ago - Stars: 1 - Forks: 0

zulfiqarAlibalti/PyTorch

This repo contains PyTorch Projects from Basic to Advance

Language: Jupyter Notebook - Size: 8.79 KB - Last synced: 10 months ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

galib360/BigData_Project

Language: Jupyter Notebook - Size: 3.89 MB - Last synced: 10 months ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0

furqan-software-engineer/Spark-BigData-Twitter-Sentiment-Analyzer

Twitter's Tweets Stream Sentiment Analyser using Apache Spark - Spark Stream, Spark SQL , Stanford NLP(Natural Language Processing)

Language: XSLT - Size: 733 KB - Last synced: 10 months ago - Pushed: about 6 years ago - Stars: 1 - Forks: 0

DeepthiSudharsan/Big-Data-Analytics-Assignment

Language: Scala - Size: 15.9 MB - Last synced: 10 months ago - Pushed: about 3 years ago - Stars: 1 - Forks: 0

DeepthiSudharsan/Analyzing-Marketing-Customer-Values-using-Spark

(Semester 4) Big Data Analytics - End Semester Project

Language: Scala - Size: 2.38 MB - Last synced: 10 months ago - Pushed: about 3 years ago - Stars: 1 - Forks: 1

bumbitzu/Big_Integers_Class

A standard integer data type, such as when working with very large prime numbers or performing other types of mathematical operations that involve large numbers.

Language: C++ - Size: 26.4 KB - Last synced: 10 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

sagar8080/Data-Engineering

A comprehensive guide to learn Data-Engineering from scratch.

Size: 124 KB - Last synced: 10 months ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 0

msafiullah/excel_to_parquet

Convert excel to parquet for quick loading into Hive table.

Language: Python - Size: 10.7 KB - Last synced: 10 months ago - Pushed: about 5 years ago - Stars: 2 - Forks: 1

Pedro-Hdez/BigDataPipeline

El objetivo de este proyecto es crear un pipeline utilizando herramientas optimizadas y libres para la recolección, tratamiento, almacenamiento y análisis de grandes volúmenes de datos en tiempo real

Language: Python - Size: 17.2 MB - Last synced: 10 months ago - Pushed: about 2 years ago - Stars: 0 - Forks: 1

PanPapag/Topic-Identification

:file_folder: Multi-label classification of printed media articles to topics

Language: Python - Size: 52.6 MB - Last synced: 10 months ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0

lstasiak/Big-Data-Algorithms-exercises

Set of tasks solved in Big Data Algorithms course

Language: Scala - Size: 3.06 MB - Last synced: 10 months ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 0

lstasiak/Bloom-Filter

Implementation of simple Bloom Filter

Language: Scala - Size: 1.95 KB - Last synced: 10 months ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 0

akshaytambe/Big-Data-Scripts

Python Scripts for working with Big Data Files

Language: Python - Size: 193 KB - Last synced: 10 months ago - Pushed: about 6 years ago - Stars: 0 - Forks: 1

geoanalytics-ca/documentation

GEOAnalytics Canada Documentation and Tutorials

Size: 357 MB - Last synced: 4 months ago - Pushed: 8 months ago - Stars: 0 - Forks: 0

sahith/Link-Prediction-for-Citation-Networks-using-Apache-Spark

Link Prediction is about predicting the future connections in a graph. In this project, Link Prediction is about predicting whether two authors will be collaborating for their future paper or not given the graph of authors who collaborated for atleast one paper together.

Language: Scala - Size: 6.41 MB - Last synced: 10 months ago - Pushed: over 4 years ago - Stars: 5 - Forks: 1

shinde-chandrakant/BigData-Ops-on-TLC-Yellow-Taxi

Analysed New York City's Yellow taxi data set with Big Data tools such as Hadoop, HBase, Sqoop, MapReduce and AWS Cloud Infrastructure.

Language: Python - Size: 7.19 MB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 0 - Forks: 0

Balazs-Nagy/elte-ai-ml

Collection of submissions prepared for the Mathematics Expert in Data Analytics and Machine Learning postgraduate specialization program of the Institute of Mathematics of Eötvös Loránd University in 2021/22.

Language: Jupyter Notebook - Size: 12.4 MB - Last synced: 3 months ago - Pushed: about 2 years ago - Stars: 1 - Forks: 1

pmihsan/Game-Review-Analysis

Sentiment Analysis and Topic Modeling on the Steam Game Reviews using Hadoop and Mahout

Language: Python - Size: 88.7 MB - Last synced: about 2 months ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

Sourabh-Marne/PySpark-Project

PySpark in Big Data Processing including Lambda Functions, filter, map and reduce functions.

Language: Python - Size: 74.2 KB - Last synced: 11 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

sadnanMohosin/Covid-19-Predictive-analysis-of-Severity-Illness

Language: Jupyter Notebook - Size: 1.67 MB - Last synced: 11 months ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

claudianopl/heart-disease-data-analysis

Repositório criado para versionar o conteúdo das atividades práticas da disciplina de Projeto Interdisciplinar para Sistemas de Informação III (PISI III), ofertada pelo curso de Bacharelado em Sistemas de Informação da UFRPE.

Language: Python - Size: 71.6 MB - Last synced: 11 months ago - Pushed: about 2 years ago - Stars: 2 - Forks: 1

neoreuvenla/msc-comp-sci

A repository to hold lecture and activity notes from the University of York MSc Computer Science course

Size: 284 MB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 1

sirmammingtonham/vector-borne-disease-analytics

Dataset and Code for 2021 IEEE International Conference on Big Data Paper - Scraping Unstructured Data to Explore the Relationship between Rainfall Anomalies and Vector-Borne Disease Outbreaks

Language: Jupyter Notebook - Size: 5.47 MB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0

Robertfarry157/Roberts_projects

This is my repository where I store all of the coding and data analysis that I do for fun

Size: 1000 Bytes - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0

sirmammingtonham/promed_scraper

Code for 2021 IEEE International Conference on Big Data Paper - Scraping Unstructured Data to Explore the Relationship between Rainfall Anomalies and Vector-Borne Disease Outbreaks

Language: Jupyter Notebook - Size: 5.56 MB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0

Jimmy-Sudoku/Dashboard-Google-Trend-in-Canada-April-2023

2023 April Dashboard for Canada content ideas using google trends

Size: 7.81 MB - Last synced: 4 months ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

darrenxx3/big-data-analytics-4th-semester-final-exam

My final exam continue to pretend to be a Data Scientist working at a retail business called "KimochiMart" implement with Big Data Analytics.

Language: SAS - Size: 4.96 MB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0

darrenxx3/big-data-analytics-4th-semester-mid-exam

My mid exam about pretending to be a Data scientist working at a retail business called "KimochiMart" implement with Big Data Analytics.

Language: SAS - Size: 2.16 MB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0

klugem/watchdog

Workflow management system for the automated and distributed analysis of large-scale experimental data.

Language: Java - Size: 193 MB - Last synced: 11 months ago - Pushed: about 2 years ago - Stars: 12 - Forks: 4

SinghHarshita/Clustering-Algorithms-Spark

KMeans, Cure and Canpoy algorithms are demonstrated using Pyspark.

Language: Jupyter Notebook - Size: 150 KB - Last synced: 4 months ago - Pushed: about 3 years ago - Stars: 5 - Forks: 0

seeratawan01/autocapture.js

Build your own analytics - A single library to grabs every click, touch, page-view, and fill — forever.

Language: TypeScript - Size: 554 KB - Last synced: 3 months ago - Pushed: over 1 year ago - Stars: 8 - Forks: 1

FTiniNadhirah/Coursera-and-EdX-courses-answers

This is about learning courses in Coursera. All the answers given written by myself

Language: HTML - Size: 476 MB - Last synced: 12 months ago - Pushed: over 3 years ago - Stars: 74 - Forks: 40

BhagiaSheri/apache-spark-SQL

Big Data Pipeline | Querying Data from Hive Table Phase

Language: Java - Size: 262 KB - Last synced: 12 months ago - Pushed: 12 months ago - Stars: 0 - Forks: 0

Amir79Naziri/TwitterSentimentAnalysisWithSpark_Project

A sentiment analyzer using Spark ML library for Twitter Dataset

Language: Jupyter Notebook - Size: 13.7 KB - Last synced: 12 months ago - Pushed: 12 months ago - Stars: 0 - Forks: 0

oprecomp/oprecomp

The Horizon 2020 Open Transprecision Computing project

Size: 16.6 KB - Last synced: 4 months ago - Pushed: over 3 years ago - Stars: 6 - Forks: 4

ajyanand/ProjectReports

Contains Reports of my projects

Language: Jupyter Notebook - Size: 65.9 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

matteocereda/GSECA

Gene Set Enrichment Class Analysis for heterogeneous RNA sequencing data

Language: R - Size: 56.4 MB - Last synced: 6 minutes ago - Pushed: almost 4 years ago - Stars: 5 - Forks: 1

SaurabhKoli74/Hadoop

It contains step by step explanation of some Big Data Analytics Experiments.

Size: 193 KB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

tabletop-labs/tabletop

A curated selection of tools, libraries and services that help tame your dataflow to productively build ambitious, data driven & reactive applications on a streaming lakehouse

Language: Go - Size: 290 KB - Last synced: 6 months ago - Pushed: about 1 year ago - Stars: 4 - Forks: 0

PrachetShah/ethAnalytics

Real-Time Eth Transactions Analysis using Big Data Techniques

Language: HTML - Size: 1.58 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

big-data-lab-team/accident-prediction-montreal

Language: Jupyter Notebook - Size: 65 MB - Last synced: about 1 month ago - Pushed: over 1 year ago - Stars: 9 - Forks: 7

srinathsai/Google-pagerank-algorithm-on-Wikipedia

A memory efficient algorithm for finding which pages need to have importance in recommendations

Language: Jupyter Notebook - Size: 40.6 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

DavidMouse1118/ECE454-Projects

Distributed Computing

Language: Java - Size: 264 MB - Last synced: about 1 year ago - Pushed: almost 5 years ago - Stars: 2 - Forks: 0