Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: big-data-analytics
nickenshidqia/Big_Data_Analytics_Kimia_Farma
Big Data Analytics Project gives challenges to create data mart design and dashboard on Kimia Farma
Size: 5.52 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0
Fili-ai/knn_cuda
KNN written in CUDA without any external library like CUBLAS or anything else
Language: Cuda - Size: 3.53 MB - Last synced: 3 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0
KOHENOORKEN/KGS-Global
All sub projects of KGS Global will be kept here
Language: Solidity - Size: 5.69 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0
Laetitia-Deken/Chicago_Taxi_Trips
Exploration of Chicago Taxi Trips - BigQuery Data with Python (January 2013 - October 2023)
Language: Jupyter Notebook - Size: 1.58 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 0
marcocolangelo/Big-Data-processing-and-Analytics
The current repository contains all the code developed during the Big Data processing and Analytics laboratories. Data are processed and analyzed using Hadoop and Spark
Language: Java - Size: 6.1 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0
arxiver/Airbnb-EDA-and-Regression
Big data exploration and analysis on Airbnb dataset as well as regression model for price prediction of entities
Language: Jupyter Notebook - Size: 3.11 MB - Last synced: 13 days ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 1
IsaacMwendwa/Big-Data-with-PySpark
This repository contains the materials (code & theory) I compiled while undertaking DataCamp's Big Data with PySpark Learning Track
Size: 147 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 0
aeronaut2001/Car-Insurance-Cold-Calls-Data-Analysis
Car Insurance Cold Calls Data Analysis using Apache Hive
Language: HiveQL - Size: 1.17 MB - Last synced: 5 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0
aeronaut2001/Telecom-Data-Analysis
Telecom Data Analysis with Apache Hive
Language: HiveQL - Size: 345 KB - Last synced: 5 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0
lithops-cloud/lithops
A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud ☁️🚀
Language: Python - Size: 12.3 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 289 - Forks: 91
tekdogan/iccbdc-21
Experiment files for ICCBDC'21 paper "Benchmarking Apache Spark and Hadoop MapReduce on Big Data Classification"
Language: Python - Size: 111 KB - Last synced: 6 months ago - Pushed: over 2 years ago - Stars: 1 - Forks: 0
harshh351998/Market-Basket-Items-Recommendation
This project provide the retailer with information to understand the purchase behaviour of a buyer and recommends products to user on their purchase history.
Language: Jupyter Notebook - Size: 1.11 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 1 - Forks: 0
anshul1004/MutualFriends
Implementation of Hadoop and Spark
Language: Java - Size: 23 MB - Last synced: 6 months ago - Pushed: about 4 years ago - Stars: 1 - Forks: 0
XuanyouLiu/US-Real-Estate-Analysis
US Real Estate Rental Price Analysis
Language: Jupyter Notebook - Size: 23 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 10 - Forks: 1
waseemsalami/project-Big-Data-in-behavioral-science-
An exciting Big Data project done during a course I took at the Technion university
Language: HTML - Size: 31.8 MB - Last synced: 4 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0
125ryun/Espresso
서강대학교 2023-2 '빅데이터의 이해와 교육적 활용(캡스톤디자인)' 과목 '에스프레소' 팀
Language: Python - Size: 7.32 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0
abroniewski/IdleCompute-Data-Management-Architecture
Implementation of a big data management and analysis backbone architecture using PySpark for distributed and scalable data ingestion and MLlib for machine learning analysis. Part of Big Data Management and Analytics (BDMA) program.
Language: Jupyter Notebook - Size: 34.8 MB - Last synced: 23 days ago - Pushed: 6 months ago - Stars: 1 - Forks: 1
sparkerhoney/BDC-KR
Repo. Big Data Certification KR(빅데이터 분석기사 자격증 시험)
Language: Python - Size: 15.6 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0
SaiprakashShetty/Big-Data-Airline-Delay-Prediction
Predicting US Airline Delay using spark(pyspark) and Apache Arrow.The objective of this project is to perform analysis on the historical flight data to gain valuable insights and build a predictive model to predict whether a flight will be delayed or not for a given set of flight characteristics.
Language: Jupyter Notebook - Size: 68.9 MB - Last synced: 6 months ago - Pushed: about 3 years ago - Stars: 2 - Forks: 2
mdafer/Machine-Learning-For-Big-Data-Project
Language: Python - Size: 5.84 MB - Last synced: 6 months ago - Pushed: almost 5 years ago - Stars: 1 - Forks: 1
mdafer/Machine-Learning-For-Big-Data-Assignment-1
Language: Python - Size: 1.26 MB - Last synced: 6 months ago - Pushed: almost 5 years ago - Stars: 1 - Forks: 0
Rifat392000/BigDataAnalytics
Language: Jupyter Notebook - Size: 18.6 KB - Last synced: 4 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0
gangodu/cloud
AWS Cloudera Hadoop setup with H2O, Spark, MR
Language: Java - Size: 49.1 MB - Last synced: 7 months ago - Pushed: about 7 years ago - Stars: 0 - Forks: 0
ssiarhei115/Customer-Classification
Developing ML model predicting bank' customer inclination to open a deposit
Language: Jupyter Notebook - Size: 0 Bytes - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0
adriana-takahagui/mba-big-data
Projeto de Conclusão da Disciplina de "Big Data" do MBA em Data Science
Language: Jupyter Notebook - Size: 1.18 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0
pdoup/avoulos
Big Data Analytics Project - Fall '21
Language: Scala - Size: 3.84 MB - Last synced: 7 months ago - Pushed: about 2 years ago - Stars: 1 - Forks: 0
kaushik03/Modern-Big-Data-Analysis-using-SQL
RDBMS techniques for Big Data analysis
Size: 1.57 MB - Last synced: 7 months ago - Pushed: over 3 years ago - Stars: 7 - Forks: 1
Akande-hub/Python--codes
Some of my programming experiences using Python
Language: Jupyter Notebook - Size: 889 KB - Last synced: 7 months ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0
ronellsalunke/Titanic-BigData
Java Hadoop MapReduce code for my Big Data Analytics Project using the Titanic dataset
Language: Java - Size: 41 KB - Last synced: 7 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
karan-owalekar/Movie-Recommendation-System
Language: Jupyter Notebook - Size: 61.2 MB - Last synced: 8 months ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0
askmrsinh/kibitzer
Media Recommendations Using Big Data Analytics.
Language: Scala - Size: 35.3 MB - Last synced: 8 months ago - Pushed: over 1 year ago - Stars: 2 - Forks: 0
ManinderpreetPuri/Big-Data-Manipulation-On-Cloud
I used big data tools (Hive, SparkRDDs, and Spark SQL). I solved challenging big data processing tasks by finding highly efficient solutions. Experienced processing four different types of real data: Standard multi-attribute data (video game sales data), Time series data (Twitter feed), Bag of words data, A News aggregation corpus.
Language: Scala - Size: 500 KB - Last synced: 8 months ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0
tirthmehta/Big-Data-Analysis-with-Apache-Hadoop-Pig-Latin
Big Data Analysis of datasets for taking into account the character occurrences.
Language: PigLatin - Size: 1000 Bytes - Last synced: 8 months ago - Pushed: almost 7 years ago - Stars: 0 - Forks: 0
Dammonoit/Student-performance-analysis-using-Big-data
This project analyses and correlates student performance with different attributes. Then at last, it determines most suitable algorithm from bunch of them.
Language: Python - Size: 1.48 MB - Last synced: 8 months ago - Pushed: over 6 years ago - Stars: 12 - Forks: 11
hello-albesta/Python-BDAPyspark-UniversityDataAnalysisSystem
This repository houses my project for a university data analysis system that utilizes PySpark.
Language: Jupyter Notebook - Size: 4.16 MB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 0 - Forks: 0
okazaki0/PARALLEL-COMPUTING
Big data Algorithm
Language: CSS - Size: 13.6 MB - Last synced: 4 months ago - Pushed: about 1 year ago - Stars: 2 - Forks: 0
noobpk/gemini-bigdata
Gemini-Big Data (G-BD)
Language: CSS - Size: 2.78 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0
amitkedia007/Analysis-of-AirBnB-data-Hadoop-Mapreduce
This repo explains the implementation of Map-Reduce Algorithm on the AirBnb data to understand the consumer satisfaction region and country wise. This is the effective use of parallel distributed computing to resolve the big data problems
Language: Java - Size: 1.8 MB - Last synced: 4 months ago - Pushed: 8 months ago - Stars: 1 - Forks: 0
nikpapage23/Big-Data-Analytics-project
Using Python and Apache Spark framework to run queries on a large MovieLens dataset.
Language: Jupyter Notebook - Size: 462 KB - Last synced: 8 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
SwapnilNair/McFlAi-OTPMS
An on-time performance management system for airlines using Spark and Kafka streaming
Language: Python - Size: 15.6 KB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 0 - Forks: 0
ferkuellar/practica_advanced_sql
El objetivo principal del proyecto es desarrollar un modelo de datos robusto y eficiente que permita analizar y comprender las interacciones del cliente a través del sistema IVR (Respuesta de Voz Interactiva).
Size: 5.13 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
Mikel-UA/BigData_Analysis_Beer_Dataset
Cleaning, exploratory analysis and drawing conclusions from data from: https://www.kaggle.com/rdoume/beerreviews
Size: 6.84 KB - Last synced: 8 months ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0
tatsuiman/rpot2
Real-time Packet Observation Tool
Language: Bro - Size: 145 MB - Last synced: 2 months ago - Pushed: 8 months ago - Stars: 40 - Forks: 6
AthinaKyriakou/mrbox
An open source experimental application aiming to simplify working with remote heterogeneous analytics and storage services via the file system of the Linux operating system.
Language: Python - Size: 219 KB - Last synced: 9 months ago - Pushed: about 3 years ago - Stars: 0 - Forks: 1
rbalbinotti/Prevendo_Cons_Energia_Carros
Curso - Big Data Analytics com R e Microsoft Azure Machine Learning - Projeto Conclusão
Language: R - Size: 2.54 MB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 0 - Forks: 0
h-fuzzy-logic/technical-writing
Technical writing samples. Includes walkthroughs and tutorials around data engineering and cloud architectures.
Size: 5.86 KB - Last synced: 4 months ago - Pushed: 9 months ago - Stars: 0 - Forks: 0
Heisenberg0203/Apache_Spark
Apache Spark Projects :-From beginners to advanced level
Language: Java - Size: 64.5 KB - Last synced: 9 months ago - Pushed: over 5 years ago - Stars: 1 - Forks: 0
Syed-Bakhtawar-Fahim/DataVisualization
Data Visualization with Python
Language: Jupyter Notebook - Size: 2.39 MB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0
leorrose/BGU-Big-Data-Course
Ben Gurion University "The Art of Analyzing Big Data - The Data Scientist’s Toolbox (372.2.5401)" course assignments & solutions
Language: Jupyter Notebook - Size: 23.3 MB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0
Gugo-le/student-performance-predict
Big data was learned using tensorflow.
Language: Jupyter Notebook - Size: 1.31 MB - Last synced: about 1 month ago - Pushed: over 1 year ago - Stars: 5 - Forks: 0
PeterSchuld/Sparkify
Capstone Project in the Udacity Data Scientist Nanodegree program. We manipulate large and realistic datasets with Spark to engineer relevant features for predicting churn. We'll learn how to use Spark MLlib to build machine learning models with large datasets, far beyond what could be done with non-distributed technologies like scikit-learn.
Language: HTML - Size: 2.44 MB - Last synced: 10 months ago - Pushed: almost 3 years ago - Stars: 1 - Forks: 0
vishu-tyagi/BigQuery-ELT
BigQuery data pipeline with dbt, Spark, Docker, Airflow, Terraform, GCP
Language: Python - Size: 1.19 MB - Last synced: 10 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
Snigda0402/Education-trends-on-Twitter
Size: 7.5 MB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0
pattyjula/pandas_lambda
Apply lambda function to Pandas value_counts
Language: Python - Size: 1000 Bytes - Last synced: 10 months ago - Pushed: about 6 years ago - Stars: 1 - Forks: 0
anshhagrawal/BigData-Analysis
In this jupyter notebook file, fictional data of football players was used to perform big data analytics in python. It involves using librarires such as pandas and matplotlib.
Language: Jupyter Notebook - Size: 3.03 MB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0
mrjakhi/MADS-milestone-2-SIADS-696-EHRAnalysis-ICD_Code_Prediction
Milestone 2 project - Electronic Health Record Analysis and ICD Code Prediction
Language: Jupyter Notebook - Size: 134 MB - Last synced: 10 months ago - Pushed: over 1 year ago - Stars: 2 - Forks: 1
garynth41/-Java-Programming-and-Software-Engineering-Fundamentals-Specialization
About this Specialization: Take your first step towards a career in software development with this introduction to Java—one of the most in-demand programming languages and the foundation of the Android operating system. Designed for beginners, this Specialization will teach you core programming concepts and equip you to write programs to solve complex problems. In addition, you will gain the foundational skills a software engineer needs to solve real-world problems, from designing algorithms to testing and debugging your programs.
Language: JavaScript - Size: 375 KB - Last synced: 10 months ago - Pushed: over 3 years ago - Stars: 1 - Forks: 0
zulfiqarAlibalti/PyTorch
This repo contains PyTorch Projects from Basic to Advance
Language: Jupyter Notebook - Size: 8.79 KB - Last synced: 10 months ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0
galib360/BigData_Project
Language: Jupyter Notebook - Size: 3.89 MB - Last synced: 10 months ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0
furqan-software-engineer/Spark-BigData-Twitter-Sentiment-Analyzer
Twitter's Tweets Stream Sentiment Analyser using Apache Spark - Spark Stream, Spark SQL , Stanford NLP(Natural Language Processing)
Language: XSLT - Size: 733 KB - Last synced: 10 months ago - Pushed: about 6 years ago - Stars: 1 - Forks: 0
DeepthiSudharsan/Big-Data-Analytics-Assignment
Language: Scala - Size: 15.9 MB - Last synced: 10 months ago - Pushed: about 3 years ago - Stars: 1 - Forks: 0
DeepthiSudharsan/Analyzing-Marketing-Customer-Values-using-Spark
(Semester 4) Big Data Analytics - End Semester Project
Language: Scala - Size: 2.38 MB - Last synced: 10 months ago - Pushed: about 3 years ago - Stars: 1 - Forks: 1
bumbitzu/Big_Integers_Class
A standard integer data type, such as when working with very large prime numbers or performing other types of mathematical operations that involve large numbers.
Language: C++ - Size: 26.4 KB - Last synced: 10 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
sagar8080/Data-Engineering
A comprehensive guide to learn Data-Engineering from scratch.
Size: 124 KB - Last synced: 10 months ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 0
msafiullah/excel_to_parquet
Convert excel to parquet for quick loading into Hive table.
Language: Python - Size: 10.7 KB - Last synced: 10 months ago - Pushed: about 5 years ago - Stars: 2 - Forks: 1
Pedro-Hdez/BigDataPipeline
El objetivo de este proyecto es crear un pipeline utilizando herramientas optimizadas y libres para la recolección, tratamiento, almacenamiento y análisis de grandes volúmenes de datos en tiempo real
Language: Python - Size: 17.2 MB - Last synced: 10 months ago - Pushed: about 2 years ago - Stars: 0 - Forks: 1
PanPapag/Topic-Identification
:file_folder: Multi-label classification of printed media articles to topics
Language: Python - Size: 52.6 MB - Last synced: 10 months ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0
lstasiak/Big-Data-Algorithms-exercises
Set of tasks solved in Big Data Algorithms course
Language: Scala - Size: 3.06 MB - Last synced: 10 months ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 0
lstasiak/Bloom-Filter
Implementation of simple Bloom Filter
Language: Scala - Size: 1.95 KB - Last synced: 10 months ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 0
akshaytambe/Big-Data-Scripts
Python Scripts for working with Big Data Files
Language: Python - Size: 193 KB - Last synced: 10 months ago - Pushed: about 6 years ago - Stars: 0 - Forks: 1
geoanalytics-ca/documentation
GEOAnalytics Canada Documentation and Tutorials
Size: 357 MB - Last synced: 4 months ago - Pushed: 8 months ago - Stars: 0 - Forks: 0
sahith/Link-Prediction-for-Citation-Networks-using-Apache-Spark
Link Prediction is about predicting the future connections in a graph. In this project, Link Prediction is about predicting whether two authors will be collaborating for their future paper or not given the graph of authors who collaborated for atleast one paper together.
Language: Scala - Size: 6.41 MB - Last synced: 10 months ago - Pushed: over 4 years ago - Stars: 5 - Forks: 1
shinde-chandrakant/BigData-Ops-on-TLC-Yellow-Taxi
Analysed New York City's Yellow taxi data set with Big Data tools such as Hadoop, HBase, Sqoop, MapReduce and AWS Cloud Infrastructure.
Language: Python - Size: 7.19 MB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 0 - Forks: 0
Balazs-Nagy/elte-ai-ml
Collection of submissions prepared for the Mathematics Expert in Data Analytics and Machine Learning postgraduate specialization program of the Institute of Mathematics of Eötvös Loránd University in 2021/22.
Language: Jupyter Notebook - Size: 12.4 MB - Last synced: 3 months ago - Pushed: about 2 years ago - Stars: 1 - Forks: 1
pmihsan/Game-Review-Analysis
Sentiment Analysis and Topic Modeling on the Steam Game Reviews using Hadoop and Mahout
Language: Python - Size: 88.7 MB - Last synced: about 2 months ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0
Sourabh-Marne/PySpark-Project
PySpark in Big Data Processing including Lambda Functions, filter, map and reduce functions.
Language: Python - Size: 74.2 KB - Last synced: 11 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
sadnanMohosin/Covid-19-Predictive-analysis-of-Severity-Illness
Language: Jupyter Notebook - Size: 1.67 MB - Last synced: 11 months ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0
claudianopl/heart-disease-data-analysis
Repositório criado para versionar o conteúdo das atividades práticas da disciplina de Projeto Interdisciplinar para Sistemas de Informação III (PISI III), ofertada pelo curso de Bacharelado em Sistemas de Informação da UFRPE.
Language: Python - Size: 71.6 MB - Last synced: 11 months ago - Pushed: about 2 years ago - Stars: 2 - Forks: 1
neoreuvenla/msc-comp-sci
A repository to hold lecture and activity notes from the University of York MSc Computer Science course
Size: 284 MB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 1
sirmammingtonham/vector-borne-disease-analytics
Dataset and Code for 2021 IEEE International Conference on Big Data Paper - Scraping Unstructured Data to Explore the Relationship between Rainfall Anomalies and Vector-Borne Disease Outbreaks
Language: Jupyter Notebook - Size: 5.47 MB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0
Robertfarry157/Roberts_projects
This is my repository where I store all of the coding and data analysis that I do for fun
Size: 1000 Bytes - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0
sirmammingtonham/promed_scraper
Code for 2021 IEEE International Conference on Big Data Paper - Scraping Unstructured Data to Explore the Relationship between Rainfall Anomalies and Vector-Borne Disease Outbreaks
Language: Jupyter Notebook - Size: 5.56 MB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0
Jimmy-Sudoku/Dashboard-Google-Trend-in-Canada-April-2023
2023 April Dashboard for Canada content ideas using google trends
Size: 7.81 MB - Last synced: 4 months ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0
darrenxx3/big-data-analytics-4th-semester-final-exam
My final exam continue to pretend to be a Data Scientist working at a retail business called "KimochiMart" implement with Big Data Analytics.
Language: SAS - Size: 4.96 MB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0
darrenxx3/big-data-analytics-4th-semester-mid-exam
My mid exam about pretending to be a Data scientist working at a retail business called "KimochiMart" implement with Big Data Analytics.
Language: SAS - Size: 2.16 MB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0
klugem/watchdog
Workflow management system for the automated and distributed analysis of large-scale experimental data.
Language: Java - Size: 193 MB - Last synced: 11 months ago - Pushed: about 2 years ago - Stars: 12 - Forks: 4
SinghHarshita/Clustering-Algorithms-Spark
KMeans, Cure and Canpoy algorithms are demonstrated using Pyspark.
Language: Jupyter Notebook - Size: 150 KB - Last synced: 4 months ago - Pushed: about 3 years ago - Stars: 5 - Forks: 0
seeratawan01/autocapture.js
Build your own analytics - A single library to grabs every click, touch, page-view, and fill — forever.
Language: TypeScript - Size: 554 KB - Last synced: 3 months ago - Pushed: over 1 year ago - Stars: 8 - Forks: 1
FTiniNadhirah/Coursera-and-EdX-courses-answers
This is about learning courses in Coursera. All the answers given written by myself
Language: HTML - Size: 476 MB - Last synced: 12 months ago - Pushed: over 3 years ago - Stars: 74 - Forks: 40
BhagiaSheri/apache-spark-SQL
Big Data Pipeline | Querying Data from Hive Table Phase
Language: Java - Size: 262 KB - Last synced: 12 months ago - Pushed: 12 months ago - Stars: 0 - Forks: 0
Amir79Naziri/TwitterSentimentAnalysisWithSpark_Project
A sentiment analyzer using Spark ML library for Twitter Dataset
Language: Jupyter Notebook - Size: 13.7 KB - Last synced: 12 months ago - Pushed: 12 months ago - Stars: 0 - Forks: 0
oprecomp/oprecomp
The Horizon 2020 Open Transprecision Computing project
Size: 16.6 KB - Last synced: 4 months ago - Pushed: over 3 years ago - Stars: 6 - Forks: 4
ajyanand/ProjectReports
Contains Reports of my projects
Language: Jupyter Notebook - Size: 65.9 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0
matteocereda/GSECA
Gene Set Enrichment Class Analysis for heterogeneous RNA sequencing data
Language: R - Size: 56.4 MB - Last synced: 6 minutes ago - Pushed: almost 4 years ago - Stars: 5 - Forks: 1
SaurabhKoli74/Hadoop
It contains step by step explanation of some Big Data Analytics Experiments.
Size: 193 KB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0
tabletop-labs/tabletop
A curated selection of tools, libraries and services that help tame your dataflow to productively build ambitious, data driven & reactive applications on a streaming lakehouse
Language: Go - Size: 290 KB - Last synced: 6 months ago - Pushed: about 1 year ago - Stars: 4 - Forks: 0
PrachetShah/ethAnalytics
Real-Time Eth Transactions Analysis using Big Data Techniques
Language: HTML - Size: 1.58 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0
big-data-lab-team/accident-prediction-montreal
Language: Jupyter Notebook - Size: 65 MB - Last synced: about 1 month ago - Pushed: over 1 year ago - Stars: 9 - Forks: 7
srinathsai/Google-pagerank-algorithm-on-Wikipedia
A memory efficient algorithm for finding which pages need to have importance in recommendations
Language: Jupyter Notebook - Size: 40.6 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0
DavidMouse1118/ECE454-Projects
Distributed Computing
Language: Java - Size: 264 MB - Last synced: about 1 year ago - Pushed: almost 5 years ago - Stars: 2 - Forks: 0