Topic: "spark-ml"
wzhe06/SparkCTR
CTR prediction model based on spark(LR, GBDT, DNN)
Language: Scala - Size: 35 MB - Last synced at: 13 days ago - Pushed at: about 5 years ago - Stars: 912 - Forks: 260

qubole/sparklens
Qubole Sparklens tool for performance tuning Apache Spark
Language: Scala - Size: 175 KB - Last synced at: 13 days ago - Pushed at: 10 months ago - Stars: 574 - Forks: 141

lifeomic/sparkflow
Easy to use library to bring Tensorflow on Apache Spark
Language: Python - Size: 8.79 MB - Last synced at: 21 days ago - Pushed at: over 1 year ago - Stars: 296 - Forks: 45

locationtech/rasterframes
Geospatial Raster support for Spark DataFrames
Language: Jupyter Notebook - Size: 102 MB - Last synced at: 13 days ago - Pushed at: about 1 year ago - Stars: 251 - Forks: 45

titicaca/spark-iforest
Isolation Forest on Spark
Language: Scala - Size: 74.2 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 221 - Forks: 91

CognonicLabs/awesome-AI-kubernetes
:snowflake: :whale: Awesome tools and libs for AI, Deep Learning, Machine Learning, Computer Vision, Data Science, Data Analytics and Cognitive Computing that are baked in the oven to be Native on Kubernetes and Docker with Python, R, Scala, Java, C#, Go, Julia, C++ etc
Size: 326 KB - Last synced at: 1 day ago - Pushed at: 9 months ago - Stars: 129 - Forks: 45

databrickslabs/geoscan
Geospatial clustering at massive scale
Language: Scala - Size: 2.44 MB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 99 - Forks: 19

titicaca/spark-gbtlr
Hybrid model of Gradient Boosting Trees and Logistic Regression (GBDT+LR) on Spark
Language: Scala - Size: 520 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 89 - Forks: 27

xiaogp/customer_churn_prediction
零售电商客户流失模型,基于tensorflow,xgboost4j-spark,spark-ml实现LR,FM,GBDT,RF,进行模型效果对比,离线/在线部署方式总结
Language: Python - Size: 205 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 58 - Forks: 17

dimajix/spark-training
Repository used for Spark Trainings
Language: Jupyter Notebook - Size: 9 MB - Last synced at: 5 days ago - Pushed at: about 2 years ago - Stars: 53 - Forks: 66

typesafehub/fdp-modelserver
An umbrella project for multiple implementations of model serving
Language: Scala - Size: 163 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 46 - Forks: 20

aipredict/ai-models-serialization
AI模型序列化总结
Size: 111 KB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 45 - Forks: 7

jubins/Spark-And-MLlib-Projects
This repository contains Spark, MLlib, PySpark and Dataframes projects
Language: Jupyter Notebook - Size: 101 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 39 - Forks: 97

feng-li/dlsa
Distributed least squares approximation (dlsa) implemented with Apache Spark
Language: Python - Size: 276 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 33 - Forks: 46

isarn/isarn-sketches-spark
Routines and data structures for using isarn-sketches idiomatically in Apache Spark
Language: Scala - Size: 1.33 MB - Last synced at: 17 days ago - Pushed at: 11 months ago - Stars: 29 - Forks: 12

roshankoirala/pySpark_tutorial
Implementation of Spark code in Jupyter notebook. Topics include: RDDs and DataFrame, exploratory data analysis (EDA), handling multiple DataFrames, visualization, Machine Learning
Language: Jupyter Notebook - Size: 202 KB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 29 - Forks: 26

uosdmlab/nsmc-zeppelin-notebook
Movie review dataset Word2Vec & sentiment classification Zeppelin notebook
Size: 107 KB - Last synced at: over 1 year ago - Pushed at: almost 8 years ago - Stars: 26 - Forks: 6

lynnlangit/Spark-Scala-EKS
Spark Scala docker container sample for AWS testing - EKS & S3
Language: HCL - Size: 5.77 MB - Last synced at: about 24 hours ago - Pushed at: over 6 years ago - Stars: 24 - Forks: 13

pierrenodet/spark-ensemble
Ensemble Learning for Apache Spark 🌲
Language: Scala - Size: 3.61 MB - Last synced at: 17 days ago - Pushed at: 8 months ago - Stars: 23 - Forks: 7

mahmoudparsian/machine-learning-course
Machine Learning Course @ Santa Clara University
Size: 194 MB - Last synced at: 18 days ago - Pushed at: almost 5 years ago - Stars: 23 - Forks: 16

autodeployai/pmml4s-spark
PMML scoring library for Spark as SparkML Transformer
Language: Scala - Size: 50.8 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 19 - Forks: 8

s22s/pre-lt-raster-frames 📦
Spark DataFrames for earth observation data
Language: Scala - Size: 16.9 MB - Last synced at: 2 days ago - Pushed at: almost 7 years ago - Stars: 19 - Forks: 5

dvgodoy/YelpDatasetChallenge
Restaurant recommendations and review text-based quality predictions
Language: Jupyter Notebook - Size: 16.8 MB - Last synced at: 16 days ago - Pushed at: about 8 years ago - Stars: 19 - Forks: 12

FlorentF9/sparkml-som
:sparkles: Spark ML implementation of SOM algorithm (Kohonen self-organizing map)
Language: Scala - Size: 29.3 KB - Last synced at: 26 days ago - Pushed at: about 3 years ago - Stars: 17 - Forks: 6

JohnSnowLabs/spark-nlp-streamlit 📦
Spark NLP for Streamlit
Language: Python - Size: 8.7 MB - Last synced at: 12 months ago - Pushed at: over 3 years ago - Stars: 15 - Forks: 12

akashsethi24/Machine-Learning
Examples of all Machine Learning Algorithm in Apache Spark
Language: Scala - Size: 3.64 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 15 - Forks: 10

lp-dataninja/SparkML
Detailed notes and code to learn machine learning with Apache Spark.
Language: Jupyter Notebook - Size: 4.06 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 12 - Forks: 17

Rohini2505/Lending-Club-Loan-Analysis
Explanatory Data Analysis and ML model building using Apache Spark and PySpark
Language: HTML - Size: 6.26 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 10 - Forks: 12

QuanLab/node2vec-spark
Implement node2vec algorithm using Spark 2 from: http://snap.stanford.edu/node2vec/
Language: Scala - Size: 196 KB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 10 - Forks: 5

josemarialuna/ExternalValidity
This package contains the code for calculating external clustering validity indices in Spark. The package includes Chi Index among others.
Language: Scala - Size: 146 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 9 - Forks: 1

BlancRay/PUAdapter 📦
A PU-learning tool on spark
Language: Scala - Size: 22.5 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 9 - Forks: 2

datastaxdevs/workshop-introduction-to-machine-learning
Come ready to discover the goals and approaches of machine learning, and how to build effective algorithms and solutions!
Language: Jupyter Notebook - Size: 17.2 MB - Last synced at: 13 days ago - Pushed at: almost 2 years ago - Stars: 8 - Forks: 6

dvgodoy/DSR-Spark-AppliedML
DSR Class - Applied Machine Learning with Apache Spark
Language: Jupyter Notebook - Size: 26.2 MB - Last synced at: 16 days ago - Pushed at: about 6 years ago - Stars: 8 - Forks: 3

DavideNardone/TwitterSentimentAnalysis
A Spark Streaming implementation for Online Twitter Sentiment Analysis.
Language: Python - Size: 1.78 MB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 8 - Forks: 3

maziyarpanahi/spark2-template
Intellij template to develop Apache Spark 2.x applications
Language: Scala - Size: 20.5 KB - Last synced at: 17 days ago - Pushed at: over 3 years ago - Stars: 7 - Forks: 41

NashTech-Labs/Sparkathon
A library having Java and Scala examples for Spark 2.x
Language: Java - Size: 113 MB - Last synced at: 20 days ago - Pushed at: over 8 years ago - Stars: 7 - Forks: 9

IBM/sms-spam-filter-using-hortonworks 📦
Build Spam Filter Model on HDP using Watson Studio Local
Language: Jupyter Notebook - Size: 25.4 MB - Last synced at: 4 months ago - Pushed at: about 6 years ago - Stars: 6 - Forks: 10

lykmapipo/Python-Spark-Log-Analysis
Python scripts to process, and analyze log files using PySpark.
Language: Python - Size: 131 KB - Last synced at: about 14 hours ago - Pushed at: 10 months ago - Stars: 5 - Forks: 0

dongkelun/spark-scala
Spark Scala代码,包括个人博客全部代码,以及平时学习测试代码
Language: Scala - Size: 316 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 5 - Forks: 4

pierrenodet/spark-smile 📦
Integrating SMILE and Spark
Language: Scala - Size: 341 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 0

alivcor/SMORK
Implementation of SMOTE - Synthetic Minority Over-sampling Technique in SparkML / MLLib
Language: Scala - Size: 165 KB - Last synced at: 11 months ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 2

giovannigarifo/bigdata
Code samples, summaries, cheatsheets and other study material for Hadoop MapReduce and Apache Spark
Language: Java - Size: 69.1 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 5 - Forks: 2

zhangruipython/ETLPlatform
多数据源,大规模数据提取转换加载
Language: Java - Size: 78.1 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 4 - Forks: 0

MHassaanButt/Crime-Spark-ML
In this project I stream data and do crime classification using Spark. This dataset contains incidents derived from the SFPD Crime Incident Reporting system. The data ranges from 1/1/2003 to 5/13/2015. I do some data analysis of crime scenes in different areas and with respect to other parameters.
Language: Python - Size: 5.86 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 0

theGuyWithBlackTie/electricChargingStations
Language: Jupyter Notebook - Size: 8.68 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 0

shutterstock/spark-phrases
phrase detection using Google's Word2phrase
Language: Scala - Size: 4.88 KB - Last synced at: 13 days ago - Pushed at: about 6 years ago - Stars: 4 - Forks: 1

siddsax/D-Driven-IS
Instance Selection for Big Data
Language: Python - Size: 729 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 0

desaiankitb/spark-mllib
Apache Spark is one of the most widely used and supported open-source tools for machine learning and big data. In this repo, discover how to work with this powerful platform for machine learning. This repo discusses MLlib—the Spark machine learning library—which provides tools for data scientists and analysts who would rather find solutions to business problems than code, test, and maintain their own machine learning libraries. Repo shows how to use DataFrames to organize data structure, and covers data preparation and the most commonly used types of machine learning algorithms: clustering, classification, regression, and recommendations. You will have experience loading data into Spark, preprocessing data as needed to apply MLlib algorithms, and applying those algorithms to a variety of machine learning problems.
Language: Python - Size: 150 KB - Last synced at: almost 2 years ago - Pushed at: almost 7 years ago - Stars: 4 - Forks: 5

felidsche/mail-spam-filter
An email spam filter using Apache Spark’s ML library
Language: Python - Size: 212 KB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 3 - Forks: 1

mnassrib/pyspark-examples
This tutorial presents some examples in order to give a quick overview of the Spark APIs.
Language: Jupyter Notebook - Size: 8.48 MB - Last synced at: 11 months ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 0

mnassrib/categorical-data-python
A simple demo repository to show how to handling categorical data in python
Language: Jupyter Notebook - Size: 2.88 MB - Last synced at: 11 months ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 0

sirCamp/spark-pspectrum
P-spectrum embedding and sequence relaxation for NLP in Spark
Language: Scala - Size: 87.9 KB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 0

LuisFalva/ophelia
Ophelian On Mars! More than a simple framework.
Language: Python - Size: 2.16 MB - Last synced at: 6 days ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 5

vasnake/spark.ml.SpatialJoinTransformer
spark.ml.transformer: join two datasets using spatial relations
Language: Scala - Size: 120 KB - Last synced at: 17 days ago - Pushed at: 7 months ago - Stars: 2 - Forks: 0

JBris/docker-spark-sparklyr
Docker setup for Apache Spark and the R sparklyr package
Language: Dockerfile - Size: 18.6 KB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

q-maze/sparkHospital
Project to create models to predict ICU patient mortality based on demographic, diagnostic, and other factors utilizing Apache Spark.
Language: Jupyter Notebook - Size: 1.26 MB - Last synced at: 29 days ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 1

seahrh/bad-renter
Working examples of Spark ML Pipeline and SMOTE algorithm for synthetic data augmentation
Language: Scala - Size: 59.6 KB - Last synced at: 28 days ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 0

adityajn105/Apache-Spark-Tutorials
Apache spark is a big data analysis framework.
Language: Jupyter Notebook - Size: 789 KB - Last synced at: 19 days ago - Pushed at: about 6 years ago - Stars: 2 - Forks: 5

multivacplatform/multivac-ml
Pre-trained ML models for Apache Spark
Language: Scala - Size: 947 KB - Last synced at: about 2 months ago - Pushed at: about 6 years ago - Stars: 2 - Forks: 0

Hydrospheredata/fastserving
Spark ML Lib serving library
Language: Scala - Size: 77.1 KB - Last synced at: about 1 month ago - Pushed at: almost 7 years ago - Stars: 2 - Forks: 0

joyoyoyoyoyo/emojipasta-topic-modeling
😅 A topic model of reddit.com/r/EmojiPasta trained with Spark and an LDA model (NSFW) - Trigger Warning: The r/emojipasta subreddit posts controversial content and anything I have crawled is to provide visibility of a topic modeling some of this controversial content. Unfortunately there is also discriminatory speech which must be called out!
Language: Scala - Size: 700 KB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 2 - Forks: 0

zhaoyongchuang/spark-ml
study spark ml
Language: Scala - Size: 208 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 0

AbdelmajidLh/Spark_ML_Weather
Projet d'apprentissage Scala et Spark : Prédire la pluie de demain avec des données historiques
Language: Scala - Size: 13.7 KB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

John-CYHui/PySpark-Code
Code for PySpark Tutorial
Language: Python - Size: 38.7 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

giabaohb48/ReviewProduct
evaluate product on Shopee by comment
Language: Jupyter Notebook - Size: 60.5 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

aabdel-kader/Apache-Spark
A repository for my practices and projects using pyspark
Language: Jupyter Notebook - Size: 11.6 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

szaher/spark
Playing with Spark using Java
Language: Java - Size: 424 KB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

angeligareta/machine-learning-spark
Assignment for Scalable Machine Learning which aims to study the basics of regression and classification in Spark.
Language: Scala - Size: 1.42 MB - Last synced at: about 1 month ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

mnassrib/bankmarketing-sparkml-databricks
This tutorial analyses a binary classification example based on Spark ML applied with Python language programming and running a databricks cloud community edition cluster.
Language: Jupyter Notebook - Size: 79.1 KB - Last synced at: 11 months ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

anmolmore/Enzyme-Classifier-Using-ML
Classify enzymes with geomic sequence using spark-ML
Language: Jupyter Notebook - Size: 719 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

yukia3e/learning-spark-3
Language: Python - Size: 289 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

Dragon1573/Online-Works 📦
泰迪在线实习备份
Language: Scala - Size: 2.24 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 1

maheshg23/Life-Expectancy-Predictor-BigDataProject
Life Expectancy Predictor - Big Data final course project using Big Data Technologies.
Language: Scala - Size: 376 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 1

Evegen55/mastering-spark
mastering spark
Language: Java - Size: 1.38 MB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

hypnosapos/sparknetes
Spark on Kubernetes PoCs
Language: Makefile - Size: 1.12 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 1

hsuanhauliu/yelp-recommendation-system
Yelp recommendation system using collaborative-filtering algorithms.
Language: Python - Size: 20.2 MB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0

corneliouzbett/Master-Apache-Spark
Apache Spark is a fast and general-purpose cluster computing system. It provides high-level APIs in Java, Scala, Python and R, and an optimized engine that supports general execution graphs. It also supports a rich set of higher-level tools including Spark SQL for SQL and structured data processing, MLlib for machine learning, GraphX for graph processing, and Spark Streaming
Language: Python - Size: 889 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 1 - Forks: 0

hasitha087/NPSClassification
This is pyspark based NPS(Net Promote Score) text classification model developed using Naive Bayes Classifier.
Language: Python - Size: 1.95 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 1 - Forks: 0

multivacplatform/multivac-nlp
Testing and benchmarking some of the existing NLP libraries in Apache Spark
Language: Scala - Size: 12 MB - Last synced at: about 2 months ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 1

AndrewKuzmin/spark-ml-pipelines-with-structured-streaming-examples
Examples of using Apache Spark MLlib Pipelines and Structured Streaming on version 2.4.0
Language: Shell - Size: 1020 KB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

maikeffi/spark_ml_boston
Loading a CSV from hadoop / hive and perform ml functions
Language: Jupyter Notebook - Size: 34.2 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

necosta/forecast-x
Tennis data exploration and forecasting
Language: Scala - Size: 40 KB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

Jayasagar/sparkml-regression-models-movie-revenue-predictions
TMDb movie dataset revenue predictions
Language: Jupyter Notebook - Size: 277 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 1

mkuthan/example-spark-ml
Machine learning, feature engineering examples using Spark ML
Language: Jupyter Notebook - Size: 129 KB - Last synced at: 14 days ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 0

EmmaMuhleman1/emmamuhlemantest1.github.io
Language: HTML - Size: 2.93 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 0

krobee/Stock-Price-Predictor
A system that can predict stock price based on Twitter data
Language: Scala - Size: 897 KB - Last synced at: 11 months ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

sueszli/sparkly-svm
distributed training of a SVM with sparkML
Language: Jupyter Notebook - Size: 21.6 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

hazecodeio/spark-sandbox
Language: Scala - Size: 13.1 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

glimmerphoenix/dataeng_book
Libro Fundamentos de Ingeniería de Datos
Language: TeX - Size: 634 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

AdrielC/skynet
Machine Learning (MLeap) Model Serving application for Scala
Language: Scala - Size: 150 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

rahult18/NYC-Yellow-Taxi-Trip-Data-Pipeline
This is an end-to-end data pipeline that processes and analyzes NYC Yellow Taxi trip data. It includes data ingestion, cleaning, feature engineering, machine learning model training, and a REST API for fare amount prediction.
Language: Jupyter Notebook - Size: 81.8 MB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

cmpt-732/Amazon_Product_Analysis
Amazon Product analysis
Language: Python - Size: 5.5 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

lediau/bigdata-data-engineering-ai-masters
Language: Python - Size: 122 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Siddharth1989/ProspectiveTopUpCustomerPrediction
Developed a model/Spark ML pipeline stream to identify potential customers that may purchase top up services in the future.
Language: Jupyter Notebook - Size: 6.17 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Amir79Naziri/TwitterSentimentAnalysisWithSpark_Project
A sentiment analyzer using Spark ML library for Twitter Dataset
Language: Jupyter Notebook - Size: 13.7 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

pmb-7684/IBM-Data-Engineering-Professional-Certificate
Learning materials, assignments, and helpful resources for professional certification. Expected Completion June 2023
Size: 9.77 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Jayveersinh-Raj/trip_duration_big_data
Taxi trip duration forecasting using Big data and spark ML
Language: Jupyter Notebook - Size: 203 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

chandni-s/BigDataSystems
Big Data Management Systems & Tools
Language: HTML - Size: 781 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

yyqcs/spark-example
spark2.4.x common examples ,using scala
Language: Scala - Size: 24.4 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 1

avisionary/reddit-comments-analysis
Big data project to analyze (Subreddit : NoStupidQuestions) comments
Language: HTML - Size: 5.68 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0
