GitHub topics: sparkml-pipelines
lykmapipo/Python-Spark-Log-Analysis
Python scripts to process, and analyze log files using PySpark.
Language: Python - Size: 131 KB - Last synced at: 12 days ago - Pushed at: 11 months ago - Stars: 6 - Forks: 0

Pirata-Codex/Sentiment-Analysis-SparkML
Using SparkML to build different machine learning models for simulating a small scale of big data management
Language: Jupyter Notebook - Size: 172 KB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 1

khaz-dev/Airfoil_ML_Pipeline
This Project about Build Machine Learning Pipeline using SparkML with Jupyter Notebook
Language: Jupyter Notebook - Size: 96.7 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

dlleonardo/spark-de-ml-assignments
Spark DE&ML assignments from the "Data Engineering and Machine Learning with Spark" course (offered by IBM Skills Network)
Language: Jupyter Notebook - Size: 56.6 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

ph2017001/FuzzyMatch_Spark
FuzzyMatch a Query Set with a Reference Set Using Spark
Language: Python - Size: 99.6 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

NijatZeynalov/Apache-SparkML-Pipelines
In this notebook I’ll use the HMP dataset and perform some basic operations using Apache SparkML Pipeline component. This dataset is a public collection of labelled accelerometer data recordings to be used for the creation and validation of acceleration models of human motion primitives.
Language: Jupyter Notebook - Size: 7.81 KB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0
