ecosyste.ms

Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: sparkify

Repositories

pratikwatwani/ETL-pipeline-for-Sparkify

An ETL model designed using Postgres SQL for Sparkify database 🗄, modeling user activity data to create a database and ETL pipeline🔀 for a music streaming app 🎼.

Language: Jupyter Notebook - Size: 677 KB - Last synced at: 11 months ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

alessiococchieri/BDA-project-sparkify

This Git repo showcases my analysis of Sparkify dataset with PySpark on Apache Spark cluster mode and JupyterLab on Docker. The goal was to identify at-risk customers and develop retention strategies. The analysis tested multiple machine learning models and uncovered insights into customer behavior and churn patterns.

Language: Jupyter Notebook - Size: 4.11 MB - Last synced at: 6 months ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 1

SimplifyData/Cloud-Data-Warehouse-with-Redshift-AWS

Cloud Data Warehouse of Sparkify Data using Redshift

Language: Python - Size: 1.2 MB - Last synced at: about 1 month ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 1

brunowdev/sparkify

This is the final project for the Data Scientist Nanodegree, where our goal is to predict churn for a fictional streaming service called Sparkify.

Language: HTML - Size: 6.33 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 0

Mcamin/User-Churn-Prediction

Data Analysis in Spark to Identify Customer Churn for a fictional music service.

Language: Jupyter Notebook - Size: 254 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

Related Keywords

sparkify 5 pyspark 3 data-modeling 2 database 2 etl-pipeline 2 churn-prediction 2 data-warehouses 1 dimension-tables 1 music-database 1 redshift 1 redshift-aws 1 staging-tables 1 data-science-capstone 1 pyspark-mllib 1 udacity 1 gradient-boosting 1 logistic-regression 1 python 1 support-vector-machines 1 tuning 1 data-lake 1 data-engineering 1 aws-redshift 1 analytics-tables 1 spark 1 machine-learning 1 churn-analysis 1 big-data-processing 1 big-data-analytics 1 big-data 1 apache-spark 1 postgresql 1 etl 1 datamodel 1