GitHub / TahirZia-1 / EDA-Netflix-Dataset-using-PySpark-on-Docker
This project demonstrates how to perform Exploratory Data Analysis (EDA) on the Netflix dataset using PySpark in a Jupyter Notebook environment. It involves setting up Spark, loading a dataset, performing basic data cleaning, and visualizing the results. All of it is runnning on a container in Docker.
Stars: 0
Forks: 0
Open issues: 0
License: None
Language: Jupyter Notebook
Size: 1.75 MB
Dependencies parsed at: Pending
Created at: 4 months ago
Updated at: 4 months ago
Pushed at: 4 months ago
Last synced at: 23 days ago
Topics: dataset, docker, docker-image, eda, jupyter-notebook, netflix, pyspark, pyspark-notebook, python