GitHub / huzaifakhan04 / exploratory-data-analysis-on-amazon-review-data-using-mongodb-and-pyspark
This repository showcases the outcomes of an Exploratory Data Analysis (EDA), including visualisation, conducted on the comprehensive Amazon Review Data (2018) dataset, consisting of nearly 233.1 million records and occupying approximately 128 gigabytes (GB) of data storage, using MongoDB and PySpark.
Stars: 0
Forks: 0
Open issues: 0
License: bsd-3-clause
Language: Jupyter Notebook
Size: 875 KB
Dependencies parsed at: Pending
Created at: almost 2 years ago
Updated at: almost 2 years ago
Pushed at: almost 2 years ago
Last synced at: almost 2 years ago
Topics: amazon, amazon-reviews, customer-analysis, customer-reviews, data-analysis, data-science, data-visualisation, eda, exploratory-data-analysis, inferential-statistics, mongodb, nosql, product-analysis, product-reviews, pyspark, statistical-analysis, statistical-inference, statistics