GitHub / yvgupta03 / Big_Data_Project_US-Airlines_Tweet_Processing_and_Analysis
Big data application of Machine Learning concepts for sentiment classification of US Airlines tweets. The focus is on the usage of pyspark libraries (ml-lib) on big data to solve a problem using Machine Learning algorithms and not about the choice of algorithm used in the ML model creation. It also involves data pre-processing using NLP techniques, cross-validation and parameter-grid builder.
Stars: 2
Forks: 0
Open issues: 0
License: None
Language: Jupyter Notebook
Size: 1.83 MB
Dependencies parsed at: Pending
Created at: almost 3 years ago
Updated at: about 2 years ago
Pushed at: almost 3 years ago
Last synced at: almost 2 years ago
Topics: big-data, databricks-notebooks, ml-pipelines, pyspark-mllib, twitter-sentiment-analysis