GitHub / BhagyashriT / DICLAB2-DataAggregationBigDataAnalysisAndVisualization
Collected data about from three sources, one opinion-based social media in twitter, research data in New York Times, and the third is the common crawl data for the same topic or key phrase, and from similar time periods. Processed the three data sets collected individually using classical big data methods like Map Reduce in Google Dataproc Clusters. And then compared the outcomes using popular visualization methods in tableau.
Stars: 2
Forks: 0
Open issues: 0
License: None
Language: Python
Size: 39.8 MB
Dependencies parsed at: Pending
Created at: almost 6 years ago
Updated at: over 3 years ago
Pushed at: over 5 years ago
Last synced at: over 1 year ago
Topics: commoncrawl, crawler, dataproc, google, mapreduce, nytimes-apis, tableau, twitter-api