An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: cloudera-hadoop-framework

jigyasaG18/Airline-Performance-And-Passenger-Satisfaction-Project-Using-Big-Data-Analytics

This project analyzes 10 years of U.S. domestic airline data (~3GB) using Hadoop (Cloudera) and Hive for data processing. Power BI dashboards visualize key metrics like delays, on-time rates, air time, and diversions. The solution includes Hive queries, DAX measures, HDFS ingestion scripts, and year-wise insights with recommendations.

Language: HiveQL - Size: 21.9 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

gowriaddepalli/movie_lens_analysis

mini project for big data elective in final year using cloudera hadoop's framework with pig ,hive and data visualization with tableau connected to hadoop with hive odbc driver.

Size: 1.8 MB - Last synced at: over 2 years ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 3

marycboardman/Assessment-Attempts

Data processing using docker containers, kafka, spark, and hadoop

Size: 6.84 KB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 1

SakhriHoussem/SparkSQL-Tutorial

a Simple SparkSQL Tutorial

Size: 6.84 KB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

SakhriHoussem/Apache-Spark-Tutorial

a Simple Apache Spark Tutorial

Size: 5.57 MB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0