GitHub topics: hiveql
ethanwebber123/Airflow-ETL-ELT
Airflow-ETL-ELT is a robust data pipeline tool that enables efficient extraction, transformation, loading, and orchestration of data workflows. It offers a scalable and customizable solution for managing complex ETL and ELT processes with ease.
Size: 1000 Bytes - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

huandu/go-sqlbuilder
A flexible and powerful SQL string builder library plus a zero-config ORM.
Language: Go - Size: 329 KB - Last synced at: 11 days ago - Pushed at: about 1 month ago - Stars: 1,528 - Forks: 123

git4additi/Indian-food-prices
A case study on daily Indian Food Prices Analysis using Hive
Language: Jupyter Notebook - Size: 12.9 MB - Last synced at: 8 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

DTStack/monaco-sql-languages
SQL languages for monaco-editor
Language: TypeScript - Size: 64.7 MB - Last synced at: 3 days ago - Pushed at: 2 months ago - Stars: 253 - Forks: 48

macbre/sql-metadata
Uses tokenized query returned by python-sqlparse and generates query metadata
Language: Python - Size: 886 KB - Last synced at: 11 days ago - Pushed at: about 2 months ago - Stars: 848 - Forks: 126

sudo-which-qp/hive_note_app
SImple Note App, using Hive and Flutter
Language: Dart - Size: 1.3 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 16 - Forks: 2

camilesing/Hive-Spark-SQL-Helper-VSCode
Hive & Spark SQL extension for Visual Studio Code
Language: TypeScript - Size: 7.08 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 4 - Forks: 0

vickyjkwan/sqlanalyzer
A SQL parser and analyzer for sql flavors including MySQL, PostgreSQL, BigQuery Standard SQL, Presto SQL and Hive SQL.
Language: Jupyter Notebook - Size: 474 KB - Last synced at: 11 days ago - Pushed at: almost 2 years ago - Stars: 10 - Forks: 1

RubensZimbres/Repo-2019
BERT, AWS RDS, AWS Forecast, EMR Spark Cluster, Hive, Serverless, Google Assistant + Raspberry Pi, Infrared, Google Cloud Platform Natural Language, Anomaly detection, Tensorflow, Mathematics
Language: Jupyter Notebook - Size: 57.8 MB - Last synced at: 14 days ago - Pushed at: over 3 years ago - Stars: 138 - Forks: 73

Pirate-Emperor/BigData-Pipeline
BigData Pipeline is a local testing environment for experimenting with various storage solutions (RDB, HDFS), query engines (Trino), schedulers (Airflow), and ETL/ELT tools (DBT). It supports MySQL, Hadoop, Hive, Kudu, and more.
Language: Dockerfile - Size: 7.95 MB - Last synced at: 14 days ago - Pushed at: 6 months ago - Stars: 3 - Forks: 0

giuseppericcio/BigData
Svolgimento degli homeworks assegnati nell'ambito del corso di Big Data Engineering del prof. Vincenzo Moscato, Università degli Studi di Napoli "Federico II", a.a. 2022-23
Language: Jupyter Notebook - Size: 58.7 MB - Last synced at: 7 days ago - Pushed at: 2 months ago - Stars: 1 - Forks: 2

andresilvase/tasky
Simple offline task manager.
Language: Dart - Size: 4.93 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 3 - Forks: 0

Narius2030/Hive-DataWarehouse-Analysis
Implement a Hive data warehouse to store meaningful data, apply Machine Learning like Clustering or Regression for dealing with business problems
Language: Jupyter Notebook - Size: 24.9 MB - Last synced at: 19 days ago - Pushed at: 10 months ago - Stars: 2 - Forks: 3

madhurimarawat/Big-Data-Analytics
This repository demonstrates big data processing, visualization, and machine learning using tools such as Hadoop, Spark, Kafka, and Python.
Language: Jupyter Notebook - Size: 10.7 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 1

seanpm2001/Bliss_Browser_HiveQL
🌳️🌐️#️⃣️ The Bliss Browser HiveQL language support module, allowing HiveQL programs to be written in and ran within the browser.
Language: HiveQL - Size: 1.74 MB - Last synced at: 7 days ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

uber/athenadriver
A fully-featured AWS Athena database driver (+ athenareader https://github.com/uber/athenadriver/tree/master/athenareader)
Language: Go - Size: 2.96 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 156 - Forks: 37

developer-sdk/beginner-bigdata-example
Hadoop, Hive, Spark 작업의 예제들
Language: Java - Size: 3.34 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 1

Mariam-iftikhar/BigDataProjects
The repository showcases a series of exercises and projects focused on big data processing using Hadoop, HBase, Hive, and Spark with Python. Hosted on AWS EMR, these projects demonstrate efficient data handling and processing techniques, leveraging the power of cloud computing to tackle complex data challenges.
Size: 10.4 MB - Last synced at: about 2 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

marcelmittelstaedt/BigData
Lecture: Big Data
Language: HTML - Size: 588 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 15 - Forks: 12

HabibAroua/Newspaper-analysis
Language: Java - Size: 12.5 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 1

aymane-maghouti/Mobile-Data-Hive-Insights
This project demonstrates the process of extracting data from a MySQL database, transferring it using Apache Sqoop, storing it in Hive Data warehouse (the data actually is store in Hadoop Distributed File System (HDFS)), and performing analysis using Hive Query Language (Hive QL) (it is a language close to SQL). Then visualize the data in Power BI,
Language: HiveQL - Size: 691 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

omkaarlavangare/pig-hive-movielens
Basic Analysis of MovieLens Dataset with Pig and Hive
Language: PigLatin - Size: 6.84 KB - Last synced at: 9 months ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

subhanjandas/User-Occupation-and-Movies-Ratings-Data-Exploration-using-Apache-Hive
In this project, the objective was to analyze the "User, Occupation, Movies, and Ratings" dataset using Apache Hive. The data was processed and analyzed using Hive's SQL-like query language and MapReduce framework, making it easier to handle large datasets. The focus of the analysis was to provide a comprehensive breakdown of the data
Language: JavaScript - Size: 1.05 MB - Last synced at: 4 days ago - Pushed at: 9 months ago - Stars: 0 - Forks: 1

GoldenDein/Clinicaltrial-Data-analysis
This project utilizes Big data tools like Hive, Pyspark and AWS Glue to explore Clinical trial data to gain further insights into the clinical trials
Language: HTML - Size: 12.9 MB - Last synced at: 9 months ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

VictoriaGomesDS/Intro_Ecossistema_Hadoop
Size: 233 KB - Last synced at: 10 months ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 0

rishuatgithub/hive-custom-udfs
This is a repository for custom user defined functions used in Apache Hive
Language: Java - Size: 50.8 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

barathsuresh/HexScript
A simple note taking app with Material you theming. All your notes will be backed up by firebase also stored in offline. Notes stored are Encrypted by default.
Language: Dart - Size: 668 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 2 - Forks: 0

RoshnaKU/AzureDataEngineering-ECDC-CovidSpreadAnalytics
Performed Analytics on covid data from ECDC website utilizing Azure capabilities - ADF, Databricks, HDInsights
Language: PowerShell - Size: 942 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

Ashutosh27ind/hiveCaseStudyNYCYellowTripData
The purpose of this dataset is to get a better understanding of the taxi system so that the city of New York can improve the efficiency of in-city commutes.
Language: HiveQL - Size: 17.8 MB - Last synced at: 12 months ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

miguel617/MovieLens-Data-Engineer-Analytics-Project
The objective of this project is to build a data pipeline to show and analyse the results in PowerBI from the MovieLens 25M database, using Hive and Python.
Language: Jupyter Notebook - Size: 25.8 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

manojkumarvohra/hive-blocks
The project supports running conditional blocks with hive queries
Language: Java - Size: 77.4 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 1

Eng-ZeyadTarek/hadoop-dojo
implementation of some tasks in hadoop framework (hive-spark-pig-spark)
Language: Jupyter Notebook - Size: 19.8 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

leovan/hive-functions
Hive 函数
Language: Java - Size: 197 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 6 - Forks: 2

lulumengyi/Hive_SQL_AST
利用Druid SQL Parser解析HiveSQL日志,自动构建字段级别的血缘关系及主外键的自动抽取
Language: Java - Size: 97.7 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 32 - Forks: 21

claireboyd/311requests_chicago
Created a simple web app which gives users a summary of the types of 311 requests in their Chicago neighborhood, built with Lambda Architecture principles using Apache's tech stack
Language: HiveQL - Size: 27.5 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

mikeroyal/Apache-Hive-Guide
Apache Hive Guide
Size: 357 KB - Last synced at: 21 days ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 1

nourhansowar/Hadoop-Hive-Cluster
Haddop Hive Cluster
Language: Shell - Size: 9.77 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Dhi13man/ycombinator_news_client
A Flutter based client app functioning as a Real-Time News Forum viewer, using YCombinator's Hacker News API. Can work both using Firebase Cloud Features, or using a Local Hive Database, as user desires.
Language: Dart - Size: 53.2 MB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 3 - Forks: 0

aminscientist/Fraud-Detection-in-Financial-Transactions
Language: Jupyter Notebook - Size: 1.11 MB - Last synced at: 11 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

elouardyabderrahim/fraud-detection-in-financial-transactions
FinTech Innovations faces a rising challenge of fraudulent transactions impacting trust and causing financial losses. We propose a real-time fraud detection system analyzing transactional, customer, and external data to minimize false alerts.
Size: 183 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

sahilbhange/hive-sql-slowly-changing-dimension
Slowly Changing Dimension type 2 using Hive query language using exclusive join technique with ORC Hive tables, partitioned and clustered hive table performance comparison
Language: Python - Size: 308 KB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 16 - Forks: 10

codyle50/Airbnb-Big-Data-Management
To develop an Airbnb database and create a pipeline using MongoDB and Hadoop architecture to ease the process of managing, loading, processing, querying, and analyzing Airbnb data based on location
Language: Jupyter Notebook - Size: 377 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

toukirnaim08/HiveQL-Hadoop-MapReduce
A HiveQL script with Hadoop/MapReduce Program to find out the most popular movies for different age groups.
Language: HiveQL - Size: 5.52 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

ssandeep858/Big-Data-Management
sql
Language: PLSQL - Size: 723 KB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

dhritimannath/sales-analysis-hive
Size: 2.93 KB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

inferrinizzard/prettier-sql Fork of sql-formatter-org/sql-formatter 📦
[ARCHIVED] Please use https://github.com/sql-formatter-org/sql-formatter
Language: TypeScript - Size: 3.08 MB - Last synced at: about 1 month ago - Pushed at: almost 3 years ago - Stars: 21 - Forks: 5

seanpm2001/AI2001_Category-Source_Code-SC-HiveQL
🧠️🖥️2️⃣️0️⃣️0️⃣️1️⃣️💾️📜️ The sourceCode:HiveQL category for AI2001, containing HiveQL programming language datasets
Language: R - Size: 2.46 MB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 1

natg76/AssignmentsNG
Individual assignments
Language: R - Size: 49.8 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

mishraapoorva/AmazonReviewAnalysis
A BIG DATA project having map reduce code in Java to perform analysis on the Amazon customer review and provide some insights
Language: Java - Size: 2.14 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

vivek2319/Learn-Hadoop-and-Spark
This repository focuses on gathering and making a curated list resources to learn Hadoop for FREE.
Language: Python - Size: 211 MB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 46 - Forks: 39

ashok-amsamani/Hive-Sqoop-Integration-2
Listed steps about how to move data from Mysql to HIVE using Sqoop and Hive to Mysql using sqoop. Follow steps given in script.txt. It has HIVE/SQOOP/Linux and Hadoop scripts. You should have a knowledge on where to run what to try this excercise. It covers:
Size: 9.77 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

ashok-amsamani/HIVE-Schema-Migration-Xml-to-Json
Size: 8.79 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

ashok-amsamani/Hive-Schema-Evolution
Detailed about how we can dynamically load columns in HIVE using AVRO.
Language: HiveQL - Size: 13.7 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

ashok-amsamani/HIVE-ORC-PARQUET-BENCHMARK
Finding storage space requirement and data retrieval time for ORC and Parquet.
Language: HiveQL - Size: 2.08 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

ashok-amsamani/HIVE-DML
Documented my learnings - how to perform DML operations in HIVE.
Language: HiveQL - Size: 6.84 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

ashok-amsamani/HIVE-SQOOP-Integration
Listed steps about how to move data from Mysql to HIVE using Sqoop and Hive to Mysql using sqoop.
Size: 216 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

favouribude1/Analysis-of-Road-Accidents-in-the-UK
Performed analysis on over 16 million road accident dataset using big data tools, developed a data mart for UK transport sector using ETL approach and built a dashboard in tableau to visualize the insight, trends, and patterns
Size: 1.95 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Eeun-ju/SQL-Study
실무에 필수인 SQL문법을 정리해두는 공간
Language: Jupyter Notebook - Size: 2.93 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

kinszee/MySQL-Hive-PowerBI-Pipeline
Built a data pipeline by creating tables in MySQL DB, ingested tables to Hadoop for data warehousing and built HiveQL views. Hive views in Linux VM were connected to Power BI application in Windows to create visualizations.
Size: 2.17 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

SyaneAndrade/RunSqlHive
Faz a conexão e execução de querys disponibilizadas por meio de um arquivo utilizando um conector JDBC para o hive.
Language: Java - Size: 10.7 KB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

MagdaleneHo/MapReduce
A simple project on the use of map and reduce in Hadoop.
Language: Python - Size: 6.84 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 1

Rishi500067313/Corona_data_analysis_using_hive
Size: 955 KB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

kr900910/hospital_data_analysis
ETL process which loads and transforms Medicare hospital data using Python and Hive
Language: Shell - Size: 31.3 KB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

globosc/bigdata
Análisis al Proyecto GDELT con herramientas bigdata basadas den hadoop en nube Microsoft Azure
Language: Python - Size: 3.56 MB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

rmartinezcu/ProyectoHadoop
Trabajo para el curso de Hadoop, realizado por el Grupo 5
Language: Shell - Size: 17.9 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

phaniteja5789/Hive
Size: 20.5 KB - Last synced at: 2 months ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

Mathews-Tom/MSc-in-Machine-Learning-and-Artificial-Intelligence
Master of Science in Machine Learning & Artificial Intelligence - Indian Institute Technology Madras & Liverpool John Moores University
Language: Jupyter Notebook - Size: 2.12 GB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 6 - Forks: 7

Subham2S/BigData-Engineering-Capstone-Project-1
BigData Engineering Capstone Project with Tech-stack : Linux, MySQL, sqoop, HDFS, Hive, Impala, SparkSQL, SparkML, git
Language: Python - Size: 15.2 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 0

rishabmenon/YouTube-Data-Analysis-Hadoop
This Hadoop project involves analysing the YouTube dataset to solve a few problem statements.
Size: 1.75 MB - Last synced at: almost 2 years ago - Pushed at: almost 6 years ago - Stars: 3 - Forks: 5

slatawa/Forex-Currency-Processing-Airflow-Hdfs-Hive-Spark
We build a Forex-currency rates pipeline to get currency rates from an external API and load the data into HDFS from where we use pyspark job to massage the data and insert it into a Hive table. The objective of this pipeline is to get the data ready for any downstream machine learning pipeline.
Language: Shell - Size: 643 KB - Last synced at: 8 days ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

Antonio-Borges-Rufino/Dados_Aeronauticos_Data_Pipeline
Tipo: Engenharia de dados. Tecnologias: Apache Nifi, Apache Druid, Api Rest, MySql, Apache Hadoop, Apache Hive, Apache Kafka.
Language: Jupyter Notebook - Size: 5.24 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

brightjonathan/Note_List
With Note list you can quickly write notes, todo items and write down all your ideas and reminders. Looking for a simple yet powerful material design notepad app for your Android device, Note list is it.
Language: C++ - Size: 1.77 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

Nandan9911/Big-Data-minor-projects
Problems on Hadoop-MapReduce, Hive and PySparkSQL
Language: Java - Size: 16.6 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

tanjiarui/Artefact-case
Case study of Artefact
Language: Python - Size: 2.12 MB - Last synced at: almost 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

seanpm2001/Learn-HiveQL
A repository for showcasing my knowledge of the HiveQL programming language, and continuing to learn the language
Language: HiveQL - Size: 529 KB - Last synced at: 7 days ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 1

alvarofpp/4linux-hadoop 📦
Scipts usados durante o curso Big Data Analytics com Hadoop oferecido pela 4Linux
Language: PigLatin - Size: 1.03 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 2 - Forks: 0

FareedKhan-dev/apache-hive-guide
This project is based on docker image of apache hive, showing all the basic commands to understand hive queries.
Size: 10.4 MB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

wzqwtt/BigData
小白大数据学习笔记 :star:
Language: Java - Size: 128 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 16 - Forks: 2

canaytore/hive-learnings
Apache Hive - an SQL-like interface to query data stored in various databases and file systems that integrate with Apache Hadoop
Size: 1.95 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

rishabmenon/Airlines-Analysis-Hadoop
This Hadoop project involves analysing the airline datasets to solve a few problem statements.
Size: 2.22 MB - Last synced at: almost 2 years ago - Pushed at: almost 6 years ago - Stars: 2 - Forks: 5

avatime/gamul-gamul
🏆가물가물 : 빅데이터 분산 처리를 활용한 물가기반 식재료 가격 정보 제공 웹앱 서비스 - 🥇SSAFY 7기 특화프로젝트 우수상 1등(2022.10.07)
Language: Java - Size: 190 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

seanpm2001/SNU_2D_ProgrammingTools_IDE_HiveQL
The HiveQL Programming language IDE submodule for SNU Programming Tools (2D Mode)
Language: HiveQL - Size: 1.32 MB - Last synced at: 7 days ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 2

harshbg/PUBG-Game-Data-Analysis
An Analysis of Player Unknown's Battle Grounds (PUBG) Game Data using Hive and Spark.
Language: Scala - Size: 5.99 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 4

kaushik-prasad-dey/Market-Analysis-in-Banking-Domain-Big-data-Hadoop-Spark
Language: HTML - Size: 1.1 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

wushengyeyouya/Hive-JDBC-Proxy
Hive-JDBC-Proxy是一个高性能的HiveServer2和Spark ThriftServer的代理服务,具备负载均衡、基于规则转发Hive JDBC Client的请求给到HiveServer2和Spark ThriftServer的能力。
Language: Scala - Size: 74.2 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 30 - Forks: 15

ai1138/Analyzing_Brooklyn
For this project we studied 3 data sets revolving around neighborhoods in New York City. We hope to learn what neighborhoods in Brooklyn are good to live in
Language: HiveQL - Size: 35.2 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 2

teepha/clinicaltrial_data_analysis
Analysis of clinical trial data
Language: HTML - Size: 10.8 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

aphp/HiveQLKernel
HiveQL Jupyter Kernel
Language: Python - Size: 61.5 KB - Last synced at: 1 day ago - Pushed at: over 2 years ago - Stars: 10 - Forks: 5

AdamJeddy/Zeppelin-Notebook-Archive
Big Data Management related Zeppelin notebooks
Language: Java - Size: 2.04 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

zahir2000/spotify
WQD7007
Language: PigLatin - Size: 4.88 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

WesleySiNeves/AprendendoPython
Language: Jupyter Notebook - Size: 1.65 GB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

uzeziogho/ClinicalTrial
Data Analysis using Databricks
Language: Jupyter Notebook - Size: 1.71 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

puneetnegi002/analytics-using-clickstream-data
Analytics using clickstream data and visualise it using tableau
Size: 7.42 MB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 2 - Forks: 3

gabriel-solon-padilha/criando_um_datalakehouse_databricks
Meu décimo primeiro projeto em que crio um datalakehouse usando computação distribuído no databricks
Language: HTML - Size: 2.1 MB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 2

samye760/Wikipedia-Big-Data-Analysis
Script for analyzing Wiki dumps big data sets.
Language: HiveQL - Size: 7.22 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

ChoiSol24/LYRICS_DATA_ANALYSIS
2019부터 2021까지 멜론 주간차트 100위 내의 음원 가사 감정어 추출 후, 긍정/부정어 개수 데이터 분석
Language: Jupyter Notebook - Size: 18.6 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

rishabmenon/Analysing-Book-Dataset-Hadoop
This Hadoop project involves analysing the book datasets to solve a few problem statements.
Size: 26.2 MB - Last synced at: almost 2 years ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0

cevheri/hive-java-example
Hive Query Language example with Apache Hive, Apache Hadoop, Java
Language: Java - Size: 2.93 KB - Last synced at: about 2 months ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

ManikHossain08/Bixi-Cloud-ETL-Data-Pipeline-using-Scala-Hive-AWS_Athena_JDBC-Driver
An Automated ETL Data pipeline which extract complex json data from web API service (GBFS-bixi Data) and convert to CSV for loading into Data-warehouse HDFS. After-that, Hive will process the further by external and managed table. Same procedure is also applied with AWS S3 and Athena.
Language: Scala - Size: 117 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

YassineMarroun/SSBBDD-Practica2020
Introducción al manejo de datos masivos con Hadoop y herramientas inspiradas en SQL, Práctica de Sistemas de Bases de Datos - Curso 2019/2020 - UNED
Language: Jupyter Notebook - Size: 79.1 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0
