An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: hiveql

ethanwebber123/Airflow-ETL-ELT

Airflow-ETL-ELT is a robust data pipeline tool that enables efficient extraction, transformation, loading, and orchestration of data workflows. It offers a scalable and customizable solution for managing complex ETL and ELT processes with ease.

Size: 1000 Bytes - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

huandu/go-sqlbuilder

A flexible and powerful SQL string builder library plus a zero-config ORM.

Language: Go - Size: 329 KB - Last synced at: 11 days ago - Pushed at: about 1 month ago - Stars: 1,528 - Forks: 123

git4additi/Indian-food-prices

A case study on daily Indian Food Prices Analysis using Hive

Language: Jupyter Notebook - Size: 12.9 MB - Last synced at: 8 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

DTStack/monaco-sql-languages

SQL languages for monaco-editor

Language: TypeScript - Size: 64.7 MB - Last synced at: 3 days ago - Pushed at: 2 months ago - Stars: 253 - Forks: 48

macbre/sql-metadata

Uses tokenized query returned by python-sqlparse and generates query metadata

Language: Python - Size: 886 KB - Last synced at: 11 days ago - Pushed at: about 2 months ago - Stars: 848 - Forks: 126

sudo-which-qp/hive_note_app

SImple Note App, using Hive and Flutter

Language: Dart - Size: 1.3 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 16 - Forks: 2

camilesing/Hive-Spark-SQL-Helper-VSCode

Hive & Spark SQL extension for Visual Studio Code

Language: TypeScript - Size: 7.08 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 4 - Forks: 0

vickyjkwan/sqlanalyzer

A SQL parser and analyzer for sql flavors including MySQL, PostgreSQL, BigQuery Standard SQL, Presto SQL and Hive SQL.

Language: Jupyter Notebook - Size: 474 KB - Last synced at: 11 days ago - Pushed at: almost 2 years ago - Stars: 10 - Forks: 1

RubensZimbres/Repo-2019

BERT, AWS RDS, AWS Forecast, EMR Spark Cluster, Hive, Serverless, Google Assistant + Raspberry Pi, Infrared, Google Cloud Platform Natural Language, Anomaly detection, Tensorflow, Mathematics

Language: Jupyter Notebook - Size: 57.8 MB - Last synced at: 14 days ago - Pushed at: over 3 years ago - Stars: 138 - Forks: 73

Pirate-Emperor/BigData-Pipeline

BigData Pipeline is a local testing environment for experimenting with various storage solutions (RDB, HDFS), query engines (Trino), schedulers (Airflow), and ETL/ELT tools (DBT). It supports MySQL, Hadoop, Hive, Kudu, and more.

Language: Dockerfile - Size: 7.95 MB - Last synced at: 14 days ago - Pushed at: 6 months ago - Stars: 3 - Forks: 0

giuseppericcio/BigData

Svolgimento degli homeworks assegnati nell'ambito del corso di Big Data Engineering del prof. Vincenzo Moscato, Università degli Studi di Napoli "Federico II", a.a. 2022-23

Language: Jupyter Notebook - Size: 58.7 MB - Last synced at: 7 days ago - Pushed at: 2 months ago - Stars: 1 - Forks: 2

andresilvase/tasky

Simple offline task manager.

Language: Dart - Size: 4.93 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 3 - Forks: 0

Narius2030/Hive-DataWarehouse-Analysis

Implement a Hive data warehouse to store meaningful data, apply Machine Learning like Clustering or Regression for dealing with business problems

Language: Jupyter Notebook - Size: 24.9 MB - Last synced at: 19 days ago - Pushed at: 10 months ago - Stars: 2 - Forks: 3

madhurimarawat/Big-Data-Analytics

This repository demonstrates big data processing, visualization, and machine learning using tools such as Hadoop, Spark, Kafka, and Python.

Language: Jupyter Notebook - Size: 10.7 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 1

seanpm2001/Bliss_Browser_HiveQL

🌳️🌐️#️⃣️ The Bliss Browser HiveQL language support module, allowing HiveQL programs to be written in and ran within the browser.

Language: HiveQL - Size: 1.74 MB - Last synced at: 7 days ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

uber/athenadriver

A fully-featured AWS Athena database driver (+ athenareader https://github.com/uber/athenadriver/tree/master/athenareader)

Language: Go - Size: 2.96 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 156 - Forks: 37

developer-sdk/beginner-bigdata-example

Hadoop, Hive, Spark 작업의 예제들

Language: Java - Size: 3.34 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 1

Mariam-iftikhar/BigDataProjects

The repository showcases a series of exercises and projects focused on big data processing using Hadoop, HBase, Hive, and Spark with Python. Hosted on AWS EMR, these projects demonstrate efficient data handling and processing techniques, leveraging the power of cloud computing to tackle complex data challenges.

Size: 10.4 MB - Last synced at: about 2 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

marcelmittelstaedt/BigData

Lecture: Big Data

Language: HTML - Size: 588 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 15 - Forks: 12

HabibAroua/Newspaper-analysis

Language: Java - Size: 12.5 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 1

aymane-maghouti/Mobile-Data-Hive-Insights

This project demonstrates the process of extracting data from a MySQL database, transferring it using Apache Sqoop, storing it in Hive Data warehouse (the data actually is store in Hadoop Distributed File System (HDFS)), and performing analysis using Hive Query Language (Hive QL) (it is a language close to SQL). Then visualize the data in Power BI,

Language: HiveQL - Size: 691 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

omkaarlavangare/pig-hive-movielens

Basic Analysis of MovieLens Dataset with Pig and Hive

Language: PigLatin - Size: 6.84 KB - Last synced at: 9 months ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

subhanjandas/User-Occupation-and-Movies-Ratings-Data-Exploration-using-Apache-Hive

In this project, the objective was to analyze the "User, Occupation, Movies, and Ratings" dataset using Apache Hive. The data was processed and analyzed using Hive's SQL-like query language and MapReduce framework, making it easier to handle large datasets. The focus of the analysis was to provide a comprehensive breakdown of the data

Language: JavaScript - Size: 1.05 MB - Last synced at: 4 days ago - Pushed at: 9 months ago - Stars: 0 - Forks: 1

GoldenDein/Clinicaltrial-Data-analysis

This project utilizes Big data tools like Hive, Pyspark and AWS Glue to explore Clinical trial data to gain further insights into the clinical trials

Language: HTML - Size: 12.9 MB - Last synced at: 9 months ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

VictoriaGomesDS/Intro_Ecossistema_Hadoop

Size: 233 KB - Last synced at: 10 months ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 0

rishuatgithub/hive-custom-udfs

This is a repository for custom user defined functions used in Apache Hive

Language: Java - Size: 50.8 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

barathsuresh/HexScript

A simple note taking app with Material you theming. All your notes will be backed up by firebase also stored in offline. Notes stored are Encrypted by default.

Language: Dart - Size: 668 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 2 - Forks: 0

RoshnaKU/AzureDataEngineering-ECDC-CovidSpreadAnalytics

Performed Analytics on covid data from ECDC website utilizing Azure capabilities - ADF, Databricks, HDInsights

Language: PowerShell - Size: 942 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

Ashutosh27ind/hiveCaseStudyNYCYellowTripData

The purpose of this dataset is to get a better understanding of the taxi system so that the city of New York can improve the efficiency of in-city commutes.

Language: HiveQL - Size: 17.8 MB - Last synced at: 12 months ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

miguel617/MovieLens-Data-Engineer-Analytics-Project

The objective of this project is to build a data pipeline to show and analyse the results in PowerBI from the MovieLens 25M database, using Hive and Python.

Language: Jupyter Notebook - Size: 25.8 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

manojkumarvohra/hive-blocks

The project supports running conditional blocks with hive queries

Language: Java - Size: 77.4 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 1

Eng-ZeyadTarek/hadoop-dojo

implementation of some tasks in hadoop framework (hive-spark-pig-spark)

Language: Jupyter Notebook - Size: 19.8 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

leovan/hive-functions

Hive 函数

Language: Java - Size: 197 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 6 - Forks: 2

lulumengyi/Hive_SQL_AST

利用Druid SQL Parser解析HiveSQL日志,自动构建字段级别的血缘关系及主外键的自动抽取

Language: Java - Size: 97.7 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 32 - Forks: 21

claireboyd/311requests_chicago

Created a simple web app which gives users a summary of the types of 311 requests in their Chicago neighborhood, built with Lambda Architecture principles using Apache's tech stack

Language: HiveQL - Size: 27.5 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

mikeroyal/Apache-Hive-Guide

Apache Hive Guide

Size: 357 KB - Last synced at: 21 days ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 1

nourhansowar/Hadoop-Hive-Cluster

Haddop Hive Cluster

Language: Shell - Size: 9.77 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Dhi13man/ycombinator_news_client

A Flutter based client app functioning as a Real-Time News Forum viewer, using YCombinator's Hacker News API. Can work both using Firebase Cloud Features, or using a Local Hive Database, as user desires.

Language: Dart - Size: 53.2 MB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 3 - Forks: 0

aminscientist/Fraud-Detection-in-Financial-Transactions

Language: Jupyter Notebook - Size: 1.11 MB - Last synced at: 11 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

elouardyabderrahim/fraud-detection-in-financial-transactions

FinTech Innovations faces a rising challenge of fraudulent transactions impacting trust and causing financial losses. We propose a real-time fraud detection system analyzing transactional, customer, and external data to minimize false alerts.

Size: 183 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

sahilbhange/hive-sql-slowly-changing-dimension

Slowly Changing Dimension type 2 using Hive query language using exclusive join technique with ORC Hive tables, partitioned and clustered hive table performance comparison

Language: Python - Size: 308 KB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 16 - Forks: 10

codyle50/Airbnb-Big-Data-Management

To develop an Airbnb database and create a pipeline using MongoDB and Hadoop architecture to ease the process of managing, loading, processing, querying, and analyzing Airbnb data based on location

Language: Jupyter Notebook - Size: 377 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

toukirnaim08/HiveQL-Hadoop-MapReduce

A HiveQL script with Hadoop/MapReduce Program to find out the most popular movies for different age groups.

Language: HiveQL - Size: 5.52 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

ssandeep858/Big-Data-Management

sql

Language: PLSQL - Size: 723 KB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

dhritimannath/sales-analysis-hive

Size: 2.93 KB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

inferrinizzard/prettier-sql Fork of sql-formatter-org/sql-formatter 📦

[ARCHIVED] Please use https://github.com/sql-formatter-org/sql-formatter

Language: TypeScript - Size: 3.08 MB - Last synced at: about 1 month ago - Pushed at: almost 3 years ago - Stars: 21 - Forks: 5

seanpm2001/AI2001_Category-Source_Code-SC-HiveQL

🧠️🖥️2️⃣️0️⃣️0️⃣️1️⃣️💾️📜️ The sourceCode:HiveQL category for AI2001, containing HiveQL programming language datasets

Language: R - Size: 2.46 MB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 1

natg76/AssignmentsNG

Individual assignments

Language: R - Size: 49.8 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

mishraapoorva/AmazonReviewAnalysis

A BIG DATA project having map reduce code in Java to perform analysis on the Amazon customer review and provide some insights

Language: Java - Size: 2.14 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

vivek2319/Learn-Hadoop-and-Spark

This repository focuses on gathering and making a curated list resources to learn Hadoop for FREE.

Language: Python - Size: 211 MB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 46 - Forks: 39

ashok-amsamani/Hive-Sqoop-Integration-2

Listed steps about how to move data from Mysql to HIVE using Sqoop and Hive to Mysql using sqoop. Follow steps given in script.txt. It has HIVE/SQOOP/Linux and Hadoop scripts. You should have a knowledge on where to run what to try this excercise. It covers:

Size: 9.77 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

ashok-amsamani/HIVE-Schema-Migration-Xml-to-Json

Size: 8.79 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

ashok-amsamani/Hive-Schema-Evolution

Detailed about how we can dynamically load columns in HIVE using AVRO.

Language: HiveQL - Size: 13.7 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

ashok-amsamani/HIVE-ORC-PARQUET-BENCHMARK

Finding storage space requirement and data retrieval time for ORC and Parquet.

Language: HiveQL - Size: 2.08 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

ashok-amsamani/HIVE-DML

Documented my learnings - how to perform DML operations in HIVE.

Language: HiveQL - Size: 6.84 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

ashok-amsamani/HIVE-SQOOP-Integration

Listed steps about how to move data from Mysql to HIVE using Sqoop and Hive to Mysql using sqoop.

Size: 216 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

favouribude1/Analysis-of-Road-Accidents-in-the-UK

Performed analysis on over 16 million road accident dataset using big data tools, developed a data mart for UK transport sector using ETL approach and built a dashboard in tableau to visualize the insight, trends, and patterns

Size: 1.95 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Eeun-ju/SQL-Study

실무에 필수인 SQL문법을 정리해두는 공간

Language: Jupyter Notebook - Size: 2.93 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

kinszee/MySQL-Hive-PowerBI-Pipeline

Built a data pipeline by creating tables in MySQL DB, ingested tables to Hadoop for data warehousing and built HiveQL views. Hive views in Linux VM were connected to Power BI application in Windows to create visualizations.

Size: 2.17 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

SyaneAndrade/RunSqlHive

Faz a conexão e execução de querys disponibilizadas por meio de um arquivo utilizando um conector JDBC para o hive.

Language: Java - Size: 10.7 KB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

MagdaleneHo/MapReduce

A simple project on the use of map and reduce in Hadoop.

Language: Python - Size: 6.84 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 1

Rishi500067313/Corona_data_analysis_using_hive

Size: 955 KB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

kr900910/hospital_data_analysis

ETL process which loads and transforms Medicare hospital data using Python and Hive

Language: Shell - Size: 31.3 KB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

globosc/bigdata

Análisis al Proyecto GDELT con herramientas bigdata basadas den hadoop en nube Microsoft Azure

Language: Python - Size: 3.56 MB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

rmartinezcu/ProyectoHadoop

Trabajo para el curso de Hadoop, realizado por el Grupo 5

Language: Shell - Size: 17.9 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

phaniteja5789/Hive

Size: 20.5 KB - Last synced at: 2 months ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

Mathews-Tom/MSc-in-Machine-Learning-and-Artificial-Intelligence

Master of Science in Machine Learning & Artificial Intelligence - Indian Institute Technology Madras & Liverpool John Moores University

Language: Jupyter Notebook - Size: 2.12 GB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 6 - Forks: 7

Subham2S/BigData-Engineering-Capstone-Project-1

BigData Engineering Capstone Project with Tech-stack : Linux, MySQL, sqoop, HDFS, Hive, Impala, SparkSQL, SparkML, git

Language: Python - Size: 15.2 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 0

rishabmenon/YouTube-Data-Analysis-Hadoop

This Hadoop project involves analysing the YouTube dataset to solve a few problem statements.

Size: 1.75 MB - Last synced at: almost 2 years ago - Pushed at: almost 6 years ago - Stars: 3 - Forks: 5

slatawa/Forex-Currency-Processing-Airflow-Hdfs-Hive-Spark

We build a Forex-currency rates pipeline to get currency rates from an external API and load the data into HDFS from where we use pyspark job to massage the data and insert it into a Hive table. The objective of this pipeline is to get the data ready for any downstream machine learning pipeline.

Language: Shell - Size: 643 KB - Last synced at: 8 days ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

Antonio-Borges-Rufino/Dados_Aeronauticos_Data_Pipeline

Tipo: Engenharia de dados. Tecnologias: Apache Nifi, Apache Druid, Api Rest, MySql, Apache Hadoop, Apache Hive, Apache Kafka.

Language: Jupyter Notebook - Size: 5.24 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

brightjonathan/Note_List

With Note list you can quickly write notes, todo items and write down all your ideas and reminders. Looking for a simple yet powerful material design notepad app for your Android device, Note list is it.

Language: C++ - Size: 1.77 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

Nandan9911/Big-Data-minor-projects

Problems on Hadoop-MapReduce, Hive and PySparkSQL

Language: Java - Size: 16.6 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

tanjiarui/Artefact-case

Case study of Artefact

Language: Python - Size: 2.12 MB - Last synced at: almost 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

seanpm2001/Learn-HiveQL

A repository for showcasing my knowledge of the HiveQL programming language, and continuing to learn the language

Language: HiveQL - Size: 529 KB - Last synced at: 7 days ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 1

alvarofpp/4linux-hadoop 📦

Scipts usados durante o curso Big Data Analytics com Hadoop oferecido pela 4Linux

Language: PigLatin - Size: 1.03 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 2 - Forks: 0

FareedKhan-dev/apache-hive-guide

This project is based on docker image of apache hive, showing all the basic commands to understand hive queries.

Size: 10.4 MB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

wzqwtt/BigData

小白大数据学习笔记 :star:

Language: Java - Size: 128 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 16 - Forks: 2

canaytore/hive-learnings

Apache Hive - an SQL-like interface to query data stored in various databases and file systems that integrate with Apache Hadoop

Size: 1.95 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

rishabmenon/Airlines-Analysis-Hadoop

This Hadoop project involves analysing the airline datasets to solve a few problem statements.

Size: 2.22 MB - Last synced at: almost 2 years ago - Pushed at: almost 6 years ago - Stars: 2 - Forks: 5

avatime/gamul-gamul

🏆가물가물 : 빅데이터 분산 처리를 활용한 물가기반 식재료 가격 정보 제공 웹앱 서비스 - 🥇SSAFY 7기 특화프로젝트 우수상 1등(2022.10.07)

Language: Java - Size: 190 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

seanpm2001/SNU_2D_ProgrammingTools_IDE_HiveQL

The HiveQL Programming language IDE submodule for SNU Programming Tools (2D Mode)

Language: HiveQL - Size: 1.32 MB - Last synced at: 7 days ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 2

harshbg/PUBG-Game-Data-Analysis

An Analysis of Player Unknown's Battle Grounds (PUBG) Game Data using Hive and Spark.

Language: Scala - Size: 5.99 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 4

kaushik-prasad-dey/Market-Analysis-in-Banking-Domain-Big-data-Hadoop-Spark

Language: HTML - Size: 1.1 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

wushengyeyouya/Hive-JDBC-Proxy

Hive-JDBC-Proxy是一个高性能的HiveServer2和Spark ThriftServer的代理服务,具备负载均衡、基于规则转发Hive JDBC Client的请求给到HiveServer2和Spark ThriftServer的能力。

Language: Scala - Size: 74.2 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 30 - Forks: 15

ai1138/Analyzing_Brooklyn

For this project we studied 3 data sets revolving around neighborhoods in New York City. We hope to learn what neighborhoods in Brooklyn are good to live in

Language: HiveQL - Size: 35.2 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 2

teepha/clinicaltrial_data_analysis

Analysis of clinical trial data

Language: HTML - Size: 10.8 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

aphp/HiveQLKernel

HiveQL Jupyter Kernel

Language: Python - Size: 61.5 KB - Last synced at: 1 day ago - Pushed at: over 2 years ago - Stars: 10 - Forks: 5

AdamJeddy/Zeppelin-Notebook-Archive

Big Data Management related Zeppelin notebooks

Language: Java - Size: 2.04 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

zahir2000/spotify

WQD7007

Language: PigLatin - Size: 4.88 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

WesleySiNeves/AprendendoPython

Language: Jupyter Notebook - Size: 1.65 GB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

uzeziogho/ClinicalTrial

Data Analysis using Databricks

Language: Jupyter Notebook - Size: 1.71 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

puneetnegi002/analytics-using-clickstream-data

Analytics using clickstream data and visualise it using tableau

Size: 7.42 MB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 2 - Forks: 3

gabriel-solon-padilha/criando_um_datalakehouse_databricks

Meu décimo primeiro projeto em que crio um datalakehouse usando computação distribuído no databricks

Language: HTML - Size: 2.1 MB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 2

samye760/Wikipedia-Big-Data-Analysis

Script for analyzing Wiki dumps big data sets.

Language: HiveQL - Size: 7.22 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

ChoiSol24/LYRICS_DATA_ANALYSIS

2019부터 2021까지 멜론 주간차트 100위 내의 음원 가사 감정어 추출 후, 긍정/부정어 개수 데이터 분석

Language: Jupyter Notebook - Size: 18.6 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

rishabmenon/Analysing-Book-Dataset-Hadoop

This Hadoop project involves analysing the book datasets to solve a few problem statements.

Size: 26.2 MB - Last synced at: almost 2 years ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0

cevheri/hive-java-example

Hive Query Language example with Apache Hive, Apache Hadoop, Java

Language: Java - Size: 2.93 KB - Last synced at: about 2 months ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

ManikHossain08/Bixi-Cloud-ETL-Data-Pipeline-using-Scala-Hive-AWS_Athena_JDBC-Driver

An Automated ETL Data pipeline which extract complex json data from web API service (GBFS-bixi Data) and convert to CSV for loading into Data-warehouse HDFS. After-that, Hive will process the further by external and managed table. Same procedure is also applied with AWS S3 and Athena.

Language: Scala - Size: 117 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

YassineMarroun/SSBBDD-Practica2020

Introducción al manejo de datos masivos con Hadoop y herramientas inspiradas en SQL, Práctica de Sistemas de Bases de Datos - Curso 2019/2020 - UNED

Language: Jupyter Notebook - Size: 79.1 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0