An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: cloudera

hortonworks/cloudbreak

CDP Public Cloud is an integrated analytics and data management platform deployed on cloud services. It offers broad data analytics and artificial intelligence functionality along with secure user access and data governance features.

Language: Java - Size: 216 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 357 - Forks: 236

tspannhw/FLiPStackWeekly

FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...

Size: 767 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 20 - Forks: 0

HariSekhon/DevOps-Bash-tools

1000+ DevOps Bash Scripts - AWS, GCP, Kubernetes, Docker, CI/CD, APIs, SQL, PostgreSQL, MySQL, Hive, Impala, Kafka, Hadoop, Jenkins, GitHub, GitLab, BitBucket, Azure DevOps, TeamCity, Spotify, MP3, LDAP, Code/Build Linting, pkg mgmt for Linux, Mac, Python, Perl, Ruby, NodeJS, Golang, Advanced dotfiles: .bashrc, .vimrc, .gitconfig, .screenrc, tmux..

Language: Shell - Size: 11.4 MB - Last synced at: 12 days ago - Pushed at: 13 days ago - Stars: 6,673 - Forks: 1,242

HariSekhon/Nagios-Plugins

450+ AWS, Hadoop, Cloud, Kafka, Docker, Elasticsearch, RabbitMQ, Redis, HBase, Solr, Cassandra, ZooKeeper, HDFS, Yarn, Hive, Presto, Drill, Impala, Consul, Spark, Jenkins, Travis CI, Git, MySQL, Linux, DNS, Whois, SSL Certs, Yum Security Updates, Kubernetes, Cloudera etc...

Language: Python - Size: 8.83 MB - Last synced at: 10 days ago - Pushed at: about 1 month ago - Stars: 1,145 - Forks: 507

DashDipti/cdw-workshop

This workshop aims to make use of airlines data set that is publicly available and showcase how one can make use of CDW for Open Data Lakehouse using Apache Iceberg.

Size: 33.2 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 5 - Forks: 8

cloudera/cdp-sdk-java

Cloudera CDP SDK for Java

Language: Java - Size: 161 MB - Last synced at: 8 days ago - Pushed at: about 1 month ago - Stars: 13 - Forks: 11

cloudera/cdpcli

CDP command line interface (CLI)

Language: Python - Size: 1.42 MB - Last synced at: 8 days ago - Pushed at: about 1 month ago - Stars: 11 - Forks: 15

frischHWC/cldr-playbook

Roles & Playbooks in Ansible to deploy CDP

Language: Jinja - Size: 1020 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 7

HariSekhon/lib

Perl Utility Library for my other repos

Language: Perl - Size: 1.88 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 18 - Forks: 33

severstal-digital/wunderkafka

The power of librdkafka for pythons

Language: Python - Size: 1.49 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 4 - Forks: 1

bantone/llama32-vision-amp

Cloudera image-to-text AMP (Accelerator for Machine Learning Projects) leveraging Llama 3.2 11b & 90b

Language: Python - Size: 6.93 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

OryxProject/oryx 📦

Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning

Language: Java - Size: 7.12 MB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 1,781 - Forks: 404

cloudera/tutorial-assets

Assets used in Cloudera Tutorials

Language: Python - Size: 31.2 MB - Last synced at: 8 days ago - Pushed at: over 3 years ago - Stars: 19 - Forks: 23

san089/Cloudera_Material

Cloudera_Material: Study Material to help people preparing for Cloudera CCA Spark and Hadoop Developer Exam (CCA175). Feel free to collaborate.

Size: 9.02 MB - Last synced at: 12 days ago - Pushed at: almost 5 years ago - Stars: 37 - Forks: 30

frischHWC/datagen

Datagenerator for Data Services

Language: Java - Size: 6.69 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 15 - Forks: 5

thammuio/doc-genius-ai

DocGenius AI - Generative AI Chatbot for your Documents - powered by Cloudera

Language: Python - Size: 4.48 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 11 - Forks: 6

myndaaa/BigDataArchitecture-COS20028-Swinburne

Apache Hadoop – A course for undergraduates | along with Apache Pig and Hive

Language: Java - Size: 2.4 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

wmudge/cldr-runner Fork of cloudera-labs/cldr-runner

Ansible Execution Environment images for Cloudera Data Platform (CDP) Public and Private Cloud

Language: Shell - Size: 201 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

achraf-oujjir/ChatGPT-Users-Tweets-Pipeline

🐦🔵End-to-end ChatGPT Users' Tweets Data Pipeline with Python 🐍, Hive 🐝, and Power BI 📊

Language: Python - Size: 7.69 MB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 2 - Forks: 0

HariSekhon/HAProxy-configs

80+ HAProxy Configs for Hadoop, Big Data, NoSQL, Docker, Kubernetes, Elasticsearch, SolrCloud, HBase, MySQL, PostgreSQL, Apache Drill, Hive, Presto, Impala, Hue, ZooKeeper, SSH, RabbitMQ, Redis, Riak, Cloudera, OpenTSDB, InfluxDB, Prometheus, Kibana, Graphite, Rancher etc.

Language: Shell - Size: 496 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 220 - Forks: 79

timveil/hive-jdbc-uber-jar

Hive JDBC "uber" or "standalone" jar based on the latest Apache Hive version

Language: Java - Size: 3.79 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 265 - Forks: 96

DashDipti/cdf-workshop

This workshop aims to make use of log data to help practitioners gain understanding of CDF and to show the value it brings to enterprises who understand that data in motion related use cases can add value to their business.

Size: 28.5 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 2 - Forks: 1

VaishnavJois/CLOUDERA

Cloudera commands used for Big Data Analytics

Size: 13.7 KB - Last synced at: 12 months ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

oracle-quickstart/oci-cloudera 📦

Terraform module to deploy Cloudera on Oracle Cloud Infrastructure (OCI)

Language: Python - Size: 1.67 MB - Last synced at: 7 days ago - Pushed at: over 3 years ago - Stars: 20 - Forks: 6

miguel617/MovieLens-Data-Engineer-Analytics-Project

The objective of this project is to build a data pipeline to show and analyse the results in PowerBI from the MovieLens 25M database, using Hive and Python.

Language: Jupyter Notebook - Size: 25.8 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

ssheremeta/airflow-cloudera

Apache Airflow parcel and CSD for Cloudera Manager

Language: Shell - Size: 13.7 KB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 5 - Forks: 3

tspannhw/nifi-mxnetinference-processor

Apache NiFi Processor For Apache MXNet Inference

Language: Java - Size: 124 MB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 2

dmilan77/cloudera-phoenix Fork of apache/phoenix

CDH compliant Apache Phoenix

Language: Java - Size: 46.9 MB - Last synced at: about 1 year ago - Pushed at: about 6 years ago - Stars: 12 - Forks: 4

phdata/retirement-age 📦

phData Retirement Age Hadoop row based data lifecycle management

Language: Scala - Size: 107 KB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 3

ptobarra/Business-Intelligence-on-Big-Data-_-U-TAD-2017-Big-Data-Master-Final-Project

This is the final project I had to do to finish my Big Data Expert Program in U-TAD in September 2017. It uses the following technologies: Apache Spark v2.2.0, Python v2.7.3, Jupyter Notebook (PySpark), HDFS, Hive, Cloudera Impala, Cloudera HUE and Tableau.

Language: Jupyter Notebook - Size: 130 MB - Last synced at: 12 months ago - Pushed at: almost 7 years ago - Stars: 6 - Forks: 1

Meetrics/cloudera-manager-tools

Cloudera Manager CLI tools to easily perform common operations using its API interface

Language: Python - Size: 36.1 KB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

Ranjandas/Dirty-CDH-Docker

A quick and dirty CDH cluster skeleton using Docker for Testing

Language: Shell - Size: 7.81 KB - Last synced at: about 1 year ago - Pushed at: over 8 years ago - Stars: 6 - Forks: 2

fusiled/hadoop-pig-matrix-multiplication-benchmark

Language: Shell - Size: 2.38 MB - Last synced at: about 1 year ago - Pushed at: about 8 years ago - Stars: 0 - Forks: 0

juan-jose-vivas/Introduccion-Al-BIG-DATA-CLOUDERA-Ecosistema-HADOOP

El Big Data surgió cuando Google estaba en el proceso de indexar toda la web. Se encontró con ficheros enormes que no cabían en ningún servidor

Size: 647 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

zz22394/cdf-workshop

Cloudera CDP/CDF Workshop

Size: 16.4 MB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

achintya-kumar/SMM-with-Spark-Streaming

Scalable Map Matching with Apache Spark Streaming

Language: Java - Size: 47.7 MB - Last synced at: 12 months ago - Pushed at: almost 7 years ago - Stars: 2 - Forks: 2

HuemulSolutions/huemul-bigdatagovernance

Huemul BigDataGovernance, es una framework que trabaja sobre Spark, Hive y HDFS. Permite la implementación de una estrategia corporativa de dato único, basada en buenas prácticas de Gobierno de Datos. Permite implementar tablas con control de Primary Key y Foreing Key al insertar y actualizar datos utilizando la librería, Validación de nulos, largos de textos, máximos/mínimos de números y fechas, valores únicos y valores por default. También permite clasificar los campos en aplicabilidad de derechos ARCO para facilitar la implementación de leyes de protección de datos tipo GDPR, identificar los niveles de seguridad y si se está aplicando algún tipo de encriptación. Adicionalmente permite agregar reglas de validación más complejas sobre la misma tabla.

Language: Scala - Size: 1.27 MB - Last synced at: about 22 hours ago - Pushed at: almost 2 years ago - Stars: 11 - Forks: 7

cloudera/observability

Cloudera Observability related artifacts including Grafana charts and Alert definitions

Language: Shell - Size: 55.7 KB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

cloudera/cdpcurl

Curl like tool with CDP request signing.

Language: Python - Size: 57.6 KB - Last synced at: 8 days ago - Pushed at: 9 months ago - Stars: 6 - Forks: 8

hysl/BigDataAnalytics

YouTube Trending Videos Project; Big data analysis and computing; Hadoop, Spark, classification, clustering, mapreduce

Language: Jupyter Notebook - Size: 1.3 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

chaimaebouyarmane/Big_Data

This repository serves as a hands-on implementation of a Big Data platform focused on processing parliamentary data from the website of the Moroccan Parliament. The project aims to calculate Key Performance Indicators (KPIs) to evaluate the engagement level of each government.

Language: Jupyter Notebook - Size: 10.7 MB - Last synced at: 12 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

teamclairvoyant/hadoop-deployment-bash

Code for the deployment of Hadoop clusters, written in Bourne or Bourne Again shell.

Language: Shell - Size: 770 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 34 - Forks: 39

dileepe-projects/CCA131_Prep

My Notes for CCA 131 certification

Size: 17.6 KB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 1 - Forks: 0

tspannhw/gtfs

GTFS / ProtoBuf Data

Language: Java - Size: 43 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 1

academyofdata/clusterdock

scripts for working with Cloudera's dockerized cluster - clusterdock

Language: Shell - Size: 78.1 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 1

academyofdata/clusterdock-with-zeppelin

clusterdock + zeppelin

Language: Shell - Size: 64.5 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 1

rurumimic/apache-impala

How to build

Language: Shell - Size: 16.6 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

France1/Hadoop-Spark-Training

Tutorials and example code to prepare for Cloudera "CCA Spark and Hadoop Developer" Certification

Language: Java - Size: 10.3 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 4

kushalhebbar/Big-data-project

Optimizing the storage capability of HDFS and HBase through data size factor with integrated security feature

Size: 48.9 MB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

vcputtini/impala-udf-cpp

Development of native C++ UDFs/UDAFs for Apache Impala.

Language: C++ - Size: 146 KB - Last synced at: about 1 month ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 1

gitosamakhan/Multinode-Ambari-Cluster

Deploying a multinode ambari cluster on Linux (CentOS7) (Documentation)

Size: 17.6 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

gitosamakhan/Multinode-Cloudera-Cluster

Deploying a multinode cloudera cluster on Linux(CentOS7) (Documentation)

Size: 8.7 MB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 1

tspannhw/ClouderaFlowManagementWorkshop

Cloudera Flow Management Workshop with Apache NiFi

Language: Python - Size: 40.7 MB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 8 - Forks: 12

tspannhw/NiFItoKafkaConnect

NiFi -> Kafka Connect -> HDFS

Language: Shell - Size: 27.3 KB - Last synced at: about 1 year ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 2

parth-code/Hadoop-XML

Extracting data from dblp.xml using Hadoop MapReduce

Language: XSLT - Size: 193 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

ramapilli16/CCA175-PySpark-Practice-with-solutions

CCA175-PySpark-Practice-with-solutions

Size: 20.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 2

akfincode/gcp-cloudera

Cloudera Install and Setup on Google Cloud (GCP)

Size: 8.79 KB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

shreyas15/Ranked-File-Search

Information retrieval (IR) is concerned with finding material (e.g., documents) of an unstructured nature (usually text) in response to an information need (e.g., a query) from large collections. One approach to identify relevant documents is to compute scores based on the matches between terms in the query and terms in the documents. For example, a document with words such as ball ​ , team ​ , score ​ , championship ​ is likely to be about sports. It is helpful to define a weight for each term in a document that can be meaningful for computing such a score. I use popular information retrieval metrics such as term frequency, inverse document frequency, and their product, term frequency-inverse document frequency (TF-IDF), that are used to define weights for terms.

Language: Java - Size: 974 KB - Last synced at: over 1 year ago - Pushed at: about 8 years ago - Stars: 1 - Forks: 0

smartlin5228/CCA175

Language: Java - Size: 107 KB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 7 - Forks: 10

rafariva/Cloudera-Hadoop-Wordcount

Tutorial for beginners for installing hadoop on a virtualmachine and run the "hello world" of hadoop (wordcount)

Language: Java - Size: 332 KB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 1

xtutran/env-setup

How to install some software without Administrator privileges in Linux

Language: Shell - Size: 35.2 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

teamclairvoyant/apache-airflow-cloudera-parcel

Parcel for Apache Airflow

Language: Dockerfile - Size: 311 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 17 - Forks: 10

AlionSSS/CDH-Install-Manual

CDH安装手册

Size: 10.8 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 76 - Forks: 28

dimajix/vagrant-cloudera 📦

A Vagrant setup to run a virtual Cloudera cluster

Language: Puppet - Size: 31.3 KB - Last synced at: about 1 year ago - Pushed at: almost 9 years ago - Stars: 2 - Forks: 4

oleewere/cmctl

CLI tool for managing multiple Cloudbreak deployed CM instances

Language: Go - Size: 2.47 MB - Last synced at: 10 months ago - Pushed at: almost 6 years ago - Stars: 4 - Forks: 0

sihamhafsi/projet-big-data_analyse-des-donnees-youtube

Language: Java - Size: 5.21 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 0

thammuio/bigdata-cluster-ansible-playbook

Ansible Playbook's for building Big Data (Hadoop, Kafka, HBase) Clusters

Language: Shell - Size: 43 KB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 3 - Forks: 0

ryandawsonuk/data-platforms-tools

Guide to data platforms and tools

Size: 270 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 22 - Forks: 3

tspannhw/airline-sentiment-streaming

Streaming with Airline Sentiment. Utilizing Cloudera Machine Learning, Apache NiFi, Apache Hue, Apache Impala, Apache Kudu

Language: Jupyter Notebook - Size: 121 KB - Last synced at: about 1 year ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 3

srowen/quatrains-rnn 📦

Simple example applying Keras, TensorFlow to Nostradamus's prophecies with Cloudera Data Science Workbench

Language: Python - Size: 75.2 KB - Last synced at: 3 months ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 5

srowen/cdsw-simple-serving 📦

Modeling Lifecycle with ACME Occupancy Detection and Cloudera

Language: Scala - Size: 76 MB - Last synced at: 3 months ago - Pushed at: over 7 years ago - Stars: 14 - Forks: 18

chezou/homebrew-cloudera 📦

Homebrew Formulas for cloudera tools

Language: Ruby - Size: 12.7 KB - Last synced at: 12 months ago - Pushed at: over 6 years ago - Stars: 10 - Forks: 7

Powerspace/kudu-from-avro 📦

A small Command Line tool to create an Kudu table from an Avro schema or from SQL script

Language: Scala - Size: 180 KB - Last synced at: about 2 years ago - Pushed at: almost 8 years ago - Stars: 6 - Forks: 3

NFPA/LocationTools

Geocoding and Reverse Geocoding at Scale

Language: Java - Size: 25.4 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 2

mohan-aditya05/cloudera-medicare-challenge

Language: Jupyter Notebook - Size: 99.6 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 1 - Forks: 0

limz1986/GCP_Databricks_Hadoop

Hadoop_MapReduce_Google_Cloud_Cloudera_VM_Databricks

Language: Python - Size: 20.5 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

opencollector/ansible-dynamic-inventory-cloudera-scm

Ansible dynamic inventory script that retrieves hosts using Cloudera Manager API

Language: Python - Size: 6.84 KB - Last synced at: almost 2 years ago - Pushed at: about 7 years ago - Stars: 1 - Forks: 1

Treydone/dek

Easily connect to multiple Hadoop clusters

Language: Java - Size: 677 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 3 - Forks: 1

JohnnyFoulds/local-hadoop

This project creates a small local Hadoop cluster using Cloudera CDH and CentOS.

Language: Python - Size: 216 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 1

ummmme/setup_cdh

CDH5.16.2 离线安装脚本

Language: Shell - Size: 21.8 MB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 16 - Forks: 15

haspdecrypted/OS-for-Big-Data-and-Hadoop

Getting Started with Hadoop and Big Data

Size: 23.4 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 1

teamclairvoyant/apache-airflow-cloudera-csd

CSD for Apache Airflow

Language: Shell - Size: 132 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 19 - Forks: 13

xiaojie-qian/Rail-tunnel-recommendation-SQL

Modern Big Data Analysis: recommend which pair of United States airports should be connected with a high-speed passenger rail tunnel.

Language: Shell - Size: 13.7 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

thammuio/bigdata-upgrade-suite

Suite of tools to help with HDP, CDH, CDP Hadoop Cluster Upgrades; CDH to CDP Migration; HDP to CDP Migration; CDP7 Migration

Language: Java - Size: 3.4 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 3

mchristian279/ConfigHostCloudera5.16.2

Provisionamento de Vms em ambiente KVM via terraform e ansible-playbook para configuração ambiente Cloudera 5.16.2.

Language: HCL - Size: 6.91 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 0

kikejimenez/nifi_api

NIFI API for a Cloudera Project

Language: Python - Size: 741 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

Sathiyarajan/big-data-pipeline

Big Data

Language: Java - Size: 705 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 9 - Forks: 6

tspannhw/kafkarest-processor

Apache NiFi 1.10.0 Processor to consume 1 Kafka message at a time, easily to tie into a REST Proxy

Language: Java - Size: 22.5 KB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 2 - Forks: 1

tspannhw/MmFLaNK

Mm FLaNK Stack (MXNet, MiNiFi, Flink, NiFi, Kafka, Kudu) for AI-IoT

Language: Java - Size: 3.79 MB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 6 - Forks: 1

tspannhw/minifi-xaviernx

NVIDIA XAVIER NX - MiNiFi - NiFi - Kafka - Flink

Language: Python - Size: 29.3 KB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 1

tspannhw/minifi-jetson-nano

MiNiFi Agent Configuration and Scripts for NVidia Jetson Nano device

Language: Python - Size: 310 KB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 6 - Forks: 2

tspannhw/retail-dynamic-shelf-pricing

Retail - Dynamic Shelf Pricing

Language: Python - Size: 126 KB - Last synced at: 22 days ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 1

tspannhw/dws-coldsupplychain-hyperledger

Apache NiFi and Hyperledger Fabric for Cold Supply Chain Logistics

Language: Python - Size: 5.86 KB - Last synced at: about 1 year ago - Pushed at: almost 6 years ago - Stars: 2 - Forks: 0

kongyew/greenplum-dockers

Create Greenplum docker files

Language: Python - Size: 32.6 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 10 - Forks: 4

generaliinformatik/hdfs-over-ftp Fork of iponweb/hdfs-over-ftp

FTP server which works on a top of HDFS and supports Kerberos authentication

Language: Java - Size: 59.6 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 2

razorsedge/hadoop-deployment-bash Fork of teamclairvoyant/hadoop-deployment-bash

Code for the deployment of Hadoop clusters, written in Bourne or Bourne Again shell.

Language: Shell - Size: 883 KB - Last synced at: 10 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 1

rajib1007/PROJECT1

An enrollment system is to help admission teams ultimately enroll more students.

Size: 10.6 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 1

Trupti2502/Associate-Management-System

This project created in hive. Project is about create and maintain a database for candidate enrollment where they provide training for various courses. In this project we created five tables. We performed various operations in this project like table join, partitioning, updating etc. We also performed queries in tables where we extracted the data from it. We run this project in hive terminal in cdh5 and hue from browser.

Size: 85.9 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 1

Trupti2502/Hive_mini_project

In this project, we are dealing with the database of about 50000 movie records in which there are attributes like movie name, release year, ratings, the release year and the time duration of the movie in Seconds. Using this database as a text file. So using file handling in python we are executing some queries to fetch details of movies using the given records.

Size: 1.08 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

kaminduN/cloudera-dev-cluster-setup

Automation scripts for a local cloudera 6 cluster for development purposes

Language: HTML - Size: 11.7 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 0