GitHub topics: cloudera
tspannhw/FLiPStackWeekly
FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...
Size: 767 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 22 - Forks: 0
HariSekhon/DevOps-Bash-tools
1000+ DevOps Bash Scripts - AWS, GCP, Kubernetes, Docker, CI/CD, APIs, SQL, PostgreSQL, MySQL, Hive, Impala, Kafka, Hadoop, Jenkins, GitHub, GitLab, BitBucket, Azure DevOps, TeamCity, Spotify, MP3, LDAP, Code/Build Linting, pkg mgmt for Linux, Mac, Python, Perl, Ruby, NodeJS, Golang, Advanced dotfiles: .bashrc, .vimrc, .gitconfig, .screenrc, tmux..
Language: Shell - Size: 11.3 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 7,211 - Forks: 1,351
hortonworks/cloudbreak
CDP Public Cloud is an integrated analytics and data management platform deployed on cloud services. It offers broad data analytics and artificial intelligence functionality along with secure user access and data governance features.
Language: Java - Size: 225 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 360 - Forks: 235
HariSekhon/Nagios-Plugins
450+ AWS, Hadoop, Cloud, Kafka, Docker, Elasticsearch, RabbitMQ, Redis, HBase, Solr, Cassandra, ZooKeeper, HDFS, Yarn, Hive, Presto, Drill, Impala, Consul, Spark, Jenkins, Travis CI, Git, MySQL, Linux, DNS, Whois, SSL Certs, Yum Security Updates, Kubernetes, Cloudera etc...
Language: Python - Size: 8.91 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 1,145 - Forks: 502
HariSekhon/lib
Perl Utility Library for my other repos
Language: Perl - Size: 1.88 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 18 - Forks: 33
Fradhyle/Voo-ong
인공지능을 활용한 개인화 영화 추천 시스템
Language: Jupyter Notebook - Size: 57.8 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0
cloudera/cdp-sdk-java
Cloudera CDP SDK for Java
Language: Java - Size: 194 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 15 - Forks: 11
frischHWC/one-script-deploy
One Click Script to Deploy CDP (CDP PvC & HDP & CDH)
Language: Shell - Size: 2.93 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 32 - Forks: 25
cloudera/cdpcli
CDP command line interface (CLI)
Language: Python - Size: 1.56 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 11 - Forks: 17
wmudge/cldr-runner Fork of cloudera-labs/cldr-runner
Ansible Execution Environment images for Cloudera Data Platform (CDP) Public and Private Cloud
Language: Shell - Size: 208 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0
DashDipti/cdw-workshop
This workshop aims to make use of airlines data set that is publicly available and showcase how one can make use of CDW for Open Data Lakehouse using Apache Iceberg.
Size: 44.5 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 5 - Forks: 11
OryxProject/oryx 📦
Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
Language: Java - Size: 7.12 MB - Last synced at: about 2 months ago - Pushed at: over 4 years ago - Stars: 1,786 - Forks: 404
cloudera-labs/cdpy
A Simple Pythonic Client wrapper for Cloudera on Cloud
Language: Python - Size: 133 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 10 - Forks: 22
cloudera/observability
Cloudera Observability related artifacts including Grafana charts and Alert definitions
Language: Shell - Size: 55.7 KB - Last synced at: about 2 months ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0
rodoac89/cloudera-private-base-installation
This repo is intended for help every mortal that try to install Cloudera on Bare Metal clusters and not die in the process
Language: Shell - Size: 12.7 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0
thammuio/doc-genius-ai
DocGenius AI - Generative AI Chatbot for your Documents
Language: Python - Size: 4.48 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 13 - Forks: 6
jigyasaG18/Airline-Performance-And-Passenger-Satisfaction-Project-Using-Big-Data-Analytics
This project analyzes 10 years of U.S. domestic airline data (~3GB) using Hadoop (Cloudera) and Hive for data processing. Power BI dashboards visualize key metrics like delays, on-time rates, air time, and diversions. The solution includes Hive queries, DAX measures, HDFS ingestion scripts, and year-wise insights with recommendations.
Language: HiveQL - Size: 21.9 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0
MasterPandaa/AirBNB_Cloudera_Hadoop
Pengolahan Dataset dan Analisis Tren Harga Sewa Properti AirBNB Menggunakan Cloudera Hadoop
Size: 30 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0
bantone/llama32-vision-amp
Cloudera image-to-text AMP (Accelerator for Machine Learning Projects) leveraging Llama 3.2 11b & 90b
Language: Python - Size: 6.94 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0
HariSekhon/HAProxy-configs
80+ HAProxy Configs for Hadoop, Big Data, NoSQL, Docker, Kubernetes, Elasticsearch, SolrCloud, HBase, MySQL, PostgreSQL, Apache Drill, Hive, Presto, Impala, Hue, ZooKeeper, SSH, RabbitMQ, Redis, Riak, Cloudera, OpenTSDB, InfluxDB, Prometheus, Kibana, Graphite, Rancher etc.
Language: Shell - Size: 623 KB - Last synced at: 6 months ago - Pushed at: 8 months ago - Stars: 252 - Forks: 81
ahmedhany/cloudera-rag-lab
Size: 0 Bytes - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 1
frischHWC/cldr-playbook
Roles & Playbooks in Ansible to deploy CDP
Language: Jinja - Size: 1020 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 3 - Forks: 7
cloudera/tutorial-assets
Assets used in Cloudera Tutorials
Language: Python - Size: 31.2 MB - Last synced at: about 2 months ago - Pushed at: almost 4 years ago - Stars: 19 - Forks: 23
san089/Cloudera_Material
Cloudera_Material: Study Material to help people preparing for Cloudera CCA Spark and Hadoop Developer Exam (CCA175). Feel free to collaborate.
Size: 9.02 MB - Last synced at: 7 months ago - Pushed at: over 5 years ago - Stars: 37 - Forks: 30
frischHWC/datagen
Datagenerator for Data Services
Language: Java - Size: 6.69 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 15 - Forks: 5
kikejimenez/nifi_api
NIFI API for a Cloudera Project
Language: Python - Size: 741 KB - Last synced at: 27 days ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0
myndaaa/BigDataArchitecture-COS20028-Swinburne
Apache Hadoop – A course for undergraduates | along with Apache Pig and Hive
Language: Java - Size: 2.4 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0
wikitops/ansible_cloudera
Ansible playbook to deploy a Cloudera cluster on Linux Vagrant instances.
Size: 35.2 KB - Last synced at: 7 months ago - Pushed at: over 5 years ago - Stars: 5 - Forks: 1
achraf-oujjir/ChatGPT-Users-Tweets-Pipeline
🐦🔵End-to-end ChatGPT Users' Tweets Data Pipeline with Python 🐍, Hive 🐝, and Power BI 📊
Language: Python - Size: 7.69 MB - Last synced at: 9 months ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0
timveil/hive-jdbc-uber-jar
Hive JDBC "uber" or "standalone" jar based on the latest Apache Hive version
Language: Java - Size: 3.79 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 265 - Forks: 96
DashDipti/cdf-workshop
This workshop aims to make use of log data to help practitioners gain understanding of CDF and to show the value it brings to enterprises who understand that data in motion related use cases can add value to their business.
Size: 28.5 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 1
VaishnavJois/CLOUDERA
Cloudera commands used for Big Data Analytics
Size: 13.7 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0
oracle-quickstart/oci-cloudera 📦
Terraform module to deploy Cloudera on Oracle Cloud Infrastructure (OCI)
Language: Python - Size: 1.67 MB - Last synced at: 7 months ago - Pushed at: about 4 years ago - Stars: 20 - Forks: 6
miguel617/MovieLens-Data-Engineer-Analytics-Project
The objective of this project is to build a data pipeline to show and analyse the results in PowerBI from the MovieLens 25M database, using Hive and Python.
Language: Jupyter Notebook - Size: 25.8 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0
ssheremeta/airflow-cloudera
Apache Airflow parcel and CSD for Cloudera Manager
Language: Shell - Size: 13.7 KB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 5 - Forks: 3
tspannhw/nifi-mxnetinference-processor
Apache NiFi Processor For Apache MXNet Inference
Language: Java - Size: 124 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 2
dmilan77/cloudera-phoenix Fork of apache/phoenix
CDH compliant Apache Phoenix
Language: Java - Size: 46.9 MB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 12 - Forks: 4
phdata/retirement-age 📦
phData Retirement Age Hadoop row based data lifecycle management
Language: Scala - Size: 107 KB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 2 - Forks: 3
ptobarra/Business-Intelligence-on-Big-Data-_-U-TAD-2017-Big-Data-Master-Final-Project
This is the final project I had to do to finish my Big Data Expert Program in U-TAD in September 2017. It uses the following technologies: Apache Spark v2.2.0, Python v2.7.3, Jupyter Notebook (PySpark), HDFS, Hive, Cloudera Impala, Cloudera HUE and Tableau.
Language: Jupyter Notebook - Size: 130 MB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 6 - Forks: 1
Meetrics/cloudera-manager-tools
Cloudera Manager CLI tools to easily perform common operations using its API interface
Language: Python - Size: 36.1 KB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0
Ranjandas/Dirty-CDH-Docker
A quick and dirty CDH cluster skeleton using Docker for Testing
Language: Shell - Size: 7.81 KB - Last synced at: over 1 year ago - Pushed at: over 9 years ago - Stars: 6 - Forks: 2
fusiled/hadoop-pig-matrix-multiplication-benchmark
Language: Shell - Size: 2.38 MB - Last synced at: almost 2 years ago - Pushed at: almost 9 years ago - Stars: 0 - Forks: 0
juan-jose-vivas/Introduccion-Al-BIG-DATA-CLOUDERA-Ecosistema-HADOOP
El Big Data surgió cuando Google estaba en el proceso de indexar toda la web. Se encontró con ficheros enormes que no cabían en ningún servidor
Size: 647 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0
zz22394/cdf-workshop
Cloudera CDP/CDF Workshop
Size: 16.4 MB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0
achintya-kumar/SMM-with-Spark-Streaming
Scalable Map Matching with Apache Spark Streaming
Language: Java - Size: 47.7 MB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 2
HuemulSolutions/huemul-bigdatagovernance
Huemul BigDataGovernance, es una framework que trabaja sobre Spark, Hive y HDFS. Permite la implementación de una estrategia corporativa de dato único, basada en buenas prácticas de Gobierno de Datos. Permite implementar tablas con control de Primary Key y Foreing Key al insertar y actualizar datos utilizando la librería, Validación de nulos, largos de textos, máximos/mínimos de números y fechas, valores únicos y valores por default. También permite clasificar los campos en aplicabilidad de derechos ARCO para facilitar la implementación de leyes de protección de datos tipo GDPR, identificar los niveles de seguridad y si se está aplicando algún tipo de encriptación. Adicionalmente permite agregar reglas de validación más complejas sobre la misma tabla.
Language: Scala - Size: 1.27 MB - Last synced at: 28 days ago - Pushed at: over 2 years ago - Stars: 11 - Forks: 7
cloudera/cdpcurl
Curl like tool with CDP request signing.
Language: Python - Size: 57.6 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 10
teamclairvoyant/apache-airflow-cloudera-csd
CSD for Apache Airflow
Language: Shell - Size: 132 KB - Last synced at: 3 months ago - Pushed at: about 6 years ago - Stars: 20 - Forks: 12
hysl/BigDataAnalytics
YouTube Trending Videos Project; Big data analysis and computing; Hadoop, Spark, classification, clustering, mapreduce
Language: Jupyter Notebook - Size: 1.3 MB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0
chaimaebouyarmane/Big_Data
This repository serves as a hands-on implementation of a Big Data platform focused on processing parliamentary data from the website of the Moroccan Parliament. The project aims to calculate Key Performance Indicators (KPIs) to evaluate the engagement level of each government.
Language: Jupyter Notebook - Size: 10.7 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0
dileepe-projects/CCA131_Prep
My Notes for CCA 131 certification
Size: 17.6 KB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0
tspannhw/gtfs
GTFS / ProtoBuf Data
Language: Java - Size: 43 KB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 5 - Forks: 1
academyofdata/clusterdock
scripts for working with Cloudera's dockerized cluster - clusterdock
Language: Shell - Size: 78.1 KB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 1 - Forks: 1
academyofdata/clusterdock-with-zeppelin
clusterdock + zeppelin
Language: Shell - Size: 64.5 KB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 1
rurumimic/apache-impala
How to build
Language: Shell - Size: 16.6 KB - Last synced at: 9 months ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0
France1/Hadoop-Spark-Training
Tutorials and example code to prepare for Cloudera "CCA Spark and Hadoop Developer" Certification
Language: Java - Size: 10.3 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 3 - Forks: 4
kushalhebbar/Big-data-project
Optimizing the storage capability of HDFS and HBase through data size factor with integrated security feature
Size: 48.9 MB - Last synced at: about 2 years ago - Pushed at: about 8 years ago - Stars: 0 - Forks: 0
vcputtini/impala-udf-cpp
Development of native C++ UDFs/UDAFs for Apache Impala.
Language: C++ - Size: 146 KB - Last synced at: 8 months ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 1
tspannhw/ClouderaFlowManagementWorkshop
Cloudera Flow Management Workshop with Apache NiFi
Language: Python - Size: 40.7 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 8 - Forks: 12
tspannhw/NiFItoKafkaConnect
NiFi -> Kafka Connect -> HDFS
Language: Shell - Size: 27.3 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 2
parth-code/Hadoop-XML
Extracting data from dblp.xml using Hadoop MapReduce
Language: XSLT - Size: 193 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0
ramapilli16/CCA175-PySpark-Practice-with-solutions
CCA175-PySpark-Practice-with-solutions
Size: 20.5 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 2
akfincode/gcp-cloudera
Cloudera Install and Setup on Google Cloud (GCP)
Size: 8.79 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0
shreyas15/Ranked-File-Search
Information retrieval (IR) is concerned with finding material (e.g., documents) of an unstructured nature (usually text) in response to an information need (e.g., a query) from large collections. One approach to identify relevant documents is to compute scores based on the matches between terms in the query and terms in the documents. For example, a document with words such as ball , team , score , championship is likely to be about sports. It is helpful to define a weight for each term in a document that can be meaningful for computing such a score. I use popular information retrieval metrics such as term frequency, inverse document frequency, and their product, term frequency-inverse document frequency (TF-IDF), that are used to define weights for terms.
Language: Java - Size: 974 KB - Last synced at: over 2 years ago - Pushed at: over 8 years ago - Stars: 1 - Forks: 0
smartlin5228/CCA175
Language: Java - Size: 107 KB - Last synced at: over 2 years ago - Pushed at: about 8 years ago - Stars: 7 - Forks: 10
rafariva/Cloudera-Hadoop-Wordcount
Tutorial for beginners for installing hadoop on a virtualmachine and run the "hello world" of hadoop (wordcount)
Language: Java - Size: 332 KB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 1
teamclairvoyant/apache-airflow-cloudera-parcel
Parcel for Apache Airflow
Language: Dockerfile - Size: 311 MB - Last synced at: 4 months ago - Pushed at: about 6 years ago - Stars: 17 - Forks: 10
AlionSSS/CDH-Install-Manual
CDH安装手册
Size: 10.8 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 76 - Forks: 28
dimajix/vagrant-cloudera 📦
A Vagrant setup to run a virtual Cloudera cluster
Language: Puppet - Size: 31.3 KB - Last synced at: over 1 year ago - Pushed at: over 9 years ago - Stars: 2 - Forks: 4
oleewere/cmctl
CLI tool for managing multiple Cloudbreak deployed CM instances
Language: Go - Size: 2.47 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 0
matkosoric/Data-Visualizations
Various data visualizations using Databricks, Zeppelin, ggplot2, matplotlib, Impala, Splunk...
Language: Jupyter Notebook - Size: 81.9 MB - Last synced at: 6 months ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0
sihamhafsi/projet-big-data_analyse-des-donnees-youtube
Language: Java - Size: 5.21 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 4 - Forks: 0
thammuio/bigdata-cluster-ansible-playbook
Ansible Playbook's for building Big Data (Hadoop, Kafka, HBase) Clusters
Language: Shell - Size: 43 KB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 0
ryandawsonuk/data-platforms-tools
Guide to data platforms and tools
Size: 270 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 22 - Forks: 3
tspannhw/airline-sentiment-streaming
Streaming with Airline Sentiment. Utilizing Cloudera Machine Learning, Apache NiFi, Apache Hue, Apache Impala, Apache Kudu
Language: Jupyter Notebook - Size: 121 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 3
srowen/cdsw-simple-serving 📦
Modeling Lifecycle with ACME Occupancy Detection and Cloudera
Language: Scala - Size: 76 MB - Last synced at: about 1 month ago - Pushed at: about 8 years ago - Stars: 14 - Forks: 17
srowen/quatrains-rnn 📦
Simple example applying Keras, TensorFlow to Nostradamus's prophecies with Cloudera Data Science Workbench
Language: Python - Size: 75.2 KB - Last synced at: about 1 month ago - Pushed at: about 8 years ago - Stars: 3 - Forks: 5
chezou/homebrew-cloudera 📦
Homebrew Formulas for cloudera tools
Language: Ruby - Size: 12.7 KB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 10 - Forks: 7
Powerspace/kudu-from-avro 📦
A small Command Line tool to create an Kudu table from an Avro schema or from SQL script
Language: Scala - Size: 180 KB - Last synced at: over 2 years ago - Pushed at: over 8 years ago - Stars: 6 - Forks: 3
teamclairvoyant/hadoop-deployment-bash
Code for the deployment of Hadoop clusters, written in Bourne or Bourne Again shell.
Language: Shell - Size: 880 KB - Last synced at: 4 months ago - Pushed at: almost 2 years ago - Stars: 34 - Forks: 39
NFPA/LocationTools
Geocoding and Reverse Geocoding at Scale
Language: Java - Size: 25.4 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 2
mohan-aditya05/cloudera-medicare-challenge
Language: Jupyter Notebook - Size: 99.6 KB - Last synced at: over 2 years ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 0
limz1986/GCP_Databricks_Hadoop
Hadoop_MapReduce_Google_Cloud_Cloudera_VM_Databricks
Language: Python - Size: 20.5 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0
opencollector/ansible-dynamic-inventory-cloudera-scm
Ansible dynamic inventory script that retrieves hosts using Cloudera Manager API
Language: Python - Size: 6.84 KB - Last synced at: over 2 years ago - Pushed at: almost 8 years ago - Stars: 1 - Forks: 1
Treydone/dek
Easily connect to multiple Hadoop clusters
Language: Java - Size: 688 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 3 - Forks: 1
JohnnyFoulds/local-hadoop
This project creates a small local Hadoop cluster using Cloudera CDH and CentOS.
Language: Python - Size: 216 MB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 1
ummmme/setup_cdh
CDH5.16.2 离线安装脚本
Language: Shell - Size: 21.8 MB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 16 - Forks: 15
haspdecrypted/OS-for-Big-Data-and-Hadoop
Getting Started with Hadoop and Big Data
Size: 23.4 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 1
xiaojie-qian/Rail-tunnel-recommendation-SQL
Modern Big Data Analysis: recommend which pair of United States airports should be connected with a high-speed passenger rail tunnel.
Language: Shell - Size: 13.7 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0
thammuio/bigdata-upgrade-suite
Suite of tools to help with HDP, CDH, CDP Hadoop Cluster Upgrades; CDH to CDP Migration; HDP to CDP Migration; CDP7 Migration
Language: Java - Size: 3.4 MB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 3
mchristian279/ConfigHostCloudera5.16.2
Provisionamento de Vms em ambiente KVM via terraform e ansible-playbook para configuração ambiente Cloudera 5.16.2.
Language: HCL - Size: 6.91 MB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 3 - Forks: 0
Sathiyarajan/big-data-pipeline
Big Data
Language: Java - Size: 705 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 9 - Forks: 6
tspannhw/kafkarest-processor
Apache NiFi 1.10.0 Processor to consume 1 Kafka message at a time, easily to tie into a REST Proxy
Language: Java - Size: 22.5 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 1
osama127001/Multinode-Cloudera-Cluster
Deploying a multinode cloudera cluster on Linux(CentOS7) (Documentation)
Size: 8.7 MB - Last synced at: 4 months ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 1
tspannhw/MmFLaNK
Mm FLaNK Stack (MXNet, MiNiFi, Flink, NiFi, Kafka, Kudu) for AI-IoT
Language: Java - Size: 3.79 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 6 - Forks: 1
tspannhw/minifi-xaviernx
NVIDIA XAVIER NX - MiNiFi - NiFi - Kafka - Flink
Language: Python - Size: 29.3 KB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 1
tspannhw/minifi-jetson-nano
MiNiFi Agent Configuration and Scripts for NVidia Jetson Nano device
Language: Python - Size: 310 KB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 6 - Forks: 2
tspannhw/retail-dynamic-shelf-pricing
Retail - Dynamic Shelf Pricing
Language: Python - Size: 126 KB - Last synced at: 8 months ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 1
tspannhw/dws-coldsupplychain-hyperledger
Apache NiFi and Hyperledger Fabric for Cold Supply Chain Logistics
Language: Python - Size: 5.86 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 0
kongyew/greenplum-dockers
Create Greenplum docker files
Language: Python - Size: 32.6 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 10 - Forks: 4