Topic: "bigdataanalytics"
open-metadata/openmetadata-site
Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.
Language: TypeScript - Size: 54.6 MB - Last synced at: 1 day ago - Pushed at: 4 days ago - Stars: 14 - Forks: 11

shreyamalogi/Retail-Pipeline
Retail insights at cloud scale — 5M+ records analyzed with PySpark on GCP
Language: Python - Size: 12.7 KB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 12 - Forks: 0

Amey-Thakur/OPTIMIZING-STOCK-TRADING-STRATEGY-WITH-K-MEANS-CLUSTERING
Big Data Analytics [BDA] Mini Project
Language: Jupyter Notebook - Size: 2.55 MB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 10 - Forks: 1

WaniaKhance/Smart-Glove-Sign-Language-Translator
In this project, we have developed 'Smart Glove' - a sign language translator. It consists of both hardware and software modules for translation and prediction analysis. We perform real-time translation of hand gesture into speech and text. In this project, we have used different machine learning models for prediction analysis including KNN, SVM, Decision Tree, Naive Bayes, Random Forest and Logistic Regression.
Language: Python - Size: 188 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 8 - Forks: 0

Amey-Thakur/BIG-DATA-ANALYTICS-AND-COMPUTATIONAL-LAB-I
CSDLO7032: Big Data Analytics & CSL704: Computational Lab - I <Semester VII>
Language: Jupyter Notebook - Size: 183 MB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 8 - Forks: 2

Amey-Thakur/HADOOP
HADOOP
Language: Jupyter Notebook - Size: 12.7 KB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 7 - Forks: 1

WaniaKhance/Real-Time_Data_Center_Energy_Management
This project is Master thesis research conducted at ENEA Portici Research Center, Italy. The data is obtained from the HPC CRESCO6 cluster at ENEA Portici Research Center. The aim is to identify energy consuming areas within the data center. In this project, real-time dataset from ENEA Portici Research Center is used. There are several techniques implemented including big data analytics and AI technology.
Language: Jupyter Notebook - Size: 52.7 KB - Last synced at: 5 months ago - Pushed at: almost 3 years ago - Stars: 4 - Forks: 0

rapticore/aws-onboarding
AWS Cloud Account Onboarding for Rapticore - www.rapticore.com
Size: 201 KB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 2 - Forks: 2

brittojo7n/BigDataProject-StockMarketTrendAnalysisAndPrediction
A comprehensive project leveraging big data techniques for stock market prediction and analysis. This repository includes data collection, processing, and visualization tools, alongside machine learning models for predicting stock prices and analyzing market trends. Ideal for financial analytics and investment strategies.
Language: Jupyter Notebook - Size: 57.4 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

meetpatel0963/KMeans-Spark
KMeans Clustering using Spark on Uber's ride share data - Case Study (Big Data Analytics @Uber)
Language: Python - Size: 10.5 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 2 - Forks: 0

ShrishtiHore/Target_Brazil_Sales_Analysis
Target is one of the world’s most recognized brands and one of America’s leading retailers.This business case has information of 100k orders from 2016 to 2018 made at Target in Brazil. Its features allows viewing an order from multiple dimensions.
Language: Jupyter Notebook - Size: 29.6 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

divithraju/divith-raju-OpenMetadata
Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.
Language: Python - Size: 68.4 KB - Last synced at: 7 days ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

MaulikBhalani/US-Airline-Delay-Analysis-And-Prediction
Language: Jupyter Notebook - Size: 17 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 1

kunalk3/Prep-And-Practice
Your go-to repository for interview preparation, use-case, research and practice where you’ll find curated notes, sample QA and code strategies in notebooks, text and pdf to help you navigate next level technical stacks. | BigData | GenAI | AI-ML | Java | Docker | DB | Cloud | Data Science | SQL | CPP
Language: HTML - Size: 626 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 1

AbhishekPardhi/steam-analytics
Unraveling Game Market Dynamics
Language: Jupyter Notebook - Size: 7 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

cc59chong/Big-Data-Fundamentals-with-PySpark
Language: Jupyter Notebook - Size: 7.23 MB - Last synced at: 21 days ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 1

AliArain87/DataScienceProjects
Hye.!!! Hope you all are doing well.. Here you will find amazing data science projects that will boost-up your knowledge and level of interest.
Language: Jupyter Notebook - Size: 36.4 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 41

andrefilipefmsilva/airbnb_mongodb_learning
Learning mongoDB
Language: Jupyter Notebook - Size: 34.6 MB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 0 - Forks: 0

JuanParias29/BigDataProcessing
Repositorio con proyectos y laboratorios de procesamiento de datos utilizando Databricks, Apache Spark y Python. Incluye conceptos clave de Big Data, almacenamiento, procesamiento, análisis y aprendizaje automático.
Language: Jupyter Notebook - Size: 3.59 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

withrvr/bda_mini_project
ipl data analysis
Language: Jupyter Notebook - Size: 1.47 MB - Last synced at: 1 day ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

Pongpang-2102/Big-Data-Analytics_KDAI-Project
This Repository stores any Project & Assignment from Course : Big Data Analytics from KDAI Curriculum
Size: 4.59 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

npdao1992/BigdataAnalysis_CounterfeitCreditCards-Kafka_SparkStreaming
Analyze fraudulent credit card transactions using Kafka, Spark Streaming, and Random Forest Classifier algorithms in PySpark
Language: Python - Size: 24.9 MB - Last synced at: 3 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

eudavidreis-odev/python_data_manage_example
Explore Python para análise de dados com este repositório. Desenvolva um modelo de regressão linear para prever casos de dengue e utilize aprendizado de máquina para classificar dados do conjunto Iris.
Language: Python - Size: 1.95 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

BobErgot/Large-Scale-Data-Processing-Design-Patterns
Explore essential MapReduce design patterns for big data processing! This repository includes practical implementations of patterns from the "MapReduce Design Patterns" book, complete with examples across summarization, filtering, organization, joins, and more.
Language: Java - Size: 37 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

anon1303/DataWarehouseAnalyzer
Language: Python - Size: 1000 Bytes - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

nardyjh/Home_Sales
Spark Home Sales Analysis utilizes Apache Spark to explore and analyze home sales data, providing insights into average prices based on various criteria. The project employs Spark SQL queries for efficient data processing and is designed for easy setup and usage.
Language: Jupyter Notebook - Size: 1.25 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

d-rushma/MovieRecommendationSystem
Project Related to BigDataSets with MachineLearning
Language: Jupyter Notebook - Size: 1.16 MB - Last synced at: 10 months ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

RushikeshShinde14/Decoding-Hotel-Reservation-Patterns-Data-Analysis
The aim of the project is to gain insights into customers' booking behaviors, preferences, and decision-making processes when reserving hotel accommodations.The data analysis process involves cleaning and organizing the hotel reservation data and generating visualizations and reports to present the findings.
Language: Python - Size: 11.2 MB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

vmtamburro/Pharma-Sales-Analysis
This analysis was pereformed as a final project for Rutgers MSBA Course "Big Data Analytics". It consists of a data analysis and generated machine learning models based on open source research data collected by researcher Milanz Dravkovic from a single pharmacy's point-of-sales system.
Language: Jupyter Notebook - Size: 4.22 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Mukta-glitch/BrainTumor
Brain Tumor detection and classification , Website for Hospital Management taking real time data from users , Tableau Visualizations of reports
Language: Jupyter Notebook - Size: 86 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

KidusMT/BDT-TwitterDataAnalysis
Big Data Technology: Using Kafka, Zookeeper, Apache Spark, Apache Hive, Apache Hadoop to process Twitter Data
Language: Java - Size: 1.39 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 1

t0re199/BDMNGT_PROJECT
Big Data Analysis on a Covid-19 Dataset
Language: Python - Size: 15.6 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

nachogtan/Worldwide-Petrol-Gas-prices
A quick look at Petrol/gas prices and distribution in 181 different countries.
Size: 11.7 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

ft-abhx/Retail-Sales-Analytics-Using-Apache-Spark
The project deals with analyzing Retail-Dataset using Apache Spark.
Language: Jupyter Notebook - Size: 7.38 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

mdsenelen/Credit-Card-Freud-Detection
Detection of credit card freuds and big data analysis of transactions. (csv files were too large to upload on github.)
Language: Jupyter Notebook - Size: 2.28 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

stephperillo/Amazon_Vine_Analysis
An analysis of Amazon reviews to determine if there is bias amongst the Vine review program.
Language: Jupyter Notebook - Size: 2.61 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

apophis-web/Page-Rank-Algorithm
Implementation of the Page Rank Algorithm in Python
Language: Jupyter Notebook - Size: 1000 Bytes - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

radityohanif/Pemberdayaan-Masyarakat-Kalimantan-Barat
UTS Mata Kuliah Praktikum Big Data
Language: Jupyter Notebook - Size: 636 KB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

JakobLS/billion_taxi_trips
Predicting the Fare on a Billion Taxi Trips with BigQuery. How long time does it take and how much does it cost to analyse and train a model on a billion taxi trips in the cloud?
Language: Jupyter Notebook - Size: 5.62 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

lucacorbucci/BigDataAnalytics
🎓 Implementation of all the milestones of the Big Data Analytics course @ UniPi Department of Computer Science
Language: Jupyter Notebook - Size: 47.9 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0
