An open API service providing repository metadata for many open source software ecosystems.

Topic: "bigdataanalytics"

open-metadata/openmetadata-site

Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.

Language: TypeScript - Size: 54.6 MB - Last synced at: 1 day ago - Pushed at: 4 days ago - Stars: 14 - Forks: 11

shreyamalogi/Retail-Pipeline

Retail insights at cloud scale — 5M+ records analyzed with PySpark on GCP

Language: Python - Size: 12.7 KB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 12 - Forks: 0

Amey-Thakur/OPTIMIZING-STOCK-TRADING-STRATEGY-WITH-K-MEANS-CLUSTERING

Big Data Analytics [BDA] Mini Project

Language: Jupyter Notebook - Size: 2.55 MB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 10 - Forks: 1

WaniaKhance/Smart-Glove-Sign-Language-Translator

In this project, we have developed 'Smart Glove' - a sign language translator. It consists of both hardware and software modules for translation and prediction analysis. We perform real-time translation of hand gesture into speech and text. In this project, we have used different machine learning models for prediction analysis including KNN, SVM, Decision Tree, Naive Bayes, Random Forest and Logistic Regression.

Language: Python - Size: 188 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 8 - Forks: 0

Amey-Thakur/BIG-DATA-ANALYTICS-AND-COMPUTATIONAL-LAB-I

CSDLO7032: Big Data Analytics & CSL704: Computational Lab - I <Semester VII>

Language: Jupyter Notebook - Size: 183 MB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 8 - Forks: 2

Amey-Thakur/HADOOP

HADOOP

Language: Jupyter Notebook - Size: 12.7 KB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 7 - Forks: 1

WaniaKhance/Real-Time_Data_Center_Energy_Management

This project is Master thesis research conducted at ENEA Portici Research Center, Italy. The data is obtained from the HPC CRESCO6 cluster at ENEA Portici Research Center. The aim is to identify energy consuming areas within the data center. In this project, real-time dataset from ENEA Portici Research Center is used. There are several techniques implemented including big data analytics and AI technology.

Language: Jupyter Notebook - Size: 52.7 KB - Last synced at: 5 months ago - Pushed at: almost 3 years ago - Stars: 4 - Forks: 0

rapticore/aws-onboarding

AWS Cloud Account Onboarding for Rapticore - www.rapticore.com

Size: 201 KB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 2 - Forks: 2

brittojo7n/BigDataProject-StockMarketTrendAnalysisAndPrediction

A comprehensive project leveraging big data techniques for stock market prediction and analysis. This repository includes data collection, processing, and visualization tools, alongside machine learning models for predicting stock prices and analyzing market trends. Ideal for financial analytics and investment strategies.

Language: Jupyter Notebook - Size: 57.4 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

meetpatel0963/KMeans-Spark

KMeans Clustering using Spark on Uber's ride share data - Case Study (Big Data Analytics @Uber)

Language: Python - Size: 10.5 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 2 - Forks: 0

ShrishtiHore/Target_Brazil_Sales_Analysis

Target is one of the world’s most recognized brands and one of America’s leading retailers.This business case has information of 100k orders from 2016 to 2018 made at Target in Brazil. Its features allows viewing an order from multiple dimensions.

Language: Jupyter Notebook - Size: 29.6 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

divithraju/divith-raju-OpenMetadata

Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.

Language: Python - Size: 68.4 KB - Last synced at: 7 days ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

MaulikBhalani/US-Airline-Delay-Analysis-And-Prediction

Language: Jupyter Notebook - Size: 17 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 1

kunalk3/Prep-And-Practice

Your go-to repository for interview preparation, use-case, research and practice where you’ll find curated notes, sample QA and code strategies in notebooks, text and pdf to help you navigate next level technical stacks. | BigData | GenAI | AI-ML | Java | Docker | DB | Cloud | Data Science | SQL | CPP

Language: HTML - Size: 626 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 1

AbhishekPardhi/steam-analytics

Unraveling Game Market Dynamics

Language: Jupyter Notebook - Size: 7 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

cc59chong/Big-Data-Fundamentals-with-PySpark

Language: Jupyter Notebook - Size: 7.23 MB - Last synced at: 21 days ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 1

AliArain87/DataScienceProjects

Hye.!!! Hope you all are doing well.. Here you will find amazing data science projects that will boost-up your knowledge and level of interest.

Language: Jupyter Notebook - Size: 36.4 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 41

andrefilipefmsilva/airbnb_mongodb_learning

Learning mongoDB

Language: Jupyter Notebook - Size: 34.6 MB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 0 - Forks: 0

JuanParias29/BigDataProcessing

Repositorio con proyectos y laboratorios de procesamiento de datos utilizando Databricks, Apache Spark y Python. Incluye conceptos clave de Big Data, almacenamiento, procesamiento, análisis y aprendizaje automático.

Language: Jupyter Notebook - Size: 3.59 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

withrvr/bda_mini_project

ipl data analysis

Language: Jupyter Notebook - Size: 1.47 MB - Last synced at: 1 day ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

Pongpang-2102/Big-Data-Analytics_KDAI-Project

This Repository stores any Project & Assignment from Course : Big Data Analytics from KDAI Curriculum

Size: 4.59 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

npdao1992/BigdataAnalysis_CounterfeitCreditCards-Kafka_SparkStreaming

Analyze fraudulent credit card transactions using Kafka, Spark Streaming, and Random Forest Classifier algorithms in PySpark

Language: Python - Size: 24.9 MB - Last synced at: 3 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

eudavidreis-odev/python_data_manage_example

Explore Python para análise de dados com este repositório. Desenvolva um modelo de regressão linear para prever casos de dengue e utilize aprendizado de máquina para classificar dados do conjunto Iris.

Language: Python - Size: 1.95 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

BobErgot/Large-Scale-Data-Processing-Design-Patterns

Explore essential MapReduce design patterns for big data processing! This repository includes practical implementations of patterns from the "MapReduce Design Patterns" book, complete with examples across summarization, filtering, organization, joins, and more.

Language: Java - Size: 37 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

anon1303/DataWarehouseAnalyzer

Language: Python - Size: 1000 Bytes - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

nardyjh/Home_Sales

Spark Home Sales Analysis utilizes Apache Spark to explore and analyze home sales data, providing insights into average prices based on various criteria. The project employs Spark SQL queries for efficient data processing and is designed for easy setup and usage.

Language: Jupyter Notebook - Size: 1.25 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

d-rushma/MovieRecommendationSystem

Project Related to BigDataSets with MachineLearning

Language: Jupyter Notebook - Size: 1.16 MB - Last synced at: 10 months ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

RushikeshShinde14/Decoding-Hotel-Reservation-Patterns-Data-Analysis

The aim of the project is to gain insights into customers' booking behaviors, preferences, and decision-making processes when reserving hotel accommodations.The data analysis process involves cleaning and organizing the hotel reservation data and generating visualizations and reports to present the findings.

Language: Python - Size: 11.2 MB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

vmtamburro/Pharma-Sales-Analysis

This analysis was pereformed as a final project for Rutgers MSBA Course "Big Data Analytics". It consists of a data analysis and generated machine learning models based on open source research data collected by researcher Milanz Dravkovic from a single pharmacy's point-of-sales system.

Language: Jupyter Notebook - Size: 4.22 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Mukta-glitch/BrainTumor

Brain Tumor detection and classification , Website for Hospital Management taking real time data from users , Tableau Visualizations of reports

Language: Jupyter Notebook - Size: 86 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

KidusMT/BDT-TwitterDataAnalysis

Big Data Technology: Using Kafka, Zookeeper, Apache Spark, Apache Hive, Apache Hadoop to process Twitter Data

Language: Java - Size: 1.39 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 1

t0re199/BDMNGT_PROJECT

Big Data Analysis on a Covid-19 Dataset

Language: Python - Size: 15.6 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

nachogtan/Worldwide-Petrol-Gas-prices

A quick look at Petrol/gas prices and distribution in 181 different countries.

Size: 11.7 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

ft-abhx/Retail-Sales-Analytics-Using-Apache-Spark

The project deals with analyzing Retail-Dataset using Apache Spark.

Language: Jupyter Notebook - Size: 7.38 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

mdsenelen/Credit-Card-Freud-Detection

Detection of credit card freuds and big data analysis of transactions. (csv files were too large to upload on github.)

Language: Jupyter Notebook - Size: 2.28 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

stephperillo/Amazon_Vine_Analysis

An analysis of Amazon reviews to determine if there is bias amongst the Vine review program.

Language: Jupyter Notebook - Size: 2.61 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

apophis-web/Page-Rank-Algorithm

Implementation of the Page Rank Algorithm in Python

Language: Jupyter Notebook - Size: 1000 Bytes - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

radityohanif/Pemberdayaan-Masyarakat-Kalimantan-Barat

UTS Mata Kuliah Praktikum Big Data

Language: Jupyter Notebook - Size: 636 KB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

JakobLS/billion_taxi_trips

Predicting the Fare on a Billion Taxi Trips with BigQuery. How long time does it take and how much does it cost to analyse and train a model on a billion taxi trips in the cloud?

Language: Jupyter Notebook - Size: 5.62 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

lucacorbucci/BigDataAnalytics

🎓 Implementation of all the milestones of the Big Data Analytics course @ UniPi Department of Computer Science

Language: Jupyter Notebook - Size: 47.9 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0