GitHub topics: big-data-analytics
LatiefDataVisionary/big-data-and-data-analytics-college-task
Language: Jupyter Notebook - Size: 63.2 MB - Last synced at: about 11 hours ago - Pushed at: about 12 hours ago - Stars: 0 - Forks: 0

jdvelasq/courses
Material de apoyo para cursos, Facultad de Minas, Universidad Nacional de Colombia
Language: Python - Size: 470 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 16 - Forks: 7

JKA098/Pokemon-Feistiness-MapReduce-Job
This Project aims to implement a **Hadoop MapReduce job in Pseudo-Distributed Mode** to determine the **feistiest Pokémon** based on their **type**. The job processes the Pokémon dataset (`pokemon.csv`) and outputs a CSV file containing Pokémon **type1, type2, name, and feistiness score**.
Language: Python - Size: 220 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

JKA098/CSTADS-2021-22-Substance-Use-Analysis
The **Canadian Student Tobacco, Alcohol and Drugs Survey (CSTADS)** 2021–22 dataset is analyzed to explore: * Provincial variation in youth **cannabis**, **alcohol**, and **tobacco** use * The impact of **cannabis legalization** * Access networks for each substance * Regional policy implications using **geospatial** and **network** analysis
Size: 4.37 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

MrXujiang/v6.dooring.public
可视化大屏解决方案, 提供一套可视化编辑引擎, 助力个人或企业轻松定制自己的可视化大屏应用.
Language: TypeScript - Size: 36 MB - Last synced at: 2 days ago - Pushed at: 5 months ago - Stars: 651 - Forks: 146

SepidehHayati/Academic-Projects-and-Assignments
Reports, assignments, and projects completed at the University of Pavia.
Language: Jupyter Notebook - Size: 26.8 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

ingef/conquery
Visual, interactive queries against big databases
Language: Java - Size: 48.7 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 37 - Forks: 13

adwaiy2912/BDA-Lab
Repository contains weekly lab work and assignments for the Big Data Analytics (BDA) course
Language: Python - Size: 7.8 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

Lara33779/Alibaba-Cloud-Useful-Resources
This repository shares useful resources, updates, and tips to help you navigate the world of cloud computing with Alibaba Cloud.
Size: 416 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

ydataai/ydata-profiling
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Language: Python - Size: 840 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 12,884 - Forks: 1,711

vinay-ram1999/data-engineer-playground
Language: TypeScript - Size: 9.4 MB - Last synced at: 4 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

prof-rjimenez/cit_bigdata_basico
Repositorio para las clases de laboratorio del curso básico de introducción a Big Data.
Language: Python - Size: 97.2 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 1 - Forks: 4

LatiefDataVisionary/big-data-for-data-science-college-task
Language: Mermaid - Size: 3.73 MB - Last synced at: 10 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

y0nil/kusto.blog
A technical blog about Kusto
Language: HTML - Size: 2.78 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 11 - Forks: 2

MrHAM17/Spotify_Streaming_Analytics
This is my Sem 7 BDA Lab Project. For complete details, kindly check the below README File.
Language: Jupyter Notebook - Size: 14.9 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

exajobs/data-engineering-collection
A collection of awesome software, libraries, Learning Tutorials, documents, books, resources and interesting stuff about Big Data Science & Engineering
Size: 241 KB - Last synced at: 3 days ago - Pushed at: over 3 years ago - Stars: 7 - Forks: 1

rouyang2017/SISSO
A data-driven method combining symbolic regression and compressed sensing for accurate & interpretable models.
Language: Fortran - Size: 3.88 MB - Last synced at: 7 days ago - Pushed at: about 2 months ago - Stars: 276 - Forks: 85

K-G-PRAJWAL/Big-Data-Engineering
Language: PLpgSQL - Size: 254 MB - Last synced at: 4 days ago - Pushed at: 3 months ago - Stars: 22 - Forks: 14

caioricciuti/ch-ui
Use CH-UI to work with your data from Click House self-hosted with a user-friendly interface. CH-UI is a modern and feature-rich user interface for ClickHouse databases. It offers an intuitive platform for querying ClickHouse databases, executing queries, and visualizing metrics about your instance.
Language: TypeScript - Size: 24.1 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 346 - Forks: 26

trieu/leo-cdp-free-edition
The binary build of LEO CDP Free Edition for training purposes
Language: HTML - Size: 782 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 38 - Forks: 14

lithops-cloud/lithops
A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud ☁️🚀
Language: Python - Size: 12.9 MB - Last synced at: 29 days ago - Pushed at: 2 months ago - Stars: 329 - Forks: 111

v6d-io/v6d
vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)
Language: C++ - Size: 19.3 MB - Last synced at: 29 days ago - Pushed at: about 2 months ago - Stars: 878 - Forks: 124

yaoguangluo/ChromosomeDNA
《DNA元基催化与肽计算》 在进化计算中, 软件函数文件进行 DNA 语义元基索引编码的 PDE 新陈代谢优化方式, 是一种有效的进化方式.
Language: Java - Size: 676 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 7 - Forks: 2

Yash22222/Olympic-Games-Analytics-Using-Apache-Spark
The "Olympic Games Analytics Using Apache Spark Databricks" project explores data from the Olympic Games (1896-2016) to identify trends and insights. Using Apache Spark for big data processing and Databricks for visualization, the project analyzes key factors like top-performing countries and athlete attributes, showcasing real-world analytics.
Language: HTML - Size: 18.7 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Houssam-11/BigData-Architecture
Big Data system predicts pandemic risk (COVID-19) via data analysis, ML modeling, and real-time dashboard.
Language: Python - Size: 29 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

mahmoudparsian/pyspark-tutorial
PySpark-Tutorial provides basic algorithms using PySpark
Language: Jupyter Notebook - Size: 8.96 MB - Last synced at: 27 days ago - Pushed at: 4 months ago - Stars: 1,217 - Forks: 475

ICT-BDA/EasyML
Easy Machine Learning is a general-purpose dataflow-based system for easing the process of applying machine learning algorithms to real world tasks.
Language: Java - Size: 14.9 MB - Last synced at: 26 days ago - Pushed at: over 1 year ago - Stars: 1,978 - Forks: 440

bydevmar/Master_MASD_FPO
Ce dépôt GitHub regroupe tous les cours, TP, TD, projets, et exercices de ma formation en master en mathématiques appliquées pour la science des données. Parcourez-le pour une vue complète de mon parcours académique, offrant une perspective détaillée de mon apprentissage dans ce domaine.
Language: Jupyter Notebook - Size: 155 MB - Last synced at: 5 days ago - Pushed at: 11 months ago - Stars: 6 - Forks: 0

bilgeswe/BigDataManagement
Building a Data Pipeline with Lakehouse Architecture on Microsoft Azure Platform
Language: TSQL - Size: 2.02 MB - Last synced at: 10 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

SrLozano/Tinder-Big-Data-Analysis
Big Data Analysis of Tinder done at Universitat Rovira i Virgili and Universitat Politècnica de Catalunya · BarcelonaTech
Language: Jupyter Notebook - Size: 21.7 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 10 - Forks: 5

FastTrack-Academy/adolescent-suicide-dashboard
An interactive data visualization and analytics tool designed to analyze risk factors, trends, and disparities in adolescent suicide rates. Using machine learning and open data, this dashboard helps policymakers, educators, and mental health professionals identify patterns and develop prevention strategies to support adolescent well-being. 🚀
Language: HTML - Size: 11.9 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

chandrahask535/Big-Data-Analysis-to-Identify-Adverse-Effects-of-Covid-19-Vaccines2.0
This project utilizes big data analytics, machine learning, and statistical methods to identify and classify adverse effects of COVID-19 vaccinations. By analyzing large datasets, it aims to uncover patterns and correlations, providing valuable insights into vaccine safety and efficacy.
Language: Python - Size: 5.71 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 0

ToheedAsghar/Smart-Meters-London-Analytics
This project analyzes the Smart Meters in London dataset, performing data preprocessing, EDA, and predictive modeling to forecast energy usage and identify optimization opportunities. It demonstrates my expertise in transforming raw data into actionable insights for improving energy efficiency using AI and real-world datasets.
Language: Jupyter Notebook - Size: 2.17 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

rapticore/ssvc_ore_miner
SSVC Ore Miner - www.rapticore.com
Language: Python - Size: 433 KB - Last synced at: 15 days ago - Pushed at: 6 months ago - Stars: 9 - Forks: 1

archivesunleashed/aut
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Language: Scala - Size: 39.5 MB - Last synced at: 8 days ago - Pushed at: about 1 year ago - Stars: 143 - Forks: 32

Wittline/pyspark-on-aws-emr
The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on writing pyspark code.
Language: Python - Size: 3.61 MB - Last synced at: 28 days ago - Pushed at: almost 3 years ago - Stars: 27 - Forks: 13

yuvrajsaraogi/Unemployment-Analysis-with-Python
Unemployment is measured by the unemployment rate which is the number of people who are unemployed as a percentage of the total labour force. We have seen a sharp increase in the unemployment rate during Covid-19, so analyzing the unemployment rate can be a good data science project.
Language: Jupyter Notebook - Size: 244 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

OwenOrcan/YiraBot-Crawler
YiraBot: Simplifying Web Scraping for All. A user-friendly tool for developers and enthusiasts, offering command-line ease and Python integration. Ideal for research, SEO, and data collection.
Language: Python - Size: 221 KB - Last synced at: 13 days ago - Pushed at: 6 months ago - Stars: 19 - Forks: 0

dongsuo/vue-data-board
A Data Analysis Board in Vue.
Language: Vue - Size: 10.4 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 1,326 - Forks: 292

JoseRuiz01/AirlineOn-TimePerformanceAnalysis
Airline on-time performance analysis using Spark Machine Learning libraries
Size: 0 Bytes - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

SohelRana-aiub-Pro/Traffic-Forecasting-Graph-Neural-Networks-LSTM
https://docs.omniverse.nvidia.com/prod_install-guide/prod_install-guide/overview.html
Language: Jupyter Notebook - Size: 1.07 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

JuanParias29/BigDataProcessingProject
Este repositorio contiene un proyecto de análisis y procesamiento de datos a gran escala basado en la metodología CRISP-DM, enfocado en resolver preguntas de negocio dentro del ámbito educativo.
Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

FreeIPCC/FreeWorkPhone
企业手机,工作手机,商务手机,企业数据沉淀,销冠手机,定制版企业手机,智能手机。
Size: 191 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 5 - Forks: 0

deepaiimpactx/BARS
Language: Python - Size: 16.2 MB - Last synced at: 29 days ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

BasharatWali/Medicine_Rec_System
Language: Jupyter Notebook - Size: 27.3 KB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

khushi-sabarad/PsyliqIntenshipDataAnalysis
Big Data Analysis Internship. Diabetes Prediction, HR & Employee Data Analysis. Tools: SQL, Power BI and Excel
Size: 22.5 KB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

JosepSampe/lithops Fork of lithops-cloud/lithops
A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud ☁️🚀
Language: Python - Size: 12.9 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

Advaitiyer/advaitiyer.github.io
Data Scientist's Portfolio covering the topics: Big Data Analytics, Information Visualization, Advanced Data Mining, Applied Data Analytics, Financial, and Marketing Analytics, Artificial Intelligence, and Deep Learning.
Language: HTML - Size: 53.2 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

ellie991/Titanic-Dataset-Analysis
Big Data Analysis on Titanic Dataset
Language: R - Size: 190 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

ellie991/Spark-Spotify-Analysys
SPOTIFY - Big Data Analysis w/ Spark
Language: Python - Size: 11 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

GMAP/DSPBench
a suite of benchmark applications for distributed data stream processing systems
Language: Java - Size: 250 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 28 - Forks: 3

Matt-J-Dong/Top-Towns-To-Take-Over-Tech
Which American cities are the best for tech jobs?
Language: Scala - Size: 12.8 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 1

Kaustubh-Indulkar/TE-IT-DSBDA-Assignmnets
This repository contains the solutions for a series of assignments covering Data Science And Big Data Analytics concepts.
Language: Jupyter Notebook - Size: 9.71 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

lesleyzhao/Bus_Delays_Analysis
Bus Delays Analysis is a big data analytics project designed to do ETL and analyze bus delays using Scala, Apache Spark, and HDFS.
Language: Scala - Size: 12.7 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

msche81/2-Jedha_Fullstack
450h Data Scientist training - Collect and store large amounts of data - Build prediction models in Machine Learning and Deep Learning - Deploy your models in real conditions
Language: Jupyter Notebook - Size: 248 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

subhanjandas/RDBMS-to-GraphDB---Big-Data-Analytics-using-Neo4j
This project involves migration from a traditional RDBMS to Neo4j for big data analytics. Using graph database technology, various business-critical questions are addressed, including identifying the employees who sold Tofu, the products sold with Tofu, the total number of products, top 5 products by sales, and the category with the highest sales.
Language: JavaScript - Size: 668 KB - Last synced at: 6 days ago - Pushed at: 10 months ago - Stars: 1 - Forks: 1

dvarshith/yelp-business-analysis
Big Data analysis on Yelp reviews/businesses for Arizona. Using Hadoop, Spark, PySpark.
Language: Jupyter Notebook - Size: 686 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

tashi-2004/Apache-Hadoop-Spark-Hive-CyberAnalytics
This project utilizes Apache Hadoop, Hive, and PySpark to process and analyze the UNSW-NB15 dataset, enabling advanced query analysis, machine learning modeling, and visualization. The project demonstrates efficient data ingestion, processing, and predictive analytics for network security insights.
Language: Jupyter Notebook - Size: 2.62 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

OuchenOussama/hespressence
Kappa Architecture Based Sentiment Analysis System for User Comments
Language: Python - Size: 10.8 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 1

mohamedsaleh1984/twitter-spark
Fetch data from Twitter and push it through Kafka to Spark then HDFS
Language: Java - Size: 7.82 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Dare-marvel/Big-Data-Analytics--BDA--
💾 Welcome to the Big Data Analytics Repository! 📚✨ Immerse yourself in a carefully curated reservoir of knowledge on Big Data Analytics. 🌐💡 Explore the intricacies of deriving insights from vast datasets and navigating powerful analytics tools. 🚀🔍
Language: Java - Size: 174 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 2

shivatejapecheti/Twitter-Live-Feed-Analysis-and-Streaming-for-Movies
Bigdata Analysis Project
Language: Jupyter Notebook - Size: 165 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

AbdullahKhurshid/ecommerce-marketing-analytics
Using Apache Spark for marketing analytics
Language: R - Size: 2.3 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

madhurimarawat/Big-Data-Analytics
This repository demonstrates big data processing, visualization, and machine learning using tools such as Hadoop, Spark, Kafka, and Python.
Language: Jupyter Notebook - Size: 10.7 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 1

sxhixho/Preprocessing_Analysis
A project that demonstrates data storage, preprocessing, and analysis using tools like HDFS, Apache Pig, and Hive, executed in an Azure virtual machine environment. The project includes cleaning and aggregating a Spotify dataset and running Hive queries to extract meaningful insights.
Size: 4.24 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

madhurimarawat/Madhurima-Mindscape
This is a personal blog where I share a variety of content, including personal reflections, tech insights, project diaries, and creative photography. Explore different categories such as personal growth, tech insights, and project experiences.
Language: HTML - Size: 27.3 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

notnayan/WLV_HCK
You're welcome.
Language: Jupyter Notebook - Size: 99.4 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

wandersonlira/state-of-data-brazil-2023
Este repositório abriga o projeto acadêmico da disciplina de Tópicos de Big Data em Python. O projeto analisa os dados da pesquisa anual "State of Data Brazil", realizada pela comunidade Data Hackers em parceria com a Bain & Company.
Language: Jupyter Notebook - Size: 17.1 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 3

Amey-Thakur/BIG-DATA-ANALYTICS-AND-COMPUTATIONAL-LAB-I
CSDLO7032: Big Data Analytics & CSL704: Computational Lab - I <Semester VII>
Language: Jupyter Notebook - Size: 183 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 8 - Forks: 2

metatron-app/metatron-discovery
Powerful & Easy way for big data discovery
Language: TypeScript - Size: 93.3 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 444 - Forks: 111

Radhikareddy-chintareddy/Big-Data-Analysis-NY-Weather-Air-Quality-2022
End-to-end workflow showcasing database setup, API development, and interactive data retrieval of large datasets. Includes integration and analysis of 2022 SURFACE HOURLY weather data (global, US, and NY) merged with NY air pollution data from the EPA to uncover actionable insights.
Language: Jupyter Notebook - Size: 3.47 MB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

Radhikareddy-chintareddy/Big-Data-Insights-NYC-Taxi-Trips-2013-
A project showcasing memory-efficient big data processing using Python, focusing on scalable data handling to overcome memory constraints. Includes anomaly detection, efficient visualizations, and actionable insights from the 2013 NYC Taxi Trip dataset.
Language: Jupyter Notebook - Size: 2.49 MB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

haustcsa/SocialSituSecu
SocialSituSecu is a project exploring the social network security, computing and intelligence basd on social situational metadata, which is sponsored by National Natural Science Foundation of China Grant No.61972133, and Project of Leading Talents in Science and Technology Innovation for Thousands of People Plan in Henan Province Grant No.204200510021.
Language: Python - Size: 87.9 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 5 - Forks: 1

jaanli/american-community-survey
American Community Survey data on people and households
Language: Jupyter Notebook - Size: 142 MB - Last synced at: 22 days ago - Pushed at: 6 months ago - Stars: 19 - Forks: 1

sanketrs/implementation-of-modern-data-engineering-architecture-with-fabric_analytics
Building a next-generation hybrid data pipeline architecture that combines the power of Microsoft Fabric, Azure Cloud, and Power BI. This pipeline is engineered to tackle the challenges of real-time data ingestion, multi-layered processing, and analytics, delivering business-critical insights.
Language: Python - Size: 32.2 KB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

AAlkiyumi/Predicting-Hospital-Readmission-Risk
This project aims to create a predictive model that forecasts the likelihood of a patient being readmitted to the hospital within 30 days of discharge.
Language: Jupyter Notebook - Size: 13.2 MB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

tabletop-labs/tabletop
A curated selection of tools, libraries and services that help tame your dataflow to productively build ambitious, data driven & reactive applications on a streaming lakehouse
Language: Go - Size: 290 KB - Last synced at: 2 days ago - Pushed at: almost 2 years ago - Stars: 5 - Forks: 0

bibekbhatta/BusinessAnalytics
Anyone (including beginners) can use these resources to get started with accessing, cleaning, and analysing different kinds of data in Python. No installation required. No registration required.
Language: Jupyter Notebook - Size: 84.1 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

bryanfks-dev/Klempoken-Analysis
Analysis and forcasting model for Klempoken MSMEs
Language: Jupyter Notebook - Size: 6.19 MB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

mehwishferoz/BDA-project
A Hadoop MapReduce project analyzing the Consumer Complaints dataset with five queries to extract insights like complaints by product, state, company, tags, and timely responses.
Language: Java - Size: 7.42 MB - Last synced at: 2 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Amey-Thakur/OPTIMIZING-STOCK-TRADING-STRATEGY-WITH-K-MEANS-CLUSTERING
Big Data Analytics [BDA] Mini Project
Language: Jupyter Notebook - Size: 2.55 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 10 - Forks: 1

waseemsalami/project-Big-Data-in-behavioral-science-
An exciting Big Data project done during a course I took at the Technion university
Language: HTML - Size: 31.8 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

madhurimarawat/Python-Projects
This repository contains the projects that I made in the Python programming language.
Language: Jupyter Notebook - Size: 17.6 MB - Last synced at: 2 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

MSUSAzureAccelerators/Workplace-Intelligence-Accelerator
The Workplace Intelligence Accelerator leverages machine learning and big data analytics to combine and transform data, allowing customer to easily identify factors that influence how people work in their organization.
Language: TSQL - Size: 22.3 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 6 - Forks: 3

chaaalistaa/Thelookecommerce---Project
Analysis "TheLook" eCommerce with highlight goals such as identifying sales trends, understanding customer behaviors, enhancing customer retention, and driving repeat purchases.
Language: Jupyter Notebook - Size: 18.6 KB - Last synced at: 2 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Ashish7129/Graph_Sampling
Graph Sampling is a python package containing various approaches which samples the original graph according to different sample sizes.
Language: Python - Size: 4.91 MB - Last synced at: 6 months ago - Pushed at: over 4 years ago - Stars: 161 - Forks: 50

BhushanSagar/Telecom-Data-Analysis
Telecom Data Analysis with Apache Hive
Language: HiveQL - Size: 357 KB - Last synced at: 3 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

Ren294/Covid-Data-Process
This project integrates real-time data processing and analytics using Apache NiFi, Kafka, Spark, Hive, and AWS services for comprehensive COVID-19 data insights.
Language: Shell - Size: 6.22 MB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 6 - Forks: 0

hatoonguls/Big-Data-Analytics
The repositary contains big data analytics projects using Apache Spark, SQL, and Machine Learning models.
Language: Python - Size: 197 KB - Last synced at: 2 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

AAlkiyumi/Project-4-Big-Data-Analysis-with-PySpark-on-Weather-Data
In this project, I analyzed weather data from the NCEI Global Surface Summary of Day dataset using PySpark in Jupyter Notebook. Tasks included data cleaning, statistical analysis, and forecasting for temperature, wind speed, precipitation, and extreme weather events. The project also predicts future weather patterns for Cincinnati and Florida.
Language: Jupyter Notebook - Size: 23.4 KB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 1

NathanVilbert/Kaiture-Agriculture-Business-Reports-with-Power-BI
The project "Kaiture-Agriculture-Business-Reports-with-Power-BI" focuses on utilizing Business Intelligence to optimize agricultural yield and productivity. By integrating Power BI for data analysis, this project provides comprehensive insights into crop production patterns, market trends, and key factors affecting yield.
Size: 7.8 MB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

adityakamble49/loss-ratio-prediction
Predicting Loss Ratios for Auto Insurance Portfolios - ITCS 6100 Big Data Analytics for Competitive Advantage
Language: Jupyter Notebook - Size: 71.8 MB - Last synced at: 5 months ago - Pushed at: almost 5 years ago - Stars: 10 - Forks: 2

saraasgari99/customer-big-data-analytics
In-depth analysis of customer behavior in e-commerce using big data analytics, visualization, and machine learning in Python (PCA, time-series, exploratory, sentiment, and predictive analysis)
Language: Jupyter Notebook - Size: 1.83 MB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

windi-wulandari/PBI_Kimia-Farma-x-Rakamin
A data-driven analytics project for Kimia Farma to evaluate business performance from 2020-2023 using BigQuery. Focused on transaction data, inventory, branch operations, and product insights. Results were visualized through an interactive dashboard to support strategic decisions and optimizations.
Size: 12.7 KB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

ayshikakap31/Strategic-Exposure-of-Participant-s-Data-for-Federated-Learning-based-Urban-Sensing-Applications
This project provides a computation and communication efficient approach for federated learning based urban sensing applications against inference attacks
Language: Jupyter Notebook - Size: 40.4 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

sofiaheinmachado/sofiaheinmachado.github.io
Projects developed with R
Size: 2.93 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

seeratawan01/autocapture.js
Build your own analytics - A single library to grabs every click, touch, page-view, and fill — forever.
Language: TypeScript - Size: 554 KB - Last synced at: 11 days ago - Pushed at: over 2 years ago - Stars: 13 - Forks: 1

nconnector/automotive-market-analysis-platform
Quantitative decision making in automotive industry 🚘📊
Language: Python - Size: 1.1 MB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 0

NiveditaSureshK/Air-Quality-Analysis-using-Big-Data-Analytics
Examined the air quality data of a few Indian states to uncover underlying principles or patterns that may give insight into the severity of the problem.
Language: HTML - Size: 15.2 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

Ren294/Log-Analysis-Project
This project builds a scalable log analytics pipeline use Lambda architecture for real-time and batch processing of NASA server logs.
Language: Python - Size: 2.88 MB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 5 - Forks: 1
