Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: big-data-analytics
ydataai/ydata-profiling
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Language: Python - Size: 594 MB - Last synced: about 11 hours ago - Pushed: 7 days ago - Stars: 12,090 - Forks: 1,631
JosepSampe/lithops Fork of lithops-cloud/lithops
A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud ☁️🚀
Language: Python - Size: 12.6 MB - Last synced: about 7 hours ago - Pushed: about 17 hours ago - Stars: 2 - Forks: 0
artemi8/SST-forecast-ML
SST Forecasting System: A robust forecasting platform leveraging ERA5 reanalysis data and big data tools (Airflow, Spark, Cassandra, PostgreSQL) to predict Sea Surface Temperatures. Utilizes Facebook's Prophet and Vector Auto Regressive models for precise predictions, integrated with Tableau for real-time data visualization.
Language: HTML - Size: 4.1 MB - Last synced: 1 day ago - Pushed: 1 day ago - Stars: 0 - Forks: 0
Dare-marvel/Big-Data-Analytics--BDA--
💾 Welcome to the Big Data Analytics Repository! 📚✨ Immerse yourself in a carefully curated reservoir of knowledge on Big Data Analytics. 🌐💡 Explore the intricacies of deriving insights from vast datasets and navigating powerful analytics tools. 🚀🔍
Language: Java - Size: 146 MB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 0 - Forks: 1
bydevmar/Master_MASD_FPO
Ce dépôt GitHub regroupe tous les cours, TP, TD, projets, et exercices de ma formation en master en mathématiques appliquées pour la science des données. Parcourez-le pour une vue complète de mon parcours académique, offrant une perspective détaillée de mon apprentissage dans ce domaine.
Language: Jupyter Notebook - Size: 148 MB - Last synced: 4 days ago - Pushed: 4 days ago - Stars: 5 - Forks: 0
mikan-senpai/sales-analysis
Python , PySpark , Big-Data
Language: Jupyter Notebook - Size: 4.23 MB - Last synced: 5 days ago - Pushed: 6 days ago - Stars: 0 - Forks: 0
sh16ma/gitpress
TIL(=Today I learned.)
Size: 1.5 MB - Last synced: 6 days ago - Pushed: 6 days ago - Stars: 1 - Forks: 1
noobpk/gemini-web-vulnerability-detection
Gemini-Web Vulnerability Detection (G-WVD) detecting web application vulnerabilities with deep learning
Language: Python - Size: 50.8 KB - Last synced: 11 days ago - Pushed: 11 days ago - Stars: 5 - Forks: 0
I2DSR/data-science-ipython-notebooks
Data science encompasses a wide range of areas, topics, and sub-domains such as Big Data, Machine & Deep learning (ETL, TensorFlow, Keras), Data Mining/Visualization (EDA), BI, Predictive Analytics, Statistical Analytics, etc.
Size: 5.86 KB - Last synced: 11 days ago - Pushed: 12 days ago - Stars: 0 - Forks: 0
jamestiotio/dbsys
SUTD 2021 50.043 Database and Big Data Systems Code Dump
Language: Java - Size: 69.7 MB - Last synced: 13 days ago - Pushed: almost 2 years ago - Stars: 1 - Forks: 3
trieu/leo-cdp-free-edition
The binary build of LEO CDP Free Edition for training purposes
Language: HTML - Size: 719 MB - Last synced: 12 days ago - Pushed: 13 days ago - Stars: 25 - Forks: 11
DrSnowbird/SANSA-RDF Fork of SANSA-Stack/SANSA-Stack
SANSA RDF Library
Language: Scala - Size: 1.14 MB - Last synced: 13 days ago - Pushed: almost 7 years ago - Stars: 1 - Forks: 0
archivesunleashed/aut
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Language: Scala - Size: 39.5 MB - Last synced: 3 days ago - Pushed: 3 months ago - Stars: 133 - Forks: 33
OwenOrcan/YiraBot-Crawler
YiraBot: Simplifying Web Scraping for All. A user-friendly tool for developers and enthusiasts, offering command-line ease and Python integration. Ideal for research, SEO, and data collection.
Language: Python - Size: 207 KB - Last synced: 10 days ago - Pushed: 2 months ago - Stars: 13 - Forks: 0
alexsuakim/MachineLearning
Machine learning model implementations from scratch in Python
Language: Jupyter Notebook - Size: 10.4 MB - Last synced: 16 days ago - Pushed: 17 days ago - Stars: 0 - Forks: 0
yaoguangluo/ChromosomeDNA
《DNA元基催化与肽计算》 在进化计算中, 软件函数文件进行 DNA 语义元基索引编码的 PDE 新陈代谢优化方式, 是一种有效的进化方式.
Language: Java - Size: 670 MB - Last synced: 19 days ago - Pushed: 19 days ago - Stars: 7 - Forks: 2
ninad-moree/TE-Lab-Work-Sem-6
This repository contains a collection of all the third-year lab work (SEM 6) for the Computer branch at (SPPU).
Language: Jupyter Notebook - Size: 24.3 MB - Last synced: 27 days ago - Pushed: 30 days ago - Stars: 1 - Forks: 0
v6d-io/v6d
vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)
Language: C++ - Size: 18.7 MB - Last synced: 24 days ago - Pushed: 25 days ago - Stars: 802 - Forks: 117
jaygala24/covid19-data-analysis
This repository contains data analysis on COVID-19 as a part of Big Data mini project
Language: Jupyter Notebook - Size: 32.2 KB - Last synced: 20 days ago - Pushed: over 2 years ago - Stars: 1 - Forks: 1
ingef/conquery
Visual, interactive queries against big databases
Language: Java - Size: 47.9 MB - Last synced: 26 days ago - Pushed: 27 days ago - Stars: 35 - Forks: 12
jackkolokasis/teraheap
TeraHeap: Reducing Memory Pressure in Managed Big Data Frameworks
Size: 536 MB - Last synced: 24 days ago - Pushed: 24 days ago - Stars: 28 - Forks: 11
HarryZhangHH/Large-Scala-Data-Engineering
Analysis and visualization of comparison between air cargo and passenger flights (pyspark and scala)
Language: Jupyter Notebook - Size: 40.5 MB - Last synced: 24 days ago - Pushed: over 1 year ago - Stars: 2 - Forks: 0
fosfrancesco/tweet-popularity
Predict the number of retweets that a tweet about a specific museum will have.
Language: HTML - Size: 521 KB - Last synced: 24 days ago - Pushed: about 3 years ago - Stars: 0 - Forks: 0
jowilf/big-data-showcase
This repository contains a project showcasing the use of Big Data technologies in processing and visualizing real-time data from an eCommerce electronics store using tools such as Apache Kafka, Spark Streaming, Spark SQL, HBase, and Plotly
Language: Java - Size: 2.7 MB - Last synced: 25 days ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0
Ashish7129/Graph_Sampling
Graph Sampling is a python package containing various approaches which samples the original graph according to different sample sizes.
Language: Python - Size: 4.91 MB - Last synced: 17 days ago - Pushed: over 3 years ago - Stars: 155 - Forks: 47
angeligareta/spark-flight-prediction
Assignment for Cloud Computing And Big Data Ecosystems Design subject that aims to predict flight arrival time using Apache Spark and Scala.
Language: Scala - Size: 54.8 MB - Last synced: 27 days ago - Pushed: about 3 years ago - Stars: 3 - Forks: 1
CirsteanPaul/pyspark-project
Big data management with PySpark
Language: Jupyter Notebook - Size: 251 KB - Last synced: 26 days ago - Pushed: 27 days ago - Stars: 0 - Forks: 0
mehrotrasan16/TwitterAnalyser_StormBoi
A lossy counting algorithm implemented to determine the top trending hashtags using the Twitter API to get a continuous stream of tweets.
Language: Java - Size: 396 KB - Last synced: 28 days ago - Pushed: 6 months ago - Stars: 0 - Forks: 1
jaanli/american-community-survey
American Community Survey data on people and households
Language: Jupyter Notebook - Size: 142 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 14 - Forks: 1
MrXujiang/v6.dooring.public
可视化大屏解决方案, 提供一套可视化编辑引擎, 助力个人或企业轻松定制自己的可视化大屏应用.
Language: TypeScript - Size: 243 KB - Last synced: about 1 month ago - Pushed: almost 3 years ago - Stars: 440 - Forks: 92
rouyang2017/SISSO
A data-driven method combining symbolic regression and compressed sensing for accurate & interpretable models.
Language: Fortran - Size: 2.12 MB - Last synced: about 1 month ago - Pushed: 8 months ago - Stars: 212 - Forks: 71
whoami-anoint/EasyHadoop
Simplified Hadoop Setup and Configuration Automation
Language: Shell - Size: 12 MB - Last synced: 28 days ago - Pushed: 9 months ago - Stars: 2 - Forks: 0
metatron-app/metatron-discovery
Powerful & Easy way for big data discovery
Language: TypeScript - Size: 93.3 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 428 - Forks: 107
drshahizan/BDM
Course covers big data fundamentals, processes, technologies, platform ecosystem, and management for practical application development.
Language: Jupyter Notebook - Size: 102 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 48 - Forks: 46
GameTuner/query-engine
GameTuner Query Engine is responsible for executing the queries that are built from API requests.
Language: Python - Size: 104 KB - Last synced: 26 days ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
GameTuner/enrich-bad-sink
GameTuner Enrich Bad Sink loads data from bad events topic to BigQuery
Language: Python - Size: 7.81 KB - Last synced: 27 days ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
GameTuner/bigquery-loader
GameTuner BigQuery Loader is application that loads enriched event to BigQuery
Language: Scala - Size: 122 KB - Last synced: 27 days ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
GameTuner/collector
GameTuner Scala Stream Collector is project for collecting raw events from tracker
Language: Scala - Size: 73.2 KB - Last synced: 27 days ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
Abdurrehman7452/search-engine-utilising-hadoop-MapReduce-technology-with-python-on-wikipedia-articles
Developing a Naive Search Engine Utilising Apache Hadoop MapReduce Technology on a dataset in comma-separated values (CSV) format containing around 5 million Wikipedia articles provided by Wikimedia, as part of an assignment for the Fundamental of Big Data Analytics (DS2004) course.
Size: 1.95 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0
mgiridhar/PySpark-Projects
Toy projects to learn apache spark (py-spark) udemy course.
Language: Python - Size: 1.95 KB - Last synced: about 2 months ago - Pushed: about 6 years ago - Stars: 0 - Forks: 1
adityakamble49/loss-ratio-prediction
Predicting Loss Ratios for Auto Insurance Portfolios - ITCS 6100 Big Data Analytics for Competitive Advantage
Language: Jupyter Notebook - Size: 71.8 MB - Last synced: about 1 month ago - Pushed: almost 4 years ago - Stars: 8 - Forks: 2
AdrianaMacc/Covid-19-BigData-Project
SARS-COV-2 genome analysis using Big Data algorithms in order to find clusters of similar mutations that belongs to different clades which mutate together and generate the correspondent clade.
Language: Jupyter Notebook - Size: 513 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0
enggiqbal/uamap-web
UAMAP / KMAP web module (old version)
Language: JavaScript - Size: 78.4 MB - Last synced: about 2 months ago - Pushed: about 5 years ago - Stars: 0 - Forks: 1
GlulkAlex/turing-big-data-challenge
Turing Data Engineering Challenge
Language: Scala - Size: 138 KB - Last synced: about 2 months ago - Pushed: about 5 years ago - Stars: 0 - Forks: 0
mahmoudparsian/pyspark-tutorial
PySpark-Tutorial provides basic algorithms using PySpark
Language: Jupyter Notebook - Size: 8.97 MB - Last synced: about 2 months ago - Pushed: over 1 year ago - Stars: 1,092 - Forks: 449
ICT-BDA/EasyML
Easy Machine Learning is a general-purpose dataflow-based system for easing the process of applying machine learning algorithms to real world tasks.
Language: Java - Size: 14.9 MB - Last synced: about 2 months ago - Pushed: 5 months ago - Stars: 1,964 - Forks: 441
dgkanatsios/GameAnalyticsEventHubFunctionsCosmosDatalake
Big data reference architecture and implementation for an online multiplayer game
Language: JavaScript - Size: 563 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 3 - Forks: 0
rapticore/ssvc_ore_miner
SSVC Ore Miner - www.rapticore.com
Language: Python - Size: 424 KB - Last synced: 16 days ago - Pushed: about 2 months ago - Stars: 5 - Forks: 1
AdityaMore7000/PICT-CE-SEM-6
Practicals of PICT SEM 6
Language: Jupyter Notebook - Size: 3.33 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0
epidataio/epidata-community
EpiData IoT Data Science Platform - Community Edition
Language: Python - Size: 7.56 MB - Last synced: about 2 months ago - Pushed: 10 months ago - Stars: 8 - Forks: 7
codeincorp/falcon
Falcon: The world fastest data analytics engine
Language: C++ - Size: 188 KB - Last synced: about 2 months ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 0
dongsuo/vue-data-board
A Data Analysis Board in Vue.
Language: Vue - Size: 10.4 MB - Last synced: about 2 months ago - Pushed: 6 months ago - Stars: 1,301 - Forks: 291
Amey-Thakur/BIG-DATA-ANALYTICS-AND-COMPUTATIONAL-LAB-I
CSDLO7032: Big Data Analytics & CSL704: Computational Lab - I <Semester VII>
Language: Jupyter Notebook - Size: 183 MB - Last synced: 1 day ago - Pushed: 2 months ago - Stars: 7 - Forks: 1
KayvanShah1/Big-Data-Specialization-Coursera
Repository for the Big Data Specialization from University of California San Diego on Coursera
Language: Python - Size: 20 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 5 - Forks: 0
Amey-Thakur/HADOOP
HADOOP
Language: Jupyter Notebook - Size: 12.7 KB - Last synced: 1 day ago - Pushed: 2 months ago - Stars: 7 - Forks: 1
Amey-Thakur/OPTIMIZING-STOCK-TRADING-STRATEGY-WITH-K-MEANS-CLUSTERING
Big Data Analytics [BDA] Mini Project
Language: Jupyter Notebook - Size: 2.55 MB - Last synced: 1 day ago - Pushed: 2 months ago - Stars: 9 - Forks: 0
enars/Data-exploration-of-NPM
A Big Data Analytics project exploring the security, general trends and depedencies of all JavaScript packages on the NPM ecosystem
Language: Jupyter Notebook - Size: 277 KB - Last synced: 2 months ago - Pushed: over 3 years ago - Stars: 1 - Forks: 0
aanujkhurana/BigData-Analysis
SocialMedia Big Data Analysis for Eminem (music artist), using RStudio and R lang
Language: R - Size: 17.6 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 0 - Forks: 0
Srking501/csc8101_coursework
A summative coursework for CSC8101 Engineering for AI
Language: Jupyter Notebook - Size: 168 KB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 0 - Forks: 0
mwrks/UAS-EDA
This repo intended to fulfill our big data subject group task
Language: Jupyter Notebook - Size: 9.26 MB - Last synced: 2 months ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0
zhuoqunw/Big-data-analysis-on-movie-dataset
SI 618 final project (PySpark, SparkSQL, Hadoop, Altair)
Language: Jupyter Notebook - Size: 1.08 MB - Last synced: 2 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
LimatanL/Kimia_farma_VIX
Project Based Internship Big Data Analyst at Kimia Farma
Size: 0 Bytes - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
rishabhshirke10/BIG_DATA_ANALYSIS
Data Analysis
Language: Jupyter Notebook - Size: 29.3 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
Wittline/pyspark-on-aws-emr
The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on writing pyspark code.
Language: Python - Size: 3.61 MB - Last synced: 13 days ago - Pushed: almost 2 years ago - Stars: 24 - Forks: 13
starryjay/PSTAT135Final
This is my final project for PSTAT 135, Big Data Analytics, using PySpark to conduct county-wide voter turnout regression analysis by demographic. This project was done in collaboration with Tyler Kim and Erasmo Rivas. The GCP storage bucket linked below contains the full project, while the Jupyter notebook and exported PDF are included here.
Language: Jupyter Notebook - Size: 1.87 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
scalytics/SDE
Scalytics Connect development environment, pre-build
Language: Jupyter Notebook - Size: 34 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 22 - Forks: 8
anon1303/DataWarehouseAnalyzer
Language: Python - Size: 1000 Bytes - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
RomanDataLab/Barcelona_Hightech_Clusters
Clustering advanced industries to facilitate technology transfer in the Barcelona metropolitan region
Language: Jupyter Notebook - Size: 57.7 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
RomanDataLab/Urbansprawl_of_Melbourne
Remote sensing project in Python/ QGIS/ Grasshopper. Analysis and prediction.
Language: Jupyter Notebook - Size: 9.72 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
ixgnoy/Visualize_movie_with_rating
By using Hadoop, visualization.
Size: 0 Bytes - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
panstacks/pandata
The Pandata scalable open-source analysis stack
Size: 17.6 KB - Last synced: 3 months ago - Pushed: 7 months ago - Stars: 59 - Forks: 1
JSM03/Data-Science
Welcome to My Data Science Projects
Language: Jupyter Notebook - Size: 1.48 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
deeplaxmilambture/DataDynamo
Transitioning into Data field
Language: Python - Size: 21.7 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
asavinov/bistro
A general-purpose data analysis engine radically changing the way batch and stream data is processed
Language: Java - Size: 2.16 MB - Last synced: about 1 month ago - Pushed: over 5 years ago - Stars: 7 - Forks: 0
LimatanL/Project_based_kimia_farma
Dashboard Visualization
Size: 544 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
arakat-community/arakat 📦
ARAKAT - Big Data Analysis and Business Intelligence Application Development Platform
Language: Python - Size: 31.6 MB - Last synced: 2 months ago - Pushed: almost 3 years ago - Stars: 26 - Forks: 21
IBM/ibmpairs
open source tools for interaction with IBM PAIRS:
Language: Jupyter Notebook - Size: 66.2 MB - Last synced: about 1 month ago - Pushed: about 2 months ago - Stars: 34 - Forks: 18
ShaharShc/BigDataCourse
Ben Gurion University "The Art of Analyzing Big Data - The Data Scientist’s Toolbox (372.2.5401)" course assignments & solutions
Language: Jupyter Notebook - Size: 39.4 MB - Last synced: 3 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0
SrinithiSaiprasath/Covid_Analysis_Using_Tableau 📦
This visualization dashboard made using Tableau enables stakeholders to grasp key pandemic metrics, understanding the nuanced dynamics of the virus across various dimensions.
Size: 509 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0
benkeyben/10alytics_air_realtor
10alytics_air_realtor ,a dynamic repository, hosts an AWS-driven data pipeline. Utilizing Apache Airflow, AWS S3, and EC2, it performs efficient ETL operations, extracting comprehensive real estate data from the Realty Mole Property API via RapidAPI. This tool empowers real estate professionals with timely insights for strategic decision-making.
Language: Jupyter Notebook - Size: 656 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0
shivatejapecheti/Twitter-Live-Feed-Analysis-and-Streaming-for-Movies
Bigdata Analysis Project
Language: Jupyter Notebook - Size: 162 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 1 - Forks: 0
jdvelasq/courses
Material de apoyo para cursos, Facultad de Minas, Universidad Nacional de Colombia
Language: Python - Size: 470 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 10 - Forks: 6
SkipToYourSoul/keep-hungry-stay-foolish
A Personal Work Notebook on Gitbook. 1)编程知识点总结;2)大数据场景下的用户数据解决方案实例
Language: HTML - Size: 5.44 MB - Last synced: 28 days ago - Pushed: almost 4 years ago - Stars: 9 - Forks: 0
Lakshmiec/Big-Data-Sentiment-Analysis-of-Amazon-Reviews-for-Seller-and-Brand-Empowerment
Language: Jupyter Notebook - Size: 1.53 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 1 - Forks: 0
Azure/AzureKusto
R interface to Azure Data Explorer, aka Kusto
Language: R - Size: 400 KB - Last synced: 15 days ago - Pushed: 7 months ago - Stars: 17 - Forks: 2
welingtonfonsec/Meu-Guia-SQL-Server
Compilação que vai do básico ao avançado de conhecimentos adquiridos em diversos cursos online sobre SQL Server.
Language: TSQL - Size: 1.01 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0
madihaiqbal/suicide_data_analysis
Language: Jupyter Notebook - Size: 1.15 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0
RJBarker/home_sales
Use PySpark and SparkSQL to execute SQL queries through a temporary view of the DataFrame created. Conduct additional queries on cached and partitioned data to determine runtime comparisons.
Language: Jupyter Notebook - Size: 146 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0
nickenshidqia/Big_Data_Analytics_Kimia_Farma
Big Data Analytics Project gives challenges to create data mart design and dashboard on Kimia Farma
Size: 5.52 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0
Fili-ai/knn_cuda
KNN written in CUDA without any external library like CUBLAS or anything else
Language: Cuda - Size: 3.53 MB - Last synced: about 2 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0
KOHENOORKEN/KGS-Global
All sub projects of KGS Global will be kept here
Language: Solidity - Size: 5.69 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0
Laetitia-Deken/Chicago_Taxi_Trips
Exploration of Chicago Taxi Trips - BigQuery Data with Python (January 2013 - October 2023)
Language: Jupyter Notebook - Size: 1.58 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 1 - Forks: 0
marcocolangelo/Big-Data-processing-and-Analytics
The current repository contains all the code developed during the Big Data processing and Analytics laboratories. Data are processed and analyzed using Hadoop and Spark
Language: Java - Size: 6.1 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0
arxiver/Airbnb-EDA-and-Regression
Big data exploration and analysis on Airbnb dataset as well as regression model for price prediction of entities
Language: Jupyter Notebook - Size: 3.11 MB - Last synced: 13 days ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 1
IsaacMwendwa/Big-Data-with-PySpark
This repository contains the materials (code & theory) I compiled while undertaking DataCamp's Big Data with PySpark Learning Track
Size: 147 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 0
aeronaut2001/Car-Insurance-Cold-Calls-Data-Analysis
Car Insurance Cold Calls Data Analysis using Apache Hive
Language: HiveQL - Size: 1.17 MB - Last synced: 5 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0
aeronaut2001/Telecom-Data-Analysis
Telecom Data Analysis with Apache Hive
Language: HiveQL - Size: 345 KB - Last synced: 5 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0
lithops-cloud/lithops
A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud ☁️🚀
Language: Python - Size: 12.3 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 289 - Forks: 91
tekdogan/iccbdc-21
Experiment files for ICCBDC'21 paper "Benchmarking Apache Spark and Hadoop MapReduce on Big Data Classification"
Language: Python - Size: 111 KB - Last synced: 5 months ago - Pushed: over 2 years ago - Stars: 1 - Forks: 0
harshh351998/Market-Basket-Items-Recommendation
This project provide the retailer with information to understand the purchase behaviour of a buyer and recommends products to user on their purchase history.
Language: Jupyter Notebook - Size: 1.11 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 0