Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: big-data-analytics

ydataai/ydata-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

Language: Python - Size: 594 MB - Last synced: about 11 hours ago - Pushed: 7 days ago - Stars: 12,090 - Forks: 1,631

JosepSampe/lithops Fork of lithops-cloud/lithops

A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud ☁️🚀

Language: Python - Size: 12.6 MB - Last synced: about 7 hours ago - Pushed: about 17 hours ago - Stars: 2 - Forks: 0

artemi8/SST-forecast-ML

SST Forecasting System: A robust forecasting platform leveraging ERA5 reanalysis data and big data tools (Airflow, Spark, Cassandra, PostgreSQL) to predict Sea Surface Temperatures. Utilizes Facebook's Prophet and Vector Auto Regressive models for precise predictions, integrated with Tableau for real-time data visualization.

Language: HTML - Size: 4.1 MB - Last synced: 1 day ago - Pushed: 1 day ago - Stars: 0 - Forks: 0

Dare-marvel/Big-Data-Analytics--BDA--

💾 Welcome to the Big Data Analytics Repository! 📚✨ Immerse yourself in a carefully curated reservoir of knowledge on Big Data Analytics. 🌐💡 Explore the intricacies of deriving insights from vast datasets and navigating powerful analytics tools. 🚀🔍

Language: Java - Size: 146 MB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 0 - Forks: 1

bydevmar/Master_MASD_FPO

Ce dépôt GitHub regroupe tous les cours, TP, TD, projets, et exercices de ma formation en master en mathématiques appliquées pour la science des données. Parcourez-le pour une vue complète de mon parcours académique, offrant une perspective détaillée de mon apprentissage dans ce domaine.

Language: Jupyter Notebook - Size: 148 MB - Last synced: 4 days ago - Pushed: 4 days ago - Stars: 5 - Forks: 0

mikan-senpai/sales-analysis

Python , PySpark , Big-Data

Language: Jupyter Notebook - Size: 4.23 MB - Last synced: 5 days ago - Pushed: 6 days ago - Stars: 0 - Forks: 0

sh16ma/gitpress

TIL(=Today I learned.)

Size: 1.5 MB - Last synced: 6 days ago - Pushed: 6 days ago - Stars: 1 - Forks: 1

noobpk/gemini-web-vulnerability-detection

Gemini-Web Vulnerability Detection (G-WVD) detecting web application vulnerabilities with deep learning

Language: Python - Size: 50.8 KB - Last synced: 11 days ago - Pushed: 11 days ago - Stars: 5 - Forks: 0

I2DSR/data-science-ipython-notebooks

Data science encompasses a wide range of areas, topics, and sub-domains such as Big Data, Machine & Deep learning (ETL, TensorFlow, Keras), Data Mining/Visualization (EDA), BI, Predictive Analytics, Statistical Analytics, etc.

Size: 5.86 KB - Last synced: 11 days ago - Pushed: 12 days ago - Stars: 0 - Forks: 0

jamestiotio/dbsys

SUTD 2021 50.043 Database and Big Data Systems Code Dump

Language: Java - Size: 69.7 MB - Last synced: 13 days ago - Pushed: almost 2 years ago - Stars: 1 - Forks: 3

trieu/leo-cdp-free-edition

The binary build of LEO CDP Free Edition for training purposes

Language: HTML - Size: 719 MB - Last synced: 12 days ago - Pushed: 13 days ago - Stars: 25 - Forks: 11

DrSnowbird/SANSA-RDF Fork of SANSA-Stack/SANSA-Stack

SANSA RDF Library

Language: Scala - Size: 1.14 MB - Last synced: 13 days ago - Pushed: almost 7 years ago - Stars: 1 - Forks: 0

archivesunleashed/aut

The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.

Language: Scala - Size: 39.5 MB - Last synced: 3 days ago - Pushed: 3 months ago - Stars: 133 - Forks: 33

OwenOrcan/YiraBot-Crawler

YiraBot: Simplifying Web Scraping for All. A user-friendly tool for developers and enthusiasts, offering command-line ease and Python integration. Ideal for research, SEO, and data collection.

Language: Python - Size: 207 KB - Last synced: 10 days ago - Pushed: 2 months ago - Stars: 13 - Forks: 0

alexsuakim/MachineLearning

Machine learning model implementations from scratch in Python

Language: Jupyter Notebook - Size: 10.4 MB - Last synced: 16 days ago - Pushed: 17 days ago - Stars: 0 - Forks: 0

yaoguangluo/ChromosomeDNA

《DNA元基催化与肽计算》 在进化计算中, 软件函数文件进行 DNA 语义元基索引编码的 PDE 新陈代谢优化方式, 是一种有效的进化方式.

Language: Java - Size: 670 MB - Last synced: 19 days ago - Pushed: 19 days ago - Stars: 7 - Forks: 2

ninad-moree/TE-Lab-Work-Sem-6

This repository contains a collection of all the third-year lab work (SEM 6) for the Computer branch at (SPPU).

Language: Jupyter Notebook - Size: 24.3 MB - Last synced: 27 days ago - Pushed: 30 days ago - Stars: 1 - Forks: 0

v6d-io/v6d

vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)

Language: C++ - Size: 18.7 MB - Last synced: 24 days ago - Pushed: 25 days ago - Stars: 802 - Forks: 117

jaygala24/covid19-data-analysis

This repository contains data analysis on COVID-19 as a part of Big Data mini project

Language: Jupyter Notebook - Size: 32.2 KB - Last synced: 20 days ago - Pushed: over 2 years ago - Stars: 1 - Forks: 1

ingef/conquery

Visual, interactive queries against big databases

Language: Java - Size: 47.9 MB - Last synced: 26 days ago - Pushed: 27 days ago - Stars: 35 - Forks: 12

jackkolokasis/teraheap

TeraHeap: Reducing Memory Pressure in Managed Big Data Frameworks

Size: 536 MB - Last synced: 24 days ago - Pushed: 24 days ago - Stars: 28 - Forks: 11

HarryZhangHH/Large-Scala-Data-Engineering

Analysis and visualization of comparison between air cargo and passenger flights (pyspark and scala)

Language: Jupyter Notebook - Size: 40.5 MB - Last synced: 24 days ago - Pushed: over 1 year ago - Stars: 2 - Forks: 0

fosfrancesco/tweet-popularity

Predict the number of retweets that a tweet about a specific museum will have.

Language: HTML - Size: 521 KB - Last synced: 24 days ago - Pushed: about 3 years ago - Stars: 0 - Forks: 0

jowilf/big-data-showcase

This repository contains a project showcasing the use of Big Data technologies in processing and visualizing real-time data from an eCommerce electronics store using tools such as Apache Kafka, Spark Streaming, Spark SQL, HBase, and Plotly

Language: Java - Size: 2.7 MB - Last synced: 25 days ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

Ashish7129/Graph_Sampling

Graph Sampling is a python package containing various approaches which samples the original graph according to different sample sizes.

Language: Python - Size: 4.91 MB - Last synced: 17 days ago - Pushed: over 3 years ago - Stars: 155 - Forks: 47

angeligareta/spark-flight-prediction

Assignment for Cloud Computing And Big Data Ecosystems Design subject that aims to predict flight arrival time using Apache Spark and Scala.

Language: Scala - Size: 54.8 MB - Last synced: 27 days ago - Pushed: about 3 years ago - Stars: 3 - Forks: 1

CirsteanPaul/pyspark-project

Big data management with PySpark

Language: Jupyter Notebook - Size: 251 KB - Last synced: 26 days ago - Pushed: 27 days ago - Stars: 0 - Forks: 0

mehrotrasan16/TwitterAnalyser_StormBoi

A lossy counting algorithm implemented to determine the top trending hashtags using the Twitter API to get a continuous stream of tweets.

Language: Java - Size: 396 KB - Last synced: 28 days ago - Pushed: 6 months ago - Stars: 0 - Forks: 1

jaanli/american-community-survey

American Community Survey data on people and households

Language: Jupyter Notebook - Size: 142 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 14 - Forks: 1

MrXujiang/v6.dooring.public

可视化大屏解决方案, 提供一套可视化编辑引擎, 助力个人或企业轻松定制自己的可视化大屏应用.

Language: TypeScript - Size: 243 KB - Last synced: about 1 month ago - Pushed: almost 3 years ago - Stars: 440 - Forks: 92

rouyang2017/SISSO

A data-driven method combining symbolic regression and compressed sensing for accurate & interpretable models.

Language: Fortran - Size: 2.12 MB - Last synced: about 1 month ago - Pushed: 8 months ago - Stars: 212 - Forks: 71

whoami-anoint/EasyHadoop

Simplified Hadoop Setup and Configuration Automation

Language: Shell - Size: 12 MB - Last synced: 28 days ago - Pushed: 9 months ago - Stars: 2 - Forks: 0

metatron-app/metatron-discovery

Powerful & Easy way for big data discovery

Language: TypeScript - Size: 93.3 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 428 - Forks: 107

drshahizan/BDM

Course covers big data fundamentals, processes, technologies, platform ecosystem, and management for practical application development.

Language: Jupyter Notebook - Size: 102 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 48 - Forks: 46

GameTuner/query-engine

GameTuner Query Engine is responsible for executing the queries that are built from API requests.

Language: Python - Size: 104 KB - Last synced: 26 days ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

GameTuner/enrich-bad-sink

GameTuner Enrich Bad Sink loads data from bad events topic to BigQuery

Language: Python - Size: 7.81 KB - Last synced: 27 days ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

GameTuner/bigquery-loader

GameTuner BigQuery Loader is application that loads enriched event to BigQuery

Language: Scala - Size: 122 KB - Last synced: 27 days ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

GameTuner/collector

GameTuner Scala Stream Collector is project for collecting raw events from tracker

Language: Scala - Size: 73.2 KB - Last synced: 27 days ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

Abdurrehman7452/search-engine-utilising-hadoop-MapReduce-technology-with-python-on-wikipedia-articles

Developing a Naive Search Engine Utilising Apache Hadoop MapReduce Technology on a dataset in comma-separated values (CSV) format containing around 5 million Wikipedia articles provided by Wikimedia, as part of an assignment for the Fundamental of Big Data Analytics (DS2004) course.

Size: 1.95 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0

mgiridhar/PySpark-Projects

Toy projects to learn apache spark (py-spark) udemy course.

Language: Python - Size: 1.95 KB - Last synced: about 2 months ago - Pushed: about 6 years ago - Stars: 0 - Forks: 1

adityakamble49/loss-ratio-prediction

Predicting Loss Ratios for Auto Insurance Portfolios - ITCS 6100 Big Data Analytics for Competitive Advantage

Language: Jupyter Notebook - Size: 71.8 MB - Last synced: about 1 month ago - Pushed: almost 4 years ago - Stars: 8 - Forks: 2

AdrianaMacc/Covid-19-BigData-Project

SARS-COV-2 genome analysis using Big Data algorithms in order to find clusters of similar mutations that belongs to different clades which mutate together and generate the correspondent clade.

Language: Jupyter Notebook - Size: 513 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0

enggiqbal/uamap-web

UAMAP / KMAP web module (old version)

Language: JavaScript - Size: 78.4 MB - Last synced: about 2 months ago - Pushed: about 5 years ago - Stars: 0 - Forks: 1

GlulkAlex/turing-big-data-challenge

Turing Data Engineering Challenge

Language: Scala - Size: 138 KB - Last synced: about 2 months ago - Pushed: about 5 years ago - Stars: 0 - Forks: 0

mahmoudparsian/pyspark-tutorial

PySpark-Tutorial provides basic algorithms using PySpark

Language: Jupyter Notebook - Size: 8.97 MB - Last synced: about 2 months ago - Pushed: over 1 year ago - Stars: 1,092 - Forks: 449

ICT-BDA/EasyML

Easy Machine Learning is a general-purpose dataflow-based system for easing the process of applying machine learning algorithms to real world tasks.

Language: Java - Size: 14.9 MB - Last synced: about 2 months ago - Pushed: 5 months ago - Stars: 1,964 - Forks: 441

dgkanatsios/GameAnalyticsEventHubFunctionsCosmosDatalake

Big data reference architecture and implementation for an online multiplayer game

Language: JavaScript - Size: 563 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 3 - Forks: 0

rapticore/ssvc_ore_miner

SSVC Ore Miner - www.rapticore.com

Language: Python - Size: 424 KB - Last synced: 16 days ago - Pushed: about 2 months ago - Stars: 5 - Forks: 1

AdityaMore7000/PICT-CE-SEM-6

Practicals of PICT SEM 6

Language: Jupyter Notebook - Size: 3.33 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0

epidataio/epidata-community

EpiData IoT Data Science Platform - Community Edition

Language: Python - Size: 7.56 MB - Last synced: about 2 months ago - Pushed: 10 months ago - Stars: 8 - Forks: 7

codeincorp/falcon

Falcon: The world fastest data analytics engine

Language: C++ - Size: 188 KB - Last synced: about 2 months ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 0

dongsuo/vue-data-board

A Data Analysis Board in Vue.

Language: Vue - Size: 10.4 MB - Last synced: about 2 months ago - Pushed: 6 months ago - Stars: 1,301 - Forks: 291

Amey-Thakur/BIG-DATA-ANALYTICS-AND-COMPUTATIONAL-LAB-I

CSDLO7032: Big Data Analytics & CSL704: Computational Lab - I <Semester VII>

Language: Jupyter Notebook - Size: 183 MB - Last synced: 1 day ago - Pushed: 2 months ago - Stars: 7 - Forks: 1

KayvanShah1/Big-Data-Specialization-Coursera

Repository for the Big Data Specialization from University of California San Diego on Coursera

Language: Python - Size: 20 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 5 - Forks: 0

Amey-Thakur/HADOOP

HADOOP

Language: Jupyter Notebook - Size: 12.7 KB - Last synced: 1 day ago - Pushed: 2 months ago - Stars: 7 - Forks: 1

Amey-Thakur/OPTIMIZING-STOCK-TRADING-STRATEGY-WITH-K-MEANS-CLUSTERING

Big Data Analytics [BDA] Mini Project

Language: Jupyter Notebook - Size: 2.55 MB - Last synced: 1 day ago - Pushed: 2 months ago - Stars: 9 - Forks: 0

enars/Data-exploration-of-NPM

A Big Data Analytics project exploring the security, general trends and depedencies of all JavaScript packages on the NPM ecosystem

Language: Jupyter Notebook - Size: 277 KB - Last synced: 2 months ago - Pushed: over 3 years ago - Stars: 1 - Forks: 0

aanujkhurana/BigData-Analysis

SocialMedia Big Data Analysis for Eminem (music artist), using RStudio and R lang

Language: R - Size: 17.6 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 0 - Forks: 0

Srking501/csc8101_coursework

A summative coursework for CSC8101 Engineering for AI

Language: Jupyter Notebook - Size: 168 KB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 0 - Forks: 0

mwrks/UAS-EDA

This repo intended to fulfill our big data subject group task

Language: Jupyter Notebook - Size: 9.26 MB - Last synced: 2 months ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

zhuoqunw/Big-data-analysis-on-movie-dataset

SI 618 final project (PySpark, SparkSQL, Hadoop, Altair)

Language: Jupyter Notebook - Size: 1.08 MB - Last synced: 2 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

LimatanL/Kimia_farma_VIX

Project Based Internship Big Data Analyst at Kimia Farma

Size: 0 Bytes - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

rishabhshirke10/BIG_DATA_ANALYSIS

Data Analysis

Language: Jupyter Notebook - Size: 29.3 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

Wittline/pyspark-on-aws-emr

The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on writing pyspark code.

Language: Python - Size: 3.61 MB - Last synced: 13 days ago - Pushed: almost 2 years ago - Stars: 24 - Forks: 13

starryjay/PSTAT135Final

This is my final project for PSTAT 135, Big Data Analytics, using PySpark to conduct county-wide voter turnout regression analysis by demographic. This project was done in collaboration with Tyler Kim and Erasmo Rivas. The GCP storage bucket linked below contains the full project, while the Jupyter notebook and exported PDF are included here.

Language: Jupyter Notebook - Size: 1.87 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

scalytics/SDE

Scalytics Connect development environment, pre-build

Language: Jupyter Notebook - Size: 34 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 22 - Forks: 8

anon1303/DataWarehouseAnalyzer

Language: Python - Size: 1000 Bytes - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

RomanDataLab/Barcelona_Hightech_Clusters

Clustering advanced industries to facilitate technology transfer in the Barcelona metropolitan region

Language: Jupyter Notebook - Size: 57.7 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

RomanDataLab/Urbansprawl_of_Melbourne

Remote sensing project in Python/ QGIS/ Grasshopper. Analysis and prediction.

Language: Jupyter Notebook - Size: 9.72 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

ixgnoy/Visualize_movie_with_rating

By using Hadoop, visualization.

Size: 0 Bytes - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

panstacks/pandata

The Pandata scalable open-source analysis stack

Size: 17.6 KB - Last synced: 3 months ago - Pushed: 7 months ago - Stars: 59 - Forks: 1

JSM03/Data-Science

Welcome to My Data Science Projects

Language: Jupyter Notebook - Size: 1.48 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

deeplaxmilambture/DataDynamo

Transitioning into Data field

Language: Python - Size: 21.7 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

asavinov/bistro

A general-purpose data analysis engine radically changing the way batch and stream data is processed

Language: Java - Size: 2.16 MB - Last synced: about 1 month ago - Pushed: over 5 years ago - Stars: 7 - Forks: 0

LimatanL/Project_based_kimia_farma

Dashboard Visualization

Size: 544 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

arakat-community/arakat 📦

ARAKAT - Big Data Analysis and Business Intelligence Application Development Platform

Language: Python - Size: 31.6 MB - Last synced: 2 months ago - Pushed: almost 3 years ago - Stars: 26 - Forks: 21

IBM/ibmpairs

open source tools for interaction with IBM PAIRS:

Language: Jupyter Notebook - Size: 66.2 MB - Last synced: about 1 month ago - Pushed: about 2 months ago - Stars: 34 - Forks: 18

ShaharShc/BigDataCourse

Ben Gurion University "The Art of Analyzing Big Data - The Data Scientist’s Toolbox (372.2.5401)" course assignments & solutions

Language: Jupyter Notebook - Size: 39.4 MB - Last synced: 3 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

SrinithiSaiprasath/Covid_Analysis_Using_Tableau 📦

This visualization dashboard made using Tableau enables stakeholders to grasp key pandemic metrics, understanding the nuanced dynamics of the virus across various dimensions.

Size: 509 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

benkeyben/10alytics_air_realtor

10alytics_air_realtor ,a dynamic repository, hosts an AWS-driven data pipeline. Utilizing Apache Airflow, AWS S3, and EC2, it performs efficient ETL operations, extracting comprehensive real estate data from the Realty Mole Property API via RapidAPI. This tool empowers real estate professionals with timely insights for strategic decision-making.

Language: Jupyter Notebook - Size: 656 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0

shivatejapecheti/Twitter-Live-Feed-Analysis-and-Streaming-for-Movies

Bigdata Analysis Project

Language: Jupyter Notebook - Size: 162 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 1 - Forks: 0

jdvelasq/courses

Material de apoyo para cursos, Facultad de Minas, Universidad Nacional de Colombia

Language: Python - Size: 470 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 10 - Forks: 6

SkipToYourSoul/keep-hungry-stay-foolish

A Personal Work Notebook on Gitbook. 1)编程知识点总结;2)大数据场景下的用户数据解决方案实例

Language: HTML - Size: 5.44 MB - Last synced: 28 days ago - Pushed: almost 4 years ago - Stars: 9 - Forks: 0

Lakshmiec/Big-Data-Sentiment-Analysis-of-Amazon-Reviews-for-Seller-and-Brand-Empowerment

Language: Jupyter Notebook - Size: 1.53 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 1 - Forks: 0

Azure/AzureKusto

R interface to Azure Data Explorer, aka Kusto

Language: R - Size: 400 KB - Last synced: 15 days ago - Pushed: 7 months ago - Stars: 17 - Forks: 2

welingtonfonsec/Meu-Guia-SQL-Server

Compilação que vai do básico ao avançado de conhecimentos adquiridos em diversos cursos online sobre SQL Server.

Language: TSQL - Size: 1.01 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

madihaiqbal/suicide_data_analysis

Language: Jupyter Notebook - Size: 1.15 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

RJBarker/home_sales

Use PySpark and SparkSQL to execute SQL queries through a temporary view of the DataFrame created. Conduct additional queries on cached and partitioned data to determine runtime comparisons.

Language: Jupyter Notebook - Size: 146 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

nickenshidqia/Big_Data_Analytics_Kimia_Farma

Big Data Analytics Project gives challenges to create data mart design and dashboard on Kimia Farma

Size: 5.52 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

Fili-ai/knn_cuda

KNN written in CUDA without any external library like CUBLAS or anything else

Language: Cuda - Size: 3.53 MB - Last synced: about 2 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

KOHENOORKEN/KGS-Global

All sub projects of KGS Global will be kept here

Language: Solidity - Size: 5.69 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

Laetitia-Deken/Chicago_Taxi_Trips

Exploration of Chicago Taxi Trips - BigQuery Data with Python (January 2013 - October 2023)

Language: Jupyter Notebook - Size: 1.58 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 1 - Forks: 0

marcocolangelo/Big-Data-processing-and-Analytics

The current repository contains all the code developed during the Big Data processing and Analytics laboratories. Data are processed and analyzed using Hadoop and Spark

Language: Java - Size: 6.1 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

arxiver/Airbnb-EDA-and-Regression

Big data exploration and analysis on Airbnb dataset as well as regression model for price prediction of entities

Language: Jupyter Notebook - Size: 3.11 MB - Last synced: 13 days ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 1

IsaacMwendwa/Big-Data-with-PySpark

This repository contains the materials (code & theory) I compiled while undertaking DataCamp's Big Data with PySpark Learning Track

Size: 147 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 0

aeronaut2001/Car-Insurance-Cold-Calls-Data-Analysis

Car Insurance Cold Calls Data Analysis using Apache Hive

Language: HiveQL - Size: 1.17 MB - Last synced: 5 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

aeronaut2001/Telecom-Data-Analysis

Telecom Data Analysis with Apache Hive

Language: HiveQL - Size: 345 KB - Last synced: 5 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

lithops-cloud/lithops

A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud ☁️🚀

Language: Python - Size: 12.3 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 289 - Forks: 91

tekdogan/iccbdc-21

Experiment files for ICCBDC'21 paper "Benchmarking Apache Spark and Hadoop MapReduce on Big Data Classification"

Language: Python - Size: 111 KB - Last synced: 5 months ago - Pushed: over 2 years ago - Stars: 1 - Forks: 0

harshh351998/Market-Basket-Items-Recommendation

This project provide the retailer with information to understand the purchase behaviour of a buyer and recommends products to user on their purchase history.

Language: Jupyter Notebook - Size: 1.11 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 0