gitlab.com topics: Apache Spark
arise-biodiversity/biocloud/docker-compose-biocloud-dta
Store and find Arise data: Delta Lake and PostgreSQL
Last synced at: about 2 years ago - Stars: 0 - Forks: 0
edu-career/edu-apachespark
Educational repo for Apache Spark
Last synced at: over 2 years ago - Stars: 0 - Forks: 0

dars1608/geographically-weighted-regression-in-apache-spark
Implementation of Geographically Weighted Regression (GWR) using Apache Spark, Spark ML and Apache Sedona.
Last synced at: 10 months ago - Stars: 0 - Forks: 0
cecilegltslmcs/electricity_consumption_production
This aim of this project is to collect informations related to energy consumption and production in France. This collection is realized by using Apache Kafka, the data are processed by Apache Spark and they are storaged in a NoSQL Database : MongoDB.
Last synced at: over 2 years ago - Stars: 0 - Forks: 0

cecilegltslmcs/twitter_sentimentanalysis
The aim of this project is to collect tweets by using the Twitter API and display the results of a sentiment analysis on a dashboard.
Last synced at: over 2 years ago - Stars: 0 - Forks: 0

progxaker/sparkplugin
The "Stage Metrics" plugin for Apache Spark to creating metrics by stage status
Last synced at: over 2 years ago - Stars: 0 - Forks: 0

leo-plese/artificial-intelligence-machine-learning-deep-learning/machine-learning/apache-spark-python-framework-machine-learning-data-pipeline
Last synced at: over 2 years ago - Stars: 0 - Forks: 0
saeideh_ab/spark-test
sentiment analysis using spark ml library. implemented classic ml models: SVM, Logistic Regression, Naive Bayes and Random Forest. implemented embedding: Word2Vec and TF-IDF. also ensemble and hybrid (ml and lexicon based) methods were implemented
Last synced at: over 2 years ago - Stars: 0 - Forks: 1
zero323/pyspark-asyncactions
Mirror of https://github.com/zero323/pyspark-asyncactions
Last synced at: over 2 years ago - Stars: 0 - Forks: 0
leliac/ganymede
Execute Hadoop and Spark applications on the BigData@Polito cluster with a single command
Last synced at: over 2 years ago - Stars: 0 - Forks: 0

siddie/stackexchange-dump-spark-research-tools
Stack Exchange releases "data dumps" of all its publicly available content roughly every three months via archive.org. This project is an example and a framework for building ETL for this data with Apache Spark and Java.
Last synced at: over 2 years ago - Stars: 0 - Forks: 0
