An open API service providing repository metadata for many open source software ecosystems.

gitlab.com topics: Apache Spark

arise-biodiversity/biocloud/docker-compose-biocloud-dta

Store and find Arise data: Delta Lake and PostgreSQL

Last synced at: about 2 years ago - Stars: 0 - Forks: 0

edu-career/edu-apachespark

Educational repo for Apache Spark

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

dars1608/geographically-weighted-regression-in-apache-spark

Implementation of Geographically Weighted Regression (GWR) using Apache Spark, Spark ML and Apache Sedona.

Last synced at: 10 months ago - Stars: 0 - Forks: 0

cecilegltslmcs/electricity_consumption_production

This aim of this project is to collect informations related to energy consumption and production in France. This collection is realized by using Apache Kafka, the data are processed by Apache Spark and they are storaged in a NoSQL Database : MongoDB.

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

cecilegltslmcs/twitter_sentimentanalysis

The aim of this project is to collect tweets by using the Twitter API and display the results of a sentiment analysis on a dashboard.

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

progxaker/sparkplugin

The "Stage Metrics" plugin for Apache Spark to creating metrics by stage status

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

saeideh_ab/spark-test

sentiment analysis using spark ml library. implemented classic ml models: SVM, Logistic Regression, Naive Bayes and Random Forest. implemented embedding: Word2Vec and TF-IDF. also ensemble and hybrid (ml and lexicon based) methods were implemented

Last synced at: over 2 years ago - Stars: 0 - Forks: 1

zero323/pyspark-asyncactions

Mirror of https://github.com/zero323/pyspark-asyncactions

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

leliac/ganymede

Execute Hadoop and Spark applications on the BigData@Polito cluster with a single command

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

siddie/stackexchange-dump-spark-research-tools

Stack Exchange releases "data dumps" of all its publicly available content roughly every three months via archive.org. This project is an example and a framework for building ETL for this data with Apache Spark and Java.

Last synced at: over 2 years ago - Stars: 0 - Forks: 0