An open API service providing repository metadata for many open source software ecosystems.

Topic: "massive-datasets"

polardb/polardbx-sql

PolarDB-X is a cloud native distributed SQL Database designed for high concurrency, massive storage, complex querying scenarios.

Language: Java - Size: 94.4 MB - Last synced at: 12 days ago - Pushed at: 4 months ago - Stars: 1,584 - Forks: 326

helmholtz-analytics/heat

Distributed tensors and Machine Learning framework with GPU and MPI acceleration in Python

Language: Python - Size: 21 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 219 - Forks: 53

polardb/polardbx

PolarDB-X is a cloud native distributed SQL Database designed for high concurrency, massive storage, complex querying scenarios.

Language: Makefile - Size: 193 KB - Last synced at: 18 days ago - Pushed at: 5 months ago - Stars: 79 - Forks: 20

simkarwin/mimo_keras

TF-Package: Multiple-Input Multiple-Output Keras Data-Generator for massive and complex datasets

Language: Python - Size: 62.5 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 8 - Forks: 0

joshuaboud/gen-dataset

Command line tool to quickly generate a lot of files in a lot of directories

Language: C++ - Size: 267 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 4 - Forks: 0

FedericoBruzzone/anti-money-laundering

The project is based on the analysis of the "IBM Transactions for Anti Money Laundering" dataset published on Kaggle. The task is to implement a model which predicts whether or not a transaction is illicit, using the attribute "Is Laundering" as a label to be predicted.

Language: Jupyter Notebook - Size: 41.6 MB - Last synced at: 4 days ago - Pushed at: 8 months ago - Stars: 3 - Forks: 0

FedericoBruzzone/algorithms-for-massive-datasets

This repository contains a LaTeX file that generates a PDF document comprising comprehensive notes for the course "Algorithms for Massive Datasets"

Language: TeX - Size: 2.67 MB - Last synced at: 2 months ago - Pushed at: 8 months ago - Stars: 2 - Forks: 2

gmalik9/floating_point_data_compressor

gipa -- compression/decompression tool to package compress and encode massive archive files with floating-point data

Language: Python - Size: 31.3 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 0

manuparra/hadoop-statistics

Calculate statistical measures of one column in big data Datasets with these simply Hadoop Application

Language: Java - Size: 42 KB - Last synced at: 2 months ago - Pushed at: about 8 years ago - Stars: 1 - Forks: 1

Sabaudian/AMD_Market_Basket_Analysis

Algorithms for Massive Datasets (AMD) -- Market-baskets analysis project

Language: Jupyter Notebook - Size: 2.16 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

SJ22032003/massive-data-streaming-nodejs

Stream, parse, manipulate and transform extremly large data ( can be 1 GB or 1TB ) in NodeJS without any process block, memory overflow or bottle neck with peak performance. And also show it in UI with the help of webStreams

Language: JavaScript - Size: 32.2 KB - Last synced at: about 2 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

nelsonstos/bulk-load-api-multivende

This project facilitates the efficient mass registration of products using Rabbit MQ, managing loads exceeding 50,000 products.

Language: JavaScript - Size: 368 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

KolwaBrad/massivedataset

Training the MASSIVE dataset by Amazon(english-US, German-DE and Swahili-KE)

Language: Python - Size: 23.4 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 4

miguel-kjh/Machine-Translation

Language: Jupyter Notebook - Size: 715 KB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

INFJakZda/Processing-Massive-Data-Sets

University lab exercises with processing big data.

Language: Python - Size: 10.7 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 1

pero5ar/FER.AVSP

Lab assignments for the Analysis of Massive Data Sets course @ FER, University of Zagreb

Language: C# - Size: 17.1 MB - Last synced at: almost 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

dhruv3/MRbasedFriendRecommender

Map Reduce program to suggest new friends based on count of mutual friends

Language: Java - Size: 4.88 KB - Last synced at: about 1 year ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0