An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: large-scale-machine-learning

EthicalML/awesome-production-machine-learning

A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning

Size: 2.39 MB - Last synced at: 2 days ago - Pushed at: 14 days ago - Stars: 18,585 - Forks: 2,365

openpsi-project/ReaLHF πŸ“¦

Super-Efficient RLHF Training of LLMs with Parameter Reallocation

Language: Python - Size: 8.75 MB - Last synced at: 6 days ago - Pushed at: about 2 months ago - Stars: 299 - Forks: 19

akashsonowal/interview-prep

Crack SWE (ML) / DS MAANG Interviews

Language: Python - Size: 1.03 MB - Last synced at: 18 days ago - Pushed at: 19 days ago - Stars: 4 - Forks: 0

terrytangyuan/distributed-ml-patterns

Distributed Machine Learning Patterns from Manning Publications by Yuan Tang https://bit.ly/2RKv8Zo

Language: Python - Size: 7.2 MB - Last synced at: 22 days ago - Pushed at: 4 months ago - Stars: 437 - Forks: 40

natnew/Awesome-Data-Science

Carefully curated list of awesome data science resources.

Size: 3.44 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 137 - Forks: 22

swghosh/DeepFace

Keras implementation of the renowned publication "DeepFace: Closing the Gap to Human-Level Performance in Face Verification" by Taigman et al. Pre-trained weights on VGGFace2 dataset.

Language: Python - Size: 471 KB - Last synced at: 2 months ago - Pushed at: almost 4 years ago - Stars: 198 - Forks: 62

astorfi/Large-Scale-AI-Blueprint

A comprehensive guide designed to empower readers with advanced strategies and practical insights for developing, optimizing, and deploying scalable AI models in real-world applications.

Size: 7.56 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 22 - Forks: 3

shaheennabi/Machine-Learning-The-Untold-System-Design

🌌 πŸƒ Focused on overlooked system design principles crucial for large-scale AI applications. πŸŒΈπŸš€βœ¨ This repository explores architecture, infrastructure, and integration, helping developers build scalable, efficient AI systems for real-world demands. it highlights key innovations and their impact on AI system design. πŸŒŸπŸŽ‡πŸŒ πŸŒŒ

Size: 4.88 KB - Last synced at: 26 days ago - Pushed at: 6 months ago - Stars: 2 - Forks: 0

asigalov61/Perceiver-Music-Transformer

SOTA Google's Perceiver-AR Music Transformer Implementation and Model

Language: Python - Size: 968 MB - Last synced at: 7 months ago - Pushed at: about 2 years ago - Stars: 93 - Forks: 9

knagrecha/hydra

Execution framework for multi-task model parallelism. Enables the training of arbitrarily large models with a single GPU, with linear speedups for multi-gpu multi-task execution.

Language: Python - Size: 98.4 MB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 19 - Forks: 3

swghosh/FaceNet

Keras implementation of the renowned publication "FaceNet: A Unified Embedding for Face Recognition and Clustering" by Schroff et al.

Language: Python - Size: 25.4 KB - Last synced at: 2 months ago - Pushed at: about 2 years ago - Stars: 6 - Forks: 2

anirudhb11/LEVER

Official Code Base for ICLR 2024 paper Enhancing Tail Performance in Extreme Classifiers by Label Variance Reduction

Language: Python - Size: 294 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

asigalov61/Euterpe

[DEPRECEATED] Multi-Instrumental Music Transformer trained on 12GB/400k MIDIs

Language: Python - Size: 626 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 17 - Forks: 2

ishugaepov/MLBD

Materials for "Machine Learning on Big Data" course

Language: Jupyter Notebook - Size: 88.6 MB - Last synced at: 6 months ago - Pushed at: almost 2 years ago - Stars: 21 - Forks: 60

asigalov61/GIGA-Piano

[DEPRECEATED] Piano Transformer model trained on 2.6GB of MIDI piano music

Language: Python - Size: 889 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 12 - Forks: 3

alexrenz/AdaPM

A fully adaptive, zero-tuning parameter manager that enables efficient distributed machine learning training

Language: C++ - Size: 2.41 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 19 - Forks: 8

khushnood/DeepLearningJavaFromScratch

This project is for developing a deep neural networks and its variant from scratch. No external libraries are used except for GPU operations.

Language: Java - Size: 49 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 1

RajeshThallam/fastertransformer-converter

This repository is a code sample to serve Large Language Models (LLM) on a Google Kubernetes Engine (GKE) cluster with GPUs running NVIDIA Triton Inference Server with FasterTransformer backend.

Language: Python - Size: 139 KB - Last synced at: 3 months ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

nilesh2797/zestxml

This is the official codebase for KDD 2021 paper Generalized Zero-Shot Extreme Multi-Label Learning

Language: C++ - Size: 7.12 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 15 - Forks: 2

Related Keywords
large-scale-machine-learning 19 machine-learning 7 deep-learning 5 distributed-systems 3 mlops 3 music-transformer 3 music-generation 3 midi 3 music-composition 3 music-ai 3 python 2 pytorch 2 tensorflow 2 music-ai-architectures 2 data-science 2 distributed-machine-learning 2 extreme-classification 2 face-recognition 2 large-scale 2 sota 2 piano-transformer 2 multi-instrumental 2 google 2 artificial-intelligence 2 awesome 2 awesome-list 2 responsible-ai 2 production-ml 2 machine-learning-operations 2 ml-operations 2 distributed-computing 2 llm 2 large-language-models 2 machine-learning-library 1 regression 1 model-parallel 1 systems-engineering 1 java 1 task-parallel 1 face-verification 1 facenet 1 googlenet 1 gpu 1 text-to-music 1 fastertransformer 1 gke 1 perceiver-ar 1 music-generation-deep-learning 1 googlecloudplatform 1 inference 1 triton-inference-server 1 generalized-zero-shot-learning 1 classification 1 parameter-server 1 parameter-management 1 distributed-training 1 distributed-ml 1 symbolic-music-data 1 symbolic-music 1 piano 1 music-composer 1 midi-music 1 spark 1 mapreduce 1 big-data 1 deep-neural-network 1 music 1 muse 1 euterpea 1 euterpe 1 information-retrieval 1 gpu-computing 1 distillation 1 triplet-loss 1 keras-tensorflow 1 cloud-computing 1 book 1 argo-workflows 1 argo 1 ml-engineering 1 jax 1 data-structures 1 algorithms 1 transformers 1 reinforcement-learning-from-human-feedback 1 reinforcement-learning 1 megatron-lm 1 llm-training 1 llm-framework 1 deepspeed 1 production-machine-learning 1 privacy-preserving-ml 1 privacy-preserving-machine-learning 1 privacy-preserving 1 ml-ops 1 large-scale-ml 1 interpretability 1 explainability 1 data-mining 1 system-design 1