An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: machine-learning-systems

inclusionAI/AReaL

Distributed RL System for LLM Reasoning

Language: Python - Size: 9.61 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1,583 - Forks: 77

AIoT-MLSys-Lab/Efficient-LLMs-Survey

[TMLR 2024] Efficient Large Language Models: A Survey

Size: 3.96 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1,164 - Forks: 97

fla-org/flash-linear-attention

🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton

Language: Python - Size: 4.1 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 2,456 - Forks: 175

mental2008/awesome-papers

Here are my personal paper reading notes (including cloud computing, resource management, systems, machine learning, deep learning, and other interesting stuffs).

Size: 15.2 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 105 - Forks: 4

AIoT-MLSys-Lab/Efficient-Diffusion-Model-Survey

[TMLR 2025] Efficient Diffusion Models: A Survey

Size: 320 KB - Last synced at: 13 days ago - Pushed at: about 1 month ago - Stars: 60 - Forks: 3

1duo/awesome-ai-infrastructures

Infrastructures™ for Machine Learning Training/Inference in Production.

Size: 11.8 MB - Last synced at: 23 days ago - Pushed at: about 6 years ago - Stars: 416 - Forks: 74

matteo-spadaccia/Machine-Learning-Systems-Project-2025 Fork of ed-aisys/edin-mls-25-spring

ML Systems group project, completed during MSc degree at the University of Edinburgh

Language: Python - Size: 7.12 MB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 1 - Forks: 0

byungsoo-oh/ml-systems-papers

Curated collection of papers in machine learning systems

Size: 178 KB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 325 - Forks: 17

tiingweii-shii/Awesome-Resource-Efficient-LLM-Papers

a curated list of high-quality papers on resource-efficient LLMs 🌱

Size: 336 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 117 - Forks: 7

wepe/dive-into-ml-system

Dive into machine learning system, start from reinventing the wheel.

Language: C++ - Size: 1.52 MB - Last synced at: about 1 month ago - Pushed at: almost 7 years ago - Stars: 231 - Forks: 28

Ziyang-Yu/Awesome-Resource-Efficient-LLM-Papers Fork of tiingweii-shii/Awesome-Resource-Efficient-LLM-Papers

a curated list of high-quality papers on resource-efficient LLMs 🌱

Size: 252 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

elinx/ugrad

A C++ implementation of the scalar-valued autograd engine micrograd

Language: C++ - Size: 2.49 MB - Last synced at: 3 months ago - Pushed at: about 5 years ago - Stars: 23 - Forks: 2

fla-org/flash-bidirectional-linear-attention

Triton implement of bi-directional (non-causal) linear attention

Language: Python - Size: 78.1 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 40 - Forks: 1

patternex/awesome-ml-for-threat-detection

A curated list of resources to deep dive into the intersection of applied machine learning and threat detection.

Size: 46.9 KB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 19 - Forks: 1

MLSys-Learner-Resources/Awesome-MLSys-Blogger

The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)

Language: HTML - Size: 940 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

Moe-Zbeeb/TAI

TAI

Language: Python - Size: 23 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

pooyanjamshidi/mls

CSCE 585 - Machine Learning Systems

Language: TeX - Size: 921 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 40 - Forks: 8

ibragim-bad/machine-learning-design-primer

Learn how to design and implement effective Machine Learning systems from start to finish.

Size: 17.6 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 210 - Forks: 19

thaisaraujom/machine-learning

Projects and summaries for the Machine Learning [PPGEEC2318] course at UFRN, taught by Professor Ivanovitch Silva.

Size: 5.58 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

Relaxed-System-Lab/COMP4901Y_Course_HKUST

Course Material for the UG Course COMP4901Y

Language: Python - Size: 26.4 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 35 - Forks: 0

meton-robean/Machine-Learning-System-Notes

搜集和机器学习系统相关的项目与研究,利用issues记录论文阅读的总结

Size: 2.93 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 5 - Forks: 0

jared-ni/PRIMES-cs243-final

Price Incentive Model Efficiency System: 1.5-2x higher accuracy per training round than random/Oort, up to 30% decrease in convergence time.

Language: Python - Size: 22.6 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

withsmilo/When-ML-pipeline-meets-Hydra

:cyclone:

Language: Python - Size: 29.3 KB - Last synced at: 2 months ago - Pushed at: over 5 years ago - Stars: 23 - Forks: 2

SymbioticLab/ModelKeeper

A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup

Language: Python - Size: 16.2 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 25 - Forks: 5

SymbioticLab/Oort

Oort: Efficient Federated Learning via Guided Participant Selection

Language: Python - Size: 116 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 94 - Forks: 25

l1nkr/DL-Compiler-Navigation

Machine Learning Compiler Road Map

Language: Jupyter Notebook - Size: 23.5 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 29 - Forks: 2

csce585-mlsystems/project-athena

This is the course project for CSCE585: ML Systems. Students will build their machine learning systems based on the provided infrastructure --- Athena.

Language: Python - Size: 3.97 GB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 13 - Forks: 20

wi-pi/rethinking-image-scaling-attacks

[ICML 2022] Rethinking Image-Scaling Attacks: The Interplay Between Vulnerabilities in Machine Learning Systems

Language: Python - Size: 5.97 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 1

sreenivasanramesh/MachineLearningSystems

Assignments for Data Intensive Systems for Machine Learning Coursework

Language: Python - Size: 1.86 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 2

sagniknitr/torch-WiT

A tool to predict the efficacy of DNN optimizations

Language: Python - Size: 4.17 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 0

Related Keywords
machine-learning-systems 30 machine-learning 12 deep-learning 5 federated-learning 4 efficient-deep-learning 4 generative-ai 4 large-language-models 4 survey 4 llm 3 awesome-list 3 pytorch 2 tvm 2 adversarial-attacks 2 artificial-intelligence 2 autodiff 2 paper-notes 2 mlsys 2 cloud-computing 2 knowledge-distillation 1 llm-inference 1 llm-training 1 inference 1 end-to-end 1 high-performance-computing 1 cuda 1 nsdi 1 ml-pipeline 1 facebook-hydra 1 machine-learning-algorithms 1 course-materials 1 large-language-model 1 foundation-models 1 interview-questions 1 design-system 1 distributed-machine-learning 1 computer-systems 1 torch 1 performance-prediction 1 performance 1 gpu 1 dnn-optimization-study 1 deep-neural-networks 1 matrix-multiplication 1 graph-executor 1 glow 1 autodifferentiation 1 preprocessors 1 preprocessing-defenses 1 preprocessing 1 decision-based-attacks 1 black-box-attacks 1 adversarial-examples 1 adversarial-machine-learning 1 adversarial-example 1 adversarial-defense 1 deep-learning-framework 1 apache-spark 1 apache-mesos 1 apache-arrow 1 diffusion-models 1 systems-for-machine-learning 1 system-design 1 resource-scheduler 1 resource-management 1 research 1 remote-direct-memory-access 1 reading-notes 1 reading-list 1 paper-list 1 gpu-virtualization 1 natural-language-processing 1 rl 1 reinforcement-learning 1 llm-reasoning 1 threat-detection 1 papers 1 machine-learning-operations 1 cybersecurity 1 applied-machine-learning 1 triton-lang 1 computer-vision 1 tinygrad 1 micrograd 1 autograd 1 openmp 1 eigen 1 ctypes 1 resource-efficient 1 awesome-papers 1 gpu-computing 1 quantization 1 pruning 1 model-compression 1 kubernetes 1