GitHub topics: machine-learning-systems
inclusionAI/AReaL
Distributed RL System for LLM Reasoning
Language: Python - Size: 9.61 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1,583 - Forks: 77

AIoT-MLSys-Lab/Efficient-LLMs-Survey
[TMLR 2024] Efficient Large Language Models: A Survey
Size: 3.96 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1,164 - Forks: 97

fla-org/flash-linear-attention
🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton
Language: Python - Size: 4.1 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 2,456 - Forks: 175

mental2008/awesome-papers
Here are my personal paper reading notes (including cloud computing, resource management, systems, machine learning, deep learning, and other interesting stuffs).
Size: 15.2 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 105 - Forks: 4

AIoT-MLSys-Lab/Efficient-Diffusion-Model-Survey
[TMLR 2025] Efficient Diffusion Models: A Survey
Size: 320 KB - Last synced at: 13 days ago - Pushed at: about 1 month ago - Stars: 60 - Forks: 3

1duo/awesome-ai-infrastructures
Infrastructures™ for Machine Learning Training/Inference in Production.
Size: 11.8 MB - Last synced at: 23 days ago - Pushed at: about 6 years ago - Stars: 416 - Forks: 74

matteo-spadaccia/Machine-Learning-Systems-Project-2025 Fork of ed-aisys/edin-mls-25-spring
ML Systems group project, completed during MSc degree at the University of Edinburgh
Language: Python - Size: 7.12 MB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 1 - Forks: 0

byungsoo-oh/ml-systems-papers
Curated collection of papers in machine learning systems
Size: 178 KB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 325 - Forks: 17

tiingweii-shii/Awesome-Resource-Efficient-LLM-Papers
a curated list of high-quality papers on resource-efficient LLMs 🌱
Size: 336 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 117 - Forks: 7

wepe/dive-into-ml-system
Dive into machine learning system, start from reinventing the wheel.
Language: C++ - Size: 1.52 MB - Last synced at: about 1 month ago - Pushed at: almost 7 years ago - Stars: 231 - Forks: 28

Ziyang-Yu/Awesome-Resource-Efficient-LLM-Papers Fork of tiingweii-shii/Awesome-Resource-Efficient-LLM-Papers
a curated list of high-quality papers on resource-efficient LLMs 🌱
Size: 252 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

elinx/ugrad
A C++ implementation of the scalar-valued autograd engine micrograd
Language: C++ - Size: 2.49 MB - Last synced at: 3 months ago - Pushed at: about 5 years ago - Stars: 23 - Forks: 2

fla-org/flash-bidirectional-linear-attention
Triton implement of bi-directional (non-causal) linear attention
Language: Python - Size: 78.1 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 40 - Forks: 1

patternex/awesome-ml-for-threat-detection
A curated list of resources to deep dive into the intersection of applied machine learning and threat detection.
Size: 46.9 KB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 19 - Forks: 1

MLSys-Learner-Resources/Awesome-MLSys-Blogger
The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)
Language: HTML - Size: 940 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

Moe-Zbeeb/TAI
TAI
Language: Python - Size: 23 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

pooyanjamshidi/mls
CSCE 585 - Machine Learning Systems
Language: TeX - Size: 921 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 40 - Forks: 8

ibragim-bad/machine-learning-design-primer
Learn how to design and implement effective Machine Learning systems from start to finish.
Size: 17.6 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 210 - Forks: 19

thaisaraujom/machine-learning
Projects and summaries for the Machine Learning [PPGEEC2318] course at UFRN, taught by Professor Ivanovitch Silva.
Size: 5.58 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

Relaxed-System-Lab/COMP4901Y_Course_HKUST
Course Material for the UG Course COMP4901Y
Language: Python - Size: 26.4 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 35 - Forks: 0

meton-robean/Machine-Learning-System-Notes
搜集和机器学习系统相关的项目与研究,利用issues记录论文阅读的总结
Size: 2.93 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 5 - Forks: 0

jared-ni/PRIMES-cs243-final
Price Incentive Model Efficiency System: 1.5-2x higher accuracy per training round than random/Oort, up to 30% decrease in convergence time.
Language: Python - Size: 22.6 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

withsmilo/When-ML-pipeline-meets-Hydra
:cyclone:
Language: Python - Size: 29.3 KB - Last synced at: 2 months ago - Pushed at: over 5 years ago - Stars: 23 - Forks: 2

SymbioticLab/ModelKeeper
A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup
Language: Python - Size: 16.2 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 25 - Forks: 5

SymbioticLab/Oort
Oort: Efficient Federated Learning via Guided Participant Selection
Language: Python - Size: 116 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 94 - Forks: 25

l1nkr/DL-Compiler-Navigation
Machine Learning Compiler Road Map
Language: Jupyter Notebook - Size: 23.5 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 29 - Forks: 2

csce585-mlsystems/project-athena
This is the course project for CSCE585: ML Systems. Students will build their machine learning systems based on the provided infrastructure --- Athena.
Language: Python - Size: 3.97 GB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 13 - Forks: 20

wi-pi/rethinking-image-scaling-attacks
[ICML 2022] Rethinking Image-Scaling Attacks: The Interplay Between Vulnerabilities in Machine Learning Systems
Language: Python - Size: 5.97 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 1

sreenivasanramesh/MachineLearningSystems
Assignments for Data Intensive Systems for Machine Learning Coursework
Language: Python - Size: 1.86 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 2

sagniknitr/torch-WiT
A tool to predict the efficacy of DNN optimizations
Language: Python - Size: 4.17 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 0
