GitHub topics: ai-infra
biubiutomato/TME-Agent
TME: Structured memory engine for LLM agents to plan, rollback, and reason across multi-step tasks. DAG upgrade in progress.
Language: Python - Size: 1.66 MB - Last synced at: about 1 hour ago - Pushed at: about 5 hours ago - Stars: 15 - Forks: 2

NexusGPU/vgpu.rs
vgpu.rs is the fractional GPU & vgpu-hypervisor implementation written in Rust
Language: Rust - Size: 269 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 12 - Forks: 4

thu-ml/SpargeAttn
SpargeAttention: A training-free sparse attention that can accelerate any model inference.
Language: Cuda - Size: 55.4 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 568 - Forks: 38

HuaizhengZhang/AI-Infra-from-Zero-to-Hero
🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSys, etc. 🗃️ Llama3, Mistral, etc. 🧑💻 Video Tutorials.
Size: 891 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 2,938 - Forks: 324

tensorchord/ai-infra-landscape
This is a landscape of the infrastructure that powers the generative AI ecosystem
Language: HTML - Size: 23.8 MB - Last synced at: 19 days ago - Pushed at: 8 months ago - Stars: 145 - Forks: 57

leonardocremasco/TME-Agent
TME: Structured memory engine for LLM agents to plan, rollback, and reason across multi-step tasks. DAG upgrade in progress.
Language: Python - Size: 1.65 MB - Last synced at: 24 days ago - Pushed at: 25 days ago - Stars: 1 - Forks: 0

ForceInjection/AI-fundermentals
AI 基础知识 - GPU 架构、CUDA 编程以及大模型基础知识
Language: Jupyter Notebook - Size: 14.3 MB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 128 - Forks: 10

awesomelistsio/awesome-ai-infrastructure
A curated list of awesome tools, frameworks, platforms, and resources for building scalable and efficient AI infrastructure, including distributed training, model serving, MLOps, and deployment.
Language: Python - Size: 4.88 KB - Last synced at: 24 days ago - Pushed at: 6 months ago - Stars: 6 - Forks: 1

oliverlabs/alz-catalogue
This repository contains a list of various service-specific Azure Landing Zone implementation options.
Size: 67.4 KB - Last synced at: 19 days ago - Pushed at: 29 days ago - Stars: 10 - Forks: 0

jinbooooom/OriginDL
OriginDL: A distributed deep learning framework Built from scratch
Language: C++ - Size: 17.5 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 10 - Forks: 7

jinbooooom/ai-infra-hpc
hpc 教程,包含集合通信(mpi、nccl)、cuda 编程、向量化 SIMD、RDMA 通信等
Language: Cuda - Size: 103 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 8 - Forks: 2

raptor-ml/raptor
Transform your pythonic research to an artifact that engineers can deploy easily.
Language: Go - Size: 4.55 MB - Last synced at: 7 days ago - Pushed at: 2 months ago - Stars: 153 - Forks: 12

ChaoHsin-fang/MlxPersistentNaming
Persistent Naming for Mellanox Ethernet Interfaces
Language: Python - Size: 4.88 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

kleveross/klever
Cloud Native ML/DL Platform
Size: 85.9 KB - Last synced at: about 2 months ago - Pushed at: over 4 years ago - Stars: 133 - Forks: 20

lastmover/aiomegacycle
visualize ai omegacycle
Language: HTML - Size: 426 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

yuelinxin/lisa
The Lisa programming language.
Language: C++ - Size: 135 KB - Last synced at: 1 day ago - Pushed at: 10 months ago - Stars: 3 - Forks: 1

memas-ai/MeMaS 📦
Memory Management Service, a Long Term Memory Solution for AI
Language: Python - Size: 145 KB - Last synced at: 9 days ago - Pushed at: 9 months ago - Stars: 8 - Forks: 0

DeCenter-AI/decenter-ai.streamlit.app
Decenteralized AI training platform for all
Language: Python - Size: 79.3 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 2
