GitHub topics: openai-triton
ModelTC/LightLLM
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
Language: Python - Size: 7.35 MB - Last synced at: about 4 hours ago - Pushed at: about 6 hours ago - Stars: 3,412 - Forks: 270

BobMcDear/attorch
A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.
Language: Python - Size: 188 KB - Last synced at: about 23 hours ago - Pushed at: 1 day ago - Stars: 563 - Forks: 30

DeepAuto-AI/hip-attention
Training-free Post-training Efficient Sub-quadratic Complexity Attention. Implemented with OpenAI Triton.
Language: Python - Size: 40.9 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 141 - Forks: 14

chengzeyi/stable-fast
https://wavespeed.ai/ Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.
Language: Python - Size: 409 KB - Last synced at: 10 days ago - Pushed at: 4 months ago - Stars: 1,278 - Forks: 79

multi-modal-ai/ai-programming-hub
Learn and experiment with new techniques and programming languages with a focus on ML
Language: Jupyter Notebook - Size: 163 KB - Last synced at: about 2 months ago - Pushed at: 11 months ago - Stars: 7 - Forks: 1
