An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: openai-triton

ModelTC/LightLLM

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Language: Python - Size: 7.35 MB - Last synced at: about 4 hours ago - Pushed at: about 6 hours ago - Stars: 3,412 - Forks: 270

BobMcDear/attorch

A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.

Language: Python - Size: 188 KB - Last synced at: about 23 hours ago - Pushed at: 1 day ago - Stars: 563 - Forks: 30

DeepAuto-AI/hip-attention

Training-free Post-training Efficient Sub-quadratic Complexity Attention. Implemented with OpenAI Triton.

Language: Python - Size: 40.9 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 141 - Forks: 14

chengzeyi/stable-fast

https://wavespeed.ai/ Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.

Language: Python - Size: 409 KB - Last synced at: 10 days ago - Pushed at: 4 months ago - Stars: 1,278 - Forks: 79

multi-modal-ai/ai-programming-hub

Learn and experiment with new techniques and programming languages with a focus on ML

Language: Jupyter Notebook - Size: 163 KB - Last synced at: about 2 months ago - Pushed at: 11 months ago - Stars: 7 - Forks: 1