Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: streamingllm
DefTruth/Awesome-LLM-Inference
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
Size: 114 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 1,183 - Forks: 87
intel/intel-extension-for-transformers
âš¡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platformsâš¡
Language: Python - Size: 543 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 1,909 - Forks: 180