Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: streamingllm

DefTruth/Awesome-LLM-Inference

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

Size: 114 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 1,183 - Forks: 87

intel/intel-extension-for-transformers

âš¡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platformsâš¡

Language: Python - Size: 543 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 1,909 - Forks: 180