GitHub / vllm-project / llm-compressor
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/vllm-project%2Fllm-compressor
Stars: 1,220
Forks: 114
Open issues: 83
License: apache-2.0
Language: Python
Size: 22.5 MB
Dependencies parsed at: Pending
Created at: 10 months ago
Updated at: 6 days ago
Pushed at: 6 days ago
Last synced at: 5 days ago
Topics: compression, quantization, sparsity
Funding Links https://github.com/sponsors/vllm-project