Topic: "low-bit"
intel/neural-speed 📦
An innovative library for efficient LLM inference via low-bit quantization
Language: C++ - Size: 16.2 MB - Last synced at: 16 days ago - Pushed at: 8 months ago - Stars: 350 - Forks: 38

fdbtrs/QuantFace
QuantFace: Towards Lightweight Face Recognition by Synthetic Data Low-bit Quantization
Language: Python - Size: 5.47 MB - Last synced at: 12 months ago - Pushed at: almost 3 years ago - Stars: 30 - Forks: 3

pyjhzwh/low-bit-quant-admm
admm for cnn layerwise weight low bit quantization
Language: Python - Size: 25.4 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 6 - Forks: 1
