Topic: "quantized-networks"
tensorflow/model-optimization
A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.
Language: Python - Size: 2.22 MB - Last synced at: 14 days ago - Pushed at: 3 months ago - Stars: 1,531 - Forks: 325

google/qkeras
QKeras: a quantization deep learning library for Tensorflow Keras
Language: Python - Size: 1.53 MB - Last synced at: 18 days ago - Pushed at: about 1 month ago - Stars: 562 - Forks: 105

bytedance/ABQ-LLM
An acceleration library that supports arbitrary bit-width combinatorial quantization operations
Language: C++ - Size: 53.9 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 221 - Forks: 21

HuangCongQing/model-compression-optimization
model compression and optimization for deployment for Pytorch, including knowledge distillation, quantization and pruning.(知识蒸馏,量化,剪枝)
Language: Python - Size: 20 MB - Last synced at: 1 day ago - Pushed at: 8 months ago - Stars: 18 - Forks: 2
