An open API service providing repository metadata for many open source software ecosystems.

Topic: "sparsegpt"

intel/neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

Language: Python - Size: 469 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 2,400 - Forks: 267