ecosyste.ms

Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: int3

Repositories

intel/neural-speed 📦

An innovative library for efficient LLM inference via low-bit quantization

Language: C++ - Size: 16.2 MB - Last synced at: 5 days ago - Pushed at: 8 months ago - Stars: 350 - Forks: 38

Related Keywords

cpu 1 fp4 1 fp8 1 gaudi2 1 gpu 1 int1 1 int2 1 int3 1 int4 1 int5 1 int6 1 int7 1 int8 1 llamacpp 1 llm-fine-tuning 1 llm-inference 1 low-bit 1 mxformat 1 nf4 1 sparsity 1