quantization-fundamentals | Topic

Topic: "quantization-fundamentals"

ksm26/Quantization-Fundamentals-with-Hugging-Face

Learn linear quantization techniques using the Quanto library and downcasting methods with the Transformers library to compress and optimize generative AI models effectively.

Language: Jupyter Notebook - Size: 205 KB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 9

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Repos

Topic: "quantization-fundamentals"

ksm26/Quantization-Fundamentals-with-Hugging-Face