Topic: "quantization-fundamentals"
ksm26/Quantization-Fundamentals-with-Hugging-Face
Learn linear quantization techniques using the Quanto library and downcasting methods with the Transformers library to compress and optimize generative AI models effectively.
Language: Jupyter Notebook - Size: 205 KB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 9
