An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: quantization-fundamentals

ksm26/Quantization-Fundamentals-with-Hugging-Face

Learn linear quantization techniques using the Quanto library and downcasting methods with the Transformers library to compress and optimize generative AI models effectively.

Language: Jupyter Notebook - Size: 205 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 9