An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: llm-quantization

snu-mllab/GuidedQuant

Official PyTorch implementation of "GuidedQuant: Large Language Model Quantization via Exploiting End Loss Guidance" (ICML 2025)

Language: Python - Size: 3.38 MB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 31 - Forks: 0

MagicTeaMC/AutoGGUF

Let me make GGUF files quickly

Language: Python - Size: 19.5 KB - Last synced at: 27 days ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

GongCheng1919/bias-compensation

[CAAI AIR'24] Minimize Quantization Output Error with Bias Compensation

Language: Python - Size: 918 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 7 - Forks: 1

paraglondhe098/sentiment-classification-llm

Implemented and fine-tuned BERT for a custom sequence classification task, leveraging LoRA adapters for efficient parameter updates and 4-bit quantization to optimize performance and resource utilization.

Language: Jupyter Notebook - Size: 6.66 MB - Last synced at: 5 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

nagababumo/Quantization-in-Depth

Language: Jupyter Notebook - Size: 5.81 MB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0