GitHub topics: spinquant
ModelTC/llmc
[EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".
Language: Python - Size: 29.8 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 474 - Forks: 53
