GitHub / codewithdark-git / QuantLLM
QuantLLM is a Python library designed for developers, researchers, and teams who want to fine-tune and deploy large language models (LLMs) efficiently using 4-bit and 8-bit quantization techniques.
JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/codewithdark-git%2FQuantLLM
PURL: pkg:github/codewithdark-git/QuantLLM
Stars: 5
Forks: 0
Open issues: 1
License: None
Language: Python
Size: 294 KB
Dependencies parsed at: Pending
Created at: 2 months ago
Updated at: 16 days ago
Pushed at: 16 days ago
Last synced at: 11 days ago
Topics: 4bit, 8bit, huggingface, llm, pypi, pypi-package, pypi-packages, python, python-lambda, python3, pytorch, quantization, quantum, torch, transformers
Funding Links https://github.com/sponsors/codewithdark-git, https://patreon.com/codewithdark, https://buymeacoffee.com/codewithdark