An open API service providing repository metadata for many open source software ecosystems.

Topic: "ptq"

Xilinx/brevitas

Brevitas: neural network quantization in PyTorch

Language: Python - Size: 20.1 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1,292 - Forks: 210

Bobo-y/flexible-yolov5

More readable and flexible yolov5 with more backbone(gcn, resnet, shufflenet, moblienet, efficientnet, hrnet, swin-transformer, etc) and (cbam,dcn and so on), and tensorrt

Language: Python - Size: 12.4 MB - Last synced at: 40 minutes ago - Pushed at: 8 months ago - Stars: 673 - Forks: 118

sony/model_optimization

Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. This project provides researchers, developers, and engineers advanced quantization and compression tools for deploying state-of-the-art neural networks.

Language: Python - Size: 22 MB - Last synced at: about 10 hours ago - Pushed at: about 11 hours ago - Stars: 386 - Forks: 66

TsingmaoAI/MI-optimize

mi-optimize is a versatile tool designed for the quantization and evaluation of large language models (LLMs). The library's seamless integration of various quantization methods and evaluation techniques empowers users to customize their approaches according to specific requirements and constraints, providing a high level of flexibility.

Language: Python - Size: 18.3 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 19 - Forks: 3

MAGICS-LAB/OutEffHop

[ICML 2024] Outlier-Efficient Hopfield Layers for Large Transformer-Based Models

Language: Python - Size: 249 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 18 - Forks: 2

lix19937/tensorrt-insight

Deep insight tensorrt, including but not limited to qat, ptq, plugin, triton_inference, cuda

Language: C++ - Size: 7.27 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 16 - Forks: 0

yester31/TensorRT_EX

Deep Learning Model Optimization Using by TensorRT API, window

Language: Python - Size: 160 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 11 - Forks: 3

yester31/TensorRT_Examples

All useful sample codes of tensorrt models using onnx

Language: Python - Size: 240 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 1

amajji/LLM-Quantization-Techniques-Absmax-Zeropoint-GPTQ-GGUF

LLM quantization techniques: absmax, zero-point, GPTQ and GGUF

Language: Jupyter Notebook - Size: 182 KB - Last synced at: 26 days ago - Pushed at: 9 months ago - Stars: 1 - Forks: 1

OmidGhadami95/EfficientNetV2_Quantization_CK

EfficientNetV2 (Efficientnetv2-b2) and quantization int8 and fp32 (QAT and PTQ) on CK+ dataset . fine-tuning, augmentation, solving imbalanced dataset, etc.

Language: Jupyter Notebook - Size: 344 KB - Last synced at: about 2 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 0

yester31/TensorRT_ONNX

Generating tensorrt model using onnx

Language: C++ - Size: 91.6 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

ambideXtrous9/Quantization-of-Models-PTQ-and-QAT

Quantization of Models : Post-Training Quantization(PTQ) and Quantize Aware Training(QAT)

Language: Jupyter Notebook - Size: 5.1 MB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

smpanaro/norm-tweaking

Post post-training-quantization (PTQ) method for improving LLMs. Unofficial implementation of https://arxiv.org/abs/2309.02784

Language: Python - Size: 32.2 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

yester31/TensorRT_Sparse

inference with the structured sparsity and quantization

Language: Python - Size: 21.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

BlindOver/blindover_AI

Build AI model to classify beverages for blind individuals

Language: Python - Size: 590 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 1

yester31/Quantization_EX

quantization example for pqt & qat

Language: Python - Size: 94.7 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Related Topics