Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: ptq

sony/model_optimization

Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. This project provides researchers, developers, and engineers advanced quantization and compression tools for deploying state-of-the-art neural networks.

Language: Python - Size: 8.57 MB - Last synced: about 9 hours ago - Pushed: about 9 hours ago - Stars: 270 - Forks: 42

Xilinx/brevitas

Brevitas: neural network quantization in PyTorch

Language: Python - Size: 19 MB - Last synced: about 7 hours ago - Pushed: about 9 hours ago - Stars: 1,107 - Forks: 177

lix19937/tensorrt-insight

deep insight tensorrt

Language: C++ - Size: 4.09 MB - Last synced: 6 days ago - Pushed: 6 days ago - Stars: 1 - Forks: 0

MAGICS-LAB/OutEffHop

[ICML 2024] Outlier-Efficient Hopfield Layers for Large Transformer-Based Models

Language: Python - Size: 241 KB - Last synced: 7 days ago - Pushed: 7 days ago - Stars: 4 - Forks: 0

OmidGhadami95/EfficientNetV2_Quantization_CK

EfficientNetV2 (Efficientnetv2-b2) and quantization int8 and fp32 (QAT and PTQ) on CK+ dataset . fine-tuning, augmentation, solving imbalanced dataset, etc.

Language: Jupyter Notebook - Size: 344 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0

Bobo-y/flexible-yolov5

More readable and flexible yolov5 with more backbone(gcn, resnet, shufflenet, moblienet, efficientnet, hrnet, swin-transformer, etc) and (cbam,dcn and so on), and tensorrt

Language: Python - Size: 12.4 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 641 - Forks: 117

smpanaro/norm-tweaking

Post post-training-quantization (PTQ) method for improving LLMs. Unofficial implementation of https://arxiv.org/abs/2309.02784

Language: Python - Size: 32.2 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

yester31/TensorRT_Sparse

inference with the structured sparsity and quantization

Language: Python - Size: 21.5 KB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 0 - Forks: 0

BlindOver/blindover_AI

Build AI model to classify beverages for blind individuals

Language: Python - Size: 590 MB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 1

yester31/Quantization_EX

quantization example for pqt & qat

Language: Python - Size: 94.7 KB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0

yester31/TensorRT_ONNX

Generating tensorrt model using onnx

Language: C++ - Size: 91.6 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0

yester31/TensorRT_EX

Deep Learning Model Optimization Using by TensorRT API, window

Language: Python - Size: 160 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 11 - Forks: 3