Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: ptq
sony/model_optimization
Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. This project provides researchers, developers, and engineers advanced quantization and compression tools for deploying state-of-the-art neural networks.
Language: Python - Size: 8.57 MB - Last synced: about 9 hours ago - Pushed: about 9 hours ago - Stars: 270 - Forks: 42
Xilinx/brevitas
Brevitas: neural network quantization in PyTorch
Language: Python - Size: 19 MB - Last synced: about 7 hours ago - Pushed: about 9 hours ago - Stars: 1,107 - Forks: 177
lix19937/tensorrt-insight
deep insight tensorrt
Language: C++ - Size: 4.09 MB - Last synced: 6 days ago - Pushed: 6 days ago - Stars: 1 - Forks: 0
MAGICS-LAB/OutEffHop
[ICML 2024] Outlier-Efficient Hopfield Layers for Large Transformer-Based Models
Language: Python - Size: 241 KB - Last synced: 7 days ago - Pushed: 7 days ago - Stars: 4 - Forks: 0
OmidGhadami95/EfficientNetV2_Quantization_CK
EfficientNetV2 (Efficientnetv2-b2) and quantization int8 and fp32 (QAT and PTQ) on CK+ dataset . fine-tuning, augmentation, solving imbalanced dataset, etc.
Language: Jupyter Notebook - Size: 344 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0
Bobo-y/flexible-yolov5
More readable and flexible yolov5 with more backbone(gcn, resnet, shufflenet, moblienet, efficientnet, hrnet, swin-transformer, etc) and (cbam,dcn and so on), and tensorrt
Language: Python - Size: 12.4 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 641 - Forks: 117
smpanaro/norm-tweaking
Post post-training-quantization (PTQ) method for improving LLMs. Unofficial implementation of https://arxiv.org/abs/2309.02784
Language: Python - Size: 32.2 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
yester31/TensorRT_Sparse
inference with the structured sparsity and quantization
Language: Python - Size: 21.5 KB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 0 - Forks: 0
BlindOver/blindover_AI
Build AI model to classify beverages for blind individuals
Language: Python - Size: 590 MB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 1
yester31/Quantization_EX
quantization example for pqt & qat
Language: Python - Size: 94.7 KB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0
yester31/TensorRT_ONNX
Generating tensorrt model using onnx
Language: C++ - Size: 91.6 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0
yester31/TensorRT_EX
Deep Learning Model Optimization Using by TensorRT API, window
Language: Python - Size: 160 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 11 - Forks: 3