GitHub topics: int8-inference
jahongir7174/YOLOv8-qat
Quantization Aware Training
Language: Python - Size: 9.31 MB - Last synced at: 11 days ago - Pushed at: over 1 year ago - Stars: 70 - Forks: 11

DerryHub/BEVFormer_tensorrt
BEVFormer inference on TensorRT, including INT8 Quantization and Custom TensorRT Plugins (float/half/half2/int8).
Language: Python - Size: 403 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 466 - Forks: 76

BUG1989/caffe-int8-convert-tools
Generate a quantization parameter file for ncnn framework int8 inference
Language: Python - Size: 622 KB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 519 - Forks: 154

anilsathyan7/Portrait-Segmentation
Real-time portrait segmentation for mobile devices
Language: Jupyter Notebook - Size: 495 MB - Last synced at: 17 days ago - Pushed at: over 4 years ago - Stars: 645 - Forks: 135

ENOT-AutoDL/gpt-j-6B-tensorrt-int8
GPT-J 6B inference on TensorRT with INT-8 precision
Language: Python - Size: 24.4 KB - Last synced at: 8 days ago - Pushed at: about 2 years ago - Stars: 11 - Forks: 0

JohnClaw/chatllm.vb
VB.NET api wrapper for llm-inference chatllm.cpp
Language: Visual Basic .NET - Size: 6.84 KB - Last synced at: 7 days ago - Pushed at: 5 months ago - Stars: 4 - Forks: 0

akashAD98/yolov7_vino_with_object_tracking
it has support for openvino converted model of yolov7-int.xml ,yolov7x,
Language: Python - Size: 673 KB - Last synced at: 30 days ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 2

JohnClaw/chatllm.cs
C# api wrapper for llm-inference chatllm.cpp
Language: C# - Size: 779 KB - Last synced at: 7 days ago - Pushed at: 5 months ago - Stars: 3 - Forks: 0

daniel-rychlewski/cnn-planesnet
Compressed CNNs for airplane classification in satellite images (APoZ-based parameter pruning, INT8 weight quantization)
Language: Python - Size: 497 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

ENOT-AutoDL/ENOT-transformers
Size: 8.79 KB - Last synced at: 2 months ago - Pushed at: almost 2 years ago - Stars: 7 - Forks: 1

yester31/TensorRT_ONNX
Generating tensorrt model using onnx
Language: C++ - Size: 91.6 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

whitelok/tensorrt-int8-python-sample
TensorRT Int8 Python version sample. TensorRT Int8 Python 实现例子。TensorRT Int8 Pythonの例です
Language: Python - Size: 1.5 MB - Last synced at: 9 months ago - Pushed at: about 6 years ago - Stars: 14 - Forks: 1

Howell-Yang/onnx2trt
将端上模型部署过程中,常见的问题以及解决办法记录并汇总,希望能给其他人带来一点帮助。
Language: Python - Size: 258 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0
