GitHub topics: tensorrt-inference
asitiaf/llm-getting-started
Practical, beginner-friendly LLM projects using Python, LangChain, and LangSmith. Modular, reusable, and easy to run.
Language: Python - Size: 12.7 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 2 - Forks: 1

Sinapsis-AI/sinapsis-retina-face-trt
Sinapsis repo with templates for face detection, face recognition and face verification
Language: Python - Size: 274 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 7 - Forks: 6

laugh12321/TensorRT-YOLO
🚀 Easier & Faster YOLO Deployment Toolkit for NVIDIA 🛠️
Language: C++ - Size: 23.4 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 1,146 - Forks: 121

emptysoal/TensorRT-YOLOv8
Based on tensorrt v8.0+, deploy detect, pose, segment, tracking of YOLOv8 with C++ and python api.
Language: C++ - Size: 212 MB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 95 - Forks: 17

emptysoal/Deepsort-YOLOv5-TensorRT
An object tracking project with YOLOv5-v5.0 and Deepsort, speed up by C++ and TensorRT.
Language: C++ - Size: 599 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 15 - Forks: 4

jagennath-hari/DepthStream-Accelerator-ROS2-Integrated-Monocular-Depth-Inference
DepthStream Accelerator: A TensorRT-optimized monocular depth estimation tool with ROS2 integration for C++. It offers high-speed, accurate depth perception, perfect for real-time applications in robotics, autonomous vehicles, and interactive 3D environments.
Language: C++ - Size: 10.9 MB - Last synced at: 29 days ago - Pushed at: about 2 months ago - Stars: 17 - Forks: 0

jolibrain/deepdetect
Deep Learning API and Server in C++14 support for PyTorch,TensorRT, Dlib, NCNN, Tensorflow, XGBoost and TSNE
Language: C++ - Size: 103 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 2,529 - Forks: 560

emptysoal/YOLOv5-TensorRT-lib-Python
The code of YOLOv5 inferencing with TensorRT C++ api is packaged into a dynamic link library , then called through Python.
Language: Cuda - Size: 748 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 14 - Forks: 1

lona-cn/vision-simple
a lightweight C++ cross-platform vision inference library,support YOLOv10 YOLOv11 PaddleOCR EasyOCR ,using ONNXRuntime/TVM with multiple exectuion providers.
Language: C++ - Size: 53.3 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 20 - Forks: 2

emptysoal/tensorrt-experiment
Base on tensorrt version 8.2.4, compare inference speed for different tensorrt api.
Language: Python - Size: 92 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 42 - Forks: 4

emptysoal/TensorRT-v8-YOLOv5-v5.0
Based on TensorRT v8.2, build network for YOLOv5-v5.0 by myself, speed up YOLOv5-v5.0 inferencing
Language: C++ - Size: 430 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 13 - Forks: 1

Qengineering/YoloV8-TensorRT-Jetson_Nano
A lightweight C++ implementation of YoloV8 running on NVIDIAs TensorRT engine
Language: C++ - Size: 328 MB - Last synced at: 28 days ago - Pushed at: 11 months ago - Stars: 18 - Forks: 1

mohamedsamirx/YOLOv12-TensorRT-CPP
YOLOv12 Inference Using CPP, Tensorrt, And CUDA
Language: C++ - Size: 1.98 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 32 - Forks: 1

Koldim2001/TrafficAnalyzer
Анализ трафика на круговом движении с использованием компьютерного зрения
Language: Python - Size: 401 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 82 - Forks: 6

fanpeace/dbnet_torch
you can use dbnet to detect word or bar code,Knowledge Distillation is provided,also python tensorrt inference is provided.
Language: Python - Size: 215 MB - Last synced at: 21 days ago - Pushed at: over 4 years ago - Stars: 45 - Forks: 10

LCH1238/bevdet-tensorrt-cpp
BEVDet implemented by TensorRT, C++; Achieving real-time performance on Orin
Language: C++ - Size: 6.98 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 253 - Forks: 45

NVIDIA-ISAAC-ROS/isaac_ros_dnn_inference
NVIDIA-accelerated DNN model inference ROS 2 packages using NVIDIA Triton/TensorRT for both Jetson and x86_64 with CUDA-capable GPU
Language: C++ - Size: 405 KB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 111 - Forks: 16

yuvraj108c/ComfyUI-Depth-Anything-Tensorrt
ComfyUI Depth Anything (v1/v2) Tensorrt Custom Node (up to 14x faster)
Language: Python - Size: 3.19 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 95 - Forks: 8

liangxiansheng093/3d-sort
For 3d dynamic target detection and motion tracking includes target detection, 3D tracking program.
Language: C++ - Size: 11.1 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

NotAP0tato/awesome-ai-toolkit
An extensive curated list of AI tools, frameworks, and libraries to supercharge your artificial intelligence projects.
Size: 1.95 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

parlaynu/inference-tensorrt
Convert ONNX models to TensorRT engines and run inference in containerized environments
Language: Python - Size: 9.77 KB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 10 - Forks: 0

xstongxue/TensorRT-Alpha-Plus
TensorRT for Web-RTDETR、Web-YOLOv8、RTDETR、YOLOv8、YOLOv8-Pose、YOLOv8-Seg、YOLOv8-Cls、YOLOv7、YOLOv6、YOLOv5......⭐
Language: C++ - Size: 106 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

mingj2021/segment-anything-tensorrt
Language: Jupyter Notebook - Size: 17.1 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 77 - Forks: 9

kamalkraj/stable-diffusion-tritonserver
Deploy stable diffusion model with onnx/tenorrt + tritonserver
Language: Jupyter Notebook - Size: 2.62 MB - Last synced at: 29 days ago - Pushed at: over 1 year ago - Stars: 123 - Forks: 19

yester31/TensorRT_Examples
All useful sample codes of tensorrt models using onnx
Language: Python - Size: 240 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 1

cuixing158/yolo-tensorRT-cpp
部署量化库,适合pc,jetson,int8量化, yolov3/v4/v5
Language: C++ - Size: 610 KB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

PrinceP/tensorrt-sample-on-threads
A tutorial for getting started on running Tensorrt engine and Deep Learning Accelerator (DLA) models on threads
Language: C++ - Size: 2.93 KB - Last synced at: 7 days ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

spacewalk01/nanosam-cpp
C++ TensorRT Implementation of NanoSAM
Language: C++ - Size: 7.1 MB - Last synced at: 8 months ago - Pushed at: over 1 year ago - Stars: 30 - Forks: 3

ggluo/TensorRT-Cpp-Example
C++/C TensorRT Inference Example for models created with Pytorch/JAX/TF
Language: C++ - Size: 767 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

YuzhouPeng/unet-tensorrt
this is a tensorrt version unet, inspired by tensorrtx
Language: C++ - Size: 32.2 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 35 - Forks: 11

ce-dric/TensorRT-Batch
TensorRT capable of processing batch units
Language: Jupyter Notebook - Size: 145 KB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

djetshu/yolo_nas_trt_training
Anaconda environment to train YOLONAS, to convert yolonas.onnx to TensorRT model and to test it with webcam in real time.
Language: Jupyter Notebook - Size: 4.26 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

umitkacar/ai-edge-computing-tiny-embedded
Size: 289 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 11 - Forks: 0

BlueMirrors/Yolov5-TensorRT
Yolov5 TensorRT Implementations
Language: Python - Size: 46.9 KB - Last synced at: 21 days ago - Pushed at: over 2 years ago - Stars: 67 - Forks: 13

vossr/YOLOX-custom Fork of Megvii-BaseDetection/YOLOX
YOLOX TensorRT object detection
Language: Python - Size: 7.43 MB - Last synced at: 1 day ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

surajiitd/NVIDIA_Jetson_Inference
This repo contains model compression(using TensorRT) and documentation of running various deep learning models on NVIDIA Jetson Orin, Nano (aarch64 architectures)
Language: Makefile - Size: 1.39 GB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 6 - Forks: 2

gpastal24/ViTPose-Pytorch
VitPose without MMCV dependencies
Language: Python - Size: 152 MB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 29 - Forks: 8

gitctrlx/xtrt
A lightweight, high-performance deep learning inference tool.
Language: C++ - Size: 265 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

rolson24/ImprovedAssociationTrack-cpp
C++ implementation of An Improved Association Pipeline for Multi-Person Tracking
Language: C++ - Size: 27.6 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

CactusQ/TensorRT-LLM-Tutorial
Getting started with TensorRT-LLM using BLOOM as a case study
Language: Jupyter Notebook - Size: 85 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

teamunitlab/yolo8-segmentation-deploy
Production-ready YOLO8 Segmentation deployment with TensorRT and ONNX support for CPU/GPU, including AI model integration guidance for Unitlab Annotate.
Language: Python - Size: 1.58 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 20 - Forks: 5

MaximeDebarbat/Dolphin
Dolphin is a python toolkit meant to speed up inference of TensorRT by providing CUDA-Accelerated processing.
Language: Python - Size: 588 KB - Last synced at: 26 days ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

Jayveersinh-Raj/cross-lingual-zero-shot-transfer
This is cross based project for industrial purposes. This would then be integrated as API, and repository would be made private accordingly
Language: Jupyter Notebook - Size: 5.74 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

edicek/JetsonNanoUbuntu20_Disk
32 GB SD card image for Jetson Nano based on Ubuntu 20 and compatible Yolov8 Ultralytics library
Size: 8.79 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

yunkai1841/eipl-deploy
Inference code of `ogata-lab/eipl`. Control robots with machine learning models on edge computer.
Language: Python - Size: 247 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

gitctrlx/NarrativeCanvas
"Narrative Canvas" project is an edge computing project based on Nvidia Jetson. It can transform uploaded images into captivating stories and artworks.
Language: Python - Size: 94.5 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 7 - Forks: 0

col-in-coding/Tensorrt-CV
Using TensorRT for Inference Model Deployment.
Language: Cuda - Size: 90.9 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 38 - Forks: 9

thaitc-hust/Yolo-TensorRT
Convert yolo models to ONNX, TensorRT add NMSBatched.
Language: Python - Size: 5.46 MB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 15 - Forks: 4

littletomatodonkey/model_inference
不同backend的模型转换与推理代码
Language: Python - Size: 48.8 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ltkong218/FastFlowNet
FastFlowNet: A Lightweight Network for Fast Optical Flow Estimation (ICRA 2021)
Language: Python - Size: 37.5 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 196 - Forks: 39

ycchen218/Pytorch-to-TensorRT-example
This is an mnist example of how to transfer a .pt file to .onnx, then transfer .onnx file to .trt file.
Language: Python - Size: 37.1 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

uschen/face-inference-rs
Rust GRPC server for face recognition, face detection and face alignment using TensorRT, Cuda on JetPack SDK (Jetson Nano, Jetson Xavier NX)
Language: C++ - Size: 39.1 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Rumeysakeskin/Speech-Command-Recognition
Classify input audio segment into categories for keyword spotting with MatchboxNet with training, exporting onnx model, accelerating inference via TensorRT
Language: Python - Size: 349 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 13 - Forks: 0

lzh420202/TensorRT_Inference
An oriented object detection framework based on TensorRT
Language: Python - Size: 57.3 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 8 - Forks: 0

Rumeysakeskin/NLP-Onnx-TensorRT
Joint Intent/Slot Classification for Jetson Nano, TX1/TX2, Xavier NX, and AGX Xavier
Language: Jupyter Notebook - Size: 146 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

leandro-svg/SparseInst_TensorRT
The real-time Instance Segmentation Algorithm SparseInst running on TensoRT and ONNX
Language: Python - Size: 57.6 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 18 - Forks: 3

storrrrrrrrm/tensorrt_smoke
3d object detection model smoke c++ inference code
Language: C++ - Size: 68.9 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 1

yester31/TensorRT_ONNX
Generating tensorrt model using onnx
Language: C++ - Size: 91.6 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

frannecki/trt_classification
TensorRT example for image classification inference
Language: C++ - Size: 18.6 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

CuteBoiz/TensorRT_Parser_Python
Export (from Onnx) and Inference TensorRT engine with Python
Language: Python - Size: 46.9 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 1

k9ele7en/ONNX-TensorRT-Inference-CRAFT-pytorch 📦
Advance inference performance using TensorRT for CRAFT Text detection. Implemented modules to convert Pytorch -> ONNX -> TensorRT, with dynamic shapes (multi-size input) inference.
Language: Python - Size: 12.3 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 8 - Forks: 2

vnghia/minrt
MIN(R)T: Minimal TensorRT inference
Language: C++ - Size: 26.4 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

ashBabu/object_detection
Deployment using torchscript and tensorrt
Language: C++ - Size: 35.7 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

CuteBoiz/TensorRT_Parser_Cpp
Export (from Onnx) and Inference TensorRT engine with C++.
Language: C++ - Size: 95.7 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 4 - Forks: 4
