GitHub topics: serving-pytorch-models
clearml/clearml-serving
ClearML - Model-Serving Orchestration and Repository Solution
Language: Python - Size: 1.85 MB - Last synced at: 15 days ago - Pushed at: 6 months ago - Stars: 151 - Forks: 42

Lake-Wang/MLops_System_NBA_Attendance Fork of jasonmoon97/dynamic_nba_scheduling
End-to-end NBA analytics pipeline for predicting game outcomes and attendance using PyTorch, MLflow, and ONNX. Includes data scraping, model training, quantization, and scalable deployment with FastAPI and Triton Inference Server.
Language: Jupyter Notebook - Size: 28.7 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 1 - Forks: 0

ahkarami/Deep-Learning-in-Production
In this repository, I will share some useful notes and references about deploying deep learning-based models in production.
Size: 315 KB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 4,345 - Forks: 691

fabridamicelli/torchserve-docker
TorchServe images with specific Python version working out-of-the-box.
Language: Python - Size: 374 KB - Last synced at: 3 days ago - Pushed at: 4 months ago - Stars: 7 - Forks: 0

balavenkatesh3322/model_deployment
A collection of model deployment library and technique.
Size: 175 KB - Last synced at: 2 months ago - Pushed at: almost 5 years ago - Stars: 73 - Forks: 9

SapienzaNLP/usea
Universal Semantic Annotator (LREC 2022)
Size: 1.13 MB - Last synced at: 19 days ago - Pushed at: 5 months ago - Stars: 17 - Forks: 1

gasparian/PicsArtHack-binary-segmentation
Segmenting people on photos using IOS devices [Pytorch; Unet]
Language: Python - Size: 8.67 MB - Last synced at: 2 months ago - Pushed at: over 5 years ago - Stars: 61 - Forks: 8

IonBoleac/serve-torch-deployments
A proof-of-concept on how to install and use Torchserve in various mode
Language: Python - Size: 12.8 MB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

lukedeo/torch-serving
Simple HTTP serving for PyTorch 🚀
Language: C++ - Size: 34.5 MB - Last synced at: 7 months ago - Pushed at: over 4 years ago - Stars: 10 - Forks: 2

trinhtuanvubk/Wav2Vec2-Triton-Serving
Serve Wav2Vec2 model using Triton Inference Server
Language: Python - Size: 623 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

nikitajz/pytorch-flask-inference
Serving PyTorch model using flask and docker
Language: Python - Size: 89.8 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

bcaitech1/p3-dst-chatting-day
Chatting-Day's Dialogue State Tracking (DST)
Language: Python - Size: 17.3 MB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 9 - Forks: 2

ingyuseong/rabbitmq-inference
A message queue based server architecture to asynchronously handle resource-intensive tasks (e.g., ML inference)
Language: Python - Size: 28.3 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0
