An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: serving-pytorch-models

clearml/clearml-serving

ClearML - Model-Serving Orchestration and Repository Solution

Language: Python - Size: 1.85 MB - Last synced at: 15 days ago - Pushed at: 6 months ago - Stars: 151 - Forks: 42

Lake-Wang/MLops_System_NBA_Attendance Fork of jasonmoon97/dynamic_nba_scheduling

End-to-end NBA analytics pipeline for predicting game outcomes and attendance using PyTorch, MLflow, and ONNX. Includes data scraping, model training, quantization, and scalable deployment with FastAPI and Triton Inference Server.

Language: Jupyter Notebook - Size: 28.7 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 1 - Forks: 0

ahkarami/Deep-Learning-in-Production

In this repository, I will share some useful notes and references about deploying deep learning-based models in production.

Size: 315 KB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 4,345 - Forks: 691

fabridamicelli/torchserve-docker

TorchServe images with specific Python version working out-of-the-box.

Language: Python - Size: 374 KB - Last synced at: 3 days ago - Pushed at: 4 months ago - Stars: 7 - Forks: 0

balavenkatesh3322/model_deployment

A collection of model deployment library and technique.

Size: 175 KB - Last synced at: 2 months ago - Pushed at: almost 5 years ago - Stars: 73 - Forks: 9

SapienzaNLP/usea

Universal Semantic Annotator (LREC 2022)

Size: 1.13 MB - Last synced at: 19 days ago - Pushed at: 5 months ago - Stars: 17 - Forks: 1

gasparian/PicsArtHack-binary-segmentation

Segmenting people on photos using IOS devices [Pytorch; Unet]

Language: Python - Size: 8.67 MB - Last synced at: 2 months ago - Pushed at: over 5 years ago - Stars: 61 - Forks: 8

IonBoleac/serve-torch-deployments

A proof-of-concept on how to install and use Torchserve in various mode

Language: Python - Size: 12.8 MB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

lukedeo/torch-serving

Simple HTTP serving for PyTorch 🚀

Language: C++ - Size: 34.5 MB - Last synced at: 7 months ago - Pushed at: over 4 years ago - Stars: 10 - Forks: 2

trinhtuanvubk/Wav2Vec2-Triton-Serving

Serve Wav2Vec2 model using Triton Inference Server

Language: Python - Size: 623 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

nikitajz/pytorch-flask-inference

Serving PyTorch model using flask and docker

Language: Python - Size: 89.8 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

bcaitech1/p3-dst-chatting-day

Chatting-Day's Dialogue State Tracking (DST)

Language: Python - Size: 17.3 MB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 9 - Forks: 2

ingyuseong/rabbitmq-inference

A message queue based server architecture to asynchronously handle resource-intensive tasks (e.g., ML inference)

Language: Python - Size: 28.3 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0