GitHub / clam004 / triton-ft-api
tutorial on how to deploy a scalable autoregressive causal language model transformer using nvidia triton server
JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/clam004%2Ftriton-ft-api
PURL: pkg:github/clam004/triton-ft-api
Stars: 5
Forks: 0
Open issues: 0
License: None
Language: Python
Size: 52.7 KB
Dependencies parsed at: Pending
Created at: over 2 years ago
Updated at: about 2 years ago
Pushed at: over 2 years ago
Last synced at: 4 months ago
Topics: fastapi, fastertransformer, gpt, huggingface, nvidia, nvidia-docker, nvidia-gpu