GitHub / clam004 / triton-ft-api

tutorial on how to deploy a scalable autoregressive causal language model transformer using nvidia triton server

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/clam004%2Ftriton-ft-api
PURL: pkg:github/clam004/triton-ft-api

Stars: 5
Forks: 0
Open issues: 0

License: None
Language: Python
Size: 52.7 KB
Dependencies parsed at: Pending

Created at: over 2 years ago
Updated at: about 2 years ago
Pushed at: over 2 years ago
Last synced at: 4 months ago

Topics: fastapi, fastertransformer, gpt, huggingface, nvidia, nvidia-docker, nvidia-gpu

Loading...