optimize-foundation-models-deployment-on-amazon-sagemaker

In this workshop, we demonstrate how to choose the right container and right instance types, optimize container parameters, and set up the right autoscaling policies and how to use APIs to get recommendations with Amazon SageMaker

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/aws-samples%2Foptimize-foundation-models-deployment-on-amazon-sagemaker
PURL: pkg:github/aws-samples/optimize-foundation-models-deployment-on-amazon-sagemaker

Stars: 3
Forks: 0
Open issues: 1

License: mit-0
Language: Jupyter Notebook
Size: 1.25 MB
Dependencies parsed at: Pending

Created at: over 1 year ago
Updated at: 9 months ago
Pushed at: about 2 months ago
Last synced at: 12 days ago

Topics: llm-inference, sagemaker, sagemaker-deployment

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Repos

GitHub / aws-samples / optimize-foundation-models-deployment-on-amazon-sagemaker