Optimized inference for LLMs with SGLang
Create an API key for your Baseten account
Add an access token for Hugging Face
hf_access_token
secret to your Baseten workspace.Install Truss in your local development environment
config.yaml
we created above.