Baseten home page
Search...
⌘K
Get started
Overview
Quick start
Concepts
Why Baseten
How Baseten works
Development
Concepts
Model APIs
Developing a model
Developing a Chain
Deployment
Concepts
Deployments
Environments
Resources
Autoscaling
Inference
Concepts
Call your model
Streaming
Async inference
Structured LLM output
Output formats
Integrations
Training
Overview
Getting started
Concepts
Management
Deploying checkpoints
Observability
Metrics
Status and health
Security
Exporting metrics
Tracing
Billing and usage
Troubleshooting
Deployments
Inference
Support
Return to Baseten
Baseten home page
Search...
⌘K
Ask AI
Support
Return to Baseten
Return to Baseten
Search...
Navigation
Quick start
Documentation
Examples
Reference
Status
Documentation
Examples
Reference
Status
Quick start
1
What modality are you working with?
Select a different modality
Compound AI
Build real-time AI-native applications
2
Select a model or guide to get started...
Get started quickly
by deploying a model from our library in seconds.
Whisper V3
Explore model library
Or choose
a step-by-step guide to help you get started.
Building your first Chain
A quickstart guide to building your first Chain
Building a RAG pipeline
An example of a RAG pipeline built with Chains
Was this page helpful?
Yes
No
Assistant
Responses are generated using AI and may contain mistakes.