Baseten home page
Search...
⌘K
Examples
Overview
Deploy your first model
Fast LLMs with TensorRT-LLM
Run any LLM with vLLM
Deploy LLMs with SGLang
RAG pipeline with Chains
Transcribe audio with Chains
Image generation
Deploy a ComfyUI project
Embeddings with BEI
Dockerized model
LLM with Streaming
Text to speech
Model library
Overview
Deepseek
Llama
Qwen
Gemma
Stable Diffusion
Flux
Kokoro
Microsoft
Nomic
Whisper
Mars
Support
Return to Baseten
Baseten home page
Search...
⌘K
Ask AI
Support
Return to Baseten
Return to Baseten
Search...
Navigation
Examples
Building with Baseten
Documentation
Examples
Reference
Status
Documentation
Examples
Reference
Status
Examples
Building with Baseten
These examples cover a variety of use cases on Baseten, from
deploying your first LLM
and
image generation
to
transcription
,
embeddings
, and
RAG pipelines
. Whether you’re optimizing inference with
TensorRT-LLM
or deploying a model with
Truss
, these guides help you build and scale efficiently.
Featured examples
Deploy your first model
Fast LLMs with TensorRT-LLM
Run any LLM with vLLM
Deploy LLMs with SGLang
Transcribe audio with a Chain
Embeddings with BEI
Model library
For a
quick start
, explore the
model library
with prebuilt, ready to deploy in one click models like DeepSeek, Llama, and Qwen.
DeepSeek R1
Whisper V3
Qwen 2.5 32B Coder Instruct
Llama 3.3 70B Instruct
flux-schnell
MARS6
Was this page helpful?
Yes
No
Deploy your first model
From model weights to API endpoint
Next
Assistant
Responses are generated using AI and may contain mistakes.