This reference section documents our API, CLI, and Python SDK for deploying models, managing inference chains, and calling endpoints in production.Documentation Index
Fetch the complete documentation index at: https://docs.baseten.co/llms.txt
Use this file to discover all available pages before exploring further.
API Reference
Baseten provides two sets of API endpoints:
Inference API
For calling deployed models and chains.

Management API
For managing models, workspaces, and training jobs.
CLI Reference
The CLI provides a command-line interface for managing deployments, running local inference, and configuring Truss models.- Truss CLI reference: Commands for initializing, deploying, and managing models.
- Chains CLI reference: Commands for orchestrating multi-model workflows.
- Training CLI reference: Commands for managing training jobs.
SDK Reference
The Python SDK provides an abstraction for deploying models, managing deployments, and interacting with models via code.- Truss SDK reference: Deploy and interact with Truss models using Python.
- Chains SDK reference: Build and manage inference chains programmatically.
- Training SDK reference: Deploy and interact with trained models using Python.