Reference
Reference documentation
For deploying, managing, and interacting with machine learning models on Baseten.
This reference section documents our API, CLI, and Python SDK—whether you’re deploying models, managing inference chains, or calling endpoints in production.
API Reference
Baseten provides two sets of API endpoints:

Inference API
For calling deployed models and chains.

Management API
For managing models and workspaces.
CLI Reference
The CLI provides a command-line interface for managing deployments, running local inference, and configuring Truss models.
- Truss CLI reference – Commands for initializing, deploying, and managing models.
- Chains CLI reference – Commands for orchestrating multi-model workflows.
SDK Reference
The Python SDK provides an abstraction for deploying models, managing deployments, and interacting with models via code.
- Truss SDK reference – Deploy and interact with Truss models using Python.
- Chains SDK reference – Build and manage inference chains programmatically.
Was this page helpful?