Skip to main content
This reference section documents our API, CLI, and Python SDK for deploying models, managing inference chains, and calling endpoints in production.

API Reference

Baseten provides two sets of API endpoints:
inference-api

Inference API

For calling deployed models and chains.
management-api

Management API

For managing models, workspaces, and training jobs.

CLI Reference

The CLI provides a command-line interface for managing deployments, running local inference, and configuring Truss models.

SDK Reference

The Python SDK provides an abstraction for deploying models, managing deployments, and interacting with models via code.