Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.baseten.co/llms.txt

Use this file to discover all available pages before exploring further.

This reference section documents our API, CLI, and Python SDK for deploying models, managing inference chains, and calling endpoints in production.

API Reference

Baseten provides two sets of API endpoints:
inference-api

Inference API

For calling deployed models and chains.
management-api

Management API

For managing models, workspaces, and training jobs.

CLI Reference

The CLI provides a command-line interface for managing deployments, running local inference, and configuring Truss models.

SDK Reference

The Python SDK provides an abstraction for deploying models, managing deployments, and interacting with models via code.