By name
Fetch a Model API by name, with workspace overlay when added.
Authorizations
Pass your Baseten API key. Clients automatically send Authorization: Bearer <key>. Direct callers can also use Authorization: Api-Key <key>; both schemes are accepted.
Path Parameters
Response
A Model API catalog row, optionally enriched with workspace-specific state.
Identifier of the Model API. Stable, URL-safe slug used as the public identifier.
"llama-3-3-70b-instruct"
Human-readable name of the Model API.
"Llama 3.3 70B Instruct"
Description of the Model API.
Date the Model API was made available.
Base URL for invoking the Model API. OpenAI-shaped routes (e.g. /v1/chat/completions) live underneath this host.
"https://inference.baseten.co"
The model's context window length, in tokens.
8192
Cost per million input tokens, in dollars.
"0.13"
Cost per million output tokens, in dollars.
"0.50"
Rate limits in effect for the workspace. Workspace-specific overrides are returned when the workspace has added this Model API and configured them; otherwise the catalog default rate limits are returned.
Family the underlying model belongs to.
"META"
Workspace-specific state. Null when the workspace has not added this Model API.