Overview

Each deployment has a dedicated subdomain on api.baseten.co for optimized routing. For models, the endpoints follow this format:

https://model-{model_id}.api.baseten.co/{deployment_type_or_id}/{endpoint}

For chains, the endpoints follow this format:

https://chain-{chain_id}.api.baseten.co/{deployment_type_or_id}/{endpoint}

Where:

model_id – The model’s alphanumeric ID (found in your model dashboard).
chain_id – The chain’s alphanumeric ID (found in your chain dashboard).
deployment_type_or_id – Either development, production, or a specific deployment’s alphanumeric ID.
endpoint – The API action, such as predict.

For long-running tasks, the inference API supports asynchronous inference with priority queuing.

Predict endpoints

Method	Endpoint	Description
`POST`	`/environments/{env_name}/predict`	Call an environment
`POST`	`/development/predict`	Call the development deployment
`POST`	`/deployment/{deployment_id}/predict`	Call any deployment
`POST`	`/environments/{env_name}/async_predict`	For Async inference, call the deployment associated with the specified environment.
`POST`	`/development/async_predict`	For Async inference, call the development deployment.
`POST`	`/deployment/{deployment_id}/async_predict`	For Async inference, call any published deployment of your model.
`WEBSOCKET`	`/environments/{env_name}/websocket`	For WebSockets, connect to an environment.
`WEBSOCKET`	`/development/websocket`	For WebSockets, connect to the development deployment.
`WEBSOCKET`	`/deployment/{deployment_id}/websocket`	For WebSockets, connect to a deployment.

Method	Endpoint	Description
`GET`	`/async_request/{request_id}`	Get the status of an async request.
`DEL`	`/async_request/{request_id}`	Cancel an async request.
`GET`	`/environments/{env_name}/async_queue_status`	Get the async queue status for a model associated with the specified environment.
`GET`	`/development/async_queue_status`	Get the status of a development deployment’s async queue.
`GET`	`/deployment/{deployment_id}/async_queue_status`	Get the status of a deployment’s async queue.

Method	Endpoint	Description
`POST`	`/production/wake`	Wake the production environment of your model.
`POST`	`/development/wake`	Wake the development deployment of your model.
`POST`	`/deployment/{deployment_id}/wake`	Wake any deployment of your model.