Get all model deployments

cURL

curl --request GET \
--url https://api.baseten.co/v1/models/{model_id}/deployments \
--header "Authorization: Api-Key $BASETEN_API_KEY"

{
  "deployments": [
    {
      "id": "<string>",
      "created_at": "2023-11-07T05:31:56Z",
      "name": "<string>",
      "model_id": "<string>",
      "is_production": true,
      "is_development": true,
      "status": "BUILDING",
      "active_replica_count": 123,
      "autoscaling_settings": {
        "min_replica": 123,
        "max_replica": 123,
        "autoscaling_window": 123,
        "scale_down_delay": 123,
        "concurrency_target": 123,
        "target_utilization_percentage": 123
      },
      "instance_type_name": "<string>",
      "environment": "<string>"
    }
  ]
}

Authorizations

Authorization

string

header

required

You must specify the scheme 'Api-Key' in the Authorization header. For example, Authorization: Api-Key <Your_Api_Key>

Path Parameters

model_id

string

required

Response

200 - application/json

A list of deployments of a model.

deployments

DeploymentV1 · object[]

required

A list of deployments of a model

Show child attributes

ProductionGets a model's production deployment and returns the deployment.

⌘I

cURL

curl --request GET \
--url https://api.baseten.co/v1/models/{model_id}/deployments \
--header "Authorization: Api-Key $BASETEN_API_KEY"

{
  "deployments": [
    {
      "id": "<string>",
      "created_at": "2023-11-07T05:31:56Z",
      "name": "<string>",
      "model_id": "<string>",
      "is_production": true,
      "is_development": true,
      "status": "BUILDING",
      "active_replica_count": 123,
      "autoscaling_settings": {
        "min_replica": 123,
        "max_replica": 123,
        "autoscaling_window": 123,
        "scale_down_delay": 123,
        "concurrency_target": 123,
        "target_utilization_percentage": 123
      },
      "instance_type_name": "<string>",
      "environment": "<string>"
    }
  ]
}

Reference

Inference API

Management API

CLI reference

SDK reference

Get all model deployments

Authorizations

Path Parameters

Response