Update model deployment autoscaling settings
Any model deployment by ID
Updates a deploymentβs autoscaling settings and returns the update status.
Authorizations
Authorization
string
headerrequiredYou must specify the scheme 'Api-Key' in the Authorization header. For example, Authorization: Api-Key <Your_Api_Key>
Path Parameters
model_id
string
requireddeployment_id
string
requiredBody
application/json
A request to update autoscaling settings for a deployment. All fields are optional, and we only update ones passed in.
min_replica
integer | null
Minimum number of replicas
max_replica
integer | null
Maximum number of replicas
autoscaling_window
integer | null
Timeframe of traffic considered for autoscaling decisions
scale_down_delay
integer | null
Waiting period before scaling down any active replica
concurrency_target
integer | null
Number of requests per replica before scaling up
Response
200 - application/json
The response to a request to update autoscaling settings.
status
enum<string>
requiredStatus of the request to update autoscaling settings
Available options:
ACCEPTED
, QUEUED
, UNCHANGED
message
string
requiredA message describing the status of the request to update autoscaling settings