Update model deployment autoscaling settings
Production model deployment
Updates a production deploymentβs autoscaling settings and returns the update status.
Authorizations
You must specify the scheme 'Api-Key' in the Authorization header. For example, Authorization: Api-Key <Your_Api_Key>
Path Parameters
Body
application/json
A request to update autoscaling settings for a deployment. All fields are optional, and we only update ones passed in.
Minimum number of replicas
Example:
0
Maximum number of replicas
Example:
7
Timeframe of traffic considered for autoscaling decisions
Example:
600
Waiting period before scaling down any active replica
Example:
120
Number of requests per replica before scaling up
Example:
2
Response
200 - application/json
The response to a request to update autoscaling settings.
Status of the request to update autoscaling settings
Available options:
ACCEPTED
, QUEUED
, UNCHANGED
A message describing the status of the request to update autoscaling settings
Was this page helpful?