Autoscaling dynamically adjusts the number of active replicas to handle variable traffic while minimizing idle compute costs.
0
(scale to zero).1
.10
by default (contact support to increase).1
replica (or the minimum count, if higher). As traffic increases, additional replicas scale up until the maximum count is reached. When traffic decreases, replicas scale down to match demand.
[Coldboost]
.
0
1
60 seconds
900 seconds (15 min)
1 request