Persist data across replicas or deployments
torch.compile
results in a cache that can speed up future torch.compile
on the same function. This can speed up other replicas’ cold start times.
These files can be stored via b10cache. b10cache is a volume mounted over the network onto each of your pods. There are two ways files can be stored:
/cache/org/
/cache/model/