Skip to main content

Early Access

Please contact our support team for access to b10cache.
Deployments sometimes have cache or other files that are useful to other replicas. Using torch.compile results in a cache that can speed up future torch.compile on the same function. This can speed up other replicas’ cold start times. These files can be stored via b10cache. b10cache is a volume mounted over the network onto each of your pods. There are two ways files can be stored:

1. /cache/org/

This directory is shared, and can be written to or accessed by every pod you deploy. Simply move a file into here and it will be accessible.

2. /cache/model/

This directory is shared by every pod within the scope of your deployment. This is excellent for keeping filesystems clean and limiting access.

Not a persistent object storage

While b10cache is very reliable, it should not be used as a persistent object storage or database. It should be considered a cache that can be shared by deployments, meaning there should always be a fallback plan if the b10cache path does not exist.
See two features built on b10cache:
  1. model cache
  2. torch compile cache
⌘I