Please contact our support team for access to b10cache.
Deployments sometimes have cache or other files that are useful to other replicas. Using torch.compile results in a cache that can speed up future torch.compile on the same function. This can speed up other replicasβ cold start times.These files can be stored via b10cache. b10cache is a volume mounted over the network onto each of your pods. There are two ways files can be stored:
While b10cache is very reliable, it should not be used as a persistent object storage or database. It should be considered a cache that can be shared by deployments, meaning there should always be a fallback plan if the b10cache path does not exist.