To deploy from Baseten Training checkpoints instead, see Deploy with optimized inference engines.
Storage sources
Thecheckpoint_repository field in your config specifies where the engine pulls weights from. The source field accepts the following providers:
S3: Amazon S3 buckets.GCS: Google Cloud Storage.AZURE: Azure Blob Storage.HF: Hugging Face repositories.
revision field pins a specific commit or branch. For Hugging Face repos, this is a git ref (branch name, tag, or commit SHA). If unset, the engine uses the default branch. For cloud storage sources (S3, GCS, Azure), revision is not applicable. The repo path points to a specific prefix.
Here’s a minimal example using S3:
config.yaml
Private storage credentials
To access private storage, add a JSON secret to your Baseten secrets manager and reference it withruntime_secret_name in your config.
- S3
- GCS
- Azure
- Hugging Face
Add a secret with your AWS credentials:Then reference the secret in your config:See AWS S3 authentication for full setup details including OIDC.
config.yaml
Related
- Configure Engine-Builder-LLM deployments: Complete build and runtime options for LLMs.
- Configure BEI deployments: Complete configuration for encoder models.
- Set up cloud storage authentication: OIDC and service account authentication for cloud storage.
- Manage deployment secrets: Configure credentials for private storage.