Training checkpoint deployment
Deploy fine-tuned models from Baseten Training with Engine-Builder-LLM. SpecifyBASETEN_TRAINING as the source:
config.yaml
base_model:decoderfor LLMs,encoder/encoder_bertfor embeddingssource:BASETEN_TRAININGfor Baseten Training checkpointsrepo: Your training job IDrevision: Checkpoint folder name (e.g.,checkpoint-100,checkpoint-final)
Cloud storage deployment
Deploy models directly from S3, GCS, or Azure. Specify the storage source and bucket path:config.yaml
S3: Amazon S3 bucketsGCS: Google Cloud StorageAZURE: Azure Blob StorageHF: Hugging Face repositories
Private storage setup
All runtimes use the same downloader system as model_cache. As a result, you configure theruntime_secret_name and repo identically across model_cache and runtimes like Engine-Builder-LLM or BEI.
Secret Setup:
Add these JSON secrets to your Baseten secrets manager.
For more details, refer to the documentation in model_cache.
S3:
Further reading
- Engine-Builder-LLM configuration: Complete build and runtime options.
- Model cache documentation: Caching strategies used by the engines.
- Secrets management: Configure credentials for private storage.