hf_access_token in workspace secrets, because the deployment downloads the base weights from Hugging Face.
Deploy from the CLI
Runtruss loops checkpoints deploy with the checkpoint’s globally unique id (the id field from listing checkpoints, not the checkpoint name):
--checkpoint-ids flag skips the checkpoint picker, but the command still prompts for a model name, GPU type, GPU count, and the Hugging Face secret name (default hf_access_token). A successful deploy prints the IDs you need to call the model:
ACTIVE.
To inspect the generated Truss config without deploying, pass --dry-run.
Deploy from the dashboard
Every Loop has a page in the dashboard atapp.baseten.co/training/loop/<run_id>, reachable from the Training tab. Its Checkpoints table lists each saved checkpoint with its bt:// path, type, and size, and is where you deploy one; the flow matches deploying job checkpoints.
Call the deployed model
Call the deployment’s OpenAI-compatible chat completions route. The URL includes the deployment ID, andmodel is the checkpoint name (checkpoint_id), not its globally unique id:
Delete the deployment
The deployment bills for its GPU while it’s live. Delete it withDELETE /v1/models/{model_id} when you’re done:
Next steps
- Train on a dataset: Move from the quickstart’s single example to a real training loop.
- Loops concepts: How checkpoints relate to sessions, trainers, and samplers.