Loops is in early access. Fill out the signup form to request access for your workspace.
How Loops works
Loops provides API-driven training infrastructure by deploying training servers that execute traditional forward and backward passes plus optimizer steps. It runs RL rollouts on separate sampling servers that scale independently of the trainer, which keeps generation fast for compute-intensive workloads. Trainer and sampler stay synchronized through weight transfers that you can await synchronously or asynchronously, so you can stay on-policy or run bounded off-policy algorithms. In Loops, you own your checkpoints. You can download them as presigned URLs or deploy them onto Baseten’s Inference Stack through the UI, CLI, or API. If you’re not sure Loops is the right path for your team, the Training overview compares Loops withtruss train (the bring-your-own-container alternative) side by side.