Skip to main content
Welcome to Baseten Training, a powerful product designed to streamline and manage the entire lifecycle of model training.

Who it’s for

Baseten Training is for MLEs, Engineers, and Developers who are looking for a fast, flexible, and scalable platform for training and finetuning models – and getting these models into production.

Use cases

Training on Baseten allows you to easily:
  • Optimize on cost and latency: Train a smaller, faster, and cheaper model from a larger, more general, expensive one.
  • Develop specialized and agentic models: Finetune a model with RL to do specific tasks, like code completion and tool calling.
  • Craft customized voice models: Finetune a voice model like Orpheus to speak with specific intonations and accents.
  • Get to prod: Productionize trained models to scalable deployments with a click of a button.

Why Baseten Training

Streamline your path from training to prod

We want to make sure you’re getting to impact as fast as possible. Our training platform prioritizes the end-to-end model development cycle with these key features:
  • Instantly up and running: Check out our getting started guide for a step-by-step how-to and our ml-cookbook for recipes and examples spanning Supervised Finetuning, Reinforcement Learning, and a variety of frameworks.
  • Flexibility: Stay up to date with the most impactful training recipes and techniques across text, vision, and audio with our framework-agnostic training API.
  • Seamless Deploys: Transition from training to inference and evals seamlessly within the Baseten ecosystem.

Seamlessly scale your experimentation

We know training and finetuning are experimental in nature. Our platform provides the infrastructure you need to scale out and scale up:
  • Reproducibility: Ensure consistent training runs by precisely defining your environment, code, and configurations.
  • Scalability: Easily scale your training jobs from single-gpu, to multi-gpu, and even to multi-node distributed training. Handle large datasets, large sequence lengths, and complex models - all without any commits.
  • Simplified Management: Organize, monitor, and manage your training projects and jobs in a centralized platform.
  • Artifact Management: Expedite handling of large artifacts like models, checkpoints, and datasets efficiently with Baseten storage.

Get Started

Check out our Getting Started guide to get started with training on Baseten.

Go deeper

Use the following resources to learn more about training on Baseten: