Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.baseten.co/llms.txt

Use this file to discover all available pages before exploring further.

Each row below is a Hugging Face repo ID you can pass as base_model when starting a Loops run, along with the maximum supported sequence length. Baseten adds rows as new models are validated end to end.

Models

ModelMax sequence length
Qwen/Qwen3.6-35B-A3B131,072
Qwen/Qwen3.6-27B131,072
Qwen/Qwen3.5-9B131,072
Qwen/Qwen3.5-4B131,072
Qwen/Qwen3.5-2B131,072
Qwen/Qwen3.5-0.8B131,072
Qwen/Qwen3.5-122B-A10BContact support
Qwen/Qwen3.5-397B-A17BContact support
moonshotai/Kimi-K2.6Contact support
Qwen/Qwen3-30B-Instruct-2507131,072
deepseek-ai/DeepSeek-V4-ProContact support
deepseek-ai/DeepSeek-V4-FlashContact support
zai-org/GLM-5.1Contact support
MiniMaxAI/MiniMax-M2.7Contact support

Pass a model to Loops

Pass the table value verbatim as base_model through any of the following entry points:
  • The Python SDK, via tinker.ServiceClient.create_lora_training_client(base_model=...). See the Loops quickstart.
  • The HTTP API, via POST /v1/loops/runs.
  • The CLI, via truss loops push <base_model>, which provisions a session, run, and paired sampler in one call.
The minimal HTTP call provisions a run and its paired sampler against an existing session. Replace sess_xyz789 with the session.id returned by POST /v1/loops/sessions:
curl --request POST \
  --url https://api.baseten.co/v1/loops/runs \
  --header "Authorization: Bearer $BASETEN_API_KEY" \
  --header "Content-Type: application/json" \
  --data '{
    "session_id": "sess_xyz789",
    "base_model": "Qwen/Qwen3.5-9B"
  }'
For the full request body, response shape, and remaining routes, see the Loops API reference.

Request a model

To request a base model that isn’t listed, contact support.