Make raw HTTP requests to Baseten management or inference APIs. The HTTP method defaults to GET, or POST when —field, —raw-field, or —input is provided. JSON responses are pretty-printed by default; non-JSON responses are streamed raw. Use —jq to filter JSON responses.Documentation Index
Fetch the complete documentation index at: https://docs.baseten.co/llms.txt
Use this file to discover all available pages before exploring further.
management
Options
Add a string field (key=value), parsed as JSON value
Add a request header (key:value)
Read request body from file (use - for stdin)
Filter JSON output with a jq expression; implies —output json (or jsonl for streamed commands)
HTTP method, defaults to GET or POST if fields are provided
Output formatOne of:
text, json, jsonl, noneAdd a raw string field (key=value)
Baseten remote URL, overrides BASETEN_REMOTE_URL (default https://app.baseten.co)
Enable verbose logging
Examples
GET a management resourceFilter output with --jq
List model IDs from /v1/models
Output
Text mode (--output text): The HTTP response body, passed through verbatim. JSON responses are pretty-printed; non-JSON responses are streamed raw to stdout.
JSON mode (--output json): payload type cmd.JSONUndefined.
Shape depends on the requested endpoint. See the management API OpenAPI spec at https://api.baseten.co/v1/spec.
inference
Options
Chain ID to target
Environment name (e.g. production)
Add a string field (key=value), parsed as JSON value
Add a request header (key:value)
Read request body from file (use - for stdin)
Filter JSON output with a jq expression; implies —output json (or jsonl for streamed commands)
HTTP method, defaults to GET or POST if fields are provided
Model ID to target
Output formatOne of:
text, json, jsonl, noneAdd a raw string field (key=value)
Baseten remote URL, overrides BASETEN_REMOTE_URL (default https://app.baseten.co)
Enable verbose logging
Examples
POST a predict body to a modelFilter output with --jq
Filter a JSON predict response
Output
Text mode (--output text): The inference endpoint’s response body, passed through verbatim. JSON responses are pretty-printed; non-JSON responses are streamed raw.
JSON mode (--output json): payload type cmd.JSONUndefined.
Shape depends on the model and endpoint. See the inference API OpenAPI spec at https://api.baseten.co/inference-spec.