Exporting metrics is in beta mode.

Baseten exposes an endpoint that returns real-time metrics in the Prometheus format. By using this endpoint as a Prometheus / OpenMetrics scrape endpoint, you can integrate Baseten metrics with compatible software like Prometheus, OpenTelemetry Collector / Agent, Datadog Agent, Vector, and more.

To scrape metrics from Baseten:

  1. Set the scrape endpoint to app.baseten.co/metrics.
  2. Configure the Authorization header for scrape requests. This should be set to your Baseten API key, prefixed withย Api-Keyย (e.g.ย {"Authorization": "Api-Key abcd1234.abcd1234"}).
  3. Set an appropriate scrape interval. We recommend scraping metrics every minute.

Supported services

The Baseten metrics endpoint can be integrated with OpenTelemetry Collector by configuring a Prometheus receiver that scrapes the endpoint. Baseten metrics can be pushed to any exporter on the OpenTelemetry registry. We integrate with providers including:

For a list of supported metrics, see the supported metrics reference.

Rate limits

Calls to the Baseten metrics endpoint are limited on a per-organization basis to 6 requests per minute. If this limit is exceeded, subsequent calls will result in 429 responses.

To avoid hitting this limit, we recommend setting your scrape interval to 1 minute.