Asynchronously call a specific deployment of a model.
Authorizations
Pass your Baseten API key. Clients automatically send Authorization: Bearer <key>. Direct callers can also use Authorization: Api-Key <key>; both schemes are accepted.
Path Parameters
The alphanumeric ID of the deployment.
Body
There is a 256 KiB size limit on async predict request payloads.
JSON-serializable model input.
HTTPS URL to receive the prediction result via webhook. Both HTTP/2 and HTTP/1.1 are supported. If omitted, the model must save outputs so they can be accessed later.
Priority of the request. Lower values are higher priority.
0 <= x <= 2Maximum time in seconds a request will spend in the queue before expiring. Must be between 10 seconds and 72 hours.
10 <= x <= 259200Exponential backoff parameters for retrying predict requests.
Response
Async predict request enqueued.
The ID of the async request.