Asynchronously call a regional environment of a model.
Enqueues an asynchronous predict request via a regional hostname. The environment is determined by the hostname, not the path.
Authorizations
Pass your Baseten API key. Clients automatically send Authorization: Bearer <key>. Direct callers can also use Authorization: Api-Key <key>; both schemes are accepted.
Body
There is a 256 KiB size limit on async predict request payloads.
JSON-serializable model input.
HTTPS URL to receive the prediction result via webhook. Both HTTP/2 and HTTP/1.1 are supported. If omitted, the model must save outputs so they can be accessed later.
Priority of the request. Lower values are higher priority.
0 <= x <= 2Maximum time in seconds a request will spend in the queue before expiring. Must be between 10 seconds and 72 hours.
10 <= x <= 259200Exponential backoff parameters for retrying predict requests.
Response
Async predict request enqueued.
The ID of the async request.