Development deployment
Use this endpoint to call the development deployment of your model asynchronously.
Parameters
The ID of the model you want to call.
Headers
Your Baseten API key, formatted with prefix Api-Key
(e.g. {"Authorization": "Api-Key abcd1234.abcd1234"}
).
Body
There is a 256 KiB size limit to /async_predict
request payloads.
JSON-serializable model input.
URL of the webhook endpoint. We require that webhook endpoints use HTTPS.
Priority of the request. A lower value corresponds to a higher priority (e.g. requests with priority 0 are scheduled before requests of priority 1).
priority
is between 0 and 2, inclusive.
Maximum time a request will spend in the queue before expiring.
max_time_in_queue_seconds
must be between 10 seconds and 12 hours, inclusive.
Exponential backoff parameters used to retry the model predict request.
Response
The ID of the async request.