Async Inference API
Cancel async request
Use this endpoint to cancel a queued async request.
Only QUEUED
requests may be canceled.
Parameters
model_id
string
requiredThe ID of the model that executed the request.
request_id
string
requiredThe ID of the async request.
Headers
Authorization
string
requiredYour Baseten API key, formatted with prefix Api-Key
(e.g. {"Authorization": "Api-Key abcd1234.abcd1234"}
).
Response
request_id
string
requiredThe ID of the async request.
canceled
boolean
requiredWhether the request was canceled.
message
string
requiredAdditional details about whether the request was canceled.
Rate limits
Calls to the cancel async request status endpoint are limited to 20 requests per second. If this limit is exceeded, subsequent requests will receive a 429 status code.