image
domains, see the Hub Catalog.
Authorizations
Bearer authentication header of the form Bearer <token>
, where <token>
is your auth token.
Body
Request to the VLM API (i.e. structured prediction).
List of base64 encoded images or URLs to the images.
The domain identifier (e.g. image.caption
).
aerospace.remote-sensing
, document.invoice
, document.markdown
, document.receipt
, document.resume
, image.caption
, experimental.image.object-detection
, experimental.image.person-detection
, retail.product-catalog
, image.tv-news
, video.caption
Optional metadata to pass to the model.
The VLM generation config to be used for /<dtype>/generate.
Unique identifier of the request.
Date and time when the request was created (in UTC timezone)
The URL to call when the request is completed.
1
The model to use for generating the response.
"vlm-1"
Whether to process the image in batch mode (async).
Response
Successful Response
Base prediction response for all API responses.
The usage metrics for the request.
Unique identifier of the response.
Date and time when the request was created (in UTC timezone)
Date and time when the response was completed (in UTC timezone)
The response from the model.
The status of the job.
pending
, enqueued
, running
, completed
, failed
, paused
The domain of the prediction (e.g. document.invoice, image.caption).