Image -> JSON
Generate structured prediction for the given image.
Authorizations
Bearer authentication header of the form Bearer <token>
, where <token>
is your auth token.
Body
Request to the VLM API (i.e. structured prediction).
Base64 encoded image.
The URL to call when the request is completed.
1
Date and time when the request was created (in UTC timezone)
The detail level to use for the model.
auto
, hi
, lo
The domain identifier.
document.generative
, document.invoice
, document.markdown
, document.presentation
, document.receipt
, document.resume
, document.utility-bill
, video.tv-news
, video.tv-intelligence
Unique identifier of the request.
The JSON schema to use for the model.
Optional metadata to pass to the model.
The model to use for generating the response.
"vlm-1"
Response
Base prediction response for all API responses.
Date and time when the response was completed (in UTC timezone)
Date and time when the request was created (in UTC timezone)
Unique identifier of the response.
The response from the model.
The status of the job.