Image -> JSON
Generate structured prediction for the given image.
Authorizations
Bearer authentication header of the form Bearer <token>
, where <token>
is your auth token.
Body
Request to the VLM API (i.e. structured prediction).
Base64 encoded image.
Unique identifier of the request.
Date and time when the request was created (in UTC timezone)
The URL to call when the request is completed.
1
The model to use for generating the response.
"vlm-1"
The domain identifier.
document.generative
, document.invoice
, document.markdown
, document.presentation
, document.receipt
, document.resume
, document.utility-bill
, video.tv-news
, video.tv-intelligence
The JSON schema to use for the model.
The detail level to use for the model.
auto
, hi
, lo
Optional metadata to pass to the model.
Response
Base prediction response for all API responses.
Unique identifier of the response.
Date and time when the request was created (in UTC timezone)
Date and time when the response was completed (in UTC timezone)
The response from the model.
The status of the job.