Finetuned Inference
Generate a response from a fine-tuned model.
Authorizations
Bearer authentication header of the form Bearer <token>
, where <token>
is your auth token.
Body
Request to the fine-tuned VLM API for querying an image.
Optional metadata to pass to the model.
The VLM generation config to be used for /<dtype>/generate.
Unique identifier of the request.
Date and time when the request was created (in UTC timezone)
The URL to call when the request is completed.
1
List of base64 encoded images or URLs to the images.
List of base64 encoded videos or URLs to the videos.
The model to use for generating the response.
Whether to process the image in batch mode (async).
Response
Base prediction response for all API responses.
The usage metrics for the request.
Unique identifier of the response.
Date and time when the request was created (in UTC timezone)
Date and time when the response was completed (in UTC timezone)
The response from the model.
The status of the job.
enqueued
, pending
, running
, completed
, failed
, paused