POST
/
v1
/
fine_tuning
/
generate
curl --request POST \
  --url https://api.vlm.run/v1/fine_tuning/generate \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '{
  "metadata": {
    "environment": "dev",
    "session_id": "<string>",
    "allow_training": true
  },
  "config": {
    "prompt": "<string>",
    "detail": "auto",
    "response_model": "<any>",
    "json_schema": {},
    "max_retries": 3,
    "max_tokens": 4096,
    "temperature": 0,
    "confidence": true,
    "grounding": true
  },
  "id": "<string>",
  "created_at": "2023-11-07T05:31:56Z",
  "callback_url": "<string>",
  "images": [
    "<string>"
  ],
  "videos": [
    "<string>"
  ],
  "model": "vlm-1",
  "batch": false
}'
{
  "usage": {
    "elements_processed": 123,
    "element_type": "image",
    "credits_used": 123
  },
  "id": "<string>",
  "created_at": "2023-11-07T05:31:56Z",
  "completed_at": "2023-11-07T05:31:56Z",
  "response": "<any>",
  "status": "pending"
}

Authorizations

Authorization
string
header
required

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Body

application/json

Request to the fine-tuned VLM API for querying an image.

metadata
object

Optional metadata to pass to the model.

config
object

The VLM generation config to be used for /<dtype>/generate.

id
string

Unique identifier of the request.

created_at
string

Date and time when the request was created (in UTC timezone)

callback_url
string | null

The URL to call when the request is completed.

Minimum length: 1
images
string[] | null

List of base64 encoded images or URLs to the images.

videos
string[] | null

List of base64 encoded videos or URLs to the videos.

model
string
default:
vlm-1

The model to use for generating the response.

batch
boolean
default:
false

Whether to process the image in batch mode (async).

Response

200
application/json
Successful Response

Base prediction response for all API responses.

usage
object

The usage metrics for the request.

id
string

Unique identifier of the response.

created_at
string

Date and time when the request was created (in UTC timezone)

completed_at
string | null

Date and time when the response was completed (in UTC timezone)

response
any | null

The response from the model.

status
enum<string>
default:
pending

The status of the job.

Available options:
enqueued,
pending,
running,
completed,
failed,
paused