Agent Execute

POST

agent

execute

curl --request POST \
  --url https://api.vlm.run/v1/agent/execute \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '{
  "metadata": {
    "environment": "dev",
    "session_id": "<string>",
    "allow_training": true
  },
  "config": {
    "prompt": "<string>",
    "detail": "auto",
    "response_model": "<any>",
    "json_schema": {},
    "gql_stmt": "<string>",
    "max_retries": 3,
    "max_tokens": 4096,
    "temperature": 0,
    "confidence": false,
    "grounding": false
  },
  "urls": [
    "<string>"
  ],
  "file_ids": [
    "<string>"
  ],
  "id": "<string>",
  "created_at": "2023-11-07T05:31:56Z",
  "callback_url": "<string>",
  "name": "<string>",
  "version": "<string>",
  "batch": true
}'

{
  "usage": {
    "elements_processed": 123,
    "element_type": "image",
    "credits_used": 123,
    "steps": 123,
    "message": "<string>"
  },
  "id": "<string>",
  "created_at": "2023-11-07T05:31:56Z",
  "completed_at": "2023-11-07T05:31:56Z",
  "response": "<any>",
  "status": "pending"
}

Authorizations

Authorization

string

header

required

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Body

application/json

Request to execute an agent.

name

string

required

Name of the agent

metadata

object

Optional metadata to pass to the model.

config

object

The VLM generation config to be used for /<dtype>/generate.

urls

string[] | null

The URLs of the files.

file_ids

string[] | null

The IDs of the uploaded files.

string

Unique identifier of the request.

created_at

string

Date and time when the request was created (in UTC timezone)

callback_url

string | null

The URL to call when the request is completed.

Minimum length: 1

version

string | null

default:latest

Version of the agent, defaults to latest.

batch

boolean

default:true

Whether to process the document in batch mode (async).

Response

201

application/json

Successful Response

Base prediction response for all API responses.

usage

object

The usage metrics for the request.

string

Unique identifier of the response.

created_at

string

Date and time when the request was created (in UTC timezone)

completed_at

string | null

Date and time when the response was completed (in UTC timezone)

response

any | null

The response from the model.

status

enum<string>

default:pending

The status of the job.

Available options:

pending,

enqueued,

running,

completed,

failed,

paused

Audio Generate Health

curl --request POST \
  --url https://api.vlm.run/v1/agent/execute \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '{
  "metadata": {
    "environment": "dev",
    "session_id": "<string>",
    "allow_training": true
  },
  "config": {
    "prompt": "<string>",
    "detail": "auto",
    "response_model": "<any>",
    "json_schema": {},
    "gql_stmt": "<string>",
    "max_retries": 3,
    "max_tokens": 4096,
    "temperature": 0,
    "confidence": false,
    "grounding": false
  },
  "urls": [
    "<string>"
  ],
  "file_ids": [
    "<string>"
  ],
  "id": "<string>",
  "created_at": "2023-11-07T05:31:56Z",
  "callback_url": "<string>",
  "name": "<string>",
  "version": "<string>",
  "batch": true
}'

{
  "usage": {
    "elements_processed": 123,
    "element_type": "image",
    "credits_used": 123,
    "steps": 123,
    "message": "<string>"
  },
  "id": "<string>",
  "created_at": "2023-11-07T05:31:56Z",
  "completed_at": "2023-11-07T05:31:56Z",
  "response": "<any>",
  "status": "pending"
}

Models

Hub

Generate

Predictions

Files

OpenAI Compatibility

files

openai

hub

image

document

video

audio

agent

API Reference

Authorizations

Body

Response