Use this file to discover all available pages before exploring further.
The VLM Run API is a unified platform for production-ready multimodal AI. Use it to extract structured data from documents, images, videos, and audio — or run complex multi-step workflows with visual agents.
Use the Chat Completions endpoint for interactive multi-modal conversations, or the Agent Executions endpoint for batch execution workflows.
from vlmrun.client import VLMRun# Initialize the VLM Run clientclient = VLMRun(api_key="<VLMRUN_API_KEY>")# Create a chat completionresponse = client.agent.completions.create( model="vlmrun-orion-1:auto", messages=[ { "role": "user", "content": [ {"type": "text", "text": "What do you see in this image?" }, {"type": "image_url", "image_url": {"url": "https://example.com/image.jpg"}} ] } ], max_tokens=1000)