Chat - VLM Run

What You Can Do

Chat with PDFs and documents

Upload a PDF, invoice, medical form, or any multi-page document. Ask Orion to summarize it, extract specific fields, compare sections, or answer questions grounded in the document’s content. Orion processes every page and returns structured, cited responses.

Analyze and caption images

Drop in a photo, screenshot, or diagram. Orion can caption the image, detect and label objects, identify UI elements, extract text via OCR, or answer open-ended questions about what it sees. Results include bounding boxes and visual annotations when relevant.

Ask questions about videos

Attach a video and ask Orion to summarize it, identify key moments, describe actions, or extract metadata across frames. Video understanding works across formats and durations, from short clips to multi-minute recordings.

Re-use a skill

Select a skill from the skill picker to guide Orion’s response. Skills constrain the output to a specific schema (invoice line items, receipt totals, document classifications) so you get consistent, structured data every time. You can also create new skills directly from a conversation.

Compose multi-step visual workflows

Chain operations in a single conversation: detect objects, crop a region, enhance it, then analyze. Orion maintains context across turns, so each step builds on the last.

Key Features

Feature	Description
File attachments	Upload images, PDFs, videos, and audio directly in the chat window
Skill picker	Select a skill to constrain outputs to a specific schema
Structured responses	Get JSON-structured results alongside natural language explanations
Conversation history	Resume past conversations and iterate on results
Create skills from chat	Turn a successful conversation into a reusable skill in one click
Real-time streaming	Responses stream token-by-token for immediate feedback

From Playground to Production

Chat is designed to be the bridge between exploration and integration. A typical workflow looks like:

Explore: Upload a sample file and ask Orion a question to see what’s possible.

Refine: Iterate on your prompt, select or create a skill to structure the output.

Integrate: Take the skill and model configuration from Chat and wire it into your API calls using the Python SDK, Node.js SDK, or REST API.

Every chat conversation is logged in the Observe dashboard, so you can inspect completions, debug responses, and track costs as you go.

Agent Chat (API)

Use the Orion agent in conversational chat mode via the API.

Chat Completions API

Explore the chat completions REST endpoint.

CLI Chat

Run chat sessions from the command line.

Skills Overview

Learn how to create and manage skills on the platform.

​What You Can Do

​Key Features

​From Playground to Production

​Related Pages

Agent Chat (API)

Chat Completions API

CLI Chat

Skills Overview

What You Can Do

Key Features

From Playground to Production

Related Pages