Skip to main content

What is VLM Run Agents?

VLM Run Agents is an advanced platform for developers to create, deploy, and manage intelligent AI agents that process documents, images, and videos with custom prompts and structured outputs. We aim to make VLM Run Agents the go-to platform for building sophisticated visual AI workflows with a unified API that’s versatile, powerful and developer-friendly. VLM Run Agents is powered by vlm-agent-1, a cutting-edge Visual Reasoning Agent that supports mixed-modality inputs and multi-turn visual reasoning. By leveraging vlm-agent-1, enterprises can effortlessly build intelligent automation workflows that understand, analyze, and process visual content at scale, transforming complex multi-modal data into actionable insights and automated actions.

Overview of AI agent capabilities with VLM Run Agents.

What makes VLM Run Agents unique?

Here are some key features of VLM Run Agents that set it apart from other AI agent platforms:

Installation

Get started with VLM Run Agents using your preferred SDK:
pip install vlmrun

Let’s get started!

Below you’ll find the API reference and code samples so you can start building intelligent agents for your use case. Sign up for an API key on our platform, then check out some of our cookbooks to learn how to use VLM Run Agents to build sophisticated visual AI workflows.
I