Connect your AI agent to VLM Run’s remote MCP server and start building agentic workflows with visual AI in minutes.

Installation

Get your API key

Head over to the VLM Run Dashboard to get your API key ($VLMRUN_API_KEY). We’ll use this to authenticate your requests to the MCP server next.

Ping the MCP server to test

Copy-paste the server URL linked below in your browser and you should see a ping response from the MCP server.

https://mcp.vlm.run/${VLMRUN_API_KEY}/sse

Authentication using the above approach is purely experimental, and is subject to change. We’ll be announcing our OAuth 2.1-based authentication, as per the MCP spec, soon.

Add the server to your MCP client

Add this configuration to your Claude Desktop MCP settings:

  • Location (Mac) ~/Library/Application Support/Claude/claude_desktop_config.json
  • Location (Windows) %APPDATA%/Claude/claude_desktop_config.json See docs for more details.
{
  "mcpServers": {
    "vlm-run-mcp": {
      "args": [
        "mcp-remote",
        "https://mcp.vlm.run/${VLMRUN_API_KEY}/sse",
        "--transport",
        "sse-only",
      ],
      "env": {
        "VLMRUN_API_KEY": "${VLMRUN_API_KEY}"
      }
    }
  }
}

Restart Claude Desktop after adding the configuration.

Test the connection

Ask your AI agent to list available tools

List available VLM Run tools

Expected response: A list of available tools like put_image_url, put_document_url, detect_faces, etc.

Try your VLM Run MCP tool interaction

Let’s extract details from an invoice image using VLM Run MCP tools:

Extract invoice data from this image (https://storage.googleapis.com/vlm-data-public-prod/hub/examples/document.invoice/invoice_1.jpg) using the document.invoice domain

Claude will automatically use the VLM Run MCP server to process your image and return structured JSON data.

Troubleshooting


Need help? Join our Discord channel or contact our support team to get help.

Connect your AI agent to VLM Run’s remote MCP server and start building agentic workflows with visual AI in minutes.

Installation

Get your API key

Head over to the VLM Run Dashboard to get your API key ($VLMRUN_API_KEY). We’ll use this to authenticate your requests to the MCP server next.

Ping the MCP server to test

Copy-paste the server URL linked below in your browser and you should see a ping response from the MCP server.

https://mcp.vlm.run/${VLMRUN_API_KEY}/sse

Authentication using the above approach is purely experimental, and is subject to change. We’ll be announcing our OAuth 2.1-based authentication, as per the MCP spec, soon.

Add the server to your MCP client

Add this configuration to your Claude Desktop MCP settings:

  • Location (Mac) ~/Library/Application Support/Claude/claude_desktop_config.json
  • Location (Windows) %APPDATA%/Claude/claude_desktop_config.json See docs for more details.
{
  "mcpServers": {
    "vlm-run-mcp": {
      "args": [
        "mcp-remote",
        "https://mcp.vlm.run/${VLMRUN_API_KEY}/sse",
        "--transport",
        "sse-only",
      ],
      "env": {
        "VLMRUN_API_KEY": "${VLMRUN_API_KEY}"
      }
    }
  }
}

Restart Claude Desktop after adding the configuration.

Test the connection

Ask your AI agent to list available tools

List available VLM Run tools

Expected response: A list of available tools like put_image_url, put_document_url, detect_faces, etc.

Try your VLM Run MCP tool interaction

Let’s extract details from an invoice image using VLM Run MCP tools:

Extract invoice data from this image (https://storage.googleapis.com/vlm-data-public-prod/hub/examples/document.invoice/invoice_1.jpg) using the document.invoice domain

Claude will automatically use the VLM Run MCP server to process your image and return structured JSON data.

Troubleshooting


Need help? Join our Discord channel or contact our support team to get help.