Try the Chat Playground
Navigate over to the Chat Playground to interact with our visual AI models in real-time and explore their capabilities.
vlm-agent-1
through natural conversation. Upload images, documents, or videos and engage in dynamic conversations that leverage our advanced vision-language models for comprehensive analysis and structured data extraction.
VLM Run Chat Playground in action
Our Chat Playground is built on the same powerful foundation as our Structured Responses API, but provides an intuitive conversational interface that makes visual AI accessible to everyone - from developers to business users.We are working on adding more capabilities to the Chat Playground, and you can expect to see more features and capabilities added in the coming months.
Capability Showcase
Explore the full range of visual AI capabilities through interactive examples:Document Analysis
Document Analysis
Upload invoices, receipts, contracts, or any document and extract structured data automatically. The playground supports all our pre-built document domains including invoices, receipts, forms, and more.

Document Analysis in Chat Playground
Image Understanding
Image Understanding
Analyze images with detailed captions, object detection, segmentation, and visual question answering. Perfect for content moderation, accessibility, and automated image analysis.

Image Understanding Capabilities
Video Generation
Video Generation
Edit and generate images, and videos from text prompts or images. Get creative with your videos and images.
Image-to-Video Generation Features
Multi-Modal Conversations
Multi-Modal Conversations
Engage in rich conversations that combine text, images, and structured data. Ask follow-up questions, request modifications, and explore different analysis approaches.

Multi-Modal Chat Interface
Supported Content Types
The Chat Playground supports a wide range of visual content:- Images: JPG, PNG, GIF, WebP (up to 10MB)
- Documents: PDF (up to 25MB, 30 pages max)
- Videos: MP4, MOV, AVI (up to 100MB)
- Audio: MP3, WAV, M4A (up to 25MB)