Python SDK
Client Reference
Detailed guide to the VLM Run Python SDK client
Client Reference
This guide provides detailed examples for using the VLM Run client, organized from basic to advanced usage.
Basic Client Setup
Initialization
Configuration Options
Parameter | Type | Default | Description |
---|---|---|---|
api_key | str | None | Your API key (falls back to VLMRUN_API_KEY env var) |
base_url | str | "https://api.vlm.run/v1" | API endpoint URL |
timeout | float | 120.0 | Request timeout in seconds |
max_retries | int | 5 | Maximum retry attempts for failed requests |
Media Processing
Image Processing
Document Processing
Audio Processing
Video Processing
Working with Predictions
Retrieving Predictions
Waiting for Completion
Working with Response Data
File Management
Uploading Files
Managing Files
Domain and Schema Management
Working with Domains
Working with Schemas
Custom Schemas
Defining Custom Schemas
Using Auto-casting
Common Workflows
Upload and Process
Batch Processing
Error Handling
Handling Common Errors
Implementing Retries
Advanced Features
Custom Timeouts
Request Metadata
Best Practices
Client Lifecycle
Create a single client instance and reuse it across your application:
API Key Management
Keep your API key secure:
- Use environment variables instead of hardcoding
- Use a secrets manager for production applications
- Regularly rotate API keys for security
Resource Cleanup
Clean up resources when they’re no longer needed:
Performance Optimization
For high-volume applications:
- Reuse the client instance
- Use batch processing for multiple files
- Implement exponential backoff for retries
- Use async processing for non-blocking operations