Pricing Breakdown
Flexible pricing plans for developers and enterprises to build with VLM Run.
Simple, transparent pricing
Choose the plan that works best for your project. All plans include access to our core Vision Language Model capabilities with structured JSON outputs.
Credit-based pricing
All applications use a credits-based system with three detail levels and optional grounding. document/image/healthcare domains charge per page, while audio and video domains charge per duration-based segment:
Credit costs by domain
Domain | Credits per Page/Image/Segment | Grounding (Add-on) | ||
---|---|---|---|---|
lo | auto | hi | ||
document.* | 1 | 2 | 4 | +2 |
document.markdown | 1 | 4 | 6 | - |
image.* | 1 | 2 | 4 | +2 |
healthcare.* | 1 | 2 | 4 | +2 |
audio.* | 1 | 1 | 2 | - |
video.* | 10 | 10 | 20 | - |
- document.*, image.*, and healthcare.* domains support the Grounding add-on, which provides visual bounding boxes and confidence scores for detected entities. This add-on costs an additional 2 credits per page/image.
- document.markdown is optimized for markdown content and does not support the Grounding add-on.
- audio.* and video.* domains are charged per 5-minute segment. For example, a 12-minute audio file will be billed as 3 segments.
- The lo, auto, and hi columns represent increasing levels of processing quality and computational cost.
- For all domains, a minimum of 2 segments is charged for audio/video files shorter than 10 minutes.
- If you need help estimating credits for your use case, please contact us.
Custom models
Custom models and specialized applications are charged at a fixed premium rate per page regardless of detail level used.
Domain/Application | Price Per Page | Notes |
---|---|---|
Custom fine-tuned models | 6 | Your specialized models trained on custom data |
Enterprise applications | 8 | Custom-built applications for specific use cases |
Why fixed pricing? Custom models and applications use specialized processing that requires consistent computational resources regardless of detail settings. All custom models are charged at the equivalent of hi + grounding (8 credits) to ensure optimal performance.
Processing levels
- lo: Low-resolution processing
- Documents/Images/Healthcare: 1 credit
- Audio: 1 credit per 5-minute segment
- Video: 10 credits per 5-minute segment
- auto: Automatically optimized processing quality
- Documents/Images/Healthcare: 2 credits
- Audio: 1 credit per 5-minute segment
- Video: 10 credits per 5-minute segment
- hi: High-resolution processing for complex content
- Documents/Images/Healthcare: 4 credits
- Audio: 2 credits per 5-minute segment
- Video: 20 credits per 5-minute segment
- Grounding: Visual grounding with bounding boxes and confidence scores
- Documents/Images/Healthcare: +2 credits
- Audio/Video: Not available
Examples
Documents:
- Invoice with lo: 1 credit = $0.01 per page
- Invoice with auto: 2 credits = $0.02 per page
- Invoice with hi + grounding: 6 credits = $0.06 per page
Images:
- Classification with lo: 1 credit = $0.01 per image
Audio/Video:
- 1-hour audio: 12 credits = $0.12
- 1-hour video: 120 credits = $1.20
Custom models:
- Fine-tuned models: 8 credits = $0.08 per page
Plan comparison
Feature | Free | Pro | Enterprise |
---|---|---|---|
Credits/month | 100 (sign-up bonus) | 100,000 included | Unlimited |
Rate Limit | 10/min | 100/min | No limits |
Custom Models | Pre-configured only | Up to 5 | Unlimited |
Support | Community Discord | Dedicated Slack | Dedicated Slack |
Data Retention | Basic logs | Zero-Data Retention (ZDR) | Zero-Data Retention (ZDR) |
Compliance | Basic | BAA | SOC2, HIPAA, BAA |
Deployments | Cloud only | Cloud only | In-VPC available |
Service Level Agreement | Community support | Standard support | Custom SLAs |
Compare features across all pricing tiers
FAQ
How does the credits-based pricing work?
How does the credits-based pricing work?
For document/image/healthcare domains, each API call consumes fixed credits: lo (1 credit), auto (2 credits), hi (4 credits), with grounding adding +2 credits. For audio domains, you pay per 5-minute segment: lo (1 credit per segment), auto (1 credit per segment), hi (2 credits per segment). For video domains, you pay per 5-minute segment: lo (10 credits per segment), auto (10 credits per segment), hi (20 credits per segment). Credits are converted to dollars at $0.01 per credit.
What's the difference between lo, auto, and hi levels?
What's the difference between lo, auto, and hi levels?
For documents/images: lo (1 credit) offers basic processing, auto (2 credits) automatically optimizes quality, hi (4 credits) uses high-resolution processing. For audio: lo (1 credit per segment), auto (1 credit per segment), hi (2 credits per segment). For video: lo (10 credits per segment), auto (10 credits per segment), hi (20 credits per segment). hi is recommended for complex content with small text, detailed layouts, or when maximum accuracy is required.
When should I use visual grounding?
When should I use visual grounding?
Visual grounding provides bounding boxes and confidence scores for extracted data, making it ideal for compliance, audit trails, and quality assurance. It’s particularly valuable in healthcare, finance, and legal applications where data accuracy verification is critical. Note: Grounding is only available for document, image, and healthcare processing, not for audio or video.
How does audio and video pricing work?
How does audio and video pricing work?
Audio and video processing uses 5-minute segment pricing instead of per-call pricing. Both audio and video files are charged per 5-minute segments (300 seconds each), with a minimum of 2 segments. For audio: cost is 1 credit per segment for lo, 1 credit for auto, 2 credits for hi. For video: cost is 10 credits per segment for lo, 10 credits for auto, 20 credits for hi. Grounding is not available for audio or video processing.
Can I upgrade or downgrade my plan anytime?
Can I upgrade or downgrade my plan anytime?
Yes! You can upgrade or downgrade your plan at any time. Changes take effect immediately, and we’ll prorate any billing adjustments.
What happens if I exceed my plan limits?
What happens if I exceed my plan limits?
If you exceed your monthly credit limit, you’ll be charged our pay-as-you-scale rates for the additional usage. You can set usage alerts to monitor your consumption. Free plan users can purchase additional credits as needed.
Why do custom models cost more?
Why do custom models cost more?
Custom requested models and fine-tuned applications require specialized computational resources and processing pipelines. They’re charged at a fixed premium rate of 6 credits per call regardless of detail level to ensure consistent, high-quality performance for specialized use cases.
What payment methods do you accept?
What payment methods do you accept?
We accept all major credit cards (Visa, MasterCard, American Express) and can arrange invoice billing for Enterprise customers.
Get started
Choose your plan and start building with VLM Run today. Need help deciding? Book a demo with our team.