> ## Documentation Index > Fetch the complete documentation index at: https://docs.vlm.run/llms.txt > Use this file to discover all available pages before exploring further. # Pricing > Flexible pricing plans for developers and enterprises to build with VLM Run. ## Simple, transparent pricing Choose the plan that works best for your project. All plans include access to our core Vision Language Model capabilities with structured JSON outputs. ## Credit-based pricing All applications use a credits-based system with three detail levels and optional grounding. The current conversion rate is:

100 credits = \$1

`document.*`, `image.*`, and `healthcare.*` domains charge **per page**, while `audio.*` and `video.*` domains charge **per duration-based segment**: ### Domain Based Pricing The pricing below applies to all document.generate and image.generate API calls.

Domain	Credits per Page / Image / 5-min Segment			Grounding (Add-on)
Domain	`lo`	`auto`	`hi`	Grounding (Add-on)
`document.\*`	1	2	4	+2
`document.markdown`	1	4	6	-
`image.\*`	1	2	4	+2
`healthcare.\*`	1	2	4	+2
`audio.\*`	1	1	2	-
`video.\*`	10	10	20	-

* **Grounding add-on**: `document.*`, `image.*`, and `healthcare.*` domains support the *Grounding* add-on, which provides visual bounding boxes and confidence scores for detected entities. This add-on costs an additional **2 credits per page or image**. * **Document Markdown**: `document.markdown` is optimized for markdown content and does not support the *Grounding* add-on. * **Audio / Video Transcription**: `audio.*` and `video.*` domains are charged per 5-minute segment. For example, a 12-minute audio file will be billed for 15-minutes, or 3 segments. * **Processing levels**: The `lo`, `auto`, and `hi` columns represent increasing levels of processing quality and computational cost. * For all domains, a minimum of 2 segments is charged for audio/video files shorter than 10 minutes. * If you need help estimating credits for your use case, please [contact us](mailto:support@vlm.run). ### Service Tiers Every prediction, agent execution, and chat completion can be routed through one of three delivery tiers by setting `service_tier` (Python SDK) / `serviceTier` (Node.js SDK) on the request's `GenerationConfig`. The tier governs **both** how the request is routed (latency / availability) **and** how credits are billed.

Tier	Multiplier	When to use
`standard` (default)	1.0×	Baseline rates and latency. Used when `service\_tier` is omitted, `null`, `"auto"`, or `"default"`.
`flex`	0.5× (50% off)	Batch / background workloads that can tolerate higher and more variable latency. Great for nightly document processing, large video backfills, or bulk evaluation runs.
`priority`	1.8×	User-facing or latency-sensitive workloads that require the highest reliability and lowest queue times.

The tier multiplier is applied **uniformly** on top of the domain / agent credit cost — including LLM, media, and tool credits. For example, an `image.*` `hi` call with grounding normally costs `4 + 2 = 6` credits per image; at `flex` it costs `3` credits, and at `priority` it costs `10.8` credits. ```python Python theme={"theme":{"light":"github-light","dark":"dark-plus"}} from vlmrun.client import VLMRun from vlmrun.client.types import GenerationConfig client = VLMRun() # Flex tier — 50% discount, higher latency response = client.document.generate( file="invoice.pdf", domain="document.invoice", config=GenerationConfig(service_tier="flex"), ) ``` ```javascript Node.js theme={"theme":{"light":"github-light","dark":"dark-plus"}} // Priority tier — 1.8x premium, lowest queue times await client.document.generate({ fileId: "file_...", domain: "document.invoice", config: { serviceTier: "priority" }, }); ``` ```bash cURL theme={"theme":{"light":"github-light","dark":"dark-plus"}} # Flex tier — 50% discount, higher latency curl -X POST https://api.vlm.run/v1/document/generate \ -H "Authorization: Bearer $VLMRUN_API_KEY" \ -H "Content-Type: application/json" \ -d '{ "file_id": "", "domain": "document.invoice", "config": { "service_tier": "flex" } }' ``` `service_tier` is accepted on all prediction routes (image, document, healthcare, audio, video, web), on agent executions, and on OpenAI-compatible chat completions. Omitting the field (or setting it to "auto", "default", or null) falls back to the server default, which is currently standard. ### Agent-Based Pricing Agent pricing is determined by the type of agent and the tools it uses:

Agent Type / Application	Price Per Page or Image	Notes
All Redaction & Edit Domains `healthcare.phi-redaction`, `healthcare.phi-edit-replace`, `insurance.document-redaction`, etc.	8	Flat rate for all domains that perform PHI redaction or PHI edit-replace, regardless of detail level or options.
Other Agent Applications	Varies	The credit cost is based on the number of tools (sub-models or APIs) called by the agent during processing. Each tool call incurs an additional credit cost. Final cost = Base agent cost + Tool cost × Number of tools used

All PHI redaction agents are charged a fixed premium rate of 8 credits per page to reflect the additional compliance and security requirements. For other agent-based applications, the total credit cost will depend on the number and type of tools invoked by the agent during execution. Please refer to the agent documentation or contact support for detailed cost breakdowns for your specific use case. ### Examples **Documents:** * Invoice with `lo`: **1 credit** = \$0.01 per page * Invoice with `auto`: **2 credits** = \$0.02 per page * Invoice with `hi` + `grounding`: **6 credits** = \$0.06 per page **Images:** * Classification with `lo`: **1 credit** = \$0.01 per image **Audio/Video:** * 1-hour audio: **12 credits** = \$0.12 * 1-hour video: **120 credits** = \$1.20 **Custom models:** * Fine-tuned models: **8 credits** = \$0.08 per page **Service tiers:** * Invoice with `auto` at `flex`: **2 × 0.5 = 1 credit** = \$0.01 per page * Invoice with `hi` + grounding at `priority`: **6 × 1.8 = 10.8 credits** ≈ \$0.11 per page * 1-hour video at `flex`: **120 × 0.5 = 60 credits** = \$0.60 *** ## Plan comparison Here's a full comparison of the features across all our current pricing tiers.

Feature	Starter	Pro	Enterprise
Credits / Month	100 (sign-up bonus)	100,000 included	Unlimited
Rate Limit	10 / min	100 / min	No limits
Custom Models	Pre-configured only	Up to 5	Unlimited
Support	Discord	Dedicated Slack	Dedicated Slack
Data Retention	Basic logs	Zero-Data Retention	Zero-Data Retention
Compliance	Basic	BAA	SOC2, HIPAA, BAA
Deployments	Cloud only	Cloud only	In-VPC available
Service Level Agreement	Community support	Standard support	Custom SLAs

## FAQ For `document.*`, `image.*`, and `healthcare.*` domains, each API call consumes fixed credits: `lo` (1 credit), `auto` (2 credits), `hi` (4 credits), with grounding adding +2 credits. For `audio.*` domains, you pay per 5-minute segment: `lo` (1 credit per segment), `auto` (1 credit per segment), `hi` (2 credits per segment). For `video.*` domains, you pay per 5-minute segment: `lo` (10 credits per segment), `auto` (10 credits per segment), `hi` (20 credits per segment). Credits are converted to dollars at \$0.01 per credit. For documents/images: `lo` (1 credit) offers basic processing, `auto` (2 credits) automatically optimizes quality, `hi` (4 credits) uses high-resolution processing. For audio: `lo` (1 credit per segment), `auto` (1 credit per segment), `hi` (2 credits per segment). For video: `lo` (10 credits per segment), `auto` (10 credits per segment), `hi` (20 credits per segment). `hi` is recommended for complex content with small text, detailed layouts, or when maximum accuracy is required. Visual grounding provides bounding boxes and confidence scores for extracted data, making it ideal for compliance, audit trails, and quality assurance. It's particularly valuable in healthcare, finance, and legal applications where data accuracy verification is critical. Note: Grounding is only available for `document.*`, `image.*`, and `healthcare.*` domains, not for `audio.*` or `video.*`. Audio and video processing uses 5-minute segment pricing instead of per-call pricing. Both audio and video files are charged per 5-minute segments (300 seconds each), with a minimum of 2 segments. For audio: cost is 1 credit per segment for `lo`, 1 credit for `auto`, 2 credits for `hi`. For video: cost is 10 credits per segment for `lo`, 10 credits for `auto`, 20 credits for `hi`. Grounding is not available for audio or video processing. Every request can be routed through one of three delivery tiers by setting `service_tier` (Python SDK) / `serviceTier` (Node.js SDK) on your `GenerationConfig`: * **`standard`** (default) — baseline rate (1.0×) and latency. * **`flex`** — **50% off** (0.5× multiplier) with higher, more variable latency. Best for batch or background workloads. * **`priority`** — 1.8× premium for the lowest queue times and highest reliability. Best for latency-sensitive, user-facing workloads. The multiplier is applied uniformly to the domain, grounding, and agent/tool credits. Omitting the field (or passing `"auto"` / `null`) uses the server default, which is currently `standard`. See the [Service Tiers](#service-tiers) section above for code examples. Yes! You can upgrade or downgrade your plan at any time. Changes take effect immediately, and we'll prorate any billing adjustments. If you exceed your monthly credit limit, you'll be charged our pay-as-you-scale rates for the additional usage. You can set usage alerts to monitor your consumption. Free plan users can purchase additional credits as needed. Custom requested models and fine-tuned applications require specialized computational resources and processing pipelines. They're charged at a fixed premium rate of 6 credits per call regardless of detail level to ensure consistent, high-quality performance for specialized use cases. We bill using Stripe, and accept all major credit cards (Visa, MasterCard, American Express) and can arrange invoice billing for Enterprise customers. ## Get started Choose your plan and start building with VLM Run today. Need help deciding? [Book a demo](https://cal.com/team/vlm-run/demo) with our team. Sign up for free and get 100 credits free to start prototyping. Discuss Enterprise plans and custom pricing options.