The following is a list of recent changes to the VLM Run Platform and API.Documentation Index
Fetch the complete documentation index at: https://docs.vlm.run/llms.txt
Use this file to discover all available pages before exploring further.
- New Platform tab: a single place to learn how the VLM Run Platform works.
- Service Tiers (Standard/Flex/Priority): a new
service_tierparameter is now supported end-to-end: from the backend, through the Python SDK’sGenerationConfig, to full documentation with cURL and SDK examples. - New evaluations guide: learn how to measure skill accuracy, collect feedback, run evaluations, and use results to optimize your skills over time.
- Improved video processing and transcription with longer, richer output.
- Process specific document pages with the new
page_indicesparameter inGenerationConfig. - LaTeX rendering in chat messages for math and scientific content.
- Lossless PNG support for segmentation masks with color keys, unlocking more segmentation use cases.
- Evaluations: Measure workflow improvements using skills in the Evaluations tab.
- Skills:
- Support for uploading skills via zip files
- Refer to skills quickly in chat by typing
/and selecting a skill you’ve used - Python and Node SDKs support skills management, inline skills, and skill-driven extraction
- Improved chat completion latency
- Improved multimodal content handling
- New
/document/executeendpoint for running arbitrary VLM DAGs over documents. - Added support for
document.markdowndomain for extracting structured markdown from documents.
- Fixes for multi-page grounding support with custom JSON schemas in Document -> JSON.
- Added support for
audio.transcriptiondomain for long-form audio transcription (upto 12 hours of audio). - Added support for
video.transcriptiondomain for long-form video transcription (upto 3 hours of video). - Added cookbook for
video transcription.