MarkdownDocument schema is the cornerstone of VLM Run’s document processing system, providing a standardized, machine-readable representation of complex documents. This technical reference guide details the schema’s architecture, components, and implementation patterns.
MarkdownDocument Data Model
The MarkdownDocument schema addresses the fundamental challenges in document processing:
- Structural Preservation: Maintains document hierarchy and relationships
- Content Extraction: Handles mixed content types (text, tables, figures, code)
- Spatial Understanding: Preserves layout and positioning information
- Data Integrity: Ensures accurate representation of structured elements
- Extensibility: Supports custom annotations and metadata
1. MarkdownPage
A MarkdownDocument is a list of MarkdownPage objects, each representing a page in the document.
2. MarkdownTable
Tables are represented with a <Table id="tb-{id}"/> tag in the markdown content, with the actual table content stored in the tables list. This allows for rich representation of table’s data while maintaining the document’s flow.
3. Charts and Figures
Charts and figures are represented with a<Chart id="ch-{id}"/> tag in the content. The chart details are stored in the figures list, including properties like:
Example Usage
Here’s an example of how theMarkdownPage model is used to process a document: