Parsing Documents
Extract structured data from long documents and reports.
Getting Started
vlm-1
can extract structured markdown from long documents and reports. Here’s a rough breakdown of the steps involved in parsing a document:
Upload Document
Use the /v1/files
endpoint to upload the document you want to parse.
You should see a response like this:
Submit the Document AI Job
Submit the uploaded file (via its file_id
) to the /v1/document/generate
endpoint to start the document parsing job. For long documents, you should set batch=True
to submit the job to a queue for processing.
You should see a response like this:
Fetch the Results
Use the /v1/document/{request_id}
endpoint to fetch the results of the document parsing job. The results of the extraction job will be in JSON format under the response
field.
You should see a response like this:
document.markdown
domain, see the MarkdownPage
Schema guide.Try our Document -> JSON API today
Head over to our Document -> JSON to start building your own document processing pipeline with VLM Run. Sign-up for access on our platform.