Extract structured data from long documents and reports.
vlm-1
can extract structured markdown from long documents and reports. Here’s a rough breakdown of the steps involved in parsing a document:
Upload Document
/v1/files
endpoint to upload the document you want to parse.Submit the Document AI Job
file_id
) to the /v1/document/generate
endpoint to start the document parsing job. For long documents, you should set batch=True
to submit the job to a queue for processing.Fetch the Results
/v1/document/{request_id}
endpoint to fetch the results of the document parsing job. The results of the extraction job will be in JSON format under the response
field.document.markdown
domain, see the MarkdownPage
Schema guide.