Parsing Documents
Extract structured data from long documents and reports.
Getting Started
VLM-1 can extract structured data from long documents and reports. Here’s a rough breakdown of the steps involved in parsing a document:
Upload Document
Use the /v1/files
endpoint to upload the document you want to parse.
You should see a response like this:
Submit the Document AI Job
Submit the uploaded file (via its file_id
) to the /v1/document/generate
endpoint to start the document parsing job. Currently, this endpoint only supports PDF files and submits the job to a queue for processing (batch=True
).
You should see a response like this:
Fetch the Results
Use the /v1/document/{request_id}
endpoint to fetch the results of the document parsing job. The results of the extraction job will be in JSON format under the response
field.
You should see a response like this:
Notebook Example
Illustrative Examples
Here are some examples of the structured JSON output that VLM-1 can extract from long documents and reports:
Document AI with VLM-1 - Example 1.
Document AI with VLM-1 - Example 2.
Document AI with VLM-1 - Example 3.
Document AI with VLM-1 - Example 4.
Get Started with our Document -> JSON API
Head over to our Document -> JSON to start building your own document processing pipeline with VLM-1. Sign-up for access to our API here.