Parsing Invoices
Extract structured data from invoices.
vlm-1
can extract structured data from invoices in PDF or image format. Here’s a step-by-step guide on how to parse an invoice:
For higher-quality results, you can enable Visual Grounding to help the model understand the invoice and extract more accurate information. See High-Quality Invoice Parsing with Grounding for more details.
Parsing Invoices in 2 Steps
Submit an Invoice Parsing Job
You should see a response like this:
Wait for the Job to Complete
You can now wait for the job to complete by calling the predictions.wait
method:
Illustrative Example
Here is an example of the structured JSON output that vlm-1
can extract from an invoice:
Parsing an invoice with `vlm-1`
You should see a response like this:
High-Accuracy Parsing with Grounding
For higher-quality results, you can enable Visual Grounding to help the model understand the invoice and extract more accurate information. You can do this by setting the config=GenerationConfig(grounding=True)
parameter when submitting the job (as shown below).
Try our Document -> JSON API today
Head over to our Document -> JSON to start building your own document processing pipeline with VLM Run. Sign-up for access on our platform.