Document Redaction
Automatically detect and redact personally identifiable information (PII) from documents using VLM Run MCP tools.
This example demonstrates how to use VLM Run MCP tools to automatically detect and redact personally identifiable information (PII) from document images. This is particularly useful for healthcare, legal, and financial documents where privacy compliance is critical.
Use Case Overview
Document redaction is essential for privacy protection and regulatory compliance. This workflow automatically identifies sensitive information like names, addresses, social security numbers, and other PII, then creates a redacted version suitable for sharing while maintaining document readability.
Workflow Steps
Try it out yourself with the following prompt (in Claude Desktop or Web):
“Can you redact all the personally identifiable information of the patient in the following image https://www.carepatron.com/files/physical-therapy-referral-form-sample-template.jpg and provide a link to the redacted image.”
The agent should be able to use the following tools to accomplish the task:
I’ll help you redact the personally identifiable information from that medical form. Let me first load the image and then identify the PII that needs to be redacted.
Load the document image
Use the put_image_url
tool to load the document image into the system for processing.
Response:
Extract text and identify PII
Use the detect_texts
tool to extract all text from the document and identify regions containing personally identifiable information.
Response: The tool returns bounding boxes for all detected text regions, including sensitive information like patient names, addresses, and medical record numbers.
Create redacted version
Use the blur_bboxes
tool to blur the identified sensitive regions, creating a privacy-compliant version of the document.
Response: Creates a new redacted image with sensitive information blurred out.
Generate preview
Use the preview_object_ref
tool to generate a shareable URL for the redacted document.
Response:
Example Result
Original Document
Redacted Document
Document Redaction Results
Build-your-own Redaction Workflow
In this example, we showed how easy it is to build your own document-redaction pipeline with VLM Run MCP tools. You can use the same approach to build your own document processing pipeline for any industry-vertical.
- 🏥 Healthcare: Patient intake forms, medical records, insurance documents
- 📄 Legal: Contracts, court documents, legal correspondence
- 💰 Financial: Bank statements, tax documents, loan applications
- 💼 HR: Employee records, payroll documents, benefits information
- 📦 Marketing: Product catalogs, marketing materials, sales proposals
- 🏠 Real Estate: Property listings, lease agreements, inspection reports
- 📊 Insurance: Policy documents, claims forms, insurance applications
Try our MCP server today
Head over to our MCP server to start building your own document processing pipeline with VLM Run. Sign-up for access on our platform.