Document Redaction

This example demonstrates how to use VLM Run MCP tools to automatically detect and redact personally identifiable information (PII) from document images. This is particularly useful for healthcare, legal, and financial documents where privacy compliance is critical.

Use Case Overview

Document redaction is essential for privacy protection and regulatory compliance. This workflow automatically identifies sensitive information like names, addresses, social security numbers, and other PII, then creates a redacted version suitable for sharing while maintaining document readability.

Workflow Steps

Try it out yourself with the following prompt (in Claude Desktop or Web): “Can you redact all the personally identifiable information of the patient in the following image https://www.carepatron.com/files/physical-therapy-referral-form-sample-template.jpg and provide a link to the redacted image.” The agent should be able to use the following tools to accomplish the task: I’ll help you redact the personally identifiable information from that medical form. Let me first load the image and then identify the PII that needs to be redacted.

Load the document image

Use the put_image_url tool to load the document image into the system for processing.

{
  "url": "https://www.carepatron.com/files/physical-therapy-referral-form-sample-template.jpg"
}

Response:

{
  "id": "img_b3a7",
  "type": "image"
}

Extract text and identify PII

Use the detect_texts tool to extract all text from the document and identify regions containing personally identifiable information.

{
  "image": {
    "id": "img_b3a7",
    "type": "image"
  }
}

Response: The tool returns bounding boxes for all detected text regions, including sensitive information like patient names, addresses, and medical record numbers.

[
  {"xywh": [0.2898, 0.1956, 0.2521, 0.0181]},
  {"xywh": [0.2898, 0.1956, 0.2521, 0.0181]},
  ...
]

Create redacted version

Use the blur_bboxes tool to blur the identified sensitive regions, creating a privacy-compliant version of the document.

{
  "image": {
    "id": "img_b3a7",
    "type": "image"
  },
  "bboxes": [
    {"xywh": [0.2898, 0.1956, 0.2521, 0.0181]},
    ...
  ]
}

Response: Creates a new redacted image with sensitive information blurred out.

{
  "id": "img_ae91",
  "type": "image"
}

Generate preview

Use the preview_object_ref tool to generate a shareable URL for the redacted document.

{
  "obj": {
    "id": "img_ae91",
    "type": "image"
  }
}

Response:

https://mcp.vlm.run/files/img_ae91

Example Result

Original Document

Redacted Document

Document Redaction Results

Build-your-own Redaction Workflow

In this example, we showed how easy it is to build your own document-redaction pipeline with VLM Run MCP tools. You can use the same approach to build your own document processing pipeline for any industry-vertical.

🏥 Healthcare: Patient intake forms, medical records, insurance documents
📄 Legal: Contracts, court documents, legal correspondence
💰 Financial: Bank statements, tax documents, loan applications
💼 HR: Employee records, payroll documents, benefits information
📦 Marketing: Product catalogs, marketing materials, sales proposals
🏠 Real Estate: Property listings, lease agreements, inspection reports
📊 Insurance: Policy documents, claims forms, insurance applications

Try our MCP server today

Head over to our MCP server to start building your own document processing pipeline with VLM Run. Sign-up for access on our platform.

Get Started

​Use Case Overview

​Workflow Steps

​Example Result