Use Case Overview
Content creators and marketers often need to extract key moments from longer videos to share as highlights or promotional clips. This workflow automates that process. The agent first isolates a 15-second segment from a longer video based on user-provided timestamps. Then, it transcribes the audio from that clip and adds the text as synchronized, easy-to-read subtitles.Workflow Steps
Try it out yourself with the following sequence of prompts (in Claude Desktop or Web):- Load: “Load this video (https://storage.googleapis.com/vlm-data-public-prod/hub/examples/video/test_video.mp4)”
- Trim: “Ok, now trim this video from 00:30 to 00:45 s”
- Subtitle: “Ok, now add subtitles to this trimmed video.”
1
Load the source video
Use the Response:
put_file_url
tool to load the source video into the system for processing.2
Trim the video to a segment
Use the Response: A new video object is created containing only the 15-second segment.
trim_video
tool to create a new video clip from the original source, specifying the start and end times.3
Add automatic subtitles
Use the Response: A final, new video object is created that includes the subtitles.
add_video_captions
tool on the trimmed video clip. The tool automatically transcribes the audio and burns the captions into the video frames.4
Generate preview
Use the Response:
preview_object_ref
tool to generate a shareable URL for the final, subtitled video clip.Example Result
Build Your Own Video Editing Pipeline
The tools for trimming, captioning, and manipulating videos can be combined to create a wide variety of automated editing workflows.- 📣 Marketing: Create social media-ready highlights from long-form interviews or product demos.
- 🎓 Education: Generate short, subtitled educational clips from lengthy lectures or documentaries.
- moderators Content Moderation: Isolate specific segments of a video for review based on transcript content.
- 🎬 Media & Entertainment: Quickly assemble rough cuts or trailers by stitching multiple clips together.
- ✍️ Accessibility: Batch process video libraries to add subtitles, making them accessible to a wider audience.