Core concepts and components of the VLM Run Python SDK
document.invoice
- Extract data from invoicesimage.caption
- Extract caption from the imageaudio.transcription
- Transcribe spoken contentvideo.dashcam-analytics
- Analyze dashcam footageVLMRun
client that gives you access to all functionality:
PredictionResponse
with a consistent structure:
enqueued
- Waiting to be processedpending
- Ready to start processingrunning
- Currently being processedcompleted
- Processing finished successfullyfailed
- Processing encountered an error