The client.audio object allows you to process audio files and extract structured data.

This feature is currently only available for our enterprise-tier customers. If you are interested in using this feature, please contact us.

Generate Predictions

from vlmrun.client import VLMRun
from vlmrun.client.types import PredictionResponse

# Initialize the client
client = VLMRun()

# Process an audio file with a predefined schema
response: PredictionResponse = client.audio.generate(
    audio="path/to/audio.mp3",
    domain="audio.transcription",
)

Get Usage

from vlmrun.client.types import CreditUsage

usage: CreditUsage = response.usage
print(usage)