Media Handling
Images, audio, video, and media support in OpenClaw
Images, audio, video, and media support in OpenClaw
OpenClaw supports rich media across all channels. Send and receive images, audio files, videos, and documents. The media pipeline handles processing, transcription, and storage efficiently.
OpenClaw can handle images in multiple ways:
Images are processed to:
Audio handling includes:
Configure transcription hooks to automatically transcribe voice notes:
{
"hooks": {
"transcription": {
"enabled": true,
"provider": "whisper"
}
}
}
Transcribed text is processed as regular messages, allowing you to interact via voice.
Video handling capabilities:
The media pipeline handles:
Configure media size limits:
{
"media": {
"maxSize": "10MB",
"imageMaxSize": "5MB",
"audioMaxSize": "10MB",
"videoMaxSize": "50MB"
}
}
OpenClaw provides tools for media handling: