Chatzuri
Pricing
Guides
Guides
Introduction
  1. 1Getting Started
  2. 2Your Agent
    • Create an agent
    • AI configuration
    • Personality and presentation
    • Customise the chat widget
    • Test in the Playground
    • Voice, video, images, and files
    • How memory works
    • Multi-agent orchestration
    • Performance and cost controls
    • Security and access control
    • Compliance and guardrails
    • Improve response quality
  3. 3Knowledge & Sources
  4. 4Agent actions & Tools
  5. 5Agent Tasks
  6. 6WorkflowsBeta
  7. 7Channels
  8. 8Customers & Conversations
  9. 9Run Your Team
  10. 10Developer Tools
Developer reference →Showcase →
Guides2. Your AgentVoice, video, images, and files
Chapter 2 · Your Agent

Voice, video, images, and files

Turn on multimodal capabilities so customers can speak, send pictures, or share documents.

4 min read

Multimodal means your agent can do more than text. Customers can speak to it, send pictures, share documents, or even video — and your agent can respond in kind.

Find these settings under Settings → Multimodal on any agent.

Voice

Turn on Voice to let customers send audio messages and hear the agent's reply spoken back. Pick a voice from the dropdown — preview each one to find the right tone.

  • Works on the website widget, WhatsApp (voice notes), and any channel that supports audio.
  • Real-time speech: customers can interrupt, just like a phone call.
  • Adds a small per-minute cost — see your team's plan for details.

Images

With Images on, customers can attach photos to a message. Your agent can describe what it sees, identify products, read text from a screenshot, or extract details from a receipt.

Useful for:

  • Customer support ("here's the error I'm seeing")
  • E-commerce ("do you have this in red?")
  • Field operations ("this is the part that broke")
  • Education ("can you explain this diagram?")

Files

With Files on, customers can attach PDFs, Word docs, and spreadsheets directly to a chat. Your agent reads the contents and answers questions about them.

Heads up
Files attached in chat are not added to the agent's permanent knowledge — they stay scoped to that conversation. Use the Sources tab for permanent training data.

Video

Video support lets customers share a clip — your agent extracts audio and key frames, then reasons about both. Best for product demos, onboarding walkthroughs, or troubleshooting.

Streaming

Already enabled by default. Replies appear word-by-word so the conversation feels alive. Turn off only for SMS-style channels where partial messages don't make sense.

Channel support

Not every channel supports every modality. Roughly:

  • Website widget — supports everything
  • WhatsApp — text, images, voice notes, files
  • Telegram — text, images, voice notes, files, video
  • SMS — text only (and short text at that)
  • Email — text and file attachments
  • Slack / Messenger / Instagram — text, images, files
Previous · Your AgentTest in the PlaygroundNext · Your AgentHow memory works
Chatzuri

AI-powered agents are transforming customer interactions by providing instant, intelligent responses around the clock. They help businesses reduce operational costs, improve response times, and scale support without compromising quality. These agents understand natural language, learn from conversations, and integrate with existing systems to offer personalized experiences that enhance customer satisfaction and loyalty.

Chatzuri

AI-powered agents are transforming customer interactions by providing instant, intelligent responses around the clock. They help businesses reduce operational costs, improve response times, and scale support without compromising quality. These agents understand natural language, learn from conversations, and integrate with existing systems to offer personalized experiences that enhance customer satisfaction and loyalty.

Product

  • Pricing
  • Security
  • Affiliates

Resources

  • API
  • Guides
  • Blog
  • Help

Company

  • About us
  • Privacy Policy
  • Terms of Service
  • Cookie Policy
  • DPA

About

  • Teams
  • Singapore, Nairobi

© 2026 Chatzuri. All rights reserved.

Chatzuri uses AI and can make mistakes.

Terms of ServicePrivacy PolicyCookie PolicyChatzuri