Chatzuri
Pricing
Guides
Guides
Introduction
  1. 1Getting Started
  2. 2Your Agent
    • Create an agent
    • AI configuration
    • Personality and presentation
    • Customise the chat widget
    • Test in the Playground
    • Voice, video, images, and files
    • How memory works
    • Multi-agent orchestration
    • Performance and cost controls
    • Security and access control
    • Compliance and guardrails
    • Improve response quality
  3. 3Knowledge & Sources
  4. 4Agent actions & Tools
  5. 5Agent Tasks
  6. 6WorkflowsBeta
  7. 7Channels
  8. 8Customers & Conversations
  9. 9Run Your Team
  10. 10Developer Tools
Developer reference →Showcase →
Guides2. Your AgentPerformance and cost controls
Chapter 2 · Your Agent

Performance and cost controls

Set a fallback model and cap daily or monthly token usage per agent.

4 min read

Two settings on this page protect you from runaway costs and provider outages: a fallback model for when the primary fails, and per-agent token caps that hard-stop usage once thresholds are hit.

Find these settings under Settings → Performance on any agent.

Fallback model

Choose a second model the agent should switch to if the primary fails — for example a cheaper or faster option from another provider.

The fallback kicks in when:

  • The primary model returns an error or rate-limits.
  • The provider is temporarily unavailable.

The retry runs once. If both models fail, the customer sees an error. Token usage and the recorded Message.model reflect the model that actually answered, so analytics stay accurate.

Tip
Picking a different provider for the fallback (e.g. Claude as fallback for GPT-5) protects you against single-provider outages — the most common cause of message failures in production.

Token monitoring

Turn on Enable Token Monitoring to put hard caps on per-agent token usage. Two windows are independently configurable:

  • Daily token limit — resets at 00:00 UTC. Leave empty for no daily cap.
  • Monthly token limit — resets on the 1st of the month at 00:00 UTC. Leave empty for no monthly cap.

Usage is summed from completed messages on this agent. When a cap is reached, the next request is refused with an HTTP 429 status — the customer sees a rate-limit response, and you get clear telemetry that a cap fired rather than a silent failure.

How limits compare

Two layers of usage protection apply in order:

  1. Plan limits (team-wide) — your subscription's monthly message and API quotas. Enforced first.
  2. Agent token caps (per-agent) — only fire if the plan check passed. These let you stop one chatty agent from consuming the team's whole budget.
Note
Token monitoring is a hard cap, not a soft alert. Configure it conservatively at first — you can always raise the ceiling, but if the cap fires mid-customer-conversation the customer is blocked.

Reading usage

Usage is tracked on every message via input/output/total token counts. You can query usage directly from the Messages table, or open the agent's analytics view to see daily totals broken down by model.

Previous · Your AgentMulti-agent orchestrationNext · Your AgentSecurity and access control
Chatzuri

AI-powered agents are transforming customer interactions by providing instant, intelligent responses around the clock. They help businesses reduce operational costs, improve response times, and scale support without compromising quality. These agents understand natural language, learn from conversations, and integrate with existing systems to offer personalized experiences that enhance customer satisfaction and loyalty.

Chatzuri

AI-powered agents are transforming customer interactions by providing instant, intelligent responses around the clock. They help businesses reduce operational costs, improve response times, and scale support without compromising quality. These agents understand natural language, learn from conversations, and integrate with existing systems to offer personalized experiences that enhance customer satisfaction and loyalty.

Product

  • Pricing
  • Security
  • Affiliates

Resources

  • API
  • Guides
  • Blog
  • Help

Company

  • About us
  • Privacy Policy
  • Terms of Service
  • Cookie Policy
  • DPA

About

  • Teams
  • Singapore, Nairobi

© 2026 Chatzuri. All rights reserved.

Chatzuri uses AI and can make mistakes.

Terms of ServicePrivacy PolicyCookie PolicyChatzuri