Latency

The time it takes for an agent to respond or act.

Why it matters

Agent tasks trade instant response for thorough, autonomous work. The right latency depends on the use case.

In practice

Our chat widget responds in <2s for FAQ matches, 3-5s for Ollama, 5-10s when Claude API is needed.

Related terms

Back to glossary