Why In-App AI Agents Need Guardrails

The gap between "chat" and "do"

Every SaaS app is adding a chatbot. Most of them answer questions. A few can search your data. Almost none of them can do anything.

There's a good reason for that: letting an AI execute actions on behalf of a user is scary.

The models are actually quite good at understanding intent. The problem is blast radius. A wrong API call can create duplicate records, send an email to the wrong person, or delete something that shouldn't be deleted. Unlike a human clicking the wrong button, an agent can make mistakes at scale, confidently, without hesitation.

Why "just add tool calling" isn't enough

If you've used function calling or tool use in any LLM API, you know the basic pattern: describe your tools, let the model pick one, execute it, return the result. Works great in demos.

Production is not a demo. In production you need to answer:

Which endpoints can the agent call? Not all of them. Your API might have 200 endpoints, but the agent should have access to 15.
Who is the agent acting as? If it's using a service account, you've bypassed all your permission logic. It needs to run as the actual user.
What happens before a destructive action? The user should see exactly what's about to happen and say yes or no.
What happens after? Every tool call should be logged: what was called, what parameters, what the response was, how long it took. When something goes wrong, you need to replay and debug.

The guardrail stack

At Agent Koan, we think about safety as a stack:

Allowlists. Only endpoints you explicitly enable can be called. Everything else is off-limits. Read-only by default.
Risk levels. Each tool gets a risk level (low, medium, high). Low-risk tools execute right away. Medium and high require confirmation.
Confirmation UX. The widget shows a preview card with every field the agent is about to submit. The user confirms or rejects.
User-scoped auth. Every API call uses the end user's token. The agent can only do what the user can do.
Audit logs. Full request/response logging with latency metrics. Replay any tool call from the dashboard.

None of these are new ideas. They're borrowed from API gateways, RBAC systems, and SOC2 audit requirements. We just package them into a drop-in SDK so you don't have to build them yourself.

The point

When you have guardrails you trust, you can actually ship the feature. Your security team signs off. Your customers opt in. Your users get a copilot that doesn't just talk, it acts.

That's what Agent Koan is for. Import your schema. Configure your guardrails. Embed the widget. Ship it in a day.