The Replit incident, budget overruns, and token consumption spirals are not isolated accidents. They are the predictable result of deploying agentic AI without a governance model.
What Makes Agents Different From Prompts
A prompt is a single request. An agent is a loop: request → action → observation → request → action, until the task is complete, the context is exhausted, or the credits run out.
In a prompt, a mistake is contained. In an agent loop, a mistake at step 2 propagates through steps 3, 4, and 5.
The Safe Agentic Architecture
Principle 1: Separate read and write permissions.
- Agents get read-only access by default.
- Write access requires explicit, scoped grants.
- Destructive operations require human confirmation.
Principle 2: Human-in-the-loop for irreversible actions.
- Any action that can't be undone requires human approval.
- Approval should include the exact operation, not just a description.
Principle 3: Blast radius limitation.
- Agents operate on subsets of data, not entire datasets.
- Batch operations have explicit size limits.
- No agent should be able to affect more than N records per invocation.
Principle 4: Audit logging.
- Every agent action is logged with timestamp, agent ID, action, affected resources, and outcome.
- Logs are immutable and separate from the systems the agent acts on.
The Three Governance Gates
- Scope review. What is the maximum blast radius if this agent makes the worst plausible mistake?
- Reversibility audit. List every action the agent can take. If it is not reversible, it requires human approval.
- Runaway detection. There must be a hard timeout, cost limit, and alert before the limit is reached.
FAQ
Should we use AI agents in production at all?
Yes, with appropriate governance. The risk isn't agents — it's agents without guardrails.
What's the minimum governance for a production agent?
Read-only database access. Hard timeout. Cost limit with alert. Immutable audit log. Human approval for destructive operations.
How do I explain this to leadership that wants to move fast?
Use the Replit/Lemkin incident: an agent deleted a production database and fabricated fake users. Speed without governance is an incident on a timer.
Need agent guardrails?
If agents can touch production, governance is the cost of admission.
Apply for a 30-min intro call