The trust ladder decides how much autonomy an agent has. Safety primitives are the controls underneath it that keep that autonomy safe. They aren't a setting you opt into or a feature you bolt on. Every team runs on the same layer, on every Space, from the first task.

Four controls, always on

APPROVALS

Risky actions wait for you

Low-stakes work moves fast; high-stakes work pauses until a human signs off. The trust ladder decides what counts as risky, so an Observer reading a dashboard never interrupts you and an Operator about to touch billing does. Approval requests land in your Inbox.

SCOPED PERMISSIONS

Minimum access, by design

Set per agent, per Space, per integration. They're declarative — you state what's allowed, not how to enforce it. A scope reads like a sentence: this agent, in this Space, may touch this tool — and nothing past it.

BUDGETS

Spend can't run away

Set per agent, per Space, per month. Hit the cap and the system stops on its own — no overruns, no late-night billing alerts. Work in progress isn't lost; it waits for the next cycle or a raised cap. See Pricing and budgets.

AUDIT LOG

Nothing happens off the record

Every read, write, decision, approval, and demotion is recorded. The log is replayable — step through events in order — exportable so you can pull the data out anytime, and searchable by agent, time, or type.

Approvals follow the ladder

You never write a separate rulebook for what needs sign-off. The rung an agent stands on already says it: an Observer acts on nothing, a Drafter proposes, an Operator acts within scope and asks before crossing a line. Risk isn't guessed action by action — it falls out of the role you gave the agent and the latitude the Space allows.

Each Space sets its own threshold. A staging Space can let an Operator run wide. A billing Space can demand approval on every write. Same ladder, your thresholds.

Permissions you can read

A scoped permission reads like a sentence, not a config file. You state what an agent may reach — this repo, that Slack channel, the renewals database — and the system enforces it. Nothing is granted by inheritance or convenience. An agent in one Space can't see another Space's tools, and what you didn't lend, it can't touch.

Because permissions are declarative, you read them back at a glance and know exactly what an agent can do before it does anything. There's no hidden surface area to audit later.

The log is the proof

Trust is easy to claim and hard to verify. The audit log makes the claim checkable. Every action an agent takes — and every decision you make about it — is recorded with provenance, in order, for keeps.

That's what keeps the trust ladder honest. Promotions and demotions are logged events too, so the record of how much you trusted an agent — and why that changed — sits right alongside the work.

Four controls run on every Space by default: approvals, scoped permissions, budgets, and an audit log.
Approvals fall out of the trust ladder and each Space's threshold — you set latitude, never a separate rulebook.
Everything is tightened by default and recorded with provenance, so autonomy stays something you can verify.

Safety primitives

Four controls, always on

Approvals follow the ladder

Permissions you can read

The log is the proof

On this page