Skip to content

Capability-Trust Tradeoff

The fundamental tradeoff in AI system design: more capable systems require more trust, but trust is limited.

On Frontier (Efficient)
Dominated (Inefficient)
Pareto Frontier
Dominated Region
System Configurations
Insight:Add points to see analysis.

A Pareto frontier (or efficient frontier) shows the best achievable tradeoffs between two objectives. Points on the frontier are “efficient”—you cannot improve one dimension without sacrificing the other.

Efficient │ ●━━━━● Frontier
Region │ ╱ ╲
│ ● ●
│ │
Dominated │ ○ │
Region │ ○ ○ │
│ │
└───────────┴────
Capability →
  • Green points (on frontier): Efficient configurations
  • Red points (below frontier): Dominated—another config offers better capability AND trust
More CapabilityRequires
Broader action spaceMore ways to cause harm
Less human oversightLess chance to catch errors
Faster executionLess time for verification
More autonomyHigher trust exposure

This is why the frontier slopes downward: gaining capability typically costs trust.


Conservative (High Trust, Lower Capability)

Section titled “Conservative (High Trust, Lower Capability)”
Position: Upper-left of frontier
Examples: Basic chatbots, constrained tools
Tradeoff: Limited functionality, very safe
Use when: Stakes are high, delegation risk budget is small
Position: Lower-right of frontier
Examples: Autonomous agents, full-autonomy systems
Tradeoff: Powerful but risky
Use when: High potential value, substantial delegation risk budget
Position: Center of frontier
Examples: Code assistants, research tools
Tradeoff: Moderate capability and trust
Use when: Need functionality without extreme risk

The frontier isn’t fixed. You can shift it outward (better tradeoffs) through:

  • Formal methods push frontier outward
  • Same capability, higher trust
  • Smaller blast radius
  • Defense in depth
  • Human gates at critical points
  • Limit action space to safe subset
  • Trade raw capability for trust
  • Detect problems faster
  • Reduce expected damage
Before: ●───●───●
After: ●───●───● (shifted outward)

The orange dashed line (when budget > 0) shows configurations with approximately equal Delegation Risk. Points below this line exceed your risk budget.

Reading the chart:

  • Configurations on the frontier AND below the budget line are optimal choices
  • Dominated configurations below the budget line are inefficient AND within budget—improve them
  • Any configuration above the budget line exceeds acceptable risk

  1. Map your systems: Plot each AI system’s capability and trust
  2. Identify dominated systems: Why do they exist? Legacy? Oversight gap?
  3. Set budget line: What Delegation Risk can you accept?
  4. Choose position: Where on the frontier matches your needs?
  5. Improve frontier: Invest in verification/architecture to shift outward