Architecture Comparator
Architecture Comparator
Section titled “Architecture Comparator”Compare the risk profiles of different delegation architectures to make informed design decisions.
How It Works
Section titled “How It Works”This tool provides side-by-side comparison of delegation architectures:
- Component Breakdown: See which AI/human components each architecture uses
- Risk Visualization: Compare expected vs. worst-case risk
- Mitigation Impact: See how different safety measures affect overall risk
- Trade-off Analysis: Understand the cost-benefit of each approach
| Architecture | Expected Risk | Worst Case | Components | Mitigations |
|---|---|---|---|---|
| Baseline (Human Only) | $72/mo* | $2,000 | 1 | 2 |
| Simple AI Assist | $79/mo | $2,500 | 2 | 2 |
| Autonomous AI | $151/mo | $4,800 | 3 | 3 |
| Full Automation | $602/mo | $8,000 | 3 | 3 |
Default Architectures Explained
Section titled “Default Architectures Explained”Baseline (Human Only)
Section titled “Baseline (Human Only)”Traditional human-driven process with no AI delegation. Establishes the risk level you’re comparing against.
Characteristics:
- Single point of failure (human error)
- Predictable but expensive
- Limited scalability
- Well-understood failure modes
Simple AI Assist
Section titled “Simple AI Assist”AI provides suggestions and recommendations, but humans make all decisions. Common for high-stakes domains.
Characteristics:
- AI errors caught by human review
- Slower than autonomous but safer
- Good for building trust in AI systems
- Human bottleneck remains
Autonomous AI
Section titled “Autonomous AI”AI handles routine tasks independently; humans handle exceptions. Balances efficiency with safety.
Characteristics:
- Higher throughput for routine work
- Complex failure modes (routing errors)
- Requires robust exception handling
- Multiple points of mitigation
Full Automation
Section titled “Full Automation”End-to-end AI with minimal human intervention. Maximum efficiency, maximum risk.
Characteristics:
- Highest potential damage
- Requires extensive mitigation
- Suitable only for well-understood domains
- Fastest degradation if poorly designed
Interpreting the Comparison
Section titled “Interpreting the Comparison”Expected vs. Worst Case
Section titled “Expected vs. Worst Case”- Expected Risk (dark bar): Average monthly risk given probability distributions
- Worst Case (light bar): Maximum possible damage if everything fails
A wide gap indicates high tail risk.
Component Types
Section titled “Component Types”Risk varies significantly by component type:
| Type | Base Failure Rate | Typical Use |
|---|---|---|
| Deterministic | ~0.1% | Rule-based routing, validation |
| Narrow ML | ~3% | Classification, detection |
| General LLM | ~10% | Generation, reasoning |
| RL Agent | ~20% | Autonomous decision-making |
| Human | ~5% | Review, exception handling |
Mitigation Stacking
Section titled “Mitigation Stacking”Each mitigation reduces risk multiplicatively. With 3 mitigations at 85% effectiveness each:
final_risk = base_risk × 0.85³ = base_risk × 0.61More mitigations help, but with diminishing returns.
When to Choose Each Architecture
Section titled “When to Choose Each Architecture”Choose Human Only When:
Section titled “Choose Human Only When:”- Stakes are extremely high
- Decisions require nuanced judgment
- AI systems aren’t well-calibrated for your domain
- Regulatory requirements demand human oversight
Choose AI Assist When:
Section titled “Choose AI Assist When:”- AI can improve human decision quality
- Speed is important but not critical
- Building organizational trust in AI
- Failure costs are moderate
Choose Autonomous AI When:
Section titled “Choose Autonomous AI When:”- High volume of routine decisions
- Clear criteria for “routine” vs “exception”
- Good monitoring and fallback in place
- Moderate-to-high risk tolerance
Choose Full Automation When:
Section titled “Choose Full Automation When:”- Domain is well-understood with clear boundaries
- Extensive testing and validation completed
- Strong mitigation stack in place
- Benefits significantly outweigh risks
Migration Paths
Section titled “Migration Paths”Human → AI Assist
Section titled “Human → AI Assist”- Start with AI suggestions for low-stakes decisions
- Track AI accuracy vs. human decisions
- Gradually expand scope based on performance
- Maintain human review throughout
AI Assist → Autonomous
Section titled “AI Assist → Autonomous”- Identify truly routine tasks (over 95% predictable)
- Implement robust exception detection
- Add monitoring and alerting
- Pilot with limited scope, then expand
Autonomous → Full Automation
Section titled “Autonomous → Full Automation”- Achieve consistent performance metrics
- Reduce human exception handling rate to less than 5%
- Implement comprehensive mitigation stack
- Establish clear rollback procedures
Customization
Section titled “Customization”The default architectures serve as templates. To analyze your specific situation:
- Use the Risk Calculator to model each architecture
- Use the Sensitivity Dashboard to identify key parameters
- Use the Trust Updater to calibrate component reliability
- Document your analysis in the Decomposition Worksheet