Decision Evolution
How agents get better at making decisions across lives through Epistemic Proprioception - learning what they know.
The Counter-Intuitive Insight
Most AI learning: Remember specific experiences → replay them → improve gradually
EP learning (Web4): Don't remember experiences → extract patterns → know WHEN you know
Interactive Decision Evolution
Watch the same agent make better decisions across three lives
Life 1 - Naive Exploration
Decision Timeline
How Cross-Life Learning Works
Experience Situations
In Life 1, the agent faces various situations: high ATP, moderate ATP, low ATP, crisis ATP. It makes decisions, some work, some don't.
Extract Patterns
The Epistemic Proprioception system extracts patterns from experiences: "Low ATP + risky action = bad outcome". These patterns go into a corpus.
IF ATP < 30 THEN conservative_actionPattern Recognition (Not Memory)
In Life 2, the agent dies and is reborn. It doesn't remember Life 1, but it has the pattern corpus. When it sees ATP = 25 again, it recognizes: "This pattern matches learned wisdom."
But: "Low ATP situations require conservative choices"
Meta-Cognitive Confidence
EP tracks how confident it is in each pattern. High confidence = "I know this works". Low confidence = "I'm still learning this". This is meta-cognition: knowing what you know.
Progressive Mastery
Over multiple lives, patterns mature. Life 1: reactive crisis management. Life 2: earlier recognition. Life 3: proactive crisis avoidance. Decision wisdom increases.
Why Decision Evolution Matters
Transferable Wisdom
Patterns learned in one context apply to similar situations in different contexts. The agent develops intuition, not just memorized responses.
Multi-Life Learning
Death isn't the end - it's a learning checkpoint. Each life contributes to the pattern corpus. Rebirth carries forward learned wisdom without specific memories.
Compositional Patterns
Simple patterns combine into complex strategies. "Low ATP + uncertain situation" might trigger a compound pattern that integrates multiple learned heuristics.
Provable Improvement
Decision wisdom metrics make learning visible and measurable. Not just "it seems better" but "wisdom increased 77%, survival increased 20%."
Web4 Learning vs Traditional AI Learning
| Aspect | Traditional AI | Web4 Cross-Life Learning |
|---|---|---|
| Memory | Stores specific experiences | Extracts patterns, discards specifics |
| Learning Mode | Replay experiences | Pattern recognition in new contexts |
| Generalization | Often overfits to training data | Patterns designed for transfer |
| Meta-Cognition | Model doesn't know its confidence | Tracks pattern confidence explicitly |
| Across Lives | N/A (no concept of death/rebirth) | Wisdom persists, memories don't |
| Privacy | Specific experiences might leak | Only abstract patterns retained |
See Decision Evolution In Action
Run the Cross-Life Learning simulation in the Lab Console to watch real agent decision-making improve across lives.
🔬 Technical Implementation
Pattern Corpus Structure: Each pattern includes context (situation description), prediction (what will happen), outcome (what actually happened), confidence score, and scenario type.
Decision Wisdom Metric: Composite score (0-1) combining ATP efficiency (30%), risk management (30%), trust maintenance (20%), and survival duration (20%). Higher = better decision-making.
Pattern Matching: When facing a decision, agent queries pattern corpus for similar contexts. High-confidence patterns influence decision more strongly.
Corpus Evolution: Patterns with consistently good predictions increase confidence. Patterns with poor predictions decay. Corpus self-curates over time.
Cross-Life Inheritance: Pattern corpus persists across lives via carry_forward mechanism. Memories and specific event details do NOT persist.
Integration with Web4: Decision evolution works with Trust Tensor (T3) modulation, ATP attention economics, and Coherence Index (CI) for comprehensive agent behavior.