AI Research Analysis

A Mathematical Theory of Agency and Intelligence

This paper introduces "bi-predictability (P)", a principled information-theoretic measure of how much of a system's total information is shared between its observations, actions, and outcomes. It proves regime-dependent bounds for P (unity in quantum, P ≤ 0.5 in classical, lower with agency), distinguishing agency from intelligence. The authors confirm these bounds in physical systems (double pendulum), reinforcement learning agents, and multi-turn LLM conversations. The work proposes an Information Digital Twin (IDT) architecture, inspired by thalamocortical regulation, to monitor P in real-time, enabling adaptive and resilient AI.

Schedule Your Strategy Session

Executive Impact & Key Findings

Understanding "bi-predictability" offers a new lens for enterprise AI reliability. It provides a first-person metric for an AI system's "grip" on its environment, identifying structural breakdowns before performance degradation becomes critical.

0.5 Classical Bi-predictability Limit

0.33 RL Agents (HalfCheetah)

89.3% IDT Perturbation Detection Rate

4.4x Faster Degradation Detection (IDT vs. Reward)

Discuss Your Implementation

Deep Analysis & Enterprise Applications

Select a topic to dive deeper, then explore the specific findings from the research, rebuilt as interactive, enterprise-focused modules.

Core Theory of Bi-predictability

IDT Architecture & Intelligence

RL Agent Insights

LLM Interaction Dynamics

P ≤ 0.5 Universal Classical Bi-predictability Bound

What is Bi-predictability (P)?

The paper introduces Bi-predictability (P) as the shared fraction of information across observations, actions, and outcomes relative to the loop's total informational budget. It quantifies how tightly two interacting entities constrain one another, independent of the absolute amount of information exchanged. P can reach unity in quantum systems, but is strictly bounded at P ≤ 0.5 in classical systems, and lower once agency (action selection) is introduced.

Physical Calibration: Double Pendulum

On a deterministic physical system (double pendulum) without an action channel, Bi-predictability (P) approaches the classical ceiling of 1/2 (P ≈ 0.48) with low variance. Predictive asymmetry (∆H) remains near zero. This establishes a calibration point, showing that P is close to its theoretical maximum in passive, deterministic systems, and any deviations in agentic systems signify informational costs of action and openness.

Information Digital Twin (IDT) Process

Metric Estimation (P, ∆H)

→

Stability Control (Deviation Detection)

→

Reflexive Modulation (Adaptation)

Agency vs. Intelligence Conditions

Condition	Criterion	Evidence	Achieved (RL Agents)	Achieved (LLM Agents)
Agency: Choice	H(A\|S) > 0	Stochastic policies (SAC, PPO); Stochastic sampling (temperature > 0)	✓ Yes	✓ Yes
Agency: Effect	MI(A; S'\|S) > 0	Actions influence outcomes; Responses influence subsequent context	✓ Yes	✓ Yes
Agency: Asymmetry	\|∆H\| > 0	∆H = -0.56 ± 0.22; ∆H < 0 across all conditions	✓ Yes	✓ Yes
Intelligence: Learning	↑ MI(S, A; S') towards objective	Trained on (S, A, S', R) to maximize reward; Trained on token sequences to predict next token	✓ Yes	✓ Yes
Intelligence: Self-monitoring	Computes P from own stream	No internal P computation	NO	NO
Intelligence: Adaptation	Adjusts {S}, {A}, {S'}	Spaces fixed by designers; Vocabulary and generation parameters fixed by designers/users	NO	NO

RL Agent Performance

The study found that RL agents (HalfCheetah) exhibit P ≈ 0.33 ± 0.02 and ∆H = -0.56 ± 0.22 under normal operation, placing them within the agentic regime, below the classical ceiling. The IDT detected 89.3 ± 15.1% of perturbations, significantly outperforming reward-based detection (44.0 ± 26.1%) and detecting degradation 4.4 times faster (median latency 42 windows vs. 184 for reward). This highlights IDT's ability to track coupling integrity at the transition level, offering immediate detection before performance visibly degrades.

LLM Interactions: Agency but Not Intelligence

LLM agents satisfy agency criteria through stochastic sampling and influencing context, exhibiting persistent predictive asymmetry (∆H < 0). They also learn by predicting next tokens. However, they lack explicit self-monitoring of coupling quality (P) and adaptation of their observation/action spaces, thus falling short of the definition of intelligence proposed in the paper. The IDT operating on raw token statistics can fill this gap, providing real-time feedback for reflexive modulation without heavy semantic evaluation. This allows LLMs to restore stability through context gating or parameter adjustment during perturbations, rather than relying solely on fixed next-token probabilities.

Project Your AI's ROI Potential

Estimate the potential time savings and financial return by implementing self-monitoring and adaptive AI systems, leveraging insights from bi-predictability.

Industry

Number of Employees Impacted

Avg. Hours Saved Per Employee/Week

Average Hourly Rate ($)

Annual Cost Savings $0

Annual Hours Reclaimed 0

Get a Custom ROI Analysis

Your Path to Intelligent AI

A phased approach to integrate bi-predictability monitoring and adaptive intelligence into your AI ecosystem.

Phase 1: Discovery & Baseline

Assess existing AI systems, identify critical interaction loops (S, A, S'), and establish baseline P and ∆H metrics. Define initial monitoring objectives.

Phase 2: IDT Integration

Develop and deploy Information Digital Twins to continuously monitor bi-predictability and predictive asymmetry. Integrate real-time anomaly detection.

Phase 3: Adaptive Modulation

Implement reflexive modulation mechanisms (e.g., context gating, parameter adjustment) to automatically adapt AI behavior based on IDT signals, ensuring resilience.

Phase 4: Continuous Optimization

Iteratively refine IDT models and adaptive strategies, expanding the scope of self-monitoring and adaptation across the enterprise AI landscape.

Start Your AI Intelligence Journey

Ready to Build Resilient, Intelligent AI?

Our experts are ready to help you implement bi-predictability and Information Digital Twins to enhance the reliability and adaptability of your enterprise AI.

Book Your Free Consultation

AI Research Analysis

A Mathematical Theory of Agency and Intelligence

Executive Impact & Key Findings

Deep Analysis & Enterprise Applications

What is Bi-predictability (P)?

Physical Calibration: Double Pendulum

Information Digital Twin (IDT) Process

Agency vs. Intelligence Conditions

RL Agent Performance

LLM Interactions: Agency but Not Intelligence

Project Your AI's ROI Potential

Your Path to Intelligent AI

Phase 1: Discovery & Baseline

Phase 2: IDT Integration

Phase 3: Adaptive Modulation

Phase 4: Continuous Optimization

Ready to Build Resilient, Intelligent AI?

Ready to Get Started?

Book Your Free Consultation.

Let's Discuss Your AI Strategy!

Lets Discuss Your Needs

Select Time Zone

Big Competitive Advantage With Ai

Learn More

Our Demos

Research Center

Jobs

Contact Us

1 888 985 3025

Solutions@OwnYourAi.com

Get Your Ai