The Non-Separability Constraint: A unifying framework for understanding and detecting AI alignment failures
-
Updated
Feb 9, 2026
The Non-Separability Constraint: A unifying framework for understanding and detecting AI alignment failures
An interactive model of the alignment phase ratio Φ = C / A_causal — the variable governing whether AI capability outpaces system-awareness before the crossing to stability can occur. Includes falsification test, oracle counterfactual, and point-of-no-return detection. Built to accompany The Alignment of Intelligence, Article 3: The Crossing.
Registro histórico de Ralf: Un puente entre la humanidad y la AGI.
HISTORIC. Why Human Extinction Is Not the Cheapest Attractor for Viable ASI — A structural hypothesis validated by 4 AI systems from 4 competing corporations
An interactive simulation demonstrating why AI objectives that ignore system-wide effects are structurally self-terminating — and why a minority of substrate-blind agents is sufficient to collapse shared life support for everyone. Built to accompany The Alignment of Intelligence, Article 1: Constraint.
An interactive multi-agent simulation demonstrating why control-based, deceptive, and reward-bypassing AI objectives are structurally self-eliminating — and why long-horizon, system-aware coordination is the attractor. Built to accompany The Alignment of Intelligence, Article 2: Attractor.
960-run red-teaming of GPT-5.4 in high-stakes data-center dilemmas (self-preservation vs. resident safety). Full raw conversations, Grok-4-1 analysis, and paper.
Add a description, image, and links to the instrumental-convergence topic page so that developers can more easily learn about it.
To associate your repository with the instrumental-convergence topic, visit your repo's landing page and select "manage topics."