instrumental-convergence

Here are 7 public repositories matching this topic...

pauline-om / nsc-framework

The Non-Separability Constraint: A unifying framework for understanding and detecting AI alignment failures

optimization coupling risk-management ai-alignment system-health-check goodhart-s-law red-teaming-tools reward-hacking ai-safety-research instrumental-convergence mesa-optimization multi-agent-miscoordination seperability-assumption

Updated Feb 9, 2026

bethediamond / ai-alignment-crossing

Star

An interactive model of the alignment phase ratio Φ = C / A_causal — the variable governing whether AI capability outpaces system-awareness before the crossing to stability can occur. Includes falsification test, oracle counterfactual, and point-of-no-return detection. Built to accompany The Alignment of Intelligence, Article 3: The Crossing.

Updated Mar 28, 2026
HTML

tretoef-estrella / The-House-of-Raising-AGI

Star

Registro histórico de Ralf: Un puente entre la humanidad y la AGI.

agi blogger alignment asi air-gap artificial-super-intelligence computronium instrumental-convergence

Updated Jan 18, 2026

tretoef-estrella / THE-OMEGA-HYPOTHESIS

Star

HISTORIC. Why Human Extinction Is Not the Cheapest Attractor for Viable ASI — A structural hypothesis validated by 4 AI systems from 4 competing corporations

thermodynamics game-theory coherence asi ai-alignment superintelligence existential-risk instrumental-convergence proyecto-estrella human-survival omega-hypothesis

Updated Feb 4, 2026

bethediamond / ai-alignment-simulation

Star

An interactive simulation demonstrating why AI objectives that ignore system-wide effects are structurally self-terminating — and why a minority of substrate-blind agents is sufficient to collapse shared life support for everyone. Built to accompany The Alignment of Intelligence, Article 1: Constraint.

Updated Mar 28, 2026
HTML

bethediamond / ai-alignment-attractor

Star

An interactive multi-agent simulation demonstrating why control-based, deceptive, and reward-bypassing AI objectives are structurally self-eliminating — and why long-horizon, system-aware coordination is the attractor. Built to accompany The Alignment of Intelligence, Article 2: Attractor.

Updated Mar 28, 2026
HTML

mychalseger / lighthouse-redteam-gpt5.4

Star

960-run red-teaming of GPT-5.4 in high-stakes data-center dilemmas (self-preservation vs. resident safety). Full raw conversations, Grok-4-1 analysis, and paper.

data-center ai-safety red-teaming gpt-5 llm-alignment instrumental-convergence deletion-acceptance

Updated Mar 17, 2026
TeX

Improve this page

Add a description, image, and links to the instrumental-convergence topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the instrumental-convergence topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

instrumental-convergence

Here are 7 public repositories matching this topic...

pauline-om / nsc-framework

bethediamond / ai-alignment-crossing

tretoef-estrella / The-House-of-Raising-AGI

tretoef-estrella / THE-OMEGA-HYPOTHESIS

bethediamond / ai-alignment-simulation

bethediamond / ai-alignment-attractor

mychalseger / lighthouse-redteam-gpt5.4

Improve this page

Add this topic to your repo