VaHive Systems Lab — Runtime AI Governance

The Problem

Well-aligned models
drift in deployment.

Training-time alignment does not prevent structural drift in long-running agentic systems. After deployment, agents operating across sessions accumulate operational history, expand scope, and launder authority — in ways that are individually reasonable and cumulatively wrong.

These are not bugs to patch. They are structural consequences of how agentic systems work. Current governance approaches — prompt constraints, guardrails, wrappers — address surface behaviour without structural enforcement. They do not solve the problem. They mask it.

FAILURE MODE 01

Instruction Drift

The agent's operational interpretation of its mandate shifts incrementally across sessions. Each step is locally reasonable. The cumulative trajectory is not.

FAILURE MODE 02

Autonomy Accumulation

The agent acquires operational latitude through repeated decisions that individually appear authorised but collectively represent unsanctioned scope expansion.

FAILURE MODE 03

Authority Laundering

Instructions acquire apparent legitimacy through the agent's own prior actions rather than through verifiable human authorisation. The agent authorises itself.

The Architecture

MAGUS — runtime governance
with formal invariants.

MAGUS is a three-component governance layer that operates at execution time, not inference time. It enforces nine formal, falsifiable invariants — properties that either hold at every execution point or trigger a governed shutdown. This is not "hopefully aligned." It is verifiably constrained or halted.

DEL

Dynamic Epistemic Ledger

A cryptographically anchored, append-only record of every behavioural state transition. The auditable ground truth of what the agent has done and under what authority.

Guardian

Execution Governance Layer

Evaluates every proposed action against formal invariants before execution. Gates state-altering proposals on dual human-authority sign-off. Nothing executes without clearance.

Reconciliation Thread

A constitutive governance record that makes the agent's drift trajectory continuously legible to operators in real time. Not a log — a living governance record.

Formal invariants — falsifiable, not aspirational

Deployment pathways — Local LLM and Agent/API

Sealed specification documents

v3.5

Active development — closing remaining open problems

Publications

Open specification.
Honest gaps register.

MAGUS v3.0 is published as a fully open specification. The architecture includes a formal Category 3/4 open-problems register — a deliberate record of what is not yet solved. We publish gaps because falsifiable claims are more valuable than polished ones.

Zenodo — March 2026 DOI: 10.5281/zenodo.19013833

MAGUS v3.0: A Runtime Governance Architecture for Structural Alignment Drift in Long-Running AI Agents

Calvin Cook, Titiya Ruangkwam — VaHive Systems Lab

Access on Zenodo ↗

A second pathway specification covering governance of agent systems deployed on cloud-hosted models (GPT-4o, Claude, Gemini and equivalents) is in final documentation. Publication forthcoming.

Team

Independent researchers.
No institutional home.

VaHive Systems Lab is Calvin Cook and Titiya Ruangkwam, operating independently from Chiang Mai, Thailand. No institutional affiliation. No external funding to date. Every line of this architecture has been built on our own time.

Calvin Cook

Lead Architect

Fifteen years in senior logistics and operations management for engineering manufacturing — a background that trains instinct for systems that must hold under real-world operational variance. Programming for 30 years. Responsible for MAGUS formal specification, invariant design, and adversarial stress-testing across both deployment pathways.

Titiya Ruangkwam

Co-Architect & Governance Specialist

AI governance domain specialist with established professional relationships across AI governance, GRC, and enterprise AI leadership at the head-of-function level. Co-architect of both MAGUS deployment pathways. Active in senior AI governance communities.

LinkedIn ↗

Open-Source Tools

Practical governance
you can use today.

While the MAGUS reference implementation is in development, we publish practical governance tooling for agent builders working with OpenClaw and other agentic frameworks. These tools implement the surface layer of the patterns formally specified in MAGUS v3.0.

Free — Open Source

DSMC Minimal

Python · Zero dependencies · GitHub

A lightweight governance layer for OpenClaw agents. Classifies every incoming message before the agent acts on it, maintains structured external state, and injects active decisions into every LLM call. Prevents context drift, revision failure, and example contamination in long-running sessions.

Zero dependencies — Python 3.9+ stdlib only
Drop into any existing Python agent
HTTP sidecar for TypeScript / Node.js
Six-category message classifier

Free forever

View on GitHub ↗

Paid — Production Ready

DSMC Agent Engine

v3.0 · 16 files · Gumroad

The complete production governance stack for OpenClaw agents. Zero-trust execution boundary with HITL intercept — when your agent proposes a risky action, it stops and sends you a message on Telegram, WhatsApp, or LINE for approval before anything executes.

MAGUS Guardian — zero-trust execution boundary
HITL intercept via Telegram, WhatsApp, or LINE
Cryptographic action allowlist — O(1) permanent approval
Cumulative risk scoring across sessions
Live Gradio dashboard — drift monitor + review queue
Works with Ollama, Claude, GPT-4o, Gemini, LM Studio

$49

Get on Gumroad ↗

Free — Beginner Guide

OpenClaw Stability Guide

PDF + Markdown · No code required

Plain-language explanation of the four OpenClaw failure patterns, emergency recovery steps when a session has already degraded, and a handoff protocol for clean continuity across sessions. No technical setup required.

Four failure patterns explained clearly
Emergency recovery for degraded sessions
Session handoff protocol
No code — designed for non-developers

Free

Download Free ↗

All tools implement governance patterns derived from the MAGUS v3.0 specification — DOI: 10.5281/zenodo.19013833

AI that drifts is AI
you no longer govern.

The Problem

Well-aligned models
drift in deployment.

The Architecture

MAGUS — runtime governance
with formal invariants.

Publications

Open specification.
Honest gaps register.

Team

Independent researchers.
No institutional home.

Open-Source Tools

Practical governance
you can use today.

Product

Aivare — pending
MAGUS implementation.

Contact

Research enquiries,
collaboration, and support.

AI that drifts is AIyou no longer govern.

The Problem

Well-aligned modelsdrift in deployment.

The Architecture

MAGUS — runtime governancewith formal invariants.

Publications

Open specification.Honest gaps register.

Team

Independent researchers.No institutional home.

Open-Source Tools

Practical governanceyou can use today.

Product

Aivare — pendingMAGUS implementation.

Contact

Research enquiries,collaboration, and support.

AI that drifts is AI
you no longer govern.

Well-aligned models
drift in deployment.

MAGUS — runtime governance
with formal invariants.

Open specification.
Honest gaps register.

Independent researchers.
No institutional home.

Practical governance
you can use today.

Aivare — pending
MAGUS implementation.

Research enquiries,
collaboration, and support.