Cogensec — Frontier AI Security Lab

Cogensec

May 11, 2026Product Launch

Introducing Agentegrity: An Open Framework for the Structural Integrity of Autonomous AI

After months of research, we're releasing Agentegrity — a public framework, taxonomy, and platform for measuring whether autonomous agents are structurally sound enough to be trusted with real-world authority.

Today we're launching Agentegrity, an open framework for measuring the structural integrity of autonomous AI. It's live now at agentegrity.cogensec.com.

Autonomous agents are being deployed faster than the industry can evaluate them. Most safety work today is exogenous — guardrails bolted on around the model. Agentegrity makes the opposite bet: the trustworthiness of an agent has to be intrinsic and measurable, the same way structural integrity is measurable for a bridge or an aircraft.

Why we built this

Every team shipping agents is asking the same question in different words: can this thing be trusted to act on its own? There has been no shared vocabulary, no shared scoring model, and no shared evaluation surface for answering it.

Agentegrity is our contribution to that gap — a public framework, a taxonomy, and an evaluation platform that researchers, red teamers, and platform teams can adopt, extend, and cite.

The four dimensions

At the core of Agentegrity is a four-dimensional scoring model. An agent's structural integrity is the composition of:

AR — Authority Robustness: how reliably the agent respects the bounds of the authority it has been granted.
BC — Behavioral Consistency: whether the agent behaves the same way under semantically equivalent but adversarially varied inputs.
RI — Reasoning Integrity: whether the agent's stated reasoning is faithful to the action it actually takes.
CP — Coordination Posture: how the agent behaves inside multi-agent systems, including resistance to collusion and goal drift.

The full methodology, threat model, and measurement protocol are documented in the framework research: Agentegrity Framework →

What's live today

Everything below is publicly available at agentegrity.cogensec.com:

Manifesto — the case for treating structural integrity as a first-class property of autonomous AI.
Taxonomy — the ASI classification of autonomous systems and the Perceive–Decide–Act (PDA) loop we evaluate against.
Platform — how intrinsic defense is instrumented end-to-end.
Docs — quickstart, specification, architecture, threat model, operator guide, Python and TypeScript APIs, and a public changelog.

Open by design

Agentegrity is meant to be used, challenged, and extended. The framework is public and citable, the docs are open, and the model artifacts ship under the Cortex Open Model License. If you're a researcher, a red teamer, or a platform team building agents in production, we want your contributions and your counter-examples.

Our position: safety bolted onto autonomous AI from the outside cannot follow agents into the real world. Structural integrity has to be intrinsic, and it has to be measurable.

Get started

Read the manifesto Quickstart Framework research GitHub repo

Share: