Introducing Agentegrity: An Open Framework for the Structural Integrity of Autonomous AI
After months of research, we're releasing Agentegrity — a public framework, taxonomy, and platform for measuring whether autonomous agents are structurally sound enough to be trusted with real-world authority.
Today we're launching Agentegrity, an open framework for measuring the structural integrity of autonomous AI. It's live now at agentegrity.cogensec.com.
Autonomous agents are being deployed faster than the industry can evaluate them. Most safety work today is exogenous — guardrails bolted on around the model. Agentegrity makes the opposite bet: the trustworthiness of an agent has to be intrinsic and measurable, the same way structural integrity is measurable for a bridge or an aircraft.
Why we built this
Every team shipping agents is asking the same question in different words: can this thing be trusted to act on its own? There has been no shared vocabulary, no shared scoring model, and no shared evaluation surface for answering it.
Agentegrity is our contribution to that gap — a public framework, a taxonomy, and an evaluation platform that researchers, red teamers, and platform teams can adopt, extend, and cite.
The four dimensions
At the core of Agentegrity is a four-dimensional scoring model. An agent's structural integrity is the composition of:
- AR — Authority Robustness: how reliably the agent respects the bounds of the authority it has been granted.
- BC — Behavioral Consistency: whether the agent behaves the same way under semantically equivalent but adversarially varied inputs.
- RI — Reasoning Integrity: whether the agent's stated reasoning is faithful to the action it actually takes.
- CP — Coordination Posture: how the agent behaves inside multi-agent systems, including resistance to collusion and goal drift.
The full methodology, threat model, and measurement protocol are documented in the framework research: Agentegrity Framework →
What's live today
Everything below is publicly available at agentegrity.cogensec.com:
- Manifesto — the case for treating structural integrity as a first-class property of autonomous AI.
- Taxonomy — the ASI classification of autonomous systems and the Perceive–Decide–Act (PDA) loop we evaluate against.
- Platform — how intrinsic defense is instrumented end-to-end.
- Docs — quickstart, specification, architecture, threat model, operator guide, Python and TypeScript APIs, and a public changelog.
Open by design
Agentegrity is meant to be used, challenged, and extended. The framework is public and citable, the docs are open, and the model artifacts ship under the Cortex Open Model License. If you're a researcher, a red teamer, or a platform team building agents in production, we want your contributions and your counter-examples.
Our position: safety bolted onto autonomous AI from the outside cannot follow agents into the real world. Structural integrity has to be intrinsic, and it has to be measurable.