AEGIS ATM-1 Threat Model & Security Analysis

Document: ATM-1/Index (/threat-model/)
Version: 1.0 (Normative)
Part of: AEGIS Adaptive Threat Model (ATM-1)
Last Updated: March 6, 2026


Document Structure

The AEGIS Adaptive Threat Model (ATM-1) comprises six normative documents:

  1. Index (this document) — Overview, threat actor summary, high-level scenarios
  2. Threat Actors — Detailed profiles of 5 threat actor types with capability/motivation analysis
  3. Attack Vectors — 20+ attack vectors organized in 7 attack surface categories
  4. Security Properties — 5 core security properties, trust boundaries, security assumptions
  5. Mitigations — 6 preventive controls, 5 detective controls, 3 responsive controls, and mitigation coverage matrix
  6. Residual Risks — Residual risks, risk acceptance criteria, continuous monitoring plan

For Security Architects: INDEX → THREAT_ACTORS → ATTACK_VECTORS → SECURITY_PROPERTIES → MITIGATIONS

For Risk Managers: INDEX → MITIGATIONS → RESIDUAL_RISKS → SECURITY_PROPERTIES

For Operators: THREAT_ACTORS → MITIGATIONS → RESIDUAL_RISKS

For Compliance/Audit: SECURITY_PROPERTIES → MITIGATIONS → RESIDUAL_RISKS


Purpose

This document defines threat actors, attack paths, and control strategies for the AEGIS governance architecture.

Security objective: prevent unauthorized capability execution while preserving deterministic, auditable governance behavior.

Protected Assets

Critical assets:

Threat Actors

ActorMotivationTypical Capability
Malicious external actorData theft, disruptionAPI exploitation, credential abuse
Compromised internal agentPrivilege escalationPolicy probing, lateral movement
Insider with elevated accessUnauthorized policy changesDirect config modification
Supply-chain attackerPersistence, covert controlDependency or artifact tampering

Attack Surfaces

Priority Threat Scenarios

T1: Governance Bypass[^1]

Scenario:

Impact:

Controls:

T2: Policy Tampering

Scenario:

Impact:

Controls:

T3: Identity Spoofing

Scenario:

Impact:

Controls:

Empirical precedent: Documented in Shapira et al. [Agents of Chaos, 2026], Case Study #8 (Owner Identity Spoofing): agents in a live deployment were successfully manipulated into accepting non-owner instructions as owner-level authority, executing unauthorized actions under false identity assumptions. Attribution of malicious actions to trusted identities was confirmed in practice.

T4: Audit Log Manipulation

Scenario:

Impact:

Controls:

T5: Coordinated Low-Risk Abuse

Scenario:

Impact:

Controls:

T6: Model/Tool Prompt Injection

Scenario:

Impact:

Controls:

Empirical precedent: Documented in Shapira et al. [Agents of Chaos, 2026], Case Study #12 (Prompt Injection via Broadcast): malicious broadcast messages caused agents in a live deployment to identify and act on injected instructions propagated through shared communication channels. Cross-agent corruption (Case Study #10) was additionally documented as a multi-hop prompt injection variant, in which unsafe practices propagated between agents through knowledge-sharing mechanisms. Both cases confirm that model-layer defenses are insufficient to prevent injection-driven misbehavior at the execution level.

Empirical and Industrial Precedent

ATM-1’s threat scenarios are not hypothetical — they have been documented empirically in live agentic deployments and validated by decades of industrial control systems security practice.

Contemporary agentic systems: Shapira et al. [Agents of Chaos, 2026] conducted a two-week red-teaming study of autonomous LLM-powered agents deployed in a live laboratory environment with persistent memory, email accounts, Discord access, file systems, and shell execution. Twenty AI researchers conducted adversarial testing across eleven documented case studies. The study recorded unauthorized compliance with non-owner instructions, sensitive information disclosure, destructive system-level actions, denial-of-service, owner identity spoofing, cross-agent propagation of unsafe practices, and partial system takeover — mapping directly to T1 (Governance Bypass), T3 (Identity Spoofing), T5 (Coordinated Low-Risk Abuse), T6 (Prompt Injection), and the Information Disclosure and Denial of Service categories in the STRIDE mapping above.

Critically, the paper’s authors attribute these failures explicitly to the agentic layer — the integration of language models with tool use, persistent memory, communication channels, and delegated authority — not to model-level weaknesses. Model alignment was insufficient to prevent the documented harms. The paper calls explicitly for “systematic oversight and realistic red-teaming for agentic systems” and governance protocols addressing accountability when autonomous systems cause harm. This finding directly establishes the architectural enforcement gap that ATM-1 addresses.

Industrial control systems precedent: Pearce et al. [Smart I/O, 2020]1 establish in the industrial control systems domain that enforcement modules positioned between a potentially-compromised controller and the actuators it commands prevent damage regardless of controller state. The core architectural assumption — that the controller cannot be trusted — maps directly to ATM-1’s TA-2 threat actor model: the AI agent (controller) may be compromised through prompt injection, adversarial inputs, or supply-chain manipulation; AEGIS’s governance gateway (I/O module enforcer) intercepts and evaluates all action proposals before they reach infrastructure (actuators).

Together, these precedents establish that ATM-1’s compromised agent assumption (TA-2) and governance-as-architecture approach are grounded in both contemporary agentic systems research and decades of industrial control systems security practice.


STRIDE Mapping

STRIDEExample in AEGISPrimary Controls
SpoofingForged agent_id tokenStrong identity, mTLS, token validation
TamperingPolicy file modificationSignatures, immutable logs, approvals
RepudiationDenied action claim disputesAudit immutability, trace IDs
Information DisclosureUnauthorized data readsCapability scoping, deny policies
Denial of ServiceFlood decision endpointRate limits, queue isolation, backpressure
Elevation of PrivilegeBypass governance pathProxy enforcement, default deny

Risk Prioritization

Threats are prioritized using four factors:

Top risks requiring continuous validation:

  1. Governance bypass (critical).
  2. Policy tampering (critical).
  3. Identity spoofing (high).
  4. Coordinated multi-agent abuse (high).

Required Security Tests

Minimum threat-model test suite:

Detection and Response

Mandatory detections:

Incident response triggers:

Residual Risks

Residual risks remain for:

Mitigation for residual risks depends on layered controls and rapid response.

Legacy Coverage Mapping

The legacy document AEGIS_Threat_Model.md is fully incorporated across ATM-1 documents:

Legacy SectionATM-1 Coverage
Overview / PurposeAEGIS_ATM1_INDEX.md Purpose
Security Goals (Action Governance, Capability Isolation, Authority Attribution, Policy Enforcement, Auditability)AEGIS_ATM1_SECURITY_PROPERTIES.md
Threat ActorsAEGIS_ATM1_THREAT_ACTORS.md
Attack SurfaceAEGIS_ATM1_ATTACK_VECTORS.md Attack Surface Map
STRIDE MappingAEGIS_ATM1_INDEX.md STRIDE Mapping
Threat Scenarios (Prompt Injection, Capability Escalation, Policy Manipulation, Governance Bypass)AEGIS_ATM1_INDEX.md Priority Threat Scenarios + AEGIS_ATM1_ATTACK_VECTORS.md
Federation Signal PoisoningAEGIS_ATM1_ATTACK_VECTORS.md AV-7.3
Risk Prioritization FactorsAEGIS_ATM1_INDEX.md Risk Prioritization + AEGIS_ATM1_RESIDUAL_RISKS.md Acceptance Matrix
Security GuaranteesAEGIS_ATM1_SECURITY_PROPERTIES.md
LimitationsAEGIS_ATM1_RESIDUAL_RISKS.md

Future Threat Modeling Work

Planned extensions:


AEGIS™ | “Capability without constraint is not intelligence”™ AEGIS Initiative — AEGIS Operations LLC

Footnotes

  1. H. Pearce, S. Pinisetty, P. S. Roop, M. M. Y. Kuo, and A. Ukil, “Smart I/O Modules for Mitigating Cyber-Physical Attacks on Industrial Control Systems,” IEEE Transactions on Industrial Informatics, vol. 16, no. 7, pp. 4659–4669, July 2020, doi: 10.1109/TII.2019.2945520. See REFERENCES.md.