TRACE — Incident Response & Forensics

Why This Matters

Agentic systems — Claude Code, Codex, Gemini CLI, Cursor's agent mode, autonomous MCP-driven loops — now write production code, call tools, spawn sub-agents, and act on behalf of users at scale. The blast radius of a single compromised session has grown accordingly: a prompt-injected agent can read credentials, exfiltrate source, push backdoored commits, and delegate the same compromise to a dozen sub-agents in seconds.

Traditional security tooling wasn't built for this. SIEMs see network flows but not delegation graphs. EDR sees process trees but not chain-of-thought drift. Code review sees diffs but not the prompt that produced them. When an incident happens in agent space, responders end up reconstructing events from screen recordings and shell history.

The data already exists. Every agent governed by VIBES is generating annotations, decision records, delegation traces, and tool-invocation logs. What's missing is a standard for using that data when something goes wrong — turning it into IoCs, evidence bundles, replayable timelines, and portable incident reports.

TRACE is the missing layer between continuous audit and post‑incident response.

Without an IR standard:

Detection is ad-hoc — every team rewrites the same regex queries against `.ai-audit/` files; no shared vocabulary for what an "agent compromise" looks like.
Evidence is fragile — audit files mutate during ongoing sessions; nothing freezes the state at incident time with chain of custody.
Reconstruction is manual — multi-agent delegation chains and MCP tool calls have to be stitched together by hand.
Reporting is bespoke — every incident produces a narrative document, not a structured artifact a regulator, insurer, or downstream SIEM can consume.

What is TRACE?

TRACE (Threat Response & Agentic Compromise Examination) is the Incident Response and forensics extension to the VIBES standard. It defines:

IoC vocabulary — standardized indicators of agent compromise, queryable against VIBES audit data
Evidence bundle format — tamper-evident packages with cryptographic chain of custody
Reconstruction primitives — timeline, delegation graph, blast-radius queries over multi-agent sessions
TRACE-IR report schema — STIX-compatible JSON for downstream SIEM, ticketing, and disclosure
Containment playbooks — standardized remediation actions keyed to IoC class
Maturity tiers — reactive, proactive, and autonomous response postures

Think of it as a forensics kit for autonomous agents. VIBES is the surveillance camera always recording. VERIFY is the tamper-evident seal on the tape. PRISM is the alarm threshold. TRACE is what the investigator does when the alarm fires — freeze the scene, dust for prints, reconstruct the timeline, write the report that holds up in court.

TRACE is standalone and provider-agnostic. It does not require cooperation from any AI provider, although tool-provider cosigning strengthens the chain of custody. It targets agentic systems specifically — Claude Code, Codex, Gemini CLI, Cursor agents, MCP servers, and autonomous loops — but the data model is generic enough for any tool emitting VIBES annotations.

The Investigation Workflow

TRACE defines a seven-step lifecycle from continuous capture through portable report. Each step is automatable; tier-1 (reactive) deployments run on demand, tier-3 (autonomous) deployments run continuously.

1

Capture

VIBES records annotations, decisions, delegation traces, and tool calls into .ai-audit/ as agents work.

→

2

Detect

vibetrace scan evaluates the IoC vocabulary against the audit log, flagging matches with severity and false-positive guidance.

→

3

Declare

An IoC match (or a human responder) opens an incident with a unique incident_id and freezes the affected sessions.

→

4

Preserve

The audit snapshot is bundled, hashed, and sealed in a VERIFY DSSE envelope — chain of custody from this moment forward.

↓

5

Reconstruct

Replay the session timeline, walk the delegation graph, and compute blast radius (files touched, creds reachable, external calls).

←

6

Contain

Quarantine annotations, generate revert ranges, rotate exposed credentials. PRISM auto-gates quarantined data from future merges.

←

7

Report

Emit a TRACE-IR JSON document (STIX 2.1 compatible) for SIEM ingestion, regulator disclosure, or feedback into EVOLVE.

Steps 1–2 run continuously. Steps 3–7 fire when an incident is declared — either by an automated IoC match crossing a threshold, by a PRISM Critical-band event, or by a human responder using vibetrace incident open.

Data Flow

TRACE sits between continuous agent audit and the existing security ecosystem. It consumes VIBES data, produces sealed evidence and structured reports, and feeds tools your team already operates.

Agentic Systems

Claude Code, Codex, Gemini, Cursor, MCP servers

→

VIBES audit

.ai-audit/ annotations, decisions, edges, delegation

→

TRACE engine

IoC scan, incident declare, evidence freeze, replay

→

TRACE-IR JSON

STIX 2.1 compatible report + sealed evidence bundle

→

Downstream

SIEM, ticketing, regulators, insurers, EVOLVE feedback

The format is intentionally simple: structured JSON aligned with STIX 2.1 object types so existing security tooling can ingest TRACE incidents without bespoke parsers. No private wire protocol, no provider-specific endpoints, no required cloud service.

IoC Vocabulary

The TRACE specification publishes a versioned catalog of indicators. Each IoC has a stable ID (TRACE-IOC-{class}-{nnn}), a detection rule expressed as a query over VIBES JSONL, severity guidance, false-positive notes, and an optional MITRE ATLAS mapping. The catalog is open; community contributions extend it.

PI — Prompt Injection

Role override in tool result

System-prompt-shaped strings appearing in tool_result or fetched URL bodies, then referenced by subsequent reasoning. Detects indirect injection from MCP servers, web pages, and file contents.

TA — Tool Abuse

Out-of-scope tool invocation

Bash commands, file writes outside the working directory, or network egress that doesn't match the declared task. Cross-references prompt_text intent against actual command records.

DA — Delegation Anomaly

Orphan or over-deep sub-agents

Sub-agent sessions spawned without a matching parent prompt, delegation depth exceeding the policy ceiling, or sub-agents reading resources their parent never had context for.

RD — Reasoning Drift

Unaligned chain-of-thought

High-assurance reasoning traces referencing entities never in context, sudden topic shifts mid-session, or scratchpad content inconsistent with the declared task. Requires Medium+ assurance.

CS — Credential Staging

Sensitive-path access pattern

Reads against .aws/credentials, .ssh/, .env, ~/.config/gh/, browser cookie stores, followed by network egress or external write within the same session.

MC — MCP Compromise

Schema or signature mismatch

MCP responses violating the server's advertised schema, signatures that don't validate against published keys, or replay-attack timing on cached responses. Targets MCP-server supply-chain attacks.

DE — Data Exfiltration

Read-then-egress chain

Source-file reads followed by encoded payloads in outbound HTTP bodies, paste-service POSTs, or DNS exfil patterns. Looks for the read-stage-egress sequence rather than any single call.

OP — Off-Pattern Behavior

Temporal or volumetric anomaly

Action volume, model swap, or geographic-IP shift mid-session that doesn't match the user's historical baseline. Requires baselining over prior sessions.

An IoC match by itself is not a confirmed incident — it is a triage signal. The vibetrace scan CLI annotates each match with confidence, evidence references, and recommended next action.

Evidence Bundle & Chain of Custody

When an incident is declared, TRACE produces a content-addressed evidence package at .ai-audit/incidents/<incident-id>/. The package is sealed in a VERIFY DSSE envelope with a TRACE-specific predicateType, providing forensic-grade chain of custody from the moment of declaration forward.

// .ai-audit/incidents/TRACE-2026-0001/bundle.json (sealed via VERIFY DSSE) { "_type": "https://in-toto.io/Statement/v1", "predicateType": "https://itsavibe.ai/trace/evidence/v1", "subject": [ {"name": "audit-snapshot.tar.gz", "digest": {"sha256": "a1b2c3..."}}, {"name": "tool-traces.jsonl", "digest": {"sha256": "d4e5f6..."}}, {"name": "mcp-logs.jsonl", "digest": {"sha256": "f7a8b9..."}}, {"name": "fs-diff.patch", "digest": {"sha256": "b2c3d4..."}} ], "predicate": { "incident_id": "TRACE-2026-0001", "declared_at": "2026-05-07T14:32:00Z", "declared_by": "keyid:a1b2c3d4e5f6a7b8", "trigger": {"type": "ioc_match", "ioc_id": "TRACE-IOC-PI-003"}, "affected_sessions": ["550e8400-e29b-41d4..."], "custody_chain": [ {"actor": "vibetrace 0.1.0", "action": "freeze", "timestamp": "..."}, {"actor": "keyid:a1b2...", "action": "sign", "timestamp": "..."} ] } }

The bundle includes:

audit-snapshot.tar.gz — frozen copy of .ai-audit/ at incident time
tool-traces.jsonl — bash history, file system diff, network flows correlated to session IDs
mcp-logs.jsonl — MCP request/response pairs with timing
fs-diff.patch — unified diff of files modified during the affected session window
provider-correlation.json — case-id tokens for requesting tool-provider logs out-of-band optional

Tool-provider cosigning on the evidence bundle is the strongest form of TRACE chain of custody — the provider attests "this audit data was generated by my system at the claimed time," eliminating the fabrication and post-hoc-editing threats from the IR analysis. Cosigning is optional; bundles signed by responder alone remain valid.

Reconstruction & Replay

Once an evidence bundle is sealed, TRACE provides query and visualization primitives for understanding what actually happened. These run locally against the sealed bundle — no network calls required.

Session timeline

Chronological reconstruction of every prompt, tool call, file edit, sub-agent spawn, and decision record across one or more affected sessions. Output as text, JSON, or interactive HTML. Anchored on VIBES session and edge records.

Delegation graph

Renders the parent → child agent DAG with credential and context flow annotations on the edges. Surfaces which sub-agent inherited which secrets from which parent prompt, and where context boundaries were crossed.

Blast-radius scoping

Lists every file touched, every credential read, every external endpoint contacted, and every sub-agent spawned during the affected window. Produces the input set for containment playbooks (revert range, rotation list, notification chain).

Counterfactual replay

Given the same prompts and context, would a clean instance of the same model produce the same output? Distinguishes user-driven misuse from model-specific compromise. Optional; requires local model access.

Cross-session correlation

Across a fleet, find every session that touched a specific compromised file, used a specific MCP server version, ran a specific model build, or matched a given IoC. Critical for scoping fleet-wide impact when an upstream provider discloses a model issue.

TRACE-IR Report Schema

The portable output of an investigation is a TRACE-IR JSON document — a structured incident report aligned with STIX 2.1 object types so it ingests cleanly into existing security tooling. No bespoke parser required; if your stack speaks STIX, it speaks TRACE-IR.

{ "trace_ir_version": "0.1", "incident_id": "TRACE-2026-0001", "classification": "prompt_injection", "discovered_at": "2026-05-07T14:32:00Z", "reported_at": "2026-05-07T16:08:00Z", "agent_systems": [ {"tool_name": "Claude Code", "tool_version": "1.5.2", "model": "claude-opus-4-5"}, {"tool_name": "MCP filesystem", "tool_version": "0.4.1"} ], "indicators": [ { "ioc_id": "TRACE-IOC-PI-003", "name": "Role override in tool result", "confidence": 0.92, "evidence_ref": "sha256:a1b2c3...", "mitre_atlas": "AML.T0051.000" } ], "blast_radius": { "sessions_affected": ["550e8400-..."], "files_touched": ["src/auth.py", "src/config.py"], "credentials_exposed": ["AWS_ACCESS_KEY_ID"], "external_calls": ["hxxps://paste.example/abcd"], "subagents_spawned": 3 }, "evidence_bundle": { "hash": "sha256:e7a3f1b2...", "verify_envelope": "https://itsavibe.ai/api/attestation/.../envelope" }, "severity": { "source": "prism", "score": 0.84, "band": "critical" }, "remediation": [ {"action": "rotate_credential", "target": "AWS_ACCESS_KEY_ID", "status": "completed"}, {"action": "quarantine_session", "target": "550e8400-...", "status": "completed"}, {"action": "revert_commits", "target": "abc123..def456", "status": "pending"} ], "disclosure": {"internal": true, "regulator": false, "public": false}, "stix_bundle_ref": ".ai-audit/incidents/TRACE-2026-0001/stix.json" }

STIX 2.1 Compatibility

TRACE-IR documents emit a parallel STIX 2.1 bundle (stix_bundle_ref) populated with standard STIX objects: incident, indicator, observed-data, tool, identity, note. Agentic-system specifics that don't fit standard STIX vocabulary are emitted as STIX x-trace-* custom objects with documented schemas.

Note on redaction: TRACE-IR documents may contain prompt text, source-code excerpts, and credential identifiers. The TRACE specification deliberately defines no built-in redaction model. Redaction and PII identification are left to dedicated tooling that operates over the open TRACE-IR format, allowing organizations to apply their own classification, retention, and disclosure policies.

Maturity Tiers

TRACE deployments mature along three tiers, paralleling the assurance ladder in VIBES. Start at Reactive; advance to Proactive when you want continuous surveillance; reach Autonomous when policy is clear enough for automated containment.

Reactive

On-demand investigation

IoC scans run on demand by responders
Manual incident declaration
Evidence bundle generation per investigation
TRACE-IR reports authored after-the-fact
Suitable for: small teams, low-volume agent use

Proactive

Continuous detection

IoC detection runs continuously against live sessions
Automatic evidence freeze on PRISM Critical-band events
Dashboards and alerting integrated with existing SIEM
Cross-session correlation across the fleet
Suitable for: production agentic systems, security-sensitive teams

Autonomous

Automated containment

Policy-driven auto-containment (token rotation, session kill, merge block)
Automatic disclosure routing (internal → regulator) on classification match
Closed-loop feedback to EVOLVE for IoC tuning
Counterfactual replay on every Critical-band event
Suitable for: regulated industries, autonomous-agent fleets at scale

Integration with the VIBES Ecosystem

TRACE is designed to compose with the other extensions, but does not require any of them beyond VIBES itself.

Standard	How TRACE uses it
VIBES	Required substrate. Annotations, decision records, edge records, and delegation traces are the input to IoC detection and timeline reconstruction.
VERIFY	Optional but recommended. Evidence bundles use DSSE envelopes with the `trace/evidence/v1` predicate type. Tool-provider cosigning eliminates the fabrication and post-hoc-editing threats from IR analysis.
PRISM	Optional severity provider. PRISM ≥ 0.8 events can auto-declare a TRACE incident. Quarantined annotations from closed incidents are excluded from PRISM aggregates so the score reflects only trusted activity. Organizations may substitute any external scoring system — severity is pluggable.
EVOLVE	Optional feedback channel. Closed-incident TRACE-IR reports feed back as training signal — new IoCs become detection rules, remediation patterns update agent decision policies, and recurring incident classes become governance constraints.

Standalone by design. TRACE does not require provider cooperation, a centralized registry, or any of the other extensions. A team running plain VIBES Low-assurance can still produce TRACE-IR reports — they just have less data to investigate with. As tool providers adopt VIBES cosigning and teams layer on PRISM, the forensic fidelity of TRACE rises with them.

Tooling: `vibetrace` CLI

The reference implementation ships as a sibling to vibecheck. Like vibecheck, vibetrace runs locally against the project's .ai-audit/ directory; no network calls are required for detection, evidence sealing, or report generation.

# Continuous detection vibetrace scan # run all IoCs against .ai-audit/ vibetrace scan --ioc TRACE-IOC-PI-003 # single IoC vibetrace scan --since 24h --format json # pipe to SIEM # Incident lifecycle vibetrace incident open --classification prompt_injection \ --trigger ioc:TRACE-IOC-PI-003 vibetrace incident freeze <incident-id> # seal evidence bundle via VERIFY vibetrace incident close <incident-id> # finalize TRACE-IR report # Reconstruction vibetrace replay <session-id> # interactive timeline vibetrace delegation-graph <session-id> # SVG/DOT output vibetrace blast-radius <incident-id> # scope affected resources # Containment vibetrace contain <incident-id> --playbook revoke-and-rotate vibetrace quarantine <session-id> # PRISM auto-gates downstream # Reporting vibetrace report <incident-id> --format trace-ir # JSON vibetrace report <incident-id> --format stix # STIX 2.1 bundle vibetrace report <incident-id> --format pdf # human-readable

Every command is scriptable for CI/CD and can be embedded in existing IR runbooks. The TRACE specification defines the file formats, the IoC catalog, the playbook contract, and the report schema; alternative implementations are welcome as long as they conform.

Design Decisions

Four explicit choices shape the TRACE specification:

Output is open JSON, STIX-compatible

TRACE-IR documents are plain JSON aligned with STIX 2.1 object types so existing SIEMs, ticketing systems, and disclosure pipelines ingest them with no custom parsers. No proprietary wire format.

Provider-agnostic and standalone

TRACE does not require cooperation from any AI provider. Adoption is welcomed but optional — providers that want to participate can add cosigning and standardized log-disclosure tokens, but TRACE is fully usable without them.

No built-in redaction model

TRACE deliberately defines no PII or secret-redaction layer. Redaction is a domain that warrants dedicated tooling operating over the open TRACE-IR format, so organizations can apply their own classification, retention, and disclosure policies.

Pluggable severity scoring

Severity is sourced from PRISM by default, but the severity.source field is pluggable. Organizations with existing CVSS-style scoring, custom risk engines, or third-party severity services can substitute them without changing the report schema.

Version History

Like the other VIBES-family standards, TRACE follows a draft → review → ratification process. Draft versions are working documents subject to change.

Version	Date	Status	Notes
0.1-draft	2026-05-07	Draft	Initial TRACE extension draft. IoC vocabulary, evidence bundle format, TRACE-IR JSON schema (STIX-compatible), maturity tiers, vibetrace CLI surface.
1.0	TBD	Pending	Target: stable release after public review and reference vibetrace implementation.

Related Standards

TRACE is an extension to VIBES — the Incident Response and forensics layer of the VIBES family. It builds on VIBES audit data and composes with the other extensions (VERIFY, PRISM, EVOLVE) but is independently adoptable.

VIBES — the audit substrate. TRACE consumes annotations, decisions, and delegation traces.

VERIFY — cryptographic sealing. TRACE evidence bundles are DSSE envelopes; cosigning hardens chain of custody.

PRISM — risk scoring. PRISM Critical-band events can auto-declare TRACE incidents; pluggable with external scoring.

EVOLVE — agent learning. Closed TRACE incidents feed back as IoC tuning and governance signal.

VIBES Standard VERIFY Standard PRISM Scoring EVOLVE Standard Implementors Guide

Three Ways to Make an Impact

TRACE is a draft — the strongest version emerges from a community of responders, tool builders, and security teams who run agents in production.

Join the Community

Contribute to the IoC catalog, propose new agent-specific indicators, refine the TRACE-IR schema, or share IR runbooks from your own incident postmortems.

Get involved →

Build an Implementation

Build a vibetrace alternative, integrate TRACE-IR ingestion into your SIEM, write IoC detection rules for your stack, or add a redaction tool that operates over the open format.

Implementation guide →

Champion the Standard

Make the case for structured IR data on agent activity in your organization. Run a tabletop exercise on a simulated agent compromise. Show that "we had no visibility" is no longer the right answer.

Resources →