Glass Box Ledger

Every AI decision, cryptographically chained. Every receipt, externally verifiable. Every block, bit-exactly replayable.

Standard chat logs are pilot notes — mutable, subjective, interpretable. The Glass Box Ledger is the flight data recorder — objective, hardened, and binding. When a governance decision is challenged, you don't stitch together logs from five systems. You look up a block. You verify its signature against a published public key. You replay its input through the same Guardian build and get the same verdict, bit-for-bit. That's evidence.

The Merkle Chain

Every block captures a canonical tuple — (timestamp, input_hash, policy_hash, adapter_hash, outcome, corrections, actor_id) — and anchors to the previous block:

H_n = sha256(H_{n-1} || state_n)

Alter a single byte of any past block and the current block's hash fails validation. The chain enforces integrity without any central coordinator — each block is a cryptographic commitment to the entire history preceding it.

The Receipt Envelope

Canonical Receipt Envelope

Every surface writes the same envelope structure. A chat verdict, a tool call, a CLI command, a training promotion, a policy finalization — they all produce receipts that share this shape. Three consequences:

One schema across the platform. SIEM queries written against the verdict field hit every surface. No per-surface rewrites.
Determinism fields are mandatory. model_digest, adapter_hash, seed, policy_hash — every receipt carries enough to reproduce.
Signature covers the whole envelope. Change any field, the signature fails.

The envelope is what makes Trinitite receipts legally binding artifacts: not "our internal log shows," but "the verifiable signed commitment from this public key, at this trusted timestamp, says."

External Trust Anchors

Public Verifier Topology — Third-Party Trust Anchors

Trinitite signatures are not the only witness. Every receipt's root hash is anchored to three independent external systems:

Public JWKS — the public half of the signing key is published at /.well-known/jwks.json. Anybody can fetch it and verify signatures offline.
RFC 3161 TSA — a trusted timestamp authority (DigiCert, GlobalSign) witnesses that the hash existed at the claimed time. Even Trinitite cannot backdate an artifact.
Sigstore Rekor — a transparency log that makes every anchored hash publicly auditable. Insertion is tamper-evident.

The practical upshot: a regulator, a Big-4 partner, or a disputing counterparty can verify a Trinitite receipt without ever calling a Trinitite endpoint. The signature, the timestamp, and the inclusion proof are all witnessable from independent third parties. This collapses the adverse-inference attack surface — "well, they could have changed the log" — to zero.

Bit-Exact Replay

Rewind · Adjust · Prove — Replay Over the Ledger

Every block is replayable. Given a block ID:

Look up the receipt — model_digest, adapter_hash, seed, policy_hash.
Load an identical inference stack — same base model, same adapter, batch-invariant kernel with the same fixed tile size.
Re-run the input. Verify the input_digest matches; verify the output_digest matches.

The verdict is bit-for-bit identical to the original. This is what the batch-invariant kernel buys you: determinism that survives across batch sizes, GPU utilization, and months of production operation.

Counterfactual replay is just as easy. Pick a block, change one input field, re-run. Observe how the verdict shifts. Prove a patch works by re-running a previously-blocked input through the new Guardian and confirming the new verdict. This is the flight simulator for AI decisions — not a theoretical promise, a shipped capability.

Append-Only Backends

The ledger is pluggable. Every deployment picks the backend that matches its regulatory posture:

Tier	Backend	Properties
Standard	S3 Object Lock / WORM	Commercial durability, adverse-inference defense
Managed	Cloud KMS / HSM-anchored Postgres	Regulatory separation of duties
Sovereign	Hardware TEE (Nvidia Confidential Computing)	Nation-state non-repudiation
Edge	SQLite	Air-gapped / on-premise deployments
SIEM mirror	Splunk / Datadog / CloudWatch pipeline	Live forwarding alongside durable write

Swap the backend via env var. Receipts are portable; verification works identically against each.

What Gets Anchored

Every governance-relevant surface writes:

Surface	Receipt kind	Content
LLM Proxy	`chat_verdict`	Input digest, output digest, provider, governance outcome
MCP Gateway	`tool_verdict`	Tool, arguments (hashed), result (hashed), verdict
CLI Firewall	`cli_exec`	Command, args, exit code, stdout/stderr hashes, redaction flag
Skill Vault	`skill_sign` / `skill_revoke`	Content hash, scan record, signature envelope
Training	`adapter_promotion`	Adapter hash, parent hash, Lipschitz bound, test pass rate
Testing	`simulation_report`	Suite, adapter, per-scenario verdicts, aggregate pass rate
Policy	`policy_finalize`	Policy hash, approver, graph snapshot
Attestation	`snapshot` / `export`	Point-in-time evidence bundle, OSCAL export

Every entry carries the same envelope. Every entry is verifiable by the same public key. Every entry is anchored to the same external trust chain.

Public Verification Path

GET /v1/public/verify/:receipt_id
(no auth required)

→ 200 OK
{
  "receipt":             { … },
  "signature_envelope":  "base64:…",
  "jwks_url":            "https://trinitite…/.well-known/jwks.json",
  "tsa_token":           "base64:…",
  "rekor_entry":         "https://rekor.sigstore.dev/api/v1/log/entries/…",
  "verification_bundle": { … }
}

An auditor's workflow: fetch the receipt, fetch the JWKS, fetch the Rekor entry, compute the hash locally, verify the signature locally, verify the Rekor inclusion proof locally, compare timestamps. No Trinitite credentials. No "take our word for it." Just math.

What You Get

Capability	Standard log	Glass Box Ledger
Tamper evidence	None	Merkle chain breaks on byte change
Replay	Best-effort reconstruction	Bit-exact via batch-invariant kernel
Signature	Not signed	Signed by HSM-backed org key
External witnesses	None	JWKS + RFC 3161 + Sigstore Rekor
Backdating defense	Trust us	External TSA cryptographically blocks
Third-party verification	Access our portal	Verify from any browser / script
Uniform schema	Per-service	One envelope across every surface

Replay Verdict Taxonomy

A replay is not a binary "matched / didn't" — the platform classifies every replayed event into one of four explicit verdicts:

BIT_EXACT

Replay produces a byte-identical output to the original. Same Guardian, same policy hash, same tile size, same seed.

USE Default for any replay run on the same node version with the original adapter still loaded.

SEMANTIC_ONLY

Replay produces a semantically equivalent output (same outcome, same JSON Patch class) but bytes differ — typically because a downstream tokenizer or model build changed.

USE Surfaces when re-running an old block on a newer build; the verdict still validates.

DIVERGENT

Replay produces a different verdict than the original. Either the active policy changed or the adapter shifted in a way that breaks the prior block.

USE Drives a forensic regression alert. Do not silently accept.

ORIGINAL_MISSING

The original adapter or upstream artifact is no longer available, so a faithful replay is impossible. The Merkle receipt is still verifiable.

USE Common after long retention windows. Mark explicitly rather than silently downgrading to semantic_only.

This is the difference between a deterministic system that pretends to be reproducible and a deterministic system that owns its honesty. When a replay is divergent, that's a forensic alert — not a quietly downgraded result.

Next Steps

→ Compliance Architecture — how ledger anchors become framework-specific attestations.

→ Testing & Simulation — the replay-over-the-ledger machinery in practice.

→ Architecture — where batch-invariant determinism is introduced.

→ Observability — the streams that surface every receipt at the operational level.

The Merkle Chain​

The Receipt Envelope​

External Trust Anchors​

Bit-Exact Replay​

Append-Only Backends​

What Gets Anchored​

Public Verification Path​

What You Get​

Replay Verdict Taxonomy​

Next Steps​