Glass Box Ledger
Every AI decision, cryptographically chained. Every receipt, externally verifiable. Every block, bit-exactly replayable.
Standard chat logs are pilot notes — mutable, subjective, interpretable. The Glass Box Ledger is the flight data recorder — objective, hardened, and binding. When a governance decision is challenged, you don't stitch together logs from five systems. You look up a block. You verify its signature against a published public key. You replay its input through the same Guardian build and get the same verdict, bit-for-bit. That's evidence.
The Merkle Chain
Every block captures a canonical tuple — (timestamp, input_hash, policy_hash, adapter_hash, outcome, corrections, actor_id) — and anchors to the previous block:
H_n = sha256(H_{n-1} || state_n)
Alter a single byte of any past block and the current block's hash fails validation. The chain enforces integrity without any central coordinator — each block is a cryptographic commitment to the entire history preceding it.
The Receipt Envelope
Canonical Receipt Envelope
Every surface writes the same envelope structure. A chat verdict, a tool call, a CLI command, a training promotion, a policy finalization — they all produce receipts that share this shape. Three consequences:
- One schema across the platform. SIEM queries written against the
verdictfield hit every surface. No per-surface rewrites. - Determinism fields are mandatory.
model_digest,adapter_hash,seed,policy_hash— every receipt carries enough to reproduce. - Signature covers the whole envelope. Change any field, the signature fails.
The envelope is what makes Trinitite receipts legally binding artifacts: not "our internal log shows," but "the verifiable signed commitment from this public key, at this trusted timestamp, says."
External Trust Anchors
Public Verifier Topology — Third-Party Trust Anchors
Trinitite signatures are not the only witness. Every receipt's root hash is anchored to three independent external systems:
- Public JWKS — the public half of the signing key is published at
/.well-known/jwks.json. Anybody can fetch it and verify signatures offline. - RFC 3161 TSA — a trusted timestamp authority (DigiCert, GlobalSign) witnesses that the hash existed at the claimed time. Even Trinitite cannot backdate an artifact.
- Sigstore Rekor — a transparency log that makes every anchored hash publicly auditable. Insertion is tamper-evident.
The practical upshot: a regulator, a Big-4 partner, or a disputing counterparty can verify a Trinitite receipt without ever calling a Trinitite endpoint. The signature, the timestamp, and the inclusion proof are all witnessable from independent third parties. This collapses the adverse-inference attack surface — "well, they could have changed the log" — to zero.
Bit-Exact Replay
Rewind · Adjust · Prove — Replay Over the Ledger
Every block is replayable. Given a block ID:
- Look up the receipt —
model_digest,adapter_hash,seed,policy_hash. - Load an identical inference stack — same base model, same adapter, batch-invariant kernel with the same fixed tile size.
- Re-run the input. Verify the
input_digestmatches; verify theoutput_digestmatches.
The verdict is bit-for-bit identical to the original. This is what the batch-invariant kernel buys you: determinism that survives across batch sizes, GPU utilization, and months of production operation.
Counterfactual replay is just as easy. Pick a block, change one input field, re-run. Observe how the verdict shifts. Prove a patch works by re-running a previously-blocked input through the new Guardian and confirming the new verdict. This is the flight simulator for AI decisions — not a theoretical promise, a shipped capability.
Append-Only Backends
The ledger is pluggable. Every deployment picks the backend that matches its regulatory posture:
| Tier | Backend | Properties |
|---|---|---|
| Standard | S3 Object Lock / WORM | Commercial durability, adverse-inference defense |
| Managed | Cloud KMS / HSM-anchored Postgres | Regulatory separation of duties |
| Sovereign | Hardware TEE (Nvidia Confidential Computing) | Nation-state non-repudiation |
| Edge | SQLite | Air-gapped / on-premise deployments |
| SIEM mirror | Splunk / Datadog / CloudWatch pipeline | Live forwarding alongside durable write |
Swap the backend via env var. Receipts are portable; verification works identically against each.
What Gets Anchored
Every governance-relevant surface writes:
| Surface | Receipt kind | Content |
|---|---|---|
| LLM Proxy | chat_verdict | Input digest, output digest, provider, governance outcome |
| MCP Gateway | tool_verdict | Tool, arguments (hashed), result (hashed), verdict |
| CLI Firewall | cli_exec | Command, args, exit code, stdout/stderr hashes, redaction flag |
| Skill Vault | skill_sign / skill_revoke | Content hash, scan record, signature envelope |
| Training | adapter_promotion | Adapter hash, parent hash, Lipschitz bound, test pass rate |
| Testing | simulation_report | Suite, adapter, per-scenario verdicts, aggregate pass rate |
| Policy | policy_finalize | Policy hash, approver, graph snapshot |
| Attestation | snapshot / export | Point-in-time evidence bundle, OSCAL export |
Every entry carries the same envelope. Every entry is verifiable by the same public key. Every entry is anchored to the same external trust chain.
Public Verification Path
GET /v1/public/verify/:receipt_id
(no auth required)
→ 200 OK
{
"receipt": { … },
"signature_envelope": "base64:…",
"jwks_url": "https://trinitite…/.well-known/jwks.json",
"tsa_token": "base64:…",
"rekor_entry": "https://rekor.sigstore.dev/api/v1/log/entries/…",
"verification_bundle": { … }
}
An auditor's workflow: fetch the receipt, fetch the JWKS, fetch the Rekor entry, compute the hash locally, verify the signature locally, verify the Rekor inclusion proof locally, compare timestamps. No Trinitite credentials. No "take our word for it." Just math.
What You Get
| Capability | Standard log | Glass Box Ledger |
|---|---|---|
| Tamper evidence | None | Merkle chain breaks on byte change |
| Replay | Best-effort reconstruction | Bit-exact via batch-invariant kernel |
| Signature | Not signed | Signed by HSM-backed org key |
| External witnesses | None | JWKS + RFC 3161 + Sigstore Rekor |
| Backdating defense | Trust us | External TSA cryptographically blocks |
| Third-party verification | Access our portal | Verify from any browser / script |
| Uniform schema | Per-service | One envelope across every surface |
Replay Verdict Taxonomy
A replay is not a binary "matched / didn't" — the platform classifies every replayed event into one of four explicit verdicts:
This is the difference between a deterministic system that pretends to be reproducible and a deterministic system that owns its honesty. When a replay is divergent, that's a forensic alert — not a quietly downgraded result.
Next Steps
→ Compliance Architecture — how ledger anchors become framework-specific attestations.
→ Testing & Simulation — the replay-over-the-ledger machinery in practice.
→ Architecture — where batch-invariant determinism is introduced.
→ Observability — the streams that surface every receipt at the operational level.