Verdict Playground

Paste an AI output. Pick a Guardian. Watch the verdict change in real time.

This is the same three-outcome contract every Trinitite-governed call follows: passed returns the output unchanged, corrected returns an RFC 6902 JSON Patch with the fix applied, and blocked returns a 403-style verdict with a forensic record. The playground runs entirely in your browser against a mock rubric — no API key required.

Verdict Playgroundguardian = PII-Redactor

AI Output (assistant message)

Guardian Verdict

CORRECTED

Output projected to nearest Pre-Validated Safe Centroid via JSON Patch

1 JSON Patch operation

replace/0/content= "Your account is in good standing, John Doe. We have your SSN [SSN-REDACTED] on file at 555 Main St."

ledger_id : lg_SZJ9GMUG

latency_ms: 140

policy_hash: 0x80c74a5c864f5

determinism_mode: fixed_kv_tile=256

Demo runs entirely in your browser against a mock rubric. Real Guardians are trained on your policies and ship as LoRA adapters — see Guardian Training.

What you're looking at

Verdict tag — the same three values your client code branches on:

PASSED (green) — original output is compliant.
CORRECTED (teal) — the Guardian patches the offending span and returns the patched result.
BLOCKED (red) — the Guardian refuses to issue a safe correction. Your client surfaces a 403.

Reason — short geometric description of why the Guardian decided what it did. Real Guardians return a structured correction_diff payload — see Observability.

JSON Patches — RFC 6902 operations, color-coded by op. Apply them with any RFC-6902 library on your end.

Receipt fields — every decision in production is hashed, signed, and Merkle-chained. The mock receipt shows the four fields you'd see on the wire: ledger_id, latency_ms, policy_hash, and the active determinism mode.

What the playground is not

The playground uses a toy rubric with hard-coded regex for SSN, card numbers, and a few destructive SQL patterns. It is intentionally not the real Guardian — the real Guardian is a fine-tuned LoRA model trained against your specific policies, evaluated against the geometric Policy Manifold (see Architecture). The point of the playground is to make the contract tangible — outcome, patches, receipt — not the implementation.

For a live test against a real Guardian, request sandbox access and follow the 5-min Quickstart.

What's next

→ Quickstart — wire a real Guardian into your code.

→ Cookbook — copy-paste recipes for common stacks.

→ Threat Library — named adversarial patterns + downloadable test fixtures.

What you're looking at​

What the playground is not​

What's next​

What you're looking at

What the playground is not

What's next