Follow a single inference request through the gate. Each movement adds microseconds and removes risk. The same seven movements hold whether the request is served on sovereign capacity, routed in-region behind a de-identification gate, or refused outright.
The application calls heimdall.run() over mTLS with an idempotency key. No path to any model exists except through the gate. A malformed request is rejected before a cent is spent.
The task class is checked against the schema registry; an owned PHI detector confirms the data class. Declared de-identified but detected identifiable? Blocked at the gate, no upstream call, no payload kept.
The tenant's signed policy resolves residency, model allowlist, budget and escalation. Identifiable PHI ⇒ sovereign-only. If the engine cannot be evaluated, the request fails closed.
A depth ladder, owned distil → owned specialist → frontier via gate → human, resolves to the cheapest rung that satisfies the policy. Compliance outweighs cost; failover stays within tier, never across.
The envelope sets context caps, output schema, retry ceiling, agentic depth and a hard per-request budget. Breach the cap and the request is queued, not silently overspent.
Owned capacity on Bifrost serves in-tenancy. A non-sovereign route crosses the de-identification trust boundary first. The hard tail escalates to a consultant, by policy, not by failure.
The result returns through the gateway, re-identified there if it was de-identified, and one immutable, hash-chained record lands in the ledger. The per-study price accrues as a single, known line item.
Residency answers where. The de-identification gate answers what, stripping identity from any payload bound for a non-sovereign model, and restoring it only inside the border, only at the gateway, only on the way back.
| Path | Request | Decision | Supply | Outcome |
|---|---|---|---|---|
| A · Sovereign draft | CT chest + 3 priors, identifiable | PHI ⇒ sovereign-UK only | 3v-rad-32b on Bifrost, LHR | Structured draft in 1.74s; radiologist edits and signs. |
| B · Frontier synthesis | 9 studies / 4 yrs, oncology | Resident OK after de-ID gate | Frontier model, UK region | Re-identified at the gateway; flagged pending consultant. |
| C · Fail-closed | Confidence 0.71 < 0.80 threshold | Escalate per playbook | Human consultant queue | Reviewed, amended, signed, joins the eval bank. |
The amended case sharpens future routing. The gate does not just enforce policy: it learns the shape of the hard cases, and feeds them back into the evals that gate every model version change.
See the full reference architecture →