Architecture review dossier

Regulated AI that can be defended before engineering, security, risk, and operations.

This is the portfolio's system-level position: explicit decision authority, enforceable boundaries, measurable nonfunctional requirements, inspectable evidence, and operating ownership beyond launch.

6Published architecture decisions with explicit trade-offs

6Measurable nonfunctional requirement categories

8End-to-end control and evidence stages

12Public repositories supporting the reference system

Flagship Reference Architecture

A provider-portable architecture for healthcare RAG and agent workflows. Identity, PHI handling, policy, approval, evaluation, and evidence are platform contracts rather than optional prompt behavior.

Experience

Identity + channelsSSO, member portal, clinical workflow

Human approvalMedical, regulatory, operations

Control plane

Agent orchestrationState, cancellation, retry budgets

Policy engineDeny-by-default tool decisions

Evaluation gatesRegression and release criteria

Intelligence + integration

Healthcare RAGScoped retrieval and citations

FHIR servicesTyped exchange boundaries

Model adaptersProvider-portable inference

Evidence + operations

Signed receiptsCanonical execution evidence

MLOpsValidation, registry, deployment

ObservabilityTraces, cost, incidents, SLOs

Identity boundary

PHI boundary

Tool boundary

Evidence boundary

Scope statement: this is a public reference architecture, not client code. It composes production-derived patterns into an inspectable system and uses synthetic data for verification.

Download sanitized solution architecture brief

Architecture Decision Records

These decisions make the operating philosophy concrete and expose the costs accepted in exchange for control, portability, and evidence.

ADR-01

Keep policy enforcement outside the prompt

Choice: Deny-by-default tool-boundary policy engine

Trade-off: Adds orchestration latency and policy maintenance, but makes authorization deterministic, testable, and auditable.

ADR-02

Separate retrieval evidence from generated language

Choice: Citations, source metadata, and confidence travel with each answer

Trade-off: Increases payload size and UI complexity, but supports review, dispute, and post-incident reconstruction.

ADR-03

Use event-driven processing for long-running agent work

Choice: Durable queue with idempotency, cancellation, retry budgets, and dead-letter recovery

Trade-off: More operational components than synchronous APIs, but safer recovery and backpressure under variable model latency.

ADR-04

Make human approval a first-class state

Choice: Explicit escalation records rather than informal chat handoffs

Trade-off: Slower happy paths, but clear accountability for high-risk clinical, regulatory, and content decisions.

ADR-05

Keep the platform model-provider portable

Choice: Provider adapters behind evaluation and policy contracts

Trade-off: Constrains provider-specific features, but reduces lock-in and makes regulated change control practical.

ADR-06

Sign execution evidence

Choice: Canonical receipts with hash chaining and verifiable signatures

Trade-off: Requires key management and retention controls, but prevents silent mutation of agent evidence.

Nonfunctional Requirements

Architecture is accepted against measurable operational constraints, not diagrams alone.

Quality	Target	Architecture response	Verification
Availability	99.9% platform; 99.99% critical event path	Multi-zone services, queue durability, graceful degradation	Synthetic probes and recovery tests
Latency	<2s p95 retrieval; streaming first token <1.5s	Caching, bounded context, asynchronous tools	Load tests and trace percentiles
Privacy	No PHI in unapproved model or log paths	Classification, redaction, scoped retrieval, private endpoints	Policy tests and log sampling
Recovery	RTO 60 min; RPO 15 min for evidence stores	Cross-zone replicas, immutable backups, replayable events	Quarterly restore exercise
Auditability	Reconstruct every consequential action	Signed receipts, model and prompt versions, approval records	Evidence-bundle verification
Cost	Per-workflow budget and tenant visibility	Token budgets, model routing, caching, usage attribution	Cost telemetry and threshold alerts

Threat Model

Residual risk remains visible and owned. Controls reduce likelihood and blast radius; they do not turn probabilistic systems into risk-free systems.

Residual risk · Medium

Prompt injection

Scenario: Untrusted retrieved content attempts to redirect tools

Controls: Content isolation, instruction hierarchy, policy enforcement at tools

Residual risk · Low-Medium

PHI disclosure

Scenario: Sensitive data reaches logs, models, or unauthorized users

Controls: Classification, redaction, tenant filters, private endpoints

Residual risk · Low

Cross-tenant retrieval

Scenario: Identity or metadata defect exposes another tenant

Controls: Attribute-based access, namespace isolation, adversarial tests

Residual risk · Medium

Tool misuse

Scenario: Agent invokes a valid tool outside intended purpose

Controls: Deny-by-default policies, argument validation, approval thresholds

Residual risk · Low

Evidence tampering

Scenario: Trace or approval history is changed after execution

Controls: Canonical signed receipts, hash chains, immutable retention

Residual risk · Medium

Model/vendor drift

Scenario: Provider behavior changes without controlled acceptance

Controls: Pinned versions, regression suites, canary release, rollback

Capacity and Cost Envelope

Planning ranges make workload assumptions and FinOps choices discussable before vendor selection. Actual pricing depends on model, region, retention, and support requirements.

Operating tier	Monthly volume	Planning range	Primary controls
Pilot	5K workflows/mo	$1.5K-$3K/mo	Managed services, smaller models, shared non-production
Department	50K workflows/mo	$9K-$18K/mo	Caching, model routing, reserved database capacity
Enterprise	500K workflows/mo	$65K-$140K/mo	Tenant attribution, regional resilience, dedicated observability

Planning estimates are modeled reference ranges, not client invoices or guaranteed cloud quotations.

Synthetic Control Walkthrough

Run a credential-free example to see how identity, policy, human approval, evaluation, and signed evidence change with workflow risk.

Member-benefit explanation

Synthetic request with no patient or client data. Choose the risk tier, then inspect the resulting control path.

Awaiting review

No model or external service is called. This walkthrough demonstrates the architecture contract with deterministic synthetic evidence.

Inspectable Evidence Workflow

A five-minute technical review can follow one synthetic request from identity through retrieval, policy, approval, evaluation, signed evidence, and operations.

Authenticate

Identity and tenant claims establish the permitted data and tool scope.

Classify

Input is checked for PHI, intent, risk tier, and required approval path.

Retrieve

FHIR and knowledge sources are filtered before evidence enters model context.

Decide

Policy engine evaluates each consequential tool call outside the prompt.

Escalate

High-risk or uncertain work pauses for an accountable human decision.

Evaluate

Scenario and regression gates verify behavior against release criteria.

Sign

Trace, policy decisions, versions, and approvals become a verifiable receipt.

Operate

SLO, cost, drift, incident, and evidence telemetry feed ongoing ownership.

Healthcare RAG Model Governance Regulated MLOps FHIR Integration Policy Engine Evaluation Harness Signed Receipts Evidence Viewer

Review implementation evidence and executable tests.

Open Proof Lab