Architecture review dossier

Regulated AI that can be defended before engineering, security, risk, and operations.

This is the portfolio's system-level position: explicit decision authority, enforceable boundaries, measurable nonfunctional requirements, inspectable evidence, and operating ownership beyond launch.

6Published architecture decisions with explicit trade-offs
6Measurable nonfunctional requirement categories
8End-to-end control and evidence stages
12Public repositories supporting the reference system

Flagship Reference Architecture

A provider-portable architecture for healthcare RAG and agent workflows. Identity, PHI handling, policy, approval, evaluation, and evidence are platform contracts rather than optional prompt behavior.

Experience
Identity + channelsSSO, member portal, clinical workflow
Human approvalMedical, regulatory, operations
Control plane
Agent orchestrationState, cancellation, retry budgets
Policy engineDeny-by-default tool decisions
Evaluation gatesRegression and release criteria
Intelligence + integration
Healthcare RAGScoped retrieval and citations
FHIR servicesTyped exchange boundaries
Model adaptersProvider-portable inference
Evidence + operations
Signed receiptsCanonical execution evidence
MLOpsValidation, registry, deployment
ObservabilityTraces, cost, incidents, SLOs
Identity boundary
PHI boundary
Tool boundary
Evidence boundary

Scope statement: this is a public reference architecture, not client code. It composes production-derived patterns into an inspectable system and uses synthetic data for verification.

Download sanitized solution architecture brief

Architecture Decision Records

These decisions make the operating philosophy concrete and expose the costs accepted in exchange for control, portability, and evidence.

ADR-01

Keep policy enforcement outside the prompt

Choice: Deny-by-default tool-boundary policy engine

Trade-off: Adds orchestration latency and policy maintenance, but makes authorization deterministic, testable, and auditable.

ADR-02

Separate retrieval evidence from generated language

Choice: Citations, source metadata, and confidence travel with each answer

Trade-off: Increases payload size and UI complexity, but supports review, dispute, and post-incident reconstruction.

ADR-03

Use event-driven processing for long-running agent work

Choice: Durable queue with idempotency, cancellation, retry budgets, and dead-letter recovery

Trade-off: More operational components than synchronous APIs, but safer recovery and backpressure under variable model latency.

ADR-04

Make human approval a first-class state

Choice: Explicit escalation records rather than informal chat handoffs

Trade-off: Slower happy paths, but clear accountability for high-risk clinical, regulatory, and content decisions.

ADR-05

Keep the platform model-provider portable

Choice: Provider adapters behind evaluation and policy contracts

Trade-off: Constrains provider-specific features, but reduces lock-in and makes regulated change control practical.

ADR-06

Sign execution evidence

Choice: Canonical receipts with hash chaining and verifiable signatures

Trade-off: Requires key management and retention controls, but prevents silent mutation of agent evidence.

Nonfunctional Requirements

Architecture is accepted against measurable operational constraints, not diagrams alone.

QualityTargetArchitecture responseVerification
Availability99.9% platform; 99.99% critical event pathMulti-zone services, queue durability, graceful degradationSynthetic probes and recovery tests
Latency<2s p95 retrieval; streaming first token <1.5sCaching, bounded context, asynchronous toolsLoad tests and trace percentiles
PrivacyNo PHI in unapproved model or log pathsClassification, redaction, scoped retrieval, private endpointsPolicy tests and log sampling
RecoveryRTO 60 min; RPO 15 min for evidence storesCross-zone replicas, immutable backups, replayable eventsQuarterly restore exercise
AuditabilityReconstruct every consequential actionSigned receipts, model and prompt versions, approval recordsEvidence-bundle verification
CostPer-workflow budget and tenant visibilityToken budgets, model routing, caching, usage attributionCost telemetry and threshold alerts

Threat Model

Residual risk remains visible and owned. Controls reduce likelihood and blast radius; they do not turn probabilistic systems into risk-free systems.

Residual risk · Medium

Prompt injection

Scenario: Untrusted retrieved content attempts to redirect tools

Controls: Content isolation, instruction hierarchy, policy enforcement at tools

Residual risk · Low-Medium

PHI disclosure

Scenario: Sensitive data reaches logs, models, or unauthorized users

Controls: Classification, redaction, tenant filters, private endpoints

Residual risk · Low

Cross-tenant retrieval

Scenario: Identity or metadata defect exposes another tenant

Controls: Attribute-based access, namespace isolation, adversarial tests

Residual risk · Medium

Tool misuse

Scenario: Agent invokes a valid tool outside intended purpose

Controls: Deny-by-default policies, argument validation, approval thresholds

Residual risk · Low

Evidence tampering

Scenario: Trace or approval history is changed after execution

Controls: Canonical signed receipts, hash chains, immutable retention

Residual risk · Medium

Model/vendor drift

Scenario: Provider behavior changes without controlled acceptance

Controls: Pinned versions, regression suites, canary release, rollback

Capacity and Cost Envelope

Planning ranges make workload assumptions and FinOps choices discussable before vendor selection. Actual pricing depends on model, region, retention, and support requirements.

Operating tierMonthly volumePlanning rangePrimary controls
Pilot5K workflows/mo$1.5K-$3K/moManaged services, smaller models, shared non-production
Department50K workflows/mo$9K-$18K/moCaching, model routing, reserved database capacity
Enterprise500K workflows/mo$65K-$140K/moTenant attribution, regional resilience, dedicated observability

Planning estimates are modeled reference ranges, not client invoices or guaranteed cloud quotations.

Synthetic Control Walkthrough

Run a credential-free example to see how identity, policy, human approval, evaluation, and signed evidence change with workflow risk.

Member-benefit explanation

Synthetic request with no patient or client data. Choose the risk tier, then inspect the resulting control path.

Awaiting review

No model or external service is called. This walkthrough demonstrates the architecture contract with deterministic synthetic evidence.

Inspectable Evidence Workflow

A five-minute technical review can follow one synthetic request from identity through retrieval, policy, approval, evaluation, signed evidence, and operations.

01

Authenticate

Identity and tenant claims establish the permitted data and tool scope.

02

Classify

Input is checked for PHI, intent, risk tier, and required approval path.

03

Retrieve

FHIR and knowledge sources are filtered before evidence enters model context.

04

Decide

Policy engine evaluates each consequential tool call outside the prompt.

05

Escalate

High-risk or uncertain work pauses for an accountable human decision.

06

Evaluate

Scenario and regression gates verify behavior against release criteria.

07

Sign

Trace, policy decisions, versions, and approvals become a verifiable receipt.

08

Operate

SLO, cost, drift, incident, and evidence telemetry feed ongoing ownership.

Review implementation evidence and executable tests.

Open Proof Lab