AI Data EngProduction

Healthcare AI Data Platform

B1→B5 trust layer: ingest, validate, reason, survive, govern. Every number traces to a file.

View Code Live Demo

55.5K

Records Ingested

Violations to AI Layer

59.3%

AI Cost Reduction

99.0%

Pipeline Success Rate

Production AI data platform for healthcare — 55,500+ encounter records through a 5-layer trust stack: ingestion (B1), truth contracts (B2), semantic knowledge products (B3), self-healing reliability (B4), and AI spend governance with Vertex Context Caching (B5). Zero data-quality violations reached AI-facing endpoints across 1,000 fault-injection runs. 59.3% AI inference cost reduction via novelty-driven attention routing.

B1→B5: every record earns its way to Baymax. Bad data is quarantined before AI sees it. Good data gets the right compute budget. Fully auditable.

Tech Stack

PythonBigQuerydbtFeastFastAPIVertex AIGeminiGCPCloud RunGreat Expectations

Features

B1: Trusted Ingestion

55.5K records, 100% source-to-warehouse reconciliation. Entity resolution to 40,235 canonical patients. Domain-specific plausibility contracts (CLINICAL-002) quarantine impossible records before they reach AI.

B2: Truth Contracts

7 named truth contracts (6 BLOCKING), 48/48 GE checks, 52/52 dbt tests. Zero data-quality violations reached AI-facing endpoints across 1,000 fault-injection runs.

B3: Semantic Knowledge Products

4 versioned semantic profiles (PatientProfile, RiskProfile, MedicationProfile, EncounterSummary). BM25 RAG retrieval: Hit@5=0.95, MRR=0.90, NDCG@10=0.89. Grounded answers verified by Vertex Gen AI Eval.

B4: Self-Healing Reliability

99.0% pipeline success, 90% auto-recovery rate, 99.9% SLA compliance. 9-class failure taxonomy. Stale data incidents = 0 across 1,000 seeded fault-injection runs.

B5: AI Spend Governance

novelty_score (text-embedding-004 kNN) drives 89% of PRO-tier routing decisions. Vertex Context Caching: 61.7% per-call cost reduction (measured via count_tokens API). 59.3% cost savings vs naive all-Pro routing across 401 encounters.

How It Works

B1 Capture

Batch + stream + API ingestion → idempotent BigQuery merge → entity resolution → plausibility contracts quarantine impossible records

→

B2 Trust

7 named truth contracts → 48 GE checks → 52 dbt tests → promotion gate blocks every CRITICAL fault before the agent layer

→

B3 Understand

4 semantic knowledge products → Feast PIT-correct feature store → BM25 retrieval → Vertex Gemini grounded answers

→

B4+B5 Govern

Self-healing orchestration + novelty_score attention routing + Vertex Context Caching → fully auditable inference budget decisions