← case studies

A search platform that became an agentic platform

Trio Health builds custom datasets for highly specific rare diseases. How we designed an agentic search system capable of extracting meaningful insight from billions of clinical patient notes.

client
Trio Health
role
hands-on retrieval engineering, product strategy
date
May 16, 2026
stack
AWS EKS + KEDATurbopufferBM25 + semantic hybridGPU embedding pipeline

Trio Health’s customers needed on-demand insights from clinical patient notes. The nuance of a question like why a patient stopped taking a medication isn’t present in other structured data formats. That constraint, combined with the sheer data volume — billions of patient notes — meant our solution had to be incredibly price-performant.

the bet

A search API could be used by agents to answer these complicated questions at scale, cost-effectively.

the investment

I helped Trio build a two-stage indexing pipeline (extraction → embedding) with hybrid retrieval (BM25 + semantic + reranking) and classifiers on AWS EKS with KEDA autoscaling. This wrapped a custom API and an agent harness we used to perform searches and produce structured results.

the kpis

Launched January 23. We’ve indexed 137K patients, 37M notes, and 500M chunks — all on our own minimal compute spend, stored in S3 via Turbopuffer. Here’s a bit about the design constraints we cared about, and why we chose what we chose.

the learnings

Since we just launched, we’re still closing the loop on many of the early learnings. But here are several first impressions.

let’s talk

If you’re weighing an agentic-first retrieval platform — or trying to work out the unit economics of search at clinical-data scale — start a conversation.