AI Systems Architect

EricTetzlaff

Agentic Workflow Designer · Document Intelligence Engineer · Forensic AI Systems

I build production-grade AI systems that solve problems that actually matter — under real operational and legal pressure, for real clients.

What I've built.

Not proofs of concept. Not demos. Production systems and open-source tooling running under real operational and legal pressure.

01 MVP Launch — Mid-July 2026
BoardPath
Governance Document Intelligence Platform

Converts fragmented HOA and condominium governing documents into citation-grounded, meeting-ready answers. Three-stage OCR pipeline, hierarchical document weighting, and a proprietary confidence scoring algorithm that shows exactly how every answer was derived — so a board member who doesn't trust AI can see the work.

Read the case study →
View full diagram
INGESTION Q&A PIPELINE AUTOMATION OCR Router Quality Gate LlamaParse MistralAI GPT-4o Extract Supabase pgvector corpus Amend. chains Hybrid retrieval semantic + keyword Hierarchy gate MIN(rank) in code GPT-4o-mini answer gen TC™ scored answer The Boardroom Q&A · Topic Briefs · Correspondence · Chronicle Postmark Resend docx export Supabase WH INGESTION → RETRIEVAL → ANSWER → OUTPUT → CHANNELS
02 In Production
Auris Intelligence
Forensic Legal Document Intelligence Platform

SHA-256 chain-of-custody ingestion across a 110,000+ document corpus spanning PDFs, spreadsheets, Outlook email, and SMS corpora. Context-window-aware subagent scaffolding. AI-orchestrated case analysis mapped to specific legal complaint counts. In active production use by a law firm in civil litigation defense.

Read the case study →
PDF/DOCX XLSX/CSV MSG Email SMS/iOS SHA-256 Manifest Subagent Scaffold Claude Opus Coordinator 110K+ doc corpus Attorney-ready output
04 Working Prototype
Persisted Memory
Agent Memory Consolidation Sidecar

An append-only agent memory store grows forever, costing more to read the longer you work. A read-only sidecar folds raw observations into one evolving digest per project category — rewriting understanding in place instead of stacking notes. 137× compression on the heaviest project; flat read cost regardless of history depth.

Read the case study →
obs n−2 obs n−1 obs n append-only water- mark epoch > merge claude −p fold 1 digest rewritten in place O(observations) O(types) 137× compression at depth
03 Deployed 2022–2025
P2P Automation Stack
AI-Powered Operations Workflow System

A compounding stack of workflow automations — format-agnostic invoice ingestion with direct API integration, license plate recognition for parking enforcement, governing document Q&A with hallucination guardrails, and a voice-matched correspondence assistant trained on 40,000+ personal documents — that collectively reduced operating overhead by 41%.

Read the case study →
Invoice OCR Plate Vision Doc Q&A Voice Companion 41% Overhead ↓ Compounding workflow automation stack
05 Published · npm
transparent-confidence
Open-Source Confidence Scoring for RAG

The scoring engine behind BoardPath, extracted into a standalone npm package. Turns the signals a RAG pipeline already produces into a 0–100 scorecard where every point has a reason attached — and returns an answer / review / abstain action, not just a number. Eight dimensions, 412 tests, dual ESM/CJS, zero runtime dependencies. Apache-2.0.

Read the case study →
SIGNALS YOU ALREADY HAVE retrieval grounding metadata citations scorer no LLM call 0 deps SCORECARD 87 /100 answer 8 dimensions · runtime · zero deps SIGNALS → SCORE → ACTION (answer / review / abstain)

How I build.

I build systems that solve real problems under real pressure. Not prototypes. Not demos for the sake of demos. The things I've built have been used by law firms in active litigation and are weeks away from public product launch. Real pressure produces real design decisions.

My approach starts with the problem, not the technology. I spend more time understanding what's actually broken — and why — than I do selecting tools. The architecture follows from the problem definition. When you start with the tool, you end up bending the problem to fit it. That's how you get AI systems that look impressive in demos and fail in production.

I design for reliability, transparency, and human oversight. AI systems that don't tell you when they're uncertain, don't cite their sources, and don't keep humans in the loop where it matters aren't useful — they're liability. Every system I build reflects that conviction.

We're at an inflection point for this technology that is comparable in scope to the Industrial Revolution. The builders who matter in this moment aren't the ones who can predict exactly where it goes. They're the ones who understand the underlying mechanics deeply, build things that actually work, and keep building. That's the work.

  • Transparency first
    Every output should show its work. Confidence without citation is just a confident guess. I build scoring and attribution layers into answer systems so users can evaluate, not just accept.
  • Context economy
    Token costs and context window limits are real architectural constraints, not implementation details. I design subagent scaffolds specifically to keep expensive work out of coordinator context.
  • Humans stay in the loop
    Automation without human oversight at the right decision points isn't efficiency — it's risk transfer. I design human-in-the-loop checkpoints deliberately, not as an afterthought.
  • Domain knowledge is a design input
    The systems I've built encode real domain expertise — forensic accounting disambiguation, legal document hierarchy, governing authority rank. That knowledge shapes architecture decisions, not just prompts.

Design decisions, written down.

Short-form posts on specific architectural choices from real builds. How I think, not just what I shipped.

04
I spent 14 years solving a document intelligence problem. I just didn't have the right tools yet.
What three-inch binders, unresolvable board disputes, and a rough NotebookLM beta taught me about information hierarchy — and why they became the foundation of everything I build today.
BoardPath · Systems Thinking
01
Why I built a confidence scoring layer — and what it taught me about AI transparency
The question wasn't whether the answer was right. The question was whether a skeptical board member in their 70s would trust it enough to act on it. That's a different design problem.
BoardPath · Confidence Design
02
Context window economics: keeping 110,000 documents out of my coordinator's context
Feeding everything to the coordinator isn't a retrieval strategy. It's expensive, slow, and produces worse answers. Here's the subagent scaffold design that fixed it.
Auris · Subagent Architecture
03
Three-stage OCR or bust: why single-model document extraction fails at scale
A governing document scanned in 1987 on a photocopier that's seen better days is still a legally binding source. Your OCR pipeline has to handle it. Mine does.
BoardPath · OCR Architecture
05
The Roof That Wasn't a Repair
A board argued for weeks about whether a leak was a repair or a replacement. A unit owner hired his own contractor. That chain of events ended in court. The answer was in the documents on day one.
BoardPath · Document Hierarchy
06
The Proxy Holder Problem
49 questions. All correct. Then a proxy holder question with four conflicting documents sent the model sideways — and the harder I made the prompt, the more confidently it got the hierarchy backwards. Here's the deterministic fix.
BoardPath · Guardrail Design
07
How I stress-tested a production AI system before anyone else could
A formal hardening gate with ten behavioral categories, explicit disqualifying conditions, and five code patches — because the failure modes that matter in a governance context are confident, plausible, and cited.
BoardPath · QA Architecture
08
The amendment problem no attorney wants to solve for $400
A CC&Rs plus three amendments isn't a corpus. It's a dispute waiting to happen. Here's how BoardPath builds a single current-state view from the whole chain — and where the line is between AI inference and architectural guarantee.
BoardPath · Amendment Architecture
09
The Deterministic/Probabilistic Boundary
The agent extracts the deadline. Code activates it. The agent drafts the narrative. Code validates the billing entry. When an AI system touches real money and legal obligations, separating probabilistic reasoning from deterministic enforcement isn't a constraint — it's what makes the system auditable and safe to run in production.
Agent Architecture · Litt
10
The memory that folds into itself
An append-only agent memory store grows forever — costing more to read the longer you work, while silently dropping the decisions that matter most. A fold that rewrites understanding in place instead of stacking notes. 137× compression at depth.
Claude Code · Agent Memory
11
The confidence layer didn't belong to BoardPath
The part of the product I was proudest of turned out not to be governance logic at all. It was a domain-agnostic scorer I kept copy-pasting into other projects. Here's why I pulled it out, published it zero-dependency, and what extraction forced me to admit.
transparent-confidence · Library Design

Let's talk about
what you're building.

Open to full-time, contract, and fractional engagements across AI systems architecture, agentic workflow design, and document intelligence. Remote-first. Available for hybrid with reasonable travel.

Cleveland, OH  ·  Remote-first  ·  Available now