Skip to content
← Insights
6 May 20265 min

When not to use a model

Most of the work in a real operation is deterministic. Put it on a harness, and let the model handle only the thin slice that actually needs judgment.

DoctrineArchitectureReliability

There is a tempting way to build with AI: hand the whole task to a model, give it some tools, and let it figure out each step. It demos well. It also tends to be slow, expensive, hard to debug, and impossible to guarantee.

Most of the work in a real operation does not need a model at all. It needs a schedule, a few rules, and somewhere to put the exceptions.

The harness does the heavy lifting

Look at a typical back-office task and sort the steps into two piles. One pile is deterministic: read this system, match on these fields, write that record, send this on a schedule. The other pile is judgment: this one is ambiguous, a person or a model has to decide.

The first pile is almost always the larger one. Put it on an event-driven harness. It runs the same way every time, you can test it, and when it breaks it tells you exactly where. No model required.

The model earns a narrow seat

Now the second pile, the judgment. This is where a model earns its place: classifying a messy request, drafting language, extracting a field from a document that does not follow the template. Give it clear inputs and clear outputs, and put it behind the harness, not in charge of it.

The result is a system where the reliable part is reliable by construction, and the model is doing the one thing it is genuinely good at, in a place where a wrong answer is caught.

Why it matters for the people paying for it

An operator does not care about the architecture. They care that the thing works on Monday, that it does not cost a fortune per run, and that when it misbehaves someone can find out why before it matters.

A deterministic harness gives you all three. Agent soup, a model improvising every step, gives you a great demo and a system nobody can stand behind. We ship the first one. It is less impressive to describe and far easier to trust.

Start with the number.

The Audit maps your operation, ranks the builds with a clear line to the P&L, and tells you what each is worth. Then you decide.