Fraud detection · Rules

Rules That Learn: The Shadow Mode That Kills Fraud's Blind Spot

Elio Rincón Jun 24, 2026 6 min read

The rule you wrote eight months ago

It's still there, running. Blocking transactions every day. And nobody on your team can answer the one question that matters: is it still catching fraud, or just annoying good customers?

You don't know. Not because you're careless, but because the system won't let you know. A rule that blocks kills its own evidence: the case never executes, never reaches a chargeback, and you never find out whether that block was a win or a legitimate customer you scared off. You cut the end of the story before you could read it.

That's how nearly every fraud rule in the world operates: blind, by design.

A rule doesn't end when you create it: that's where it starts

Think about how a rule is born. Someone analyzed a case, spotted a pattern of possible or outright fraud, and wrote a rule to prevent it from happening again. Good work. The rule goes to production. And right there, for most teams, the story ends.

But that's exactly the moment the questions nobody answers begin:

How do you measure the real impact of that rule today? Do you have the full confusion matrix, or just an estimate?
Do you have a way to monitor its impact over time, without running a manual query every time?
How do you know if a rule is still performing well, or if it already went obsolete because the attacker changed methods three months ago?

The honest answer, on almost every team, is: you don't know, and nobody has time to find out. After creating the rule, the analyst moves on to ten other tasks, loses the context, and that rule keeps running on its own, with no owner, no metric, no review date. The time the team would spend evaluating and monitoring it simply doesn't exist, because there's always a more urgent fire. That's how zombie rules pile up: alive, blocking, and nobody knows whether they protect or get in the way.

The blind spot has a name

In the literature it's called selective labels: you only know the outcome of the cases you let through. Of the ones you blocked, nothing. And that poisons two things at once.

First, your metrics. You can estimate false positives by simulating against history, but that's a lab estimate, not the real behavior of the rule today, against today's traffic.

Second, and worse: your model. If you train the engine only on what the rules let through, the model learns a censored world. It never saw the cases the rule killed, so it goes blind exactly in the zone where the rule decides. It ends up being an echo of the rule, not an independent judge. Two systems staring at the same blind spot and agreeing with each other.

The idea: let the rule pass, on purpose

The only way to know whether a rule is right is to see what happens when it doesn't block. So that's exactly what we do: every rule can switch on an intelligent shadow mode.

Once active, the rule lets a small sample of the cases it would normally block through (every now and then) and watches the end of the story. Did that case you let through end in a chargeback? Then it was fraud: the rule would have been right. No chargeback? It was a legitimate customer: the rule would have blocked someone good.

This isn't giving up on blocking. It's blocking with an open channel to ground truth. The rule still protects you on 90-95% of cases, and the rest teaches you whether it's worth what it costs.

Per-rule chargebacks: closing the loop

Shadow mode lets the case through. Per-rule chargeback metrics tell you what happened to it. That's the piece that closes the loop: crossing every case a rule let through with its real outcome.

With that, the per-rule confusion matrix stops being an estimate and fills itself in, with real data:

Hit (the rule was right): it let through a case that ended in a chargeback. It would have blocked fraud.
False positive (the rule was wrong): it let through a case with no chargeback. It would have blocked a good customer.

For the first time, an honest answer to "is this rule any good?". Not an opinion, not a simulation. The data.

And along the way, the model stops being blind

There's a second effect, and it's the one that excites me most. The model learns from both sides of every rule: from the cases where it made a decision and from the ones it let through. Shadow mode, applied at random, is what fills that second side, and because it's random, the sample isn't biased by the rule itself.

Every case shadow mode lets through comes with its outcome known. That's a clean label: uncensored data, from exactly the zone where the model used to see nothing. So rules stop competing with models and start feeding them. The human writes the rule; the rule acts as a sensor; the sensor generates labeled data from the decision boundary; the model learns from that boundary.

And because the sampling is random and bounded, you can measure its business impact before it hurts: how much fraud letting that sample through implies, against how much learning you buy with it. The old "rules or machine learning?" debate was always a trap. The answer is: rules teach the model, if your architecture closes the loop.

The cost, no makeup

Letting fraud through on purpose costs money. We won't hide that.

But the cost is small and controlled: you don't let everything through, you let a sample through: 5%, 10%, whatever you configure per rule. It's the price of not operating blind, and it's absurdly cheaper than the alternative: a rule that's been blocking your best customers for eight months without anyone knowing, or a model that never improves because it never saw the truth. You pay a known, tiny loss today to avoid an invisible, enormous one every day.

Where it's going

This is what we're building at Frauddi: turning every rule from a switch you flip and forget into an organism with a life cycle: it's born observing in shadow, gets evaluated against real chargebacks, and graduates to hard blocking or gets discarded based on what the evidence says. And that evidence, along the way, feeds the engine.

It's the same thesis we repeat in everything we do: an antifraud system that learns at the attacker's pace, not one that executes blind and prays. The rule you write today should be smarter tomorrow. Not by magic, but because you finally let it see the outcome of its own decisions.

If you want to see what it looks like to operate rules that learn from themselves, book a demo.

Stop running rules blind

Frauddi turns every rule into an organism that learns: intelligent shadow mode, per-rule chargebacks, and a real confusion matrix, so you know, with data, whether a rule catches fraud or annoys customers.

Book a free demo