The Transparency Problem

Data-driven attribution uses machine learning to distribute conversion credit. The algorithm is proprietary. The weighting factors are hidden. You cannot audit the maths that determines where your budget flows.

How Data-Driven Attribution Works

Google's data-driven model analyses your conversion paths and compares them to paths that did not convert. It identifies which touchpoints appear more frequently in successful journeys and assigns credit accordingly.

The concept is sound. Machine learning can identify patterns humans miss. The problem is not the methodology. The problem is that you cannot verify it.

When Google tells you that a particular keyword deserves 30% credit for a conversion, you have no way to validate that claim. You cannot see the training data. You cannot inspect the model weights. You cannot run alternative scenarios.

"Trusting a black box attribution model from the company that profits from your spend is like asking a casino to calculate your odds of winning."

The Conflict of Interest

Google's revenue increases when you spend more on Google Ads. Data-driven attribution determines how credit flows across your campaigns, which directly influences where you invest budget.

We are not suggesting Google deliberately manipulates attribution to inflate spend. But we are suggesting that structural incentives matter. When the referee also sells tickets, scepticism is warranted.

What You Can See

Credit percentages assigned to each touchpoint. Overall conversion paths. Time to conversion. But no insight into why specific weights were chosen.

What You Cannot

Model training data. Feature weights. Confidence intervals. Comparison to counterfactual scenarios. The actual logic driving budget decisions.

The Data Quality Problem

Data-driven attribution requires sufficient conversion data to build a reliable model. Google's minimum is 300 conversions and 3,000 ad interactions within 30 days. But "minimum" does not mean "optimal."

For smaller advertisers scraping past the threshold, the model is training on limited data. The patterns it identifies might be statistical noise rather than genuine signal. You cannot tell the difference.

Even for larger advertisers, seasonal shifts, product changes, and marketing tests can corrupt the training data. The model continues making predictions based on patterns that may no longer apply.

Cross-Device Gaps

Modern customer journeys span devices. Someone might see an ad on mobile, research on desktop, and purchase via tablet. Data-driven attribution attempts to stitch these journeys together, but relies on Google account sign-ins for accuracy.

Users not signed into Google, using privacy browsers, or clearing cookies regularly create fragmented journeys. The model fills gaps with assumptions you cannot inspect. How much of your attributed revenue is based on inferences rather than observations?

Questions to Ask Your Agency

•What percentage of our conversions are cross-device inferences?
•How often does the attribution model retrain on our data?
•How would budget allocation differ under last-click versus data-driven?
•What is the confidence interval on attributed conversions?

The Alternative: Triangulation

Rather than trusting any single attribution model, sophisticated advertisers triangulate. They compare data-driven results against last-click, against position-based, and critically against holdout tests.

When different models tell similar stories, confidence increases. When they diverge dramatically, investigation is warranted. Attribution becomes one input among several rather than gospel truth.

Healthy Scepticism

Data-driven attribution is probably better than last-click for most advertisers. It attempts to distribute credit more fairly across the funnel. The intent is reasonable.

But "better than last-click" is a low bar. And blind trust in any model you cannot audit is a risk. Use data-driven attribution as one perspective among many, not as the definitive answer.

The Practical Approach

Accept that all attribution models are imperfect. Use data-driven as a directional indicator. Validate major budget decisions with holdout tests. Never make six-figure reallocation choices based solely on a black box.

Data-Driven Attribution Is a Black Box (And Google Built It)

The Transparency Problem

How Data-Driven Attribution Works

The Conflict of Interest

What You Can See

What You Cannot

The Data Quality Problem

Cross-Device Gaps

Questions to Ask Your Agency

The Alternative: Triangulation

Healthy Scepticism

The Practical Approach

Want transparent reporting?

Get our insights in your inbox

Related Reading

Every SKU Has a Job

ROAS Can Turn a Pound Into a Loss

The Case for JudeLuxe

Related Services

Book a Strategy Call

Google Ads Audit

Retained Google Ads