Skip to main content
    European Search Awards 2026 Winner - Best PPC Agency
    Attribution11 min read

    Data-Driven Attribution Is a Black Box (And Google Built It)

    Google recommends data-driven attribution. They also built it, control it, and benefit when it tells you to spend more. The lack of transparency should concern you.

    The Transparency Problem

    Data-driven attribution uses machine learning to distribute conversion credit. The algorithm is proprietary. The weighting factors are hidden. You cannot audit the maths that determines where your budget flows.

    How Data-Driven Attribution Works

    Google's data-driven model analyses your conversion paths and compares them to paths that did not convert. It identifies which touchpoints appear more frequently in successful journeys and assigns credit accordingly.

    The concept is sound. Machine learning can identify patterns humans miss. The problem is not the methodology. The problem is that you cannot verify it.

    When Google tells you that a particular keyword deserves 30% credit for a conversion, you have no way to validate that claim. You cannot see the training data. You cannot inspect the model weights. You cannot run alternative scenarios.

    "Trusting a black box attribution model from the company that profits from your spend is like asking a casino to calculate your odds of winning."

    The Conflict of Interest

    Google's revenue increases when you spend more on Google Ads. Data-driven attribution determines how credit flows across your campaigns, which directly influences where you invest budget.

    We are not suggesting Google deliberately manipulates attribution to inflate spend. But we are suggesting that structural incentives matter. When the referee also sells tickets, scepticism is warranted.

    What You Can See

    Credit percentages assigned to each touchpoint. Overall conversion paths. Time to conversion. But no insight into why specific weights were chosen.

    What You Cannot

    Model training data. Feature weights. Confidence intervals. Comparison to counterfactual scenarios. The actual logic driving budget decisions.

    The Data Quality Problem

    Data-driven attribution requires sufficient conversion data to build a reliable model. Google's minimum is 300 conversions and 3,000 ad interactions within 30 days. But "minimum" does not mean "optimal."

    For smaller advertisers scraping past the threshold, the model is training on limited data. The patterns it identifies might be statistical noise rather than genuine signal. You cannot tell the difference.

    Even for larger advertisers, seasonal shifts, product changes, and marketing tests can corrupt the training data. The model continues making predictions based on patterns that may no longer apply.

    Cross-Device Gaps

    Modern customer journeys span devices. Someone might see an ad on mobile, research on desktop, and purchase via tablet. Data-driven attribution attempts to stitch these journeys together, but relies on Google account sign-ins for accuracy.

    Users not signed into Google, using privacy browsers, or clearing cookies regularly create fragmented journeys. The model fills gaps with assumptions you cannot inspect. How much of your attributed revenue is based on inferences rather than observations?

    Questions to Ask Your Agency

    • What percentage of our conversions are cross-device inferences?
    • How often does the attribution model retrain on our data?
    • How would budget allocation differ under last-click versus data-driven?
    • What is the confidence interval on attributed conversions?

    The Alternative: Triangulation

    Rather than trusting any single attribution model, sophisticated advertisers triangulate. They compare data-driven results against last-click, against position-based, and critically against holdout tests.

    When different models tell similar stories, confidence increases. When they diverge dramatically, investigation is warranted. Attribution becomes one input among several rather than gospel truth.

    Healthy Scepticism

    Data-driven attribution is probably better than last-click for most advertisers. It attempts to distribute credit more fairly across the funnel. The intent is reasonable.

    But "better than last-click" is a low bar. And blind trust in any model you cannot audit is a risk. Use data-driven attribution as one perspective among many, not as the definitive answer.

    The Practical Approach

    Accept that all attribution models are imperfect. Use data-driven as a directional indicator. Validate major budget decisions with holdout tests. Never make six-figure reallocation choices based solely on a black box.

    Want transparent reporting?

    We show you multiple attribution perspectives, not just Google's preferred view.

    See Our Approach

    Get our insights in your inbox

    Plain-English thinking about Google Ads. No spam, unsubscribe anytime.

    We use cookies to improve your experience. Privacy Policy