Attribution data

Data
6 min read
Updated June 13, 2026

Why it matters

Ad platforms optimize on events they can tie back to an impression or click. When attribution data is missing, truncated, or stored in a different system than revenue, three problems stack up: under-reported conversions in the UI, weak Event Match Quality on server paths, and broken joins in your data warehouse between spend and outcomes.

Performance marketers feel this as "CAPI is live but results look flat." Engineering sees high send volume with low match rate. Analytics cannot reconcile platform cohorts to order-level LTV because the click ID never landed on the purchase row.

Attribution data is also the bridge between first-party data and platform learning. Your data warehouse knows who bought and who returned. Platforms know which ad click started the session. Attribution fields are the join keys that let you train models, send values back on the correct user, and read out incrementality with defensible grain.

Attribution data

Attribution data flows through the full pLTV stack:

  1. Data warehouse (input): Persist click and campaign identifiers on signup, install, and order events alongside stable user keys and net revenue fields.
  2. Model (Churney): Train user-level pLTV on labeled outcomes joined to the same attribution fields used at score time.
  3. Signal design: Choose anchor event timing and signal freshness so values arrive while identifiers are still valid for matching.
  4. Activation (output): Churney sends conversion and value payloads directly to ad networks via Meta CAPI or Google Ads Conversion API, including GCLID, fbc/fbp, and hashed customer parameters where supported.
  5. Readout: Compare matched event volume, Event Match Quality, and incremental ROAS vs BAU once cohort maturity allows.

The data warehouse is modeling input, not the delivery pipe. Attribution data must be captured at the edge (web, app, checkout) and stored in the data warehouse for training, then passed again on the server event at activation time.

Category variants

ChannelKey attribution fieldsCommon gap
Google Ads (web)GCLID, gbraid/wbraid on consent-limited trafficGCLID not stored on order or CRM row
Meta (web)fbc, fbp, campaign/ad set IDs from pixel or CAPIMissing fbc on server-only implementations
Mobile appMMP click ID, install referrer, SKAdNetwork postbacksWeb-to-app handoff drops identifiers
TikTok / otherttclid, network-specific click tokensUTMs used as substitute without click ID

Common mistakes

  1. UTM-only attribution. Campaign labels help analytics but do not replace click IDs for API matching.
  2. Storing IDs only in the ad platform. Data warehouse models cannot join revenue without the same keys locally.
  3. Truncated or hashed GCLID in logs. Breaks Google Ads Conversion API upload matching.
  4. Late ID capture. Click ID collected after redirect chain loses the original parameter.
  5. Cross-domain gaps. Checkout on a subdomain without shared cookie or server forwarding of fbc/fbp.
  6. Treating MMP postbacks as data warehouse truth. Postbacks activate platforms; the data warehouse still needs unified user grain for pLTV.

Advertiser lens

RoleCares about
UA / performanceMatch rate and attributed volume after CAPI go-live
Data engineeringID capture contracts across web, app, and OMS
Marketing analyticsJoin coverage between spend, clicks, and revenue tables
EngineeringServer-side forwarding of click parameters on conversion endpoints

FAQ

What is attribution data?

Identifiers and metadata that connect an ad touch to a later conversion event, used for platform matching, reporting, and data warehouse joins.

What fields matter most for server-side value signals?

Google: GCLID (and gbraid/wbraid where applicable). Meta: fbc, fbp, plus hashed email or phone when consented. Mobile: MMP device and click identifiers aligned to install time.

Why does pLTV need attribution data in the data warehouse?

Modeling and calibration require the same join keys you will send at activation. Training on orders without click IDs produces scores you cannot match back to campaigns.

Does attribution data replace hashed customer identifiers?

No. Click IDs tie events to ad delivery. Hashed PII improves match rate on top of click parameters when users consent.

How do I audit attribution data quality?

Measure fill rate on key fields at conversion, match rate after CAPI launch, and data warehouse join rate between orders and click ID tables.

What if iOS ATT reduces visible attribution?

Data warehouse-first modeling and modeled/consented signals still benefit from every identifier you can capture; readouts lean on holdouts and mature cohorts, not platform totals alone.

Not the same as

TermDifference
UTM parametersCampaign labeling for analytics; not a substitute for network click IDs
Multi-touch attribution (MTA)Statistical model across touches, not the raw identifier payload
Media mix modeling (MMM)Aggregate channel inference without user-level click keys
Hashed identifiersPrivacy-safe customer matching fields, complementary to click IDs