Contextualizing DiD for Marketing
Marketing panels feature rich adoption patterns, substantial heterogeneity, and pervasive threats to identification assumptions. Section 4.9 of the causal inference text illustrates how the staggered Difference-in-Differences (DiD) framework handles common marketing applications. Depending on the data structure and business question, the choice of estimands, diagnostic workflows, and estimators must be tailored significantly.
Here are three canonical scenarios and their specific causal inference challenges.
1. Loyalty Programme Rollouts
Consider a retail chain launching a loyalty programme in waves across 500 stores over 12 quarters.
- The Estimands: The primary target is often the overall ATT on sales. Secondary estimands include event-time effects $\theta_k$ (to trace how habits develop as customers accumulate points over time) and cohort-specific effects $\tau_g$ (are early adopting stores fundamentally different from late adopters?).
- The Identification Challenge: Staggered Parallel Trends assumes that timing is related to operational constraints, not anticipated sales growth.
- The Primary Threat: Spillovers. Customers might refer friends, artificially elevating control store sales (positive spillover, which attenuates the treatment effect). Conversely, treated stores might cannibalize sales from control stores (negative spillover, which inflates the effect).
- Solutions: SUTVA violations can be addressed via spatial models, defining broader geographic clusters, or implementing buffer zones that drop control stores situated too closely to treated stores.
2. Retail Pricing Policy Changes
Suppose a retailer shifts from frequent promotions to “everyday low pricing” across 200 product categories over 24 weeks.
- The Estimands: Overall ATT on profit margins, along with heterogeneous effects driven by category features like price elasticity or brand concentration.
- The Identification Challenge: Parallel trends are violated if the retailer specifically transitions their high-margin or declining-margin categories first.
- The Primary Threat: Substitution and Competitive Response. Reducing promotional frequency in one category has general equilibrium effects. Competitors might aggressively step up their promotions in that exact category, masking the true effect. Alternatively, consumers may simply substitute and shift their spending to untreated categories that are still heavily promoted.
- Solutions: Rely on modern DiD estimators but extend the analysis using explicit spillover models to capture cross-category substitution. Causal forests can be particularly useful here to flexibly explore heterogeneity across many category characteristics at once.
3. Platform Channel Expansion
A food delivery platform phases its entry into 30 different cities over 24 months, while 20 cities remain untreated.
- The Estimands: The ATT on total restaurant revenues, plus event-time estimates (which reflect the speed of market penetration) and heterogeneous effects by restaurant type (e.g., chains versus independent spots).
- The Identification Challenge: Platforms almost always enter larger, faster-growing cities first. This selection on both observables and unobservables fundamentally casts doubt on Parallel Trends in levels.
- The Primary Threat: Market Re-equilibration. Initial entry brings friction—pricing strategies shift and supply (driver availability) is volatile. The immediate short-term effect often looks very different from the long-run equilibrium. Additionally, measuring whether the platform expanded category demand overall, or just redistributed existing diners, is difficult.
- Solutions: Conditioning on observables like city size and macroeconomic factors via propensity scores is heavily recommended. When differential trends driven by latent shocks remain, standard DiD may fail; factor models (interactive fixed effects) or Generalized Synthetic Control methods are required to produce credible counterfactuals.
Summary
These applications show that DiD cannot be approached as a cookie-cutter calculation. Validating the business logic behind why the treatment was staggered is crucial. Estimators like Callaway–Sant’Anna or Sun–Abraham effectively aggregate multiple rollouts, but anticipating and modeling spillovers or selection bias remains the ultimate determinant of a credible result.