This document outlines the feature roadmap for diff-diff, prioritized by practitioner value and academic credibility.
For past changes and release history, see CHANGELOG.md.
diff-diff v2.6.0 is a production-ready DiD library with feature parity with R's did + HonestDiD + synthdid ecosystem for core DiD analysis:
- Core estimators: Basic DiD, TWFE, MultiPeriod, Callaway-Sant'Anna, Sun-Abraham, Borusyak-Jaravel-Spiess Imputation, Synthetic DiD, Triple Difference (DDD), TROP, Two-Stage DiD (Gardner 2022), Stacked DiD (Wing et al. 2024), Continuous DiD (Callaway, Goodman-Bacon & Sant'Anna 2024)
- Valid inference: Robust SEs, cluster SEs, wild bootstrap, multiplier bootstrap, placebo-based variance
- Assumption diagnostics: Parallel trends tests, placebo tests, Goodman-Bacon decomposition
- Sensitivity analysis: Honest DiD (Rambachan-Roth), Pre-trends power analysis (Roth 2022)
- Study design: Power analysis tools
- Data utilities: Real-world datasets (Card-Krueger, Castle Doctrine, Divorce Laws, MPDTA), DGP functions for all supported designs
- Performance: Optional Rust backend for accelerated computation; faster than R at scale (see CHANGELOG.md for benchmarks)
Extend the existing TripleDifference estimator to handle staggered adoption settings.
- Group-time ATT(g,t) for DDD designs with variation in treatment timing
- Event study aggregation and pre-treatment placebo effects
- Multiplier bootstrap for valid inference in staggered settings
Reference: Ortiz-Villavicencio & Sant'Anna (2025). Working Paper. R package: triplediff.
- Synthetic control weight visualization (bar chart of unit weights)
- Treatment adoption "staircase" plot for staggered designs
- Interactive plots with plotly backend option
Semiparametrically efficient versions of existing DiD/event-study estimators with 40%+ precision gains over current methods.
Reference: Chen, Sant'Anna & Xie (2025). Working Paper.
Handles treatment that switches on and off (reversible treatments), unlike most other methods.
- Allows units to move into and out of treatment
- Time-varying, heterogeneous treatment effects
- Comparison with never-switchers or flexible control groups
Reference: de Chaisemartin & D'Haultfœuille (2020, 2024). American Economic Review.
Implements local projections for dynamic treatment effects. Doesn't require specifying full dynamic structure.
- Flexible impulse response estimation
- Robust to misspecification of dynamics
- Natural handling of anticipation effects
Reference: Dube, Girardi, Jordà, and Taylor (2023).
For outcomes where linear models are inappropriate (binary, count, bounded).
- Logit/probit DiD for binary outcomes
- Poisson DiD for count outcomes
- Proper handling of incidence rate ratios and odds ratios
Reference: Wooldridge (2023). The Econometrics Journal.
Extends DiD to duration/survival outcomes where standard methods fail (hazard rates, time-to-event).
- Duration analogue of parallel trends on hazard rates
- Avoids distributional assumptions and hazard function specification
Reference: Deaner & Ku (2025). AEA Conference Paper.
Frontier methods requiring more research investment.
Standard DiD assumes SUTVA; spatial/network spillovers violate this. Two-stage imputation approach estimates treatment AND spillover effects under staggered timing.
Reference: Butts (2024). Working Paper.
Recover the full counterfactual distribution and quantile treatment effects (QTT), not just mean ATT. Goes beyond "what's the average effect" to "who gains, who loses."
- Changes-in-Changes (CiC) identification strategy
- QTT(τ) at user-specified quantiles
- Full counterfactual distribution function
- Two-period foundation, then staggered extension
Reference: Athey & Imbens (2006). Econometrica.
ML-powered conditional ATT — discover who benefits most from treatment using doubly robust meta-learner.
Reference: Lan, Chang, Dillon & Syrgkanis (2025). Working Paper.
Machine learning methods for discovering heterogeneous treatment effects in DiD settings.
- Estimate treatment effect heterogeneity across covariates
- Data-driven subgroup discovery
- Honest confidence intervals for discovered heterogeneity
References:
- Kattenberg, Scheer & Thiel (2023). CPB Discussion Paper.
- Athey & Wager (2019). Annals of Statistics.
Unified framework encompassing synthetic control and regression approaches.
- Nuclear norm regularization for low-rank structure
- Bridges synthetic control (few units, many periods) and regression (many units, few periods)
Reference: Athey et al. (2021). Journal of the American Statistical Association.
For high-dimensional settings with many potential confounders.
- ML for nuisance parameter estimation (propensity, outcome models)
- Cross-fitting for valid inference
Reference: Chernozhukov et al. (2018). The Econometrics Journal.
- Randomization inference: Exact p-values for small samples
- Bayesian DiD: Priors on parallel trends violations
- Conformal inference: Prediction intervals with finite-sample guarantees
- Video tutorials and worked examples
Interested in contributing? Features in the "Near-Term" and "Medium-Term" sections are good candidates. See the GitHub repository for open issues.
Key references for implementation:
- Roth et al. (2023). "What's Trending in Difference-in-Differences?" Journal of Econometrics.
- Baker et al. (2025). "Difference-in-Differences Designs: A Practitioner's Guide."