DoubleGen: Debiased Generative Modeling of Counterfactuals

ICLR 2026 Conference SubmissionAnonymous Authors

OpenReview Score: 6.0 Download Report PDF

generative modelingcounterfactualdoubly robustdebiased machine learning

Generative models for counterfactual outcomes face two key sources of bias. Confounding bias arises when approaches fail to account for systematic differences between those who receive the intervention and those who do not. Misspecification bias arises when methods attempt to address confounding through estimation of an auxiliary model, but specify it incorrectly. We introduce DoubleGen, a doubly robust framework that modifies generative modeling training objectives to mitigate these biases. The new objectives rely on two auxiliaries---a propensity and outcome model---and successfully address confounding bias even if only one of them is correct. We provide finite-sample guarantees for this robustness property. We further establish conditions under which DoubleGen achieves oracle optimality---matching the convergence rates standard approaches would enjoy if interventional data were available---and minimax rate optimality. We illustrate DoubleGen with three examples: diffusion models, flow matching, and autoregressive language models.

Abstract:

Disclaimer

This report is AI-GENERATED using Large Language Models and WisPaper (A scholar search engine). It analyzes academic papers' tasks and contributions against retrieved prior work. While this system identifies POTENTIAL overlaps and novel directions, ITS COVERAGE IS NOT EXHAUSTIVE AND JUDGMENTS ARE APPROXIMATE. These results are intended to assist human reviewers and SHOULD NOT be relied upon as a definitive verdict on novelty.

NOTE that some papers exist in multiple, slightly different versions (e.g., with different titles or URLs). The system may retrieve several versions of the same underlying work. The current automated pipeline does not reliably align or distinguish these cases, so human reviewers will need to disambiguate them manually.

If you have any questions, please contact: mingzhang23@m.fudan.edu.cn

Overview

Overall Novelty Assessment

The paper introduces DoubleGen, a framework that modifies generative model training objectives to achieve doubly robust counterfactual generation under confounding. It resides in the 'Doubly Robust and Oracle-Optimal Estimation' leaf, which contains only one sibling paper among the fifty surveyed. This sparse occupancy suggests the intersection of doubly robust theory and generative modeling remains relatively underexplored. The taxonomy shows that most work either focuses on theoretical identifiability without generative architectures or on generative designs without formal robustness guarantees, making DoubleGen's position at this intersection noteworthy.

The taxonomy reveals neighboring research directions that contextualize DoubleGen's contribution. Adjacent leaves include 'Identifiability under Hidden Confounding' (three papers on bounds and proxy methods) and 'Causal Structure Learning and Validation' (three papers on graph discovery). The broader 'Deconfounding via Auxiliary Models' branch contains nine papers across propensity weighting, latent confounder inference, and instrumental variables. DoubleGen bridges these areas by employing dual auxiliary models (propensity and outcome) within generative architectures, whereas neighboring work typically treats auxiliary modeling and generative synthesis as separate stages rather than unified training objectives.

Among thirty candidates examined, none clearly refute any of the three contributions. The first contribution (DoubleGen framework) examined ten candidates with zero refutable overlaps; the second (finite-sample guarantees) and third (unified application to diffusion, flow, and autoregressive models) each examined ten candidates with identical results. This limited search scope means the analysis captures top semantic matches and their citations but cannot claim exhaustive coverage. The absence of refutable candidates suggests that combining doubly robust estimation with generative model training objectives represents a relatively unexplored methodological direction within the examined literature.

Based on the top-thirty semantic matches and taxonomy structure, DoubleGen appears to occupy a sparsely populated niche. The single sibling paper and zero refutable candidates indicate limited prior work directly addressing doubly robust generative modeling. However, the search scope remains constrained, and the taxonomy shows substantial activity in adjacent areas (nine papers on auxiliary models, nine on generative architectures). A more exhaustive search might reveal closer precedents, particularly in recent conference proceedings or domain-specific venues not fully captured here.

Taxonomy

Core-task Taxonomy Papers

Claimed Contributions

Contribution Candidate Papers Compared

Refutable Paper

Research Landscape Overview

Core task: Debiased generative modeling of counterfactual outcomes under confounding. The field addresses how to learn generative models that can predict what would have happened under alternative treatments or interventions when observational data suffer from hidden confounders. The taxonomy reveals several complementary research directions. Theoretical Frameworks and Identifiability establish formal conditions under which causal quantities can be recovered, often drawing on doubly robust estimation and identifiability results. Deconfounding via Auxiliary Models explores techniques that leverage proxy variables, instrumental variables, or learned representations to mitigate unmeasured confounding. Generative Architecture Designs focus on the modeling side, employing VAEs, GANs, diffusion models, and normalizing flows to synthesize counterfactual samples. Counterfactual Data Augmentation for Bias Mitigation uses generated counterfactuals to debias downstream predictors in vision, language, and recommendation tasks. Treatment Effect Estimation targets the direct quantification of causal effects, while Application Domains showcase deployments in healthcare, finance, and other real-world settings. Methodological Surveys and Comparative Studies provide overarching perspectives on progress and open challenges. A particularly active line of work centers on combining theoretical guarantees with flexible generative architectures. Some studies emphasize identifiability under minimal assumptions, such as Reconsidering Generative Objectives[3], which revisits how objective functions interact with causal structure. Others prioritize practical debiasing strategies, for instance Debiasing Generative Models[4] and Generative Counterfactual Augmentation[1], which apply counterfactual synthesis to reduce spurious correlations in classifiers. DoubleGen[0] sits within the Theoretical Frameworks branch, specifically under Doubly Robust and Oracle-Optimal Estimation, suggesting it integrates robust statistical principles with generative modeling. Compared to Reconsidering Generative Objectives[3], which interrogates foundational modeling choices, DoubleGen[0] appears to emphasize formal efficiency and robustness properties that protect against model misspecification. This positioning highlights an ongoing tension between achieving strong theoretical coverage and designing architectures that scale to complex, high-dimensional data.

Claimed Contributions

DoubleGen framework for debiased counterfactual generation

10 retrieved papers

The authors propose DoubleGen, a doubly robust framework that adapts standard generative modeling training objectives to generate counterfactual outcomes while mitigating confounding bias and misspecification bias. The framework uses two auxiliary models (propensity and outcome) and remains valid if at least one is correctly specified.

10 retrieved papers

Finite-sample statistical guarantees with oracle and minimax optimality

10 retrieved papers

The authors establish finite-sample guarantees for DoubleGen's double robustness property and provide conditions under which the method achieves oracle optimality (matching rates as if counterfactual data were available) and minimax rate optimality for the counterfactual generation problem.

10 retrieved papers

Unified application to multiple generative modeling paradigms

10 retrieved papers

The authors demonstrate how DoubleGen can be applied to three different generative modeling frameworks: diffusion models, flow matching, and autoregressive language models, providing a unified approach that can adapt to various generative modeling strategies.

10 retrieved papers

Core Task Comparisons

Comparisons with papers in the same taxonomy category

[3] Reconsidering generative objectives for counterfactual reasoning PDF

Danni Lu, Chenyang Tao, Junya Chen, Fan Li, Feng Guo, Lawrence Carin (2020)

Contribution Analysis

Detailed comparisons for each claimed contribution

Contribution

DoubleGen framework for debiased counterfactual generation

[2] Conformal Counterfactual Inference under Hidden Confounding PDF

Cannot Refute

[71] Doubly-Robust Estimation of Counterfactual Policy Mean Embeddings PDF

Cannot Refute

[72] Practical and Robust Safety Guarantees for Advanced Counterfactual Learning to Rank PDF

Cannot Refute

[73] CANDOR: Counterfactual ANnotated DOubly Robust Off-Policy Evaluation PDF

Cannot Refute

[74] Doubly robust estimation and inference for a log-concave counterfactual density PDF

Cannot Refute

[75] DiffPO: A causal diffusion model for learning distributions of potential outcomes PDF

Cannot Refute

[76] DR-VIDAL-Doubly Robust Variational Information-theoretic Deep Adversarial Learning for Counterfactual Prediction and Treatment Effect Estimation on Real World â¦ PDF

Cannot Refute

[77] Inferring Heterogeneous Treatment Effects of Crashes on Highway Traffic: A Doubly Robust Causal Machine Learning Approach PDF

Cannot Refute

[78] Counterfactual prediction under selective confounding PDF

Cannot Refute

[79] Doubly robust estimation of causal effects for random object outcomes with continuous treatments PDF

Cannot Refute

Contribution

Finite-sample statistical guarantees with oracle and minimax optimality

[51] Semiparametric Counterfactual Density Estimation PDF

Cannot Refute

[52] Towards optimal doubly robust estimation of heterogeneous causal effects PDF

Cannot Refute

[53] Policy learning âwithoutâ overlap: Pessimism and generalized empirical Bernstein's inequality PDF

Cannot Refute

[54] Policy learning with new treatments PDF

Cannot Refute

[55] Toward minimax off-policy value estimation PDF

Cannot Refute

[56] Who should be treated? empirical welfare maximization methods for treatment choice PDF

Cannot Refute

[57] Optimal statistical inference for individualized treatment effects in high-dimensional models PDF

Cannot Refute

[58] Minimax off-policy evaluation for multi-armed bandits PDF

Cannot Refute

[59] Minimax optimal nonparametric estimation of heterogeneous treatment effects PDF

Cannot Refute

[60] Causal inference with high-dimensional discrete covariates PDF

Cannot Refute

Contribution

Unified application to multiple generative modeling paradigms

[61] Discrete flow matching PDF

Cannot Refute

[62] Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models PDF

Cannot Refute

[63] MonoFormer: One Transformer for Both Diffusion and Autoregression PDF

Cannot Refute

[64] Beyond next-token: Next-x prediction for autoregressive visual generation PDF

Cannot Refute

[65] Unigenx: Unified generation of sequence and structure with autoregressive diffusion PDF

Cannot Refute

[66] Generative Latent Neural PDE Solver using Flow Matching PDF

Cannot Refute

[67] Flowar: Scale-wise autoregressive image generation meets flow matching PDF

Cannot Refute

[68] Improving Progressive Generation with Decomposable Flow Matching PDF

Cannot Refute

[69] The Mathematics of Modern Generative Modeling: Normalizing Flows, Autoregressive and Diffusion Models PDF

Cannot Refute

[70] Fisher Flow Matching for Generative Modeling over Discrete Data PDF

Cannot Refute

DoubleGen: Debiased Generative Modeling of Counterfactuals

Overview

Overall Novelty Assessment

Taxonomy

Research Landscape Overview

Claimed Contributions

Core Task Comparisons

[3] Reconsidering generative objectives for counterfactual reasoning PDF

Contribution Analysis

DoubleGen framework for debiased counterfactual generation

[2] Conformal Counterfactual Inference under Hidden Confounding PDF

[71] Doubly-Robust Estimation of Counterfactual Policy Mean Embeddings PDF

[72] Practical and Robust Safety Guarantees for Advanced Counterfactual Learning to Rank PDF

[73] CANDOR: Counterfactual ANnotated DOubly Robust Off-Policy Evaluation PDF

[74] Doubly robust estimation and inference for a log-concave counterfactual density PDF

[75] DiffPO: A causal diffusion model for learning distributions of potential outcomes PDF

[76] DR-VIDAL-Doubly Robust Variational Information-theoretic Deep Adversarial Learning for Counterfactual Prediction and Treatment Effect Estimation on Real World â¦ PDF

[77] Inferring Heterogeneous Treatment Effects of Crashes on Highway Traffic: A Doubly Robust Causal Machine Learning Approach PDF

[78] Counterfactual prediction under selective confounding PDF

[79] Doubly robust estimation of causal effects for random object outcomes with continuous treatments PDF

Finite-sample statistical guarantees with oracle and minimax optimality

[51] Semiparametric Counterfactual Density Estimation PDF

[52] Towards optimal doubly robust estimation of heterogeneous causal effects PDF

[53] Policy learning âwithoutâ overlap: Pessimism and generalized empirical Bernstein's inequality PDF

[54] Policy learning with new treatments PDF

[55] Toward minimax off-policy value estimation PDF

[56] Who should be treated? empirical welfare maximization methods for treatment choice PDF

[57] Optimal statistical inference for individualized treatment effects in high-dimensional models PDF

[58] Minimax off-policy evaluation for multi-armed bandits PDF

[59] Minimax optimal nonparametric estimation of heterogeneous treatment effects PDF

[60] Causal inference with high-dimensional discrete covariates PDF

Unified application to multiple generative modeling paradigms

[61] Discrete flow matching PDF

[62] Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models PDF

[63] MonoFormer: One Transformer for Both Diffusion and Autoregression PDF

[64] Beyond next-token: Next-x prediction for autoregressive visual generation PDF

[65] Unigenx: Unified generation of sequence and structure with autoregressive diffusion PDF

[66] Generative Latent Neural PDE Solver using Flow Matching PDF

[67] Flowar: Scale-wise autoregressive image generation meets flow matching PDF

[68] Improving Progressive Generation with Decomposable Flow Matching PDF

[69] The Mathematics of Modern Generative Modeling: Normalizing Flows, Autoregressive and Diffusion Models PDF

[70] Fisher Flow Matching for Generative Modeling over Discrete Data PDF

Table of Contents

[76] DR-VIDAL-Doubly Robust Variational Information-theoretic Deep Adversarial Learning for Counterfactual Prediction and Treatment Effect Estimation on Real World â¦ PDF

[53] Policy learning âwithoutâ overlap: Pessimism and generalized empirical Bernstein's inequality PDF