Joint Distribution–Informed Shapley Values for Sparse Counterfactual Explanations
Overview
Overall Novelty Assessment
The paper introduces COLA, a post-hoc refinement framework that uses optimal transport coupling and Shapley-based attribution to reduce feature edits in counterfactual explanations. Within the taxonomy, it resides in the 'Optimal Transport and Coupling-Based Refinement' leaf under 'Optimization-Based Counterfactual Generation Methods'. Notably, this leaf contains only the original paper itself—no sibling papers are present. This isolation suggests the specific combination of optimal transport coupling with Shapley-driven feature selection for counterfactual refinement represents a relatively sparse research direction within the broader optimization-based counterfactual landscape.
The taxonomy reveals that COLA's parent branch, 'Optimization-Based Counterfactual Generation Methods', contains several neighboring leaves: 'Gradient-Based and Perturbation Optimization' (3 papers), 'Shapley Value-Guided Optimization' (2 papers), and 'Multi-Objective and Constraint-Based Optimization' (5 papers). While Shapley values appear in adjacent work for feature prioritization, and optimal transport exists in theoretical frameworks, the taxonomy structure indicates that coupling-based refinement as a distinct methodological approach has not been extensively explored. The scope note clarifies this leaf excludes methods using optimal transport only for post-hoc refinement without coupling theory, suggesting a narrow definitional boundary.
Among 29 candidates examined across three contributions, the 'p-SHAP' component shows one refutable candidate from 10 examined, indicating some overlap with prior Shapley-based attribution methods. The 'COLA framework' contribution examined 10 candidates with zero refutations, suggesting the overall refinement architecture appears distinct within the limited search scope. The 'theoretical guarantees' contribution also shows no refutations across 9 candidates. These statistics reflect a top-K semantic search, not exhaustive coverage, meaning the apparent novelty of COLA's coupling-driven refinement may stem partly from the nascent state of this specific methodological intersection rather than comprehensive field saturation.
Given the limited search scope of 29 candidates and the taxonomy's structural sparsity in this leaf, COLA appears to occupy a relatively unexplored niche combining optimal transport coupling with Shapley attribution for counterfactual refinement. The single refutation for p-SHAP suggests incremental overlap in attribution mechanics, while the framework's overall architecture shows distinctiveness within the examined literature. However, the analysis cannot rule out relevant work outside the top-K semantic matches or in adjacent optimization paradigms not captured by the current taxonomy boundaries.
Taxonomy
Research Landscape Overview
Claimed Contributions
The authors propose COLA (COunterfactuals with Limited Actions), a general post-hoc framework that refines counterfactual explanations across different models and CE generators. It uses optimal transport to compute a coupling between factual and counterfactual sets, which then guides Shapley-based attribution to select minimal feature edits while preserving target effects.
The authors introduce p-SHAP, a Shapley value method that integrates an algorithm returning joint probability between factual and counterfactual instances. This method unifies other commonly used Shapley methods under appropriate couplings and provides a modular interface for attribution and edit selection.
The authors provide theoretical results showing that optimal transport minimizes an upper bound on the 1-Wasserstein divergence between factual and counterfactual outcomes. They also prove that under mild conditions, refined counterfactuals remain no farther from factuals than the original counterfactuals.
Core Task Comparisons
Comparisons with papers in the same taxonomy category
Contribution Analysis
Detailed comparisons for each claimed contribution
COLA framework for sparse counterfactual explanations
The authors propose COLA (COunterfactuals with Limited Actions), a general post-hoc framework that refines counterfactual explanations across different models and CE generators. It uses optimal transport to compute a coupling between factual and counterfactual sets, which then guides Shapley-based attribution to select minimal feature edits while preserving target effects.
[6] Refining Counterfactual Explanations With Joint-Distribution-Informed Shapley Towards Actionable Minimality PDF
[51] DISCOUNT: Distributional Counterfactual Explanation With Optimal Transport PDF
[52] Collective Counterfactual Explanations via Optimal Transport PDF
[53] Post-event modeling via causal optimal transport for ctr prediction PDF
[54] Conservative inference for counterfactuals PDF
[55] The matrix reloaded: Towards counterfactual group fairness in machine learning PDF
[56] Distributional Counterfactual Explanations With Optimal Transport PDF
[57] Relative Explanations for Contextual Problems with Endogenous Uncertainty: An Application to Competitive Facility Location PDF
[58] A consistent extension of discrete optimal transport maps for machine learning applications PDF
[59] When adversarial attacks become interpretable counterfactual explanations PDF
Joint distribution-informed Shapley values (p-SHAP)
The authors introduce p-SHAP, a Shapley value method that integrates an algorithm returning joint probability between factual and counterfactual instances. This method unifies other commonly used Shapley methods under appropriate couplings and provides a modular interface for attribution and edit selection.
[61] Counterfactual shapley additive explanations PDF
[6] Refining Counterfactual Explanations With Joint-Distribution-Informed Shapley Towards Actionable Minimality PDF
[60] Counterfactual Shapley Values for Explaining Reinforcement Learning PDF
[62] To Select or Not to Select? The Role of Meta-features Selection in Meta-learning Tasks with Tabular Data PDF
[63] Calculating and Visualizing Counterfactual Feature Importance Values PDF
[64] The Counterfactual-Shapley Value: Attributing Change in System Metrics PDF
[65] Group Shapley Value and Counterfactual Simulations in a Structural Model PDF
[66] Decomposition of inequality of opportunity in India: An application of data-driven machine learning approach PDF
[67] Explaining Reinforcement Learning: A Counterfactual Shapley Values Approach PDF
[68] -test: Global Feature Selection and Inference for Shapley Additive Explanations PDF
Theoretical guarantees for OT-based counterfactual refinement
The authors provide theoretical results showing that optimal transport minimizes an upper bound on the 1-Wasserstein divergence between factual and counterfactual outcomes. They also prove that under mild conditions, refined counterfactuals remain no farther from factuals than the original counterfactuals.