TabStruct: Measuring Structural Fidelity of Tabular Data

ICLR 2026 Conference SubmissionAnonymous Authors

OpenReview Score: 7.0 Download Report PDF

Tabular dataTabular data structureSynthetic data generation

Evaluating tabular generators remains a challenging problem, as the unique causal structural prior of heterogeneous tabular data does not lend itself to intuitive human inspection. Recent work has introduced structural fidelity as a tabular-specific evaluation dimension to assess whether synthetic data complies with the causal structures of real data. However, existing benchmarks often neglect the interplay between structural fidelity and conventional evaluation dimensions, thus failing to provide a holistic understanding of model performance. Moreover, they are typically limited to toy datasets, as quantifying existing structural fidelity metrics requires access to ground-truth causal structures, which are rarely available for real-world datasets. In this paper, we propose a novel evaluation framework that jointly considers structural fidelity and conventional evaluation dimensions. We introduce a new evaluation metric, global utility, which enables the assessment of structural fidelity even in the absence of ground-truth causal structures. In addition, we present TabStruct, a comprehensive evaluation benchmark offering large-scale quantitative analysis on 13 tabular generators from nine distinct categories, across 29 datasets. Our results demonstrate that global utility provides a task-independent, domain-agnostic lens for tabular generator performance. We release the TabStruct benchmark suite, including all datasets, evaluation pipelines, and raw results.

Abstract:

Disclaimer

This report is AI-GENERATED using Large Language Models and WisPaper (A scholar search engine). It analyzes academic papers' tasks and contributions against retrieved prior work. While this system identifies POTENTIAL overlaps and novel directions, ITS COVERAGE IS NOT EXHAUSTIVE AND JUDGMENTS ARE APPROXIMATE. These results are intended to assist human reviewers and SHOULD NOT be relied upon as a definitive verdict on novelty.

NOTE that some papers exist in multiple, slightly different versions (e.g., with different titles or URLs). The system may retrieve several versions of the same underlying work. The current automated pipeline does not reliably align or distinguish these cases, so human reviewers will need to disambiguate them manually.

If you have any questions, please contact: mingzhang23@m.fudan.edu.cn

Overview

Overall Novelty Assessment

The paper proposes a multi-dimensional evaluation framework for synthetic tabular data that jointly assesses structural fidelity and conventional quality dimensions, introducing a 'global utility' metric that operates without ground-truth causal structures. It resides in the Multi-Dimensional Evaluation Frameworks leaf, which contains five papers including this one. This leaf sits within the broader Evaluation Frameworks and Methodologies branch, indicating a moderately populated research direction focused on holistic assessment approaches rather than isolated metrics or generation methods.

The taxonomy reveals neighboring work in Benchmark Suites and Comparative Studies (five papers) and Evaluation Tools and Platforms (five papers), both addressing systematic evaluation but with different emphases—standardized comparisons versus software implementation. The Structural and Relational Fidelity Assessment branch (two sub-leaves, eight papers total) focuses specifically on inter-column dependencies and heterogeneity, providing complementary depth to the multi-dimensional perspective. The paper bridges these areas by incorporating structural considerations into a comprehensive framework, distinguishing itself from purely statistical or utility-focused approaches in adjacent branches.

Among thirty candidates examined across three contributions, none yielded clear refutations. The global utility metric examined ten candidates with zero refutable overlaps, suggesting potential novelty in enabling structural assessment without ground-truth causal graphs. The joint evaluation framework and TabStruct benchmark each examined ten candidates with similar results. This limited search scope—thirty papers from semantic retrieval—cannot confirm absolute novelty but indicates that within the examined literature, no prior work explicitly combines these specific elements: causal-structure-agnostic structural metrics, multi-dimensional integration, and large-scale benchmarking across thirteen generators.

Based on top-thirty semantic matches and taxonomy positioning, the work appears to occupy a recognizable but not overcrowded niche. The Multi-Dimensional Evaluation Frameworks leaf contains four siblings, suggesting moderate prior activity in holistic assessment approaches. The absence of refutable candidates within this limited scope suggests the specific combination of contributions may be novel, though exhaustive search across the broader fifty-paper taxonomy and beyond would be necessary to confirm originality conclusively.

Taxonomy

Core-task Taxonomy Papers

Claimed Contributions

Contribution Candidate Papers Compared

Refutable Paper

Research Landscape Overview

Core task: Evaluating structural fidelity of synthetic tabular data. The field has organized itself around several complementary perspectives. Evaluation Frameworks and Methodologies provide overarching systems for assessing synthetic data quality, often combining multiple dimensions such as statistical resemblance, utility, and privacy. Structural and Relational Fidelity Assessment focuses specifically on whether generated tables preserve inter-column dependencies, logical constraints, and relational integrity—issues that simple marginal or distributional checks may miss. Evaluation Metrics and Measurement Approaches develop concrete scoring functions and distance measures, while Privacy-Utility Tradeoff Analysis examines the tension between data protection and downstream usefulness. Generation Methods and Comparative Analysis benchmarks different synthesizers (GANs, diffusion models, large language models), Domain-Specific Applications tackle challenges in healthcare or finance, Data-Centric and Preprocessing Approaches address data quality before generation, and Survey and Review Studies synthesize the landscape. Together, these branches reflect a maturing discipline that balances theoretical rigor with practical deployment concerns. Recent work highlights the difficulty of capturing complex structural properties beyond univariate statistics. Multi-Dimensional Evaluation[31] and Critical Evaluation Challenges[18] emphasize that no single metric suffices; evaluators must consider fidelity, diversity, and privacy simultaneously. TabStruct[0] sits squarely within the Multi-Dimensional Evaluation Frameworks branch, proposing a structured approach to assess how well synthetic data preserves intricate dependencies and logical relationships. It shares common ground with Synthetic Tabular Quality[3], which also advocates for holistic quality measures, and with Complex Tabular Evaluation[5], which stresses the need to go beyond simple distributional tests. Meanwhile, works like Inter-Column Logical Relationships[1] and Benchmarking Relational Data[2] drill into specific structural aspects—foreign keys, functional dependencies—that TabStruct[0] aims to incorporate into a unified framework. The central open question remains how to balance computational cost, interpretability, and coverage when evaluating increasingly sophisticated generative models across diverse application domains.

Claimed Contributions

Global utility metric for structural fidelity assessment

10 retrieved papers

The authors propose global utility, a novel metric that allows evaluation of how well synthetic tabular data preserves causal structures without requiring access to ground-truth causal graphs, addressing a key limitation of existing structural fidelity metrics that only work on toy datasets with known causal structures.

10 retrieved papers

Evaluation framework jointly considering structural fidelity and conventional dimensions

10 retrieved papers

The authors develop a comprehensive evaluation framework that integrates structural fidelity assessment with traditional evaluation dimensions such as density estimation, ML efficacy, and privacy preservation, providing a more holistic understanding of tabular generator performance than prior work.

10 retrieved papers

TabStruct benchmark suite

10 retrieved papers

The authors introduce TabStruct, a large-scale benchmark that evaluates 13 tabular generators across 29 datasets with multiple evaluation dimensions, addressing the limited scope of existing benchmarks and providing datasets, evaluation pipelines, and raw results as open resources.

10 retrieved papers

Core Task Comparisons

Comparisons with papers in the same taxonomy category

[18] Critical Challenges and Guidelines in Evaluating Synthetic Tabular Data: A Systematic Review PDF

Esnaola Inaki, Nazia Nafis, MartÃnez-PÃ©rez, Ãlvaro, IÃ±aki Esnaola, Villa-Uriol, Maria-Cruz, Alvaro Martinez-Perez, Osmani, Venet, M. Villa-Uriol, V. Osmani (2025)

[31] Benchmarking Synthetic Tabular Data: A Multi-Dimensional Evaluation Framework PDF

Sidorenko Andrey, Platzer, Michael, Andrey Sidorenko, Scriminaci, Mario, Michael Platzer, Tiwald, Paul, Mario Scriminaci, P. Tiwald (2025)

[38] Synthetic tabular data evaluation in the health domain covering resemblance, utility, and privacy dimensions PDF

Mikel Hernadez, Gorka Epelde, Ane Alberdi, Rodrigo Cilla, Debbie Rankin (2023)

[39] FEST: A Unified Framework for Evaluating Synthetic Tabular Data PDF

Weijie Niu, Alberto Celdran, Karoline Siarsky, Alberto Huertas CeldrÃ¡n, Burkhard Stiller (2025)

Contribution Analysis

Detailed comparisons for each claimed contribution

Contribution

Global utility metric for structural fidelity assessment

[5] Evaluation of synthetic data generators on complex tabular data PDF

Cannot Refute

[8] LLM-TabLogic: Preserving Inter-Column Logical Relationships in Synthetic Tabular Data via Prompt-Guided Latent Diffusion PDF

Cannot Refute

[21] Structured Evaluation of Synthetic Tabular Data PDF

Cannot Refute

[22] A Comparative Study of Open-Source Libraries for Synthetic Tabular Data Generation: SDV vs. SynthCity PDF

Cannot Refute

[23] Preserving logical and functional dependencies in synthetic tabular data PDF

Cannot Refute

[30] Evaluating Fidelity and Machine Learning Utility of Synthetic Tabular Data Generated Using Generative Models PDF

Cannot Refute

[44] A Quantitative Comparison of Structural and Distributional Properties of Synthetic Tabular Data in Parkinson's Disease PDF

Cannot Refute

[51] Improving the generation and evaluation of synthetic data for downstream medical causal inference PDF

Cannot Refute

[52] Tabularargn: A flexible and efficient auto-regressive framework for generating high-fidelity synthetic data PDF

Cannot Refute

[53] Dependency-aware synthetic tabular data generation PDF

Cannot Refute

Contribution

Evaluation framework jointly considering structural fidelity and conventional dimensions

[61] Generative Models in Computational Pathology: A Comprehensive Survey on Methods, Applications, and Challenges PDF

Cannot Refute

[62] Grid-Based Decompositions for Spatial Data under Local Differential Privacy PDF

Cannot Refute

[63] TLPP: Deep-Learning-Based Two-Layer Privacy Preserving Mechanism for Protecting Vehicle Trajectory Data PDF

Cannot Refute

[64] Integration Of Machine Learning and Advanced Computing For Optimizing Retail Customer Analytics PDF

Cannot Refute

[65] SynthVal: A Framework for Validating Synthetic Medical Images PDF

Cannot Refute

[66] Differentially Private Graph Data Publishing via Feature-Based Community Detection PDF

Cannot Refute

[67] Differentially private learning of structured discrete distributions PDF

Cannot Refute

[68] OTTER: Optimized Training with Trustworthy Enhanced Replication via Diffusion and Federated VMUNet for Privacy-Aware Medical Segmentation PDF

Cannot Refute

[69] Utilizing synthetic data for privacy-preserving AI modeling in radiomics: a case study * PDF

Cannot Refute

[70] Preserving privacy and fidelity via Ehrhart theory PDF

Cannot Refute

Contribution

TabStruct benchmark suite

[10] How Well Does Your Tabular Generator Learn the Structure of Tabular Data? PDF

Cannot Refute

[20] Scaling While Privacy Preserving: A Comprehensive Synthetic Tabular Data Generation and Evaluation in Learning Analytics PDF

Cannot Refute

[29] A novel and fully automated platform for synthetic tabular data generation and validation PDF

Cannot Refute

[54] Comprehensive evaluation framework for synthetic tabular data in health: fidelity, utility and privacy analysis of generative models with and without privacy guarantees PDF

Cannot Refute

[55] Tabular and latent space synthetic data generation: a literature review PDF

Cannot Refute

[56] Deep neural networks and tabular data: A survey PDF

Cannot Refute

[57] A comprehensive evaluation framework for synthetic medical tabular data generation PDF

Cannot Refute

[58] Systematic assessment of tabular data synthesis PDF

Cannot Refute

[59] Modeling tabular data using conditional gan PDF

Cannot Refute

[60] Comparison of tabular synthetic data generation techniques using propensity and cluster log metric PDF

Cannot Refute

TabStruct: Measuring Structural Fidelity of Tabular Data

Overview

Overall Novelty Assessment

Taxonomy

Research Landscape Overview

Claimed Contributions

Core Task Comparisons

[18] Critical Challenges and Guidelines in Evaluating Synthetic Tabular Data: A Systematic Review PDF

[31] Benchmarking Synthetic Tabular Data: A Multi-Dimensional Evaluation Framework PDF

[38] Synthetic tabular data evaluation in the health domain covering resemblance, utility, and privacy dimensions PDF

[39] FEST: A Unified Framework for Evaluating Synthetic Tabular Data PDF

Contribution Analysis

Global utility metric for structural fidelity assessment

[5] Evaluation of synthetic data generators on complex tabular data PDF

[8] LLM-TabLogic: Preserving Inter-Column Logical Relationships in Synthetic Tabular Data via Prompt-Guided Latent Diffusion PDF

[21] Structured Evaluation of Synthetic Tabular Data PDF

[22] A Comparative Study of Open-Source Libraries for Synthetic Tabular Data Generation: SDV vs. SynthCity PDF

[23] Preserving logical and functional dependencies in synthetic tabular data PDF

[30] Evaluating Fidelity and Machine Learning Utility of Synthetic Tabular Data Generated Using Generative Models PDF

[44] A Quantitative Comparison of Structural and Distributional Properties of Synthetic Tabular Data in Parkinson's Disease PDF

[51] Improving the generation and evaluation of synthetic data for downstream medical causal inference PDF

[52] Tabularargn: A flexible and efficient auto-regressive framework for generating high-fidelity synthetic data PDF

[53] Dependency-aware synthetic tabular data generation PDF

Evaluation framework jointly considering structural fidelity and conventional dimensions

[61] Generative Models in Computational Pathology: A Comprehensive Survey on Methods, Applications, and Challenges PDF

[62] Grid-Based Decompositions for Spatial Data under Local Differential Privacy PDF

[63] TLPP: Deep-Learning-Based Two-Layer Privacy Preserving Mechanism for Protecting Vehicle Trajectory Data PDF

[64] Integration Of Machine Learning and Advanced Computing For Optimizing Retail Customer Analytics PDF

[65] SynthVal: A Framework for Validating Synthetic Medical Images PDF

[66] Differentially Private Graph Data Publishing via Feature-Based Community Detection PDF

[67] Differentially private learning of structured discrete distributions PDF

[68] OTTER: Optimized Training with Trustworthy Enhanced Replication via Diffusion and Federated VMUNet for Privacy-Aware Medical Segmentation PDF

[69] Utilizing synthetic data for privacy-preserving AI modeling in radiomics: a case study * PDF

[70] Preserving privacy and fidelity via Ehrhart theory PDF

TabStruct benchmark suite

[10] How Well Does Your Tabular Generator Learn the Structure of Tabular Data? PDF

[20] Scaling While Privacy Preserving: A Comprehensive Synthetic Tabular Data Generation and Evaluation in Learning Analytics PDF

[29] A novel and fully automated platform for synthetic tabular data generation and validation PDF

[54] Comprehensive evaluation framework for synthetic tabular data in health: fidelity, utility and privacy analysis of generative models with and without privacy guarantees PDF

[55] Tabular and latent space synthetic data generation: a literature review PDF

[56] Deep neural networks and tabular data: A survey PDF

[57] A comprehensive evaluation framework for synthetic medical tabular data generation PDF

[58] Systematic assessment of tabular data synthesis PDF

[59] Modeling tabular data using conditional gan PDF

[60] Comparison of tabular synthetic data generation techniques using propensity and cluster log metric PDF

Table of Contents