ChartGalaxy: A Dataset for Infographic Chart Understanding and Generation
Overview
Overall Novelty Assessment
The paper introduces ChartGalaxy, a million-scale dataset for infographic chart understanding and generation, constructed through an inductive process identifying 75 chart types, 440 variations, and 68 layout templates. Within the taxonomy, it resides in the 'Large-Scale Dataset Construction' leaf under 'Multimodal Model Development for Chart Tasks', sharing this space with only one sibling paper (NovaChart). This leaf represents a relatively sparse but critical research direction focused on building comprehensive training resources for vision-language models, distinguishing itself from the more crowded branches of design automation and educational applications.
The taxonomy reveals that ChartGalaxy sits within a broader ecosystem of multimodal model development, adjacent to leaves addressing unified multi-task learning frameworks, code-guided synthesis, and specialized UI models. Neighboring branches include 'Automated Design and Authoring Systems' (template extraction, message-driven authoring) and 'Infographic Understanding and Interpretation' (content extraction, cognitive analysis). The scope note for its leaf emphasizes 'diverse chart types, annotations, and task coverage', explicitly excluding domain-specific or small-scale datasets, positioning ChartGalaxy as a general-purpose resource rather than a specialized benchmark.
Among 24 candidates examined across three contributions, the dataset itself (Contribution 1: 10 candidates, 0 refutable) appears novel within the limited search scope, with no prior work directly overlapping its million-scale programmatic construction approach. However, the pipeline for programmatic chart creation (Contribution 2: 10 candidates, 2 refutable) shows more substantial prior work, suggesting that code-based synthesis methods have been explored elsewhere. The three applications demonstrating utility (Contribution 3: 4 candidates, 0 refutable) appear less contested, though the small candidate pool limits confidence in this assessment.
Based on this limited top-24 semantic search, ChartGalaxy's primary novelty appears to lie in its scale and systematic taxonomy-driven construction rather than fundamentally new generation techniques. The analysis does not cover exhaustive literature on chart datasets or programmatic synthesis methods, leaving open the possibility of additional relevant prior work beyond the examined candidates. The sparse population of its taxonomy leaf suggests this dataset-centric direction remains relatively underexplored compared to design automation or educational applications.
Taxonomy
Research Landscape Overview
Claimed Contributions
The authors present ChartGalaxy, a large-scale dataset containing 1,701,356 synthetic and 61,833 real infographic charts paired with data tables. The dataset is constructed through an inductive process that identifies chart types, variations, and layout templates from real designs to programmatically create synthetic ones.
The authors develop a human-in-the-loop pipeline that extracts design patterns (75 chart types, 440 variations, 68 layout templates) from real infographic charts and uses them to automatically generate synthetic infographic charts at scale.
The authors demonstrate the value of ChartGalaxy through three distinct applications: improving infographic chart understanding via fine-tuning, benchmarking code generation for infographic charts, and enabling example-based infographic chart generation.
Core Task Comparisons
Comparisons with papers in the same taxonomy category
[1] NovaChart: A Large-scale Dataset towards Chart Understanding and Generation of Multimodal Large Language Models PDF
Contribution Analysis
Detailed comparisons for each claimed contribution
ChartGalaxy dataset
The authors present ChartGalaxy, a large-scale dataset containing 1,701,356 synthetic and 61,833 real infographic charts paired with data tables. The dataset is constructed through an inductive process that identifies chart types, variations, and layout templates from real designs to programmatically create synthetic ones.
[20] ScreenAI: A Vision-Language Model for UI and Infographics Understanding PDF
[40] Chart-info 2024: A dataset for chart analysis and recognition PDF
[41] Infographicvqa PDF
[42] Infogen: Generating complex statistical infographics from documents PDF
[43] ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation PDF
[44] Scaling text-rich image understanding via code-guided synthetic multimodal data generation PDF
[45] Effective Training Data Synthesis for Improving MLLM Chart Understanding PDF
[46] From pixels to insights: A survey on automatic chart understanding in the era of large foundation models PDF
[47] Evaluation and Analysis of Chart Reasoning Accuracy in Multimodal Large Language Models: An Empirical Study on Influencing Factors PDF
[48] Unmasking Deceptive Visuals: Benchmarking Multimodal Large Language Models on Misleading Chart Question Answering PDF
Pipeline for programmatic infographic chart creation
The authors develop a human-in-the-loop pipeline that extracts design patterns (75 chart types, 440 variations, 68 layout templates) from real infographic charts and uses them to automatically generate synthetic infographic charts at scale.
[52] Datashot: Automatic generation of fact sheets from tabular data PDF
[53] Text-to-viz: Automatic generation of infographics from proportion-related natural language statements PDF
[19] Infographics Generator: A Smart Application for Visual Summarization PDF
[51] Retrieve-Then-Adapt: Example-based Automatic Generation for Proportion-related Infographics PDF
[54] Exploring visual information flows in infographics PDF
[55] OrionBench: A Benchmark for Chart and Human-Recognizable Object Detection in Infographics PDF
[56] Propagating visual designs to numerous plots and dashboards PDF
[57] Multilingual Infographics Generator: A language-agnostic visual summarizer PDF
[58] MapCraft: Dissecting and Designing Custom Geo-Infographics PDF
[59] Infographics Wizard: Flexible Infographics Authoring and Design Exploration PDF
Three applications demonstrating dataset utility
The authors demonstrate the value of ChartGalaxy through three distinct applications: improving infographic chart understanding via fine-tuning, benchmarking code generation for infographic charts, and enabling example-based infographic chart generation.