CTBench: Cryptocurrency Time Series Generation Benchmark
Overview
Overall Novelty Assessment
The paper introduces CTBench, a comprehensive benchmark for cryptocurrency time series generation, positioned within the Benchmarking and Evaluation Frameworks leaf of the taxonomy. This leaf currently contains only this single paper, indicating a sparse research direction with no direct sibling works. The contribution addresses a recognized gap in systematic evaluation protocols for crypto-specific synthetic data generation, distinguishing itself from the more populated Forecasting Methods and Synthetic Data Generation branches that contain 8-15 papers each focused on model development rather than standardized assessment.
The taxonomy reveals CTBench sits at the intersection of multiple active research directions. The Synthetic Data Generation branch (4 GAN-focused papers) and Forecasting Methods branch (20+ papers across RNNs, classical models, and hybrid approaches) represent neighboring work that CTBench aims to evaluate. The scope_note for Benchmarking explicitly excludes 'individual models without systematic multi-metric benchmarking,' positioning this work as complementary infrastructure rather than competing methodology. The dual-task framework bridges evaluation gaps between generation quality and downstream forecasting utility, connecting to both the generation and forecasting branches.
Among 20 candidates examined across three contributions, no clearly refuting prior work emerged. The CTBench benchmark itself examined 6 candidates with 0 refutable matches, suggesting novelty in providing crypto-specific evaluation infrastructure. The dual-task framework similarly showed 6 candidates examined, 0 refutable, indicating the predictive utility plus statistical arbitrage pairing appears distinctive. The comprehensive financial metric suite examined 8 candidates with no refutations, though this may reflect the limited search scope rather than absolute novelty in individual metrics. The absence of sibling papers in the same taxonomy leaf reinforces that systematic crypto generation benchmarking remains underexplored.
Based on top-20 semantic matches, the work appears to occupy genuinely sparse territory within cryptocurrency time series research. The taxonomy structure shows active development in forecasting architectures and GAN-based generation, but minimal prior effort toward standardized evaluation frameworks combining financial metrics with crypto-specific characteristics. The analysis covers methodological positioning but cannot assess whether individual evaluation metrics or dataset curation practices overlap with broader financial benchmarking literature outside the examined candidate set.
Taxonomy
Research Landscape Overview
Claimed Contributions
The authors introduce CTBench, the first benchmark specifically designed for evaluating time series generation methods in cryptocurrency markets. It provides an open-source dataset of 452 tokens and evaluates models across 13 metrics spanning forecasting accuracy, rank fidelity, trading performance, risk assessment, and computational efficiency.
The authors propose a dual-task evaluation framework that assesses both predictive realism and tradable structure. The Predictive Utility task measures whether synthetic data preserves forecasting signals, while the Statistical Arbitrage task evaluates whether reconstructed series enable market-neutral trading strategies.
The authors develop a comprehensive suite of 13 financial metrics grouped into six categories (error-based, rank-based, trading performance, risk assessment, efficiency, and visualization) specifically designed to evaluate TSG models in cryptocurrency contexts, addressing limitations of existing benchmarks that focus on traditional markets.
Core Task Comparisons
Comparisons with papers in the same taxonomy category
Contribution Analysis
Detailed comparisons for each claimed contribution
CTBench: Cryptocurrency Time Series Generation Benchmark
The authors introduce CTBench, the first benchmark specifically designed for evaluating time series generation methods in cryptocurrency markets. It provides an open-source dataset of 452 tokens and evaluates models across 13 metrics spanning forecasting accuracy, rank fidelity, trading performance, risk assessment, and computational efficiency.
[32] Real-time detection of anomalous trading patterns in financial markets using generative adversarial networks PDF
[33] Ethereum Price Prediction Employing Large Language Models for Short-term and Few-shot Forecasting PDF
[34] Quantifying Cryptocurrency Unpredictability: A Comprehensive Study of Complexity and Forecasting PDF
[35] Benchmarking Cryptocurrency Forecasting Models in the Context of Data Properties and Market Factors PDF
[36] CMS-VAE: A Strategy-aware Variational AutoEncoder for High-Fidelity Crypto Market Simulation PDF
[37] A Novel Cryptocurrency Trend Prediction Framework Powered by Innovative Feature Engineering PDF
Dual-task evaluation framework
The authors propose a dual-task evaluation framework that assesses both predictive realism and tradable structure. The Predictive Utility task measures whether synthetic data preserves forecasting signals, while the Statistical Arbitrage task evaluates whether reconstructed series enable market-neutral trading strategies.
[45] Forecasting of virtual power plant generating and energy arbitrage economics in the electricity market using machine learning approach PDF
[46] Machine learning in portfolio decisions PDF
[47] From algorithms to efficiency: generative AI's role in reshaping market efficiency PDF
[48] Machine Learning for Photovoltaic Power Forecasting Integrated with Energy Storage Systems: A Scientometric Analysis, Systematic Review, and Meta ⦠PDF
[49] An artificial neural network framework for dual interest rate parity PDF
[50] Enhanced Electric Vehicle-Aggregator Energy Arbitrage Through Dual-Side Optimization: Integrating Spatial Probability Analysis and Machine Learning Forecasting PDF
Comprehensive financial metric suite for crypto-specific evaluation
The authors develop a comprehensive suite of 13 financial metrics grouped into six categories (error-based, rank-based, trading performance, risk assessment, efficiency, and visualization) specifically designed to evaluate TSG models in cryptocurrency contexts, addressing limitations of existing benchmarks that focus on traditional markets.