Veritas: Generalizable Deepfake Detection via Pattern-Aware Reasoning

ICLR 2026 Conference SubmissionAnonymous Authors

OpenReview Score: 6.5 Download Report PDF

Deepfake DetectionMLLMs

Deepfake detection remains a formidable challenge due to the evolving nature of fake content in real-world scenarios. However, existing benchmarks suffer from severe discrepancies from industrial practice, typically featuring homogeneous training sources and low-quality testing images, which hinder the practical usage of current detectors. To mitigate this gap, we introduce HydraFake, a dataset that contains diversified deepfake techniques and in-the-wild forgeries, along with rigorous training and evaluation protocol, covering unseen model architectures, emerging forgery techniques and novel data domains. Building on this resource, we propose Veritas, a multi-modal large language model (MLLM) based deepfake detector. Different from vanilla chain-of-thought (CoT), we introduce pattern-aware reasoning that involves critical patterns such as "planning" and "self-reflection" to emulate human forensic process. We further propose a two-stage training pipeline to seamlessly internalize such deepfake reasoning capacities into current MLLMs. Experiments on HydraFake dataset reveal that although previous detectors show great generalization on cross-model scenarios, they fall short on unseen forgeries and data domains. Our Veritas achieves significant gains across different out-of-domain (OOD) scenarios, and is capable of delivering transparent and faithful detection outputs.

Abstract:

Disclaimer

This report is AI-GENERATED using Large Language Models and WisPaper (A scholar search engine). It analyzes academic papers' tasks and contributions against retrieved prior work. While this system identifies POTENTIAL overlaps and novel directions, ITS COVERAGE IS NOT EXHAUSTIVE AND JUDGMENTS ARE APPROXIMATE. These results are intended to assist human reviewers and SHOULD NOT be relied upon as a definitive verdict on novelty.

NOTE that some papers exist in multiple, slightly different versions (e.g., with different titles or URLs). The system may retrieve several versions of the same underlying work. The current automated pipeline does not reliably align or distinguish these cases, so human reviewers will need to disambiguate them manually.

If you have any questions, please contact: mingzhang23@m.fudan.edu.cn

Overview

Taxonomy

Core-task Taxonomy Papers

Claimed Contributions

Contribution Candidate Papers Compared

Refutable Paper

Research Landscape Overview

Core task: generalizable deepfake detection via pattern-aware reasoning. The field has evolved into a rich landscape of complementary strategies, each addressing different facets of the generalization challenge. At the highest level, the taxonomy reveals several major branches: frequency and spectral domain analysis (e.g., Frequency-aware Detection[1], Synthetic Frequency Patterns[3]) exploits artifacts in the frequency spectrum that persist across generation methods; temporal and spatiotemporal reasoning captures inconsistencies over time; feature disentanglement and decomposition (e.g., Texture Artifact Decomposition[8]) separates content from manipulation traces; large pre-trained model adaptation leverages foundation models such as CLIP or large language models; data augmentation and synthetic training strategies create diverse training signals; domain adaptation and generalization strategies (e.g., Invariant Risk Minimization[25]) explicitly optimize for cross-domain robustness; meta-learning and few-shot detection (e.g., Meta-Learning Relation Embedding[22]) enable rapid adaptation to novel forgeries; local and patch-level analysis (e.g., Patch-Discontinuity Mining[34]) focuses on fine-grained spatial cues; identity and semantic consistency analysis checks for logical coherence; noise pattern and forensic trace analysis mines low-level statistical signatures; attention mechanisms and architectural innovations introduce novel inductive biases; baseline and specialized architectures provide reference points; and surveys (e.g., Robust Detection Survey[6]) synthesize the state of the art. A particularly active line of work centers on adapting large pre-trained models to deepfake detection, where methods such as C2P-CLIP[5] and DeepFake-Adapter[23] fine-tune vision-language or vision-only backbones to capture generalizable forgery patterns. Within this branch, a small but growing cluster explores multimodal large language model reasoning, combining visual and textual modalities to perform more interpretable, context-aware detection. Veritas[0] sits squarely in this cluster, alongside Skyra[15] and EDVD-LLaMA[32], all of which harness the reasoning capabilities of large language models to identify subtle inconsistencies that simpler architectures might miss. Compared to Skyra[15], which emphasizes cross-modal alignment, and EDVD-LLaMA[32], which integrates video-level temporal cues, Veritas[0] focuses on pattern-aware reasoning that bridges low-level forensic traces with high-level semantic understanding. This direction reflects a broader trend toward interpretable, reasoning-driven detection, contrasting with purely data-driven approaches in frequency analysis or meta-learning branches, and highlights ongoing questions about how best to combine domain-specific inductive biases with the flexibility of foundation models.

Claimed Contributions

HydraFake dataset with hierarchical evaluation protocol

7 retrieved papers

The authors construct a new deepfake detection dataset featuring diverse forgery techniques and in-the-wild samples. They establish a hierarchical evaluation protocol with four testing levels (in-domain, cross-model, cross-forgery, cross-domain) to simulate real-world challenges and comprehensively measure detector generalization.

7 retrieved papers

Pattern-aware reasoning framework for deepfake detection

1 retrieved paper

The authors propose a reasoning framework that incorporates five thinking patterns (fast judgement, planning, reasoning, self-reflection, conclusion) inspired by human forensic analysis. This pattern-aware approach enables logical and holistic reasoning for deepfake detection, outperforming vanilla chain-of-thought methods.

1 retrieved paper

Two-stage training pipeline with MiPO and P-GRPO

10 retrieved papers

The authors develop a training pipeline consisting of pattern-guided cold-start (with SFT and Mixed Preference Optimization) and Pattern-aware Group Relative Policy Optimization. This pipeline internalizes reasoning abilities into MLLMs, enabling adaptive planning and self-reflection while delivering transparent and faithful detection outputs.

10 retrieved papers

Core Task Comparisons

Comparisons with papers in the same taxonomy category

[15] Skyra: AI-Generated Video Detection via Grounded Artifact Reasoning PDF

Yifei Li, Wenzhao Zheng, Yanran Zhang, Runze Sun, Yu Zheng, Lei Chen, Jie Zhou, Jiwen Lu (2025)

[32] EDVD-LLaMA: Explainable Deepfake Video Detection via Multimodal Large Language Model Reasoning PDF

Sun Hao-Ran, Cai Chen, Hao Sun, Zhuang Huiping, Chen Cai, Lee, Kong Aik, Huiping Zhuang, Chau, Lap-Pui, Kong Aik Lee, Wang Yi, Lap-Pui Chau, Yi Wang (2025)

Contribution Analysis

Detailed comparisons for each claimed contribution

Contribution

HydraFake dataset with hierarchical evaluation protocol

[54] WATCHER: Wavelet-guided texture-content hierarchical relation learning for deepfake detection PDF

Cannot Refute

[55] Multi-level distributional discrepancy enhancement for cross domain face forgery detection PDF

Cannot Refute

[56] Unmasking synthetic realities in generative ai: A comprehensive review of adversarially robust deepfake detection systems PDF

Cannot Refute

[57] HiTAL: Hierarchical Thumbnail and Latent Augmentation for Deepfake Detection PDF

Cannot Refute

[58] Multi-domain Multi-scale DeepFake Detection for Generalization PDF

Cannot Refute

[59] XMAD-Bench: Cross-Domain Multilingual Audio Deepfake Benchmark PDF

Cannot Refute

[60] Wav2DF-TSL: Two-stage Learning with Efficient Pre-training and Hierarchical Experts Fusion for Robust Audio Deepfake Detection PDF

Cannot Refute

Contribution

Pattern-aware reasoning framework for deepfake detection

[61] READFake: Reflection and Environment-Aware DeepFake Detection PDF

Cannot Refute

Contribution

Two-stage training pipeline with MiPO and P-GRPO

[44] Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents PDF

Cannot Refute

[45] Open vision reasoner: Transferring linguistic cognitive behavior for visual reasoning PDF

Cannot Refute

[46] Progressive multimodal reasoning via active retrieval PDF

Cannot Refute

[47] Think Twice to See More: Iterative Visual Reasoning in Medical VLMs PDF

Cannot Refute

[48] Preference-Optimized Retrieval and Ranking for Efficient Multimodal Recommendation PDF

Cannot Refute

[49] Step-Controlled DPO: Leveraging Stepwise Error for Enhanced Mathematical Reasoning PDF

Cannot Refute

[50] Toward Effective Tool-Integrated Reasoning via Self-Evolved Preference Learning PDF

Cannot Refute

[51] Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning PDF

Cannot Refute

[52] Iterative Tool Usage Exploration for Multimodal Agents via Step-wise Preference Tuning PDF

Cannot Refute

[53] Towards Intrinsic Self-Correction Enhancement in Monte Carlo Tree Search Boosted Reasoning via Iterative Preference Learning PDF

Cannot Refute

Veritas: Generalizable Deepfake Detection via Pattern-Aware Reasoning

Overview

Taxonomy

Research Landscape Overview

Claimed Contributions

Core Task Comparisons

[15] Skyra: AI-Generated Video Detection via Grounded Artifact Reasoning PDF

[32] EDVD-LLaMA: Explainable Deepfake Video Detection via Multimodal Large Language Model Reasoning PDF

Contribution Analysis

HydraFake dataset with hierarchical evaluation protocol

[54] WATCHER: Wavelet-guided texture-content hierarchical relation learning for deepfake detection PDF

[55] Multi-level distributional discrepancy enhancement for cross domain face forgery detection PDF

[56] Unmasking synthetic realities in generative ai: A comprehensive review of adversarially robust deepfake detection systems PDF

[57] HiTAL: Hierarchical Thumbnail and Latent Augmentation for Deepfake Detection PDF

[58] Multi-domain Multi-scale DeepFake Detection for Generalization PDF

[59] XMAD-Bench: Cross-Domain Multilingual Audio Deepfake Benchmark PDF

[60] Wav2DF-TSL: Two-stage Learning with Efficient Pre-training and Hierarchical Experts Fusion for Robust Audio Deepfake Detection PDF

Pattern-aware reasoning framework for deepfake detection

[61] READFake: Reflection and Environment-Aware DeepFake Detection PDF

Two-stage training pipeline with MiPO and P-GRPO

[44] Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents PDF

[45] Open vision reasoner: Transferring linguistic cognitive behavior for visual reasoning PDF

[46] Progressive multimodal reasoning via active retrieval PDF

[47] Think Twice to See More: Iterative Visual Reasoning in Medical VLMs PDF

[48] Preference-Optimized Retrieval and Ranking for Efficient Multimodal Recommendation PDF

[49] Step-Controlled DPO: Leveraging Stepwise Error for Enhanced Mathematical Reasoning PDF

[50] Toward Effective Tool-Integrated Reasoning via Self-Evolved Preference Learning PDF

[51] Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning PDF

[52] Iterative Tool Usage Exploration for Multimodal Agents via Step-wise Preference Tuning PDF

[53] Towards Intrinsic Self-Correction Enhancement in Monte Carlo Tree Search Boosted Reasoning via Iterative Preference Learning PDF

Table of Contents