Learning with Dual-level Noisy Correspondence for Multi-modal Entity Alignment
Overview
Taxonomy
Research Landscape Overview
Claimed Contributions
The authors identify and formalize a new problem in multi-modal entity alignment where misalignments occur at two levels: within entities (entity-attribute pairs) and across knowledge graphs (entity-entity and attribute-attribute pairs). They demonstrate empirically that this dual-level noise undermines both attribute fusion and inter-graph alignment.
The authors propose RULE, a robust framework that estimates correspondence reliability using uncertainty and consensus principles. This reliability estimation enables the method to reduce the impact of noisy correspondences during both attribute fusion within entities and alignment across knowledge graphs.
The authors introduce a correspondence reasoning module that operates during inference to discover latent semantic connections between attributes across graphs. This module uses multi-modal large language models with chain-of-thought reasoning to improve entity identification accuracy at test time, representing a novel contribution to test-time robustness in MMEA.
Core Task Comparisons
Comparisons with papers in the same taxonomy category
Contribution Analysis
Detailed comparisons for each claimed contribution
Dual-level Noisy Correspondence (DNC) problem formulation
The authors identify and formalize a new problem in multi-modal entity alignment where misalignments occur at two levels: within entities (entity-attribute pairs) and across knowledge graphs (entity-entity and attribute-attribute pairs). They demonstrate empirically that this dual-level noise undermines both attribute fusion and inter-graph alignment.
[32] Learning with noisy correspondence for cross-modal matching PDF
[39] Tackling Uncertain Correspondences for Multi-Modal Entity Alignment PDF
[51] Noise-powered multi-modal knowledge graph representation framework PDF
[52] APKGC: Noise-enhanced Multi-Modal Knowledge Graph Completion with Attention Penalty PDF
[53] MAGIC: Noise Mitigation and Knowledge Alignment for Knowledge Graph-Based Multi-modal Recommendation PDF
[54] Multimodal knowledge graph embedding with missing data integration PDF
[55] MDSEA: Knowledge Graph Entity Alignment Based on Multimodal Data Supervision PDF
[56] Multi-modal Entity in One Word: Aligning Multi-level Semantics for Multi-modal Knowledge Graph Completion PDF
[57] Learning versatile multimodal representation for knowledge extraction and reasoning PDF
[58] Neighborhood Matching Entity Alignment Model for Vulnerability Knowledge Graphs PDF
RULE framework with two-fold reliability estimation principle
The authors propose RULE, a robust framework that estimates correspondence reliability using uncertainty and consensus principles. This reliability estimation enables the method to reduce the impact of noisy correspondences during both attribute fusion within entities and alignment across knowledge graphs.
[59] Dynamic Uncertainty Learning with Noisy Correspondence for Text-Based Person Search PDF
[60] Entity alignment with noisy annotations from large language models PDF
[61] An analysis of one-to-one matching algorithms for entity resolution PDF
[62] Joint extraction of entities and relations using multi-label tagging and relational alignment PDF
[63] Entity Matching with Large Language Models as Weak and Strong Labellers PDF
[64] Multilayer network alignment based on topological assessment via embeddings PDF
[65] Matching web tables with knowledge base entities: from entity lookups to entity embeddings PDF
[66] Unsupervised bootstrapping of active learning for entity resolution PDF
[67] Enhance Well Mastering with Machine Learning-Based Entity Matching PDF
[68] Adaptive Pseudo Text Augmentation for Noise-Robust Text-to-Image Person Re-Identification PDF
Test-time correspondence reasoning module
The authors introduce a correspondence reasoning module that operates during inference to discover latent semantic connections between attributes across graphs. This module uses multi-modal large language models with chain-of-thought reasoning to improve entity identification accuracy at test time, representing a novel contribution to test-time robustness in MMEA.