Exchangeability of GNN Representations with Applications to Graph Retrieval

ICLR 2026 Conference SubmissionAnonymous Authors

OpenReview Score: 6.0 Download Report PDF

GNNLocality sensitive hashing

In this work, we discover a probabilistic symmetry, called as exchangeability in graph neural networks (GNNs). Specifically, we show that the trained node embedding computed using a large family of graph neural networks, learned under standard optimization tools, are exchangeable random variables. This implies that the probability density of the node embeddings remains invariant with respect to a permutation applied on their dimension axis. This results in identical distribution across the elements of the graph representations. Such a property enables approximation of transportation-based graph similarities by Euclidean similarities between order statistics. Leveraging this reduction, we propose a unified locality-sensitive hashing (LSH) framework that supports diverse relevance measures, including subgraph matching and graph edit distance. Experiments show that our method helps to do LSH more effectively than baselines.

Abstract:

Disclaimer

This report is AI-GENERATED using Large Language Models and WisPaper (A scholar search engine). It analyzes academic papers' tasks and contributions against retrieved prior work. While this system identifies POTENTIAL overlaps and novel directions, ITS COVERAGE IS NOT EXHAUSTIVE AND JUDGMENTS ARE APPROXIMATE. These results are intended to assist human reviewers and SHOULD NOT be relied upon as a definitive verdict on novelty.

NOTE that some papers exist in multiple, slightly different versions (e.g., with different titles or URLs). The system may retrieve several versions of the same underlying work. The current automated pipeline does not reliably align or distinguish these cases, so human reviewers will need to disambiguate them manually.

If you have any questions, please contact: mingzhang23@m.fudan.edu.cn

Overview

Taxonomy

Core-task Taxonomy Papers

Claimed Contributions

Contribution Candidate Papers Compared

Refutable Paper

Research Landscape Overview

Core task: Locality-sensitive hashing for graph retrieval using transportation-based similarity measures. The field structure reflects a convergence of optimal transport theory, graph neural networks, and scalable retrieval systems. The taxonomy organizes work into four main branches: Optimal Transport Approximation Methods focus on efficient computation of Wasserstein and related distances, often trading exactness for speed (e.g., Scalable optimal transport in[3], Warpspeed Computation of Optimal[8]); Graph Neural Network Architectures with Transport-Based Distances explore how to embed graphs in ways that respect or approximate transport metrics; Neural Graph Retrieval Systems address end-to-end pipelines for indexing and querying large graph databases (e.g., Scalable Neural Graph Retrieval[6], Scalable and Multi-modal Neural[7]); and Domain-Specific Applications of Transport-Based Retrieval demonstrate these techniques in areas like molecular search or shape matching (e.g., Fast contour matching using[4]). Together, these branches illustrate a progression from foundational distance computation to learned representations and practical retrieval architectures. A particularly active theme concerns the interplay between approximation quality and computational scalability: many studies seek hashing schemes or neural embeddings that preserve transport-based distances while enabling sublinear query times. Within this landscape, Exchangeability of GNN Representations[0] sits squarely in the Graph Neural Network Architectures branch, emphasizing the theoretical property of exchangeability to ensure that learned node or graph embeddings remain compatible with locality-sensitive hashing under permutation-invariant transport metrics. This contrasts with works like SLoSH[1] or HIGH-PERFORMANCE SEMANTIC SIMILARITY ANALYSIS[2], which may prioritize empirical speedups or domain-specific tuning over formal invariance guarantees. Meanwhile, Template based Graph Neural[5] explores structured message-passing that could complement exchangeable representations. The central tension across these lines is whether to rely on hand-crafted transport approximations, end-to-end learned hashing, or hybrid approaches that blend neural architectures with classical LSH theory.

Claimed Contributions

Exchangeability of GNN node embeddings

10 retrieved papers

The authors establish that node embeddings produced by trained GNNs exhibit exchangeability across embedding dimensions, meaning the joint probability density of embedding elements remains invariant under permutations of the dimension axis. This property holds for a broad class of GNN architectures, loss functions, and optimizers.

10 retrieved papers

Approximation of transportation-based graph similarity using Euclidean similarity

10 retrieved papers

The authors leverage exchangeability to approximate computationally expensive transportation-based graph similarities with simpler Euclidean similarities computed over sorted embedding elements in individual dimensions, reducing complexity from O(n³) to dimension-wise operations.

10 retrieved papers

Unified locality-sensitive hashing framework for graph retrieval

Can Refute

10 retrieved papers

The authors develop GRAPH HASH, a unified LSH framework that supports multiple asymmetric graph relevance measures including subgraph matching and graph edit distance by combining their exchangeability-based approximation with Fourier-based hashing techniques.

10 retrieved papers

Can Refute

Core Task Comparisons

Comparisons with papers in the same taxonomy category

Within the taxonomy built over the current TopK core-task papers, the original paper is assigned to a leaf with no direct siblings and no cousin branches under the same grandparent topic. In this retrieved landscape, it appears structurally isolated, which is one partial signal of novelty, but still constrained by search coverage and taxonomy granularity.

Contribution Analysis

Detailed comparisons for each claimed contribution

Contribution

Exchangeability of GNN node embeddings

[29] Sdgnn: Symmetry-preserving dual-stream graph neural networks PDF

Cannot Refute

[30] E (n) equivariant graph neural networks PDF

Cannot Refute

[31] Reconstruction for powerful graph representations PDF

Cannot Refute

[32] Non-exchangeable conformal prediction for temporal graph neural networks PDF

Cannot Refute

[33] Similarity-navigated conformal prediction for graph neural networks PDF

Cannot Refute

[34] Conformal inductive graph neural networks PDF

Cannot Refute

[35] Learning invariant graph representations for out-of-distribution generalization PDF

Cannot Refute

[36] Non-Euclidean Spatial Graph Neural Network PDF

Cannot Refute

[37] Representing long-range context for graph neural networks with global attention PDF

Cannot Refute

[38] Universal Representation of Permutation-Invariant Functions on Vectors and Tensors PDF

Cannot Refute

Contribution

Approximation of transportation-based graph similarity using Euclidean similarity

[9] Wasserstein task embedding for measuring task similarities PDF

Cannot Refute

[10] A Wasserstein graph distance based on distributions of probabilistic node embeddings PDF

Cannot Refute

[11] An optimal transport based embedding to quantify the distance between playing styles in collective sports PDF

Cannot Refute

[12] Applications of no-collision transportation maps in manifold learning PDF

Cannot Refute

[13] Unified route representation learning for multi-modal transportation recommendation with spatiotemporal pre-training PDF

Cannot Refute

[14] An optimal transport-embedded similarity measure for diagnostic knowledge transferability analytics across machines PDF

Cannot Refute

[15] Learning with similarity functions on graphs using matchings of geometric embeddings PDF

Cannot Refute

[16] A squaredâeuclidean distance locationâallocation problem PDF

Cannot Refute

[17] A Path Recommendation Method Based on the Siamese Graph Convolutional Network for the Holographic Counterpart of Consumer Electronics Logistics of the â¦ PDF

Cannot Refute

[18] Enabling location-based servicesâmulti-graph representation of transportation networks PDF

Cannot Refute

Contribution

Unified locality-sensitive hashing framework for graph retrieval

[25] Fast graph similarity search via locality sensitive hashing PDF

Can Refute

[19] Locality sensitive hashing in fourier frequency domain for soft set containment search PDF

Cannot Refute

[20] A tree locality-sensitive hash for secure software testing PDF

Cannot Refute

[21] Locality Sensitive Hashing for Optimizing Subgraph Query Processing in Parallel Computing Systems PDF

Cannot Refute

[22] Graph edit distance with general costs using neural set divergence PDF

Cannot Refute

[23] Fast graph similarity search via hashing and its application on image retrieval PDF

Cannot Refute

[24] Kam1n0: Mapreduce-based assembly clone search for reverse engineering PDF

Cannot Refute

[26] Locality Sensitive Hashing for Data Placement to Optimize Parallel Subgraph Query Evaluation PDF

Cannot Refute

[27] A symbol spotting approach in graphical documents by hashing serialized graphs PDF

Cannot Refute

[28] Symbol spotting in graphical documents with serialized subgraph hashing PDF

Cannot Refute

Exchangeability of GNN Representations with Applications to Graph Retrieval

Overview

Taxonomy

Research Landscape Overview

Claimed Contributions

Core Task Comparisons

Contribution Analysis

Exchangeability of GNN node embeddings

[29] Sdgnn: Symmetry-preserving dual-stream graph neural networks PDF

[30] E (n) equivariant graph neural networks PDF

[31] Reconstruction for powerful graph representations PDF

[32] Non-exchangeable conformal prediction for temporal graph neural networks PDF

[33] Similarity-navigated conformal prediction for graph neural networks PDF

[34] Conformal inductive graph neural networks PDF

[35] Learning invariant graph representations for out-of-distribution generalization PDF

[36] Non-Euclidean Spatial Graph Neural Network PDF

[37] Representing long-range context for graph neural networks with global attention PDF

[38] Universal Representation of Permutation-Invariant Functions on Vectors and Tensors PDF

Approximation of transportation-based graph similarity using Euclidean similarity

[9] Wasserstein task embedding for measuring task similarities PDF

[10] A Wasserstein graph distance based on distributions of probabilistic node embeddings PDF

[11] An optimal transport based embedding to quantify the distance between playing styles in collective sports PDF

[12] Applications of no-collision transportation maps in manifold learning PDF

[13] Unified route representation learning for multi-modal transportation recommendation with spatiotemporal pre-training PDF

[14] An optimal transport-embedded similarity measure for diagnostic knowledge transferability analytics across machines PDF

[15] Learning with similarity functions on graphs using matchings of geometric embeddings PDF

[16] A squaredâeuclidean distance locationâallocation problem PDF

[17] A Path Recommendation Method Based on the Siamese Graph Convolutional Network for the Holographic Counterpart of Consumer Electronics Logistics of the â¦ PDF

[18] Enabling location-based servicesâmulti-graph representation of transportation networks PDF

Unified locality-sensitive hashing framework for graph retrieval

[25] Fast graph similarity search via locality sensitive hashing PDF

[19] Locality sensitive hashing in fourier frequency domain for soft set containment search PDF

[20] A tree locality-sensitive hash for secure software testing PDF

[21] Locality Sensitive Hashing for Optimizing Subgraph Query Processing in Parallel Computing Systems PDF

[22] Graph edit distance with general costs using neural set divergence PDF

[23] Fast graph similarity search via hashing and its application on image retrieval PDF

[24] Kam1n0: Mapreduce-based assembly clone search for reverse engineering PDF

[26] Locality Sensitive Hashing for Data Placement to Optimize Parallel Subgraph Query Evaluation PDF

[27] A symbol spotting approach in graphical documents by hashing serialized graphs PDF

[28] Symbol spotting in graphical documents with serialized subgraph hashing PDF

Table of Contents

[16] A squaredâeuclidean distance locationâallocation problem PDF

[17] A Path Recommendation Method Based on the Siamese Graph Convolutional Network for the Holographic Counterpart of Consumer Electronics Logistics of the â¦ PDF

[18] Enabling location-based servicesâmulti-graph representation of transportation networks PDF