Estimating Dimensionality of Neural Representations from Finite Samples

ICLR 2026 Conference SubmissionAnonymous Authors

OpenReview Score: 7.3 Download Report PDF

Dimensionalityestimatorneuroscience

The global dimensionality of a neural representation manifold provides rich insight into the computational process underlying both artificial and biological neural networks. However, all existing measures of global dimensionality are sensitive to the number of samples, i.e., the number of rows and columns of the sample matrix. We show that, in particular, the participation ratio of eigenvalues, a popular measure of global dimensionality, is highly biased with small sample sizes, and propose a bias-corrected estimator that is more accurate with finite samples and with noise. On synthetic data examples, we demonstrate that our estimator can recover the true known dimensionality. We apply our estimator to neural brain recordings, including calcium imaging, electrophysiological recordings, and fMRI data, and to the neural activations in a large language model and show our estimator is invariant to the sample size. Finally, our estimators can additionally be used to measure the local dimensionalities of curved neural manifolds by weighting the finite samples appropriately.

Abstract:

Disclaimer

This report is AI-GENERATED using Large Language Models and WisPaper (A scholar search engine). It analyzes academic papers' tasks and contributions against retrieved prior work. While this system identifies POTENTIAL overlaps and novel directions, ITS COVERAGE IS NOT EXHAUSTIVE AND JUDGMENTS ARE APPROXIMATE. These results are intended to assist human reviewers and SHOULD NOT be relied upon as a definitive verdict on novelty.

NOTE that some papers exist in multiple, slightly different versions (e.g., with different titles or URLs). The system may retrieve several versions of the same underlying work. The current automated pipeline does not reliably align or distinguish these cases, so human reviewers will need to disambiguate them manually.

If you have any questions, please contact: mingzhang23@m.fudan.edu.cn

Overview

Overall Novelty Assessment

The paper proposes a bias-corrected estimator for the participation ratio of eigenvalues to measure global dimensionality of neural representation manifolds from finite samples. It resides in the 'Finite-Sample Bias Correction Techniques' leaf, which contains only two papers total (including this one). This places the work in a relatively sparse research direction within the broader taxonomy of 16 papers across 13 leaf nodes. The sibling paper in this leaf also addresses finite-sample correction, suggesting this specific methodological niche—correcting bias in dimensionality measures under limited sampling—is not yet crowded but represents a recognized gap in the field.

The taxonomy tree reveals that neighboring leaves focus on general intrinsic dimension estimation approaches (three papers using nearest-neighbor and correlation-based techniques) and Bayesian nonparametric methods (one paper). These adjacent directions do not explicitly emphasize finite-sample bias correction, instead offering broader algorithmic frameworks. The paper's position bridges methodological development (Intrinsic Dimensionality Estimation Methods branch) with applications to both biological neural recordings and artificial neural networks, connecting to separate branches that examine dimensionality in biological systems (three papers across cortical, hippocampal, and multi-electrode studies) and artificial systems (two papers on deep network representations). This cross-branch applicability distinguishes the work from purely algorithmic or purely empirical studies.

Among 23 candidates examined across three contributions, none were found to clearly refute any contribution. The bias-corrected participation ratio estimator examined 3 candidates with 0 refutable; the noise correction method examined 10 candidates with 0 refutable; and the weighted framework for local dimensionality examined 10 candidates with 0 refutable. This suggests that within the limited search scope—top-K semantic matches plus citation expansion—no prior work was identified that directly anticipates the specific combination of bias correction, noise handling, and weighted local dimensionality estimation proposed here. The noise correction and weighted framework contributions, each examined against 10 candidates, appear particularly distinct from existing approaches in the sampled literature.

Based on the limited search of 23 candidates, the work appears to occupy a methodologically focused niche with modest prior coverage. The taxonomy structure confirms that finite-sample bias correction is an emerging rather than saturated direction, and the contribution-level statistics indicate no substantial overlap with examined prior work. However, this assessment reflects the scope of semantic search and citation expansion, not an exhaustive survey of all dimensionality estimation literature. The cross-applicability to both biological and artificial neural systems, demonstrated empirically, may represent a practical contribution beyond the core methodological novelty.

Taxonomy

Core-task Taxonomy Papers

Claimed Contributions

Contribution Candidate Papers Compared

Refutable Paper

Research Landscape Overview

Core task: estimating global dimensionality of neural representation manifolds from finite samples. This field addresses a fundamental challenge in neuroscience and machine learning—determining the intrinsic dimensionality of high-dimensional neural activity or learned representations when only limited observations are available. The taxonomy reveals several complementary perspectives: one branch focuses on intrinsic dimensionality estimation methods themselves, developing algorithms that can handle finite-sample biases and adapt to local manifold structure (e.g., Intrinsic Dimension Undersampled Data[4], Manifold Adaptive Dimension Estimation[8]). Other branches examine neural representation dimensionality in biological systems (Dimensionality Manifold Neural Recordings[2], Dimensionality Inferotemporal Cortex[6]) and artificial systems (Intrinsic Dimensionality Image Representations[1], CNN Compression Intrinsic Dimension[13]), while theoretical frameworks (Multineuronal Dimensionality Theory[5], Statistical Neural Representations[9]) provide formal grounding. Additional branches cover manifold learning techniques for reconstruction (Manifold Flattening Reconstruction[7]) and domain-specific applications spanning cognitive maps to chemical reaction coordinates. A central tension across these branches concerns how to reliably estimate dimensionality when sample sizes are modest relative to ambient dimensionality—a ubiquitous constraint in neural recordings and representation analysis. Many studies grapple with finite-sample biases that can systematically overestimate or underestimate true dimensionality, particularly when data lie on curved or variable-density manifolds. Estimating Dimensionality Finite Samples[0] sits squarely within the methodological branch addressing finite-sample bias correction techniques, closely aligned with work like Intrinsic Dimension Undersampled Data[4] that explicitly tackles undersampling challenges. Compared to broader manifold learning approaches (Geometric Nonlinear Manifold Clustering[3]) or domain-specific applications, this work emphasizes rigorous statistical correction to yield accurate global dimensionality estimates despite sampling limitations—a critical step for interpreting neural coding capacity and representational geometry across both biological and artificial systems.

Claimed Contributions

Bias-corrected estimator for participation ratio of eigenvalues

3 retrieved papers

The authors derive an unbiased estimator of the participation ratio (PR) by correcting finite-sample bias in both the numerator and denominator. This estimator addresses the systematic bias that arises when computing global dimensionality from finite data matrices by averaging only over unequal indices, making it resistant to sample size variations.

3 retrieved papers

Noise correction method for dimensionality estimation

10 retrieved papers

The authors present a method to correct bias from additive or multiplicative noise in dimensionality estimation by using two independent trials of the same stimuli and neurons. This approach requires only two trials and achieves bias reduction of O(1/P + 1/Q), more efficient than naive averaging methods.

10 retrieved papers

Weighted dimensionality framework for local dimensionality estimation

10 retrieved papers

The authors extend their framework to measure local (intrinsic) dimensionality by introducing sample weighting schemes. This weighted approach enables estimation of dimensionality in local neighborhoods of a manifold and is resistant to noise, unlike existing popular local dimensionality estimators such as TwoNN.

10 retrieved papers

Core Task Comparisons

Comparisons with papers in the same taxonomy category

[4] Intrinsic dimension estimation for locally undersampled data PDF

Erba, Vittorio, Gherardi Marco, Rotondo, Pietro (2019) • Scientific Reports

Contribution Analysis

Detailed comparisons for each claimed contribution

Contribution

Bias-corrected estimator for participation ratio of eigenvalues

[26] A scale-dependent measure of system dimensionality PDF

Cannot Refute

[27] Sample size determination for GEE analyses of stepped wedge cluster randomized trials PDF

Cannot Refute

[28] Critical generalized inverse participation ratio distributions PDF

Cannot Refute

Contribution

Noise correction method for dimensionality estimation

[29] High-dimensional geometry of population responses in visual cortex PDF

Cannot Refute

[30] Gaussian partial information decomposition: Bias correction and application to high-dimensional data PDF

Cannot Refute

[31] A security model for smart grid SCADA systems using stochastic neural network PDF

Cannot Refute

[32] Assessing Neural Network Representations During Training Using Noise-Resilient Diffusion Spectral Entropy PDF

Cannot Refute

[33] Disentangling Identifiable Features from Noisy Data with Structured Nonlinear ICA PDF

Cannot Refute

[34] Variable noise and dimensionality reduction for sparse Gaussian processes PDF

Cannot Refute

[35] Python for information theoretic analysis of neural data PDF

Cannot Refute

[36] Estimating the functional dimensionality of neural representations PDF

Cannot Refute

[37] Manifold Reconstruction of Differences: A Model-Based Iterative Statistical Estimation Algorithm With a Data-Driven Prior. PDF

Cannot Refute

[38] Accuracy maximization analysis for sensory-perceptual tasks: Computational improvements, filter robustness, and coding advantages for scaled additive noise PDF

Cannot Refute

Contribution

Weighted dimensionality framework for local dimensionality estimation

[3] Geometric analysis of nonlinear manifold clustering PDF

Cannot Refute

[17] Fermat Distances: Metric Approximation, Spectral Convergence, and Clustering Algorithms PDF

Cannot Refute

[18] Weighted Linear Local Tangent Space Alignment via Geometrically Inspired Weighted PCA for Fault Detection PDF

Cannot Refute

[19] UCET8: Universal Curvature Equivalence on Noncompact, Bounded, and Lorentzian Geometries PDF

Cannot Refute

[20] Intrinsic Dimension Estimating Autoencoder (IDEA) Using CancelOut Layer and a Projected Loss PDF

Cannot Refute

[21] A global view of reactive coordinate manifolds from nonlinear dimensionality reduction PDF

Cannot Refute

[22] Local smoothing for manifold learning PDF

Cannot Refute

[23] A continuous Formulation of intrinsic Dimension. PDF

Cannot Refute

[24] Mitigating Class Imbalance in Long-Tailed Visual Recognition Through the Use of Intrinsic Dimensionality PDF

Cannot Refute

[25] Estimating low dimensional dynamical models for molecules PDF

Cannot Refute

Estimating Dimensionality of Neural Representations from Finite Samples

Overview

Overall Novelty Assessment

Taxonomy

Research Landscape Overview

Claimed Contributions

Core Task Comparisons

[4] Intrinsic dimension estimation for locally undersampled data PDF

Contribution Analysis

Bias-corrected estimator for participation ratio of eigenvalues

[26] A scale-dependent measure of system dimensionality PDF

[27] Sample size determination for GEE analyses of stepped wedge cluster randomized trials PDF

[28] Critical generalized inverse participation ratio distributions PDF

Noise correction method for dimensionality estimation

[29] High-dimensional geometry of population responses in visual cortex PDF

[30] Gaussian partial information decomposition: Bias correction and application to high-dimensional data PDF

[31] A security model for smart grid SCADA systems using stochastic neural network PDF

[32] Assessing Neural Network Representations During Training Using Noise-Resilient Diffusion Spectral Entropy PDF

[33] Disentangling Identifiable Features from Noisy Data with Structured Nonlinear ICA PDF

[34] Variable noise and dimensionality reduction for sparse Gaussian processes PDF

[35] Python for information theoretic analysis of neural data PDF

[36] Estimating the functional dimensionality of neural representations PDF

[37] Manifold Reconstruction of Differences: A Model-Based Iterative Statistical Estimation Algorithm With a Data-Driven Prior. PDF

[38] Accuracy maximization analysis for sensory-perceptual tasks: Computational improvements, filter robustness, and coding advantages for scaled additive noise PDF

Weighted dimensionality framework for local dimensionality estimation

[3] Geometric analysis of nonlinear manifold clustering PDF

[17] Fermat Distances: Metric Approximation, Spectral Convergence, and Clustering Algorithms PDF

[18] Weighted Linear Local Tangent Space Alignment via Geometrically Inspired Weighted PCA for Fault Detection PDF

[19] UCET8: Universal Curvature Equivalence on Noncompact, Bounded, and Lorentzian Geometries PDF

[20] Intrinsic Dimension Estimating Autoencoder (IDEA) Using CancelOut Layer and a Projected Loss PDF

[21] A global view of reactive coordinate manifolds from nonlinear dimensionality reduction PDF

[22] Local smoothing for manifold learning PDF

[23] A continuous Formulation of intrinsic Dimension. PDF

[24] Mitigating Class Imbalance in Long-Tailed Visual Recognition Through the Use of Intrinsic Dimensionality PDF

[25] Estimating low dimensional dynamical models for molecules PDF

Table of Contents