The Expressive Limits of Diagonal SSMs for State-Tracking
Overview
Overall Novelty Assessment
The paper characterizes the expressivity of diagonal complex-valued state-space models (DCD SSMs) for group state-tracking tasks, proving that single-layer models can track Abelian groups but not non-Abelian groups at finite precision. It resides in the 'Diagonal SSM Expressivity Bounds' leaf under 'Theoretical Expressivity Analysis', sharing this leaf with one sibling paper. This represents a relatively sparse research direction within the taxonomy, which contains only seven total papers across six leaf nodes, suggesting the paper addresses a focused theoretical question in an emerging subfield.
The taxonomy reveals that neighboring work diverges into architectural enhancements (dense parameterizations, structured sparsity) and domain-specific applications (geometric SSMs, temporal graphs). The sibling paper in the same leaf likely explores related diagonal expressivity questions, while the adjacent 'Eigenvalue-Based Expressivity Mechanisms' leaf examines spectral properties as a complementary theoretical lens. The paper's focus on group-theoretic characterizations distinguishes it from architectural modifications that relax diagonal constraints, positioning it as foundational theory rather than applied methodology.
Among fourteen candidates examined, the first contribution (single-layer expressivity) showed no refutable prior work across four candidates, suggesting novelty in the Abelian/non-Abelian dichotomy result. The second contribution (multi-layer subnormal series characterization) had zero candidates examined, indicating limited direct precedent. The third contribution (learnability gap) examined ten candidates with three appearing to provide overlapping empirical observations, suggesting this aspect has more substantial prior exploration within the limited search scope. The theoretical contributions appear more distinctive than the empirical learnability findings.
Based on the limited search of fourteen semantically similar papers, the theoretical characterizations appear relatively novel within the examined scope, particularly the group-theoretic framework for multi-layer models. However, the learnability gap observation aligns with existing work on expressivity-trainability mismatches. The analysis does not cover exhaustive literature review or broader SSM theory beyond the top-K semantic matches and their citations.
Taxonomy
Research Landscape Overview
Claimed Contributions
The authors prove that a single-layer input-dependent complex-valued diagonal SSM can track a group at finite precision if and only if that group is Abelian. This establishes a fundamental limitation of single-layer diagonal SSMs on non-commutative group operations.
The authors establish that k-layer diagonal SSMs can track exactly those groups admitting a subnormal series with Abelian factors of length at most k. This precisely identifies the expressive capacity of multi-layer diagonal SSMs within the class of solvable groups.
The authors empirically show that even when multi-layer diagonal SSMs are theoretically capable of tracking non-Abelian groups, standard gradient-based training often fails to discover generalizable solutions. This reveals a practical limitation beyond theoretical expressivity.
Core Task Comparisons
Comparisons with papers in the same taxonomy category
[1] Theoretical foundations of deep selective state-space models PDF
Contribution Analysis
Detailed comparisons for each claimed contribution
Expressivity characterization of single-layer diagonal SSMs for group state-tracking
The authors prove that a single-layer input-dependent complex-valued diagonal SSM can track a group at finite precision if and only if that group is Abelian. This establishes a fundamental limitation of single-layer diagonal SSMs on non-commutative group operations.
[2] Fixed-point rnns: From diagonal to dense in a few iterations PDF
[5] Fixed-Point RNNs: Interpolating from Diagonal to Dense PDF
[7] Structured Sparse Transition Matrices to Enable State Tracking in State-Space Models PDF
[8] State space grids PDF
Expressivity characterization of multi-layer diagonal SSMs via subnormal series
The authors establish that k-layer diagonal SSMs can track exactly those groups admitting a subnormal series with Abelian factors of length at most k. This precisely identifies the expressive capacity of multi-layer diagonal SSMs within the class of solvable groups.
Demonstration of learnability gap between expressivity and trainability
The authors empirically show that even when multi-layer diagonal SSMs are theoretically capable of tracking non-Abelian groups, standard gradient-based training often fails to discover generalizable solutions. This reveals a practical limitation beyond theoretical expressivity.