Pretraining with Re-parametrized Self-Attention: Unlocking Generalizationin SNN-Based Neural Decoding Across Time, Brains, and Tasks

ICLR 2026 Conference SubmissionAnonymous Authors

OpenReview Score: 6.0 Download Report PDF

Brain-Machine InterfaceNeural Spike DecodingSpiking Neural NetworkFoundation Model

The emergence of large-scale neural activity datasets provides new opportunities to enhance the generalization of neural decoding models. However, it remains a practical challenge to design neural decoders for fully implantable brain-machine interfaces (iBMIs) that achieve high accuracy, strong generalization, and low computational cost, which are essential for reliable, long-term deployment under strict power and hardware constraints. To address this, we propose the Re-parametrized self-Attention Spiking Neural Network (RAT SNN) with a cross-condition pretraining framework to integrate neural variability and adapt to stringent computational constraints. Specifically, our approach introduces multi-timescale dynamic spiking neurons to capture the complex temporal variability of neural activity. And we refine spike-driven attention within a lightweight, re-parameterized architecture that enables accumulate-only operations between spiking neurons without sacrificing decoding accuracy. Furthermore, we develop a stepwise training pipeline to systematically integrate neural variability across conditions, including neural temporal drift, subjects and tasks. Building on these advances, we construct a pretrained model capable of rapid generalization to unseen conditions with high performance. We demonstrate that RAT SNN consistently outperforms leading SNN baselines and matches the performance of state-of-the-art artificial neural network (ANN) models in terms of decoding accuracy with much lower computational cost under both seen and unseen conditions across various datasets. Collectively, Pretrained-RAT SNN represents a high-performance, highly generalizable, and energy-efficient prototype of an SNN foundation model for fully iBMI. Code is available at RAT SNN GitHub.

Abstract:

Disclaimer

This report is AI-GENERATED using Large Language Models and WisPaper (A scholar search engine). It analyzes academic papers' tasks and contributions against retrieved prior work. While this system identifies POTENTIAL overlaps and novel directions, ITS COVERAGE IS NOT EXHAUSTIVE AND JUDGMENTS ARE APPROXIMATE. These results are intended to assist human reviewers and SHOULD NOT be relied upon as a definitive verdict on novelty.

NOTE that some papers exist in multiple, slightly different versions (e.g., with different titles or URLs). The system may retrieve several versions of the same underlying work. The current automated pipeline does not reliably align or distinguish these cases, so human reviewers will need to disambiguate them manually.

If you have any questions, please contact: mingzhang23@m.fudan.edu.cn

Overview

Overall Novelty Assessment

The paper introduces a Re-parametrized self-Attention Spiking Neural Network (RAT SNN) with cross-condition pretraining for neural decoding in implantable brain-machine interfaces. It resides in the Spiking Neural Network Decoders leaf, which contains nine papers—a moderately populated category within the broader Decoding Algorithms and Computational Methods branch. This positions the work in an active but not overcrowded research direction, where spiking architectures are explored for their event-driven efficiency and biological plausibility in BMI applications.

The taxonomy reveals that spiking decoders sit alongside Classical and Statistical Decoding Methods (four papers using Kalman filters and Bayesian approaches) and Deep Learning and Artificial Neural Network Decoders (five papers employing transformers and recurrent networks). Neighboring branches address Hardware Implementation and System Integration, including FPGA-Based Real-Time Decoding Systems and Low-Power Decoding ASICs, which share the paper's concern for computational constraints. The scope notes clarify that spiking methods emphasize event-driven computation, distinguishing them from standard backpropagation-trained networks and classical statistical models.

Among twenty-two candidates examined across three contributions, none were flagged as clearly refuting the proposed work. The Re-parametrized self-Attention SNN examined ten candidates with zero refutable overlaps, as did the multi-timescale dynamic spiking neurons component. The cross-condition pretraining framework reviewed two candidates, also without refutation. This limited search scope—focused on top semantic matches rather than exhaustive coverage—suggests that within the examined literature, the specific combination of re-parameterized attention, multi-timescale dynamics, and cross-condition pretraining appears distinct, though the analysis does not rule out relevant prior work beyond these twenty-two papers.

Based on the top-twenty-two semantic matches, the work appears to occupy a recognizable niche within spiking neural network decoders, combining architectural innovations with a training pipeline tailored to neural variability. The absence of refutable candidates in this limited sample indicates that the specific technical choices may be novel, but the search scope leaves open the possibility of related approaches in the broader literature. The taxonomy context shows that spiking decoders remain an active area with ongoing exploration of efficiency-accuracy trade-offs.

Taxonomy

Core-task Taxonomy Papers

Claimed Contributions

Contribution Candidate Papers Compared

Refutable Paper

Research Landscape Overview

Core task: neural decoding from cortical spike trains for brain-machine interfaces. The field organizes around several major branches that reflect both methodological diversity and application scope. Decoding Algorithms and Computational Methods encompasses a spectrum from classical statistical approaches to modern deep learning architectures and biologically inspired spiking neural networks, with works like Deep Neural Decoder[14] and motorSRNN[18] exemplifying data-driven strategies. Neural Signal Sources and Recording Modalities addresses the variety of input signals—ranging from single-unit spikes to local field potentials and multimodal recordings—while Hardware Implementation and System Integration focuses on real-time, power-efficient deployment using FPGAs and custom processors, as seen in Hardware Decoding Benchmarking[13] and RISC-V SNN Decoder[39]. Meanwhile, Decoding Targets and Application Domains spans motor control, speech synthesis, and cognitive state estimation, and Learning, Adaptation, and Generalization tackles challenges like calibration-free operation and continual learning across sessions. Clinical Translation and System Evaluation emphasizes long-term reliability and user-centered performance metrics. Within the algorithmic landscape, a particularly active line of work explores spiking neural network decoders that leverage event-driven computation for efficiency and biological plausibility. Reparametrized SNN Decoding[0] sits squarely in this cluster, proposing novel training strategies to improve gradient flow and learning stability in SNNs applied to BMI tasks. This contrasts with earlier efforts like SNN Decoder BMI[30], which established foundational architectures, and complements recent hybrid approaches such as Hybrid Spiking Networks[42] that blend spiking and non-spiking components. Compared to purely deep learning methods like Neural Data Transformer[49], spiking decoders trade off representational flexibility for lower power consumption and closer alignment with neural dynamics. Open questions remain around how to best balance biological realism, computational efficiency, and decoding accuracy, especially as hardware platforms like FPGA Spiking Networks[33] enable increasingly sophisticated real-time implementations.

Claimed Contributions

Re-parametrized self-Attention Spiking Neural Network (RAT SNN)

10 retrieved papers

The authors introduce RAT SNN, a lightweight spiking neural network architecture that integrates re-parameterized spike-driven self-attention with multi-timescale dynamics. The architecture maintains accumulate-only operations between spiking neurons while achieving high decoding accuracy for brain-machine interfaces.

10 retrieved papers

Cross-condition pretraining framework with subject-specific batch normalization

2 retrieved papers

The authors develop a stepwise training pipeline that systematically integrates neural variability across conditions including temporal drift, subjects, and tasks. This framework uses subject-specific batch normalization to enable rapid generalization to unseen conditions.

2 retrieved papers

Multi-timescale dynamic spiking neurons with recurrent connections

10 retrieved papers

The authors propose recurrently connected leaky integrate-and-fire neurons with dynamic synapses to capture multi-timescale temporal dynamics in neural activity, mimicking biological neural systems with both long-range projections and local microcircuits.

10 retrieved papers

Core Task Comparisons

Comparisons with papers in the same taxonomy category

[4] Decoding finger velocity from cortical spike trains with recurrent spiking neural networks PDF

Teng-jun Liu, Julia Gygax, Tengjun Liu, Julian Rossbroich, Yansong Chua, Shaomin Zhang, Friedemann Zenke (2024)

[6] A spiking neural network with continuous local learning for robust online brain machine interface PDF

Sahil Shah, Elijah A. Taeckens, S. Shah (2023)

[18] motorSRNN: A spiking recurrent neural network inspired by brain topology for the effective and efficient decoding of cortical spike trains PDF

Teng-jun Liu, Tengjun Liu, Yansong Chua, Yuxiao Ning, Pengfu Liu, Yiwei Zhang, Tuoru Li, Guihua Wan, Zijun Wan, Weidong Chen, Shaomin Zhang (2025) • Biomedical Signal Processing and Control

[30] Spiking neural network decoder for brain-machine interfaces PDF

Julie Dethier, Vikash Gilja, J. Dethier, Paul Nuyujukian, V. Gilja, Shauki Elassaad, Krishna V. Shenoy, Kwabena Boahen, K. Shenoy, K. Boahen, Shauki A Elassaad (2011)

[33] Low-Power FPGA-based Spiking Neural Networks for Real-Time Decoding of Intracortical Neural Activity PDF

Luca Martis, Gianluca Leone, Luigi Raffo, Paolo Meloni (2024)

[39] A spiking neural network decoder for implantable brain machine interfaces and its sparsity-aware deployment on RISC-V microcontrollers PDF

Liao Jiawei, Jiawei Liao, Wang Xiaying, Oscar Toomey, Widmer Lars, Xiaying Wang, Lars Widmer, Benini, Luca, Cynthia A. Chestek, Jang Taekwang, Luca Benini, Taekwang Jang (2024)

[42] Hybrid Spiking Neural Networks for Low-Power Intra-Cortical Brain-Machine Interfaces PDF

Alexandru Vasilache, Jann Krausse, Klaus Knobloch, Juergen Becker (2024) • Biomedical Circuits and Systems Conference

[46] Emergent Bio-Functional Similarities in a Cortical-Spike-Train-Decoding Spiking Neural Network Facilitate Predictions of Neural Computation PDF

Liu, Tengjun, Chua, Yansong, Tengjun Liu, Zhang Yi-wei, Yansong Chua, Ning, Yuxiao, Yiwei Zhang, Pengfu, Yuxiao Ning, Wan, Guihua, Pengfu Liu, Zijun, Guihua Wan, Zhang Shao-min, Zijun Wan, Chen Wei-Dong, Shaomin Zhang, Weidong Chen (2023)

Contribution Analysis

Detailed comparisons for each claimed contribution

Contribution

Re-parametrized self-Attention Spiking Neural Network (RAT SNN)

[51] ADFCNN: Attention-Based Dual-Scale Fusion Convolutional Neural Network for Motor Imagery BrainâComputer Interface PDF

Cannot Refute

[52] Brain-Inspired Action Generation with Spiking Transformer Diffusion Policy Model PDF

Cannot Refute

[53] Online transformers with spiking neurons for fast prosthetic hand control PDF

Cannot Refute

[54] Spiking neural networks for biomedical signal analysis PDF

Cannot Refute

[55] Multiscale fusion enhanced spiking neural network for invasive BCI neural signal decoding PDF

Cannot Refute

[56] Effective and efficient intracortical brain signal decoding with spiking neural networks PDF

Cannot Refute

[57] MECASA: Motor Execution Classification using Additive Self-Attention for Hybrid EEG-fNIRS Data PDF

Cannot Refute

[58] A Systematic Review of Spiking Neural Networks for Human-Robot Interaction in Rehabilitative Wearable Robotics PDF

Cannot Refute

[59] A Bio-Inspired Spiking Attentional Neural Network for Attentional Selection in the Listening Brain PDF

Cannot Refute

[60] A Brain-Computer Interface Four-class Classification Algorithm Integrating a Custom Spiking Neural Network with Attention Mechanisms PDF

Cannot Refute

Contribution

Cross-condition pretraining framework with subject-specific batch normalization

[61] Mitigating Subject Dependency in EEG Decoding with Subject-Specific Low-Rank Adapters PDF

Cannot Refute

[62] Neural decoding from stereotactic EEG: accounting for electrode variability across subjects PDF

Cannot Refute

Contribution

Multi-timescale dynamic spiking neurons with recurrent connections

[63] Few-shot learning in spiking neural networks by multi-timescale optimization PDF

Cannot Refute

[64] Computing of temporal information in spiking neural networks with ReRAM synapses PDF

Cannot Refute

[65] Role of short-term plasticity and slow temporal dynamics in enhancing time series prediction with a brain-inspired recurrent neural network. PDF

Cannot Refute

[66] Continual familiarity decoding from recurrent connections in spiking networks PDF

Cannot Refute

[67] Heterogeneous recurrent spiking neural network for spatio-temporal classification PDF

Cannot Refute

[68] Learning spatiotemporal signals using a recurrent spiking network that discretizes time PDF

Cannot Refute

[69] Rapid memory encoding in a recurrent network model with behavioral time scale synaptic plasticity PDF

Cannot Refute

[70] Temporal coding in recurrent spiking neural networks with synaptic delay-weight plasticity PDF

Cannot Refute

[71] Synaptic transistor with multiple biological functions based on metal-organic frameworks combined with the LIF model of a spiking neural network to recognize temporal information PDF

Cannot Refute

[72] Causal Spike Timing Dependent Plasticity Prevents Assembly Fusion in Recurrent Networks PDF

Cannot Refute

Pretraining with Re-parametrized Self-Attention: Unlocking Generalizationin SNN-Based Neural Decoding Across Time, Brains, and Tasks

Overview

Overall Novelty Assessment

Taxonomy

Research Landscape Overview

Claimed Contributions

Core Task Comparisons

[4] Decoding finger velocity from cortical spike trains with recurrent spiking neural networks PDF

[6] A spiking neural network with continuous local learning for robust online brain machine interface PDF

[18] motorSRNN: A spiking recurrent neural network inspired by brain topology for the effective and efficient decoding of cortical spike trains PDF

[30] Spiking neural network decoder for brain-machine interfaces PDF

[33] Low-Power FPGA-based Spiking Neural Networks for Real-Time Decoding of Intracortical Neural Activity PDF

[39] A spiking neural network decoder for implantable brain machine interfaces and its sparsity-aware deployment on RISC-V microcontrollers PDF

[42] Hybrid Spiking Neural Networks for Low-Power Intra-Cortical Brain-Machine Interfaces PDF

[46] Emergent Bio-Functional Similarities in a Cortical-Spike-Train-Decoding Spiking Neural Network Facilitate Predictions of Neural Computation PDF

Contribution Analysis

Re-parametrized self-Attention Spiking Neural Network (RAT SNN)

[51] ADFCNN: Attention-Based Dual-Scale Fusion Convolutional Neural Network for Motor Imagery BrainâComputer Interface PDF

[52] Brain-Inspired Action Generation with Spiking Transformer Diffusion Policy Model PDF

[53] Online transformers with spiking neurons for fast prosthetic hand control PDF

[54] Spiking neural networks for biomedical signal analysis PDF

[55] Multiscale fusion enhanced spiking neural network for invasive BCI neural signal decoding PDF

[56] Effective and efficient intracortical brain signal decoding with spiking neural networks PDF

[57] MECASA: Motor Execution Classification using Additive Self-Attention for Hybrid EEG-fNIRS Data PDF

[58] A Systematic Review of Spiking Neural Networks for Human-Robot Interaction in Rehabilitative Wearable Robotics PDF

[59] A Bio-Inspired Spiking Attentional Neural Network for Attentional Selection in the Listening Brain PDF

[60] A Brain-Computer Interface Four-class Classification Algorithm Integrating a Custom Spiking Neural Network with Attention Mechanisms PDF

Cross-condition pretraining framework with subject-specific batch normalization

[61] Mitigating Subject Dependency in EEG Decoding with Subject-Specific Low-Rank Adapters PDF

[62] Neural decoding from stereotactic EEG: accounting for electrode variability across subjects PDF

Multi-timescale dynamic spiking neurons with recurrent connections

[63] Few-shot learning in spiking neural networks by multi-timescale optimization PDF

[64] Computing of temporal information in spiking neural networks with ReRAM synapses PDF

[65] Role of short-term plasticity and slow temporal dynamics in enhancing time series prediction with a brain-inspired recurrent neural network. PDF

[66] Continual familiarity decoding from recurrent connections in spiking networks PDF

[67] Heterogeneous recurrent spiking neural network for spatio-temporal classification PDF

[68] Learning spatiotemporal signals using a recurrent spiking network that discretizes time PDF

[69] Rapid memory encoding in a recurrent network model with behavioral time scale synaptic plasticity PDF

[70] Temporal coding in recurrent spiking neural networks with synaptic delay-weight plasticity PDF

[71] Synaptic transistor with multiple biological functions based on metal-organic frameworks combined with the LIF model of a spiking neural network to recognize temporal information PDF

[72] Causal Spike Timing Dependent Plasticity Prevents Assembly Fusion in Recurrent Networks PDF

Table of Contents

[51] ADFCNN: Attention-Based Dual-Scale Fusion Convolutional Neural Network for Motor Imagery BrainâComputer Interface PDF