arXivDaily arXiv每日学术速递 周一至周五更新

科学与医疗

脑机接口 / BCI

脑机接口、EEG、神经信号解码、神经假体和脑控交互。

今日/当前日期收录 8 信号源:q-bio.NC, eess.SP, cs.LG, cs.HC, cs.RO

1. EEG解码 3 篇

2606.18816 2026-06-18 cs.HC cs.AI cs.ET 新提交 专题 95

SwitchBraidNet: Quantisation-Aware Lightweight Architecture for Hybrid Brain-Computer Interface

SwitchBraidNet: 面向混合脑机接口的量化感知轻量级架构

Gourav Siddhad, Yogesh Kumar Meena

发表机构 * Human-AI Interaction (HAIx) Lab, Indian Institute of Technology Gandhinagar(人类-人工智能交互实验室,印度理工学院甘地纳格尔)

专题命中 EEG解码 :混合BCI架构,解码MI和SSVEP信号

AI总结 提出SwitchBraidNet紧凑型EEG分类架构,采用双路径时间辫、自适应挤压激励空间开关和对数方差读出层,通过量化感知训练在OpenBMI数据集上实现高精度低功耗混合脑机接口解码,INT8模型仅3.03 KB。

Comments 6 pages, 5 figures, Preprint accepted at IEEE SMC 2026

详情
AI中文摘要

混合脑机接口(BCI)结合运动想象(MI)和稳态视觉诱发电位(SSVEP),提供高维神经解码,但通常超出嵌入式硬件的计算限制。为解决此问题,我们提出SwitchBraidNet,一种专为低功耗部署设计的紧凑型EEG分类架构。该模型采用双路径时间辫提取多尺度振荡特征,自适应挤压激励空间开关进行电极门控,以及对数方差读出层直接编码频带功率。此外,通过在OpenBMI数据集上进行系统量化感知训练,我们将SwitchBraidNet与四种基线方法在FP32、FP16和INT8精度下进行比较。实验结果表明其优越的效率和性能,在FP16下MI准确率达到69.49%,FP32下SSVEP准确率达到93.48%,FP16下混合信息传输率为64.82 bits/min。INT8模型仅占用3.03 KB,SwitchBraidNet在不同数值精度下保持高准确率,证明了其适用于低功耗嵌入式BCI部署。

英文摘要

Hybrid brain-computer interfaces (BCIs) that integrate motor imagery (MI) and steady-state visual evoked potentials (SSVEP) provide high-dimensional neural decoding but typically exceed the computational limits of embedded hardware. To address this, we propose SwitchBraidNet, a compact EEG classification architecture designed for low-power deployment. The model employs a dual-path temporal braid to extract multiscale oscillatory features, an adaptive squeeze-and-excitation spatial switch for electrode gating, and a log-variance readout layer for direct band-power encoding. Furthermore, through systematic quantisation-aware training on the OpenBMI dataset, we compared SwitchBraidNet against four established baselines across FP32, FP16, and INT8 precisions. Experimental results demonstrate superior efficiency and performance, achieving MI accuracy of 69.49% (FP16), SSVEP accuracy of 93.48% (FP32), and a hybrid information transfer rate of 64.82 bits/min (FP16). With an INT8 footprint of only 3.03 KB, SwitchBraidNet maintains high accuracy across varying numerical precisions, demonstrating its suitability for low-power embedded BCI deployment.

2606.19312 2026-06-18 cs.PL 新提交 专题 75

QDSV: A Semantic Problem Representation and Multi-Backend Execution Framework for Quantum-Oriented Computation

QDSV:面向量子计算的语义问题表示与多后端执行框架

Jaime Alexander Jimenez Lozano, Sebastian Jimenez Giraldo

专题命中 EEG解码 :量子计算框架用于EEG分类

AI总结 提出QDSV框架,通过语义表示分离问题规范与后端实现,支持非电路形式的执行模式,并在EEG分类案例中验证了跨模拟器和硬件的稳定执行。

Comments 12 pages, 1 figure, 6 tables

详情
AI中文摘要

基于状态空间的谓词计算将问题规范与实现它的后端分离。基于arXiv:2606.15027中引入的模型,本文研究QDSV作为面向量子计算的语义多后端执行框架。我们描述了QDSV、QIntent和Qruba如何将声明式问题意图连接到结构化语义表示,在异构后端约束下实现该表示,并报告分离模型级语义输出与后端特定观测的执行轨迹输出。该框架支持不需要原始问题以电路形式编写的执行模式,同时在需要时仍允许生成电路兼容的工件。作为案例研究,我们使用来自Bonn和Delhi数据集的预处理信号特征评估EEG发作期/发作间期分类。该研究比较了经典机器学习参考、电路优先变分量子分类器基线、QDSV模拟器执行以及受控的IBM量子硬件运行。本文不声称通用量子优势或优于经典机器学习。其贡献在于一种语义执行验证,展示了问题优先表示如何在模拟器和硬件实现中保持稳定,同时保留可解释的执行轨迹输出。

英文摘要

Predicate-based computation over state spaces separates a problem specification from the backend that realizes it. Building on the model introduced in arXiv:2606.15027, this paper studies QDSV as a semantic, multi-backend execution framework for quantum-oriented computation. We describe how QDSV, QIntent, and Qruba connect declarative problem intent to a structured semantic representation, realize that representation under heterogeneous backend constraints, and report execution trace outputs that separate model-level semantic outputs from backend-specific observations. The framework supports execution modes that do not require the original problem to be authored as a circuit, while still allowing circuit-compatible artifacts when required. As a case study, we evaluate EEG ictal/interictal classification using prepared signal features from the Bonn and Delhi datasets. The study compares classical machine-learning references, a circuit-first variational quantum classifier baseline, QDSV simulator executions, and controlled IBM Quantum hardware runs. The paper does not claim general quantum advantage or superiority over classical machine learning. Its contribution is a semantic execution validation showing how a problem-first representation can remain stable across simulator and hardware realizations while retaining interpretable execution trace outputs.

2606.19039 2026-06-18 cs.NE cs.LG cs.SD 新提交 专题 60

Adaptive Speech-to-Spike Encoding for Spiking Neural Networks

自适应语音到脉冲编码用于脉冲神经网络

Taharim Rahman Anon, Jakaria Islam Emon

发表机构 * PI LLC(1 PI LLC)

专题命中 EEG解码 :语音到脉冲编码,SNN,但非直接BCI。

AI总结 提出一种可学习的残差语音到脉冲编码器,与R-LIF骨干网络联合训练,在GSC-v2上达94.97%准确率,参数高效且学习任务对齐的脉冲表示。

Comments Accepted at Interspeech 2026. This version is a preprint

详情
AI中文摘要

连续声学信号与离散事件驱动处理之间的不匹配仍然是神经形态语音处理的基本瓶颈。当前系统通常依赖固定的脉冲编码器,迫使下游脉冲神经网络(SNN)补偿非自适应的输入表示。为了解决这个问题,我们提出了一种可学习的残差语音到脉冲编码器,与循环漏积分点火(R-LIF)骨干网络进行端到端联合训练。我们在Google Speech Commands v2(GSC-v2)基准上验证了该方法,达到了高达94.97%的准确率。值得注意的是,学习到的编码器仍然高度参数高效,其紧凑的35k参数变体达到了89.8%,匹配或超过了需要多一个数量级参数的先前基线。我们以编码器为中心的分析,包括线性探测和梯度残差检查,表明编码器并不追求忠实的信号重建,而是学习任务对齐的脉冲表示,增强了类别可分性。最后,我们通过比较直接反馈对齐(DFA)和替代梯度BPTT在相同架构和训练条件下的表现,对生物启发、硬件友好的信用分配进行了基准测试。我们发现DFA达到了91.5%的准确率,量化了生物启发学习规则在现代神经形态音频中的性能权衡。

英文摘要

The mismatch between continuous acoustic signals and discrete event-driven processing remains a fundamental bottleneck for neuromorphic speech processing. Current systems typically rely on fixed spike encoders, forcing downstream Spiking Neural Networks (SNNs) to compensate for non-adaptive input representations. To address this, we present a learnable residual speech-to-spike encoder jointly trained end-to-end with a Recurrent Leaky Integrate-and-Fire (R-LIF) backbone. We validate this approach on the Google Speech Commands v2 (GSC-v2) benchmark, achieving up to 94.97% accuracy. Notably, the learned encoder remains highly parameter-efficient with a compact 35k-parameter variant that reaches 89.8%, matching or exceeding prior baselines that require an order of magnitude more parameters. Our encoder-focused analysis, including linear probing and gradient-residual inspection, indicates that the encoder does not target faithful signal reconstruction but instead learns task-aligned spike representations that enhance class separability. Finally, we benchmark bio-inspired, hardware-friendly credit assignment by comparing Direct Feedback Alignment (DFA) with surrogate-gradient BPTT under identical architectures and training conditions. We find that DFA reaches 91.5% accuracy, quantifying the performance trade-off of bio-inspired learning rules for modern neuromorphic audio.

2. 神经假体 1 篇

2606.15091 2026-06-18 cs.HC cs.AI 新提交 专题 95

Sensory Restoration via Brain-Computer Interfaces: A Unified 2 x 2 Framework and Convergence Roadmap

通过脑机接口的感觉恢复:统一的2×2框架与融合路线图

Xuan-The Tran

发表机构 * School of Mechanical Engineering, Vietnam Maritime University(机械工程学院,越南海防大学)

专题命中 神经假体 :提出BCI框架用于感觉恢复

AI总结 本文提出一个统一的2×2框架,按侵入性和信号方向分类脑机接口,并定义恢复、替代和增强范式,同时给出近中长期的融合路线图。

详情
AI中文摘要

全球数百万个体因神经退行性疾病、中风或创伤而遭受感觉和沟通缺陷。脑机接口(BCI)为感觉和运动恢复提供了有希望的途径。然而,科学文献在侵入性神经假体和非侵入性电生理解码器之间高度碎片化,缺乏一致的术语和比较指标。本章提出了一个统一的2×2框架,沿两个轴对BCI进行分类:侵入性程度(侵入性与非侵入性)和信号方向(传入感觉-IN与传出感觉-OUT)。我们定义并区分了恢复、替代和增强的范式。此外,我们概述了一个结构化的路线图,用于在近期、中期和长期内这些模态的融合,重点关注物理限制和机器学习基础模型的整合作用。

英文摘要

Millions of individuals worldwide suffer from sensory and communication deficits caused by neurodegenerative diseases, stroke, or trauma. Brain-computer interfaces (BCIs) offer a promising avenue for sensory and motor restoration. However, the scientific literature remains highly fragmented between invasive neuroprosthetics and non-invasive electrophysiological decoders, with a lack of consistent terminology and comparison metrics. This chapter proposes a unified 2 x 2 framework categorizing BCIs along two axes: degree of invasiveness (invasive vs. non-invasive) and signal direction (afferent sensory-IN vs. efferent sensory-OUT). We define and distinguish the paradigms of restoration, substitution, and augmentation. Furthermore, we outline a structural roadmap for the convergence of these modalities over near-, medium-, and long-term horizons, focusing on physical limits and the integrative role of machine learning foundation models.

3. 神经信号处理 4 篇

2406.15537 2026-06-18 q-bio.NC cs.AI cs.SD eess.AS 专题 90

R&B -- Rhythm and Brain: Cross-subject Decoding of Music from Human Brain Activity

R&B -- 音乐与大脑:从人类脑活动交叉解码音乐

Matteo Ferrante, Matteo Ciferri, Nicola Toschi

发表机构 * Department of Biomedicine and Prevention University of Rome Tor Vergata(生物医学与预防系罗马大学托尔维加塔分校) A.A. Martinos Center for Biomedical Imaging Harvard Medical School/MGH, Boston (US)(A.A. Martinos生物医学成像中心哈佛医学院/马萨诸塞总医院,波士顿(美国))

专题命中 神经信号处理 :从fMRI脑活动解码音乐

AI总结 研究通过fMRI数据解码音乐,利用CLAP模型和voxel编码模型,实现跨被试音乐识别,提升音乐感知与情绪的神经基础理解。

Comments The first two authors contributed equally to this work

Journal ref Neural Networks, 203, 109195 (2026)

详情
AI中文摘要

音乐是一种普遍现象,深刻影响人类经验。本研究探讨是否能从功能性磁共振成像(fMRI)数据中解码音乐。利用最新大规模数据集和预训练计算模型,构建神经数据与音乐刺激潜在表示之间的映射。我们的方法整合功能和解剖对齐技术,以解决fMRI数据低时间分辨率和信噪比的问题。从GTZan fMRI数据集出发,五名受试者在听10种不同音乐流派的540个音乐刺激时记录脑活动。利用CLAP模型提取音乐刺激的潜在表示,并开发voxel编码模型以识别对这些刺激有反应的脑区。通过设置预测与实际脑活动之间的阈值,确定特定感兴趣区域(ROIs),这些区域可解释为音乐处理的关键参与者。我们的解码流程主要基于检索,使用线性映射将脑活动投影到对应的CLAP特征。这使我们能够预测并检索与fMRI数据起源最相似的音乐刺激。我们的结果展示了最先进的识别精度,方法显著优于现有方法。我们的发现表明,基于神经的音乐检索系统可能实现个性化推荐和治疗应用。未来工作可利用更高时间分辨率的神经成像和生成模型来提高解码精度,并探索音乐感知和情绪的神经基础。

英文摘要

Music is a universal phenomenon that profoundly influences human experiences across cultures. This study investigates whether music can be decoded from human brain activity measured with functional MRI (fMRI) during its perception. Leveraging recent advancements in extensive datasets and pre-trained computational models, we construct mappings between neural data and latent representations of musical stimuli. Our approach integrates functional and anatomical alignment techniques to facilitate cross-subject decoding, addressing the challenges posed by the low temporal resolution and signal-to-noise ratio (SNR) in fMRI data. Starting from the GTZan fMRI dataset, where five participants listened to 540 musical stimuli from 10 different genres while their brain activity was recorded, we used the CLAP (Contrastive Language-Audio Pretraining) model to extract latent representations of the musical stimuli and developed voxel-wise encoding models to identify brain regions responsive to these stimuli. By applying a threshold to the association between predicted and actual brain activity, we identified specific regions of interest (ROIs) which can be interpreted as key players in music processing. Our decoding pipeline, primarily retrieval-based, employs a linear map to project brain activity to the corresponding CLAP features. This enables us to predict and retrieve the musical stimuli most similar to those that originated the fMRI data. Our results demonstrate state-of-the-art identification accuracy, with our methods significantly outperforming existing approaches. Our findings suggest that neural-based music retrieval systems could enable personalized recommendations and therapeutic applications. Future work could use higher temporal resolution neuroimaging and generative models to improve decoding accuracy and explore the neural underpinnings of music perception and emotion.

2211.01960 2026-06-18 q-bio.NC cs.HC cs.LG 专题 85

FingerFlex: Inferring Finger Trajectories from ECoG signals

FingerFlex:从ECoG信号推断手指轨迹

Vladislav Lomtev, Alexander Kovalev, Alexey Timchenko

发表机构 * Bauman Moscow State Technical University(巴乌曼莫斯科国立技术大学) ALVI Labs(ALVI实验室) Brain Dynamics Group, Higher School of Economics(高等经济学院脑动力组) University of Tuebingen(图宾根大学)

专题命中 神经信号处理 :从ECoG信号推断手指轨迹,运动BCI。

AI总结 本文提出FingerFlex模型,通过卷积编码器-解码器架构实现对电极皮层脑数据中手指运动回归,达到0.74的相关系数,推动高精度运动皮层脑机接口发展。

Comments 6 pages, 3 figures, 4 tables. Preprint. Under review

Journal ref 10.1109/IEEECONF58974.2023.10405112

详情
AI中文摘要

运动脑机接口(BCI)的发展严重依赖于神经时间序列解码算法。近年来深度学习架构的进步使得自动特征选择能够近似数据中的高阶依赖关系。本文提出了FingerFlex模型——一种针对电极皮层脑数据中手指运动回归的卷积编码器-解码器架构。在公开的BCI竞赛IV数据集4上,取得了最先进的性能,真值与预测轨迹之间的相关系数高达0.74。所提出的方法为开发完全功能的高精度运动皮层脑机接口提供了机会。

英文摘要

Motor brain-computer interface (BCI) development relies critically on neural time series decoding algorithms. Recent advances in deep learning architectures allow for automatic feature selection to approximate higher-order dependencies in data. This article presents the FingerFlex model - a convolutional encoder-decoder architecture adapted for finger movement regression on electrocorticographic (ECoG) brain data. State-of-the-art performance was achieved on a publicly available BCI competition IV dataset 4 with a correlation coefficient between true and predicted trajectories up to 0.74. The presented method provides the opportunity for developing fully-functional high-precision cortical motor brain-computer interfaces.

2606.18667 2026-06-18 q-bio.NC q-bio.QM 新提交 专题 80

Can neurons speak? Semantic narration of vision at single-cell resolution

神经元能说话吗?单细胞分辨率的视觉语义叙述

Arnau Marin-Llobet, Richard Hakim, Sara Matias, Venkatesh N. Murthy, Na Li, Demba Ba

专题命中 神经信号处理 :解码神经元活动为自然语言,属于神经信号处理

AI总结 提出NEURRATOR框架,通过将神经元活动解码为自然语言描述,实现单细胞分辨率的视觉语义叙述,并用于量化解码保真度及解析单个神经元和特定细胞类型的功能贡献。

详情
AI中文摘要

识别高级视觉皮层中单个神经元编码的内容是一个开放问题。响应难以直观参数化,而用于替代的深度网络嵌入是黑箱。这里,我们介绍NEURRATOR,一个将尖峰活动解码为单神经元分辨率的自由形式自然语言叙述的框架。一个学习编码器将来自任意子集的同步记录神经元的尖峰序列映射到冻结CLIP的补丁嵌入空间,多模态语言模型和稀疏自编码器生成并验证描述,无需语言侧训练。应用于自然电影观看期间小鼠视觉皮层的Neuropixel记录,NEURRATOR从数千个神经元、单个皮层区域、局部群体或分子定义的细胞类型进行叙述。我们利用这一特性来(i)量化解码保真度如何随群体大小和皮层区域变化,以及(ii)用平实的语言“叙述”单个神经元和基因标记的抑制性细胞类型对视觉表征的贡献。这将细胞身份从分类目标重新定义为视觉系统的功能探针,为神经系统提供了一种新的生物学见解单位。

英文摘要

Identifying what individual neurons encode in higher-order visual cortex is an open problem. Responses resist intuitive parameterization, and the deep-network embeddings used in their place are black boxes. Here, we introduce NEURRATOR, a framework that decodes spiking activity into free-form natural-language narration of the viewed scene at single-neuron resolution. A learned encoder maps spike trains from arbitrary subsets of simultaneously-recorded neurons into the patch-embedding space of a frozen CLIP, from which a multimodal language model and sparse autoencoder generates and validates a description with no language-side training. Applied to Neuropixel recordings of mouse visual cortex during natural-movie viewing, NEURRATOR narrates from thousands of neurons, singular cortical regions, local populations, or from a molecularly-defined cell-types. We use this property to (i) quantify how decoding fidelity scales with population size and cortical region, and (ii) "neurrate", in plain language, what individual neurons and genetically-tagged inhibitory cell-types contribute to visual representation. This recasts cell identity from a classification target into a functional probe of the visual system, providing a new unit of biological insights in neural systems.

2606.19081 2026-06-18 q-bio.NC cs.HC 新提交 专题 80

Retrieval-Based Brain Decoding by Alignment, not Complexity

基于对齐而非复杂性的检索式脑解码

Matteo Ciferri, Matteo Ferrante, Nicola Toschi

专题命中 神经信号处理 :从fMRI活动解码脑信号

AI总结 本文通过跨多数据集实验证明,线性对比解码器在脑解码中优于岭回归和标准非线性方法,表明解码增益更多来自训练目标而非架构复杂性。

详情
AI中文摘要

认知科学中的一个著名理论认为,大脑中的概念被组织为高维向量,语义含义由该空间中的方向和相对角度捕获。脑解码是从神经活动中重建或检索刺激(或其表示)的努力,涉及找到一个近似大脑如何表示概念的函数。这激发了对对比目标作为逆转脑损失函数的生物合理候选者的研究。在这项工作中,我们研究了如何将功能磁共振成像(fMRI)活动与视觉、语言和音频基础模型的嵌入空间进行一般性映射。尽管神经计算在微观尺度上是高度非线性的,但fMRI测量平均了跨空间和时间的信号,并进一步被噪声平滑,从而有效地线性化了可观察的表示。与这些观点一致,我们在多个数据集上的实验表明,线性对比解码器始终优于岭回归和标准非线性替代方案,并且这些结果在图像、文本和声音中普遍适用。这些发现表明,解码增益更多地来自训练目标的选择而非架构复杂性,指向对比线性模型作为脑解码的原则性策略。

英文摘要

A prominent theory in cognitive science suggests that concepts in the brain are organized as high-dimensional vectors, with semantic meaning captured by directions and relative angles in this space. Brain decoding is the effort of reconstructing or retrieving stimuli (or their representations) from neural activity and involves finding a function that approximates how the brain represents concepts. This motivates the investigation of contrastive objectives as biologically plausible candidates to reverse the brain loss function. In this work, we study how functional MRI (fMRI) activity can generally be mapped with the embedding spaces of foundation models in vision, language, and audio. Although neural computations are highly non-linear at the microscale, fMRI measurements average signals across space and time, further smoothed by noise, effectively linearizing the observable representation. Consistent with these views, our experiments across multiple datasets demonstrate that linear contrastive decoders consistently outperform ridge regression and standard non-linear alternatives, and that these results generalize across images, text, and sound. These findings indicate that decoding gains arise more from the choice of training objective than from architectural complexity, pointing to contrastive-linear models as a principled strategy for brain decoding.