arXivDaily arXiv每日学术速递 周一至周五更新

科学与医疗

脑机接口 / BCI

脑机接口、EEG、神经信号解码、神经假体和脑控交互。

今日/当前日期收录 4 信号源:q-bio.NC, eess.SP, cs.LG, cs.HC, cs.RO
2406.15537 2026-06-18 q-bio.NC cs.AI cs.SD eess.AS 90%

R&B -- Rhythm and Brain: Cross-subject Decoding of Music from Human Brain Activity

R&B -- 音乐与大脑:从人类脑活动交叉解码音乐

Matteo Ferrante, Matteo Ciferri, Nicola Toschi

发表机构 * Department of Biomedicine and Prevention University of Rome Tor Vergata(生物医学与预防系罗马大学托尔维加塔分校) A.A. Martinos Center for Biomedical Imaging Harvard Medical School/MGH, Boston (US)(A.A. Martinos生物医学成像中心哈佛医学院/马萨诸塞总医院,波士顿(美国))

专题命中 神经信号处理 :从fMRI脑活动解码音乐

AI总结 研究通过fMRI数据解码音乐,利用CLAP模型和voxel编码模型,实现跨被试音乐识别,提升音乐感知与情绪的神经基础理解。

Comments The first two authors contributed equally to this work

Journal ref Neural Networks, 203, 109195 (2026)

详情
AI中文摘要

音乐是一种普遍现象,深刻影响人类经验。本研究探讨是否能从功能性磁共振成像(fMRI)数据中解码音乐。利用最新大规模数据集和预训练计算模型,构建神经数据与音乐刺激潜在表示之间的映射。我们的方法整合功能和解剖对齐技术,以解决fMRI数据低时间分辨率和信噪比的问题。从GTZan fMRI数据集出发,五名受试者在听10种不同音乐流派的540个音乐刺激时记录脑活动。利用CLAP模型提取音乐刺激的潜在表示,并开发voxel编码模型以识别对这些刺激有反应的脑区。通过设置预测与实际脑活动之间的阈值,确定特定感兴趣区域(ROIs),这些区域可解释为音乐处理的关键参与者。我们的解码流程主要基于检索,使用线性映射将脑活动投影到对应的CLAP特征。这使我们能够预测并检索与fMRI数据起源最相似的音乐刺激。我们的结果展示了最先进的识别精度,方法显著优于现有方法。我们的发现表明,基于神经的音乐检索系统可能实现个性化推荐和治疗应用。未来工作可利用更高时间分辨率的神经成像和生成模型来提高解码精度,并探索音乐感知和情绪的神经基础。

英文摘要

Music is a universal phenomenon that profoundly influences human experiences across cultures. This study investigates whether music can be decoded from human brain activity measured with functional MRI (fMRI) during its perception. Leveraging recent advancements in extensive datasets and pre-trained computational models, we construct mappings between neural data and latent representations of musical stimuli. Our approach integrates functional and anatomical alignment techniques to facilitate cross-subject decoding, addressing the challenges posed by the low temporal resolution and signal-to-noise ratio (SNR) in fMRI data. Starting from the GTZan fMRI dataset, where five participants listened to 540 musical stimuli from 10 different genres while their brain activity was recorded, we used the CLAP (Contrastive Language-Audio Pretraining) model to extract latent representations of the musical stimuli and developed voxel-wise encoding models to identify brain regions responsive to these stimuli. By applying a threshold to the association between predicted and actual brain activity, we identified specific regions of interest (ROIs) which can be interpreted as key players in music processing. Our decoding pipeline, primarily retrieval-based, employs a linear map to project brain activity to the corresponding CLAP features. This enables us to predict and retrieve the musical stimuli most similar to those that originated the fMRI data. Our results demonstrate state-of-the-art identification accuracy, with our methods significantly outperforming existing approaches. Our findings suggest that neural-based music retrieval systems could enable personalized recommendations and therapeutic applications. Future work could use higher temporal resolution neuroimaging and generative models to improve decoding accuracy and explore the neural underpinnings of music perception and emotion.

2211.01960 2026-06-18 q-bio.NC cs.HC cs.LG 85%

FingerFlex: Inferring Finger Trajectories from ECoG signals

FingerFlex:从ECoG信号推断手指轨迹

Vladislav Lomtev, Alexander Kovalev, Alexey Timchenko

发表机构 * Bauman Moscow State Technical University(巴乌曼莫斯科国立技术大学) ALVI Labs(ALVI实验室) Brain Dynamics Group, Higher School of Economics(高等经济学院脑动力组) University of Tuebingen(图宾根大学)

专题命中 神经信号处理 :从ECoG信号推断手指轨迹,运动BCI。

AI总结 本文提出FingerFlex模型,通过卷积编码器-解码器架构实现对电极皮层脑数据中手指运动回归,达到0.74的相关系数,推动高精度运动皮层脑机接口发展。

Comments 6 pages, 3 figures, 4 tables. Preprint. Under review

Journal ref 10.1109/IEEECONF58974.2023.10405112

详情
AI中文摘要

运动脑机接口(BCI)的发展严重依赖于神经时间序列解码算法。近年来深度学习架构的进步使得自动特征选择能够近似数据中的高阶依赖关系。本文提出了FingerFlex模型——一种针对电极皮层脑数据中手指运动回归的卷积编码器-解码器架构。在公开的BCI竞赛IV数据集4上,取得了最先进的性能,真值与预测轨迹之间的相关系数高达0.74。所提出的方法为开发完全功能的高精度运动皮层脑机接口提供了机会。

英文摘要

Motor brain-computer interface (BCI) development relies critically on neural time series decoding algorithms. Recent advances in deep learning architectures allow for automatic feature selection to approximate higher-order dependencies in data. This article presents the FingerFlex model - a convolutional encoder-decoder architecture adapted for finger movement regression on electrocorticographic (ECoG) brain data. State-of-the-art performance was achieved on a publicly available BCI competition IV dataset 4 with a correlation coefficient between true and predicted trajectories up to 0.74. The presented method provides the opportunity for developing fully-functional high-precision cortical motor brain-computer interfaces.

2606.18667 2026-06-18 q-bio.NC q-bio.QM 新提交 80%

Can neurons speak? Semantic narration of vision at single-cell resolution

神经元能说话吗?单细胞分辨率的视觉语义叙述

Arnau Marin-Llobet, Richard Hakim, Sara Matias, Venkatesh N. Murthy, Na Li, Demba Ba

专题命中 神经信号处理 :解码神经元活动为自然语言,属于神经信号处理

AI总结 提出NEURRATOR框架,通过将神经元活动解码为自然语言描述,实现单细胞分辨率的视觉语义叙述,并用于量化解码保真度及解析单个神经元和特定细胞类型的功能贡献。

详情
AI中文摘要

识别高级视觉皮层中单个神经元编码的内容是一个开放问题。响应难以直观参数化,而用于替代的深度网络嵌入是黑箱。这里,我们介绍NEURRATOR,一个将尖峰活动解码为单神经元分辨率的自由形式自然语言叙述的框架。一个学习编码器将来自任意子集的同步记录神经元的尖峰序列映射到冻结CLIP的补丁嵌入空间,多模态语言模型和稀疏自编码器生成并验证描述,无需语言侧训练。应用于自然电影观看期间小鼠视觉皮层的Neuropixel记录,NEURRATOR从数千个神经元、单个皮层区域、局部群体或分子定义的细胞类型进行叙述。我们利用这一特性来(i)量化解码保真度如何随群体大小和皮层区域变化,以及(ii)用平实的语言“叙述”单个神经元和基因标记的抑制性细胞类型对视觉表征的贡献。这将细胞身份从分类目标重新定义为视觉系统的功能探针,为神经系统提供了一种新的生物学见解单位。

英文摘要

Identifying what individual neurons encode in higher-order visual cortex is an open problem. Responses resist intuitive parameterization, and the deep-network embeddings used in their place are black boxes. Here, we introduce NEURRATOR, a framework that decodes spiking activity into free-form natural-language narration of the viewed scene at single-neuron resolution. A learned encoder maps spike trains from arbitrary subsets of simultaneously-recorded neurons into the patch-embedding space of a frozen CLIP, from which a multimodal language model and sparse autoencoder generates and validates a description with no language-side training. Applied to Neuropixel recordings of mouse visual cortex during natural-movie viewing, NEURRATOR narrates from thousands of neurons, singular cortical regions, local populations, or from a molecularly-defined cell-types. We use this property to (i) quantify how decoding fidelity scales with population size and cortical region, and (ii) "neurrate", in plain language, what individual neurons and genetically-tagged inhibitory cell-types contribute to visual representation. This recasts cell identity from a classification target into a functional probe of the visual system, providing a new unit of biological insights in neural systems.

2606.19081 2026-06-18 q-bio.NC cs.HC 新提交 80%

Retrieval-Based Brain Decoding by Alignment, not Complexity

基于对齐而非复杂性的检索式脑解码

Matteo Ciferri, Matteo Ferrante, Nicola Toschi

专题命中 神经信号处理 :从fMRI活动解码脑信号

AI总结 本文通过跨多数据集实验证明,线性对比解码器在脑解码中优于岭回归和标准非线性方法,表明解码增益更多来自训练目标而非架构复杂性。

详情
AI中文摘要

认知科学中的一个著名理论认为,大脑中的概念被组织为高维向量,语义含义由该空间中的方向和相对角度捕获。脑解码是从神经活动中重建或检索刺激(或其表示)的努力,涉及找到一个近似大脑如何表示概念的函数。这激发了对对比目标作为逆转脑损失函数的生物合理候选者的研究。在这项工作中,我们研究了如何将功能磁共振成像(fMRI)活动与视觉、语言和音频基础模型的嵌入空间进行一般性映射。尽管神经计算在微观尺度上是高度非线性的,但fMRI测量平均了跨空间和时间的信号,并进一步被噪声平滑,从而有效地线性化了可观察的表示。与这些观点一致,我们在多个数据集上的实验表明,线性对比解码器始终优于岭回归和标准非线性替代方案,并且这些结果在图像、文本和声音中普遍适用。这些发现表明,解码增益更多地来自训练目标的选择而非架构复杂性,指向对比线性模型作为脑解码的原则性策略。

英文摘要

A prominent theory in cognitive science suggests that concepts in the brain are organized as high-dimensional vectors, with semantic meaning captured by directions and relative angles in this space. Brain decoding is the effort of reconstructing or retrieving stimuli (or their representations) from neural activity and involves finding a function that approximates how the brain represents concepts. This motivates the investigation of contrastive objectives as biologically plausible candidates to reverse the brain loss function. In this work, we study how functional MRI (fMRI) activity can generally be mapped with the embedding spaces of foundation models in vision, language, and audio. Although neural computations are highly non-linear at the microscale, fMRI measurements average signals across space and time, further smoothed by noise, effectively linearizing the observable representation. Consistent with these views, our experiments across multiple datasets demonstrate that linear contrastive decoders consistently outperform ridge regression and standard non-linear alternatives, and that these results generalize across images, text, and sound. These findings indicate that decoding gains arise more from the choice of training objective than from architectural complexity, pointing to contrastive-linear models as a principled strategy for brain decoding.