arXivDaily arXiv每日学术速递 周一至周五更新
重置
2606.02408 2026-06-02 cs.CC q-bio.QM

Structure-Informed Multiple Sequence Alignment: A Formal Model and Hardness Results

结构信息引导的多序列比对:形式化模型与困难性结果

Yoshiki Kanazawa, Naphan Benchasattabuse, Michal Hajdušek, Rodney Van Meter

AI总结 本文提出一种结构信息引导的多序列比对问题MSA-S,通过形式化模型证明其判定问题NP完全,且优化问题无多项式时间近似方案。

详情
AI中文摘要

我们形式化了一个结构信息引导的多序列比对问题,记为MSA-S。该模型将生物序列抽象为字符串,结构信息抽象为指定的位置对。它通过一个固定的非空位符号对评分规则和固定的仿射空位罚分定义了一个固定的成对字符串得分,并在指定的位置对上增加了一个二元重叠得分,在结构应用中可解释为接触图重叠得分。这产生了一个固定得分、整数值的优化模型,适合复杂性理论分析。在此形式化下,我们证明对于一大类固定的成对字符串评分方案,判定问题MSA-S-DEC是NP完全的。我们还证明,即使在每个指定位置对集合非空且位置对重叠阈值严格为正的限制下,NP困难性仍然存在。对于关联的标量优化问题MSA-S-OPT(λ),其中λ≥1为任意固定有理常数,我们进一步证明,在非空位符号对评分规则的规范单位方案下,即使对于两个输入字符串(k=2),MSA-S-OPT(λ)也不存在多项式时间近似方案(PTAS),除非P=NP。这些结果为结构信息引导的多序列比对建立了形式化的复杂性理论基线。

英文摘要

We formulate a structure-informed multiple sequence alignment problem, denoted MSA-S. The model abstracts biological sequences as strings and structural information as designated position-pairs. It augments a fixed pairwise string score, defined by a fixed non-gap symbol-pair scoring rule and fixed affine gap penalties, with a binary overlap score on designated position-pairs, which can be interpreted as a contact-map overlap score in structural applications. This yields a fixed-score, integer-valued optimization model suitable for complexity-theoretic analysis. Under this formulation, we show that the decision problem MSA-S-DEC is NP-complete for a broad class of fixed pairwise string scoring schemes. We also show that NP-hardness persists even under the restriction that every designated position-pair set is nonempty and the pair-overlap threshold is strictly positive. For the associated scalarized optimization problem MSA-S-OPT(lambda) with any fixed rational constant lambda >= 1, we further show that, under the canonical unit scheme for the non-gap symbol-pair scoring rule, MSA-S-OPT(lambda) admits no polynomial-time approximation scheme (PTAS) even for two input strings (k = 2), unless P = NP. These results establish a formal complexity-theoretic baseline for structure-informed multiple sequence alignment.

2606.02392 2026-06-02 cs.SI cs.LO q-bio.NC

Topology as Logic: Structural Role Geometry Across Formal, Software, Biological, and Prebiotic Systems

拓扑即逻辑:跨越形式系统、软件、生物和生命起源系统的结构角色几何

Vladi Ivanov

AI总结 通过多层网络分析,研究依赖拓扑是否作为可恢复的几何结构(功能邻近律下的枢纽持久性和秩散度)与功能承载组织相关,并在七个独立基底上验证了该假设。

详情
Comments
7 pages, 1 table. Version 1. Seven pre-registered experiments across digital circuits, formal mathematics (Lean 4 and Coq), legacy COBOL, neural connectomics (C. elegans to Drosophila, ~600 Myr), and prebiotic chemistry. Companion paper: arXiv:2604.23639. Pre-registration: https://github.com/vladi160/preregistrations. Zenodo: https://doi.org/10.5281/zenodo.20489745
AI中文摘要

我们探究依赖拓扑是否与功能承载组织相关,作为可恢复的几何结构——不是隐喻,而是通过多层网络分析可检测的可测量结构属性。在七个独立基底上,我们展示了功能邻近律下的枢纽持久性和秩散度恢复了领域专家描述为逻辑的操作组织:形式数学中的公理承载结构、遗留软件中的控制和契约结构、约6亿年神经进化中保守的枢纽语法、已发表的生命起源前自催化网络中的催化角色组织、4位数字电路中的进位路径主导性、ISCAS85 c432标准基准(n=196)中的介数持久性,以及Coq Corelib(n=17)中的定向形式系统复制。一个关键方法论发现:基于度的枢纽持久性在物理布线和模拟状态相关层之间较弱(c432中r=0.21),而基于介数的持久性更强(4位ALU事后分析中r=0.77;c432中r=0.34)。ISCAS85预注册的主要假设被确认(度r=0.426,p=0.002,Spearman r=0.551)。形式系统的声明由两个证明助手语料库支持:Lean 4 mathlib4(确认,r=0.777,p=0.004)和Coq Corelib(部分,方向确认,r=0.288,p=0.287,n=17,统计功效不足)。所有七个实验在分析前均已预注册。

英文摘要

We ask whether dependency topology correlates with functional load-bearing organization as recoverable geometry -- not as a metaphor, but as a measurable structural property detectable by multilayer network analysis. Across seven independent substrates, we show that hub persistence and rank divergence under the Functional Proximity Law recover operational organization that domain experts describe as logic: axiomatic load-bearing structure in formal mathematics, control and contract structure in legacy software, conserved hub grammar across approx. 600 million years of neural evolution, catalytic role organization in a published prebiotic autocatalytic network, carry-path dominance in a 4-bit digital circuit, betweenness persistence in the ISCAS85 c432 standard benchmark (n=196), and a directional formal-systems replication in the Coq Corelib (n=17). A key methodological finding: degree-based hub persistence is weak between physical wiring and simulation state-correlation layers (r=0.21 in c432), while betweenness-based persistence is stronger (r=0.77 in the 4-bit ALU post-hoc; r=0.34 in c432). The ISCAS85 pre-registered primary hypothesis was CONFIRMED (degree r=0.426, p=0.002, Spearman r=0.551). The formal-systems claim is supported by two proof-assistant corpora: Lean 4 mathlib4 (CONFIRMED, r=0.777, p=0.004) and Coq Corelib (PARTIAL, direction confirmed, r=0.288, p=0.287, n=17, underpowered). All seven experiments were pre-registered before analysis.

2606.02385 2026-06-02 q-bio.NC cs.LG

How Optimality Structures Sparse Dictionaries: A Theory for Understanding SAE Representations

最优性如何结构化稀疏字典:理解SAE表示的理论

William Dorrell

AI总结 本文通过扩展局部最优性分析到非负联合优化问题,推导出稀疏自编码器(SAE)最优特征与数据分布之间的约束,解释了层级分裂与吸收、残差结构和密集对映特征等行为,并构建了新型大字典凸问题以探索宽原子-数据点极限。

详情
Comments
27 pages, 5 figures
AI中文摘要

稀疏自编码器(SAE)已成功将神经表示解析为可解释的概念,为理解和控制提供了基础。然而,SAE究竟提取了什么,以及我们据此能得出哪些科学结论,并不明显。经验上,证据在于结果:SAE学习了可解释的特征。理论上,我们缺乏一个清晰的解释,说明一个“概念”必须满足什么属性才能被SAE提取。已有大量可识别性工作研究稀疏编码恢复真实特征的条件,但这些方法往往关注简单的数据生成模型(如稀疏独立特征),这些模型难以近似SAE所训练的、吞噬互联网的语言模型表示。在此,我们避免数据生成模型,仅询问任何字典学习最优解必须满足什么属性。具体地,我们将局部最优性分析(Gribonval & Schnass, 2010)扩展到普通SAE近似的非负联合优化问题,并推导出最优SAE特征与其分布之间的约束。我们利用这些约束解释了一系列观察到的SAE行为——层级分裂与吸收、残差结构以及密集对映特征——每个都反映了L1+非负性如何与数据交互以结构化最优字典。最后,我们构建了一个新颖的大字典凸问题,并探索了宽原子-数据点极限。总之,我们希望将模型假设与意外观察区分开,从而从SAE的成功中学到更多,并为设计其继任者提供原则。

英文摘要

Sparse Autoencoders (SAEs) have found success parsing neural representations into interpretable concepts, providing a basis for understanding and control. However, what exactly SAEs extract, and, correspondingly, the scientific conclusions we can draw from them, are not obvious. Empirically, the proof is in the pudding: SAEs learn interpretable features. Theoretically, we lack a clear account of what properties a 'concept' must satisfy for an SAE to extract it. There has been extensive identifiability work studying the conditions under which sparse coding recovers ground-truth features; however, these approaches tends to focus on simple data-generating models (e.g. sparse independent features) which poorly approximate the internet-swallowing language-model representations on which SAEs are trained. Here, avoiding data-generating models, we ask simply what properties any dictionary learning optimum must satisfy. Concretely, we extend local optimality analyses (Gribonval & Schnass, 2010) to the nonnegative joint-optimisation problem that vanilla SAEs approximate, and derive constraints relating optimal SAE features to their distributions. We use these constraints to explain a range of observed SAE behaviours - hierarchical splitting & absorption, the structure of residuals, and dense antipodal features - each reflecting how L1+nonnegativity interact with data to structure optimal dictionaries. Finally, we construct a novel large-dictionary convex problem and explore the wide atom-per-datapoint limit. In sum, we hope to tease model assumptions from unexpected observations, letting us learn more from SAEs' successes and provide principles for designing their successors.

2606.02305 2026-06-02 q-bio.NC cs.HC

Mapping Whisper Representations to Human ECoG Responses with Interpretable Time-Resolved Neural Encoding

将Whisper表示映射到人类ECoG响应:可解释的时间分辨神经编码

Matteo Ciferri, Tommaso Boccato, Michal Olak, Matteo Ferrante, Nicola Toschi

AI总结 通过时间分辨神经编码器结合语音嵌入与循环时间模型及软注意力,研究Whisper内部表示如何预测颅内ECoG响应,发现中间层与神经活动对应最强,且高分辨率ECoG受益于时间结构化建模。

详情
Comments
Presented at ICLR 2026 Workshop on Representational Alignment (Re-Align)
AI中文摘要

理解语音基础模型如何与人类皮层活动相关是计算神经科学的一个关键挑战。在这里,我们研究了Whisper的内部表示如何在自然语音感知过程中预测颅内ECoG响应。我们引入了一个时间分辨神经编码器,将语音嵌入与循环时间模型和软注意力相结合,从而能够检查逐层的大脑对齐。中间Whisper层与神经活动提供了最强的对应关系,支持模型表示与皮层语音处理之间的层次匹配。与基线的比较表明,高分辨率ECoG响应受益于超出相同语音表示的线性映射的时间结构化建模。此外,注意力图揭示了语音嵌入与神经响应之间的时间局部对齐,而音位可解释性分析在编码信息电极中识别出解剖学上一致的音位类别组织。这些结果共同表明,语音基础模型为研究时间分辨的皮层语音表示提供了一个有用的框架。

英文摘要

Understanding how speech foundation models relate to human cortical activity is a key challenge for computational neuroscience. Here, we investigate how internal representations from Whisper predict intracranial ECoG responses during naturalistic speech perception. We introduce a time-resolved neural encoder that combines speech embeddings with a recurrent temporal model and soft attention, allowing us to examine layer-wise brain alignment. Intermediate Whisper layers provide the strongest correspondence with neural activity, supporting a hierarchical match between model representations and cortical speech processing. Comparisons with baselines show that high-resolution ECoG responses benefit from temporally structured modelling beyond linear mappings from the same speech representations. In addition, attention maps reveal temporally local alignment between speech embeddings and neural responses, while a phonemic interpretability analysis identifies anatomically coherent phoneme-category organization among encoding-informative electrodes. Together, these results suggest that speech foundation models offer a useful framework for studying time-resolved cortical speech representations.

2606.02121 2026-06-02 q-bio.NC

What biology can, and cannot, tell us about conscious AI

生物学能告诉我们关于有意识的人工智能什么,以及不能告诉我们什么

Ulysse Klatzmann, Adrien Doerig

AI总结 本文区分了两种类型的生物自然主义,指出类型A不可检验,类型B可检验且与计算功能主义兼容,强调生物学可指导但无法解决意识与信息处理的关系。

详情
AI中文摘要

人工智能的进展正在将机器意识从哲学好奇心转变为社会问题,并引发了对广泛的计算功能主义框架的批评。生物自然主义声称生物学而非计算对意识至关重要。我们讨论哪些形式的生物自然主义在经验上是可检验的。对于类型A生物自然主义,生物学本质上对意识重要,但不提供独特的信息处理能力。我们认为,类似于展开论证,这将意识与行为分离,使得类型A生物自然主义不可检验。对于类型B生物自然主义,生物学之所以重要,是因为它提供了独特的信息处理能力。类型B生物自然主义是可检验的,并且与计算功能主义不矛盾。两者面临相同的任务:将意识与信息处理联系起来。生物学可以在这项探索中作为指导,但不能作为解决方案。

英文摘要

Progress in AI is turning machine consciousness from a philosophical curiosity into a societal issue, and has led to criticism of the widespread computational functionalism framework. Biological Naturalism (BN) claims that biology, not computation, is crucial for consciousness. We discuss which forms of BN are empirically testable. For Type-A-BN, biology intrinsically matters for consciousness, without affording unique information processing capabilities. We argue, similarly to the unfolding argument, that this dissociates consciousness from behaviour, making Type-A-BN untestable. For Type-B-BN, biology matters because it affords unique information processing capabilities. Type-B-BN is testable, and not incompatible with computational functionalism. Both face the same task: relating consciousness to information processing. Biology can act as a guide on this quest, but not as a solution.

2606.02099 2026-06-02 q-bio.NC

Unveiling the shared grey matter signature between Alzheimer's and Parkinson's Disease

揭示阿尔茨海默病与帕金森病共享的灰质特征

Vishaak Gangasandra, Finn Blain, Elise Delzant, Michelle Lupton, Miguel Rentería, Sarah Medland, Baptiste Couvy-duchesne

AI总结 本研究利用大型MRI数据集和高分辨率脑图,首次量化了阿尔茨海默病与帕金森病在顶点水平的灰质关联,发现两者存在显著的正相关(rGM=0.24),并定位了9个贡献显著的顶点簇,提示共享的神经解剖特征在神经退行性早期出现。

详情
AI中文摘要

引言:本研究首次利用从大型MRI数据集聚合的高分辨率脑图,量化了阿尔茨海默病(AD)与帕金森病(PD)在顶点水平的灰质关联,旨在识别两种疾病之间的共享神经解剖特征。方法:利用一种从遗传相关性分析改编的新颖统计框架(SumR2回归),我们估计了AD与PD之间的共享神经解剖特征(灰质相关性:rGM)。结果:在AD与PD之间观察到显著的全脑正向灰质相关性(rGM=0.24,95%CI 0.20-0.28)。该相关性在疾病各阶段均被观察到,并利用英国生物银行数据进行了重复。我们定位了9个顶点簇(106个顶点)对显著的rGM有贡献,突出显示双侧壳核和右侧伏隔核的厚度减少与AD和PD均相关。讨论:我们的发现表明,共享的神经解剖特征在神经退行性早期出现,对早期筛查、疾病监测和靶向干预具有重要意义。数据来自帕金森病进展标志物倡议(PPMI)数据库(www.ppmi-info.org/access-)。

英文摘要

INTRODUCTION. This study presents the first quantification of vertex-level grey-matter associations between Alzheimer's disease (AD) and Parkinson's disease (PD) using highresolution brain maps aggregated from large MRI datasets. The aim is to identify shared neuroanatomical signatures between the two diseases. METHODS. Leveraging a novel statistical framework (SumR2 regression), adapted from genetic correlation analysis, we estimated the shared neuroanatomical signature (grey-matter correlation: rGM) between AD and PD. RESULTS. A significant positive brain-wide grey-matter correlation (rGM=0.24, 95%CI 0.20-0.28) was observed between AD and PD. This correlation was further observed across disease stages and replicated using UK Biobank data. We located 9 vertex-wise clusters (106 vertices) that contribute to the significant rGM, highlighting reduced thickness in the bilateral putamen and right accumbens as associated with both AD and PD. DISCUSSION. Our findings suggest that shared neuroanatomical features emerge early in neurodegeneration and have implications for early screening, disease monitoring, and targeted interventions. from the Parkinson's Progression Markers Initiative (PPMI) database (www.ppmi-info.org/access-

2606.01841 2026-06-02 q-bio.NC

The Neuromorphic Supremacy

神经形态至上

Yuliya Tsybina, Ivan Y. Tyukin, Alexander N. Gorban, Victor Kazantsev, Dianhui Wang, Susanna Gordleeva

AI总结 通过在传统人工神经网络中嵌入星形胶质细胞调节和脉冲动力学等神经形态电路,实现少样本高精度学习并在噪声下保持高性能,提出“神经形态至上”概念。

详情
AI中文摘要

活体神经系统展现出从极少量样本中学习新行为和模式的卓越能力,并且已知在严重感官噪声下也能稳健运行。然而,这些能力对于现代人工神经网络(包括深度学习模型)来说仍然遥不可及。我们证明,通过将新颖的、真正的神经形态电路嵌入传统人工神经网络架构中,可以弥合这一差距。这些电路包含生物神经结构固有的星形胶质细胞调节和脉冲动力学。在代表不同复杂度任务的标准基准测试中,混合模型从每类少量训练样本中实现了高精度,并在导致标准模型性能崩溃的遮挡和脉冲噪声下保持高性能。我们将这种现象称为神经形态至上——一种基于神经生物学的架构决定性超越经典深度学习的机制,为在噪声、数据稀缺环境中运行的具身AI系统的感知提供了原则性基础。

英文摘要

Live neural systems demonstrate remarkable capabilities to learn new behavior and patterns from mere few examples and are known to operate robustly under severe sensory noise. These capabilities, however, remain largely out of reach for modern artificial neural networks, including deep learning models. We show that this gap can be bridged by embedding novel genuine neuromorphic circuits into conventional artificial neural network architectures. These circuits comprise astrocytic modulation and spiking dynamics inherent to biological neural structures. Tested across standard benchmarks representing tasks of varying complexity, the hybrid models achieve high accuracy from few training examples per class and sustain high performance under occlusion and impulse noise that cause performance collapse in standard models without neuromorphic adaptation. We term this phenomenon neuromorphic supremacy - a regime in which architectures grounded in neurobiology decisively outperform classical deep learning, pointing toward a principled foundation for perception in embodied AI systems operating in noisy, data-scarce environments.

2606.01816 2026-06-02 q-bio.BM cs.LG

Site4Drug: Predicting Drug-Binding Target Sites with an AI Agent

Site4Drug: 利用AI智能体预测药物结合靶点

Taehan Kim, Sarrah Rose Mikhail Leung, Bharat Mekala, Jeongbin Park

AI总结 提出Site4Drug,一种模态感知的靶点发现智能体,通过整合拓扑、亲水性、翻译后修饰等证据,输出带约束、风险标记和决策日志的可靶向区域排名列表,并自动推荐结合模态。

详情
Comments
Accepted to the ICML 2026 Workshop on Generative and Agentic AI for Biology (GenBio)
AI中文摘要

选择在蛋白质上的干预位置(即选择可靶向位点)通常比选择结合物更模糊且更容易失败,尤其是对于膜蛋白,其可及性、拓扑和翻译后修饰(PTMs)限制了可作用区域。我们提出Site4Drug,一种模态感知的位点发现智能体,输出带有显式约束、证据摘要、风险标记和可追溯决策日志的可靶向区域排名列表。Site4Drug无需用户预先指定药物模态,而是利用与位点发现相同的证据(包括拓扑、亲水性、PTM倾向、二硫键、结构域背景和序列)推荐结合模态(例如抗体/肽类 vs 小分子)。重要的是,这些证据一致地应用于所有模态,包括小分子口袋发现,以避免选择化学上可行但生物学上被遮蔽的位点。

英文摘要

Selecting where to intervene on a protein (i.e., choosing a targetable site) is often a more ambiguous and failure-prone bottleneck than selecting what binds, especially for membrane proteins where accessibility, topology, and post-translational modifications (PTMs) constrain actionable regions. We present Site4Drug, a modality-aware site-finding agent that outputs a ranked list of targetable regions with explicit constraints, evidence summaries, risk flags, and a traceable decision log. Rather than requiring users to specify the drug modality upfront, Site4Drug can recommend a binding modality (e.g., antibody/peptide-like vs small-molecule) from the same evidence used for site discovery, including topology, hydropathy, PTM propensity, disulfides, domain context, and sequence. Importantly, this evidence is applied consistently across modalities, including small-molecule pocket discovery, to avoid selecting chemically plausible but biologically occluded sites.

2606.01661 2026-06-02 q-bio.NC q-bio.QM stat.ME

Feature leakage and the identifiability of direct-dependency entropy models of neural activity

特征泄露与神经活动直接依赖熵模型的可辨识性

Houman Safaai, Bernardo L. Sabatini

AI总结 本文研究直接依赖熵模型在预测神经活动时的局限性,指出其成功预测仅反映输入分布下的预测能力而非机制辨识,并提出诊断方法区分分布内预测与响应规则恢复。

详情
AI中文摘要

生物神经元在分支、电兴奋的树突上接收数千个突触输入,但群体活动通常用直接输入-输出规则建模,其中每个输入独立贡献于标量驱动。我们研究此类模型的成功预测能揭示和不能揭示关于神经计算的什么。对于匹配输出率和成对输出-输入共活性的条件最大熵模型,直接模型解释的熵是采样输入分布下的预测度量,而非机制识别测试。受限最大熵拟合是信息投影:省略的交互、时间或隐藏状态项,只要与包含的充分统计量相关,就可以吸收到拟合的一阶参数中。对于稀疏相关二进制输入,这种吸收具有明确的协偏度形式。我们引入诊断方法,将分布内预测与响应规则恢复分开:在保持P(y|x)固定而改变P(x)的状态重加权、局部可加性的条件对数几率对比以及时间泄露控制。在真实模拟中,纯高阶响应可以在易泄露采样下通过一阶熵和原始共活性测试,但在重加权后被正确分类。应用于从CA1海马记录中选择的、泄露丰富的局部表格时,大约一半在经验权重下表现为一阶的表格在平衡重加权下变得分布敏感,远高于匹配加性替代零假设。因此,直接熵解释分数和原始共活性预测应解释为在观测状态分布下的预测,而非直接模型之外的机制不存在或微弱的证据。

英文摘要

Biological neurons receive thousands of synaptic inputs on branching, electrically excitable dendrites, yet population activity is often modeled with direct input-output rules in which each input contributes independently to a scalar drive. We study what successful prediction by such models does, and does not, reveal about neural computation. For conditional maximum-entropy models that match output rates and pairwise output-input coactivities, the entropy explained by a direct model is a prediction measure under the sampled input distribution, not a mechanism-identification test. A restricted MaxEnt fit is an information projection: omitted interaction, temporal, or hidden-state terms can be absorbed into fitted first-order parameters whenever they are correlated with the included sufficient statistics. For sparse correlated binary inputs, this absorption has an explicit coskewness form. We introduce diagnostics that separate in-distribution prediction from recovery of the response rule: state reweighting that holds P(y|x) fixed while changing P(x), conditional log-odds contrasts for local additivity, and temporal leakage controls. In ground-truth simulations, purely higher-order responses can pass first-order entropy and raw coactivity tests under leakage-prone sampling, but are correctly classified after reweighting. Applied to selected, leakage-enriched local tables from CA1 hippocampal recordings, approximately half of tables that appear first-order under empirical weights become distribution-sensitive under balanced reweighting, far above a matched additive-surrogate null. Thus direct entropy-explained fractions and raw coactivity predictions should be interpreted as predictions under the observed state distribution, not as evidence that mechanisms outside the direct model are absent or small.

2606.01642 2026-06-02 q-bio.SC

An agent-based model of outer membrane biogenesis in Gram-negative bacteria

革兰氏阴性菌外膜生物发生的基于智能体的模型

Thomas Williams, James M. Osborne, Kwok Jian Goh, Trevor Lithgow, Jennifer Flegg

AI总结 通过半定量基于智能体模型,研究革兰氏阴性菌外膜生长过程中的分子尺度动力学,发现BAM复合体介导的蛋白插入易停滞且呈爆发式,并暗示BAM与Lpt复合体协同工作。

详情
AI中文摘要

外膜是革兰氏阴性菌(包括大肠杆菌和多种致命病原体)与外界环境相互作用的界面。过去二十年对外膜生物发生过程的研究揭示了介导该过程的组分,并表征了这些细菌细胞机器组件的结构和功能。然而,目前的实验方法和传统的分子动力学模拟方法都无法在细胞分裂周期的时间尺度上研究这种膜机器。这留下了关键问题未解,例如这种脂质贫乏、相对静态的环境如何组织以允许持续的膜生长。在这里,我们引入了一个半定量基于智能体模型,以探索革兰氏阴性菌外膜生长过程中的分子尺度动力学。在广泛参数空间内的模型模拟表明,由$β$-桶组装机器(BAM复合体)介导的蛋白掺入膜是一个容易停滞的过程,可能仅以短爆发形式发生。我们还发现,BAM复合体彼此之间以及与邻近的脂多糖插入Lpt复合体存在协作。我们引入的基于智能体框架提供了一种手段,用于在以前无法达到的时间尺度上评估和生成关于外膜生物发生的假设。

英文摘要

The outer membrane is the interface through which Gram-negative bacteria - a broad classification of organisms including \textit{Escherichia coli} and a number of deadly pathogens - interact with the environment. Two decades of work on the process of outer membrane biogenesis have led to the discovery of the components that mediate this process, and the characterisation of structure and function of these component parts of the bacterial cell machinery. However, neither current experimental methods, nor conventional molecular dynamics (MD) simulation approaches are capable of investigating this membrane machinery on the time scale of the cell division cycle. This leaves crucial questions unanswered, such as how this lipid-poor, largely static environment is organised to permit ongoing membrane growth. Here, we introduce a semi-quantitative agent-based model to explore the molecular-scale dynamics of Gram-negative outer membrane as it grows. Model simulations across a broad region of parameter space suggest that protein incorporation into the membrane by the $β$-barrel assembly machinery (BAM complex) is a process which is prone to stalling, and may take place only in short bursts. We also find suggestions that BAM complexes work collaboratively with each other, and with the lipopolysaccharide-inserting Lpt complex when in close proximity. The agent-based framework we introduce provides a means to assess and generate hypotheses on outer membrane biogenesis on previously inaccessible time scales.

2606.01628 2026-06-02 q-bio.BM cs.AI

Demystifying Multimodal Biomolecular Co-design With Intrinsic Geodesic Coupling

揭示具有内在测地耦合的多模态生物分子协同设计

Keyue Qiu, Xintong Wang, Zhilong Zhang, Hao Zhou, Wei-Ying Ma

AI总结 针对生物分子协同设计中模态间时间耦合被忽视的问题,提出GeoCoupling框架优化异构模态的时间耦合,在基于结构的药物设计和无条件蛋白质设计中提升物理有效性和多样性。

详情
Comments
Accepted to ICML 2026
AI中文摘要

蛋白质和小分子配体等生物分子在生物系统中发挥核心作用,这源于序列与三维结构之间的紧密相互作用。最近的生物分子协同设计生成模型旨在通过联合建模耦合模态来捕捉这种相互作用。然而,现有方法大多采用并行执行边际生成过程,隐式地强制固定同步耦合。我们认为,一个关键但被忽视的自由度在于这些边际过程在训练和生成过程中如何时间耦合,不恰当的耦合会引入高方差监督和不一致的中间状态,影响模态一致性。为了解决这个问题,我们引入了GeoCoupling,一个优化异构模态之间时间耦合的系统框架。在基于结构的药物设计和无条件蛋白质设计上的实证结果表明,学习到的耦合始终优于同步和随机耦合基线,产生了具有改进的物理有效性和多样性的生物分子。

英文摘要

Biomolecules such as proteins and small-molecule ligands play a central role in biological systems, arising from the tight interplay between sequence and three-dimensional structure. Recent generative models for biomolecular co-design aim to capture this interplay by jointly modeling coupled modalities. However, existing approaches largely adopt a parallel execution of marginal generative processes, implicitly enforcing fixed synchronous coupling. We argue that a critical but overlooked degree of freedom lies in how these marginal processes are temporally coupled during training and generation, where inappropriate coupling can introduce high-variance supervision and inconsistent intermediate states, affecting modality consistency. To address this, we introduce GeoCoupling, a systematic framework that optimizes for temporal couplings between heterogeneous modalities. Empirical results across structure-based drug design and unconditional protein design demonstrate the learned couplings consistently outperform synchronous and randomly coupled baselines, yielding biomolecules with improved physical validity and diversity.

2606.01611 2026-06-02 q-bio.BM

Peptide Structure Prediction Using Counter-Diabatic Quantum Approximate Optimization Algorithm (CD-QAOA)

使用反绝热量子近似优化算法(CD-QAOA)进行肽结构预测

Sung Won Yun, Yeon Gyo Seo, Seong Hun Jang, Suhyun Park, Joonwoo Bae, Sangwook Wu

AI总结 本研究采用反绝热量子近似优化算法(CD-QAOA)在四面体格点上预测七肽APRLRFY的结构,通过引入反绝热驱动项加速收敛,并结合经典方法验证,展示了量子-经典混合框架在短肽结构预测中的高效性和准确性。

详情
AI中文摘要

在这项研究中,我们使用量子近似优化算法(QAOA)在四面体格点上预测了七肽APRLRFY(一种神经肽序列)的结构。QAOA基于绝热近似,已成功应用于广泛的优化问题。然而,在基态搜索过程中经常报告收敛速度相对较慢。为了克服这一限制,我们采用了反绝热量子近似优化算法(CD-QAOA),该算法在绝热框架中引入了一个额外的反绝热驱动项,以加速肽结构预测中向基态的收敛。在七肽结构预测中,分子间相互作用通过两种不同方法建模。第一种方法仅将第二个残基脯氨酸(P)和第七个残基酪氨酸(Y)之间的相互作用纳入优化。第二种方法使用Miyazawa-Jernigan(MJ)相互作用矩阵对七肽内所有残基-残基相互作用进行建模。为了验证使用CD-QAOA预测的肽结构,我们还采用了多种经典计算方法,包括基于量子化学的Hartree-Fock(HF)计算和密度泛函理论(DFT)计算、常规分子动力学(MD)模拟以及哈密顿副本交换分子动力学(H-REMD)模拟。系统分析了从这些不同方法获得的构象之间的结构相似性。CD-QAOA对于预测短肽结构非常有效。特别是,我们证明了量子-经典混合框架可以显著提高肽结构预测的效率和准确性。

英文摘要

In this study, we predicted the structure of the heptapeptide APRLRFY, a neuropeptide sequence, on a tetrahedral lattice using a Quantum Approximate Optimization Algorithm (QAOA). QAOA is based on the adiabatic approximation and has been successfully applied to a wide range of optimization problems. However, relatively slow convergence during ground-state searches has frequently been reported. To overcome this limitation, we employed the Counter-Diabatic Quantum Approximate Optimization Algorithm (CD-QAOA), which introduces an additional counter-diabatic driving term into the adiabatic framework to accelerate convergence toward the ground state during peptide structure prediction. In the heptapeptide structure prediction, intermolecular interactions were modeled using two different approaches. In the first approach, only the interaction between the second residue, proline (P), and the seventh residue, tyrosine (Y), was included in the optimization. In the second approach, all residue-residue interactions within the heptapeptide were modeled using the Miyazawa-Jernigan (MJ) interaction matrix. To validate the peptide structures predicted using CD-QAOA, we additionally employed several classical computational methods, including quantum chemistry-based Hartree-Fock (HF) calculation and Density Functional Theory (DFT) calculation, conventional molecular dynamics (MD) simulation, and Hamiltonian replica exchange molecular dynamics (H-REMD) simulation. The structural similarities among the conformations obtained from these different approaches were systematically analyzed. CD-QAOA is highly effective for predicting the structures of short peptides. In particular, we demonstrate that a quantum-classical hybrid framework can significantly improve both the efficiency and accuracy of peptide structure prediction.

2606.01357 2026-06-02 q-bio.QM physics.data-an q-bio.NC

Hypergraphs from multivariate connectivity: caCoh-based EEG/MEG representation

基于多元连通性的超图:caCoh 的 EEG/MEG 表示

Daniil Vlasenko, Irina Saranskaia, Denis Zakharov

AI总结 本文提出基于典型相干(caCoh)从 EEG/MEG 数据构建超图的方法,通过一对多和空间对空间两种表示,在模拟中相比幅度平方相干(MSC)图更有效地恢复耦合频率和空间模式,并大幅减少边数。

详情
AI中文摘要

超图为表示分布在传感器集合上的神经生理交互提供了自然框架。一个关键的方法论问题是如何从频率分辨的脑电图/脑磁图(EEG/MEG)数据定义超边。我们展示了一种构建策略,其中超边从典型相干(caCoh)获得,caCoh 是相干性的扩展,用于估计多维信号空间之间的耦合。据我们所知,这是第一个直接从专门为频率分辨神经生理分析设计的多元连通性度量构建超图的工作。我们提出了两种基于 caCoh 的表示:一对空间超图,其中每个外部信号在 EEG/MEG 传感器空间上定义一个超边;以及空间对空间超图,其中两个多维信号空间由一个超边表示。我们在已知耦合频率和不同信噪比(SNR)的受控模拟中评估了该方法。与基于幅度平方相干(MSC)的图相比,基于 caCoh 的超图在几乎所有 SNR 水平下显示出统计上更高的目标-基线对比度,表明耦合频率的恢复更强。它们还恢复了与模拟源相关的传感器级空间模式。此外,一对空间和空间对空间表示将每个频率的 610 条 MSC 边分别减少到 10 条和 1 条超边。这些结果确立了多元频谱连通性作为 EEG/MEG 超图的自然方法论基础。

英文摘要

Hypergraphs provide a natural framework for representing neurophysiological interactions distributed across sets of sensors. A key methodological question is how hyperedges should be defined from frequency-resolved electroencephalography/magnetoencephalography (EEG/MEG) data. We demonstrate a construction strategy in which hyperedges are obtained from canonical coherence (caCoh), an extension of coherence that estimates coupling between multidimensional signal spaces. To our knowledge, this is the first work to construct hypergraphs directly from a multivariate connectivity measure specifically designed for frequency-resolved neurophysiological analysis. We propose two caCoh-based representations: a one-to-space hypergraph, where each external signal defines a hyperedge over the EEG/MEG sensor space, and a space-to-space hypergraph, where two multidimensional signal spaces are represented by a single hyperedge. We evaluate the approach in controlled simulations with known coupling frequencies and varying signal-to-noise ratio (SNR). Compared with graphs based on magnitude-squared coherence (MSC), caCoh-based hypergraphs showed statistically higher target-baseline contrasts at almost all SNR levels, indicating stronger recovery of coupling frequencies. They also recovered sensor-level spatial patterns associated with the simulated sources. In addition, one-to-space and space-to-space representations reduced 610 MSC edges per frequency to 10 and 1 hyperedges, respectively. These results establish multivariate spectral connectivity as a natural methodological basis for EEG/MEG hypergraphs.

2606.01329 2026-06-02 cs.LG q-bio.BM

Conditioned free-energy density of proteins using unbalanced solutions to constraint satisfaction problems

使用约束满足问题的不平衡解的条件化蛋白质自由能密度

Pratik Worah, Subhash Khot, Srinivasa Varadhan

AI总结 本文通过将条件化非均匀Curie-Weiss自旋哈密顿量的对数配分函数(自由能)简化为不平衡$2 \to 1$范数计算,并设计多项式时间SDP算法,应用于泛素蛋白以探索自由能景观并识别柔性区域。

详情
AI中文摘要

我们证明,计算条件化非均匀Curie-Weiss自旋哈密顿量的对数配分函数(自由能)简化为不平衡的$2 \to 1$范数计算,并为此问题设计了一个多项式时间的SDP算法,同时给出了所实现不平衡量的下界证明。应用于蛋白质泛素,该框架从已知晶体结构出发,探索自由能景观中的替代骨架构象,并在保留天然二级结构的同时识别蛋白质的柔性区域。

英文摘要

We show that computing the log-partition function (free-energy) of conditioned inhomogeneous Curie--Weiss spin Hamiltonians reduces to an unbalanced $2 \to 1$ norm computation, and design a polynomial-time SDP algorithm for this problem with a lower bound proof for the amount of unbalance achieved. Applied to the protein Ubiquitin, the framework starts from a known crystal structure, explores alternative backbone conformations across the free-energy landscape, and identifies flexible regions of the protein while preserving its native secondary structure.

2606.01264 2026-06-02 q-bio.NC cs.HC cs.SD eess.AS eess.SP

A 1000-hour EEG-EMG-audio dataset of Japanese speech production

1000小时日语语音产生的EEG-EMG-音频数据集

Motoshige Sato, Ilya Horiguchi, Masakazu Inoue, Kenichi Tomeoka, Eri Hatakeyama, Yuya Kita, Atsushi Yamamoto, Ippei Fujisawa, Shuntaro Sasai

AI总结 本研究构建了一个包含1020小时同步头皮脑电图、面部肌电图和语音音频的多模态数据集,来自三名健康日语母语者在开放词汇有声语音过程中的记录,旨在支持语音解码、多模态信号处理及脑电图表示学习等研究。

详情
AI中文摘要

我们提出了一个多模态数据集,包含来自三名健康日语母语者在开放词汇有声语音过程中同步记录的1020小时头皮脑电图(EEG)、面部肌电图(EMG)和语音音频。记录使用三种EEG系统——超高密度系统(g.Pangolin)和两种帽式系统(g.SCARABEO和eegosports),通道数从62到128不等——在数月内跨多个会话采集。每个会话提供时间同步的EEG、面部EMG和音频,以及语音事件注释和转录。尽管数据集的主要动机是语音解码,但它也支持多模态信号处理、伪影建模、纵向和跨设备适应以及EEG表示学习等工作。技术验证包括跨参与者、设备和任务的功率谱密度和事件相关电位分析,显示了预期的1/f频谱轮廓、任务相关的alpha频段衰减和时间锁定的诱发响应。该数据集以脑成像数据结构(BIDS)格式通过OpenNeuro在CC0豁免下发布,以支持语音相关及更广泛的EEG研究。

英文摘要

We present a multimodal dataset of 1020 hours of simultaneously recorded scalp electroencephalography (EEG), facial electromyography (EMG), and speech audio from three healthy native Japanese speakers during open-vocabulary overt speech. Recordings were acquired with three EEG systems-an ultra-high-density system (g.Pangolin) and two cap-type systems (g.SCARABEO and eegosports), spanning 62-128 channels-across many sessions over several months. Each session provides time-synchronized EEG, facial EMG, and audio, together with speech-event annotations and transcriptions. Although collected with speech decoding as a primary motivation, the dataset also supports work on multimodal signal processing, artifact modeling, longitudinal and cross-device adaptation, and EEG representation learning. Technical validation included power spectral density and event-related potential analyses across participants, devices, and tasks, which showed the expected 1/f spectral profile, task-related alpha-band attenuation, and time-locked evoked responses. The dataset is released in Brain Imaging Data Structure (BIDS) format via OpenNeuro under a CC0 waiver to support both speech-related and broader EEG research.

2606.01227 2026-06-02 cs.LG q-bio.NC

DAGGER: Gradient-Free Construction of Transiently Amplifying Networks under Hard Connectivity Constraints

DAGGER: 硬连接约束下瞬态放大网络的无梯度构造

James C. Ferguson

AI总结 提出无梯度单遍算法DAGGER,在硬符号/稀疏/对角约束下构造瞬态放大网络,通过单一标量β控制Wasserstein-2预算实现放大与多重集保留的平滑权衡。

详情
Comments
12 pages, 7 figures
AI中文摘要

许多网络不仅支持而且依赖于瞬态非正态放大,即稳定系统的活动增加数个数量级。在硬符号/稀疏/对角约束(与生物连接组和结构化RNN初始化相关的区域)下构造此类网络,迄今为止需要基于梯度的局部搜索(包含数千次内循环特征分解)或基于Schur形式的直接构造(在抽象基中,投影后破坏约束)。 本文提出DAGGER(有向无环图引导边重加权),一种无梯度单遍算法。给定稳定的有符号稀疏矩阵,DAGGER产生具有相同符号、稀疏性和对角的输出。单一标量β控制Wasserstein-2预算,平滑地权衡精确多重集保留(β=0)与放大;峰值放大随β几乎无界增长,经验上在数值溢出前达到10^10。 在单次前向传递中,DAGGER在多重集保留方面匹配或超过基于梯度的方法(比典型梯度内循环少30-100倍特征分解),并且在中等β下,在精确保持连接性的同时,超过它们数个数量级。我们开发了该算法,将其与现有方法以及下游信号检测任务进行比较,并检查了显示DAGGER在结构上与其他放大网络不同的诊断结果。

英文摘要

Many networks not only support but also rely on transient non-normal amplification, an orders-of-magnitude increase in the activity of an otherwise stable system. Constructing such networks under hard sign/sparsity/diagonal constraints -- the regime relevant for biological connectomes and structured RNN initializations -- has so far required either gradient-based local search with thousands of inner-loop eigendecompositions or Schur-form direct construction in an abstract basis that breaks the constraints under projection. Here we introduce DAGGER (Directed Acyclic Graph Guided Edge Reweighting), a gradient-free single-pass algorithm. Given a stable signed sparse matrix, DAGGER produces an output with the same sign, sparsity, and diagonal. A single scalar $β$ controls a Wasserstein-2 budget that smoothly trades exact multiset preservation ($β= 0$) for amplification; peak amplification grows essentially without bound with $β$, empirically reaching $10^{10}$ before numerical overflow. DAGGER matches or exceeds gradient-based methods at multiset preservation in a single forward pass -- 30-100$\times$ fewer eigendecompositions than a typical gradient inner loop -- and at moderate $β$ beats them by orders of magnitude with connectivity exactly preserved. We develop the algorithm, compare it to the existing methods and on a downstream signal-detection task, and examine the diagnostics that show why DAGGER is structurally different from other amplifying networks.

2606.01193 2026-06-02 cs.LO q-bio.MN q-bio.QM

Modulation-Reaction Networks

调制-反应网络

Leo Lobski, Yoàv Montacute

AI总结 本文提出调制-反应网络(MR-networks)框架,统一处理生物化学系统中的物质流和信息流,并开发调制-反应逻辑(MRL)以推理其同步布尔语义,通过模型检验和互模拟关系验证生物相关性质。

详情
Comments
To appear in the proceedings of Computational Methods in Systems Biology 2026
AI中文摘要

生物化学系统既涉及物质流(实体通过反应相互转化),也涉及信息流(实体调控哪些反应可以发生)。布尔网络捕捉后者;反应网络捕捉前者。然而,尽管在诸如系统生物学图形符号过程描述(SBGN-PD)语言等标准中,受调控的反应作为主要研究对象非常突出,但目前尚无统一的定性形式化方法将其作为主要研究对象。我们引入调制-反应网络(MR-networks),这是一个数学框架,其中实体通过激活和抑制来调制反应,并研究其同步布尔语义。为了推理MR-networks,我们开发了调制-反应逻辑(MRL),这是一种混合模态$μ$-演算,其模态推理网络结构,不动点算子捕捉计算的时间演化。我们建立了一系列有效性,包括一步更新规则的完整刻画,并通过形式化生物学感兴趣的性质(如可达性、持续生产和吸引子的存在)展示了MRL的表达能力。我们证明MRL通过评估博弈允许模型检验,并引入了MR-networks的互模拟关系,该关系被证明对所有MRL公式保持不变。作为迈向生物学上更真实计算模型的一步,我们概述了MR-networks的异步语义,并概述了同步情况下的发展如何转移到异步情况的研究中。

英文摘要

Biochemical systems involve both the flow of matter, in which entities transform into one another via reactions, and the flow of information, in which entities regulate which reactions may occur. Boolean networks capture the latter; reaction networks capture the former. Yet no unified qualitative formalism treats regulated reactions as its principal objects of study, despite their prominence in standards such as the Systems Biology Graphical Notation Process Description (SBGN-PD) language. We introduce modulation-reaction networks (MR-networks), a mathematical framework in which entities modulate reactions through activations and inhibitions, and study their synchronous Boolean semantics. To reason about MR-networks we develop Modulation-Reaction Logic (MRL), a hybrid modal $μ$-calculus whose modalities reason about the structure of the network and whose fixed-point operators capture temporal evolution of the computation. We establish a collection of validities, including a complete characterisation of the one-step update rule, and demonstrate the expressive power of MRL by formalising properties of biological interest such as reachability, sustained production, and presence of attractors. We show that MRL admits model-checking via an evaluation game, and introduce a bisimulation relation for MR-networks, which is proved to be invariant for all MRL-formulas. As a step towards a biologically more realistic computational model, we sketch the asynchronous semantics of MR-networks, and outline how the developments for the synchronous case transfer to the study of the asynchronous one.

2606.00955 2026-06-02 cs.LG q-bio.QM

CryoProt: A Protein Pretraining Framework with Cross-Box Interactions on Cryo-EM Density Maps

CryoProt: 一种基于冷冻电镜密度图跨盒交互的蛋白质预训练框架

Dan Luo, Xuan Lin, Peng Zhou, Junwen Zhu, Tengfei Ma, Xiangxiang Zeng, Yiping Liu

AI总结 提出 CryoProt 框架,通过多头潜在注意力机制实现密度图跨盒交互建模,并采用多任务预训练策略,在蛋白质柔性预测等下游任务中取得最高12%的性能提升。

详情
AI中文摘要

尽管冷冻电镜(cryo-EM)密度图的数据日益增多,但有效利用它们进行蛋白质表示仍具挑战。首先,当前方法缺乏专门针对cryo-EM密度图设计的通用蛋白质预训练框架,用于蛋白质相关属性预测。其次,现有方法通常将密度图划分为局部盒区域并独立建模,忽略了跨盒交互,而这对捕获cryo-EM密度图中的全局结构上下文至关重要。为解决这些挑战,我们提出CryoProt,一种专为cryo-EM密度图设计的蛋白质预训练框架。CryoProt引入了基于多头潜在注意力(MLA)的图编码器,其中盒级表示通过共享潜在空间进行交互,从而显式建模密度图内的跨盒依赖关系。此外,我们采用多任务预训练策略来学习可泛化的表示,这些表示可以有效地迁移到各种下游任务,例如蛋白质柔性预测,其中不需要cryo-EM密度图,而可以由预训练模型隐式推断。实验结果表明,CryoProt在多个基准测试中持续优于现有最先进方法,相比最佳基线实现了高达12%的提升,突显了在cryo-EM数据中建模跨盒交互的重要性。源代码公开于https://anonymous.4open.science/r/CryoProt。

英文摘要

Despite the growing availability of cryo-electron microscopy (cryo-EM) density maps, effectively leveraging them for protein representation remains challenging. First, current methods lack a general-purpose protein pretraining framework tailored for cryo-EM density maps, designed for protein-related property prediction. Second, existing approaches typically partition density maps into local box regions and model them independently, overlooking interactions across boxes which are essential for capturing global structural context in cryo-EM density map. To address these challenges, we propose CryoProt, a protein pretraining framework designed for cryo-EM density maps. CryoProt introduces a Map Encoder based on multi-head latent attention (MLA), where box-level representations interact through a shared latent space, enabling explicit modeling of cross-box dependencies within the density map. Furthermore, we adopt a multi-task pretraining strategy to learn generalizable representations that can be effectively transferred to diverse downstream tasks, such as protein flexibility prediction, where cryo-EM density maps are not required and can be inferred implicitly by the pretrained model. Experimental results demonstrate that CryoProt consistently outperforms existing state-of-the-art methods across multiple benchmarks, achieving up to 12% improvement over the best-performing baselines, highlighting the importance of modeling cross-box interactions in cryo-EM data. The source code is publicly available at https://anonymous.4open.science/r/CryoProt.

2606.00667 2026-06-02 q-bio.NC cs.LG

Cortex and subcortex play distinct roles over learning when cortical memory is limited

皮层与皮层下在学习中扮演不同角色:当皮层记忆受限时

Matthew Farrell, Taro Toyoizumi

AI总结 通过约束模型基模块的记忆资源,研究皮层与皮层下系统在学习中的功能分离,发现皮层支持一般结构学习而皮层下专攻奖励学习。

详情
Comments
Preprint. 19 pages, 4 figures
AI中文摘要

已有研究表明,大脑将灵活但计算成本高的皮层处理与更简单、成本更低的皮层下机制相结合,以实现比任一系统单独运行更高效的资源利用。尽管这一观点具有吸引力,但探索该假设的理论框架仍然有限。我们扩展了现有框架,其中模型基模块和模型无关模块并行学习,通过显式约束模型基模块的记忆资源,并在一个简单的决策设置中研究该约束的影响。记忆约束自然引发了分配记忆资源的策略。我们评估了不同策略在不同情境下的表现,并证明当奖励状态频繁变化时,模型基模块将记忆资源用于捕捉环境的通用结构而非利用当前奖励可能更有利。这项工作为学习过程中皮层和皮层下系统的功能分离提供了理论基础:皮层支持通用结构学习,而皮层下回路专门负责基于奖励的学习。我们进一步详细说明了如何在实验数据上检验这些假设。

英文摘要

It has been proposed that the brain integrates flexible, computationally expensive cortical processing with simpler, lower-cost subcortical mechanisms to achieve resource-efficient performance greater than that of either system alone. Despite the allure of this perspective, satisfying theoretical frameworks that explore this hypothesis are still limited. We extend existing frameworks in which a model-based module and model-free module learn in tandem by explicitly constraining the memory resources of the model-based module, and investigate the impact of this constraint in a simple decision-making setting. Memory constraints naturally give rise to strategies for allocating memory resources. We evaluate the performance of different strategies in different situations and demonstrate that when the rewarded states change often, it can be advantageous for the model-based module to focus its memory resources not on exploiting the current reward, but on capturing general structure of the environment. This work provides a theoretical foundation for a functional dissociation between cortical and subcortical systems during learning: the cortex supports general structure learning, while subcortical circuits specialize in reward-based learning. We further detail how these hypotheses can be tested on experimental data.

2606.00483 2026-06-02 q-bio.GN cs.LG

Annotation-Informed Block-Sparse Bayesian Modeling for cis-Expression Prediction

基于注释信息的块稀疏贝叶斯建模用于顺式表达预测

Lei Huang, Hui Shen, Kuan-Jui Su, Chuan Qiu, Martha Isabel Gonzalez-Ramirez, Anqi Liu, Zhe Luo, Yun Gong, Yipu Zhang, Dawei Li, Chaoyang Zhang, Hong-Wen Deng

AI总结 提出块稀疏贝叶斯稀疏线性混合模型(bsBSLMM),通过整合LD块尖峰-板稀疏性和TSS先验,提高了顺式表达预测性能及下游TWAS发现能力。

详情
Comments
16 pages manuscript; 38 pages supplementary
AI中文摘要

基于基因型的顺式表达预测依赖于对局部调控架构的精确建模。我们提出了块稀疏贝叶斯稀疏线性混合模型(bsBSLMM),这是贝叶斯稀疏线性混合模型(BSLMM)的扩展,它整合了连锁不平衡(LD)块的尖峰-板稀疏性和转录起始位点(TSS)先验的SNP包含。在来自GEUVADIS欧洲血统淋巴母细胞系系的23,098个基因中,在匹配的评估标准下,bsBSLMM保留了比BSLMM、LASSO、BLUP、TIGAR弹性网和TIGAR狄利克雷过程回归更多可预测的基因。与BSLMM相比,bsBSLMM提高了大多数共享基因的留出预测性能,其增益主要由LD块稀疏性驱动,并通过TSS先验进一步增强。bsBSLMM选择的变异在GM12878 DNase和H3K27ac调控区域中显示出比BSLMM选择的变异更强的富集性。在全转录组关联研究(TWAS)分析中,bsBSLMM恢复了已建立的炎症性肠病信号,包括IL23R,并识别了BSLMM未检测到的其他全基因组显著基因。在路易斯安那州骨质疏松症研究中的独立验证重现了跨祖先的预测产量增加,并在下游TWAS和基因集富集分析中恢复了生物学相关的骨矿物质密度通路。这些结果表明,整合LD块结构和生物学先验的SNP改进了顺式表达预测并增强了下游TWAS发现。

英文摘要

Genotype-based cis-expression prediction depends on accurately modeling local regulatory architecture. We present block-sparse Bayesian sparse linear mixed model (bsBSLMM), an extension of Bayesian sparse linear mixed model (BSLMM) that incorporates linkage disequilibrium (LD)-block spike-and-slab sparsity and a transcription start site (TSS)-informed SNP inclusion prior. Across 23,098 genes from GEUVADIS European-ancestry lymphoblastoid cell lines, bsBSLMM retained more predictable genes than BSLMM, LASSO, BLUP, TIGAR elastic net, and TIGAR Dirichlet-process regression under matched evaluation criteria. Compared with BSLMM, bsBSLMM improved held-out prediction performance for most shared genes, with gains driven primarily by LD-block sparsity and further enhanced by the TSS-informed prior. Variants selected by bsBSLMM showed stronger enrichment in GM12878 DNase and H3K27ac regulatory regions than variants selected by BSLMM. In transcriptome-wide association study (TWAS) analysis, bsBSLMM recovered established inflammatory bowel disease signals, including IL23R, and identified additional genome-wide significant genes not detected by BSLMM. Independent validation in the Louisiana Osteoporosis Study reproduced the increased prediction yield across ancestries and recovered biologically relevant bone mineral density pathways in downstream TWAS and gene set enrichment analyses. These results demonstrate that incorporating LD-block structure and biologically informed SNP priors improves cis-expression prediction and enhances downstream TWAS discovery.

2606.00373 2026-06-02 q-bio.NC

Sequential chaotic oscillations in excitatory-inhibitory threshold-linear networks

兴奋-抑制阈值线性网络中的序列混沌振荡

Jie Zang, Carina Curto

AI总结 本文提出兴奋-抑制阈值线性网络中的序列混沌振荡作为序列亚稳态的候选动力学机制,通过发展新的图规则分析路径和环上的不动点结构,揭示了序列混沌振荡需要不稳定的单例不动点和足够强的抑制。

详情
Comments
37 pages, 12 figures
AI中文摘要

亚稳态是大脑动力学和许多其他系统中观察到的一种现象,被认为是健康大脑功能的关键特征,反映了整合与分离之间的平衡。然而,如何在动力系统框架内捕捉这种行为仍不清楚。在本文中,我们提出兴奋-抑制阈值线性网络(E-I TLNs)中产生的序列混沌振荡(SCOs)作为序列亚稳态的候选动力学机制。作为混沌巡游的一种简单形式,SCOs在恒定输入下发生,由一系列亚稳态组成,其转换顺序可由底层图预测。为了确定SCOs的参数区域,我们为E-I TLNs开发了新的图规则,并用它们来表征路径和环上E-I TLNs的不动点结构。我们的结果表明,SCOs的出现需要不稳定的单例不动点和足够强的抑制。除了SCOs,我们还发现E-I振荡不必是同步的。受此启发,我们引入了z模式和平均模式的分解,分别捕捉兴奋性差异和整体网络活动。然后,这些模式被用来区分与环上E-I TLNs的全支持不动点相关的吸引子。

英文摘要

Metastable states, a phenomenon observed in brain dynamics and many other systems, have been proposed as a key feature of healthy brain function, reflecting a balance between integration and segregation. However, it remains unclear how to capture this behavior within a dynamical-systems framework. In this paper, we propose sequential chaotic oscillations (SCOs), arising in excitatory-inhibitory threshold-linear networks (E-I TLNs), as a candidate dynamical mechanism for sequential metastability. As a simple form of chaotic itinerancy, SCOs occur under constant input and consist of a sequence of metastable states whose transition order can be predicted by the underlying graph. To identify the parameter regime for SCOs, we develop new graph rules for E-I TLNs and use them to characterize the fixed point structure of E-I TLNs on paths and cycles. Our results show that the emergence of SCOs requires unstable singleton fixed points and sufficiently strong inhibition. In addition to SCOs, we find that E-I oscillations need not be synchronized. Motivated by this, we introduce a decomposition into the z-mode and the mean mode, which capture excitatory differences and overall network activity, respectively. These modes are then used to distinguish attractors associated with the full-support fixed point of E-I TLNs on cycles.

2606.00326 2026-06-02 q-bio.NC cond-mat.dis-nn

On the synaptic matrix eigenvalues of sparsely connected neural networks

稀疏连接神经网络的突触矩阵特征值

Mohd. Gayas Ansari, Pragya Shukla

AI总结 本文通过谱分析研究不同稀疏类型突触矩阵模型的特征值分布,探讨稀疏性对网络动力学和稳定性的影响,为诱导特定脑功能或瞬态机制提供理论依据。

详情
Comments
46 pages, 12 figures
AI中文摘要

突触矩阵(代表神经元连接强度)的谱行为是分析典型大脑稳定性、瞬态动力学以及其学习过程和记忆容量的重要工具。由于大量神经元以及潜在的瞬态机制(如稳态、癫痫发作或突触可塑性)导致的脑复杂性,可能产生具有时变稀疏度和类型的网络。这使得突触矩阵的精确确定不仅在技术上困难,而且毫无意义,因此其统计分析成为最佳的理论方法。这促使我们对具有不同稀疏类型的突触矩阵模型进行谱分析,从而分析稀疏性对网络动力学和稳定性各个方面的影响。我们的结果对于确定诱导特定脑功能或所需瞬态机制(例如药理效应或生理调节剂)所需的突触稀疏类型具有潜在意义。

英文摘要

The spectral behaviour of the synaptic matrix, representing the neuronal connection strengths, is an important tool to analyze the stability and transient dynamics of a typical brain as well as its learning process and memory capacity. The complexity of the brain due to large number of neurons as well as underlying transient mechanisms e.g. homeostasis, seizure or synaptic plasticity can lead to networks with time-varying degree and type of sparsity. This renders an exact determination of the synaptic matrix not only technically difficult but also meaningless, leaving its statistical analysis as the best available theoretical approach. This motivates us to pursue a spectral analysis of the synaptic matrix models with different type of sparsity and thereby analyze latter's role on various aspects of network dynamics and stability. Our results have potential relevance for detemining the type of synaptic sparsity required to induce a specific brain function or desired transient mechanism e.g for pharmacological effects or physiological modulators.

2606.00286 2026-06-02 cond-mat.dis-nn cond-mat.stat-mech physics.bio-ph q-bio.SC

Localization of Active Particles on Random Arrays of Parallel Filaments

活性粒子在平行丝状体随机阵列上的局域化

Owen Santoso, Elena Koslover

AI总结 研究无序丝状体阵列如何导致间歇运动粒子在汇聚极性区域局域化,通过快速附着-脱离极限下的有效能量景观和随机游走近似,揭示中间行程长度下最强的局域化效应。

详情
AI中文摘要

环境中的淬火无序可以根本性地改变主动和被动系统中的输运动力学。我们探索无序的丝状体阵列如何控制间歇运动粒子的分布,这些粒子在扩散和定向输运之间切换。受哺乳动物树突中观察到的平行微管混合极性排列的启发,我们表明这种阵列倾向于导致粒子在丝状体汇聚取向区域局域化。在快速附着-脱离极限下,无序系统可以用一个含噪声的一维有效能量景观来描述,其结构近似为随机游走。该景观中势阱的深度和宽度表示为输运动力学和系统几何的函数。局域化在中间行程长度处最强,此时偏向输运持续足够长的时间以感知淬火的丝状体极性,但又不至于长到促进从局部陷阱中逃逸。这些结果证明了粒子在随机丝状体网络上运动的鲁棒局域化,突出了由主动输运和淬火无序相互作用产生的涌现空间组织。

英文摘要

Quenched disorder in the environment can fundamentally alter transport dynamics in both active and passive systems. We explore how disordered arrays of filaments govern the distribution of intermittently moving particles which switch between diffusive and processive transport. Motivated by the mixed-polarity arrangements of parallel microtubules observed in mammalian dendrites, we show that such arrays tend to result in localization of particles at regions of convergent filament orientation. In the rapid attachment-detachment limit, the disordered system can be described by a noisy one-dimensional effective energy landscape, whose structure is approximated by a random walk. The depth and width of wells on this landscape are expressed as a function of the transport kinetics and system geometry. Localization is shown to be strongest at intermediate run-lengths, where biased transport persists long enough to sense the quenched filament polarity but not so long as to facilitate escape from local traps. These results demonstrate robust localization of particles moving on random filament networks, highlighting the emergent spatial organization that arises from an interplay of active transport and quenched disorder.

2606.00243 2026-06-02 cs.NE q-bio.NC stat.ML

Dynamics and Representation Structure of Local Approximations to Gradient-Based Learning in Linear Recurrent Neural Networks

线性递归神经网络中基于梯度学习的局部近似的动力学与表示结构

Ezekiel Williams, Alexandre Payeur, Guillaume Lajoie

AI总结 本文应用动力系统理论分析线性RNN中局部学习算法(RFLO和tBPTT)与BPTT的差异,发现RFLO的解被限制在初始参数的低秩扰动上。

详情
Comments
accepted to ICML 2026 as poster. Current version is camera-ready submission
AI中文摘要

生物和神经形态递归神经网络(RNN)在学习过程中可合理使用的信息受到空间和时间局部性约束。满足这些约束的常见策略是通过不同程度地忽略非局部项来修改梯度下降,如随机反馈局部在线学习(RFLO)和截断时间反向传播(tBPTT)。然而,这些算法的学习动力学及其与BPTT的比较仍不清楚。我们将动力系统理论应用于数据对齐的线性RNN——其动力学可分解为正交模式——以比较平稳解、稳定性性质和收敛速度,发现RFLO与BPTT及一步tBPTT在行为上存在质的差异。我们进一步观察到,RFLO学习的解被限制在初始参数的低秩扰动上,这一结果在数据对齐设置之外也成立。我们的工作为局部性约束如何塑造学习动力学提供了分析性见解,对神经科学的学习模型和RNN的替代优化方法具有启示意义。

英文摘要

Biological and neuromorphic recurrent neural networks (RNNs) are subject to spatial and temporal locality constraints on the information that can plausibly be used during learning. A common strategy to satisfy these constraints is to modify gradient descent by neglecting non-local terms to varying degrees, as in random feedback local online (RFLO) learning and truncated backpropagation through time (tBPTT). However, the learning dynamics of these algorithms, and how they compare with BPTT, remain poorly understood. We apply dynamical systems theory to data-aligned linear RNNs -- whose dynamics can be separated into orthogonal modes -- to compare stationary solutions, stability properties, and convergence rates, finding qualitatively distinct behaviour for RFLO versus BPTT and one-step tBPTT. We further observe that the solutions learned by RFLO are restricted to low-rank perturbations of initial parameters, a result which holds beyond the data-aligned setting. Our work provides analytical insight into how locality constraints shape learning dynamics, with implications for neuroscientific models of learning and alternative optimization approaches for RNNs.

2606.00226 2026-06-02 q-bio.NC

Consciousness, AI, and the Limits of Scientific Explanation

意识、人工智能与科学解释的局限

Bradley C. Love

AI总结 本文论证意识难题不是科学问题,而是范畴错误,并指出机器意识同样无法科学裁决。

详情
Comments
18 pages, no figures
AI中文摘要

科学本质上是第三人称的:其发现在原则上可由任何观察者重现,独立于视角,并可测量。这是科学力量的源泉,也是其在面对第一人称现象时的局限。虽然显而易见的是,关于生命意义的科学无法实现,但研究人员尚未对意识得出同样的结论——在其现象维度上,即看到红色、感到疼痛、成为任何事物的感受质。我认为他们应该得出这一结论。意识的困难问题不是一个等待更好工具或更雄心勃勃理论来解决的科学问题,而是一个范畴错误。同样的结构性问题也适用于机器意识:无论是归属还是否认都无法科学裁决。我将科学置于更广泛的理解生态中,并论证一个同时解决客观和主观的统一框架可能无法实现。

英文摘要

Science is constitutively third-personal: its findings are in principle reproducible by any observer, independent of perspective, and answerable to measurement. This is the source of its power and also its limit when it comes to phenomena that are first-personal. While it is obvious that a science of the Meaning of Life is unattainable, researchers have not drawn the same conclusion for consciousness -- in its phenomenal dimension, the qualia of seeing red, of feeling pain, of being anything at all. I argue they should. The hard problem of consciousness is not a scientific problem awaiting better tools or a more ambitious theory, but a category error. The same structural problem applies to machine consciousness: neither attribution nor denial is scientifically adjudicable. I situate science within a broader ecology of understanding and argue that a unified framework that addresses both the objective and the subjective may be unattainable.

2606.00192 2026-06-02 q-bio.PE

Mechanics of Pandemics

大流行病的力学

Seba Contreras, Philipp Dönges, Laura Müller, Piklu Mallick, Sydney Paltra, Ulrik Hvid, Robyn J. N. Kettlitz, Andreas Reitenbach, Rodrigo Amaral Lind, Maíra Aguiar, Philip Bechtle, André Calero Valdez, Ronja Gronemeyer, Manuela Harries, Veronika K. Jaeger, André Karch, Carolina J. Klett-Tammen, Peter Klimek, Mirjam E. Kretzschmar, Kai Nagel, Bjarke Frost Nielsen, Barbara Prainsack, Isabella M. Radhuber, Lone Simonsen, Kim Sneppen, Janik Suer, Viola Priesemann

AI总结 本文由多学科团队总结出十项基本机制,揭示疾病传播的普遍规律,以帮助社会更好地理解和应对未来大流行病。

详情
AI中文摘要

COVID-19 及以往的大流行病已表明疾病如何能够扰乱、威胁并改变日常生活。由于病原体和社会不断演变,每次大流行病都各不相同。然而,疾病传播的某些基本原则在不同疫情中似乎都成立。这些“机制”基于自然法则或我们生物与社会的基本结构。本文汇编了十项基本机制,由来自公共卫生、医学、流行病学、政治学、数学、物理学和心理学等多学科背景的团队精心挑选。这些机制尽管可能未被充分重视,但极大地塑造了大流行病的演变和控制方式。我们越能成功理解这些机制并将这些知识确立于社会之中,就越能为未来的大流行病做好准备,并在它们发生时做出适当应对。

英文摘要

COVID-19 and previous pandemics have shown how diseases can disrupt, threaten, and transform daily life. Since pathogens and societies are continuously evolving, every pandemic is different. However, certain fundamental principles of disease transmission appear to hold true across different outbreaks. These ``mechanisms'' are grounded in natural laws or the very structure of our biology and societies. This paper compiles ten fundamental mechanisms, curated by a multidisciplinary team with backgrounds spanning public health, medicine, epidemiology, political science, mathematics, physics, and psychology. These mechanisms, although perhaps underappreciated, substantially shape how pandemics unfold and are controlled. The better we succeed in understanding these mechanisms and establishing this knowledge in our societies, the better we will be able to prepare for future pandemics and respond appropriately when they occur.

2602.23179 2026-06-02 cs.LG q-bio.BM

Induction Meets Biology: Mechanisms of Repeat Detection in Protein Language Models

归纳遇见生物学:蛋白质语言模型中重复检测的机制

Gal Pomerants, Yaniv Nikankin, Anja Reusch, Tomer Tsaban, Ora Schueler-Furman, Yonatan Belinkov

AI总结 通过分析蛋白质语言模型在掩码预测中的行为,揭示了其检测精确和近似重复序列的两阶段机制:先构建特征表示,再利用归纳头关注重复片段中的对齐标记。

详情
AI中文摘要

蛋白质序列中存在大量重复片段,既有精确拷贝,也有带有突变的近似片段。这些重复对蛋白质结构和功能至关重要,推动了数十年来关于重复识别的算法研究。最近的研究表明,蛋白质语言模型(PLMs)通过掩码标记预测中的行为能够识别重复。为了阐明其内部机制,我们研究了PLMs如何检测精确和近似重复。我们发现,近似重复的机制在功能上包含了精确重复的机制。然后,我们描述了这一机制,揭示了两个主要阶段:首先,PLMs使用通用位置注意力头和生物学特化组件(如编码氨基酸相似性的神经元)构建特征表示;然后,归纳头关注重复片段中的对齐标记,促进正确答案的产生。我们的结果揭示了PLMs如何通过将基于语言的模式匹配与特化的生物学知识相结合来解决这一生物学任务,从而为研究PLMs中更复杂的进化过程奠定了基础。

英文摘要

Protein sequences are abundant in repeating segments, both as exact copies and as approximate segments with mutations. These repeats are important for protein structure and function, motivating decades of algorithmic work on repeat identification. Recent work has shown that protein language models (PLMs) identify repeats, by examining their behavior in masked-token prediction. To elucidate their internal mechanisms, we investigate how PLMs detect both exact and approximate repeats. We find that the mechanism for approximate repeats functionally subsumes that of exact repeats. We then characterize this mechanism, revealing two main stages: PLMs first build feature representations using both general positional attention heads and biologically specialized components, such as neurons that encode amino-acid similarity. Then, induction heads attend to aligned tokens across repeated segments, promoting the correct answer. Our results reveal how PLMs solve this biological task by combining language-based pattern matching with specialized biological knowledge, thereby establishing a basis for studying more complex evolutionary processes in PLMs.

2511.04047 2026-06-02 q-bio.NC cs.NE

Considering a generative mechanism of consciousness from the perspective of inter-level causation

从层间因果关系的角度考虑意识的生成机制

Yoshiyuki Ohmura, Yasuo Kuniyoshi

AI总结 本文从层间因果关系出发,提出意识的生成由系统内部因果机制决定,而非功能,并通过双定律模型解释意识生成及其因果性。

详情
AI中文摘要

为什么有些物理系统拥有意识,而其他系统却没有?我们不将意识视为主观体验,而是将其视为伴随体验的物理事件。这是一个物理学问题吗?还是因果关系理论的问题?物理学和因果关系理论服务于不同的描述目的。为了描述因果模型,我们引入了因果之间的不对称关系,这对于描述因果关系是必要的,但对于物理定律则不是。我们提出意识的生成由系统的内部因果机制决定,而不是由系统的功能(即物理决定的输入-输出关系)决定。为了解释这些内在原因,我们关注整体到部分的因果关系。传统上,整体到部分的因果关系被视为涌现现象而非机制。我们设计了一种方法,通过检查源自较高层次的原因如何在系统内传递到较低层次,来在因果模型中显式实现这些机制。然后,我们提出了一个双定律模型(DLM),该模型在较高和较低层次具有不同的动力学定律。最后,我们基于DLM讨论了功能性意识的生成及其因果关系。

英文摘要

Why do some physical systems possess consciousness, while others do not? We view consciousness not as a subjective experience, but rather as a physical event accompanying experience. Is this a question of physics? Or is it a question of the theory of causation? Physics and the theory of causation serve different descriptive purposes. To describe a causal model, we introduce an asymmetric relation between cause and effect that is necessary for describing causality, but not physical laws. We propose that the generation of consciousness is determined by a system's internal causal mechanisms, rather than by a system's functions (i.e., physically determined input-output relations). To explain these intrinsic causes, we focus on whole-to-parts causality. Traditionally, whole-to-parts causality is considered an emergent phenomenon rather than a mechanism. We devise a method for explicitly implementing these mechanisms in a causal model by examining how causes originating at higher levels are transmitted to lower levels within a system. We then propose a dual-laws model (DLM), which features distinct dynamical laws at higher and lower levels. Finally, we discuss the generation of functional consciousness and its causality based on the DLM.

2605.01430 2026-06-02 q-bio.NC

Measuring Understanding Through Discrete Compositional Knowledge Structures in Hierarchical Automata

通过层次自动机中的离散组合知识结构测量理解

Igor Balaz

AI总结 提出基于有限状态机和层次自动机的框架,通过离散可检查的结构签名(如表示形成、结构知识、泛化能力、组合意识和元认知访问)来测量人工认知系统中的理解,并在简单几何域中验证了该方法与统计相关性的区分能力。

详情
Comments
AGI 2026 Conference
AI中文摘要

我们如何测量人工认知系统中的真正理解?当前方法面临测量差距:概率系统逐步细化置信度,基于实践的系统通过重复执行编译知识,神经系统在不透明的嵌入空间中分布理解。我们提出,使理解可测量需要架构中理解形成产生离散、可检查的结构签名。本文提出了由表示模式的有限状态机和表示组合的高阶自动机构建的层次自动机。约束推理从单个观测构建自动机。相似性检测聚类相关自动机,使概念鲁棒性可量化。图记忆使组合知识直接可检查。元认知机制实现可观察的重配置。我们在一个简单的几何域中演示了理解测量。图演化跟踪揭示了五个可测量的签名:即时表示形成、结构知识、泛化能力、组合意识和元认知访问。这些测量将结构理解与统计相关性区分开来。我们的贡献是一个通过离散组合知识结构使理解可测量的框架。该测量能力补充了神经系统中的感知学习和神经符号架构中的任务执行。

英文摘要

How do we measure genuine understanding in artificial cognitive systems? Current approaches face a measurement gap: probabilistic systems refine confidence gradually, practice-based systems compile knowledge through repeated execution, and neural systems distribute understanding across opaque embedding spaces. We propose that making understanding measurable requires architectures where understanding formation produces discrete, inspectable structural signatures. This paper presents hierarchical automata built from finite state machines representing patterns and higher-order automata representing compositions. Constrained inference constructs automata from single observations. Similarity detection clusters related automata, making concept robustness quantifiable. Graph memory makes compositional knowledge directly inspectable. Metacognitive mechanisms enable observable reconfiguration. We demonstrate understanding measurement in a simple geometric domain. Graph evolution tracking reveals five measurable signatures: immediate representation formation, structural knowledge, generalization capacity, compositional awareness, and metacognitive access. These measurements distinguish structural understanding from statistical correlation. Our contribution is a framework for making understanding measurable through discrete compositional knowledge structures. This measurement capability complements perceptual learning in neural systems and task execution in neurosymbolic architectures.

2604.20615 2026-06-02 q-bio.QM

Semi supervised GAN for smart microscopy, fast and data efficient cell cycle classification

用于智能显微镜的半监督GAN:快速且数据高效的细胞周期分类

Rajeev Manick, Youssef El Habouz, Maëlle Guillout, Celia Martin, Julia Bonnet, Louis Ruel, Sylvain Pastezeur, Olivier Chanteux, Otmane Bouchareb, Marc Tramier, Jacques Pécréaux

AI总结 提出半监督生成对抗网络(SGAN),结合未标记显微镜图像与合成样本,在标注数据有限且类别不平衡条件下实现鲁棒的细胞周期阶段分类,在Mitocheck数据集上仅用每类80个标注和600个未标注图像达到93±2%准确率。

详情
AI中文摘要

现代光学显微镜已完全电动化;然而,将其转变为真正的智能系统需要根据检测到的对象和动态生物事件实时调整采集设置。核心是分类算法,这些算法通常依赖于定制化软件,并且通常针对狭窄定义的生物学应用设计。此外,它们通常需要大量标注数据集才能有效训练。我们引入了一种半监督生成对抗网络(SGAN),用于在低资源条件下进行稳健的细胞周期阶段分类,并适应多种细胞结构。该框架将未标记的显微镜图像与合成生成的样本相结合,以缓解标注有限的问题,同时即使在未标记子集类别不平衡时也能保持稳定的性能。在包含五个有丝分裂类别的Mitocheck数据集上测试,该模型仅使用每类80个标注图像和600个未标注图像就达到了93±2%的准确率。所提出的算法是通用的,可以通过迁移学习轻松适应新的标注方案、分类目标、细胞系或显微镜模式。SGAN非常适合集成到自动显微镜中,能够在各种生物学和显微镜应用中实现高效且适应性强的图像分析。

英文摘要

Modern optical microscopes are fully motorised; however, transforming them into truly smart systems requires real-time adjustment of acquisition settings in response to detected objects and dynamic biological events. At the core are classification algorithms that commonly depend on customised software and are generally designed for narrowly-defined biological applications. In addition, they often require substantial annotated datasets for effective training. We introduce a semi-supervised generative adversarial network (SGAN) for robust cell-cycle stage classification under low-resource conditions, adaptable to diverse cellular structures. The framework combines unlabelled microscopy images with synthetically generated samples to mitigate limited annotation, while preserving stable performance even when the unlabelled subset is class-imbalanced. Tested on the Mitocheck dataset, which features five mitosis classes, the model achieved $93 \pm 2\%$ accuracy using only 80 labelled per class and 600 unlabelled images. The proposed algorithm is generic and can be readily adapted to new labeling schemes, classification targets, cell lines, or microscopy modalities through transfer learning. SGAN is well suited for integration into automated microscopes, enabling efficient and adaptable image analysis across diverse biological and microscopy applications.