arXivDaily arXiv每日学术速递 周一至周五更新
重置
2605.27189 2026-05-27 cs.CL cs.LG cs.SD eess.AS q-bio.NC

Beyond Binary: Speech Representations Across the Cognitive Score Hierarchy

超越二元:认知评分层级中的语音表征

Serli Kopar, Roshan Prakash Rane, Christian Mychajliw, Lydia Federmann, Gerhard Eschweiler, Daniela Berg, Sam Gijsen, Paula Andrea Perez-Toro, Kerstin Ritter

AI总结 本研究利用5,754份德语神经心理学评估录音,比较手工声学特征与自监督学习嵌入在轻度认知障碍认知评估层级(任务、领域、全局)中的表现,发现任务约束与评估层级之间的关联。

详情
AI中文摘要

本研究考察了轻度认知障碍中语音表征与认知评估层级结构之间的关系。利用5,754份德语神经心理学评估录音,我们在三个评分层级(任务、领域和全局)上评估了六项认知任务。我们比较了手工声学特征与自监督学习(SSL)嵌入。结果表明,尽管SSL表示在较低层级通常优于手工特征,但这种趋势在MCI分类中发生逆转。此外,任务特定约束影响性能:响应自由度较大的任务随着层级增加表现出性能稀释,表明“专家”表示,而高度结构化任务的性能向更高层级增加,表明“通才”表示。这些发现揭示了自动临床语音分析中任务约束与评估层级之间的联系。

英文摘要

This study examines the relationship between speech representations and the hierarchical structure of cognitive assessment in mild cognitive impairment. Utilizing 5,754 German neuropsychological assessment recordings, we evaluate six cognitive tasks across three score levels: task, domain, and global levels. We compare hand-crafted acoustic features with self-supervised learning (SSL) embeddings. Results show that although SSL representations generally outperform hand-crafted features at lower levels, this trend reverses for MCI classification. Furthermore, task-specific constraints influence performance: tasks with greater response freedom exhibit performance dilution as hierarchical levels increase, suggesting ``specialist'' representations, whereas the performance of highly structured tasks increases toward higher levels, suggesting ``generalist'' representations. These findings show links between task constraints and assessment hierarchy in automated clinical speech analysis.

2605.27100 2026-05-27 q-bio.PE

Logistic dynamics of small populations with demographic stochasticity

具有人口统计学随机性的小种群逻辑斯蒂动力学

Lucas M. Brugevin, Damián H. Zanette

AI总结 本研究通过随机出生、死亡和移民事件建模有限种群,利用解析和数值方法刻画活跃期和空缺期的统计特性,发现活跃期呈现双峰分布,并探讨了网络结构对种群聚类和空缺期频率的影响。

详情
Journal ref
Phys. Rev. E 113 (2026) 054315
AI中文摘要

我们研究了一个受生态学启发的有界种群模型,其动力学由随机出生、死亡和移民事件支配。个体数量的随机波动导致活跃期和空缺期交替出现,其中种群分别存在和灭绝。利用解析和数值技术,我们刻画了这两种时期的统计特性,量化了它们的持续时间和频率,以及活跃期中的典型种群规模。与由逻辑斯蒂动力学支配的确定性平均场行为形成鲜明对比的是,活跃期可能表现出明显的双峰性:要么持续时间短且种群规模非常小,要么持续时间长且种群规模接近最大值。我们还研究了当种群在三种随机网络(Erdős-Rényi、正则和地理网络)上演化时这些结果如何变化。网络结构的主要作用是诱导种群聚类,个体聚集为局部群体,这反过来限制了种群增长并增加了空缺期的频率。

英文摘要

We study an ecology-inspired model for a population of bounded size, whose dynamics is governed by random birth, death, and immigration events. Stochastic fluctuations in the number of individuals give rise to a succession of alternating active and vacant periods, where the population is respectively extant and extinct. Using both analytical and numerical techniques, we characterize the statistics of the two kinds of period, quantifying their duration and frequency, and the typical population sizes in active periods. In sharp contrast to the deterministic mean-field behavior, governed by logistic dynamics, active periods may exhibit pronounced bimodality: either short durations with very small populations, or much longer durations with population sizes close to the maximum. We also investigate how these results change when the population evolves on random networks of three classes: Erdős-Rényi, regular, and geographic. The main effect of the network structure is to induce population clustering, with individuals aggregated into localized groups. This, in turn, limits population growth and increases the frequency of vacant periods.

2605.26998 2026-05-27 cs.LG q-bio.NC

Probabilistic Recurrent Intention Switching Model

概率递归意图切换模型

Wenyuan Sheng, Hao Zhu, Joschka Boedecker

AI总结 提出PRISM模型,利用轻量级递归网络建模非平稳意图切换,实现精确EM分解和闭式求解,在网格世界、小鼠迷宫和机器人操作任务中取得最优似然并恢复可解释意图。

详情
AI中文摘要

逆强化学习(IRL)从观察到的行为中恢复奖励函数,但传统方法假设单一固定奖励,无法捕捉一个回合内的目标切换。最近的多意图IRL方法通过分割轨迹来解决这一问题,但将意图转换建模为无记忆马尔可夫链或通过固定历史窗口的手动状态增强。我们提出概率递归意图切换模型(PRISM),该模型用轻量级递归网络替代这两种机制,将观察历史映射到每步意图分布。我们证明由此产生的EM目标可以精确分解为独立的每意图奖励子问题,每个子问题可闭式求解,从而得到$\mathcal{O}(nK)$的E步,无需变分近似。我们在非马尔可夫网格世界、小鼠迷宫和BridgeData~V2机器人操作(首个大规模多意图IRL机器人应用)上评估PRISM。在所有设置中,PRISM在保持最高留出对数似然的同时,从未标记的演示中恢复出可命名、时间上连贯的意图,表明离散目标切换存在于生物和人工智能体中。

英文摘要

Inverse reinforcement learning (IRL) recovers reward functions from observed behavior, yet traditional methods assume a single stationary reward that cannot capture goal switching within an episode. Recent multi-intention IRL methods address this by segmenting trajectories, but model intention transitions as either a memoryless Markov chain or via manual state augmentation with a fixed history window. We propose the Probabilistic Recurrent Intention Switching Model (PRISM), which replaces both mechanisms with a lightweight recurrent network that maps observation history to a per-step intention distribution. We prove that the resulting EM objective decomposes exactly into independent per-intention reward subproblems, each solvable in closed form, yielding an $\mathcal{O}(nK)$ E-step with no variational approximation. We evaluate PRISM on a non-Markovian gridworld, a mouse labyrinth, and BridgeData~V2 robotic manipulation, the first large-scale robotic application of multi-intention IRL. Across all settings PRISM achieves the highest held-out log-likelihood while recovering nameable, temporally coherent intentions from unlabeled demonstrations, suggesting that discrete goal switching is present in both biological and artificial agents.

2605.26973 2026-05-27 stat.ML cond-mat.dis-nn cs.LG cs.NE q-bio.NC

Signal-to-Noise Ratio and Sample Size Govern Representational Alignment in Neural Networks

信噪比与样本量控制神经网络中的表征对齐

Ali Hussaini Umar, Alessandro Laio

AI总结 通过理论和实验证明,信噪比和训练样本量以单调和非单调方式分别影响神经网络表征对齐,且对齐程度在插值阈值附近最小,与泛化误差解耦。

详情
AI中文摘要

已知神经网络会发展出潜在表征,这些表征是$对齐$的,即在不同架构、训练协议或训练数据集训练的网络之间结构相似。我们在一个受控环境中研究这一现象,使用被噪声过程的独立实现扰动的训练集,训练一组网络执行回归和分类任务。我们表明,信噪比(SNR)和训练样本量以定性相似的方式影响对齐,无论是在真实世界数据集上训练的网络,还是在极其简单的具有单个隐藏层的$线性$网络中(其对齐可以解析估计)。在线性和非线性网络、回归和分类任务以及合成和真实数据中,我们一致观察到,对齐随SNR单调变化,但随训练样本量非单调变化。特别地,对齐在插值阈值附近最小,且更强的对齐不一定对应更好的泛化误差。这些发现揭示了数据质量和数量对对齐的非平凡依赖关系,且与泛化性能解耦。

英文摘要

Neural networks are known to develop latent representations that are $aligned$, namely structurally similar across networks trained with different architectures, training protocols, or training datasets. We study this phenomenon in a controlled setting, where we train an ensemble of networks on regression and classification tasks using training sets perturbed by independent realizations of a noise process. We show that the signal-to-noise ratio (SNR) and the training sample size influence the alignment in qualitatively similar ways in networks trained on real-world datasets and in an extremely simple $linear$ network with a single hidden layer, for which the alignment can be estimated analytically. Across linear and nonlinear networks, regression and classification tasks, and both synthetic and real-world data, we consistently observe that alignment varies monotonically with SNR but non-monotonically with training sample size. In particular, the alignment is minimized near the interpolation threshold, and a stronger alignment does not necessarily correspond to better generalization error. These findings reveal a non-trivial dependence of alignment on data quality and quantity, decoupled from generalization performance.

2605.26904 2026-05-27 q-bio.CB

SpCAST: Decoding spatial transcriptomics at single-cell resolution with fast and interpretable analysis

SpCAST: 通过快速且可解释的分析解码单细胞分辨率空间转录组学

Yiyang Zhang, Bokai Zhao, Xiaoru Zhang, Zongchang Du, Xiaojuan Sun, Tianzi Jiang

AI总结 提出基于Kolmogorov-Arnold网络的SpCAST框架,通过非线性映射和特征归因实现细胞类型标签转移、空间基因表达重建和标记基因优先排序,在53个数据集上验证了其高效性和可解释性。

详情
AI中文摘要

单细胞分辨率的空间转录组学在原生组织中细胞位置处描绘基因表达,但准确的细胞类型注释仍具挑战性:基于成像的平台受限于靶向基因面板,而基于测序的平台常因分子捕获稀疏和缺失而受影响。因此,从单细胞RNA测序参考可靠地转移细胞类型标签对于解释靶向和稀疏的空间数据集至关重要。在此,我们提出SpCAST,一种基于Kolmogorov-Arnold网络的参考引导空间转录组学分析框架。SpCAST捕获参考与空间表达谱之间的非线性映射,并使用特征归因来优先排序支持细胞类型预测的基因。在统一框架内,SpCAST执行细胞类型标签转移、空间基因表达重建和标记基因候选优先排序。我们在涵盖五种技术和多种组织背景的53个数据集(包含413,376个空间细胞)上对SpCAST进行了基准测试。与现有代表性方法相比,SpCAST在减少运行时间的同时实现了具有竞争力的注释性能。案例研究表明,SpCAST支持跨物种标签转移和最初未标记细胞的候选分配。它还能以改善的空间一致性重建标记基因表达模式,并优先排序与细胞类型相关的标记基因。总之,这些结果支持SpCAST作为一种高效且可解释的框架,用于从靶向和稀疏的单细胞分辨率空间转录组学数据中提取细胞类型和基因水平信息。

英文摘要

Single-cell-resolution spatial transcriptomics profiles gene expression at cellular locations in native tissues, yet accurate cell-type annotation remains challenging: imaging-based platforms are constrained by targeted gene panels, whereas sequencing-based platforms often suffer from sparse molecular capture and dropout. Reliable transfer of cell-type labels from single-cell RNA sequencing references is therefore critical for interpreting targeted and sparse spatial datasets. Here, we present SpCAST, a Kolmogorov--Arnold network-based framework for reference-guided spatial transcriptomics analysis. SpCAST captures nonlinear mappings between reference and spatial expression profiles and uses feature attribution to prioritize genes supporting cell-type predictions. Within a unified framework, SpCAST performs cell-type label transfer, spatial gene-expression reconstruction and marker-gene candidate prioritization. We benchmarked SpCAST on 53 datasets comprising 413,376 spatial cells across five technologies and diverse tissue contexts. SpCAST achieved competitive annotation performance with reduced runtime relative to representative existing methods. Case studies demonstrated that SpCAST supports cross-species label transfer and candidate assignment of originally unlabeled cells. It also reconstructs marker-gene expression patterns with improved spatial concordance and prioritizes cell-type-associated marker genes. Together, these results support SpCAST as an efficient and interpretable framework for extracting cell-type and gene-level information from targeted and sparse single-cell-resolution spatial transcriptomics data.

2605.26856 2026-05-27 q-bio.NC cs.AI cs.RO

The Sensation Modulating Network:Haltability as the architectural ground for object-directed phenomenology

感觉调节网络:可停性作为对象导向现象学的架构基础

G. Nagarjuna, Durgaprasad Karnam

AI总结 本文提出感觉调节网络(SMN)作为具身认知的架构,通过对手动力学和可停性机制,将对象导向现象学(胡塞尔意义)的意向性建立在身体组织的结构特征上,从而调和认知主义与4E认知的争论。

详情
Comments
64 pages, main body 38 pages + References 6, Appendices 20 pages, Tables 3, and Figures 21
AI中文摘要

认知科学仍然分裂为认知主义——它解释了递归和语言,但无法将形式符号扎根于意义——和4E方法——它将认知扎根于身体,但很少详细说明身体的架构以支持生成性。我们认为这一僵局源于对具身代理架构的不完整描述,并提出一个架构:感觉调节网络(SMN),即认知代理被构想为整个身体,在每个解剖尺度上由对手动力学组织,由感觉调节器构建,这些调节器通过一个基底感知和行动,配对成协调动作区,由全身广播网络路由。三个承诺赋予了SMN其效力。可停性——将对抗性可供性招募到共激活平衡中——提供了对象导向现象学(在胡塞尔意义上)所需的架构位置:对手性使得共激活成为可能,共激活使得停止成为可能,停止使得注意成为可能,注意使得意向指向成为可能,而无需在顶层添加任何模块。可自我调节动作模式(SMAP)的双信号特性使得自我/世界区分成为布线的结构特征,而非代理应用的范畴。四级动作模式层级——基础、可停、可协商、交易——提供了从自主规律性到公共惯例化的单一轨迹,将基于语法的生成性条件定位为架构转变。SMN调和了认知主义与4E的争论:递归存在于可协商动作模式的可修改动力学中,具身性存在于支持它们的对手基底中。附录中给出了一个初步的形式化方法和八个预测寄存器(七个可测试,一个假设性),以及参考模拟。

英文摘要

Cognitive science remains split between cognitivism - which accounts for recursion and language but cannot ground formal symbols in meaning - and 4E approaches - which ground cognition in the body but rarely specify the body's architecture in enough detail to support generativity. We argue the impasse stems from an incomplete account of the embodied agent's architecture, and propose one: the Sensation Modulating Network (SMN), the cognitive agent conceived as the whole body, organized at every anatomical scale by opponent dynamics, built from Sensation Modulators that sense and act through one substrate, paired into Coordinated Action Zones routed by a body-wide broadcast network. Three commitments give the SMN its purchase. Haltability - the recruitment of antagonistic affordance into co-activated equilibrium - provides the architectural locus that object-directed phenomenology, in Husserl's sense, requires: opponency enables co-activation, co-activation enables halt, halt enables attention, attention enables intentional directedness, with no module added on top. The dual-signal property of self-modulatable action patterns (SMAPs) makes the self/world distinction a structural feature of the wiring rather than a category the agent applies. And a four-level action-pattern hierarchy - Basal, Haltable, Negotiable, Transactional - gives a single trajectory from autonomic regularity to public conventionalization, locating the conditions for grammar-grounded generativity as architectural transitions. The SMN reconciles the cognitivism-4E debate: recursion lives in the modifiable dynamics of Negotiable Action Patterns, embodiment in the opponent substrate that supports them. A tentative formalism and eight predicted registers (seven testable, one hypothetical), with reference simulations, are given in an appendix.

2605.26852 2026-05-27 q-bio.PE cs.DM

Recognizing Level-k-Based Phylogenetic Networks is NP-Complete

识别基于Level-k的系统发育网络是NP完全的

Takatora Suzuki

AI总结 本文证明对于每个固定整数k≥1,判断给定有根几乎二元系统发育网络的最小level是否≤k是NP完全的,从而证实了Suzuki和Hayamizu关于该问题NP难的猜想。

详情
Comments
19 pages, 7 figures
AI中文摘要

系统发育网络通过表示网状进化来推广系统发育树。基于树的网络及其支持树已被广泛研究,但并非所有网络都是基于树的。为了衡量此类网络与基于树的网络之间的距离,Suzuki和Hayamizu(2025)提出了寻找给定有根几乎二元系统发育网络的最小level的支持网络的问题。他们猜想该问题是NP难的,并提供了指数时间算法。在本文中,我们通过证明对于每个固定整数$k \geq 1$,判断最小level是否至多为$k$是NP完全的,从而证实了这一猜想。

英文摘要

Phylogenetic networks generalize phylogenetic trees by representing reticulate evolution. Tree-based networks and their support trees have been extensively studied, but not all networks are tree-based. To measure how far such networks are from being tree-based, Suzuki and Hayamizu (2025) formulated the problem of finding the support network with minimum level of a given rooted almost-binary phylogenetic network. They conjectured that this problem is NP-hard and provided exponential-time algorithms. In this paper, we prove this conjecture by showing that, for every fixed integer $k \geq 1$, it is NP-complete to decide whether the minimum level is at most $k$.

2605.26758 2026-05-27 physics.bio-ph q-bio.OT

Biophoton Emission from Palm during Meditation: A Multi-Method Complexity Analysis

冥想期间手掌的生物光子发射:多方法复杂性分析

E. Pace, L. De Paolis, G. Felici, I. Vaglini, M. Sandrini, M. Pettini, A. Gemignani, M. Benfatto

AI总结 采用多方法复杂性分析框架,研究人类手掌在冥想期间的超弱光子发射,发现冥想导致发射间歇性系统性降低,并与心脏和脑电复杂性变化一致。

详情
Comments
49 pages, 8 Figures, it will be submitted to Frontiers in Photobiology
AI中文摘要

生物光子是生物体在可见光谱中产生的超弱光子发射。虽然已在植物、发芽种子和细胞培养物中得到广泛研究,但尚未有对人类在生理调节下的超弱光子发射(UPE)进行系统性的多方法复杂性分析。我们通过将综合分析框架应用于人类受试者右手掌的UPE测量来填补这一空白。在不同日期进行了三次独立实验,每次包括四个连续的15分钟阶段:暗参考、冥想前静息状态(Pre)、基于Sama Vritti箱式呼吸协议的结构化冥想、以及冥想后恢复(Post)。使用四种互补方法分析光子计数序列:分布统计(Fano因子、偏度、尾部预期短缺);多尺度Fano因子和Allan偏差;条纹滤波扩散熵分析(DEA);以及带时间反转检验的Renyi熵。这些方法显示出互补的敏感性,汇聚成一个连贯的图像:冥想期间发射间歇性系统性降低,在三次实验中一致检测到。条纹滤波DEA将发射置于非遍历更新状态,并显示从Pre到冥想的标度指数下降。Renyi分析揭示了两种效应:边际幅度突发性(Tdir)降低和序列模式结构(Tseq)增加,解释为对Sama Vritti节奏的同步。这些发现与Tuladhar等人报告的冥想期间心脏复杂性转变以及Zaccaro等人报告的Sama Vritti呼吸期间脑电图重组一致,表明存在协调的多通道生理反应。结果建立了在生理调节下人类UPE复杂性分析的概念验证框架。

英文摘要

Biophotons are ultra-weak photon emissions in the visible spectrum produced by living organisms. While extensively studied in plants, germinating seeds, and cell cultures, no systematic multi-method complexity analysis of human ultraweak photon emission (UPE) under physiological modulation has been reported. We address this gap by applying a comprehensive analytical framework to UPE measurements from the right palm of a human subject. Three independent sessions were conducted on different days, each comprising four consecutive 15-minute phases: Dark reference, pre-meditation resting state (Pre), structured meditation based on the Sama Vritti box-breathing protocol, and post-meditation recovery (Post). Photon count series are analysed with four complementary methods: distributional statistics (Fano factor, skewness, tail Expected Shortfall); multiscale Fano factor and Allan deviation; stripe-filtered Diffusion Entropy Analysis (DEA); and Renyi entropy with a Time Reversal test. The methods show complementary sensitivities, converging on a coherent picture: a systematic reduction of emission intermittency during meditation, consistently detected across all three sessions. Stripe-filtered DEA places the emission in the non-ergodic renewal regime with a Pre-to-Meditation decrease of the scaling exponent. Renyi analysis reveals two effects: reduced marginal amplitude burstiness (Tdir) and increased sequential pattern structure (Tseq), interpreted as entrainment to the Sama Vritti rhythm. These findings are consistent with cardiac complexity transitions during meditation reported by Tuladhar et al. and with EEG reorganization during Sama Vritti breathing by Zaccaro et al., suggesting a coordinated multi-channel physiological response. The results establish a proof-of-concept framework for complexity analysis of human UPE under physiological modulation.

2605.21617 2026-05-27 cs.LG q-bio.QM

$\textit{BlockFormer}$ : Transformer-based inference from interaction maps

$ extit{BlockFormer}$:基于交互图的Transformer推理

Eloïse Touron, Pedro L. C. Rodrigues, Julyan Arbel, Nelle Varoquaux, Michael Arbel

AI总结 提出BlockFormer,一种基于Transformer架构的数据驱动方法,通过模拟器生成合成数据训练,解决从交互图中推断可变数量和大小实体参数的反问题,并成功应用于多种物种的着丝粒定位。

详情
AI中文摘要

从交互图中进行推理,例如从全基因组染色体构象捕获技术(特别是Hi-C)中识别着丝粒,可以表述为一个通用的反问题:给定一个通过可变数量和大小的块总结实体间成对相互作用的图,推断一组参数。在这项工作中,我们引入了一种数据驱动的方法,利用这些图之间的共享结构(例如局部模式的全局对齐),同时处理真实数据中实体数量和大小可变性。我们的方法依赖于能够处理这种可变性的Transformer架构,以及一个自定义模拟器,用于生成丰富且计算成本低廉的合成数据进行训练。应用于着丝粒定位问题,该方法能够准确恢复各种基因组大小的多种物种的着丝粒基因组位置。

英文摘要

Inference from interaction maps, such as centromere identification from genome-wide chromosome conformation capture techniques -- notably Hi-C -- can be formulated as a generic inverse problem: infer a set of parameters given a map summarizing pairwise interactions between entities through blocks of variable numbers and sizes. In this work, we introduce a data-driven approach that leverages shared structure between these maps, such as global alignment between localized patterns, while handling the variability in number and size of entities arising in real-world data. Our approach relies on a transformer architecture capable of handling such variability and a custom simulator to generate abundant, yet computationally cheap synthetic data for training. Applied to the problem of centromere localization, the method accurately recovers their genomic positions across a wide range of species of various genome sizes.

2605.26690 2026-05-27 cs.LG cs.AI q-bio.QM

Self-Improvement Imitation with Biologically Guided Search for Protein Design Under Oracle Budgets

SILO:基于生物引导搜索的自改进模仿用于预算约束下的蛋白质设计

Ashima Khanna, Dominik Grimm

AI总结 提出SILO框架,通过层次化编辑策略、增量随机束搜索和UCB代理集成,在有限oracle预算下实现蛋白质序列优化,在8个蛋白质适应度景观上达到最优性能。

详情
AI中文摘要

在严格的oracle预算下进行蛋白质序列优化需要探索巨大的组合空间,同时使每次评估都具有信息量。现有的强化学习和离策略生成方法在代理噪声下性能下降,且位置无关的突变提议可能破坏功能关键残基。我们提出了SILO,一个用于oracle预算蛋白质设计的轨迹级自改进模仿框架。SILO使用层次化编辑策略,将每个突变分解为位置选择后跟残基选择。在每个主动学习轮次中,策略通过增量随机无放回束搜索(SBS)采样候选轨迹,结合基于UCB的代理集成和丙氨酸扫描适应度分数(AFS),选择具有功能相关编辑的候选进行计算机oracle评估。然后,通过在轮次中最佳oracle标记轨迹上的下一动作交叉熵模仿来更新策略,避免值函数估计。在八个复现的蛋白质适应度景观和来自先前工作的五个强基线上,SILO在我们的评估中在8/8的景观上实现了最高的最大和top-100平均适应度,通常表现出更快的早期改进。在每种设置两个景观的低数据和噪声代理压力测试中,当多个基线退化时,SILO保持竞争力或最佳。消融实验表明,SBS与AFS贡献了大部分增益,迭代模仿提供了额外改进。代码可在:https://github.com/grimmlab/SILO.git 获取。

英文摘要

Protein sequence optimization under tight oracle budgets requires methods that explore vast combinatorial spaces while making each evaluation informative. Existing reinforcement learning and off-policy generative approaches often degrade under surrogate noise, and position-agnostic mutation proposals risk disrupting functionally critical residues. We introduce SILO, a trajectory-level self-improvement imitation framework for oracle-budgeted protein design. SILO uses a hierarchical edit policy that decomposes each mutation into a position choice followed by a residue choice. In each active-learning round, the policy samples candidate trajectories via incremental stochastic beam search without replacement (SBS), and a UCB-based proxy ensemble, combined with an alanine-scan fitness score (AFS), selects candidates with functionally relevant edits for in silico oracle evaluation. The policy is then updated by next-action cross-entropy imitation on the round's best oracle-labeled trajectories, avoiding value-function estimation. Across eight reproduced protein fitness landscapes and five strong baselines from prior work, SILO achieves the highest maximum and top-100 mean fitness on 8 of 8 landscapes within our evaluations, often exhibiting faster early-stage improvement. In low-data and noisy-proxy stress tests on two landscapes per setting, SILO remains competitive or best when several baselines degrade. Ablations show that SBS with AFS account for much of the gains, with iterative imitation providing additional improvement. Code is available at: https://github.com/grimmlab/SILO.git

2605.26551 2026-05-27 q-bio.NC cond-mat.dis-nn physics.bio-ph

Random neural networks match observed dimensionality of neural population recordings and motivate stronger experimental tests

随机神经网络匹配神经群体记录的观测维度并激励更强的实验检验

Zehui Zhao, Michael J Pasek, Ilya M Nemenman

AI总结 通过将有限测量时间和行为上下文变异性纳入动态平均场理论,证明随机神经网络预测的维度与大规模记录一致,并指出流形方向相似性比维度对网络结构更敏感。

详情
AI中文摘要

随机连接的神经网络长期以来作为研究神经群体集体动力学的理论工具,但与实验的定量比较仍然有限。最近的技术进步使得解析跨神经元的群体相关性成为可能,而随机神经网络等最小模型预测了它们的通用结构。两者是否定量一致尚未得到检验。在这项工作中,我们通过建立在动态平均场理论的最新进展基础上,并将两个额外的实验相关特征纳入模型:有限测量时间和跨行为上下文的变异性,检验了最小结构的随机神经网络能否解释神经群体记录中活动的低维度。我们表明,当包含这些因素时,从大规模记录中测量的维度与随机模型预测的值一致。然而,当前的记录持续时间使得难以利用维度来区分连接结构。我们进一步表明,分析预测的维度随外部输入强度非单调变化,并且在不同行为背景下记录的神经流形之间的方向相似性可能比维度对网络结构更敏感。总之,这些结果为实验设计提供了定量指导,以推断群体活动背后的连接结构。

英文摘要

Randomly connected neural networks have long served as a theoretical tool for studying collective dynamics in neural populations, yet quantitative comparisons to experiments remain limited. Recent technological advances have made it possible to resolve population-wide correlations across neurons, and minimal models such as random neural networks predict their generic structure. Whether the two agree quantitatively remains untested. In this work, we examine whether a minimally structured random neural network can account for the low dimensionality of activity in neural population recordings by building on recent developments in Dynamical Mean-Field Theory and incorporating two additional experimentally relevant features into the model: finite measurement time and variability across behavioral contexts. We show that, when these factors are included, the dimensionality measured from large-scale recordings is consistent with the values predicted by random models. However, current recording durations make it difficult to use dimensionality to discriminate among connectivity structures. We further show that analytically predicted dimensionality varies non-monotonically with external input strength, and that the orientation similarity between neural manifolds recorded under different behavioral contexts can be more sensitive to network structure than dimensionality is. Together, these results provide quantitative guidance for experimental design to infer the connectivity structure underlying population activity.

2605.26192 2026-05-27 cs.LG cs.AI q-bio.BM

Co-folding model guided by structural proteomics

结构蛋白质组学引导的共折叠模型

Alon Shtrikman, Nitzan Simchi, Michal Ran Shchory, Sagie Brodsky, Eran Seger, Kirill Pevzner

AI总结 提出AIMS-Fold框架,通过整合XL-MS和HDX-MS实验数据与扩散模型,在推理时引导蛋白质复合物构象生成,提升诱导接近靶标的预测准确性。

详情
AI中文摘要

蛋白质结构生成模型擅长从序列预测单个蛋白质的静态结构,但通常无法捕捉蛋白质复合物的正确构象状态,这对蛋白质设计和诱导接近模式(如抗体和PROTACs)至关重要。虽然交联质谱(XL-MS)和氢氘交换质谱(HDX-MS)等结构蛋白质组学技术提供了有价值的空间和动态信息,但将这些稀疏、异质的测量整合到这些模型中仍然是一个开放的挑战。在这里,我们通过将结构蛋白质组学数据与预训练扩散模型学到的丰富生物物理先验相结合来弥合这一差距。我们引入了AIMS-Fold,一个推理时引导扩散框架,它使用源自XL-MS空间约束和HDX-MS溶剂可及性轮廓的可微物理势能主动引导生成采样轨迹。我们证明这些结构方法各自提高了预测准确性,并且它们的整合产生了协同改进。关键的是,通过利用这些实验约束,AIMS-Fold在具有挑战性的诱导接近靶标上比纯计算、无引导的最先进模型(如Boltz-2)实现了更高的准确性。这确立了我们的框架作为诱导接近药物基于结构的药物设计的强大整合计算方法。评估代码将在发表后公开。

英文摘要

Protein structure generative models excel at predicting single protein static structures from sequence, but routinely fail to capture the correct conformational state of protein complexes, critical for protein design and induced proximity modalities such as antibodies and PROTACs. While structural proteomics techniques like Cross-Linking Mass Spectrometry (XL-MS) and Hydrogen-Deuterium Exchange (HDX-MS) offer valuable spatial and dynamic insights, integrating these sparse, heterogeneous measurements into these models remains an open challenge. Here, we bridge this gap by combining structural proteomics data with the rich biophysical priors learned by pretrained diffusion models. We introduce AIMS-Fold, an inference-time guided-diffusion framework that actively steers the generative sampling trajectory using differentiable physical potentials derived from XL-MS spatial restraints and HDX-MS solvent accessibility profiles. We demonstrate that these structural methods individually enhance predictive accuracy, and their integration yields synergistic improvement. Crucially, by leveraging these experimental restraints, AIMS-Fold achieves higher accuracy on challenging induced proximity targets than purely computational, unguided state-of-the-art models like Boltz-2. This establishes our framework as a powerful, integrative computational approach for the structure based drug design of induced proximity drugs. Evaluation code will be made publicly available upon publication.

2605.22133 2026-05-27 q-bio.BM cs.AI

Atom-level Protein Representation Learning Improves Protein Structure Prediction

原子级蛋白质表示学习改进蛋白质结构预测

Taewon Kim, Hyosoon Jang, Hyunjin Seo, Seonghwan Seo, Hyeongwoo Kim, Wonho Zhung, Mingyeong Shin, Wooyoun Kim, Sungsoo Ahn

AI总结 提出结构感知预训练方法TriProRep,通过VQ-VAE联合建模三种对齐的残基级视图,在结构预测任务中优于仅序列和先前结构感知表示模型。

详情
Comments
Project Page: https://holymollyhao.github.io/TriProRep/
AI中文摘要

生成建模的最新进展表明,预训练表示可以作为条件特征或对齐目标来改进生成。受此启发,我们研究用于预测结构(超越常规功能注释)的蛋白质表示。我们提出TriProRep,一种结构感知预训练方法,它联合建模三种对齐的残基级视图:氨基酸身份、主链几何和局部全原子几何,通过VQ-VAE分词器进行离散编码。通过预训练从生成器损坏的视图中恢复原始标记,TriProRep学会区分合理但不正确的跨视图增强与原始蛋白质。我们进一步引入RepSP,一个用于在结构预测设置中评估蛋白质表示的基准。RepSP测试表示的三种用途:从脱辅基链表示进行同源二聚体共折叠、同源二聚体衍生相互作用属性的残基级预测,以及表示对齐的单体结构预测。在这些任务中,TriProRep优于仅序列和先前的结构感知表示模型,同时在常规基准上保持竞争性能。

英文摘要

Recent advances in generative modeling show that pretrained representations can improve generation as conditioning features or alignment targets. Motivated by this, we study protein representations for predicting structures beyond conventional function annotation. We propose TriProRep, a structure-aware pretraining method that jointly models three aligned residue-level views: amino-acid identity, backbone geometry, and local full-atom geometry, discretely encoded via VQ-VAE tokenizers. By pretraining to recover original tokens from generator-corrupted views, TriProRep learns to distinguish plausible but incorrect cross-view augmentations from the original protein. We further introduce RepSP, a benchmark for evaluating protein representations in structure-predictive settings. RepSP tests three uses of representations: homodimer co-folding from apo-chain representations, residue-level prediction of homodimer-derived interaction properties, and representation-aligned monomer structure prediction. Across these tasks, TriProRep improves over sequence-only and prior structure-aware representation models, while maintaining competitive performance on conventional benchmarks.

2512.05794 2026-05-27 cs.LG cs.AI q-bio.QM

Mechanistic Interpretability of Antibody Language Models Using SAEs

使用 SAE 对抗体语言模型的机制可解释性研究

Rebonto Haque, Oliver M. Turnbull, Anisha Parsan, Nithin Parsan, John J. Yang, Anna L. Beukenhorst, Charlotte M. Deane

AI总结 本研究采用 TopK 和 Ordered 稀疏自编码器(SAE)对抗体语言模型进行机制可解释性分析,发现 TopK SAE 能揭示有意义的生物学潜在特征但无法保证生成控制,而 Ordered SAE 通过层次结构可靠识别可操控特征但激活模式更复杂。

详情
Comments
v3: 15 pages; corrected author list and affiliations in the main text; minor text changes; updated steering results following minor code changes; conclusions and findings remain unchanged; included link to data and code in the Data Availability section
AI中文摘要

稀疏自编码器(SAE)是一种机制可解释性技术,已被用于揭示大型蛋白质语言模型中学到的概念。在此,我们采用 TopK 和 Ordered SAE 来研究自回归抗体语言模型,并引导其生成。我们表明,TopK SAE 可以揭示有生物学意义的潜在特征,但高特征-概念相关性并不能保证对生成的因果控制。相比之下,Ordered SAE 施加了层次结构,能够可靠地识别可操控特征,但代价是激活模式更复杂且可解释性较低。这些发现推进了领域特异性蛋白质语言模型的机制可解释性,并表明,虽然 TopK SAE 足以将潜在特征映射到概念,但在需要精确生成引导时,Ordered SAE 更可取。

英文摘要

Sparse autoencoders (SAEs) are a mechanistic interpretability technique that have been used to provide insight into learned concepts within large protein language models. Here, we employ TopK and Ordered SAEs to investigate autoregressive antibody language models, and steer their generation. We show that TopK SAEs can reveal biologically meaningful latent features, but high feature-concept correlation does not guarantee causal control over generation. In contrast, Ordered SAEs impose a hierarchical structure that reliably identifies steerable features, but at the expense of more complex and less interpretable activation patterns. These findings advance the mechanistic interpretability of domain-specific protein language models and suggest that, while TopK SAEs suffice for mapping latent features to concepts, Ordered SAEs are preferable when precise generative steering is required.

2512.15534 2026-05-27 q-bio.PE

Characterizing Open-Ended Evolution Through Undecidability Mechanisms in Random Boolean Networks

通过随机布尔网络中的不可判定性机制刻画开放式进化

Amahury J. López-Díaz, Pedro Juan Rivera Torres, Gerardo L. Febres, Carlos Gershenson

AI总结 提出一个与模型无关的度量Ω,通过吸引子周期长度的驻留时间加权贡献来量化有限观测窗口内的动态特征,并在随机布尔网络中比较经典与非经典机制,发现不可判定性相关的状态依赖机制是持续新颖性的使能条件。

详情
Comments
16 pages, 2 figures; code and SM available at GitHub repo; submitted for publication to npj Systems Biology and Applications
AI中文摘要

离散动力学模型支撑着系统生物学,但我们仍然缺乏与基质无关的诊断方法,用于识别可能与开放式进化(OEE)相关的有限时间动态特征,例如反复产生新的表型状态,而不是快速稳定或非结构化噪声。我们引入了一个简单、与模型无关的度量Ω,它总结了在有限观测窗口内实现的反复发作序列中吸引子周期长度的驻留时间加权贡献。Ω对于单吸引子动力学为零,对于没有重复的纯新颖性也为零,而当轨迹反复进入多个持久循环表型时,Ω增加。使用随机布尔网络(RBN)作为受控测试平台,我们在同质和异质更新方案下比较了经典布尔动力学与生物启发的非经典机制(概率上下文切换、退火规则突变、超协调逻辑、模态必要/可能门控以及量子启发的叠加/配对态耦合)。我们的结果支持以下观点:不可判定性相邻的状态依赖机制——实现为概率上下文切换、模态必要/可能门控、超协调逻辑或量子启发的相关分支——是持续新颖性的使能条件。在手稿末尾,我们概述了Ω到连续/混合状态空间的实用扩展,将Ω定位为生物建模中OEE的可移植代理,并作为设计可进化合成电路的指南。

英文摘要

Discrete dynamical models underpin systems biology, but we still lack substrate-agnostic diagnostics for identifying finite-horizon dynamical signatures that may be relevant to open-ended evolution (OEE), such as the recurrent production of novel phenotypic states rather than rapid settling or unstructured noise. We introduce a simple, model-independent metric, Ω, that summarizes the residence-time-weighted contribution of attractor cycle lengths across the sequence of recurrent episodes realized within a finite observation window. Ω is zero for single-attractor dynamics and also vanishes for pure novelty without recurrence, while increasing when trajectories repeatedly enter multiple persistent cyclic phenotypes. Using Random Boolean Networks (RBNs) as a controlled testbed, we compare classical Boolean dynamics with biologically motivated non-classical mechanisms (probabilistic context switching, annealed rule mutation, paraconsistent logic, modal necessary/possible gating, and quantum-inspired superposition/paired-state coupling) under homogeneous and heterogeneous updating schemes. Our results support the view that undecidability-adjacent, state-dependent mechanisms -- implemented as probabilistic context switching, modal necessity/possibility gating, paraconsistent logic, or quantum-inspired correlated branching -- are enabling conditions for sustained novelty. At the end of our manuscript we outline a practical extension of Ω to continuous/hybrid state spaces, positioning Ω as a portable proxy for OEE in biological modeling and a guide for engineering evolvable synthetic circuits.

2503.21450 2026-05-27 cs.CE q-bio.BM

CMADiff: Cross-Modal Aligned Diffusion for Controllable Protein Generation

CMADiff: 用于可控蛋白质生成的跨模态对齐扩散

Changjian Zhou, Yuexi Qiu, Jia Song

AI总结 提出CMADiff框架,通过条件变分自编码器整合理化特征,并利用对比学习模块BioAligner对齐文本描述与蛋白质特征,实现基于文本驱动的可控蛋白质序列生成。

详情
AI中文摘要

AI辅助的蛋白质设计已成为推动生物技术发展的关键工具,深度生成模型在该领域已展现出可靠性。然而,现有模型主要利用蛋白质序列或结构数据进行训练,忽略了蛋白质的理化性质,且缺乏在直观条件下控制蛋白质生成的能力。为解决这些局限,我们提出CMADiff,一种新颖的框架,通过潜在扩散过程将蛋白质序列的理化性质与基于文本的描述对齐,实现可控蛋白质生成。具体而言,CMADiff采用条件变分自编码器(CVAE)将理化特征作为条件输入整合,形成捕获生物学特征的稳健潜在空间。在该潜在空间中,我们应用条件扩散过程,由BioAligner(一种基于对比学习的模块)引导,该模块将文本描述与蛋白质特征对齐,实现对蛋白质序列生成的文本驱动控制。通过包括AlphaFold3在内的一系列评估验证,实验结果表明CMADiff优于蛋白质序列生成基准,并具有未来应用的强大潜力。实现和代码可在 https://github.com/HPC-NEAU/PhysChemDiff 获取。

英文摘要

AI-assisted protein design has emerged as a critical tool for advancing biotechnology, as deep generative models have demonstrated their reliability in this domain. However, most existing models primarily utilize protein sequence or structural data for training, neglecting the physicochemical properties of proteins.Moreover, they are deficient to control the generation of proteins in intuitive conditions. To address these limitations,we propose CMADiff here, a novel framework that enables controllable protein generation by aligning the physicochemical properties of protein sequences with text-based descriptions through a latent diffusion process. Specifically, CMADiff employs a Conditional Variational Autoencoder (CVAE) to integrate physicochemical features as conditional input, forming a robust latent space that captures biological traits. In this latent space, we apply a conditional diffusion process, which is guided by BioAligner, a contrastive learning-based module that aligns text descriptions with protein features, enabling text-driven control over protein sequence generation. Validated by a series of evaluations including AlphaFold3, the experimental results indicate that CMADiff outperforms protein sequence generation benchmarks and holds strong potential for future applications. The implementation and code are available at https://github.com/HPC-NEAU/PhysChemDiff.

2603.16801 2026-05-27 cs.GR q-bio.TO

TAMP-OS: An Open-Source Workflow for Tactile 3D-Printable Lithographs

TAMP-OS:触觉3D可打印光刻胶的开源工作流

Robert Faulkner, Natalia Gonzalez-Vazquez, Victoria Gamez, Karly E. Cohen, Gunther Richter, Abigale Stangl, Andrew K. Schulz

AI总结 提出一个低成本、开源的工作流,用于从显微镜图像生成触觉光刻文件,通过3D打印实现科学图像的可触达性。

详情
Comments
3 figures, Abigale Stangl and Andrew K. Schulz are co-corresponding authors
AI中文摘要

描述一种动物而不使用“看”这个动词。你能有效地提供一种替代方法,在保留长度尺度的同时解释复杂的显微镜图像吗?世界上充满了我们肉眼无法看到的特征:壁虎脚上的刚毛、覆盖老鼠胡须的角质层,或蝙蝠翅膀的绒毛。此外,这些结构是非均匀的,常常从硬变软。我们提供了一个工作流,用于生成低数据、低成本、开源的光刻文件,使显微镜图像具有触觉可访问性。使用此工作流制作的光刻版可以在350美元的3D打印机上打印,使用小于100 Mb的3D文件,每次打印的总成本为0.75美元。这项工作旨在利用先进的3D打印技术创建触觉图形和艺术,使科学更易获取,并实现对生物结构的触觉探索。本文中的框架与一个将持续更新的GitHub仓库保持一致,使得随着3D打印和光刻技术在未来的发展,触觉媒体能够被创建。

英文摘要

Describe an animal without using the verb look. Can you effectively provide an alternative method for interpreting complex microscopy images while preserving the length scale? The world is filled with features too small for our eyes to see: the setae on a gecko's feet, the cuticles covering a rat's whisker, or the fuzziness of a bat's wing. Furthermore, these structures are non-homogeneous, often shifting from stiff to soft. We provide a workflow for producing low-data, low-cost, and open-source lithograph files, allowing tactile accessibility in microscopy images. The lithographs made with this workflow can be printed on a 350 USD 3D printer using 3D files under 100 Mb, for a total cost per print of 0.75 USD. This work seeks to leverage advanced 3D printing to create tactile graphics and art that make science more accessible and enable tactile exploration of biological structures. This framework in this text is aligned with a GitHub repository that will be constantly updated, allowing tactile media to be created as 3D printing and lithography become more streamlined in the years to come.

2601.20670 2026-05-27 q-bio.PE cond-mat.stat-mech nlin.AO

Noise-induced excitability: bloom, bust and extirpation in autotoxic population dynamics

噪声诱导的兴奋性:自体毒性种群动态中的繁荣、崩溃与灭绝

Pablo Moreno-Spiegelberg, Javier Aguilar

AI总结 本研究通过随机框架(源于个体模型)描述种群在环境反馈滞后下的繁荣-崩溃-灭绝动力学,识别噪声驱动的阈值行为,并刻画从兴奋态到持久态的转变,为理解不可逆转变提供理论基础。

详情
AI中文摘要

物种种群在生长过程中常常会改变其环境。当环境反馈的运作速度慢于种群增长时,系统可能经历繁荣-崩溃动力学,即种群数量超过其承载能力,随后崩溃。在极端情况下,这种崩溃会导致完全灭绝。虽然确定性模型通常无法捕捉这些有限时间的灭绝事件,但我们提出了一个随机框架,源自基于个体的模型,来描述繁荣-崩溃-灭绝动力学。我们识别出一种噪声驱动的阈值行为,其中根据初始条件,种群要么经历“繁荣”,要么在扩张发生之前灭绝。此外,我们刻画了从兴奋态(其中大多数轨迹在第一次崩溃后立即被吸收态捕获)到持久态(其中大多数种群达到亚稳态)的转变。我们证明这一转变由噪声强度和环境-种群时间尺度比控制。该框架为理解入侵物种、植物演替、微生物动力学以及癌症肿瘤消除中的不可逆转变提供了理论基础。

英文摘要

Species populations often modify their environment as they grow. When environmental feedback operates more slowly than population growth, the system can undergo boom-bust dynamics, where the population overshoots its carrying capacity and subsequently collapses. In extreme cases, this collapse leads to total extinction. While deterministic models typically fail to capture these finite-time extinction events, we propose a stochastic framework, derived from an individual-based model, to describe boom-bust-extirpation dynamics. We identify a noise-driven, threshold-like behavior where, depending on initial conditions, the population either undergoes a ``boom'' or is extirpated before the expansion occurs. Furthermore, we characterize a transition between an excitable regime, where most trajectories are captured by the absorbing state immediately after the first bust, and a persistent regime, where most populations reach a metastable state. We show that this transition is governed by the noise strength and the ratio of environmental-to-population timescales. This framework provides a theoretical basis for understanding irreversible transitions in invasive species, plant succession, microbial dynamics, and the elimination of cancerous tumors.

2502.19063 2026-05-27 q-bio.PE cond-mat.dis-nn nlin.CD physics.soc-ph

Global population crisis scenarios predicted by a general nonlinear dynamical model

一般非线性动力学模型预测的全球人口危机情景

Alessio Zaccone, Kostya Trachenko

AI总结 本文通过一个简单的非线性微分方程(最初在无序系统物理学中研究)数学描述了全球过去12000年的人口增长,并预测了未来情景,包括在承载力限制突然生效的保守最坏情况下,全球人口最早可能在2064年减半。

详情
Journal ref
Chaos, Solitons & Fractals 209, 118542 (2026)
AI中文摘要

我们证明了一个简单的非线性微分方程(最初在无序系统物理学中研究)能够数学描述过去12000年的全球人口增长。从新石器时代早期至今的不同人口增长阶段都被证明是同一非线性微分方程在其各种极限下的所有解。这些还包括著名的马尔萨斯(指数)和Verhulst(逻辑斯蒂)增长模型,以及von Foerster的“世界末日”公式。所有这些极限对应于忽略所提出的非线性微分方程描述的更一般非线性动态模型中的高阶项。虽然较旧的模型可能为全球人口增长曲线在有限时间间隔内提供有效的拟合,但它们明显的近似性质阻止了它们在更长时间内的预测能力。相反,所提出的模型的综合解决方案非常适合提供未来情景的预测。这些情景包括:在故意保守的最坏情况下,即承载力限制从今天突然生效,全球人口最早可能在2064年减半。

英文摘要

We show that a simple nonlinear differential equation (originally studied in the physics of disordered systems) is able to mathematically describe the global population growth over the past 12000 years. Different regimes of population growth since the early Neolithic until today are shown to be all solutions to the same nonlinear differential equation in its various limits. These also include the well-known Malthus (exponential) and Verhulst (logistic) growth regimes, as well as von Foerster's ``doomsday'' formula. All these limits correspond to neglecting higher-order terms in a more general nonlinear dynamic model described by the proposed nonlinear differential equation. While the older models may provide valid fittings to limited time intervals in the global population growth curve in time, their clearly approximate nature prevents them from being predictive over longer periods of time. The proposed comprehensive solution of the proposed model is instead well suited to provide predictions for future scenarios. These include a scenario where the global population could halve as early as 2064 under a deliberately conservative, worst-case assumption that carrying-capacity constraints become abruptly active today.

2512.08355 2026-05-27 q-bio.PE physics.soc-ph

Joint economic and epidemiological modelling of alternative pandemic response strategies

联合经济与流行病学建模:替代性大流行应对策略

M J Plank, M Sushames, T Fisher-Taylor, A Afshari, R N Thompson, A Hurford, S C Hendy

AI总结 提出一个结合健康和经济成本的模型框架,比较缓解、抑制和消除策略在不同流行病学和经济参数下的总成本,发现疾病严重程度和基本再生数R0影响最优策略选择,并以新西兰2020年新冠应对为例验证。

详情
AI中文摘要

在新出现的大流行中,政策制定者需要在信息有限的情况下做出重要决策,例如在缓解、抑制或消除策略之间进行选择。这些策略可能需要在流行病对健康的影响与应对措施的经济成本之间进行权衡。数学模型是一种有用的工具,可以帮助理解不同政策选择对未来疫情动态和影响的后果。大多数模型侧重于直接健康影响,忽略了控制措施的经济成本。在此,我们引入一个同时考虑健康和经济成本的模型框架。我们使用该框架比较缓解、抑制和消除策略在不同流行病学和经济参数下的预期总成本。我们发现,对于低严重性的疾病,缓解往往是最具成本效益的选择。对于更严重的疾病,如果基本再生数$R_0$相对较低,抑制往往最具成本效益;而如果$R_0$较高,消除则更具成本效益。我们以新西兰2020年对Covid-19大流行的消除应对为例,将我们的框架锚定于一个真实世界案例研究。我们发现,新西兰Covid-19的参数估计值接近或高于消除比缓解更具成本效益的阈值。我们得出结论,我们提出的框架作为未来大流行威胁的决策支持工具具有前景,尽管还需要进一步工作来考虑人群异质性及其他与决策相关的因素。

英文摘要

In an emerging pandemic, policymakers need to make important decisions with limited information, for example choosing between a mitigation, suppression or elimination strategy. These strategies may require trade-offs to be made between the health impact of the pandemic and the economic costs of the interventions introduced in response. Mathematical models are a useful tool that can help understand the consequences of alternative policy options on the future dynamics and impact of the epidemic. Most models have focused on direct health impacts, neglecting the economic costs of control measures. Here, we introduce a model framework that captures both health and economic costs. We use this framework to compare the expected aggregate costs of mitigation, suppression and elimination strategies, across a range of different epidemiological and economic parameters. We find that for diseases with low severity, mitigation tends to be the most cost-effective option. For more severe diseases, suppression tends to be most cost effective if the basic reproduction number $R_0$ is relatively low, while elimination tends to be more cost-effective if $R_0$ is high. We use the example of New Zealand's elimination response to the Covid-19 pandemic in 2020 to anchor our framework to a real-world case study. We find that parameter estimates for Covid-19 in New Zealand put it close to or above the threshold at which elimination becomes more cost-effective than mitigation. We conclude that our proposed framework holds promise as a decision-support tool for future pandemic threats, although further work is needed to account for population heterogeneity and other factors relevant to decision-making.

2511.13611 2026-05-27 cs.SE q-bio.QM

BIOMERO 2.0: end-to-end FAIR infrastructure for bioimaging data import, analysis, and provenance

BIOMERO 2.0:用于生物成像数据导入、分析和溯源的端到端FAIR基础设施

Torec T. Luik, Joost de Folter, Rodrigo Rosas-Bertolini, Eric A. J. Reits, Ron A. Hoebe, Przemek M. Krawczyk

AI总结 本文提出BIOMERO 2.0框架,通过集成数据导入、预处理、分析和工作流监控的OMERO.web插件及容器化组件,将OMERO转化为符合FAIR原则且具有溯源能力的生物成像平台。

详情
Comments
16 pages, 2 figures, 25 pages supplemental information; for software, see https://github.com/Cellular-Imaging-Amsterdam-UMC/NL-BIOMERO
AI中文摘要

我们提出BIOMERO 2.0,这是BIOMERO框架的一次重大演进,它将OMERO转变为一个符合FAIR(可发现、可访问、可互操作、可重用)原则、具有溯源能力的生物成像平台。BIOMERO 2.0通过OMERO.web插件和容器化组件集成了数据导入、预处理、分析和工作流监控。导入子系统通过容器化预处理和表单元数据丰富促进原位导入,而分析子系统通过BIOMERO Python库协调和跟踪高性能计算系统上的容器化分析。所有导入和分析都记录参数、版本和结果,确保通过集成仪表盘实时访问溯源信息。这种双重方法将OMERO置于生物成像分析过程的核心:导入器确保从图像采集到预处理及导入OMERO的溯源,而分析器则记录下游处理的溯源。这些集成层增强了OMERO的FAIR化,支持可追溯、可重用的图像分析工作流,弥合了数据导入、分析和共享之间的差距。

英文摘要

We present BIOMERO 2.0, a major evolution of the BIOMERO framework that transforms OMERO into a FAIR-compliant (findable, accessible, interoperable, and reusable), provenance-aware bioimaging platform. BIOMERO 2.0 integrates data import, preprocessing, analysis, and workflow monitoring through an OMERO.web plugin and containerized components. The importer subsystem facilitates in-place import using containerized preprocessing and metadata enrichment via forms, while the analyzer subsystem coordinates and tracks containerized analyses on high-performance computing systems via the BIOMERO Python library. All imports and analyses are recorded with parameters, versions, and results, ensuring real-time provenance accessible through integrated dashboards. This dual approach places OMERO at the heart of the bioimaging analysis process: the importer ensures provenance from image acquisition through preprocessing and import into OMERO, while the analyzer records it for downstream processing. These integrated layers enhance OMEROs FAIRification, supporting traceable, reusable workflows for image analysis that bridge the gap between data import, analysis, and sharing.

2507.13762 2026-05-27 cs.LG q-bio.BM

MolPIF: A Parameter Interpolation Flow Model for Molecule Generation

MolPIF: 一种用于分子生成的参数插值流模型

Yaowei Jin, Junjie Wang, Yufan Tang, Wenkai Xiang, Duanhua Cao, Dan Teng, Zhehuan Fan, Jiacheng Xiong, Xia Sheng, Chuanlong Zeng, Duo An, Mingyue Zheng, Shuangjia Zheng, Qian Shi

AI总结 提出参数插值流模型MolPIF,通过参数空间分布插值统一连续坐标与离散原子类型的生成,在CrossDocked2020数据集上优于基线方法。

详情
Comments
Accepted to Bioinformatics
AI中文摘要

动机:基于结构的药物设计(SBDD)随着深度生成模型的发展而进步,但弥合连续原子坐标与离散原子类型之间的差距仍然是一个挑战。当前的方法,如扩散和流匹配模型,通常未能统一这些异质模态,依赖于分离的策略或对离散变量不合适的欧几里得度量。缺乏一致的框架限制了生成模型捕捉蛋白质-配体复合物的几何和化学结构的能力。结果:我们提出了MolPIF,一种参数插值流机制,旨在统一连续和离散分子变量的生成。与在样本空间中运行的传统流模型不同,MolPIF在参数空间中对分布进行插值,理论上恢复了连续坐标的Wasserstein-2最优传输,并建立了离散原子类型的Fisher-Rao测地线。我们进一步整合了几何增强学习策略,以改善原子上下文的捕捉。在CrossDocked2020数据集上的广泛评估表明,MolPIF在结合亲和力、化学有效性、几何保真度和化学空间覆盖方面优于基线。此外,MolPIF在先导优化中表现出多功能性,并提供灵活的先验分布选择(如Laplace),为SBDD建立了一个稳健的范式。可用性:源代码可在https://github.com/BLEACH366/MolPIF免费获取。补充信息:补充数据可在Bioinformatics上获取。

英文摘要

Motivation: Structure-based drug design (SBDD) has advanced with deep generative models, but bridging the gap between continuous atomic coordinates and discrete atom types remains a challenge. Current approaches, such as diffusion and flow matching models, often fail to unify these heterogeneous modalities, relying on separate strategies or ill-fitting Euclidean metrics for discrete variables. This lack of a consistent framework limits generative models' ability to capture the geometric and chemical structure of protein-ligand complexes. Results: We present MolPIF, a parameter interpolation flow mechanism designed to unify the generation of continuous and discrete molecular variables. Unlike traditional flow models that operate in sample space, MolPIF interpolates between distributions in the parameter space, theoretically recovering Wasserstein-2 optimal transport for continuous coordinates and establishing Fisher-Rao geodesics for discrete atom types. We further incorporate a geometry-enhanced learning strategy to improve the capture of atomic contexts. Extensive evaluations on the CrossDocked2020 dataset demonstrate that MolPIF outperforms baselines in binding affinity, chemical validity, geometric fidelity and chemical space coverage. Additionally, MolPIF exhibits versatility in lead optimization and offers flexible prior distribution selection (such as Laplace), establishing a robust paradigm for SBDD. Availability: Source code is freely available at https://github.com/BLEACH366/MolPIF. Supplementary information: Supplementary data are available at Bioinformatics.

2507.07295 2026-05-27 cond-mat.stat-mech physics.bio-ph q-bio.MN

Local imperfect feedback control in non-equilibrium biophysical systems enabled by thermodynamic constraints

非平衡生物物理系统中由热力学约束实现的局部不完美反馈控制

Carlos Floyd, Aaron R. Dinner, Suriyanarayanan Vaikuntanathan

AI总结 本文通过马尔可夫跳变过程框架推导非平衡响应约束,发现热力学约束可确保稳态响应符号固定,使得简单的局部反馈规则无需全局信息即可实现全局稳定控制,并解释了趋化、转录调控等生物现象。

详情
Comments
28 pages
AI中文摘要

生物网络如何在依赖不完美的局部信息的情况下实现稳健控制仍然是一个重要的开放问题。在这里,我们识别出热力学约束可以严重限制非平衡稳态响应,以至于即使粗糙的局部反馈规则也能实现全局稳定的控制,而无需精确的网络设计或全局信息。具体地,使用马尔可夫跳变过程作为生物物理动力学的一般框架,我们推导了通用的非平衡响应约束,表明对于许多类速率扰动,稳态响应在所有驱动强度下具有固定符号,因此近平衡响应可以预测远离平衡的行为,无论系统复杂度如何。这些约束澄清了几个生物现象:当扰动作用于单个跃迁速率时,单调性在热力学上得到保证;而非单调响应(例如在转录因子调控中观察到的)仅在输入同时调节多个速率时出现。即使在这种情况下,我们识别出一个称为“相干性”的图论概念,它允许恢复单调性。我们展示了相干性如何自然且普遍地出现在经典的适应生物物理模型中,包括大肠杆菌趋化性和转录因子调控(当包含网络参数化的生物约束时)。接下来,我们证明,在控制理论框架内,这些约束保证了对小部分动力学速率的简单线性反馈能够实现全局稳定的跟踪和适应,而无需协调多个变量的操纵。对于具有一个调节器的系统,局部稳定性意味着任意网络拓扑下的全局稳定性,无需精细调谐,揭示了非平衡热力学从根本上约束了生化网络响应。

英文摘要

How biological networks achieve robust control despite relying on imperfect, local information remains an important open question. Here, we identify thermodynamic constraints that can curtail non-equilibrium steady-state responses so severely that even crude, local feedback rules can achieve globally stable control without requiring precise network design or global information. Specifically, using Markov jump processes as a general framework for biophysical dynamics, we derive general non-equilibrium response constraints showing that for many classes of rate perturbations, steady-state responses have fixed signs across all driving strengths, so that near-equilibrium responses predict far-from-equilibrium behavior regardless of system complexity. These constraints clarify several biological phenomena: monotonicity is thermodynamically guaranteed whenever a perturbation acts on a single transition rate, and non-monotonic responses, as observed for example in transcription factor regulation, arise only when an input simultaneously modulates multiple rates. Even in this case, we identify a graph-theoretic concept termed ``coherence'' that allows for a restoration of monotonicity. We show how coherence naturally and generally emerges in classic biophysical models of adaptation, including E. coli chemotaxis, and transcription factor regulation when biological constraints on network parameterization are included. We next show that, within a control-theoretic framework, these constraints guarantee that simple linear feedback on small subsets of kinetic rates achieves globally stable tracking and adaptation without coordinated manipulation of many variables. For systems with one regulator, local stability implies global stability for arbitrary network topologies without fine tuning, revealing that non-equilibrium thermodynamics fundamentally constrains biochemical network responses.