arXivDaily arXiv每日学术速递 周一至周五更新
重置
2605.30275 2026-05-29 cs.LG q-bio.QM

Digitally enriching a screening population for pancreatic cancer using routine blood-based measures and clinical histories

利用常规血液检测指标和临床病史对胰腺癌筛查人群进行数字富集

Chris Varghese, Leo Y. Li-Han, Richa Bisht, Ellen Larson, Frank Lee, Ryan M. Carr, Tanios S. Bekaii-Saab, Shounak Majumder, John D. Halamka, Mark Truty, Ajit H. Goenka, Hojjat Salehinejad, Cornelius A. Thiels

AI总结 提出基于Transformer的多头注意力神经网络,利用纵向诊断编码和血液检测序列预测胰腺癌风险,实现提前1-3年风险分层,为人群级数字富集筛查奠定基础。

详情
AI中文摘要

早期检测胰腺癌是扩大治愈性治疗可及性和减少癌症死亡的关键;然而,目前筛查并不可行。病理的潜在指标体现在个体的疾病和血液检测轨迹中,可能预测胰腺癌的发展。利用患者在临床互动过程中积累的纵向诊断编码和血液检测值序列,训练了一个基于Transformer的定制神经网络,采用多头注意力机制,以提前多年预测胰腺癌风险,并对人群进行风险分层以进行靶向筛查。该队列包括6,017名胰腺癌成人患者和177,081名对照(总体中位年龄75岁,45%女性),在胰腺癌诊断前拥有中位12年(四分位距6.9-16.2)的病史。通过留一站点法进行外部验证,在诊断前1年、2年和3年预测胰腺癌,受试者工作特征曲线下面积均值分别为0.837(95%置信区间0.827-0.848)、0.797(95%置信区间0.782-0.813)和0.760(95%置信区间0.745-0.776)。估计的胰腺癌风险校准良好(校准图斜率1.08,截距-0.077;Brier评分0.025),贝叶斯人群胰腺癌患病率更新使得估计的癌症风险输出可跨环境迁移。在测试中,1年内胰腺癌风险>3.3%的筛查阈值提供了18.2的诊断优势比。因此,我们的工作为第一个人群级数字富集工具奠定了基础,以扩大胰腺癌治愈性管理的可及性。

英文摘要

Earlier detection of pancreatic cancer is key to enabling wider access to curative treatment and reducing cancer deaths; however, screening is presently not viable. Latent indicators of pathology are evident in an individual's disease and blood test trajectories and may predict the development of pancreatic cancer. Longitudinal sequences of coded diagnoses and blood test values accrued by patients throughout their clinical interactions were used to train a custom Transformer-based neural network with a multi-head attention mechanism to predict risk of pancreatic cancer with a multi-year lead time and risk-stratify populations for targeted screening. The cohort comprised 6,017 adults with pancreatic cancer and 177,081 controls (overall median age 75, 45% female) with median 12 years (interquartile range 6.9-16.2) of medical history prior to pancreatic cancer diagnosis. External validation via leave-one-site-out, out-of-sample testing predicting pancreatic cancer 1-, 2-, and 3-years prior to diagnosis demonstrated mean area under the receiver operating characteristic of 0.837 (95% confidence interval 0.827-0.848), 0.797 (95% confidence interval 0.782-0.813), and 0.760 (95% confidence interval 0.745-0.776), respectively. Estimated pancreatic cancer risks were well-calibrated (calibration plot slope 1.08, intercept of -0.077; Brier score 0.025), and a Bayesian population pancreatic cancer prevalence update allows estimated cancer risk outputs to be transportable across settings. At testing, a screening threshold of >3.3% risk of pancreatic cancer in 1-year offered a diagnostic odds ratio of 18.2. Our work therefore lays the foundation for a first population-level digital enrichment tool to widen access to curative-intent management of pancreatic cancer.

2605.30109 2026-05-29 q-bio.PE q-bio.QM

Training Ecosystems: A Computational Approach to Uncovering Learning Behavior in Unconventional Contexts

训练生态系统:一种揭示非传统背景下学习行为的计算方法

Adrita Samanta, Hananel Hazan, Michael Levin

AI总结 通过模拟经典捕食者-猎物模型并探索超过22万种参数组合,发现生态动力学足以实现习惯化、敏感化和离散数字学习,且学习能力主要由生态交互强度决定。

详情
Comments
26 pages, 14 figures
AI中文摘要

近期在多样化智能方面的进展表明,在生物体水平以下(单细胞甚至分子网络)存在简单的学习能力。然而,关于生物体水平以上的学习能力,以及完全由动态相互作用实现而无显式记忆介质的记忆,仍存在许多知识空白。我们证明,最小的生态动力学(在计算机中)足以实现几种类型的学习,通过响应幅度和恢复时间的变化进行测定。对模拟经典捕食者-猎物模型中超过22万种参数组合的系统探索揭示,当受到刺激扰动时,恢复时间表现出习惯化、敏感化以及一种尺度不变形式的离散数字学习。鲁棒性分析表明,习惯化和敏感化在随机扰动下持续存在,而离散数字学习即使在低噪声水平下也会被破坏。降维分析显示,学习能力的发生主要由生态交互强度决定。参数空间中清晰、独特的聚类模式使得对能够实现学习的新参数组合具有高预测准确性。响应幅度显示出显著的不对称性:90.6%的参数组合表现出恢复时间敏感化与响应幅度习惯化配对,而相反模式极为罕见。这些发现突显了生态学、基础认知和数学交叉领域的一系列现象,对许多可由类似方程描述的系统具有广泛意义。这些特性为生物学和工程学中的众多努力提供了一个具有相当大预模式学习倾向的基质,这种倾向最终源于数学,而不依赖于物理或生物学的细节。

英文摘要

Recent progress in diverse intelligence has shown simple learning capacities below the organism level - single cells and even molecular networks. However, there are still many knowledge gaps around learning capacity above the organism level, and about memory implemented purely by dynamical interactions without explicit memory media. We demonstrate that minimal ecological dynamics (in silico) are sufficient for several kinds of learning, assayed as changes in both, magnitude of response, and of recovery time. Systematic exploration of over 220,000 parameter combinations in a simulated classic predator-prey model revealed that, when perturbed by stimuli, recovery time exhibits habituation, sensitization, and a form of discrete number learning in a scale-invariant manner. Robustness analysis revealed that habituation and sensitization persist under stochastic perturbations, while discrete number learning is disrupted even at low noise levels. Dimensionality reduction revealed that the incidence of learning capacity is primarily determined by ecological interaction strengths. Clear, unique clustering patterns in parameter space allow high prediction accuracy for novel parameter combinations that enable learning. Response magnitude revealed a striking asymmetry: 90.6% of parameter combinations exhibited recovery time sensitization paired with habituation of response magnitude, while the opposite pattern was extremely rare. These findings highlight a set of phenomena at the intersection of ecology, basal cognition, and mathematics with many implications for a wide range of systems describable by similar kinds of equations. These properties provide numerous efforts in biology and engineering with a substrate that has considerable, pre-patterned, propensity for learning, which ultimately arises from mathematics, not depending on the details of physics or biology.

2605.29958 2026-05-29 q-bio.PE cond-mat.stat-mech math.PR nlin.PS

Lattice Brownian bees with cooperative reproduction: steady states, collapse, and spreading

具有合作繁殖的格子布朗蜜蜂:稳态、坍缩与扩散

Ohad Vilk, Baruch Meerson

AI总结 通过流体动力学自由边界问题研究具有合作繁殖的格子布朗蜜蜂模型,揭示了不同繁殖阶数下的稳态、线性稳定性、有限时间坍缩和扩散行为。

详情
Comments
23 one-column pages, 6 figures
AI中文摘要

我们将Berestycki等人(2021, 2022)的“布朗蜜蜂”模型扩展到合作繁殖,即$kA\to(k{+}1)A$,其中种群由$N$个对称随机游走者组成,每次出生事件中移除离原点最远的粒子。在$N\to\infty$极限下,我们为该模型制定了一个流体动力学自由边界问题。利用这一形式,我们确定了所有$k$的稳态种群密度,并证明了$k\le 2$时的线性稳定性和$k\ge 4$时的不稳定性。在临界情况$k=3$下,存在一个完整的连续稳态族,对应于繁殖速率与扩散速率的单一临界比值。高于临界值时,种群经历渐近自相似的有限时间坍缩至原点。低于临界值时,种群扩散性地传播,但繁殖在定量上仍然重要。对于$k\ge 4$,不稳定稳态将有限时间坍缩与扩散传播区域分开。这里的坍缩动力学是渐近自相似的,种群密度表现出尺度分离,需要匹配渐近描述。我们的分析预测通过流体动力学自由边界问题的数值解和原始微观模型的蒙特卡洛模拟得到证实。

英文摘要

We extend the ``Brownian bees'' model of Berestycki et al. (2021, 2022) to cooperative reproduction, $kA\to(k{+}1)A$, of a population of $N$ symmetric random walkers with removal, at each birth event, of the particle farthest from the origin. Working in the limit $N\to\infty$, we formulate a hydrodynamic free-boundary problem for this model. Using this formalism, we determine steady state population densities for all~$k$ and prove their linear stability for $k\le 2$ and instability for $k\ge 4$. In the marginal case $k=3$, there is a whole continuous family of steady states at a single, critical ratio of the reproduction and diffusion rates. Above criticality the population undergoes an asymptotically self-similar finite-time collapse to the origin. Below the criticality the population spreads diffusively, but the reproduction remains quantitatively relevant. For $k\ge 4$, the unstable steady state separates regimes of a finite-time collapse and a diffusive spreading. Here the collapse dynamics is asymptotically self-similar, and the population density exhibits a scale separation requiring a matched-asymptotic description. Our analytical predictions are confirmed by numerical solutions of the hydrodynamic free-boundary problem and by Monte Carlo simulations of the original microscopic model.

2605.29907 2026-05-29 q-bio.QM

Stochastic network epidemic model and particle filter: General framework and application to influenza in Japan

随机网络流行病模型与粒子滤波:通用框架及其在日本流感中的应用

Ihtisham Ul Haq, Serge Richard

AI总结 提出基于粒子滤波的数据同化框架,用于图基随机流行病系统的状态和参数估计,并通过日本流感数据验证其有效性。

详情
AI中文摘要

在随机且部分观测的生物系统中,参数推断和状态估计仍然是数学生物学中的主要问题。本文引入了一个二维晶格图模型来模拟传染病的传播。由于随机性和不完全观测,基于图的随机流行病系统中的状态和参数估计尤其具有挑战性。为了解决这些问题,我们提出了一种基于粒子滤波的数据同化框架,用于顺序估计模型状态和未知参数。发展了两种方法:一种基于感染个体数量,另一种基于二维晶格上感染个体的部分空间位置信息。首先使用合成数据分析和验证了两种方法的性能,然后将第一种方法应用于2024年7月至2025年12月期间日本不同县收集的流感数据。还使用当前周数据进行了为期一周的预测模拟。研究结果突显了所提出的粒子滤波框架在实时流行病监测、预测和适应性公共卫生决策中的有效性。

英文摘要

Parameter inference and state estimation in stochastic and partially observed biological systems remain major problems in mathematical biology. In this work, we introduce a two-dimensional lattice graph model for the spread of infectious diseases. Estimating states and parameters in graph-based stochastic epidemic systems is particularly challenging because of randomness and incomplete observations. To address these issues, we propose a particle filter based data assimilation framework for the sequential estimation of both model states and unknown parameters. Two methodologies are developed: one based on the number of infected agents and another based on partial spatial location's information of infected agents on a two-dimensional lattice. The performance of the two methods are firstly analyzed and validated using synthetic data, and the first method is then applied to influenza data collected from different prefectures in Japan between July 2024 and December 2025. One-week-ahead forecasting simulations are also performed using current weekly data. The findings highlight the effectiveness of the proposed PF framework for real-time epidemic monitoring, forecasting, and adaptive public health decision-making.

2605.29736 2026-05-29 q-bio.PE math.PR

Phylogenetic dynamics of MRCA ages and empirical moments of a Brownian trait

MRCA年龄的系统发育动力学及布朗性状的经验矩

Gilles Didier

AI总结 研究布朗性状在系统发育树上前两个经验矩的时间动态,通过MRCA年龄分布推导方差表达式,并在广义生灭过程中给出显式公式。

详情
AI中文摘要

我们研究了系统发育树上布朗性状前两个经验矩的时间动态。对于固定树,我们刻画了在任何给定时间所有现存谱系中经验均值和经验方差的分布。特别地,我们证明了经验均值的方差和经验期望方差在多样化事件之间是分段线性的。对于谱系齐次的随机树,经验均值的方差和经验期望方差都可以用均匀抽样的一对现存谱系的最远共同祖先(MRCA)的期望年龄来表示。在这种表示中,期望MRCA年龄以相反符号进入这两个量,表明经验均值的方差与经验期望方差之间存在结构性对立。对于具有时间依赖性物种形成和灭绝速率的广义生灭过程,我们推导了均匀抽样的一对现存谱系的MRCA年龄分布的显式公式。这给出了在任何时刻经验均值的方差和经验期望方差的积分表达式。在恒定速率生灭情况下,我们进一步获得了经验期望方差的闭式表达式,并描述了其在超临界、临界和次临界状态下的渐近行为。

英文摘要

We study the temporal dynamics of the first two empirical moments of Brownian traits on phylogenetic trees. For a fixed tree, we characterize the distributions of their empirical mean and empirical variance across all lineages extant at any given time. In particular, we show that the variance of the empirical mean and the expected empirical variance are piecewise linear between diversification events. For lineage-homogeneous random trees, both the variance of the empirical mean and the expected empirical variance can be expressed in terms of the expected age of the most recent common ancestor (MRCA) of a uniformly sampled pair of extant lineages. In this representation, the expected MRCA age enters the two quantities with opposite signs, pointing to a structural opposition between the variance of the empirical mean and the expected empirical variance. For generalized birth-death processes with time-dependent speciation and extinction rates, we derive an explicit formula for the distribution of the MRCA age of a uniformly sampled pair of extant lineages. This yields integral expressions, at any time, for both the variance of the empirical mean and the expected empirical variance. In the constant-rate birth-death case, we further obtain closed-form expressions for the expected empirical variance and describe its asymptotic behavior in the supercritical, critical and subcritical regimes.

2605.29703 2026-05-29 q-bio.NC cs.CV q-bio.TO

Subcortical Shape Variations and Their Associations with Cognition Across the 8th Decade of Life. A Study in the Lothian Birth Cohort 1936

皮层下形状变化及其与第八个十年生命期认知的关联:洛锡安出生队列1936研究

Maria del C. Valdes-Hernandez, Wonjung Park, Joanna Moodie, Susana Muñoz Maniega, Janie Corley, Fraser N. Sneden, Mark E. Bastin, Joanna M. Wardlaw, Simon R. Cox, Jinah Park

AI总结 利用洛锡安出生队列1936的纵向数据,通过ANCOVA和混合线性模型分析,研究第八个十年中皮层下结构的形状变化及其与认知老化的关联。

详情
Comments
34 pages
AI中文摘要

对正常个体脑形态变化的研究可能捕捉到与功能相关的脑老化方面,而这些方面不一定完全由总体积测量所指示。尽管皮层下脑结构在认知中起重要作用,但其形态轨迹与认知老化之间的关联尚未被记录。我们利用来自一项大型认知老化纵向研究——洛锡安出生队列1936——的神经影像、人口统计学和认知数据,探索社区居住个体在第八个十年生命期中皮层下脑结构的形状变化。我们使用ANCOVA和混合线性模型分析研究这些变化与认知老化的关联。皮层下形状变化是异质性的,在整个时期呈现不同的萎缩模式。海马体和腹侧DC经历了不同的形态变形(相对于其基线点),左右半球不同,而丘脑和苍白球形状则经历了更均匀的体积收缩,几乎在不同时间线上对称。一般认知的变化主要与时间点之间的向内和向外顶点位移相关。

英文摘要

The study of brain morphology changes in normal individuals may capture aspects of functionally-relevant brain aging not fully indicated by gross volumetry. Despite the important role of subcortical brain structures in cognition, the associations between their morphological trajectories and cognitive changes in aging have not been documented. We use neuroimaging, demographic, and cognitive data from a large longitudinal study of cognitive aging, the Lothian Birth Cohort 1936, to explore shape changes in subcortical brain structures of community-dwelling individuals across their 8th decade of life. We investigate the association of these changes with cognitive aging using ANCOVA and mixed linear model analyses. Subcortical shape changes were heterogeneous, with varied atrophy patterns across whole period. The hippocampus and the ventral DC experienced varied morphological deformations (from its baseline point) different in left and right hemispheres, while the thalami and globus pallidi shapes, for example, experienced a more uniform volume contraction, nearly symmetrical throughout different timelines. Changes in general cognition were mainly associated with inwards and outwards vertex displacements between the time-points.

2605.29677 2026-05-29 cs.HC eess.SP q-bio.NC

Embodied Virtual Reality Feedback Reshapes Neural Representations to Support Continuous Three-Dimensional Motor Imagery Decoding

具身虚拟现实反馈重塑神经表征以支持连续三维运动想象解码

Niall McShane, Attila Korik, Karl McCreadie, Naomi Du Bois, Darryl Charles, Damien Coyle

AI总结 本研究通过十名参与者的纵向实验,首次系统探究了具身虚拟现实反馈在实时三维虚拟肢体运动想象控制中的作用,发现VR反馈显著优于屏幕反馈,能提升解码性能并诱发更可解码和泛化的神经表征。

详情
Comments
28 pages, 7 figures, 3 tables. Submitted to Nature Biomedical Engineering. Data to be made available via Zenodo (DOI: 10.5281/zenodo.16047021)
AI中文摘要

连续脑机接口(BCI)通过解码想象运动中的运动轨迹提供直观的运动控制,然而反馈模态和纵向训练如何塑造神经表征和解码性能仍知之甚少。我们首次系统研究了在由运动想象驱动的实时三维虚拟肢体控制过程中,具身虚拟现实(VR)反馈的作用,涉及十名参与者的十个纵向会话。使用三种策略评估性能:实际在线性能(固定解码器泛化,FDG)、周期性再训练(顺序自适应训练,SAT)和会话内上限估计(会话内重建,WSR)。CNN-LSTM解码器在VR下实现了会话内想象运动相关性r = 0.762,在屏幕反馈下为r = 0.672。VR在所有策略和运动维度上均显著优于屏幕反馈(改进8.9-13.0%,所有p <= 0.002,d = 1.42-2.05)。这种优势在无需再训练的固定解码器下持续存在,表明具身VR反馈能诱发本质上更可解码和泛化的神经表征。线性混合效应模型证实了反馈模态和运动轴的主效应显著,且无交互作用。在神经生理学上,VR产生了更强的感觉运动-顶叶去同步和增强的运动-额叶功能连接,所有频带均涉及广泛的前岛叶活动,并增加了上顶叶小叶耦合,这与真实运动执行相关的模式相似。这些发现确立了具身空间反馈作为下一代面向直观运动控制和神经康复的连续BCI的关键设计原则。

英文摘要

Continuous brain-computer interfaces (BCIs) that decode motion trajectories from imagined movement offer intuitive motor control, yet how feedback modality and longitudinal training shape neural representations and decoding performance remains poorly understood. We present the first systematic investigation of embodied virtual reality (VR) feedback during real-time 3D virtual limb control driven by motor imagery, across ten longitudinal sessions in ten participants. Performance was evaluated using three strategies: actual online performance (Fixed Decoder Generalisation, FDG), periodic retraining (Sequential Adaptive Training, SAT), and within-session upper-bound estimation (Within-Session Reconstruction, WSR). A CNN-LSTM decoder achieved within-session imagined movement correlations of r = 0.762 under VR and r = 0.672 under screen feedback. VR significantly outperformed screen feedback across all strategies and movement dimensions (improvements of 8.9-13.0%, all p <= 0.002, d = 1.42-2.05). This advantage persisted under fixed decoders without retraining, demonstrating that embodied VR feedback elicits inherently more decodable and generalisable neural representations. Linear mixed-effects modelling confirmed robust main effects of feedback modality and movement axis with no interaction. Neurophysiologically, VR produced stronger sensorimotor-parietal desynchronisation and enhanced motor-frontal functional connectivity, with pervasive anterior insula engagement across all frequency bands and increased superior parietal lobule coupling, paralleling patterns associated with real movement execution. These findings establish embodied spatial feedback as a key design principle for next-generation continuous BCIs targeting intuitive motor control and neurorehabilitation.

2605.29587 2026-05-29 q-bio.QM cs.LG

FPLIER: Federated Pathway-Level Information Extractor

FPLIER:联邦通路级信息提取器

Daniele Malpetti, Christian Berchtold, Francesco Gualdi, Marco Scutari, Laura Azzimonti, Francesca Mangili

AI总结 提出联邦学习框架FPLIER,通过安全聚合实现分布式基因表达数据上的通路级因子分解,并证明隐私风险由训练表达矩阵的秩决定。

详情
Comments
Accepted for publication at the ACM BCB '26 conference
AI中文摘要

在转录组学中,通路级信息提取器(PLIER)等基因集感知因子分解方法在大型异质性表达数据集上训练时效果最佳。然而,由于隐私和治理限制,许多临床相关队列无法合并为单个数据集。我们提出FPLIER,这是PLIER的联邦扩展,能够在多个数据持有者之间进行分布式训练,同时整合公开可用数据集。通过安全聚合,FPLIER产生的训练更新在代数上等价于集中式池化数据方法,同时保持表达数据的本地性。我们在两个模拟联盟(来自K-CLIER和MultiPLIER研究)的多个场景中评估FPLIER,并展示其稳定收敛。我们进一步对针对中间训练统计量和发布模型的成员推断攻击进行了系统分析。结果表明,隐私风险由训练表达矩阵的秩决定。整合公开数据或降低数据维度会增加该秩,使系统趋向满秩状态,在此状态下训练样本与非训练样本对攻击者而言难以区分,成员推断性能接近随机猜测。

英文摘要

In transcriptomics, gene-set-aware factorization methods such as the Pathway Level Information Extractor (PLIER) are most effective when trained on large, heterogeneous expression compendia. Yet, many clinically relevant cohorts cannot be pooled into a single dataset due to privacy and governance constraints. We present FPLIER, a federated extension of PLIER that enables distributed training across multiple data holders while incorporating publicly available datasets. Through secure aggregation, FPLIER produces training updates algebraically equivalent to those of a centralized pooled-data approach while keeping expression data local. We evaluate FPLIER across multiple scenarios in two simulated consortia (from the K-CLIER and MultiPLIER studies) and demonstrate stable convergence. We further conduct a systematic analysis of membership inference attacks targeting both intermediate training statistics and the released model. Our results show that privacy risk is governed by the rank of the training expression matrix. Incorporating public data or reducing data dimensionality increases this rank, moving the system toward a full-rank regime in which training and non-training samples become indistinguishable to the attacker, and membership-inference performance approaches random guessing.

2605.27580 2026-05-29 cs.AI q-bio.NC

You Are in Control of Your State: Why Human Outcomes Are Controllable Through Causal State Intervention

你掌控自己的状态:为什么人类结果可以通过因果状态干预来控制

Suraj Biswas, Saurav Gupta, Pritam Mukherjee

AI总结 本文提出人类行为的变异性源于动态潜在状态,并通过因果状态干预实现对结果的可控性,结合六类证据和超过20万用户的数据验证了该框架。

详情
Comments
20 pages, 12 figures, 37 references. Companion to a prior SSRN preprint on causal architecture for human modelling
AI中文摘要

行为科学和面向人类的人工智能的一个核心谜题是个体内变异性的持续存在。同一个体在相同的可观察输入下,在不同场合产生不同的结果,而不同个体产生不同的结果,且没有可观察的协变量能完全预测。我们认为,这种变异性属于个体的动态潜在状态,并且通过针对决策形成时刻的状态及其权重的干预,人类结果在精确且可操作的意义上是可控的。我们将状态定义为随时间索引的权重向量,其维度决定个体的生物学、生理学和神经心理学如何将下一个事件处理为决策和结果。状态、决策和结果之间的关系是因果性的而非相关性的。权重向量在亚日时间尺度上是动态的。结果可报告的意识通道是一个狭窄的注意瓶颈,其内容本身依赖于状态。综合这些主张,意味着给定事件的结果在干预时的状态轨迹条件下是可控的。我们通过六条已建立的证据链(因果推断、预测处理、稳态应变、注意瓶颈、时间生物学、计算精神病学)以及一个部署的行为平台(涵盖2023年至2026年研究期间超过20万同意用户,跨越四种职业角色)的24个月观察基础来推动该框架。我们推导出七个可检验的预测,列出了六个状态感知系统的操作要求,并讨论了对数字健康、教育、人工智能个性化和个人能动性的影响。

英文摘要

A central puzzle for the behavioural sciences and for human-facing artificial intelligence is the persistence of within-person variability. The same individual, presented with the same observable input, produces different outcomes on different occasions, and different individuals produce divergent outcomes that no observable covariate fully predicts. We argue that this variability belongs in the dynamic latent state of the person, and that human outcomes are controllable in a precise and operational sense through interventions that target the state and its weighting at the moment a decision is being formed. We define a state as the time-indexed weighting vector over the dimensions that govern how an individual's biology, physiology, and neuropsychology process the next event into a decision and an outcome. The relationship between state, decision, and outcome is causal rather than correlational. The weighting vector is dynamic at sub-daily timescales. The conscious channel through which outcomes are reportable is a narrow attentional bottleneck whose contents are themselves state-dependent. Taken together, these claims imply that the outcome of a given event is controllable, conditionally, on the state-trajectory at the time of intervention. We motivate the framework with six strands of established evidence (causal inference, predictive processing, allostasis, attentional bottleneck, chronobiology, computational psychiatry) and a 24-month observational base from a deployed behavioural platform spanning more than 200,000 consented users across four occupational personas (research period 2023 to 2026). We derive seven testable predictions, list six operational requirements for state-aware systems, and discuss implications for digital health, education, AI personalisation, and personal agency.

2512.18652 2026-05-29 cond-mat.stat-mech q-bio.PE

Impact of temporary lockdown on disease extinction in assortative networks

临时封锁对同配网络中疾病灭绝的影响

Elad Korngut, Michael Assaf

AI总结 通过半经典近似和数值模拟,研究随机SIS模型中临时封锁的时长、强度及网络拓扑对异质同配网络中疾病灭绝风险的影响。

详情
Comments
10 pages, 7 figures; to appear in Phys. Rev. E (2026)
AI中文摘要

环境条件的变化会显著影响疾病传播的动态。这些变化可能自然发生,也可能源于人类干预;在后一种情况下,封锁措施会导致传播率突然但暂时地降低,用于对抗疾病传播。然而,这些措施对异质群体中罕见事件的影响仍未得到充分研究。在这里,我们分析了随机环境下的易感-感染-易感(SIS)模型,其中疾病灭绝——感染的突然清除——通过罕见的、大的涨落发生。我们使用半经典近似和在异质同配网络(具有邻居节点间度-度相关性)上的数值模拟,展示了疾病的灭绝风险如何取决于封锁的持续时间和强度,以及网络拓扑。

英文摘要

Changing environmental conditions can significantly affect the dynamics of disease spread. These changes may arise naturally or result from human interventions; in the latter case, lockdown measures that lead to abrupt but temporary reductions in transmission rates are used to combat disease spread. Yet, the impact of these measures on rare events in heterogeneous populations remains understudied. Here, we analyze the susceptible-infected-susceptible (SIS) model in a stochastic setting where disease extinction -- a sudden clearance of the infection -- occurs via a rare, large fluctuation. We use a semiclassical approximation and numerical simulations on heterogeneous assortative networks, with degree-degree correlations between neighboring nodes, to show how the extinction risk of the disease depends on the lockdown's duration and magnitude, and on the network topology.

2512.13517 2026-05-29 q-bio.NC cs.LG

A Deep Learning Model of Mental Rotation Informed by Interactive VR Experiments

基于交互式VR实验的心理旋转深度学习模型

Raymond Khazoum, Daniela Fernandes, Aleksandr Krylov, Qin Li, Stephane Deny

AI总结 提出一个由等变编码器、神经符号对象编码器和神经决策代理组成的深度学习模型,通过VR实验验证,准确模拟人类心理旋转的性能、响应时间和行为。

详情
Comments
Version accepted at ICML 2026
AI中文摘要

心理旋转——比较从不同视角观察到的物体的能力——是人类心理模拟和空间世界建模的基本示例。在这里,我们利用深度、等变和神经符号学习的最新进展,提出了一个人类心理旋转的机制模型。我们的模型由三个堆叠的组件组成:(1) 等变神经编码器,从图像中生成物体的3D空间表示;(2) 神经符号对象编码器,从这些空间表示中推导出符号对象描述;(3) 神经决策代理,通过循环路径比较这些符号描述,以在3D潜在空间中规定旋转模拟。我们的模型设计受到现有心理旋转实验文献的指导,并辅以VR实验,其中参与者有时可以操作物体进行比较。我们的模型很好地捕捉了参与者在我们和其他人的实验中的表现、反应时间和行为,并通过消融研究证明了每个组件的必要性。我们的工作为最近一系列人类空间推理的深度神经模型增添了新的内容,进一步证明了整合深度、等变和符号表示来模拟人类思维的效力。

英文摘要

Mental rotation -- the ability to compare objects seen from different viewpoints -- is a fundamental example of mental simulation and spatial world modeling in humans. Here we propose a mechanistic model of human mental rotation, leveraging recent advances in deep, equivariant, and neuro-symbolic learning. Our model consists of three stacked components: (1) an equivariant neural encoder, producing 3D spatial representations of objects from images, (2) a neuro-symbolic object encoder, deriving symbolic objects descriptions from these spatial representations, and (3) a neural decision agent, comparing these symbolic descriptions to prescribe rotation simulations in 3D latent space via a recurrent pathway. Our model design is guided by the existing experimental literature on mental rotation, which we complemented with experiments in VR where participants could at times manipulate the objects to compare. Our model captures well the performance, response times and behavior of participants in our and others' experiments, and through ablation studies we demonstrate the necessity of each component. Our work adds to a recent collection of deep neural models of human spatial reasoning, further demonstrating the potency of integrating deep, equivariant, and symbolic representations to model the human mind.

2504.13727 2026-05-29 math.DS math-ph math.MP q-bio.NC

High-dimensional dynamics in low-dimensional networks

低维网络中的高维动力学

Yue Wan, Robert Rosenbaum

AI总结 研究低秩结构递归网络在高维输入或扰动下产生高维或低维动力学的条件,发现“低秩抑制”现象并推导线性化动力学高维的数学条件。

详情
AI中文摘要

自然界和许多应用中的网络具有近似低秩结构,即其连接结构由少数维度主导。人们自然期望此类网络上的动力学也是低维的。实际上,理论结果表明,当网络与外部扰动或输入隔离时,低秩网络会产生低维动力学。然而,自然界中的网络很少是孤立的。本文研究了由高维输入或扰动驱动的低维结构递归网络中动力学的维数。我们发现,此类网络中的动力学可以是高维或低维的,并推导了线性化动力学为高维时网络结构的数学条件。在许多低秩网络中,动力学在与网络低秩结构对齐的方向上被抑制,我们将这一现象称为“低秩抑制”。我们表明,自然界中出现的几种低秩网络结构满足产生高维动力学和低秩抑制的条件。我们的结果阐明了递归网络的连接结构与其对外部输入响应结构之间重要但反直觉的关系。

英文摘要

Many networks in nature and applications have an approximate low-rank structure in the sense that their connectivity structure is dominated by a few dimensions. It is natural to expect that dynamics on such networks would also be low-dimensional. Indeed, theoretical results show that low-rank networks produce low-dimensional dynamics whenever the network is isolated from external perturbations or input. However, networks in nature are rarely isolated. Here, we study the dimensionality of dynamics in recurrent networks with low-dimensional structure driven by high-dimensional inputs or perturbations. We find that dynamics in such networks can be high- or low-dimensional and we derive mathematical conditions on the network structure under which linearized dynamics are high-dimensional. In many low-rank networks, dynamics are suppressed in directions aligned with the network's low-rank structure, a phenomenon we term ``low-rank suppression.'' We show that several low-rank network structures arising in nature satisfy the conditions for generating high-dimensional dynamics and low-rank suppression. Our results clarify important, but counterintuitive relationships between a recurrent network's connectivity structure and the structure of its response to external input.

2502.01360 2026-05-29 cs.LG math.AT q-bio.NC

A Quotient Homology Theory of Representation in Neural Networks

神经网络表示的商同调理论

Kosio Beshkov

AI总结 利用ReLU神经网络的分片线性性质,定义输入数据集上的等价关系并构造商空间,证明在凸性条件下神经表示的同调群与商同调群同构,从而无需外部度量即可计算Betti数。

详情
Journal ref
Transactions on Machine Learning Research, 05/2026, https://openreview.net/forum?id=RluspxztzS
AI中文摘要

先前的研究已经证明,使用ReLU激活函数的神经网络所实现的映射集合与分片线性连续映射的集合相同。此外,这类网络诱导一个超平面排列,将网络的输入域分割成凸多面体$G_J$,网络$Φ$在这些多面体上以仿射方式运行。在本文中,我们利用这些性质在输入数据集上定义一个等价关系$\sim_Φ$,该关系定义了一个商空间,该商空间可被分割成两个集合,分别与$Φ_J$的局部秩以及交集$\cap ext{Im}Φ_{J_i}$相关。我们将后者称为 extit{重叠分解}$\mathcal{O}_Φ$,并证明如果每个多面体与输入流形之间的交集是凸的,则神经表示的同调群与商同调群$H_k(Φ(\mathcal{M})) \simeq H_k(\mathcal{M}/\mathcal{O}_Φ)$同构。这使我们能够在不选择外部度量的情况下内在地计算神经表示的Betti数。我们开发了通过线性规划和并查集算法数值计算重叠分解的方法。利用这一框架,我们在玩具数据集上进行了若干实验,表明与标准持续同调相比,基于重叠同调的Betti数计算追踪的是纯拓扑特征而非几何特征。最后,我们研究了几个分类问题中训练过程中重叠分解的演化,并讨论了该方法的一些缺点。

英文摘要

Previous research has proven that the set of maps implemented by neural networks with a ReLU activation function is identical to the set of piecewise linear continuous maps. Furthermore, such networks induce a hyperplane arrangement splitting the input domain of the network into convex polyhedra $G_J$ over which a network $Φ$ operates in an affine manner. In this work, we leverage these properties to define an equivalence relation $\sim_Φ$ on top of an input dataset, which defines a quotient space that can be split into two sets related to the local rank of $Φ_J$ and the intersections $\cap \text{Im}Φ_{J_i}$. We refer to the latter as the \textit{overlap decomposition} $\mathcal{O}_Φ$ and prove that if the intersections between each polyhedron and an input manifold are convex, the homology groups of neural representations are isomorphic to quotient homology groups $H_k(Φ(\mathcal{M})) \simeq H_k(\mathcal{M}/\mathcal{O}_Φ)$. This lets us intrinsically calculate the Betti numbers of neural representations without the choice of an external metric. We develop methods to numerically compute the overlap decomposition through linear programming and a union-find algorithm. Using this framework, we perform several experiments on toy datasets showing that, compared to standard persistent homology, our overlap homology-based computation of Betti numbers tracks purely topological rather than geometric features. Finally, we study the evolution of the overlap decomposition during training on several classification problems and discuss some shortcomings of our method.

2411.14107 2026-05-29 q-bio.NC

Inward rectifier potassium channels interact with calcium channels to promote robust and physiological bistability

内向整流钾通道与钙通道相互作用促进稳健且生理性的双稳态

Anaëlle De Worm, Guillaume Drion, Pierre Sacré

AI总结 本研究通过最小电导模型揭示,内向整流钾通道(Kir)与L型钙通道(CaL)结合可扩大双稳态窗口,产生稳健且生理性的双稳态,为中枢敏化提供候选内在机制。

详情
AI中文摘要

背角投射神经元将伤害性输入传递到脊髓上中枢。在中枢敏化过程中,一部分神经元从紧张性放电转变为具有持续后放电的平台电位,这种变化需要静息态和放电态之间的内在双稳态。电压门控L型钙通道(CaL)可以产生双稳态,但只有与电压门控钾通道配对时才能达到生理性静息态,而大多数钾通道会同时缩小双稳态窗口。因此,稳健、生理性的双稳态如何产生尚不清楚。使用最小电导模型,我们显示内向整流钾通道(Kir)与CaL通道结合时扩大双稳态窗口,而M型钾通道(KM)则略微缩小。在双稳态既稳健又生理的参数区域内,两种通道类型都能维持双稳态,但CaL+Kir组合产生更大的窗口,并且对噪声和内在变异性更稳健。这种窗口扩大效应可追溯到外向Kir稳态电流的形状特征:与CaL电流类似,它在放电阈值附近具有负微分电导区域,而KM和大多数其他电压门控钾电流缺乏此特征。分岔分析进一步表明,这两对组合支持性质不同的兴奋性:CaL+Kir产生平台电位生成性双稳态,CaL+KM产生谐振样动力学。这些结论在具有真实离子通道补足的深层投射神经元双房室模型中成立,并确定CaL+Kir对是中枢敏化的候选内在机制。

英文摘要

Projection neurons in the dorsal horn relay nociceptive input to supraspinal centers. During central sensitization, a subset of them switches from tonic firing to plateau potentials with sustained afterdischarges, a change that requires intrinsic bistability between a resting and a spiking state. Voltage-gated L-type calcium (CaL) channels can produce bistability, but reach physiological resting states only when paired with voltage-gated potassium channels, most of which simultaneously shrink the bistability window. How robust, physiological bistability arises has therefore remained unclear. Using a minimal conductance-based model, we show that inward rectifier potassium (Kir) channels enlarge the bistability window when combined with CaL channels, while M-type potassium (KM) channels slightly reduce it. Within the parameter region where bistability is both robust and physiological, both channel types can sustain bistability, but the CaL+Kir combination produces a substantially larger window and is more robust to noise and intrinsic variability. This window-enlarging effect traces to a shape feature of the outward Kir steady-state current: like the CaL current, it has a region of negative differential conductance around the spike threshold, a feature absent from KM and from most other voltage-gated potassium currents. Bifurcation analysis further shows that the two pairs support qualitatively distinct excitability: plateau-generating bistability for CaL+Kir and resonator-like dynamics for CaL+KM. These conclusions hold in a two-compartment model of deep projection neurons with realistic ion channel complements, and identify the CaL+Kir pair as a candidate intrinsic mechanism for central sensitization.

2605.29529 2026-05-29 nlin.AO cond-mat.stat-mech q-bio.NC

Common Noise-Induced Group-Level Synchronization Between Uncoupled Groups of Oscillators

常见噪声引起的未耦合振荡器群之间的群级同步

Tae-Wook Ko

AI总结 研究在无群间耦合下,由共同噪声诱导的振荡器群之间的群级同步,通过数值模拟和相位密度演化映射分析其机制。

详情
Comments
22 pages, 15 figures
AI中文摘要

我们研究了在无群间耦合的情况下,由共同噪声诱导的振荡器群之间的群级同步。每个群接收其所有振荡器共享的共同噪声以及独立作用于单个振荡器的局部噪声输入。所有群接收相同的共同噪声。该系统在相同和非相同振荡器的情况下进行研究,并且考虑有和没有群内耦合。在非相同情况下,两个群的自然频率来自相同的分布,使它们在统计上等价。通过对该系统的数值模拟,我们发现每个群内的同步程度(由复Kuramoto序参数的绝对值衡量)通常表现出显著的时间波动。重要的是,当群由相同的共同噪声驱动时,代表群集体振荡的复序参数会同步。通过推导相位密度演化映射,我们从分析上解释了在没有群内耦合的情况下如何实现这种群级同步。

英文摘要

We investigate group-level synchronization between oscillator groups induced by common noise in the absence of inter-group coupling. Each group receives a common noise shared by all its oscillators and independent local noise inputs to individual oscillators. The same common noise is applied to all groups. The system is studied with both identical and nonidentical oscillators, and with and without intra-group coupling. In the nonidentical case, natural frequencies are drawn from the same distribution for both groups, making them statistically equivalent. Through numerical simulations of this system, we find that the degree of synchronization within each group, measured by the absolute value of a complex Kuramoto order parameter, typically shows significant temporal fluctuations. Importantly, the complex order parameters representing the collective oscillations of the groups synchronize when the groups are driven by the same common noise. By deriving a phase density evolution mapping, we analytically explain how this group-level synchronization is achieved in the absence of intra-group coupling.

2605.29329 2026-05-29 q-bio.QM cs.LG

Mixing Vector Model for Copolymer Inference via Mixed Integer Linear Programming

基于混合整数线性规划的共聚物推断的混合向量模型

Jianshen Zhu, Raveena Rai, Taiyo Sohkawa, Naveed Ahmed Azam, Kazuya Haraguchi, Liang Zhao, Tatsuya Akutsu

AI总结 提出混合向量模型,通过混合整数线性规划实现共聚物的逆设计,在多个物化数据集上取得高预测精度并保持可解性。

详情
AI中文摘要

最近开发了一种新颖的两阶段分子推断框架mol-infer,通过两层模型下的混合整数线性规划(MILP),在给定学习预测函数和结构约束的条件下,以最优性和精确性推断具有规定抽象结构和期望性质值的化学图。在本研究中,我们通过引入一种称为混合向量(MV)模型的简单特征表示,将该框架扩展到共聚物。在所提出的模型中,共聚物特征向量表示为MILT可处理单体描述符的凸组合,加权系数为组成单体的混合比例。这种表示不需要明确的序列类别信息,因此自然兼容基于MILP的逆设计。在该模型下,我们使用人工神经网络、简化二次多元线性回归和随机森林为多个共聚物性质数据集构建预测函数。所提出的表示在多个物理化学性质数据集上实现了实际有用的预测性能;特别地,十个数据集中有九个的最佳测试R²分数超过0.7,六个数据集超过0.9。我们还制定了在MV表示下具有规定混合比例的多单体逆设计问题,并表明即使在三单体设置下,生成的MILP实例仍然可解。最后,我们通过重新评估推断的候选物并将重新计算的性质值与学习模型预测的值进行比较,进行外部一致性检查。总体而言,所提出的框架为在两层模型下实现共聚物的模型级精确逆设计提供了可处理的第一步。

英文摘要

A novel two-phase molecule inference framework, mol-infer, has recently been developed to infer chemical graphs with prescribed abstract structures and desired property values through mixed integer linear programming (MILP) under the two-layered model, with guaranteed optimality and exactness relative to the given learned prediction function and structural constraints. In this study, we extend this framework to copolymers by introducing a simple feature representation, called the mixing vector (MV) model. In the proposed model, a copolymer feature vector is represented as a convex combination of MILP-tractable monomer descriptors weighted by the mixing ratio of the constituent monomers. This representation does not require explicit sequence-class information and is therefore naturally compatible with MILP-based inverse design. Under this model, we construct prediction functions for several copolymer property datasets using artificial neural networks, reduced quadratic multiple linear regression, and random forests. The proposed representation achieves practically useful predictive performance across multiple physicochemical property datasets; in particular, the best test R^2 score exceeds 0.7 for nine of the ten datasets and exceeds 0.9 for six datasets. We also formulate a multi-monomer inverse-design problem under the MV representation with a prescribed mixing ratio and show that the resulting MILP instances remain tractable, even for three-monomer settings. Finally, we perform an external consistency check by re-evaluating the inferred candidates and comparing the re-computed property values with those predicted by the learned model. Overall, the proposed framework gives a tractable first step toward model-level exact inverse design of copolymers under the two-layered model.

2605.29228 2026-05-29 cs.LG q-bio.MN

Traditional machine learning vs. deep learning from dynamic graph representations of proteins' 3D folds in the task of protein structure classification

传统机器学习 vs. 深度学习在蛋白质三维折叠动态图表示中的蛋白质结构分类任务

Aydin Wells, Francis A. Gatsi, Aaron Striegel, Tijana Milenković

AI总结 本研究比较了传统机器学习与深度学习在基于动态蛋白质结构网络进行蛋白质结构分类时的准确性和效率,发现两者准确性相近但深度学习慢10倍以上。

详情
Comments
Main paper: 16 pages, 4 figures, and 1 table; Supplementary information: 13 pages, 9 figures
AI中文摘要

蛋白质结构分类(PSC)使用监督学习从蛋白质序列或三维结构特征预测其CATH/SCOP(e)类别。我们之前将三维结构建模为(静态)蛋白质结构网络(PSN),证明了基于PSN的特征在PSC任务中与序列或直接(即非网络)三维结构特征相比具有竞争力。最近,我们展示了从动态PSN中提取的特征在相同任务中优于从静态PSN中提取的特征(从而通过传递性优于序列和直接三维结构特征)。该动态PSN方法使用传统机器学习(ML),结合手动(预设计)特征与现成分类器。在此,我们评估从动态PSN进行自动深度学习(DL)是否能带来改进。我们对涵盖约44,000个CATH或SCOPe标记的动态PSN的72个数据集进行的评估显示,就PSC准确性而言,传统ML和DL在绝大多数数据集上(接近)持平,而DL平均慢10倍以上。我们是首个在基于动态PSN的PSC任务中评估传统ML与DL的研究。

英文摘要

Protein structure classification (PSC) uses supervised learning to predict a protein's CATH/SCOP(e) class from the protein's sequence or 3D structural feature(s). We already modeled 3D structures as (static) protein structure networks (PSNs), demonstrating the competitiveness of PSN-based features to sequence or direct (i.e. non-network) 3D structural features in the PSC task. More recently, we demonstrated the power of features extracted from dynamic PSNs over features extracted from static PSNs (and thus by transitivity over sequence and direct 3D structural features) in the same task. That dynamic PSN approach used traditional machine learning (ML), combining manual (pre-engineered) features with an off-the-shelf classifier. Here, we evaluate whether automatic deep learning (DL) from the dynamic PSNs yields improvements. Our evaluation on 72 datasets spanning ~44,000 CATH- or SCOPe-labeled dynamic PSNs reveals that in terms of PSC accuracy, traditional ML and DL are (close to) tied for a large majority of the datasets, while DL is on average 10+ times slower. We are the first to evaluate traditional ML vs. DL in the dynamic PSN-based PSC task.

2605.29158 2026-05-29 cs.LG cs.IR q-bio.BM

PROTOCOL: Late Interaction Retrieval for Protein Homolog Search

PROTOCOL: 用于蛋白质同源搜索的延迟交互检索

Gabrielle Cohn, Rohan Gumaste, Minh Hoang, Vihan Lakshman

AI总结 提出ProtoCol模型,利用ColBERT风格的延迟交互机制对残基嵌入进行最大相似度评分,以提升远程同源搜索的灵敏度,在SCOPe超家族和Pfam clan基准上优于多种基线方法。

详情
AI中文摘要

蛋白质同源搜索是功能注释、结构预测和进化分析的基础,但在全局序列相似性较弱且经典比对方法灵敏度下降的“模糊区”中仍然具有挑战性。蛋白质语言模型提供了上下文感知的表示,可以在此范围内提高比对灵敏度。然而,先前的基于蛋白质嵌入的检索流程通常将这些表示池化为单个向量,可能掩盖揭示远程同源性的局部基序、结构域或保守残基。我们引入了ProtoCol,该模型将蛋白质表示为残基嵌入的集合,并使用ColBERT风格的延迟交互来测试残基级比较是否改善同源检索。ProtoCol独立编码蛋白质,保持候选表示可预计算,并通过残基嵌入上的MaxSim对候选进行评分。在SCOPe超家族和Pfam clan基准上,ProtoCol优于基于序列组成、比对、池化PLM和训练的单向量基线,支持延迟交互作为远程同源搜索的有效检索层。

英文摘要

Protein homology search underlies function annotation, structure prediction, and evolutionary analysis, but remains challenging in the "twilight zone," where global sequence similarity is weak and classical alignment methods lose sensitivity. Protein language models provide context-aware representations that could improve alignment sensitivity in this regime. However, prior protein embedding-based retrieval pipelines often pool these representations into a single vector, potentially obscuring local motifs, domains, or conserved residues that reveal remote homology. We introduce ProtoCol, a model which represents proteins as sets of residue embeddings and uses ColBERT-style late interaction to test whether residue-level comparison improves homolog retrieval. ProtoCol encodes proteins independently, keeps candidate representations pre-computable, and scores candidates with MaxSim over residue embeddings. On SCOPe superfamily and Pfam clan benchmarks, ProtoCol outperforms sequence-composition, alignment-based, pooled PLM, and trained single-vector baselines, supporting late interaction as an effective retrieval layer for remote homology search.

2605.28976 2026-05-29 q-bio.PE

On a phenotype-structured Shigesada--Kawasaki--Teramoto model: Turing instability and pattern selection under fast phenotype switching

关于表型结构化的Shigesada-Kawasaki-Teramoto模型:快速表型切换下的图灵不稳定性和模式选择

Davide Cusseddu, Gaetana Gambino, Tommaso Lorenzi

AI总结 本文提出一个表型结构化的SKT模型,通过快速表型切换下的线性与弱非线性分析,研究表型分布如何作为有效控制参数影响图灵不稳定性和空间模式选择。

详情
Comments
24 pages, 6 figures
AI中文摘要

Shigesada-Kawasaki-Teramoto (SKT) 模型已成为研究竞争种群中空间分离和交叉扩散驱动模式形成的经典建模框架。该模型假设表型同质性,但任何种群内部都存在表型变异,并可能强烈影响生态和进化动态。在本文中,我们提出了一个考虑表型变异的SKT模型的广义表型结构化公式。在该公式中,竞争种群在某些表型状态空间上连续结构化。种群成员以表型依赖的方式移动和竞争,并且可以在不同表型状态之间切换。首先,我们展示了如何通过快速表型切换的准不变机制,得到经典SKT模型的一种形式,其中参数以广义结构化模型的表型依赖函数的连续加权平均值表示,权重由两个种群的表型分布给出。然后,仍然假设快速表型切换,并扩展经典的图灵型线性和弱非线性分析,我们探索了空间模式出现的条件,识别了导致模式形成的图灵型分岔阈值,并研究了这种分岔的性质(超临界或亚临界)以及模式状态的稳定性。所得结果使得在表型依赖模型函数与种群尺度聚合空间动态的出现之间建立联系成为可能,特别展示了表型分布如何作为图灵不稳定性和模式选择的有效控制参数。这些发现通过数值模拟得到补充,验证了形式渐近性并确认了模式形成分析的预测。

英文摘要

The Shigesada-Kawasaki-Teramoto (SKT) model has become a classical modelling framework for studying spatial segregation and cross-diffusion-driven pattern formation in competing populations. This model assumes phenotypic homogeneity, but phenotypic variability persists within any population and can strongly influence both ecological and evolutionary dynamics. In this paper, we present a generalised phenotype-structured formulation of the SKT model that accounts for phenotypic variability. In this formulation, the competing populations are continuously structured across some phenotype state spaces. Population members move and compete in phenotype-dependent ways, and can also switch between different phenotype states. First we show how a form of the classical SKT model, wherein parameters are written in terms of continuous weighted averages of the phenotype-dependent functions of the generalised structured model, with weights given by the phenotype distributions of the two populations, can be obtained in the quasi-invariant regime of fast phenotype switching. Then, still assuming fast phenotype switching and extending classical Turing-like linear and weakly nonlinear analyses, we explore the conditions for the emergence of spatial patterns, identify a Turing-type bifurcation threshold leading to pattern formation, and investigate the nature of such a bifurcation (super- or sub-critical) as well as the stability of the patterned state. The results obtained make it possible to draw connections between phenotype-dependent model functions and the emergence of population-scale aggregate spatial dynamics, showing in particular how phenotype distributions can act as effective control parameters for Turing instability and pattern selection. These findings are complemented by numerical simulations, which validate the formal asymptotics and confirm the predictions of the pattern formation analyses.

2605.28886 2026-05-29 q-bio.QM cs.LG

Computational Modeling of Antibody-Antigen Complexes: PLM-Based and MSA-Based Approaches

抗体-抗原复合物的计算建模:基于PLM和基于MSA的方法

Xiao Luo

AI总结 本研究探讨抗体相关任务计算困难的原因,并提出基于蛋白质语言模型(PLM)和多重序列比对(MSA)的两种互补改进方法,以提升抗体-抗原结构预测精度。

详情
Comments
PhD thesis
AI中文摘要

抗体通过特异性识别和中和抗原在免疫反应中发挥核心作用,治疗性抗体已成为癌症和自身免疫疾病的主要药物。然而,其发现仍依赖大量体外筛选,而抗体结构和抗体-抗原相互作用的准确计算建模可以优先候选、减少实验负担并加速理性设计。尽管近年来高精度蛋白质和复合物预测取得了进展,但与一般蛋白质-蛋白质相互作用相比,抗体相关任务仍存在持续的性能差距,限制了下游设计。 本论文研究了为何抗体相关任务更困难,并沿两个互补方向提出改进。首先,我们研究了基于蛋白质语言模型(PLM)的抗体及抗体-抗原结构预测方法。利用多个PLM的嵌入,我们的方法在抗体单体预测中达到了所比较的PLM方法中最高的CDR-H3精度。将其扩展到复合物预测时未能泛化:由于缺乏抗体和抗原之间的共进化信号,单序列PLM表示无法可靠识别结合界面。 其次,我们针对抗体-抗原复合物预测开发了两种基于MSA的干预措施:MSA精炼,结合了CDR聚焦过滤和从更大序列数据库恢复深度;以及收敛感知循环,选择稳定的中间循环状态用于最终扩散采样。这些干预措施在保留的抗体-抗原测试集上相对于AlphaFold3基线提供了一致的增益。由于这些方法修改了MSA构建和循环行为而非模型参数,它们无需重新训练或权重访问即可应用。

英文摘要

Antibodies play a central role in the immune response by specifically recognizing and neutralizing antigens, and therapeutic antibodies have become major drugs for cancer and autoimmune diseases. However, their discovery still relies on extensive in vitro screening, and accurate computational modeling of antibody structures and antibody-antigen interactions can prioritize candidates, reduce experimental burden, and accelerate rational design. Despite recent advances in high-accuracy protein and complex prediction, a persistent performance gap remains for antibody-related tasks compared with general protein-protein interactions, limiting downstream design. This thesis investigates why antibody-related tasks are harder and proposes improvements along two complementary directions. First, we investigate protein language model (PLM)-based methods for antibody and antibody-antigen structure prediction. Using embeddings from multiple PLMs, our approach achieves the best CDR-H3 accuracy among compared PLM-based methods on antibody monomer prediction. Extending it to complex prediction does not generalize: without co-evolutionary signals between antibody and antigen, single-sequence PLM representations do not reliably identify binding interfaces. Second, we develop two MSA-based interventions for antibody-antigen complex prediction: MSA refinement, which combines CDR-focused filtering with depth recovery from a larger sequence database, and convergence-aware recycling, which selects a stable intermediate recycle state for final diffusion sampling. Together, these interventions provide consistent gains over the AlphaFold3 baseline on a held-out antibody-antigen test set. Because the methods modify MSA construction and recycling behavior rather than model parameters, they apply without retraining or weight access.

2605.28862 2026-05-29 cs.LG q-bio.QM

Molecular Lead Optimization via Agentic Tool Planning

通过智能体工具规划进行分子先导优化

Lingxiao Li, Haobo Zhang, Ruohao Fan, Bin Chen, Jiayu Zhou

AI总结 提出TRACE,一种轨迹感知的LLM推理智能体,将先导优化建模为序列决策问题,通过工具选择实现结构约束下的前瞻性分子优化,在ADMET任务中优于基线。

详情
Comments
12 pages
AI中文摘要

药物发现是一个漫长且资源密集的过程,由多个阶段组成。其中,先导优化在将早期命中化合物转化为可行的候选药物中起着关键作用。这一阶段需要通过细微的结构修饰来改善ADMET相关性质,同时保留负责与疾病靶点结合亲和力的关键分子子结构。人工智能的最新进展在加速药物发现的各个方面显示出前景;然而,大多数现有的先导优化方法依赖于一步式分子优化,未能考虑序列设计决策的长期后果。为了解决这一限制,我们提出了TRACE,一种用于分子先导优化的轨迹感知、LLM推理智能体,它将工具选择形式化为一个关于动作轨迹的序列决策问题。给定一个先导分子和一个优化目标,TRACE在分子优化工具上做出轨迹感知的决策,从而在结构约束下实现前瞻性优化。在多个ADMET优化任务上的实验表明,与基线模型相比,我们的智能体实现了更高的优化成功率、更大的性质改进和更高的有效性,同时保持了分子相似性。

英文摘要

Drug discovery is a lengthy and resource-intensive process composed of multiple stages. Among these stages, lead optimization plays a critical role in transforming early hit compounds into viable drug candidates. This stage requires improving ADMET-related properties through subtle structural refinement while preserving key molecular substructures responsible for binding affinity to disease targets. Recent advances in artificial intelligence have shown promise in accelerating various aspects of drug discovery; however, most existing approaches to lead optimization rely on one-step molecular optimization, which fail to account for the long-term consequences of sequential design decisions. To address this limitation, we propose TRACE, a trajectory-aware, LLM-reasoning agent for molecular lead optimization that formulates tool selection as a sequential decision-making problem over action trajectories. Given a lead molecule and an optimization objective, TRACE makes trajectory-aware decisions over molecular optimization tools, enabling forward-looking refinement under structural constraints. Experiments on multiple ADMET optimization tasks show that our agent achieves higher optimization success, larger property improvements, and higher validity, while preserving molecular similarity compared to baseline models.

2605.28854 2026-05-29 cs.CL cs.LG q-bio.NC

Large language models reorganize representational geometry during in-context learning

大型语言模型在上下文学习中重组表征几何结构

Hua-Dong Xiong, Li Ji-An, Robert C. Wilson, Kwonjoon Lee, Xue-Xin Wei

AI总结 研究大型语言模型在上下文学习中的表征几何重组,发现其性能与任务表征结构相关,并通过原型算法动态调整表征以提高可分性。

详情
AI中文摘要

大型语言模型(LLMs)表现出显著的灵活性:它们可以从上下文示例中适应新任务,而无需任何参数更新,这种能力被称为上下文学习(ICL)。先前关于合成任务的研究表明,ICL可以实现特定算法,展示了架构能力,并且机制分析已经识别出支持这种行为的关键回路。然而,由于上下文计算——无论其算法形式如何——依赖于高维表征空间中的变换,该空间的几何结构如何塑造ICL的有效性仍不清楚。受神经科学中将分类视为神经表征解缠的观点启发,我们假设ICL依赖于任务相关表征的成功在线解缠。为了验证这一想法,我们研究了LLMs如何对上下文示例进行分类,这些示例的标签由模型自身具有已知结构的内部表征定义。我们表明,ICL性能与底层分类任务的表征结构系统性相关,并且成功的ICL伴随着几何重组,增加了在线可分性。我们进一步发现,LLM的行为可以通过一种原型类算法很好地描述,该算法在重塑表征以支持分类的同时整合证据。这些发现为预训练LLMs中的ICL提供了几何解释,将表征几何结构确立为ICL的机制约束,并量化了预训练表征所能提供的与上下文学习所能利用之间的差距。

英文摘要

Large language models (LLMs) exhibit remarkable flexibility: they can adapt to novel tasks from in-context examples without any parameter updates, a capability known as in-context learning (ICL). Prior work on synthetic tasks has shown that ICL can implement specific algorithms, demonstrating architectural competence, and mechanistic analyses have identified key circuits that support this behavior. However, because in-context computation -- regardless of its algorithmic form -- relies on transformations in high-dimensional representation space, it remains unclear how the geometry of that space shapes ICL effectiveness. Motivated by the neuroscience view of classification as the untangling of neural representations, we hypothesize that ICL depends on the successful online untangling of task-relevant representations. To test this idea, we study how LLMs classify in-context examples whose labels are defined by the model's own internal representations with known structure. We show that ICL performance correlates systematically with the representational structure of the underlying classification task and that successful ICL is accompanied by geometric reorganization that increases online separability. We further find that LLM behavior is well described by a prototype-like algorithm that integrates evidence while reshaping representations to support classification. These findings offer a geometric account of ICL in pretrained LLMs, establish representational geometry as a mechanistic constraint on ICL, and quantify the gap between what pretrained representations afford and what in-context learning can exploit.

2605.27928 2026-05-29 physics.bio-ph q-bio.BM

Experimental Collapse in Virophysics: Protocol-Resolved Observation, Inference, and Plaque-Assay Blindness

病毒物理学中的实验坍缩:协议分辨的观测、推断与噬斑测定盲区

Lillian St. Kleess

AI总结 本文提出病毒物理学中的实验坍缩框架,通过协议分辨的观测算子形式化病毒测量中协议条件对潜在病毒-环境系综的投影,并以噬斑测定为例揭示测量偏差与互补推断方法。

详情
Comments
174 Pages, 8 Figures
AI中文摘要

病毒学测量常被视为对病毒粒子结构、力学、介电响应、感染性或滴度的报告。实际上,实验观测的是更丰富的潜在病毒-环境系综的协议条件投影。本文将这一过程定义为协议分辨病毒物理学中的实验坍缩。其核心对象是包含空结果的观测算子 $P_{\mathrm{obs},t}^{\varnothing}(\,\cdot\mid E\,) = \mathcal{M}_{E,t}^{\varnothing}P_{\mathrm{ref},t}$,它将参考潜在系综映射为由协议 $E$ 生成的观测系综,包括空结果。该公式分离了潜在状态变换、检测加权、读出和非观测,使协议效应成为显式成分而非偏差项。该框架引入了协议条件潜在系综、坍缩泛函、协议盲区、观测等价性、Fisher信息可观测性、逆推断和多协议一致性。它识别了坍缩机制,包括制备、表面固定、机械加载、场导向、介质过滤、扩增、删失和检测阈值。作为实例,噬斑测定估计的是有效协议条件感染浓度 $\Lambda_{\mathrm{PFU}}=\int_\Psi\pi_{\mathrm{PFU}}(x;E_{\mathrm{PFU}})n_{\mathrm{ref}}(x)\,dx$,而非总粒子浓度。这恢复了稀释条件下的泊松噬斑计数模型和PFU滴度公式;扩展到过度分散、零膨胀、噬斑合并、终点稀释、中和和形态增强读出,将偏差重新解释为协议条件信息。因此,病毒学数据是显式协议核的输出,阐明了测量报告了什么、遗漏了什么,以及互补测定如何推断隐藏的潜在病毒粒子结构。

英文摘要

Virological measurements are often treated as reports of virion structure, mechanics, dielectric response, infectivity, or titer. In practice, an experiment observes a protocol-conditioned projection of a richer latent virion--environment ensemble. This paper defines this process as experimental collapse within protocol-resolved virophysics. Its central object is the null-inclusive observation operator $P_{\mathrm{obs},t}^{\varnothing}(\,\cdot\mid E\,) = \mathcal{M}_{E,t}^{\varnothing}P_{\mathrm{ref},t}$, which maps a reference latent ensemble to the observed ensemble generated by protocol $E$, including null outcomes. The formulation separates latent-state transformation, detection weighting, readout, and non-observation, making protocol effects explicit components rather than bias terms. The framework introduces protocol-conditioned latent ensembles, collapse functionals, protocol blindness, observation equivalence, Fisher-information observability, inverse inference, and multi-protocol consistency. It identifies collapse mechanisms including preparation, surface immobilization, mechanical loading, field steering, medium filtering, amplification, censoring, and detection thresholds. As a worked example, the plaque assay estimates an effective protocol-conditioned infectious concentration $Λ_{\mathrm{PFU}}=\int_Ψπ_{\mathrm{PFU}}(x;E_{\mathrm{PFU}})n_{\mathrm{ref}}(x),dx$, rather than total particle concentration. This recovers the Poisson plaque-count model and PFU titer formula in the dilute regime; extensions to overdispersion, zero inflation, plaque merging, endpoint dilution, neutralization, and morphology-augmented readouts recast deviations as protocol-conditioned information. Thus, virological data are outputs of explicit protocol kernels, clarifying what measurements report, miss, and how complementary assays can infer hidden latent virion structures.

2605.27480 2026-05-29 q-bio.OT cs.AI cs.CY

BIRDS: Characterizing and Understanding Biodiversity Impact of Large Language Model Serving

BIRDS:表征与理解大语言模型服务对生物多样性的影响

Tianyao Shi, Yi Ding

AI总结 提出BIRDS框架,通过定义请求级功能单元、量化运营与隐含生物多样性影响,并引入质量归一化生物多样性影响(QNBI),揭示大规模LLM服务对生态系统的累积影响及质量感知的服务权衡。

详情
Comments
21 pages, 27 figures, 9 tables
AI中文摘要

大语言模型(LLM)服务产生的环境影响不仅限于碳和水,还包括通过生物多样性相关途径造成的生态系统破坏。我们提出了BIRDS,一个用于请求驱动型LLM服务的生物多样性影响框架。BIRDS定义了请求级功能单元,量化了运营和隐含的生物多样性影响,并引入了质量归一化生物多样性影响(QNBI)来联合分析生态影响和响应质量。在不同的工作负载、模型、GPU和区域中,BIRDS揭示了生物多样性影响在大规模下累积,并暴露了可操作的质量感知服务权衡。

英文摘要

Large language model (LLM) serving creates environmental impacts beyond carbon and water, including ecosystem damage through biodiversity-related pathways. We present BIRDS, a framework for Biodiversity Impact of Request-Driven LLM Serving. BIRDS defines request-level functional units, quantifies operational and embodied biodiversity impact, and introduces Quality-Normalized Biodiversity Impact (QNBI) to jointly analyze ecological impact and response quality. Across diverse workloads, models, GPUs, and regions, BIRDS reveals that biodiversity impact accumulates at scale and exposes actionable quality-aware serving tradeoffs.

2605.18587 2026-05-29 q-bio.GN cs.LG

PACE: Geometry-Aware Bridge Transport for Single-Cell Trajectory Inference

PACE: 几何感知的桥梁传输用于单细胞轨迹推断

Chenglei Yu, Chuanrui Wang, Bangyan Liao, Tailin Wu

AI总结 针对单细胞轨迹推断中异步发育导致的错位问题,提出PACE框架,通过构建各向异性黎曼度量、交替优化跨时间耦合与神经桥梁、蒸馏全局速度场,在七个数据集上平均降低MMD、Wasserstein-1和Wasserstein-2距离23.7%。

详情
Comments
31 pages, 12 figures
AI中文摘要

基于破坏性时间序列快照的单细胞轨迹推断本质上是病态的:既未观察到跨时间细胞对应关系,也未观察到连续轨迹,因此仅凭快照分布无法唯一确定底层动力学。现有的最优传输和基于流的方法通常根据观察到的时钟时间通过欧几里得邻近性耦合细胞,当发育异步且在同一实验时间采样的细胞处于不同潜在伪时间阶段时,这可能导致轨迹错位。我们提出PACE,一个轨迹推断框架,通过三个耦合组件从破坏性时间序列快照中恢复几何一致的连续传输动力学。首先,PACE构建一个状态和时间依赖的各向异性黎曼度量,沿局部支持的切向方向分配低传输成本,同时惩罚法向速度分量。其次,它在诱导路径作用成本下交替优化跨时间耦合,并拟合相邻快照之间保持端点的神经桥梁。第三,它将学习到的桥梁动力学蒸馏为细胞状态上的全局连续时间速度场。在涵盖九个保留重建实验的七个受控和生物数据集上,PACE实现了最强的整体重建性能,相对于最强竞争基线,平均降低了MMD、Wasserstein-1距离和Wasserstein-2距离23.7%。在胚状体分化基准上,PACE还将RNA速度对齐提高了15.4%,且在训练过程中不需要显式的细胞配对、谱系追踪或RNA速度监督。代码可在https://github.com/AI4Science-WestlakeU/PACE获取。

英文摘要

Single-cell trajectory inference from destructive time-course snapshots is fundamentally ill-posed: neither cross-time cell correspondences nor continuous trajectories are observed, so the snapshot distributions alone do not uniquely determine the underlying dynamics. Existing optimal transport and flow-based methods typically couple cells by Euclidean proximity at observed clock times, which can misalign trajectories when development is asynchronous and cells sampled at the same experimental time occupy different latent pseudotime stages. We propose PACE, a trajectory inference framework that recovers geometry-consistent continuous transport dynamics from destructive time-course snapshots through three coupled components. First, PACE constructs a state- and time-dependent anisotropic Riemannian metric that assigns low transport cost along locally supported tangent directions while penalizing normal velocity components. Second, it alternates between refining cross-time couplings under the induced path-action cost and fitting endpoint-preserving neural bridges between adjacent snapshots. Third, it distills the learned bridge dynamics into a global continuous-time velocity field over cellular states. Across seven controlled and biological datasets covering nine held-out reconstruction experiments, PACE achieves the strongest overall reconstruction performance, reducing MMD, Wasserstein-1 distance, and Wasserstein-2 distance by 23.7% on average relative to the strongest competing baseline. PACE also improves RNA-velocity alignment by 15.4% on an embryoid body differentiation benchmark, without requiring explicit cell pairing, lineage tracing, or RNA-velocity supervision during training. Code is available at https://github.com/AI4Science-WestlakeU/PACE.

2605.03219 2026-05-29 physics.bio-ph q-bio.TO

The Incommensurability Principle in Biological Transport

生物输运中的不可公度性原理

Riccardo Marchesi

AI总结 本文提出一个网络级最小最大原理,证明哺乳动物血管树的分支指数普适性源于结构异质性和拓扑刚性,并通过粘性-惯性能量划分的双阈值框架解释了其发育稳定性。

详情
Comments
46 pages, 2 figures. Paper IV in a series on branching transport networks, Supplemental Material available; see also arXiv:2603.13687 (Paper I), arXiv:2603.14691 (Paper II), and arXiv:2604.10476 (Paper III). Zenodo: https://doi.org/10.5281/zenodo.20030962
AI中文摘要

为什么哺乳动物血管树在体质量跨越$10^7$倍范围内,尽管从粘性主导输运到波动主导输运存在根本性转变,却仍保持一个保守的分支指数$α^* \approx 2.72$?我们证明这种普适性不能源于局部优化:任何不可公度成本的连接级耦合都需要在整个层级中变化$O(10^2$--$10^3)$倍的尺度依赖微调。真实网络通过结构异质性解决这一问题,而血管几何作为网络级最小最大原理的无标度吸引子出现。将适应度代价基于ATP化学计量,我们证明了一个拓扑刚性定理:最优分支指数仅依赖于无量纲结构参数$(G, N, p, α_w)$,与所有代谢量无关。粘性-惯性能量划分的自洽条件产生了一个双阈值框架,其中$\mathrm{Wo}_c^{\mathrm{fluid}} = \sqrt{3}$和$\mathrm{Wo}_c^{\mathrm{wave}} = 3/\sqrt{2}$。对称模型给出$α^*_{\mathrm{model}} \approx 2.626$,与异速生长转变附近的哺乳动物一致;形态测量异质性将大型哺乳动物的值移向$2.72$。该框架解释了心血管网络的发育稳定性,作为架构与生物化学解耦的结果。

英文摘要

Why does the mammalian vascular tree maintain a conserved branching exponent $α^* \approx 2.72$ across a $10^7$-fold range in body mass, despite a fundamental shift from viscous to wave-dominated transport? We prove this universality cannot emerge from local optimization: any junction-level coupling of incommensurable costs requires scale-dependent fine-tuning varying by $O(10^2$--$10^3)$ across the hierarchy. Real networks resolve this through structural heterogeneity, and vascular geometry emerges as a scale-free attractor of a network-level minimax principle. Grounding the fitness penalty in ATP stoichiometry, we prove a Topological Rigidity theorem: the optimal branching exponent depends only on dimensionless structural parameters $(G, N, p, α_w)$, independent of all metabolic quantities. A self-consistency condition on the viscous--inertial energy partition yields a dual-threshold framework with $\mathrm{Wo}_c^{\mathrm{fluid}} = \sqrt{3}$ and $\mathrm{Wo}_c^{\mathrm{wave}} = 3/\sqrt{2}$. The symmetric model yields $α^*_{\mathrm{model}} \approx 2.626$, in agreement with mammals near the allometric transition; morphometric heterogeneities shift large-mammal values toward $2.72$. The framework explains developmental stability of cardiovascular networks as a consequence of architecture being decoupled from biochemistry.

2604.01187 2026-05-29 q-bio.PE physics.bio-ph

Competition at the front of expanding populations

扩张种群前沿的竞争

Sergio Eraso, Mehran Kardar

AI总结 通过耦合Fisher方程和KPZ方程,研究扩张种群前沿的竞争机制,发现空间扩张能力可超越生殖优势,且适应度变异可用Tracy-Widom分布描述。

详情
Comments
17 pages, 8 figures
AI中文摘要

当竞争物种进入新领地时,种群由扩张前沿成功祖先的后代主导。成功的祖先既取决于生殖优势(适应度),也取决于殖民新领域的能力和机会。我们提出了一个模型,通过将经典的一维竞争描述(Fisher方程)与前沿形状的最小模型(KPZ方程)耦合,整合了这两个要素。这些方程的宏观表现是由扩张速率、竞争能力或空间各向异性控制的不同生长形态。在某些情况下,空间扩张能力可能克服生殖优势,在殖民新领地中占据上风。当新性状伴随累积突变出现时,我们发现范围扩张中适应度的变异可以用Tracy-Widom分布描述。

英文摘要

When competing species grow into new territory, the population is dominated by descendants of successful ancestors at the expansion front. Successful ancestry depends on both the reproductive advantage (fitness), as well as ability and opportunity to colonize new domains. We present a model that integrates both elements by coupling the classic description of one-dimensional competition (Fisher equation) to the minimal model of front shape (KPZ equation). Macroscopic manifestations of these equations are distinct growth morphologies controlled by expansion rates, competitive abilities, or spatial anisotropy. In some cases the ability to expand in space may overcome reproductive advantage in colonizing new territory. When new traits appear with accumulating mutations, we find that variations in fitness in range expansion may be described by the Tracy--Widom distribution.

2602.06606 2026-05-29 q-bio.PE math.DS

Multiple timescales in collective motion: daily and intraday upstream fish migration focusing on Feller condition

集体运动中的多时间尺度:基于Feller条件的日间和日内鱼类上游迁移

Hidekazu Yoshioka

AI总结 利用扩散桥(非线性随机微分方程)统一建模日间和日内鱼类迁移,通过Feller条件揭示不同时间尺度下迁移现象的随机性与间歇性差异。

详情
Comments
Updated on May 6 2026
AI中文摘要

鱼类迁移是一种具有多时间尺度的集体现象,范围从日间到日内(每小时甚至更细)。我们提出了一种统一的数学方法,使用扩散桥(具有固定初始和终端条件的非线性随机微分方程)来建模日间和日内鱼类迁移现象。这些桥的漂移和扩散系数根据对鱼类计数数据拟合的时变参数化平均值和方差曲线确定,并严格保证了其解的唯一存在性。我们表明,扩散桥的样本路径根据Feller条件(即扩散与漂移大小之比)具有定性不同的性质。我们对日本香鱼(Plecoglossus altivelis altivelis)幼鱼上游迁移的应用研究阐明了日间和日内迁移现象的异同。特别是,我们讨论了日间和日内鱼类计数数据对应于不同的Feller指数,表明前者在定性上随机性更小且更具间歇性。本研究获得的结果表明,Feller条件可能作为评估不同时间尺度下香鱼迁移现象的有效工具。

英文摘要

Fish migration is a collective phenomenon that has multiple timescales, ranging from daily to intraday (hourly or even finer). We propose a unified mathematical approach using diffusion bridges, nonlinear stochastic differential equations with pinned initial and terminal conditions, to model both daily and intraday fish migration phenomena. Drift and diffusion coefficients of these bridges are determined based on time-dependent parameterized average and variance curves fitted against fish count data, with which the unique existence of their solutions is rigorously guaranteed. We show that sample paths of the diffusion bridges have qualitatively distinctive properties depending on the Feller condition, namely, the ratio between the sizes of diffusion and drift. Our application study about the juvenile upstream migration of Plecoglossus altivelis altivelis (Ayu) in Japan clarifies similarities and differences between daily and intraday migration phenomena. Particularly, we discuss that the daily and intraday fish count data correspond to distinctive Feller indices, showing that the former is qualitatively less randomized and intermittent. The results obtained in this study suggest that the Feller condition potentially serves as an effective tool for evaluating fish migration phenomena of Ayu across different timescales.

2602.06282 2026-05-29 cs.CV q-bio.QM

An Interpretable Vision Transformer as a Fingerprint-Based Diagnostic Aid for Kabuki and Wiedemann-Steiner Syndromes

一种可解释的基于指纹的视觉Transformer辅助诊断Kabuki和Wiedemann-Steiner综合征

Marilyn Lionts, Arnhildur Tomasdottir, Viktor I. Agustsson, Yuankai Huo, Hans T. Bjornsson, Lotta M. Ellingsen

AI总结 本研究提出一种基于视觉Transformer的深度学习模型,利用指纹图像区分Kabuki综合征(KS)和Wiedemann-Steiner综合征(WSS)患者与健康对照,并通过注意力可视化增强可解释性,为罕见遗传病的非侵入性诊断提供新工具。

详情
AI中文摘要

Kabuki综合征(KS)和Wiedemann-Steiner综合征(WSS)是罕见但不同的发育障碍,具有重叠的临床特征,包括神经发育迟缓、生长受限和持续性胎儿指尖垫。尽管基因检测仍是诊断的金标准,但由于基因检测和专业知识获取的障碍,许多KS或WSS患者仍未得到诊断。皮纹异常虽然是几种遗传综合征的既定标志,但在分子检测时代仍是一种未被充分利用的诊断信号。本研究提出一种基于视觉Transformer的深度学习模型,利用指纹图像区分KS和WSS患者与未受影响的对照组以及彼此。我们在三个二分类任务中评估模型性能。在三个分类任务中,模型在对照组vs. KS、对照组vs. WSS和KS vs. WSS上分别达到了0.80、0.73和0.85的AUC分数,相应的F1分数分别为0.71、0.72和0.83。除了分类,我们应用基于注意力的可视化来识别对模型预测最显著的指纹区域,增强了可解释性。总之,这些发现表明存在综合征特异性的指纹特征,证明了基于指纹的人工智能(AI)工具作为一种非侵入性、可解释且可获取的未来诊断辅助手段,用于早期诊断未充分诊断的遗传综合征的可行性。

英文摘要

Kabuki syndrome (KS) and Wiedemann-Steiner syndrome (WSS) are rare but distinct developmental disorders that share overlapping clinical features, including neurodevelopmental delay, growth restriction, and persistent fetal fingertip pads. While genetic testing remains the diagnostic gold standard, many individuals with KS or WSS remain undiagnosed due to barriers in access to both genetic testing and expertise. Dermatoglyphic anomalies, despite being established hallmarks of several genetic syndromes, remain an underutilized diagnostic signal in the era of molecular testing. This study presents a vision transformer-based deep learning model that leverages fingerprint images to distinguish individuals with KS and WSS from unaffected controls and from one another. We evaluate model performance across three binary classification tasks. Across the three classification tasks, the model achieved AUC scores of 0.80 (control vs. KS), 0.73 (control vs. WSS), and 0.85 (KS vs. WSS), with corresponding F1 scores of 0.71, 0.72, and 0.83, respectively. Beyond classification, we apply attention-based visualizations to identify fingerprint regions most salient to model predictions, enhancing interpretability. Together, these findings suggest the presence of syndrome-specific fingerprint features, demonstrating the feasibility of a fingerprint-based artificial intelligence (AI) tool as a noninvasive, interpretable, and accessible future diagnostic aid for the early diagnosis of underdiagnosed genetic syndromes.

2601.10912 2026-05-29 q-bio.NC eess.IV q-bio.QM

Graph Neural Network Reveals the Cortical Morphology of Local Brain Aging in Normal Cognition and Alzheimer's Disease

图神经网络揭示正常认知与阿尔茨海默病中局部脑老化的皮层形态学

Samuel D. Anderson, Jordan Jomsky, Nikhil N. Chaudhari, Nahian F. Chowdhury, Xiaoyu, Zheng, Andrei Irimia, Alzheimers Disease Neuroimaging Initiative

AI总结 提出图神经网络(GNN)利用皮层形态学特征(厚度、面积、曲率、灰白质强度比、沟深)估计局部脑龄(LBA),在ADNI数据集上比现有方法误差更低,并发现与阿尔茨海默病病理相关的区域老化模式。

详情
Comments
Code and supplementary tables are available at https://github.com/irimia-laboratory/Graph_UNet
AI中文摘要

从T1加权磁共振图像(MRI)估计脑龄(BA)为量化解剖学脑老化提供了强大框架。全局BA(GBA)总结整体脑健康,而局部BA(LBA)在个体水平提供皮层特异的老化模式。尽管以往研究探讨了GBA的解剖学贡献,但据我们所知,尚无框架利用皮层形态学估计LBA。为填补这一空白,我们引入图神经网络(GNN),使用形态学特征——皮层厚度、表面积、曲率、灰/白质强度比(GWR)、沟深——以高空间分辨率(平均顶点间距离=1.37 mm)估计整个皮层表面的LBA。该模型在认知正常(CN)成人(N=14,423)的MRI提取的皮层表面网格上训练,在ADNI数据集上实现了比现有最先进方法更低的平均绝对误差(MAE),同时识别出阿尔茨海默病(AD)中更生物学合理的老化模式。联合皮层是CN中形态学老化的主要部位,而轻度认知障碍的特征是广泛老化,在海马旁回尤为显著。AD受试者表现出整个皮层显著老化,尤其在内侧颞叶区域及相关皮层网络。特征消融突出显示曲率和GWR对AD病理特别敏感。区域LBA差距与AD相关认知障碍的神经心理学测量显著相关,将皮层老化模式与临床结果联系起来。这些结果表明,基于GNN的皮层形态学建模能够实现局部脑老化的生物学可解释映射,且可解释性优于先前工作。

英文摘要

Estimating brain age (BA) from T1-weighted magnetic resonance images (MRIs) provides a powerful framework for quantifying anatomical brain aging. Whereas global BA (GBA) summarizes overall brain health, local BA (LBA) provides cortically specific patterns of aging at the subject level. Although previous studies have examined anatomical contributors to GBA, to our knowledge, no framework has been established to estimate LBA using cortical morphology. To address this gap, we introduce a graph neural network (GNN) that uses morphometric features$\unicode{x2013}$cortical thickness, surface area, curvature, gray/white matter intensity ratio (GWR), sulcal depth$\unicode{x2013}$to estimate LBA across the cortical surface at high spatial resolution (mean inter-vertex distance = 1.37 mm). Trained on cortical surface meshes extracted from the MRIs of cognitively normal (CN) adults (N = 14,423), our model achieves lower mean absolute error (MAE) than the existing state-of-the-art while identifying more biologically plausible patterns of aging in Alzheimer's disease (AD) on the ADNI dataset. Association cortices emerge as primary sites of morphometric aging in CNs, whereas mild cognitive impairment is characterized by widespread aging that is pronounced in the parahippocampal gyrus. AD subjects demonstrate significant aging across the entire cortex, particularly within medial temporal regions and associated cortical networks. Feature ablation highlights curvature and GWR as preferentially sensitive to AD pathology. Regional LBA gaps are significantly associated with neuropsychological measures of AD-related cognitive impairment, linking cortical aging patterns to clinical outcomes. These results demonstrate that GNN-based modeling of cortical morphometry enables biologically interpretable mapping of local brain aging with greater interpretability than prior work.