arXivDaily arXiv每日学术速递 周一至周五更新
重置
2605.13750 2026-05-14 physics.soc-ph cs.GT q-bio.PE

The Co-evolution of Costly Signaling and Cooperation in Social Dilemmas

Mahdi Abolhasani, Saman Moghimi-Araghi, Mohammad Salahshour

AI总结 该研究探讨了在社会困境中,昂贵合作行为与昂贵信号机制如何共同演化的问题。通过构建一个信号与行为相互影响的模型,研究发现信号的演化更多取决于其引发的合作反应,而非单纯的信号成本。研究显示,在不同博弈场景和群体结构下,信号机制能够促进合作的维持,尤其在空间结构中合作更易增强,而在囚徒困境中则需引入动态关联因素来解释其演化机制。

Comments 25 pages, 9 figures

详情
英文摘要

Costly cooperation and costly signaling are both difficult to reconcile with simple fitness maximization, yet both are common in biological and social systems. We study a model in which agents emit costly signals and condition their actions on the signals they observe. Across the Prisoner's Dilemma (PD), Snowdrift (SD), and Stag Hunt (SH) games, we ask when this coevolutionary process can sustain cooperation and how it changes across well-mixed populations, spatial lattices, and fluctuating strategic environments. The simulations show that signals are selected less by their raw production costs than by the cooperative responses they currently elicit. In well-mixed populations, the mechanism sustains partial cooperation in PD and SD and drives near-complete cooperation in SH. On lattices, cooperation is strengthened further by local assortment. A reduced mean-field analysis explains why average population feedback is already sufficient in SD and SH, but not in the PD. To account for the PD dynamics, the reduced theory must include transient correlations associated with rare signals, inheritance, or spatial clustering. Our results therefore delineate a class of settings in which costly signals persist because they transiently organize cooperative responses and thereby reshape the effective strategic environment.

2605.13675 2026-05-14 cs.CV cs.LG q-bio.NC

Characterizing Universal Object Representations Across Vision Models

Florian P. Mahner, Johannes Roth, Ka Chun Lam, Michael F. Bonner, Francisco Pereira, Martin N. Hebart

AI总结 本研究探讨了不同架构、目标函数和数据集训练的深度神经网络在视觉表征上的收敛现象,旨在揭示模型实际收敛于哪些视觉属性以及影响这一收敛的因素。通过将162个多样化视觉模型的对象相似性结构分解为少量非负维度,并分析这些维度在模型间的重复出现情况,研究发现部分维度具有跨模型的普遍性,且更易解释、更受图像语义属性驱动。研究还表明,模型的普遍性维度与灵长类动物视觉皮层活动和人类相似性判断的预测能力更强,暗示了这种普遍性可能反映了与生物视觉相关的表征特性。

详情
英文摘要

Deep neural networks trained with different architectures, objectives, and datasets have been reported to converge on similar visual representations. However, what remains unknown is which visual properties models actually converge on and which factors may underlie this convergence. To address this, we decompose the object similarity structure of 162 diverse vision models into a small set of non-negative dimensions. To determine universal versus model-specific dimensions, we then estimate how often each dimension reappears across models. In contrast to model-specific dimensions, universal dimensions are more interpretable and more strongly driven by conceptual image properties, indicating the relevance of interpretability and semantic content as implicit factors driving universality across models. Differences in architecture, objective function, training data, model size, and model performance do not explain the emergence of universal dimensions. However, models with more universal dimensions also better predict macaque IT activity and human similarity judgments, suggesting that universality reflects representations relevant to biological vision. These findings have important implications for understanding the emergent representations underlying deep neural network models and their alignment with biological vision.

2605.13535 2026-05-14 q-bio.PE

Shared quasispecies architecture in experimental and natural RNA virus populations

Samuel Martínez-Alcalá, Iker Atienza-Diez, Pilar Somovilla, Brenda Martínez-González, Celia Perales, Luis F. Seoane, Ester Lázaro, Susanna Manrubia

AI总结 该研究探讨了RNA病毒群体在基因组序列空间中的结构组织,比较了实验条件下进化的大肠杆菌噬菌体Qβ和自然宿主中进化的SARS-CoV-2的基因型网络架构。通过深度测序数据重建病毒群体中突变耦合变异体的网络结构,发现尽管两者在基因组大小、突变率和生态环境上差异显著,但均表现出相似的层次化结构:一个高度丰富的中心基因型被不同丰度的变异体环绕。这一共同的基因型网络架构表明,RNA病毒可能共享由序列空间基本性质和复制突变机制决定的普遍组织模式,为理解病毒进化可预测性提供了新的视角。

Comments 19 pages (main ms, 5 figures, 1 table) + 4 pages (supplementary information, 3 figures)

详情
英文摘要

RNA viruses form genetically diverse populations structured as mutant spectra, or quasispecies, whose internal organization influences their evolutionary and adaptive dynamics. While genetic diversity has been extensively characterized, the structural organization of viral populations in sequence space remains less explored. Here, we compare genotype network architectures in two RNA viruses with markedly different evolutionary contexts: bacteriophage $Qβ$ evolving in controlled laboratory conditions and SARS-CoV-2 evolving within infected human hosts. Using deep sequencing data, we reconstruct the genotype network of mutationally coupled variants within viral populations and analyze their topological properties. Despite large differences in genome size, mutation rate, and ecological setting, both viruses exhibit a common organization: a highly abundant central haplotype surrounded by layers of variants of diminishing abundance as Hamming distance to the central haplotype increases. All reconstructed networks share qualitative and quantitative topological features, displaying a hierarchical structure. The robust organization of both populations under multiple conditions suggests that RNA viruses may share a common genotype network architecture governed by fundamental properties of sequence space and the generic mechanisms of replication and mutation. Genotype networks provide a unifying framework to describe viral population structure beyond conventional diversity measures and, by revealing how local constraints shape mutational search, offers insights into the predictability of viral evolution.

2605.13504 2026-05-14 stat.ME math.AP math.DS math.PR q-bio.QM

Structural identifiability of partially-observed stochastic processes: from single-particle trajectories to total particle density data

Arianna Ceccarelli, Alexander P. Browning, Ruth E. Baker

AI总结 本文研究了部分观测随机过程的结构可识别性问题,探讨了在单粒子轨迹数据和总粒子密度数据下参数能否唯一确定。作者提出了一种适用于时空随机过程的方法,针对轨迹数据采用个体模型描述,针对密度数据建立偏微分方程模型并结合微分代数方法进行分析,同时引入基于特征方程的初始条件分析方法,揭示了初始条件对可识别性的重要影响,并通过实例展示了该方法在识别参数组合上的有效性。

Comments Main: 26 pages, 4 figures Supplementary Information: 20 pages, 5 figures

详情
英文摘要

The increasing availability of experimental data has intensified interest in calibrating stochastic models, raising fundamental questions about parameter identifiability. Structural identifiability determines whether parameters can be uniquely recovered from idealised, noise-free data, a prerequisite to allow for parameter estimation. However, existing methods to assess structural identifiability are not generally applicable to stochastic processes. We develop a methodology to analyse structural identifiability for a class of spatio-temporal stochastic processes. We investigate how identifiability depends on the type of available data, distinguishing between single-particle trajectories and total particle density measurements. For trajectory data, we use the individual-based model description that explicitly represents single-particle dynamics. For population-level data, we derive a partial differential equation model representation, that describes the evolution of total particle density, and apply a differential algebra approach, common to ordinary differential equations analysis. We further introduce a novel method to study the initial condition, based on characteristic equations to construct a Taylor expansion of the density evolution, enabling identification of additional identifiable parameter combinations. We apply our methodology to a model, and show it is identifiable with trajectory data but only locally identifiable with density data, and demonstrate the critical role of initial conditions in the identifiability analysis.

2605.13364 2026-05-14 q-bio.BM

Predicting Endocrine Disruptors: A Deep Learning QSAR Model for Estrogen Receptor Activity

Belaguppa Manjunath Ashwin Desai, Shreyas Murthy, Bhoomika Sridhar, Anirudh Belaguppa Manjunath, Vivien Humtsoe, Pronama Biswas

AI总结 该研究针对内分泌干扰物(EDCs)对健康和生态系统的潜在威胁,开发了一种基于深度学习的QSAR模型,用于预测化合物对雌激素受体的结合活性。研究利用包含224种化合物和2,944个分子描述符与指纹的数据库,训练了一个结合Dropout和批归一化的深度神经网络,模型在测试集上表现出较高的准确率和良好的分类性能。通过分子对接验证,部分预测化合物被证实具有与雌二醇相似的受体结合能力,该模型为快速筛选潜在EDCs提供了有效工具,有助于提升化学品风险评估效率和生物多样性保护。

Comments Copyright IEEE 2026. Permission from IEEE must be obtained for all other uses, including reprinting/republishing for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work. DOI: 10.1109/B-HTC67770.2026.11502197

详情
Journal ref
2026 IEEE Bangalore Humanitarian Technology Conference (B-HTC)
英文摘要

Endocrine-disrupting chemicals (EDCs) threaten human health, ecosystems, and biodiversity by interfering with hormonal signaling pathways conserved across vertebrates. Traditional in vivo assays are costly and time-consuming, limiting their capacity to screen the growing number of chemicals. To address this, we developed a deep learning-based QSAR model to predict estrogen receptor (ER) binding molecules. Using a curated dataset of 224 compounds and 2,944 molecular descriptors and fingerprints, a deep neural network (DNN) incorporating dropout and batch normalization was trained and validated. The model achieved training and test accuracies of 96.65% and 91.30%, respectively, with an ROC-AUC of 0.81, a precision of 0.82, and a recall of 0.88 for the active class. Molecular docking against estrogen receptor (PDB ID: 5TOA) confirmed that several predicted compounds exhibited binding comparable to Estradiol, sharing key interactions. This model enables rapid screening of potential EDCs, supporting efficient chemical risk assessment and contributing to biodiversity conservation by identifying compounds that may disrupt reproduction and population stability in humans and wildlife.

2605.09597 2026-05-14 cs.SI q-bio.QM

Interactively visualizing biological multilayer networks using MiRA

Shir Miryam Nehoray, Yuval Bloch, Shai Pilosof

AI总结 本文介绍了一种名为 MiRA 的浏览器端交互式可视化工具,用于展示生物多层网络。该工具提供了七种互补的可视化模式和交互功能,帮助研究人员和教育者更直观地探索复杂多层网络的结构与特性。MiRA 的提出填补了生物多层网络可视化工具的空白,为相关研究提供了有效的支持。

Comments 7 pages, 2 figures. For the application page, see https://mira.ecomplab.com/

详情
英文摘要

Multilayer networks are widely used across biology to represent systems in which complex networks vary across space, time, or interaction types. However, interactive visualization tools remain limited. We present MiRA (Multilayer Interactive Rendering Application), a browser-based, installation-free web application for visualizing biological multilayer networks. MiRA offers seven complementary visualization modes and interactive features that enable researchers to visually navigate the high complexity of multilayer networks for research and education.

2605.07433 2026-05-14 q-bio.MN cs.LG cs.LO

Inference of Qualitative Models from Steady-State Data via Weighted MaxSMT

Ondřej Huvar, Nikola Beneš, Martin Jonáš, David Šafránek, Samuel Pastva

AI总结 该研究提出了一种基于加权MaxSMT的鲁棒方法,用于从稳态数据中推断定性生物模型。该方法通过将不确定的生物观测表示为带权重的软约束,能够在存在冲突约束的情况下,识别出最符合观测的模型。研究支持布尔和多值变量域,并结合离散化和差异表达约束,成功应用于从先验知识网络中推断神经细胞分化模型。

详情
英文摘要

Qualitative models provide crucial instruments for modelling complex biological systems. While advances in automated reasoning and symbolic encodings have enabled rigorous inference of these models from data, the process remains highly fragile. First, biological measurement errors inevitably propagate into formal model specifications. Second, when a specification becomes unsatisfiable, distinguishing between fundamental design flaws and minor technical errors is notoriously difficult. This uncertainty often leads to under-specification, as it is unclear which observations are still ``safe'' to incorporate. To overcome these challenges, we introduce a robust inference method based on weighted MaxSMT. By encoding uncertain biological observations as weighted soft constraints, our approach enables the solver to identify a model best reflecting the observations, even with some conflicting constraints. Our method allows for Boolean and multi-valued variable domains, alongside observations derived from discretisation (level constraints) and differential expression (ordering constraints). We show our approach can be used to successfully infer neural cell differentiation models from prior-knowledge networks with 200--1300 genes using ordering constraints on all included genes.

2605.05778 2026-05-14 q-bio.QM cs.CG cs.NA math.NA

Planar morphometry via functional shape data analysis and quasi-conformal mappings

Hangyu Li, Gary P. T. Choi

AI总结 本文提出了一种新的平面形态度量方法FDA-QC,结合功能形状数据分析(FDA)和拟共形映射(QC),同时考虑平面形状的边界和内部特征,克服了传统方法仅关注单一特征或无法量化形状变化的局限。该方法通过平方根速度函数对闭合曲线进行表示与配准,并利用拟共形映射将边界对应关系扩展到整个平面区域,可实现形状变形与变化量化的统一框架。实验表明,该方法在叶片和昆虫翅膀等数据集上能更有效地捕捉形态变化,为理解平面生物形状的生长与演化提供了新途径。

详情
英文摘要

The study of shapes is one of the most fundamental problems in life sciences. Although numerous methods have been developed for the morphometry of planar biological shapes over the past several decades, most of them focus solely on either the outer silhouettes or the interior features of the shapes without capturing the coupling between them. Moreover, many existing shape mapping techniques are limited to establishing correspondence between planar structures without further allowing for the quantitative analysis or modelling of shape changes. In this work, we introduce FDA-QC, a novel planar morphometry method that combines functional shape data analysis (FDA) techniques and quasi-conformal (QC) mappings, taking both the boundary and interior of the planar shapes into consideration. Specifically, closed planar curves are represented by their square-root velocity functions and registered by elastic matching in the function space. The induced boundary correspondence is then extended to the entire planar domains by a quasi-conformal map, optionally with landmark constraints. Moreover, the proposed FDA-QC method can naturally lead to a unified framework for shape morphing and shape variation quantification. We apply the FDA-QC method to various leaf and insect wing datasets, and the experimental results show that the proposed combined approach captures morphological variation more effectively than purely boundary-based or interior-based descriptions. Altogether, our work paves a new way for understanding the growth and form of planar biological shapes.

2604.26070 2026-05-14 cs.LG math.OC math.ST q-bio.QM stat.TH

Observable Neural ODEs for Identifiable Causal Forecasting in Continuous Time

Jennifer Wendland, Nicolas Freitag, Maik Kschischo

AI总结 该论文研究了连续时间因果推理中的可识别性问题,针对存在隐藏混杂因素的动态决策场景,提出了可观测神经ODE(ObsNODE)模型。通过将控制理论中的可观测性概念与因果可识别性联系起来,论文推导出一种连续时间调整公式,并设计了能够从观测数据中重构潜在状态的神经ODE模型,从而实现对不同干预路径下结果的预测。实验表明,该方法在合成癌症数据、基于MIMIC-IV的半合成数据和真实脓毒症数据上均表现出优越的性能。

Comments 20 pages, 5 figures

详情
英文摘要

Causal inference in continuous-time sequential decision problems is challenged by hidden confounders. We show that, in latent state-space models with time-varying interventions, observability of the latent dynamics from observed data is necessary for identifying dynamic treatment effects, linking control-theoretic observability to causal identifiability, even when hidden confounders affect both treatments and outcomes. We derive a continuous-time adjustment formula expressing potential outcome distributions under treatment trajectories via the measurement model, latent dynamics, and the filtering distribution over latent states given observed histories. We propose Observable Neural ODEs (ObsNODEs), Neural ODE models in observable normal form for causal forecasting. ObsNODEs learn continuous-time dynamics with states reconstructible from observations, enabling outcome prediction under alternative treatment paths. Experiments on synthetic cancer data, semi-synthetic data based on MIMIC-IV, and real-world sepsis data show strong performance over recent sequence models.

2603.03337 2026-05-14 q-bio.NC

Does the motor cortex draw on a wire plane?

Patrick Iglesias-Zemmour

AI总结 本文探讨了人类运动控制中三分之二幂律的几何本质,指出其等价于等仿射速度恒定。传统微分几何中,等仿射度量并非张量,需限制对称群或引入额外结构才能恢复张量性质。本文提出一种新的几何框架——“线性微分学结构”,使等仿射度量在全微分同胚群下成为真正的协变三阶张量,无需限制对称性或附加结构。该方法基于运动皮层追踪曲线而非二维区域的特性,为运动原语的几何形式化提供了新视角。

Comments 7.33 pages. This note applies the framework of Diffeology (specifically the Wire Plane) to resolve the non-tensorial nature of the equi-affine metric in motor control

详情
英文摘要

The two-thirds power law of human motor control ($v \propto κ^{-1/3}$) is geometrically equivalent to constant equi-affine speed. In classical differential geometry, however, the equi-affine metric is not a tensor: it depends on acceleration, which does not transform covariantly under arbitrary coordinate changes. To recover tensorial behavior, one must either restrict the symmetry group to the affine group or introduce an affine connection -- sacrificing full diffeomorphism covariance. This article proposes a different geometric setting. We equip the Euclidean plane with the "wire diffeology', the smooth structure generated by all smooth curves. In this diffeological space, the equi-affine metric becomes a true covariant $3$-tensor under the **full** diffeomorphism group -- no restriction of symmetries, no additional structure required. The construction is motivated by a simple fact: the motor cortex traces curves, not two-dimensional patches. Accordingly, curves are taken as primitive, echoing the motor control literature in which movements are built from a repertoire of elementary building blocks -- motor primitives. The wire plane offers a geometric formalization of this idea in which the two-thirds power law emerges as a fully covariant invariant.

2602.00586 2026-05-14 q-bio.MN cs.AI cs.LG

RAG-GNN: Integrating Retrieved Knowledge with Graph Neural Networks for Precision Medicine

Hasi Hays, William J. Richardson

AI总结 该研究提出了一种名为 RAG-GNN 的端到端可训练框架,将图神经网络(GNN)与动态检索的生物医学文献知识相结合,以提升精准医学中的功能聚类性能。通过联合优化的检索投影、门控融合机制和对比对齐方法,RAG-GNN 在癌症信号通路案例中显著提升了功能聚类效果,并验证了检索信息对聚类一致性和内部紧密性的积极影响。实验表明,该方法在功能聚类任务上优于仅依赖图结构的传统方法,为精准医学中的知识整合提供了新思路。

详情
英文摘要

Network topology excels at structural predictions but fails to capture functional semantics encoded in biomedical literature. We present RAG-GNN, an end-to-end trainable retrieval-augmented graph neural network framework that integrates GNN representations with dynamically retrieved literature-derived knowledge through a jointly optimized retrieval projection, gated fusion mechanism, and contrastive alignment. In a cancer signaling case study (379 proteins, 3,498 interactions, 14 functional categories), RAG-GNN improves functional clustering from silhouette $= -0.237 \pm 0.065$ (GNN-only) to $-0.144 \pm 0.066$, a consistent improvement of $+0.093 \pm 0.022$ across 10 random seeds, while the learned retrieval achieves mean precision@10 $= 0.242$, a 152\% improvement over the random baseline ($0.096$). Heuristic information decomposition with bootstrap confidence intervals reveals that topology and retrieval encode overwhelmingly shared information (95.6\%), with retrieval improving both intra-cluster cohesion (silhouette) and cluster agreement (ARI $+0.021 \pm 0.015$). Counterfactual experiments confirm that adversarial, absent, and random retrieval all degrade performance, validating that the gated fusion mechanism depends on document content. Benchmarking against eight established embedding methods demonstrates task-specific complementarity: topology-focused methods achieve strong link prediction, while retrieval augmentation consistently improves functional clustering within the controlled GNN-only ablation. DDR1 subnetwork analysis provides confirmatory validation consistent with established synthetic lethality relationships. These results establish that topology-only and retrieval-augmented approaches serve complementary purposes for precision medicine applications.

2510.04698 2026-05-14 q-bio.NC cs.AI econ.TH

The Bayesian Origin of the Probability Weighting Function in Human Representation of Probabilities

Xin Tong, Thi Thu Uyen Hoang, Xue-Xin Wei, Michael Hahn

AI总结 人类在感知概率时普遍存在系统性的扭曲,表现为典型的反S型权重模式,但其成因长期未明。本文提出一种基于贝叶斯编码-解码的解释框架,认为概率通过带有噪声的内部信号进行编码,并通过最小化贝叶斯风险进行解码。研究发现,这种编码过程中的扭曲可分解为边界回归、似然排斥和先验吸引,从而预测出反S型权重模式源于编码精度的U型分布,即在概率接近0和1时更为敏感。实验结果表明,该框架能够从数据中自然恢复出U型编码结构,并在多个任务中优于传统确定性权重函数和其它模型。

详情
英文摘要

Humans systematically misrepresent probability in a stereotyped inverse-S pattern. It has been documented for decades, but its origin remains unexplained. We propose a Bayesian encoding-decoding account in which probabilities are represented by noisy internal signals and decoded by Bayes-risk minimization. For bounded probability stimuli, we show that distortion decomposes into boundary regression, likelihood repulsion, and prior attraction, yielding a key prediction: the classic inverse-S-shaped weighting pattern implies a U-shaped allocation of encoding precision with greater sensitivity near 0 and 1. Across judgment of relative frequency, lottery pricing, and risky choice, this U-shape is recovered from data without imposing any functional form on the encoding, and our framework outperforms deterministic weighting functions, bounded log-odds models, uniform-encoding Bayesian accounts, and matched efficient-coding models on held-out data. In a new dot probability estimation experiment with bimodal stimulus statistics, the recovered prior tracks the new distribution while the recovered encoding remains U-shaped. Together, these results identify the inverse-S-shaped probability weighting function as the joint product of a stable U-shaped encoding and a flexible prior, integrated by optimal Bayesian decoding.

2501.07738 2026-05-14 math.PR math-ph math.MP math.ST q-bio.PE stat.TH

Mixing time for an epidemic model on graphs with external sources of infection

Wasiur R. KhudaBukhsh, Yangrui Xiang

AI总结 本文研究了带有外部感染源的易感-感染-易感(SIS)传染病模型在图上的混合时间问题。作者在参数适当假设下,证明了该模型的混合时间与顶点数 $n$ 的数量级为 $Θ(n\log n)$。进一步地,他们在随机图家族(如 Erdős–Rényi 图、随机正则多重图和 Galton–Watson 树)上分析了该模型,证明在高概率下混合时间仍保持 $Θ(n\log n)$ 的数量级。

Comments improved results, minor typos fixed, 19 pages, no figures

详情
英文摘要

We study the mixing time of a Susceptible--Infected--Susceptible (SIS) model on graphs with external sources of infection, which we refer to as the noisy SIS model. Under suitable assumptions on the parameters of the dynamics, we show that the mixing time is of the order $Θ(n\log n)$ with respect to the number of vertices $n$. We further investigate the model on random graph families, including Erd{ö}s--R{é}nyi graphs, random regular multigraphs, and Galton--Watson trees. By identifying high-probability structural properties of these graphs and conditioning on typical realizations, we prove that the mixing time remains of order $Θ(n\log n)$ with high probability.

2412.20570 2026-05-14 cond-mat.soft math.AP q-bio.SC

Voltage laws in nanodomains revealed by asymptotics and simulations of electro-diffusion equations

Frédéric Paquin-Lefebvre, Alejandro Barea Moreno, David Holcman

AI总结 本研究旨在揭示纳米生理域内由离子电流驱动的局部电压分布特性,这对理解细胞活动具有重要意义。研究通过建立一种求解包含离子电流输入输出的泊松-能斯特-普兰克方程的方法,结合渐近展开和格林函数表示,推导出不同电荷条件下的离子分布和电压变化规律。研究还分析了表面曲率和窗口通道尺寸对电压动态的影响,并通过数值模拟验证了理论结果,为纳米域生理行为的表征提供了新的理论依据。

Comments 30 pages, 7 figures, 3 tables

详情
Journal ref
Multiscale Modeling and Simulation, 24(1), 365-397 (2026)
英文摘要

Characterizing the local voltage distribution within nanophysiological domains, driven by ionic currents through membrane channels, is crucial for studying cellular activity in modern biophysics, yet it presents significant experimental and theoretical challenges. Theoretically, the complexity arises from the difficulty of solving electro-diffusion equations in three-dimensional domains. Currently, there are no methods available for obtaining asymptotic computations or approximated solutions of nonlinear equations, and numerically, it is challenging to explore solutions across both small and large spatial scales. In this work, we develop a method to solve the Poisson-Nernst-Planck equations with ionic currents entering and exiting through two narrow, circular window channels located on the boundary. The inflow through the first window is composed of a single cation, while the outflow maintains a constant ionic density satisfying local electro-neutrality conditions. Employing regular expansions and Green's function representations, we derive the ionic profiles and voltage drops in both small and large charge regimes. We explore how local surface curvature and window channels size influence voltage dynamics and validate our theoretical predictions through numerical simulations, assessing the accuracy of our asymptotic computations. These novel relationships between current, voltage, concentrations and geometry can enhance the characterization of physiological behaviors of nanodomains.

2605.13315 2026-05-14 cs.ET cs.LG cs.NE cs.SY eess.SY q-bio.NC

Embodied Neurocomputation: A Framework for Interfacing Biological Neural Cultures with Scaled Task-Driven Validation

Johnson Zhou, Daniel Tanneberg, Forough Habibollahi, Alon Loeffler, Kiaran Lawson, Valentina Baccetti, Kwaku Dad Abu-Bonsrah, Candice Desouza, Finn Doensen, Bradley Watmuff, Daria Kornienko, Azin Azadi, Justin Leigh Bourke, Bernhard Sendhoff, Brett J. Kagan

AI总结 该研究提出了一种“具身神经计算”框架,旨在解决生物神经网络与传统硅基计算接口之间的最优编码与解码问题。通过在模拟环境中对生物神经网络代理进行闭环导航任务的参数优化,研究发现了12种能够稳定学习的配置,其任务表现优于相同交互预算下的硅基深度Q网络代理。该工作为基于生物神经网络的目标导向学习提供了基础,并推动了任务驱动神经计算和跨领域基准的建立。

详情
英文摘要

Biological neural networks (BNNs) have been established as a powerful and adaptive substrate that offer the potential for incredibly energy and data efficient information processing with distinct learning mechanisms. Yet a core challenge to utilizing BNN for neurocomputation is determining the optimal encoding and decoding mechanisms between the traditional silicon computing interface and the living biology. Here, we propose an Embodied Neurocomputation framework as a systems-level approach to this multi-variable optimization encoding/decoding problem. We operationalize this approach through the first large-scale parameter optimization of encoding configurations for a BNN agent performing closed-loop navigation along an odor-style gradient in a simulated grid-world. Despite the relative simplicity of the task, the biological interactions gave rise to a massive multi-combinatorial search space for optimal parameters. By considering how the components of the system are interconnected and parameterized, we evaluated approximately 1,300 parameter combinations, over 4,000 hours of real-time agent-environment interactions, to identify 12 configurations that consistently demonstrated learning across multiple episodes. These configurations achieved significantly higher task performances than optimized silicon-based DQN agents under the same interaction budget. These findings represent an initial step toward robust and scalable goal-oriented learning using BNNs. Our framework establishes a foundation for applying task-driven neurocomputing and supports the development of field-wide benchmarks. In the long term, this work supports the development of hybrid bio-silicon architectures capable of efficient, adaptive and real-time computation, including the potential for robotic control applications.

2605.13262 2026-05-14 cs.LG q-bio.QM

Chem-GMNet: A Sphere-Native Geometric Transformer for Molecular Property Prediction

Deepak Warrier, Raja Sekhar Pappala

AI总结 本文提出了一种名为Chem-GMNet的球面原生几何变换器,用于分子属性预测任务。该模型通过将传统变换器中的各个模块替换为基于球面几何的结构,充分利用了化学结构中的几何先验信息。实验表明,Chem-GMNet在参数更少的情况下取得了优于现有方法如ChemBERTa的性能,尤其在无需预训练的情况下也表现出色。

详情
英文摘要

Modern SMILES-based chemical language models obtain strong MoleculeNet performance by treating SMILES as generic text and compensating with multi-million-molecule self-supervised pretraining. We ask: when a domain carries structural priors as rich as chemistry's, does it warrant a domain-native transformer rather than a generic one rescued by scale? We answer affirmatively with \textbf{GM-Net} (Geometric Measure Network), a transformer family in which every module is replaced by a sphere-native counterpart, and instantiate it as \textbf{Chem-GMNet}. Three blocks follow: SH-Embedding (tokens as learnable directions on $S^{k-1}$ lifted through a Gegenbauer feature map); DualSKA (a per-head fusion of a linear-time gated Sphere-Flow recurrence whose persistent state we prove is the truncated multipole expansion of the input distribution, and a softmax Sphere-Kernel branch over the same Schoenberg-valid kernel); and SH-FFN (sphere projection $\to$ Gegenbauer lift $\to$ moment readout). On canonical DeepChem scaffold splits, against same-shape ChemBERTa-2 baselines under the chemberta3-faithful protocol: (i) random-initialised, Chem-GMNet wins on 7 of 10 MoleculeNet endpoints at $\sim\!35\%$ fewer parameters; (ii) pretrained on the same 10M-SMILES ZINC corpus as ChemBERTa-2 MLM-10M, it matches or beats the public release on 6 of 8 shared endpoints (5/7 excluding a known ClinTox release anomaly). A $(k,L)$ ablation shows that increasing the sphere dimension from $k\!=\!8$ to $k\!=\!10$ at fixed $L\!=\!3$ lowers ESOL RMSE to $0.938$ at scratch, beating pretrained ChemBERTa-2 MLM-10M on this endpoint without any pretraining at all.

2605.13234 2026-05-14 physics.bio-ph cond-mat.stat-mech q-bio.TO

A hyperbolic cell cycle law for early embryonic developmental timing

Adrián Aguirre-Tamaral, Johanna Royer, Magdalena Schindler-Johnson, Jun-Ru Lee, Daniel R. Amor, Nicoletta I. Petridou, Bernat Corominas-Murtra

AI总结 本研究揭示了早期胚胎发育中细胞周期长度随时间变化的普遍规律,发现尽管不同物种的发育速度和分子机制各异,但其细胞复制速度均呈现一致的减缓趋势。研究提出,这种现象源于母源资源有限消耗与生化反应动力学之间的耦合,导致细胞周期长度呈双曲增长,趋近于数学上的奇点,即发育停滞。通过多物种数据验证,该模型不仅定量描述了细胞数量随时间的变化、细胞大小对周期长度的影响,还准确预测了原肠形成的时间点,揭示了资源消耗速率是调控早期发育时序的关键机制。

Comments 10 pages, 3 figures

详情
英文摘要

Across metazoans, early embryos exhibit a strikingly conserved slowing down of their cell duplication speed, despite widely varying developmental paces and underlying molecular mechanisms. Here we show that this common behavior arises because early development unfolds along a biochemical rather than a chronological timescale, resulting from the coupling of finite maternal resource consumption to the Michaelis-Menten-like kinetics governing the rates of the biochemical reactions involved in cell duplication. This leads to a hyperbolic growth of the Cell Cycle Length (CCL), approaching a mathematical singularity, which would correspond to developmental arrest. Data from a wide range of organisms -- cnidarians, nematodes, arthropods, molluscs, echinoderms, tunicates, amphibians, and fish -- collapse on a single curve, quantitatively capturing not only a universal CCL dynamical behaviour, but also key hallmarks of early metazoan development, including cell-number temporal evolution, the dependency of CCL on cell size, and, remarkably, gastrulation timing at the predicted singularity. Crucially, experimental modulation of resource availability and consumption rates validate the model and further demonstrate that a source of heterochrony in early development is an altered biochemical timescale of resource depletion. Overall, this work reveals resource consumption rates as a fundamental mechanism driving developmental timing in early embryogenesis across species.

2605.13070 2026-05-14 q-bio.CB physics.bio-ph q-bio.TO

3D mechano-geometric multicellular model of apical stem cell-driven plant morphogenesis

Naoya Kamamoto, Koichi Fujimoto

AI总结 本文提出了一种基于三维力学与几何的多细胞模型,用于研究顶端干细胞驱动的植物形态建成过程。该模型结合了细胞力学、不可逆的细胞壁生长以及可变形组织结构,能够模拟细胞分裂方向对三维植物形态形成的影响。文章详细介绍了模型的构建方法,包括细胞的三角化薄壳表示、膨压处理、细胞壁弹性和应变驱动生长机制,以及支持多种分裂规则的细胞分裂算法和保持网格质量的重网格化操作,旨在为实验植物学家提供一个可访问且可定制的建模工具。

Comments 14 pages, 2 figures

详情
英文摘要

The orientation of cell division is a major determinant of three-dimensional plant morphogenesis. Whether and how a simple division orientation rule explains the establishment of symmetric body plans is a fundamental question. Testing such hypotheses is facilitated by a modeling framework that combines realistic three-dimensional cell mechanics, irreversible cell-wall growth, and a deformable tissue geometry. We recently introduced such a framework, a 3D mechano-geometric multicellular model of apical stem cell-driven morphogenesis. Here we document how the model is built from physiological and computational perspectives. We describe the triangulated thin-shell representation of cells, the treatment of turgor pressure, cell-wall elasticity and strain-driven wall growth, the cell-division algorithm together with its two pluggable division-rule implementations, and the remeshing operations that keep the triangulation well-conditioned as cells grow, divide, and deform. The aim of this paper is to make the present model accessible and customizable to experimental plant biologists.

2605.13009 2026-05-14 q-bio.PE physics.bio-ph

Conditioning as a route to stereotyped behavior in growing populations

Riccardo Ravasio, Kabir Husain, Constantine G. Evans, Rob Phillips, Marco Ribezzi-Crivellari, Jack W. Szostak, Arvind Murugan

AI总结 本文研究了在生物系统中,如何通过条件选择(conditioning)策略实现复杂多步骤过程的可重复行为。作者提出了一种不同于传统分子机制纠错的策略,即通过淘汰未在限定时间内完成任务的个体,使得存活下来的群体行为更加有序。研究表明,这种条件选择机制可以在无需微观控制的情况下,自发地形成时间层级结构,并在某些情况下自动促进最有序群体的增长,为实现可靠行为提供了一种简单而高效的途径。

详情
英文摘要

Biological systems perform complex multi-step processes in a reproducible way despite underlying stochasticity. The standard explanation is micromanagement by molecular machinery that recognizes and corrects specific errors. Here we study conditioning, a qualitatively different strategy in which attempts failing a coarse criterion are destroyed and do not leave a physical record. The surviving, i.e., conditioned, ensemble is narrower and therefore more ordered. We model conditioning through stochastic resets in a ''socks-before-shoes'' model of a growing population, where $n$ actions must be completed in any order to replicate and any replication attempt not finished by a threshold time is discarded. We find that resets impose hierarchical temporal ordering of the $n$ actions without microscopic control over which action happens when. When disorder carries a sufficient time penalty, this ordering is free: the fastest-growing population is automatically the most ordered, with no direct selection for order required. Save points, at which verified progress is preserved across resets, allow conditioning to scale to complex multi-step processes. Conditioning provides a minimal route to reliable behavior, requiring only a clock rather than molecular machinery that recognizes specific errors. For the right class of processes, it pays for itself.

2605.12999 2026-05-14 q-bio.NC cs.LG

Implicit Behavioral Decoding from Next-Step Spike Forecasts at Population Scale

John R. Minnick, Jesus Gonzalez-Ferrer, Kamran Hussain, Jinghui Geng, Ash Robbins, Mohammed A. Mostajo-Radji, David Haussler, Jason Eshraghian, Mircea Teodorescu

AI总结 该研究提出了一种基于单个Mamba模型的闭环脑机接口方法,能够在一次前向传播中同时预测神经群体活动和解码动物行为状态。通过在Neuropixels尺度的下一步尖峰计数上训练模型,并使用轻量级线性分类器读取模型预测的发放率,该方法在行为解码任务中表现优于直接使用原始尖峰数据的线性分类器。实验表明,该方法在视觉辨别任务中能够准确解码小鼠的选择和刺激侧,且在计算效率和解码性能上均优于传统方法。

Comments 21 pages, 6 figures, 5 tables; submitted to NeurIPS 2026 Neuroscience & Cognitive Science Track

详情
英文摘要

Closed-loop brain-computer interfaces often require both a forecast of upcoming neural population activity and a readout of the animal's behavioral state. A single Mamba forecaster, trained only on next-step spike counts at Neuropixels scale, can deliver both in one forward pass. A lightweight per-session linear head reading the model's predicted rates decodes behavior better than the same linear classifier reading the raw spike counts, under matched temporal context. We test on the Steinmetz visual-discrimination benchmark, which spans 39 sessions, roughly 27,000 neurons, and 1,994 held-out trials. Across three training seeds, Mamba's predicted rates decode mouse choice at 75.7$\pm$0.2% trial vote, roughly 2.3 times chance level, and stimulus side at 66.1$\pm$0.6%, about twice chance. Compared to a matched 500 ms-context linear decoder on the raw spike counts, Mamba wins at trial vote by 4-6 pp on response and 4-6 pp on stimulus side. A session-start calibration block of about 100-150 trials brings the readout within 1-2 pp of asymptote, and the full pipeline fits inside the 50 ms bin budget on workstation-class GPUs typical of tethered chronic Neuropixels recordings.

2605.12992 2026-05-14 q-bio.NC cs.LG

SpikeProphecy: A Large-Scale Benchmark for Autoregressive Neural Population Forecasting

John R. Minnick, Jinghui Geng, Kamran Hussain, Jesus Gonzalez-Ferrer, Ash Robbins, Mohammed A. Mostajo-Radji, David Haussler, Jason K. Eshraghian, Mircea Teodorescu

AI总结 本文提出SpikeProphecy,首个用于真实电生理记录的因果自回归神经元群体预测的大规模基准。研究通过分解群体预测性能指标,分别评估时间保真度、空间模式准确性和幅度不变对齐,揭示了传统单一相关系数所掩盖的关键信息。实验基于105个Neuropixels数据集,对比了七种不同结构的预测模型,发现不同脑区的预测能力存在显著差异,并揭示了在泊松计数域中ANN到SNN迁移的负结果。

Comments 26 pages, 4 figures, 12 tables; submitted to NeurIPS 2026 Datasets and Benchmarks Track; processed dataset at https://huggingface.co/datasets/mysteriousauthor/spikeprophecy-steinmetz (CC-BY-4.0); code at https://github.com/JohnMinnick/SpikeProphecy-A-Large-Scale-Benchmark-for-Autoregressive-Neural-Population-Forecasting

详情
英文摘要

Neural population models, which predict the joint firing of many simultaneously recorded neurons forward in time, are typically evaluated by a single aggregate Pearson correlation $r$ between predicted and actual spike counts, a number that masks critical structure. We argue that how we evaluate spike forecasting matters as much as what we build, and introduce SpikeProphecy, the first large-scale benchmark for causal, autoregressive spike-count forecasting on real electrophysiology recordings. Our core contribution is a population metric decomposition that separates aggregate performance into temporal fidelity, spatial pattern accuracy, and magnitude-invariant alignment. The decomposition surfaces aspects of the underlying data that an aggregate scalar collapses together. We apply the protocol to 105 Neuropixels sessions (Steinmetz 2019 + IBL Repeated Site; ~89,800 neurons) with seven architecture baselines spanning four structural families: four SSMs (three diagonal and one non-diagonal), a Transformer, an LSTM, and a spiking network. The decomposition surfaces a brain-region predictability ranking that reproduces across all seven baselines and survives ANCOVA correction for firing-statistics constraints (region $ΔR^2 = 0.018$ above the firing-statistics covariates). It also exposes a sub-Poisson evaluation floor where rigorous metrics combine with genuine biophysical constraints on regular spike trains, and yields a negative result on KL-on-output-rates distillation for ANN-to-SNN transfer in this Poisson count domain.

2605.12852 2026-05-14 cs.LG q-bio.QM

Multitask Multimodal Fusion with Tabular Foundation Models for Peak and Durability Prediction of Pertussis Booster Response

Divya Sitani

AI总结 该研究旨在同时预测百日咳加强疫苗接种后的免疫反应峰值和持续时间,这两个过程由不同的生物学机制驱动。研究提出了一种多任务多模态融合模型,结合冻结的TabPFN-v2编码器、双标签对比损失、缺失校准的模态丢弃和注意力融合机制,以应对数据模态异质性、缺失值和任务间关联弱的挑战。实验表明,该模型在两个预测任务上均优于传统方法,且结果与免疫学机制一致,揭示了不同模态对峰值和持续时间预测的特异性贡献。

Comments 22 pages, 8 figures, 4 tables. Code available at https://github.com/Divya1205/cmi-pb-multitask

详情
英文摘要

Pertussis booster vaccination produces immune responses that vary widely across individuals in both peak magnitude and long-term durability. These two phases are governed by partly distinct biological compartments:peak reflects acute B-cell activation and antibody secretion, while durability reflects the establishment of long-term humoral memory. Yet most computational models target only one, missing the full boost-and-wane trajectory. Jointly predicting both is non-trivial because the two endpoints are biologically dissociated rather than redundant; samples are small, modalities are heterogeneous with structured missingness, and the two tasks rely on different measurement windows. We propose a multi-task contrastive multimodal fusion architecture combining frozen TabPFN-v2 per-modality encoders, a dual-label supervised contrastive loss that treats two subjects as a positive pair if they agree on the Task 1 label or the Task 2 label, modality dropout calibrated to empirical missingness, and missingness-masked attention fusion. Applied to a curated subset of the CMI-PB pertussis booster dataset (n = 158 subjects, four modalities, 44.9% with at least one modality missing; Spearman r = -0.58 between peak and durability, n = 96), the model achieves test AUROC 0.797 (95% CI [0.621, 0.948]) for peak response and 0.755 (95% CI [0.519, 0.945]) for durability, with both significant under joint label permutation (N = 1000; p = 0.002 and p = 0.045). Across logistic regression, XGBoost, and MLP baselines on raw features and on TabPFN embeddings, the proposed model is the only one whose 95% CIs lie above chance on both tasks simultaneously. Per-modality contribution analyses recover task-specific modality contributions consistent with the underlying immunology: peak prediction is carried by cytokine signatures, while durability is carried by baseline antibody features.

2605.12823 2026-05-14 cs.LG physics.chem-ph physics.comp-ph q-bio.BM

Hessian Matching for Machine-Learned Coarse-Grained Molecular Dynamics

Sanya Murdeshwar, Sanjit Shashi, Kevin Bachelor, William Noid, Ashwin Lokapally, Razvan Marinescu

AI总结 该研究提出了一种基于Hessian向量积匹配的机器学习粗粒化分子动力学方法,旨在提升粗粒化势能函数对自由能曲率的建模能力。通过引入随机探针向量,该方法在不显式构造Hessian矩阵的情况下,将二阶曲率信息融入粗粒化势能函数中,从而提高了模拟的准确性。实验表明,该方法在多个蛋白质体系中显著优于传统的梯度匹配方法,尤其在慢模动力学指标上表现出更优的性能。

Comments 15 pages, 4 figures, 1 table

详情
英文摘要

Coarse-grained (CG) molecular dynamics enables simulations of atomic systems such as biomolecules at timescales inaccessible to all-atom (AA) methods, but existing CG neural potentials trained via force matching capture only the gradient of the free-energy surface, leaving its curvature unconstrained. We introduce a framework that augments force matching with stochastic Hessian-vector product (HVP) matching, instilling second-order curvature information into CG potentials without constructing the full Hessian. We derive a decomposition of the target CG Hessian into a model-independent projected AA Hessian, precomputed once before training, and a model-dependent covariance correction computed online at negligible cost. We construct an unbiased stochastic estimator of the Hessian-matching objective by using random probe vectors. We evaluate our method by comparing against force matching on a benchmark of nine fast-folding proteins unseen during training. HVP matching outperforms plain force matching on 8 of 9 proteins on slow-mode metrics, with reductions of up to 85% in the Kullback--Leibler divergence between the CG and reference distributions along the slowest collective mode of the largest protein. Our results demonstrate that higher-order physical supervision is a practical path to more accurate and transferable CG potentials for biomolecular simulation.

2605.12773 2026-05-14 q-bio.PE physics.soc-ph

The interplay of network structure and correlated infectious traits in epidemic models

Abhay Gupta, Nicholas W. Landry

AI总结 该研究探讨了网络结构与个体易感性和传播能力相关性在传染病模型中的相互作用。作者构建了一个包含不同子群体的数学模型,每个子群体具有特定的易感性和传播能力联合分布,用于分析社区结构和节点度异质性对疫情传播的影响。通过推导基本再生数的解析表达式并结合数值模拟,验证了模型的有效性,并进一步探讨了其在社会干预策略中的应用意义。

Comments 14 pages, 6 figures

详情
英文摘要

Individual contributions to the spread of an epidemic vary widely due to an individual's location in a social network and their intrinsic ability to spread or contract diseases. While the effect of heterogeneous population structure and infection rates is well-understood, less studied is the impact of population-level covariance between susceptibility and transmissibility, despite empirical evidence showing that both susceptibility and transmission vary across individuals. We introduce a mathematical modeling framework incorporating population subgroups, each with its own joint distribution of susceptibility and transmissibility. We apply this framework to the susceptible-infected-recovered (SIR) model to examine the effect of community structure and degree heterogeneity. We derive analytical expressions for the basic reproduction number, which, when reduced, corroborates prior results and validate these results with numerical simulations. We pair these estimates with simulations exploring first, the temporal dynamics of this model with the homogeneous SIR model, and second, implications for effective social intervention. This analysis provides a foundation for future studies exploring the interplay between structural and dynamical heterogeneity in infectious disease transmission.

2605.12763 2026-05-14 cs.LG math.DS math.OC q-bio.NC

State-Space NTK Collapse Near Bifurcations

James Hazelden, Eric Shea-Brown

AI总结 本文研究了在时间展开任务中,模型通过分岔点时的特征学习问题,提出了基于经验状态空间神经切线核(sNTK)的局部梯度下降理论。研究发现,分岔点不仅主导了学习动态,还简化了学习过程,使得sNTK可近似为一个秩一算子,从而提供了对高维递归系统局部学习几何的解析描述。通过将sNTK分解为与分岔相关的通道和残差通道,论文展示了分岔通道在常见分岔点附近的显著放大效应,并指出低秩自然梯度方法能有效解决分岔附近的学习不稳定性问题。

详情
英文摘要

Rich feature learning in tasks that unfold over time often requires the model to pass through bifurcations, constituting qualitative changes in the underlying model dynamics. We develop a local theory of gradient descent near these transitions through the empirical state-space neural tangent kernel (sNTK). Our central finding is that bifurcations both dominate and simplify learning dynamics: near bifurcations, we can reduce sNTK to a rank-one operator corresponding to learning in a classical normal form system, providing an analytically tractable description of the local learning geometry, even for high-dimensional recurrent systems. Concretely, we give a procedure for decomposing sNTK into bifurcation-relevant and residual channels, showing that near commonly codimension-1 bifurcations the relevant channel is a rank-one operator that is highly amplified. This amplification causes the bifurcation channel to dominate the full sNTK. Thus, bifurcations locally warp the learning landscape, funneling gradient descent into a few critical dynamical directions and making the nearby kernel and loss geometry predictable from classical normal forms. We illustrate this in a student-teacher recurrent neural network: the first learned bifurcation coincides with a sharp collapse in sNTK effective rank and the emergence of a dominant parameter direction whose restricted sNTK closely matches the landscape predicted by the scalar pitchfork normal form. Finally, we show that low-rank natural gradient methods resolve the resulting learning instability near bifurcations with very little overhead over SGD.

2605.12732 2026-05-14 q-bio.NC

Predictive Coding Light+: learning to predict visual sequences with spike timing-dependent plasticity and synaptic delays

Antony W. N'dri, Thomas Barbier, Céline Teulière, Jochen Triesch

AI总结 本文提出了一种名为Predictive Coding Light+(PCL+)的脉冲神经网络架构,用于无监督处理视觉序列。该网络通过学习具有延迟的递归兴奋性连接,实现了对近期信息的短期记忆,从而能够预测未来输入。研究显示,PCL+不仅能重现视觉皮层在序列学习中的经典结果,还能在复杂的手势识别任务中填补缺失的输入,展示了脉冲神经网络在学习和预测未来方面的潜力。

Comments 13 pages, 7 figures, 2 tables, preprint

详情
英文摘要

The ability to predict the future is of great value for biological and artificial cognitive systems alike. However, successfully predicting the future typically requires maintaining a memory of the recent past. It is currently unclear how biological or artificial spiking neural networks can learn to maintain past sensory information to help predict the future. Here we propose Predictive Coding Light+ (PCL+), a spiking neural network architecture for unsupervised sequence processing that learns recurrent excitatory connections with delays to enable short-term retention of information. We show that the PCL+ network reproduces classic findings on sequence learning in visual cortex. Furthermore, it learns to ``fill in'' missing input in a challenging gesture recognition task. Overall, our work shows how spiking neural networks can learn recurrent excitatory connections with delays to maintain a record of the recent past and successfully predict the future.

2605.12706 2026-05-14 cs.LG q-bio.GN

A Resampling-Based Framework for Network Structure Learning in High-Dimensional Data

Ziwei Huang, Zeyuan Song, Paola Sebastiani, Stefano Monti

AI总结 RSNet 是一个开源的 R 软件包,提供了一种基于重采样的框架,用于在高维数据中进行稳健且可解释的网络结构学习,旨在解决小样本量带来的挑战。该框架支持连续和离散混合数据类型的条件高斯贝叶斯网络及部分相关网络的估计,并结合多种重采样策略以适应独立或相关观测。RSNet 通过引入基于图元的拓扑分析,增强了网络结构的可解释性,并首次实现了在稀疏网络中高效构建带符号的图元度向量矩阵,从而支持对高阶网络结构的可扩展分析。

Comments 7 pages, 1 figure

详情
英文摘要

RSNet is an open-source R package that provides a resampling-based framework for robust and interpretable network inference, designed to address the limited-sample-size challenges common in high-dimensional data. It supports both the estimation of partial correlation networks modeled as Gaussian networks and conditional Gaussian Bayesian networks for mixed data types that combine continuous and discrete variables. The framework incorporates multiple resampling strategies, including bootstrap, subsampling, and cluster-based approaches, to accommodate both independent and correlated observations. To enhance interpretability, RSNet integrates graphlet-based topology analysis that captures higher-order connectivity and edge sign information, enabling single-node and subnetwork-level insights. Notably, RSNet is the first R package to efficiently construct signed graphlet degree vector matrices (GDVMs) in near-constant time for sparse networks, providing scalable analysis of higher-order network structure. Collectively, RSNet offers a versatile tool for statistically reliable and interpretable network inference in high-dimensional data.

2605.12662 2026-05-14 cs.LG q-bio.GN

scShapeBench: Discovering geometry from high dimensional scRNAseq data

Andrew J Steindl, João Felipe Rocha, Brian Tshilengi Di Bassinga, Zachary Warren, Matthew Scicluna, César Miguel Valdez Córdova, Shabarni Gupta, Leire Torices, Daniel Neumann, Timothy J. Mann, Ihuan Gunawan, Dhananjay Bhaskar, John G Lock, Christine L Chaffer, Guy Wolf, Smita Krishnaswamy

AI总结 scShapeBench 是一个用于单细胞转录组数据形状检测的基准数据集,旨在自动识别数据中的几何结构,如聚类、轨迹和典型模式,从而辅助选择合适的下游分析流程。该研究引入了 scReebTower 方法,基于扩散几何提取 Reeb 图,实现了可视化与分析流程的自动匹配,并提供了拓扑感知的评估指标。实验表明,scReebTower 在合成和真实数据上均优于现有方法,为单细胞数据的自动化分析提供了重要工具。

详情
英文摘要

High-dimensional point cloud data arise across many scientific domains, especially single-cell biology. The shapes or topologies of these datasets determine the types of information that can be extracted. For example, clustered data supports cell-type identification, trajectory structures support transition analysis, and archetypal structures capture continua of cellular behaviors. Existing analysis pipelines often assume a specific shape. The standard Seurat pipeline combines UMAP visualization with Louvain clustering and therefore assumes clustered data, while tools such as Monocle and SPADE assume tree-like structures, and flow-based models such as MIOFlow and Conditional Flow Matching target trajectories. Choosing which pipeline to apply is therefore often left to bioinformaticians who visually inspect datasets before selecting an analysis strategy. With the rise of agentic AI scientists, automating shape detection is increasingly important for selecting downstream analysis pipelines. To address this problem, we introduce scShapeBench, a benchmark dataset for shape detection containing both synthetic and expert-annotated single-cell datasets. Synthetic datasets are sampled from ground-truth skeleton graphs with controlled variance. Real single-cell datasets are curated from diverse sources and annotated by experts into four categories: clusters, single trajectory, multi-branching, and archetypal. We additionally introduce scReebTower, a baseline method that uses diffusion geometry to extract Reeb graphs and connect visualization with pipeline selection. We provide topology-aware evaluation metrics and compare scReebTower against PAGA and Mapper on synthetic and real data. Our results indicate that scReebTower outperforms existing baselines. Overall, our contributions span benchmarks, evaluation metrics, and a baseline for automated shape detection in single-cell data.

2602.12026 2026-05-14 cs.LG q-bio.QM

Protein Circuit Tracing via Cross-layer Transcoders

Darin Tsui, Kunal Talreja, Daniel Saeedi, Amirali Aghazadeh

AI总结 该研究提出了一种名为ProtoMech的框架,用于揭示蛋白质语言模型(pLMs)中的计算电路,通过跨层转码器学习各层之间的稀疏潜在表示,从而捕捉模型的整体计算流程。该方法应用于ESM2模型后,在蛋白质家族分类和功能预测任务中恢复了82-89%的原始性能,并识别出仅使用不到1%潜在空间却保留高达79%模型精度的压缩电路,揭示了与结构和功能模体的对应关系。该成果为蛋白质功能设计提供了高效且精准的指导,显著优于现有方法。

Comments Accepted into ICML 2026. 32 pages, 17 figures

详情
英文摘要

Protein language models (pLMs) have emerged as powerful predictors of protein structure and function. However, the computational circuits underlying their predictions remain poorly understood. Recent mechanistic interpretability methods decompose pLM representations into interpretable features, but they treat each layer independently and thus fail to capture cross-layer computation, limiting their ability to approximate the full model. We introduce ProtoMech, a framework for discovering computational circuits in pLMs using cross-layer transcoders that learn sparse latent representations jointly across layers to capture the model's full computational circuitry. Applied to the pLM ESM2, ProtoMech recovers 82-89% of the original performance on protein family classification and function prediction tasks. ProtoMech then identifies compressed circuits that use <1% of the latent space while retaining up to 79% of model accuracy, revealing correspondence with structural and functional motifs, including binding, signaling, and stability. Steering along these circuits enables high-fitness protein design, surpassing baseline methods in more than 70% of cases. These results establish ProtoMech as a principled framework for protein circuit tracing.

2602.11618 2026-05-14 cs.LG q-bio.QM

How Well Do Large-Scale Chemical Language Models Transfer to Downstream Tasks?

Tatsuya Sagawa, Ryosuke Kojima

AI总结 本文研究了大规模化学语言模型(CLMs)在下游分子属性预测任务中的迁移性能。通过扩展训练资源(如模型规模、数据集大小和计算量),作者系统评估了预训练损失与下游任务表现之间的关系,发现尽管预训练损失持续下降,下游任务性能提升有限。研究还揭示了预训练指标与实际任务表现之间的差距,并分析了影响迁移效果的任务依赖性失效模式,强调了在模型选择和评估中需考虑下游任务特性的必要性。

详情
英文摘要

Chemical Language Models (CLMs) pre-trained on large scale molecular data are widely used for molecular property prediction. However, the common belief that increasing training resources such as model size, dataset size, and training compute improves both pretraining loss and downstream task performance has not been systematically validated in the chemical domain. In this work, we evaluate this assumption by pretraining CLMs while scaling training resources and measuring transfer performance across diverse molecular property prediction (MPP) tasks. We find that while pretraining loss consistently decreases with increased training resources, downstream task performance shows limited improvement. Moreover, alternative metrics based on the Hessian or loss landscape also fail to estimate downstream performance in CLMs. We further identify conditions under which downstream performance saturates or degrades despite continued improvements in pretraining metrics, and analyze the underlying task dependent failure modes through parameter space visualizations. These results expose a gap between pretraining based evaluation and downstream performance, and emphasize the need for model selection and evaluation strategies that explicitly account for downstream task characteristics.