arXivDaily arXiv每日学术速递 周一至周五更新
重置
2606.04525 2026-06-12 cs.CL cs.LG q-bio.GN 版本更新

GENEB: Why Genomic Models Are Hard to Compare

GENEB:为什么基因组模型难以比较

Daria Ledneva, Mikhail Nuridinov, Denis Kuznetsov

AI总结 针对基因组基础模型评估碎片化的问题,提出GENEB基准,通过统一探测协议在100项任务上比较40个模型,揭示模型排名不稳定、规模收益有限等关键发现。

详情
Comments
change first page figure, fix model sizes, add more consistency
AI中文摘要

由于基准碎片化、评估协议不兼容以及任务特定报告,基因组基础模型的进展难以评估。因此,关于模型优越性或通用性的声明往往无法直接比较。我们引入GENEB,这是一个大规模诊断基准,在统一的基于探测的协议下(包括少样本场景),评估来自40个基因组基础模型的冻结表示,涵盖100个任务,跨越13个功能类别。GENEB能够在明确暴露任务级权衡的同时,对模型规模、架构、分词和预训练数据进行受控比较。我们的分析表明,整体排行榜不稳定:模型排名在不同任务类别间变化剧烈,规模仅带来适度且不一致的收益,而架构和预训练对齐常常超过参数数量的影响。这些结果凸显了当前评估实践的局限性,并将GENEB定位为基因组机器学习中原则性比较和类别感知模型选择的参考框架。

英文摘要

Progress in genomic foundation models is difficult to assess due to fragmented benchmarks, incompatible evaluation protocols, and task-specific reporting. As a result, claims of superiority or generality across models are often not directly comparable. We introduce GENEB, a large-scale diagnostic benchmark that evaluates frozen representations from 40 genomic foundation models across 100 tasks spanning 13 functional categories under a unified probing-based protocol, including few-shot regimes. GENEB enables controlled comparison across model scale, architecture, tokenization, and pretraining data while explicitly exposing task-level trade-offs. Our analysis shows that aggregate leaderboards are unstable: model rankings vary sharply across task categories, scale provides only modest and inconsistent gains, and architectural and pretraining alignment frequently outweigh parameter count. These results highlight limitations of current evaluation practices and position GENEB as a reference framework for principled comparison and category-aware model selection in genomic machine learning.

2605.23111 2026-06-12 q-bio.NC 版本更新

Contextual Role Modulates Object Representational Geometry in the Human Brain

情境角色调节人脑中物体的表征几何结构

Julien Dirani, Shankar Chawla, Leila Wehbe, Bradford Z. Mahon

AI总结 本研究结合fMRI与自然电影观看,发现物体作为动作目标时激活顶叶动作网络,其表征按动作可供性组织;作为被动元素时激活枕颞网络,按语义维度组织,表明大脑根据情境角色动态重映射物体表征几何结构。

详情
AI中文摘要

人脑表征物体时既保持跨实例的不变性,又足够灵活以支持不同情境和任务。然而,当同一物体在情境角色间转换时,其表征如何被动态重映射仍不清楚。本研究结合fMRI与自然电影观看,探究同一物体作为场景中的被动元素与作为目标导向动作的目标时,其表征方式。当物体是动作目标时,它们激活了以缘上回和中央后回为中心的顶叶动作网络;而被动物体则招募了参与视觉物体识别的分布式枕颞网络。在各自情境中最强编码物体的网络内,表征几何结构表现出双重分离:目标物体表征按动作可供性和手姿势可供性维度组织,而被动物体表征则与语义维度对齐。此外,视觉表征结构在不同情境下保持不变。在这些情境特异性脑网络之外,表征内容保持情境不变性,表明灵活性和不变性在同一表征系统的不同层次上运作。总之,这些发现展示了物体表征几何结构的神经重映射,其方式依赖于自然场景中物体情境相关性的实时变化。

英文摘要

The human brain represents objects in a way that is both invariant across instances and flexible enough to support different contexts and tasks. Yet it remains unknown how object representations are dynamically remapped as the same object shifts across contextual roles. Using fMRI during naturalistic movie viewing we investigated how the same objects are represented when they are passive scene elements versus targets of goal-directed actions. Action targets engaged a parietal action network centered in the supramarginal and postcentral gyri, while passive objects recruited a distributed occipito-temporal network involved in visual object recognition. Within context-selective networks, representational geometry showed a double dissociation: target objects were organized by action affordance and hand posture affordance dimensions, while passive objects aligned with semantic dimensions. Visual representational structure was invariant to context. Outside these networks, representational content retained invariance, indicating that flexibility and invariance operate at different levels of the same representational system. These findings demonstrate neural remapping of object representations depending on moment-to-moment changes in contextual roles during a naturalistic scene.

2603.02274 2026-06-12 q-bio.QM cs.AI 版本更新

Contextual Invertible World Models: A Neuro-Symbolic Agentic Framework for Colorectal Cancer Drug Response

上下文可逆世界模型:用于结直肠癌药物反应的神经符号智能框架

Christopher Baker, Tianyu Ren, Karen Rafferty, Hui Wang

AI总结 提出上下文可逆世界模型(CIWM),结合机器学习模拟器与大语言模型推理层,通过逆推理进行CRISPR扰动,揭示KRAS突变在5-氟尿嘧啶耐药中的主导作用及PIK3CA修复的意外效应。

详情
AI中文摘要

精准肿瘤学目前受到小N大P悖论的限制,即高维基因组数据丰富但药理学反应样本稀疏。虽然深度学习实现了预测准确性,但它常常无法提供临床采用所需的机制清晰度。我们提出了上下文可逆世界模型(CIWM),这是一个神经符号智能框架,通过将定量机器学习模拟器与大语言模型推理层集成来弥合这一差距。利用在Sanger GDSC数据集(\\( N=83 \\))上严格筛选的高保真数据工程流程,我们从体外伪影中分离出真正的生物信号,为复杂转录组学建立了严格的基线预测相关性(\\( r=0.268 \\))。通过逆推理,我们在结直肠癌景观中进行了计算机CRISPR扰动。该框架自主推翻了经典机制假设,识别出突变KRAS在驱动5-氟尿嘧啶耐药(\\( \Delta=-0.0469 \\))中相对于APC/Wnt轴具有层级优势,并通过映射到MAPK/PI3K网络的“KRAS盾牌”实现。此外,智能层识别出“PIK3CA悖论”,揭示修复PIK3CA通过触发补偿性反馈环过度激活主导的MAPK生存通路,无意中增加了化疗耐药性(\\( \Delta=+0.0085 \\))。

英文摘要

Precision oncology is currently limited by the small-N, large-P paradox, where high-dimensional genomic data is abundant but pharmacological response samples are sparse. While deep learning achieves predictive accuracy, it frequently fails to provide the mechanistic clarity required for clinical adoption. We present the Contextual Invertible World Model (CIWM), a Neuro-Symbolic Agentic Framework that bridges this gap by integrating a quantitative machine learning emulator with a Large Language Model reasoning layer. Utilising a stringently curated, high-fidelity data engineering pipeline on the Sanger GDSC dataset (\( N=83 \)), we isolate true biological signals from in vitro artifacts to establish a rigorous baseline predictive correlation for complex transcriptomics (\( r=0.268 \)). Through Inverse Reasoning, we perform in silico CRISPR perturbations across the colorectal landscape. The framework autonomously overturns classical mechanistic assumptions, identifying a hierarchical dominance of mutant KRAS over the APC/Wnt-axis in driving 5-fluorouracil resistance (\( \Delta=-0.0469 \)) via a "KRAS Shield" mapped to MAPK/PI3K networks. Furthermore, the agentic layer identified a "PIK3CA Paradox", revealing that repairing PIK3CA inadvertently increases chemoresistance (\( \Delta=+0.0085 \)) by triggering a compensatory feedback loop that hyperactivates the dominant MAPK survival pathway.

2604.20782 2026-06-12 q-bio.QM q-bio.BM 版本更新

LAFA: A Framework for Reproducible Longitudinal Assessment of Protein Function Annotation Models

LAFA:可重复的蛋白质功能注释模型纵向评估框架

An Phan, Yanli Wang, Frimpong Boadu, Jianlin Cheng, Predrag Radivojac, Iddo Friedberg

AI总结 提出LAFA服务器,作为蛋白质功能预测方法的持续基准测试系统,通过容器化方法实现动态、可重复的评估,加速方法迭代并支持可重复性。

详情
AI中文摘要

动机:蛋白质功能预测是计算生物学中一项具有挑战性的任务和一个开放性问题。蛋白质功能注释的关键评估(CAFA)是一项三年一次的、社区驱动的倡议,通过延时基准测试实验,为蛋白质功能预测的计算方法提供独立的大规模评估。CAFA在突出高性能方法、促进详细分析和思想交流方面发挥了关键作用。然而,在定期的CAFA挑战之外,没有平台可以持续评估新开发的方法并跟踪随着功能注释积累的性能变化。结果:本文介绍了蛋白质功能注释模型的纵向评估服务器(LAFA),作为蛋白质功能预测方法的持久基准测试系统。LAFA提供对容器化功能预测方法的持续评估,能够在不断演变的真实标签下进行最新且稳健的方法性能比较评估。LAFA加速了方法迭代,支持可重复性,并提供了蛋白质功能预测进展的更动态和细粒度的视图。代码和数据可用性:LAFA可在以下网址获取:此 https URL。详细评估结果可在以下网址找到:此 https URL。

英文摘要

Motivation: Protein function prediction is a challenging task and an open problem in computational biology. The Critical Assessment of protein Function Annotation (CAFA) is a triennial, community-driven initiative that provides an independent, large-scale evaluation of computational methods for protein function prediction through time-delayed benchmarking experiments. CAFA has played a key role in highlighting high-performing methodologies and fostering detailed analysis and exchange of ideas. However, outside the periodic CAFA challenges, there is no platform for the continuous evaluation of newly developed methods and tracking performance as function annotations accumulate. Results: Here we introduce the Longitudinal Assessment of Protein Function Annotation Models server (LAFA) as a persistent benchmarking system for protein function prediction methods. LAFA provides a continuous evaluation of containerized function prediction methods, enabling up-to-date and robust comparative assessment of method performance under evolving ground truth. LAFA accelerates methodological iteration, supports reproducibility, and offers a more dynamic and fine-grained view of progress in protein function prediction. Code and Data Availability: LAFA is available at this https URL. Detailed evaluation results can be found at this https URL

2512.02528 2026-06-12 q-bio.QM 版本更新

Assessment of Simulation-based Inference Methods for Stochastic Compartmental Models in Epidemiological Research

基于模拟的推断方法在流行病学随机房室模型中的评估

Vincent Wieland, Nils Wassmuth, Lorenzo Contento, Martin Kühn, Jan Hasenauer

AI总结 比较伪边际粒子马尔可夫链蒙特卡洛和条件归一化流两种贝叶斯推断方法在三种随机房室模型上的性能,展示其准确鲁棒的推断能力和预测能力。

详情
AI中文摘要

全球大流行,如最近的COVID-19危机,凸显了对能够捕捉疾病传播固有随机性的随机流行病模型的需求。此类模型必须配备参数估计方法,以便生成快速的即时预测和短期预测,为公共卫生决策提供信息。本文比较了两种先进的贝叶斯推断方法:1) 伪边际粒子马尔可夫链蒙特卡洛,使用粒子滤波器获得的无偏似然估计;2) 条件归一化流。我们研究了它们在三种常用房室模型上的性能:经典易感-感染-易感模型、易感-感染-康复模型和双变种易感-暴露-感染-康复模型,并辅以将潜在轨迹映射到经验数据的观测模型。针对随机设置中参数推断的难处理似然挑战,我们的分析强调了这些无似然方法如何提供准确且鲁棒的推断能力。我们的模拟研究结果进一步强调了这些方法在捕捉流行病随机动态方面的有效性,为疫情爆发的控制提供了预测能力。在埃塞俄比亚队列研究上的结果展示了在真实世界噪声和不规则数据采样下的操作鲁棒性。为了促进重用并支持构建最终有助于公共卫生更好决策的流程,我们公开提供了代码和合成数据集。

英文摘要

Global pandemics, such as the recent COVID-19 crisis, highlight the need for stochastic epidemic models that can capture the randomness inherent in the spread of disease. Such models must be accompanied by methods for estimating parameters in order to generate fast nowcasts and short-term forecasts that can inform public health decisions. This paper presents a comparison of two advanced Bayesian inference methods: 1) pseudo-marginal particle Markov chain Monte Carlo, using an unbiased likelihood estimate obtained by Particle Filter (PF), and 2) Conditional Normalizing Flows (CNF). We investigate their performance on three commonly used compartmental models: A classical Susceptible-Infected-Susceptible (SIS), a Susceptible-Infected-Recovered (SIR) model and a two-variant Susceptible-Exposed-Infected-Recovered (SEIR) model, complemented by an observation model that maps latent trajectories to empirical data. Addressing the challenges of intractable likelihoods for parameter inference in stochastic settings, our analysis highlights how these likelihood-free methods provide accurate and robust inference capabilities. The results of our simulation study further underscore the effectiveness of these approaches in capturing the stochastic dynamics of epidemics, providing prediction capabilities for the control of epidemic outbreaks. Results on an Ethiopian cohort study demonstrate operational robustness under real-world noise and irregular data sampling. To facilitate reuse and to enable building pipelines that ultimately contribute to better informed decision making in public health, we make code and synthetic datasets publicly available.

2603.24603 2026-06-12 q-bio.NC cs.AI 版本更新

Fusion Learning from Dynamic Functional Connectivity: Combining the Amplitude and Phase of fMRI Signals to Identify Brain Disorders

融合动态功能连接:结合fMRI信号的幅度和相位识别脑疾病

Jinlong Hu, Jiatong Huang, Zijian Cai

AI总结 提出多尺度融合学习框架MSFL,结合滑动窗口相关和相位同步两种互补的动态功能连接特征,在自闭症和抑郁症数据集上显著优于现有模型。

详情
AI中文摘要

基于静息态功能磁共振成像(fMRI)的动态功能连接(dFC)已广泛应用于脑科学研究。滑动窗口相关(SWC)方法通过计算脑区对信号幅度时间序列之间的相关系数,是构建dFC的常用方法。在本研究中,我们提出了一种集成方法,结合fMRI信号的幅度和相位信息,以提高脑疾病的检测能力。具体而言,我们引入了一个多尺度融合学习框架MSFL,该框架利用来自SWC和相位同步(PS)的两种互补dFC特征。其中,SWC捕获幅度相关性,而PS测量dFC内的相位相干性。我们使用两个公开数据集(ABIDE I和REST-meta-MDD)评估了MSFL在分类自闭症谱系障碍和重度抑郁症方面的有效性。结果表明,MSFL显著优于现有比较模型。此外,我们使用SHAP框架进行了模型解释分析,表明来自SWC和PS的两种dFC特征均有助于检测脑疾病。

英文摘要

Dynamic functional connectivity (dFC) derived from resting-state functional magnetic resonance imaging (fMRI) has been extensively utilized in brain science research. The sliding window correlation (SWC) method is a widely used approach for constructing dFC by computing correlation coefficients between amplitude time series of signals from pairs of brain regions. In this study, we propose an integrated approach that incorporates both amplitude and phase information of fMRI signals to improve the detection of brain disorders. Specifically, we introduce a multi-scale fusion learning framework, namely MSFL, which leverages two complementary dFC features derived from SWC and phase synchronization (PS). Here, SWC captures amplitude correlations, while PS measures phase coherence within dFC. We evaluated the efficacy of MSFL in classifying autism spectrum disorder and major depressive disorder using two publicly available datasets: ABIDE I and REST-meta-MDD, respectively. The results indicate that MSFL significantly outperforms existing comparative models. Moreover, we performed model explanation analysis using the SHAP framework, which showed that both types of dFC features from SWC and PS contribute to detecting brain disorders.

2602.22281 2026-06-12 math.CO q-bio.PE 版本更新

A kernel for the maximum agreement forest problem on multiple binary phylogenetic trees

多个二叉树的最大一致森林问题的核

Steven Kelk, Ruben Meuwese, Leo van Iersel

AI总结 针对多个二叉树的最大一致森林问题,通过改进链约简规则,得到每个树的叶子数为O(t * r * k)的核,其中k为参数,r=min{max{k,3},t+1},这是t>2时的首个核。

详情
Comments
Under revision at journal. Compared to V2: new figures in Lemma 2, extended conclusion, fixed an error in the tightness constructions, several small typos fixed
AI中文摘要

系统发育学中的最大一致森林(MAF)问题输入为同一分类集X上的t≥2个二叉树T,要求将X划分为最少数量的块,使得这些块诱导的子树在所有树中不相交且具有共同拓扑。我们修改了著名的链约简规则,证明在穷举应用约简规则后,每个树的叶子数为O(t * r * k),其中k是自然参数(块数),r=min{max{k,3},t+1}。我们证明了该界适用于无根和有根版本的问题,并证明了公共链被截断的长度r是紧的。我们的结果是t>2情况下MAF的首个核。

英文摘要

The maximum agreement forest (MAF) problem in phylogenetics takes as input a set t >= 2 of binary phylogenetic trees T on the same set of taxa X. It asks for a partition of X into the smallest number of blocks such that the subtrees induced by these blocks are disjoint and have common topology across all the trees in T. We produce a modified version of the well-known chain reduction rule in order to prove that after exhaustive application of reduction rules each tree has O( t * r * k ) leaves, where k is the natural parameter (the number of blocks) and r=min{max{k,3},t+1}}. We prove this bound for both the unrooted and rooted version of the problem, and demonstrate that the bound r, the length to which common chains are truncated, is tight. Our results constitute the first kernels for MAF in the t>2 regime.

2601.10221 2026-06-12 q-bio.NC 版本更新

Cognitive Field Theory of Learning, Inference, and Emergence

学习、推理与涌现的认知场论

Byung Gyu Chae

AI总结 提出一种认知场论,将认知视为由自适应动力学时间尺度的红外组织调控的非平衡集体现象,通过引入时间尺度态密度(TDOS)描述推理、记忆和涌现智能的层级集体动力学。

详情
Comments
41 pages, 3 figures
AI中文摘要

生物和人工系统中的学习、推理、记忆和涌现通常使用不同的理论框架描述,从神经场模型到循环和注意力架构。这里我们发展了一种认知场论,其中认知作为由自适应动力学时间尺度的红外组织调控的集体非平衡现象出现。从具有稳态稳定化和自适应流形几何的随机认知场方程出发,我们表明集体认知动力学由嵌入在高维认知流形中的缓慢弛豫红外模式组织。整合潜在慢记忆区产生延迟自能反馈和非局域记忆核,控制长期上下文持久性和集体认知相干性。我们引入时间尺度态密度(TDOS)作为基本描述符,表征构成推理、记忆和自适应推理基础的集体弛豫模式的分布。学习和自适应持续重组红外TDOS,选择性地稳定支持上下文组织和递归集体动力学的弱阻尼集体区。在临界点附近,红外TDOS通常发展出与缓慢弛豫集体模式积累相关的宽而平坦的结构,产生无标度时间组织和增强的集体相干性。在此框架内,记忆形成、自适应推理和涌现智能作为集体红外动力学组织的层级阶段出现。

英文摘要

Learning, inference, memory, and emergence in biological and artificial systems are often described using disparate theoretical frameworks. Here we develop a cognitive field theory in which cognition is described as a collective nonequilibrium phenomenon governed by the geometry and relaxation spectrum of a learned cognitive manifold. Starting from a stochastic cognitive-field equation on an adaptive Riemannian cognitive manifold, we derive a memory-dressed cognitive field equation incorporating nonlocal memory kernels and retarded self-energy feedback. We show that the local stability structure of learned cognitive geometry generates a spectrum of collective relaxation modes whose distribution is characterized by a time-scale density of states (TDOS). The TDOS provides a fundamental dynamical descriptor of cognition and determines the emergent memory kernel, collective response, and infrared temporal organization of the cognitive field. The accumulation of weakly damped collective modes suppresses the cognitive forgetting gap, enhances collective susceptibility, and drives the system toward a protected near-critical regime characterized by long-time contextual persistence and collective cognitive coherence. The resulting framework provides a unified dynamical description of learning, memory, inference, selfhood, and emergent cognition in terms of the collective organization of a memory-dressed cognitive field.

2511.04417 2026-06-12 q-bio.PE 版本更新

The evolutionary advantage of replacers in the Moran process

Moran过程中替代者的进化优势

Michal Pecho, Josef Tkadlec, Martin A. Nowak

AI总结 研究在Moran过程中引入“替代者”表型,通过分析其固定概率和突变选择平衡,发现替代者能显著提高中性及有害突变体的固定概率,并在固定后有效抵御入侵。

详情
AI中文摘要

进化发生在繁殖个体的群体中。在进化动力学的随机描述中,如Moran过程,个体被随机选择进行出生和死亡。如果同一类型被选中进行两个步骤,那么繁殖事件就被浪费了,因为群体的组成保持不变。这里我们引入一种新的表型,我们称之为替代者。替代者是高效的竞争者。当一个替代者被选择繁殖时,后代将总是替换另一个类型的个体(如果有的话)。我们确定了替代者在充分混合群体和一维格子上的选择优势。我们发现,作为替代者显著提高了中性和有害突变体的固定概率。特别地,一个入侵大小为$N$的充分混合群体的单一中性替代者的固定概率的量级为$1/\sqrt N$,而不是标准的$1/N$。更重要的是,替代者在固定后能更好地抵御入侵。因此,即使作为替代者的表型带来相当大的代价,替代者也能主导突变选择平衡:奇怪的是,对于大群体大小和小突变率,一个成功的替代者的相对繁殖率可以低至$1/e$。

英文摘要

Evolution occurs in populations of reproducing individuals. In stochastic descriptions of evolutionary dynamics, such as the Moran process, individuals are chosen randomly for birth and for death. If the same type is chosen for both steps, then the reproductive event is wasted, because the composition of the population remains unchanged. Here we introduce a new phenotype, which we call a replacer. Replacers are efficient competitors. When a replacer is chosen for reproduction, the offspring will always replace an individual of another type (if available). We determine the selective advantage of replacers in well-mixed populations and on one-dimensional lattices. We find that being a replacer substantially boosts the fixation probability of neutral and deleterious mutants. In particular, fixation probability of a single neutral replacer who invades a well-mixed population of size $N$ is of the order of $1/\sqrt N$ rather than the standard $1/N$. Even more importantly, replacers are much better protected against invasions once they have reached fixation. Therefore, replacers dominate the mutation selection equilibrium even if the phenotype of being a replacer comes at a substantial cost: curiously, for large population size and small mutation rate the relative reproductive rate of a successful replacer can be as low as $1/e$.

2410.01082 2026-06-12 q-bio.TO 版本更新

Comparative CFD modelling of drug and nanocarrier transport in the eye's anterior segment for glaucoma drug delivery via intracameral injection, drug-eluting implant and contact lens

青光眼药物经前房注射、药物洗脱植入物和隐形眼镜递送的眼前段药物和纳米载体输运的CFD比较建模

Nazifa Begum, Robert Dinita, Naman Patel, Umar Pitafi, Cynthia Yu-Wai-Man, Daniel Sebastia-Saez

AI总结 本研究开发CFD框架,比较前房注射、药物洗脱植入物和隐形眼镜三种递送策略的药物输运、滞留和空间分布,以评估青光眼治疗效果。

详情
AI中文摘要

青光眼治疗依赖于将治疗药物有效递送至前房;然而,传统方法如局部给药和前房注射受限于快速清除和低眼内生物利用度。在本研究中,开发了一个计算流体动力学(CFD)框架,用于比较评估三种递送策略的药物输运、滞留和空间分布:前房注射、药物洗脱植入物和通过隐形眼镜的局部递送。

英文摘要

Glaucoma treatment relies on effective delivery of therapeutics to the anterior chamber; however, conventional approaches such as topical administration and intracameral injection are limited by rapid clearance and low intraocular bioavailability. In this study, a Computational Fluid Dynamics (CFD) framework was developed to comparatively evaluate drug transport, retention, and spatial distribution across three delivery strategies: intracameral injection, drug-eluting implants, and topical delivery via contact lens.

2510.00011 2026-06-12 q-bio.NC 版本更新

Robust State-space Reconstruction of Brain Dynamics via Bootstrap Monte Carlo SSA

通过Bootstrap Monte Carlo SSA实现脑动力学的鲁棒状态空间重构

Sir-Lord Wiafe, Carter Hinsley, Vince D. Calhoun

AI总结 针对短时、噪声和粗采样数据,提出Bootstrap Monte Carlo SSA方法,通过统计检验和重采样保留可靠振荡模式,增强确定性并稳定嵌入,在fMRI中改善功能测量可靠性并揭示状态空间差异。

详情
Comments
6 pages, 2 figures, conference
AI中文摘要

从时间序列重构潜在状态空间几何为研究复杂系统的非线性动力学提供了强大途径。延迟坐标嵌入提供了理论基础,但假设记录长且无噪声,这在许多领域不成立。在许多实际领域中,记录短、有噪声且采样粗糙;例如在神经影像中,fMRI还包含自相关背景结构,可能掩盖振荡成分并破坏嵌入的稳定性。我们提出Bootstrap Monte Carlo奇异谱分析(BMC-SSA),将Monte Carlo SSA与Bootstrap稳定性相结合,以保留在重采样数据中具有统计支持且可复现的振荡模式。这产生了强调可靠振荡结构的重构,增强了确定性并稳定了后续嵌入。我们的结果表明,BMC-SSA提高了功能测量的可靠性,并揭示了fMRI中状态空间动力学的差异,为噪声有限信号的鲁棒嵌入提供了一个通用框架。

英文摘要

Reconstructing latent state-space geometry from time series provides a powerful route to studying nonlinear dynamics across complex systems. Delay-coordinate embedding provides the theoretical basis but assumes long, noise-free recordings, which many domains violate. In many real-world domains, recordings are short, noisy, and coarsely sampled; in neuroimaging, for example, fMRI additionally contains autocorrelated background structure that can obscure oscillatory components and destabilize embeddings. We propose bootstrap Monte Carlo singular spectrum analysis (BMC-SSA), which combines Monte Carlo SSA with bootstrap stability to retain oscillatory modes that are statistically supported and reproducible across resampled data. This produces reconstructions that emphasize reliable oscillatory structure, enhancing determinism and stabilizing subsequent embeddings. Our results show that BMC-SSA improves the reliability of functional measures and uncovers differences in state-space dynamics in fMRI, offering a general framework for robust embedding of noisy, finite signals.

2508.14143 2026-06-12 cs.LG q-bio.NC 版本更新

The Urysohn Machine: A Metric-Topological Model of Computation

Urysohn机器:一种度量-拓扑计算模型

Xin Li

AI总结 提出Urysohn机器,一种基于度量分离、前沿结构和收缩的分类计算模型,通过Urysohn三元组和分层构造实现分类复杂度度量与可重用推理。

详情
AI中文摘要

我们引入Urysohn机器,一种面向分类计算的有效模型,其中度量分离、前沿结构和收缩是计算状态的显式部分。其基本对象是Urysohn三元组:一个支撑区域、一个目标划分以及一个存储在可重用度量库中的分离分类器。拓扑基础是有限单纯形设置下的构造性Urysohn实现定理。它通过嵌套多面体区域的二进阶梯构建分离器,并为其前沿配备链级微积分:前沿是循环,层级之间的壳层边界由前沿之差给出。该构造产生两种相关的复杂度度量:决策边界宽度(单个分类器边界的几何度量)和Urysohn宽度(库或实现所表示的总前沿质量)。我们证明了摊销分离定理,该定理表明在显式边界足迹假设下,逼近宽度为的边界达到精度所需的简单基三元组数量与边界宽度成正比,与分辨率成反比。我们还引入了一种对比分离算子,其图割泛函能从采样度量数据中一致地估计决策边界宽度,而其拉普拉斯谱则能证明类组件结构和电导率。最后,我们分析了动态Urysohn阶梯,并证明了四个保证:商塌缩下的可分离性、已提交前沿的稳定性、收缩下的有界容量以及商距离下的可扩展性。这些结果共同给出了分类复杂度、摊销推理和组合重用的度量-拓扑解释,在保留经典可计算性的同时,揭示了纯符号描述所隐藏的几何结构。

英文摘要

We introduce the Urysohn Machine, an effective model of classification-oriented computation in which metric separation, frontier structure, and contraction are explicit parts of the computational state. Its basic object is a \emph{Urysohn Triple}: a support region, a target partition, and a separating classifier stored in a reusable Metric Library. The topological foundation is a constructive Urysohn Realization theorem for finite simplicial settings. It builds separators from dyadic ladders of nested polyhedral regions and equips their frontiers with a chain-level calculus: frontiers are cycles, and shells between levels have boundaries given by differences of frontiers. This construction yields two related complexity measures: decision-boundary width, the geometric measure of a single classifier's boundary, and Urysohn width, the total frontier mass represented by a library or realization. We prove an Amortized Separation Theorem showing that approximating a boundary of width to accuracy requires a number of simple basis triples proportional to boundary width and inversely proportional to resolution, under explicit boundary-footprint assumptions. We also introduce a contrastive separation operator whose graph-cut functional consistently estimates decision-boundary width from sampled metric data, while its Laplacian spectrum certifies class-component structure and conductance. Finally, we analyze the dynamic Urysohn ladder and prove four guarantees: separability under quotient collapse, stability of committed frontiers, bounded capacity under contraction, and scalability with quotient distance. Together, these results give a metric-topological account of classification complexity, amortized inference, and compositional reuse that preserves classical computability while exposing geometric structure hidden by purely symbolic descriptions.

2505.23289 2026-06-12 quant-ph cond-mat.soft physics.bio-ph q-bio.GN 版本更新

Intermediate State Formation of Topologically Associated Chromatin Domains using Quantum Annealing

拓扑关联染色质结构域的中间态形成:基于量子退火的方法

Tobias Kempe, S.M. Ali Tabei, Mohammad H. Ansari

AI总结 利用量子退火模拟表观遗传伊辛模型,高效生成具有拓扑关联结构域特征的染色质构象,揭示一维表观遗传标记与三维折叠的关联机制。

详情
Comments
16 pages, 19 Figs, Scientific Reports, 2026
AI中文摘要

拓扑关联染色质结构域是空间上分离的染色质区域,通过分隔活性和非活性基因组元件来调控转录。实验研究表明,它们的形成与表观遗传标记的局部模式相关,但将一维表观遗传景观与三维染色质折叠联系起来的精确机制仍不清楚。最近的模型将染色质表示为自旋系统,其中核小体被视为离散状态变量,其耦合强度源自基因组和表观遗传数据。由于高度阻挫和密集耦合,经典采样器难以处理这些模型。本文提出了一种量子退火方法,用于高效采样染色质状态,将表观遗传伊辛模型嵌入到D-Wave量子处理器的拓扑结构中。我们的方法不是重建精确的TAD大小分布或绝缘分数,而是再现统计特征,如平均标记发生率和核小体内/间相关性,同时生成表现出TAD样结构基序的构型。这些结果证明了量子退火作为探索染色质结构的一种替代方法,并为表观遗传建模提供了基础。

英文摘要

Topologically Associating Chromatin Domains are spatially distinct chromatin regions that regulate transcription by segregating active and inactive genomic elements. Empirical studies show that their formation correlates with local patterns of epigenetic markers, yet the precise mechanisms linking 1D epigenetic landscapes to 3D chromatin folding remain unclear. Recent models represent chromatin as a spin system, where nucleosomes are treated as discrete-state variables coupled by interaction strengths derived from genomic and epigenetic data. Classical samplers struggle with these models due to high frustration and dense couplings. Here, we present a quantum annealing (QA) approach to efficiently sample chromatin states, embedding an epigenetic Ising model into the topology of D-Wave quantum processors. Rather than reconstructing exact TAD size distributions or insulation scores, our method reproduces statistical features, such as mean marker incidences and intra-/inter-nucleosome correlations, while generating configurations that exhibit TAD-like structural motifs. These results demonstrate QA as an alternative to explore the chromatin architecture and provide a foundation in epigenetic modeling.

2504.00872 2026-06-12 q-bio.QM physics.bio-ph q-bio.CB 版本更新

Bioelectrical interfaces beyond excitable cells: cancer, aging, and gene expression modulation

超越可兴奋细胞的生物电接口:癌症、衰老与基因表达调控

Paolo Cadinu, Matthew Burgess, Catarina Franco Jones, Marzia Iarossi, Manuel Schröter, Nako Nakatsuka, Mustafa B. A. Djamgoz, Gil Gonçalves, Sidahmed Abayzeed, Paola Sanjuán-Alberte, Paula M. Mendes, Frankie J. Rawson, Malavika Nair, Michael Levin, Rosalia Moreddu

AI总结 本文综述了非兴奋细胞中生物电信号在癌症、衰老和基因表达调控中的作用,并探讨了先进生物电接口技术如何为这些领域提供新见解和干预手段。

详情
Comments
19 pages, 3 figures, 1 table
AI中文摘要

生物电导性的研究已从其基于离子通量支撑心脏和神经元兴奋性的经典基础,演变为细胞生理学的多层面调节因子。传统上,探测生命物质中电事件的方法主要集中于动作电位记录。然而,非兴奋细胞中的生物电控制着关键现象,包括发育模式、组织稳态和疾病进展。开创性研究表明内源性生物电在形态发生、伤口愈合、再生和癌症的许多方面发挥作用。早期发现为将生物电视为影响细胞命运、细胞周期进程、分化和衰老的手段奠定了基础。最近,发现肿瘤微环境中膜电位的空间变化与转移潜能相关。同时,在设计用于研究神经网络和心脏功能的先进生物电接口方面取得了重大突破。本视角通过考察这些技术如何在不同操作尺度上促进对非兴奋细胞电事件的新认识,从而最终操纵癌症重编程、抗衰老干预和基因表达调控中的细胞通路,桥接了工程和生物学领域。

英文摘要

The investigation of biological conductivity has evolved from its classical foundation based on ionic fluxes underpinning cardiac and neuronal excitability to a multifaceted regulator of cellular physiology. Traditional approaches for probing electrical events in living matter focused largely on action potentials recording. However, bioelectricity in non-excitable cells governs key phenomena, including developmental patterning, tissue homeostasis, and disease progression. Pioneering studies implicated endogenous bioelectrics in many aspects of morphogenesis, wound healing, regeneration, and cancer. Early findings laid the groundwork for viewing bioelectricity as a means to influence cell fate, cell cycle progression, differentiation, and senescence. More recently, spatial variations in membrane potential within tumor microenvironments were found to correlate with metastatic potential. In parallel, substantial breakthroughs have been achieved in designing advanced bioelectrical interfaces for the study of neuronal networks and cardiac function. This perspective bridges the engineering and biological domains by examining how such technologies might enable new insights into non-excitable cell electrical events at different scales of operation to ultimately manipulate cellular pathways in cancer reprogramming, anti-aging interventions, and gene expression modulation.