arXivDaily arXiv每日学术速递 周一至周五更新
重置
Q-BIO定量生物26
2606.06509 2026-06-08 eess.IV cs.AI cs.LG q-bio.TO 新提交

Which Anatomy Matters Under Limited Labels? A Data-Efficient Anatomy-Aware Benchmark for Cardiac Pathology Prediction

在有限标签下哪些解剖结构重要?用于心脏病理预测的数据高效解剖感知基准

Himanshu Singh

AI总结 针对有限标签和计算资源下的医学影像问题,提出解剖感知基准,通过比较不同解剖结构表示和分类器,发现表示质量比模型复杂度更重要。

详情
Comments
ACCEPTED at ICML 2026 Workshop GlobalSouthML (Seoul, South Korea; PMLR 306, 2026)
AI中文摘要

许多医学影像问题必须在有限标签和受限计算条件下解决,然而性能提升主要来自更具表达力的模型还是对临床有意义解剖结构的更好表示,目前尚不清楚。我们通过一个低数据解剖感知基准来研究这个问题,该基准用于在公共ACDC MRI数据集上进行5类心脏病理预测。利用来自右心室、心肌和左心室的分割衍生患者描述符,我们在线性、核和基于树的分类器上比较了特定解剖结构和多结构表示。我们发现,在有限标签设置下,表示主导复杂度。这些结果表明,在资源受限的医疗环境中,识别和表示最具信息量的解剖结构可能比单纯增加模型复杂度更重要。

英文摘要

Numerous medical imaging problems must be solved under limited labels and constrained compute, yet it remains unclear whether performance gains are driven mainly by more expressive models or by better representation of clinically meaningful anatomy. We study this question through a low-data anatomy-aware benchmark for 5-class cardiac pathology prediction on the public ACDC MRI dataset. Using segmentation-derived patient descriptors from the right ventricle, myocardium, and left ventricle, we compare anatomy-specific and multi-structure representations across linear, kernel, and tree-based classifiers. We find that under limited label settings, representation dominates complexity. These results suggest that in resource-constrained healthcare settings, identifying and representing the most informative anatomy may matter more than the increasing complexity of the model alone.

2606.07372 2026-06-08 q-bio.PE math.DS 新提交

Nullclines, Subnullclines and the Asymptotic and Transient Attractors in Eco-Evolutionary Dynamics

生态进化动力学中的零线、子零线以及渐近和瞬态吸引子

Krzysztof Argasinski, Manjyot Singh Bedi, Mark Broom

AI总结 本文通过分析经典鹰鸽博弈的生态进化动力学,发现频率和密度零线交点决定的稳定与不稳定平衡点由异宿轨道连接,并引入子零线概念,进而考虑环境季节性导致复杂循环行为,子零线作为扰动传播的屏障。

详情
AI中文摘要

在人口统计学框架中,死亡率支付函数描述交互的成本,而生育率支付函数描述其回报。因此,虽然死亡率成本取决于对手的策略,但生育率奖励可能受到密度依赖的幼体补充存活率的影响。这激发了对经典鹰鸽博弈的生态进化动力学的分析。结果表明,由频率和密度零线的交点决定的稳定和不稳定平衡点通过异宿轨道连接,这些轨道吸引附近的轨迹。由此产生的轨迹束导致发现了所谓的子零线(位于频率和密度零线之间的流形),然后它们收敛到稳定不动点。然后通过添加环境季节性(周期性背景死亡率)作为外部因素来扩展初始孤立系统。这导致复杂的循环行为,子零线作为扰动传播的屏障(弹性/抵抗阈值)。因此,从某种意义上说,本文完成并扩展了先前关于具有人口统计支付的博弈的生态进化动力学的工作。

英文摘要

In the demographic framework, mortality payoff function describes the cost of an interaction and fertility payoff function describes its reward. So while mortality cost depends on opponent's strategy, fertility reward can be affected by the density-dependent juvenile recruitment survival. This motivates an analysis of the eco-evolutionary dynamics of the classical Hawk-Dove game. It is shown that the stable and unstable equilibria (determined by the intersections of frequency and density nullclines) are connected by heteroclinic orbits, which attract nearby trajectories. The resulting bundle of trajectories leads to the discovery of the so-called subnullcines (manifolds placed between frequency and density nullcline) before they converge to the stable rest point. The initial isolated system is then extended by adding environmental seasonality (periodic background mortality), which acts as an external factor. This leads to complex cycling behavior and the subnullclines act as barriers to the propagation of the perturbation (resilience/resistance threshold). Thus, in a way, this paper completes, yet extends, previous works on the eco-evolutionary dynamics of games with demographic payoffs.

2606.07336 2026-06-08 q-bio.NC 新提交

Fixed point compositionality via low-rank gluing rules in inhibition-dominated threshold-linear networks

抑制主导阈值线性网络中基于低秩粘合规则的定点组合性

Juliana Londono Alvarez

AI总结 本文研究抑制主导阈值线性网络中结构模块性如何支持功能组合性,通过引入低秩粘合规则,证明全局定点是局部定点的组合,并应用于图网络以扩展定点分解规则。

详情
Comments
39 pages, 18 figures
AI中文摘要

大脑在相对稳定的结构和有限资源上常规地产生高度灵活和复杂的行为。这种能力的一个关键机制是组合性,它允许大脑有效地将复杂任务分解为更简单、可重用的基元。虽然网络模块性在生物和人工网络中常与组合性相关联,但在非线性网络中这种关系的严格数学表征仍然缺乏。在这项工作中,我们正式研究了结构模块性如何支持抑制主导阈值线性网络(TLNs)中的功能组合性。我们引入了一类新颖的模块化网络组装,称为低秩粘合,其中具有任意内部连接的组件子网络通过特定的低秩耦合连接。我们证明了这些网络的全局定点被限制为其组成模块的局部定点的组合。对于更结构化的子类,称为秩-1粘合,我们提供了完整的表征,确定哪些局部定点的组合产生全局定点。我们将这些结果应用于基于图的网络,将定点分解规则从组合阈值线性网络(CTLNs)扩展到更灵活的广义CTLNs(gCTLNs)家族,从而证明这些结构规则比最初假设的更鲁棒。最后,我们展示了这些粘合规则为工程化组合动力学提供了数学上易处理的配方,使得能够构建具有组合大量可预测吸引子库的网络,这些吸引子可以从更简单的组件基元理解,范围从定点组合到组合极限环。

英文摘要

Brains routinely generate highly flexible and complex behaviors on a relatively stable structure and limited resources. A key mechanism underlying this ability is compositionality, which allows the brain to efficiently decompose complex tasks into simpler, reusable primitives. While network modularity has often been linked to compositionality in biological and artificial networks, a rigorous mathematical characterization of this relationship in nonlinear networks is still lacking. In this work, we formally investigate how structural modularity supports functional compositionality in inhibition-dominated threshold-linear networks (TLNs). We introduce a novel class of modular network assembly called low-rank gluings, where component subnetworks with arbitrary internal connectivity are connected via specific low-rank couplings. We prove that the global fixed points of these networks are constrained to be combinations of the local fixed points of their constituent modules. For a more structured subclass, called rank-1 gluings, we provide a complete characterization that determines which combinations of local fixed points yield global ones. We apply these results to graph-based networks, extending fixed point decomposition rules from combinatorial threshold-linear networks (CTLNs) to the more flexible family of generalized CTLNs (gCTLNs), thereby proving that these structural rules are more robust than initially posited. Finally, we demonstrate that these gluing rules provide a mathematically tractable recipe for engineering compositional dynamics, enabling the construction of networks with a combinatorially large repertoire of predictable attractors that can be understood from simpler component motifs, ranging from compositions of fixed points to compositional limit cycles.

2606.07301 2026-06-08 q-bio.QM 新提交

Structure-guided taxonomic placement of divergent RNA viruses with ViraClass

基于结构的RNA病毒分类定位:ViraClass

Sheng Xu, Wenxuan Huang, Shutong Yue, Weiqiang Bai, Shiyang Feng, Xiaohan He, Bo Zhang, Qiantai Feng, Edward C. Holmes, Weifeng Shi, Siqi Sun

AI总结 针对RNA病毒分类中RdRp序列相似性低的问题,提出基于蛋白质结构的ViraClass框架,实现从门到属的层级分类,在深度进化距离上优于序列方法。

详情
AI中文摘要

宏转录组测序扩展了我们对RNA病毒圈的认识,其速度远超新病毒的分类学鉴定。科级以上的分类尤为困难,因为RNA依赖的RNA聚合酶(RdRp)通常是RNA病毒中唯一保留的基因,但在高度分化的病毒中序列相似性极低。这里我们证明,在RdRp一级序列相似性基本消失的进化深度上,RdRp蛋白质结构保留了分类信号,且这些信号的组织方式与当前ICTV层级一致。基于此,我们开发了ViraClass,一个用于RNA病毒分类定位的层级框架,它利用RdRp结构进行从门到属的逐级分类,在置信阈值支持的最深等级停止,并对仍处于现有参考空间之外的病毒进行校准的结构聚类。在随机分割、前瞻性和分类学保留基准测试中,ViraClass优于基于序列和基因组内容的基线方法。最大的提升出现在深度进化距离上,在从参考中保留整个科、目或纲的基准测试中,基于序列的方法失去了大部分信号。在诸如黄病毒科等具有挑战性的边界案例中,ViraClass基于结构的分类定位捕捉到了近期系统发育研究强调的分类边界张力。当应用于大量先前未分类的RdRp序列时,ViraClass将高置信度查询归入现有门,并将剩余序列组织成紧凑的结构组。因此,ViraClass提供了一种可扩展的方法,从大规模病毒发现到层级分类解释,特别是在当前基于序列的流程无法达到的深度进化范围。

英文摘要

Metatranscriptomic sequencing has expanded our knowledge of the RNA virosphere far more rapidly than novel viruses can be taxonomically classified. Taxonomic assignment above the family level is particularly difficult because the RNA-dependent RNA polymerase (RdRp) is often the only gene retained across RNA viruses yet exhibits little sequence similarity among highly divergent viruses. Here we show that RdRp protein structure retains taxonomic signal at evolutionary depths where RdRp primary sequence similarity has largely collapsed, and that the organization of this signal is consistent with the current ICTV hierarchy. Based on this, we developed ViraClass, a hierarchical framework for RNA virus taxonomic placement that uses RdRp structure for rank-by-rank assignment from phylum to genus, stopping at the deepest rank supported by confidence thresholds, and calibrated structural clustering for viruses that remain outside existing reference space. Across random-split, prospective and taxonomic hold-out benchmarks, ViraClass outperforms sequence-based and genome-content baselines. The largest gains emerge at deep evolutionary distances, in benchmarks that withhold entire families, orders or classes from the reference, where sequence-based methods lose most of their signal. In challenging boundary cases such as the Flaviviridae, ViraClass's structure-based placements capture the taxonomic boundary tensions highlighted by recent phylogenetic studies. When applied to a large collection of previously unclassified RdRp sequences, ViraClass places high-confidence queries into existing phyla and organizes the remainder into compact structural groups. ViraClass therefore provides a scalable approach from large-scale virus discovery to hierarchical taxonomic interpretation, particularly at the deep evolutionary ranges that current sequence-based pipelines cannot reach.

2606.06889 2026-06-08 q-bio.GN 新提交

From Genomes to Algorithms: Neural Network Applications for Palimpsest Detection in Medieval Manuscripts

从基因组到算法:中世纪手稿中重写本检测的神经网络应用

James B. Harr, Madelin E. Blong, Tessa Gadomski, Kelly A. Meiklejohn, William E. Gundling

AI总结 本研究通过非破坏性采样和测序,结合机器学习分类器(逻辑回归和神经网络),评估重写本制备对DNA完整性的影响,并探索计算方法在识别重写本中的应用。

详情
AI中文摘要

生物密码学(Biocodicology)研究手稿中保存的生物信息,为将羊皮纸视为文本和生物制品提供了新机会。本研究采用非破坏性采样,从14世纪手稿Ms. Codex 1629(包含单次使用和重写本页)中分离并测序线粒体基因组(mtGenomes)。我们旨在评估重写本制备(包括化学清洗)是否损害DNA完整性,以及计算方法是否有助于识别重复使用的羊皮纸。DNA测序显示,单次使用和重写本羊皮纸均保留了足够的mtGenomes用于分析,基因组覆盖度和深度无显著差异。为了评估计算生物学在手稿研究中的潜力,我们实施了机器学习分类器,包括逻辑回归和神经网络,以区分重写本和单次使用页。模型实现了高精度,但对少数类重写本的召回率较低,反映了数据集不平衡。虽然需要更多来自重写本的古代mtGenome样本并进行进一步测试,但本研究证明了整合分子生物学和神经网络如何为重写本检测提供新方法,并强调了数据科学在生物密码学中不断演变的作用。

英文摘要

Biocodicology, the study of biological information preserved in manuscripts, offers new opportunities to examine parchment as both a textual and biological artefact. This study applies non-destructive sampling to isolate and sequence mitochondrial genomes (mtGenomes) from a 14th-century manuscript, Ms. Codex 1629, which contains both single-use and palimpsested folios. We sought to evaluate whether palimpsest preparation, including chemical washing, compromised DNA integrity and whether computational methods could aid in identifying reused parchment. DNA sequencing revealed that both single-use and palimpsested parchments retained sufficient mtGenomes for analysis, with no significant differences in genome coverage or depth. To assess the potential of computational biology in manuscript studies, we implemented machine learning classifiers, including logistic regression and neural networks, to distinguish palimpsests from single-use folios. Models achieved high precision but exhibited reduced recall for the minority palimpsest class, reflecting dataset imbalance. While additional ancient mtGenome samples from palimpsest are required and further testing is needed, this study demonstrates how integrating molecular biology and neural networks highlights new approaches for palimpsest detection and underscores the evolving role of data science in biocodicology.

2606.06749 2026-06-08 q-bio.QM 新提交

Deterministic access to global viral sequence data enables robust agentic scientific discovery

确定性访问全球病毒序列数据实现稳健的自主科学发现

Ferdous Nasri, Sarah Gurev, Patrick Varilly, Krithik Ramesh, Nuala A. O'Leary, Jonah Cool, Bernhard Y. Renard, Pardis C. Sabeti, Laura Luebbert

AI总结 针对基于大语言模型的科学代理在病毒数据检索中的高错误率问题,提出确定性查询框架gget virus,通过形式化NCBI Virus过滤流程、元数据约束和结构化记录检索,将检索准确率提升至90%以上,并减少98%数据传输。

详情
AI中文摘要

公共病毒基因组资源,如美国国家生物技术信息中心(NCBI)病毒数据库,是疫情应对、进化分析、疫苗设计和基因组监测的核心。然而,许多高价值检索工作流程仍针对交互式使用而非确定性、可重复的程序化接口进行优化。这给基于大语言模型(LLM)的科学代理带来了挑战,其中元数据解释、过滤逻辑或检索中的错误可能传播到不正确的数据集中。为了评估自主病毒数据检索,我们构建了VirBench,这是一个手动策划的基准测试,包含120个查询,涵盖多种病原体、分类级别和元数据过滤器。当包括Biomni、Claude、GPT和Edison Analysis在内的自主AI系统在没有专用检索层的情况下执行这些查询时,性能差异很大:平均准确率从Claude Sonnet 4的16.9%到GPT-5.5的91.3%,较新的前沿模型虽有进步,但残留错误仍会产生严重后果。为了解决这个问题,我们构建了gget virus,一个确定性查询框架,将NCBI Virus风格的过滤形式化为可重复的程序化系统。通过分阶段检索、在序列下载前应用元数据约束以及检索结构化的GenBank记录,gget virus在高容量查询中减少了超过98%的数据传输,同时保持了精确匹配语义。指示自主AI系统使用gget virus后,所有评估系统的准确率至少提高到90.0%,GPT-5.5最高达到99.7%,响应稳定性提高到0.92-1.00,错误幅度减小,并且通常减少了运行时间和工具调用。总之,这项工作确立了确定性数据访问作为可靠自主科学的关键基础设施,并为稳健的人类和AI驱动的病毒基因组学工作流程提供了可重复的检索层。

英文摘要

Public viral genome resources such as the National Center for Biotechnology Information (NCBI) Virus database are central to outbreak response, evolutionary analysis, vaccine design, and genomic surveillance. Yet many high-value retrieval workflows remain optimized for interactive use rather than deterministic, reproducible programmatic interfaces. This creates a challenge for Large Language Model (LLM)-based scientific agents, where errors in metadata interpretation, filtering logic, or retrieval can propagate into incorrect datasets. To evaluate agentic viral data retrieval, we built VirBench, a manually curated benchmark of 120 queries spanning diverse pathogens, taxonomic levels, and metadata filters. When autonomous AI systems, including Biomni, Claude, GPT, and Edison Analysis, were tasked with these queries without a dedicated retrieval layer, performance varied widely: mean accuracy ranged from 16.9% for Claude Sonnet 4 to 91.3% for GPT-5.5, with newer frontier models showing progress but residual errors remaining consequential. To address this, we built gget virus, a deterministic query framework that formalizes NCBI Virus-style filtering as a reproducible programmatic system. By staging retrieval, applying metadata constraints before sequence download, and retrieving structured GenBank records, gget virus reduces data transfer by more than 98% for high-volume queries while preserving exact-match semantics. Instructing autonomous AI systems to use gget virus increased accuracy to at least 90.0% across all evaluated systems and up to 99.7% for GPT-5.5, improved response stability to 0.92-1.00, reduced error magnitude, and generally decreased runtime and tool calls. Together, this work establishes deterministic data access as critical infrastructure for reliable agentic science and provides a reproducible retrieval layer for robust human- and AI-driven viral genomics workflows.

2606.06562 2026-06-08 q-bio.QM 新提交

Iterative AI-guided optimisation of selective triple-drug combinations for breast cancer

AI引导的选择性三联药物组合用于乳腺癌的迭代优化

Oghenejokpeme Orhobor, Abbi Abdel-Rehim, Emma Tate, Holly X. Smith, Elizabeth Bourne, Ross J. Collins, Larisa N. Soldatova, Ross D. King

AI总结 提出AI引导的QSAR驱动迭代优化框架,结合机器学习与自动化实验筛选,闭环发现选择性三联药物组合,在MCF7乳腺癌细胞中快速富集高效且选择性高的方案。

详情
Comments
4 figures, 3 tables
AI中文摘要

个性化癌症治疗旨在根据个体肿瘤特征定制治疗方案,然而肿瘤异质性和适应性耐药性持续限制临床疗效。药物组合通过同时靶向多条通路提供克服耐药性的策略,但其合理设计受限于巨大的组合搜索空间和实验成本。本文提出一个AI引导的、QSAR驱动的迭代优化框架,将机器学习与自动化实验筛选相结合,实现选择性多药疗法的闭环发现。从初始随机筛选开始,系统迭代预测、测试和优化针对MCF7乳腺癌细胞的三药组合。引入非致瘤性MCF10A细胞使得能够显式优化肿瘤选择性疗效,优先选择最大化杀伤癌细胞同时保护健康细胞的方案。经过连续迭代,该框架快速富集高选择性、高效能的组合,同时保持化学和机制多样性,避免收敛于狭窄解空间。通过持续从实验反馈中学习,该方法高效探索数百万种组合,识别出一小组经过验证的、肿瘤选择性方案。这些结果建立了AI驱动的闭环优化高阶药物组合的可扩展概念验证,展示了计算与实验的迭代整合如何实现精准肿瘤学中自适应且可能个性化的治疗设计。

英文摘要

Personalised cancer therapy aims to tailor treatment to individual tumour profiles, yet tumour heterogeneity and adaptive resistance continue to limit clinical efficacy. Drug combinations offer a strategy to overcome resistance by simultaneously targeting multiple pathways, but their rational design is constrained by the vast combinatorial search space and experimental cost. Here, we present an AI-guided, QSAR-driven iterative optimisation framework that integrates machine learning with automated experimental screening to enable closed-loop discovery of selective multi-drug therapies. Starting from an initial random screen, the system iteratively predicts, tests, and refines three-drug combinations targeting MCF7 breast cancer cells. Incorporation of non-tumorigenic MCF10A cells enables explicit optimisation of tumour-selective efficacy, prioritising regimens that maximise cancer cell killing while sparing healthy cells. Across successive iterations, the framework rapidly enriched for highly selective, high-efficacy combinations, while maintaining chemical and mechanistic diversity and avoiding convergence on a narrow solution space. By continuously learning from experimental feedback, the approach efficiently navigates millions of combinations to identify a small set of validated, tumour-selective regimens. These results establish a scalable proof-of-concept for AI-driven, closed-loop optimisation of higher-order drug combinations, demonstrating how iterative integration of computation and experimentation can enable adaptive and potentially personalised therapeutic design in precision oncology.

2606.06537 2026-06-08 q-bio.QM cs.CV eess.IV 新提交

DSU-Net: An Attention-Enhanced Dense Skip U-Net for Breast Lesion Segmentation in Mammographic Images

DSU-Net:用于乳腺X线图像中乳腺病变分割的注意力增强密集跳跃U-Net

Reza Bozorgpour, Mohammadreza Soltany Sadrabadi

AI总结 提出DSU-Net,通过密集跳跃连接和注意力机制改进特征传播与边界描绘,在CBIS-DDSM数据集上实现高精度乳腺病变分割。

详情
AI中文摘要

乳腺癌仍然是全球女性癌症相关死亡的主要原因之一,因此早期检测对于有效治疗至关重要。乳腺X线摄影是主要的筛查方式;然而,可疑病变的准确勾画仍然具有挑战性,且存在观察者间差异。自动分割方法可以通过提供一致且高效的病变定位来辅助放射科医生。本研究提出了DSU-Net,一种用于乳腺X线图像中自动乳腺病变分割的注意力增强密集跳跃U-Net架构。该框架集成了密集跳跃连接和注意力机制,以改进特征传播、保留空间信息并增强病变边界描绘。实验使用了乳腺摄影筛查数字数据库的精选乳腺成像子集(CBIS-DDSM)。为了解决严重的前景-背景不平衡问题,训练中采用了结合Dice损失、焦点损失和二元交叉熵损失的复合损失函数。所提模型在验证数据集上实现了0.9421的Dice相似系数、0.8905的交并比、0.9711的准确率和0.9878的AUC-ROC。定性评估显示了对不同大小和形态病变的准确勾画,而定量结果证实了病变与背景区域之间的稳健区分。这些发现表明,DSU-Net在乳腺X线图像中提供了准确可靠的乳腺病变分割,并突出了注意力引导深度学习在计算机辅助乳腺癌筛查和诊断中的潜力。

英文摘要

Breast cancer remains one of the leading causes of cancer-related mortality among women worldwide, making early detection essential for effective treatment. Mammography is the primary screening modality; however, accurate delineation of suspicious lesions remains challenging and subject to inter-observer variability. Automated segmentation methods can assist radiologists by providing consistent and efficient lesion localization. This study presents DSU-Net, an attention-enhanced Dense Skip U-Net architecture for automated breast lesion segmentation in mammographic images. The proposed framework integrates dense skip connections and attention mechanisms to improve feature propagation, preserve spatial information, and enhance lesion boundary delineation. Experiments were conducted using the Curated Breast Imaging Subset of the Digital Database for Screening Mammography (CBIS-DDSM). To address severe foreground-background imbalance, a composite loss function combining Dice loss, focal loss, and binary cross-entropy loss was employed during training. The proposed model achieved a Dice Similarity Coefficient of 0.9421, an Intersection over Union of 0.8905, an accuracy of 0.9711, and an AUC-ROC of 0.9878 on the validation dataset. Qualitative evaluation demonstrated accurate delineation of lesions with varying sizes and morphologies, while quantitative results confirmed robust discrimination between lesion and background regions. These findings demonstrate that DSU-Net provides accurate and reliable breast lesion segmentation in mammographic images and highlights the potential of attention-guided deep learning for computer-aided breast cancer screening and diagnosis.

2606.06516 2026-06-08 q-bio.QM cs.LG 新提交

Probabilistic learning to perform pre-onset individualised prediction of disease severity: application to Veno Occlusive Disease

概率学习用于疾病严重程度的发病前个体化预测:在静脉闭塞性疾病中的应用

Dalia Chakrabarty, Kane Warrior, Chuqiao Zhang, Akash Bhojgaria, Joydeep Chakrabartty

AI总结 提出一种新的概率监督学习方法,利用数字孪生和概率逆学习,在骨髓移植前自动预测静脉闭塞性疾病(VOD)的严重程度评分,辅助医生制定治疗方案。

详情
AI中文摘要

我们提出了一种新的概率监督学习方法,能够对预期患者疾病发展的严重程度进行可靠、自动且早期的个体化预测。通过考虑预期患者的数字孪生(DT),在移植前预测静脉闭塞性疾病(VOD)的严重程度评分来展示预测能力,该评分参数化了患者在接受骨髓移植后VOD发展的严重程度。通过将移植前变量与严重程度评分变量之间的关系建模为(随机)函数,该函数被视为适当选择的随机过程的样本函数,从而学习这种关系。该基础过程的参数使用训练数据集学习,该数据集由回顾性患者队列的实时演变生成,随后通过预期患者评分的概率逆学习来扩充该训练数据集的大小。扩充后的训练集允许学习在移植前阶段自动预测VOD严重程度评分的函数,该评分表征了物理患者在其独特移植前状态下的DT。该评分随后反馈给真实预期患者,作为其移植后VOD发展的严重程度。这样的评分允许治疗血液肿瘤学家决定治疗方案,在本例中简化为决定是否使用去纤维蛋白多核苷酸治疗患者。开发了一个AI工具来执行这种自动预测,医生输入表征预期患者DT的移植前状态数据。

英文摘要

We advance a new probabilistic supervised learning approach that permits reliable, automated, and early individualised prediction of the severity with which a disease will develop in a prospective patient. The prediction capacity is illustrated via the pre-transplant prediction of the score of severity of Veno Occlusive Disease (or VOD) in the digital twin (DT) of the considered prospective patient, where this score parametrises the severity with which VOD will develop in this patient, after they undergo their Bone Marrow Transplant. The learning of the relationship between the pre-transplant variables, and a severity score variable is undertaken by modelling this relationship as a (random) function that is treated as a sample function of an adequately-chosen stochastic process. The parameters of this underlying process are learnt using a training dataset that is generated using the real-time evolution of retrospective patients in a cohort, with this training dataset subsequently augmented in size by a probabilistic inverse learning of the score of prospective patients. The augmented training set, then permits the learning of the function that capacitates - at the pre-transplant stage - automated prediction of the score of the severity of VOD that characterises the DT of a physical patient in their unique pre-transplant state. This score is subsequently fed back to the real prospective patient as the severity with which VOD will develop in them, after this patient undergoes their transplant. Such a score then permits the treating Haematologist-Oncologists to decide on the treatment regimen, which in this illustration reduces to deciding on treating the patient with Defibrotide. An AI facility is developed to undertake such automated prediction, with the physician inputting the data on the pre-transplant state that characterises the DT of the prospective patient under consideration.

2606.07258 2026-06-08 cs.CE q-bio.QM 新提交

CaliPPer: quantifying, predicting and improving AI model performance for binding prediction

CaliPPer:量化、预测和改进AI模型在结合预测中的性能

Jian-Qing Zheng, Hantao Lou, Zinan Yin, Sam Farrar, Yuze Zhou, Elie Antoun, Xiangxi Wang, Xuetao Cao, Tao Dong

AI总结 提出CaliPPer框架,通过多链样本到域距离和距离感知贝叶斯重校准,在三个分辨率上量化、预测和改进AI模型在结合预测中的性能,显著提升新表位、抗原变体和化学骨架上的发现率。

详情
AI中文摘要

结合预测模型加速了治疗性抗体和TCR的发现,但其在新数据集上的性能不可预测,常导致低发现率。密度比方法(PAPE, M-CBPE)为二分类提供无标签性能估计,但其假设和仅聚合输出限制了在新表位、抗原变体和化学骨架上的结合预测。这里我们提出CaliPPer(性能校准与预测),一个事后框架,将多链样本到域距离(S2DD)与距离感知贝叶斯重校准配对,在三个分辨率上运行:泛化性分数、聚合性能预测和每个样本置信度。在十个模型、八个架构和两个免疫受体域上,CaliPPer达到了距离-性能相关性$|r|=0.80\text{--}0.92$,预测AUROC/AP/F1的平均绝对误差为$0.008\text{--}0.070$,并在未见表位/变体上将AUROC提升高达$+0.20$。回顾性地应用于五个已发表的TCR、BCR、MHC-肽和小分子研究,CaliPPer在所有五个研究中提高了真实发现率(例如,$0/5 \to 3/5$确认的新抗原),在计算预测和实验验证之间提供了一个分诊层。

英文摘要

Binding prediction models accelerate therapeutic antibody and TCR discovery, but their performance on new datasets is unpredictable, often leading to low discovery rates. Density-ratio methods (PAPE, M-CBPE) provide label-free performance estimation for binary classification, but their assumptions and aggregate-only outputs limit binding prediction on neoepitopes, antigen variants and chemical scaffolds. Here we present CaliPPer (Calibration and Prediction of Performance), a post-hoc framework pairing a multi-chain Sample-to-Domain Distance (S2DD) with distance-aware Bayesian recalibration, operating at three resolutions: generalisability score, aggregate performance prediction, and per-sample confidence. Across ten models, eight architectures and two immune-receptor domains, CaliPPer attains distance--performance correlations $|r|=0.80\text{--}0.92$, predicts AUROC/AP/F1 with mean absolute errors $0.008\text{--}0.070$, and improves AUROC by up to $+0.20$ on unseen epitopes/variants. Applied retrospectively to five published TCR, BCR, MHC--peptide and small-molecule studies, CaliPPer raises true discovery rates in all five (e.g.\ $0/5 \to 3/5$ confirmed neoantigens), providing a triage layer between computational prediction and experimental validation.

2606.07181 2026-06-08 cs.LG cs.AI q-bio.MN 新提交

RETROSPECT: RETROsynthesis via Sequential Prediction, and Chemically Transformed-ranking

RETROSPECT: 通过序列预测和化学变换排序的逆合成

Raja Sekhar Pappala, Shreyas Vinaya Sathyanarayana, Ronit Kumar Choudhary, Arjun Verma, Deepak Warrier

AI总结 提出RETROSPECT系统,将单步逆合成分解为候选生成和重排序,结合ChemAlign Transformer生成器和LambdaMART重排序器,在USPTO-50K上实现55.00% top-1准确率。

详情
Comments
Accepted at the AI for Science workshop (ICML 2026)
AI中文摘要

单步逆合成既需要准确的首位建议,也需要足够丰富的候选列表以供下游选择。我们将其研究为提议-选择分解。我们的系统RETROSPECT结合了一个单一的Transformer提议模型(我们称之为ChemAlign Transformer)和一个基于结构、反应模板、上游分数以及可选的DFT衍生描述符的LambdaMART重排序器。生成器使用混合根对齐和随机SMILES增强、预层归一化、绑定嵌入、指数移动平均权重以及可微的原子平衡辅助损失进行训练。在包含5,007个反应的完整USPTO-50K测试集上,生成器达到55.00%的top-1和86.18%的top-10精确匹配准确率,top-1有效率为99.86%。在用于重排序的合并候选池基准上(包含5,007个测试产物,每个产物约111个候选),基于结构特征集训练的LambdaMART模型达到59.4%的top-1和0.7171的平均倒数排名。特征消融实验表明,上游提议分数和模板频率统计提供了大部分重排序信号,而DFT和反应中心DFT特征提供的增益较小且不一致。这些结果支持逆合成的模块化观点:更强的单模型提议和学习候选选择是互补的,并且提议模型可以作为集成系统(如RetroChimera (Maziarz et al., 2024))的即插即用组件。

英文摘要

Single-step retrosynthesis needs both accurate first-ranked suggestions and candidate lists that are rich enough for downstream selection. We study this as a proposal-selection decomposition. Our system, RETROSPECT, combines a single Transformer proposal model, which we call the ChemAlign Transformer, with a LambdaMART reranker over structural, reaction-template, upstream-score, and optional DFT-derived descriptors. The generator is trained with hybrid root-aligned and random SMILES augmentation, Pre-LayerNorm, tied embeddings, exponential moving average weights, and a differentiable atom-balance auxiliary loss. On the full USPTO-50K test set of 5,007 reactions, the generator reaches 55.00% top-1 and 86.18% top-10 exact-match accuracy with 99.86% top-1 validity. On the merged candidate-pool benchmark used for reranking, which contains 5,007 test products and about 111 candidates per product, a LambdaMART model trained on the structural feature set reaches 59.4% top-1 with 0.7171 mean reciprocal rank. Feature ablations show that upstream proposal score and template-frequency statistics provide most of the reranking signal, while DFT and reaction-center DFT features provide smaller and less consistent gains. These results support a modular view of retrosynthesis: stronger single-model proposal and learned candidate selection are complementary, and the proposal model can serve as a drop-in component for ensemble systems such as RetroChimera (Maziarz et al., 2024)

2606.06834 2026-06-08 cs.CL q-bio.GN 新提交

The Dark Regulome: Disentangling Predictability from Regulation in Genomic Foundation Models

暗调控组:从基因组基础模型中分离可预测性与调控性

Chahat Baranwal, Aadtya Baranwal, Lakshya Nitin Tandon

发表机构 * IIT Jodhpur University of Central Florida Northeastern University

AI总结 本研究提出残差化-置换诊断方法,从基因组基础模型的计算机诱变评分中分离序列可预测性与调控信号,揭示10kb近端调控边界,并验证跨架构分解可区分可预测性层与调控输出层,为暗基因组调控研究提供通用工具。

详情
AI中文摘要

高级别胶质瘤通过与神经元的突触整合到神经回路中,这引发了一个问题:哪些非编码元件塑造了肿瘤细胞中的突触形成基因表达。写在暗基因组上的调控程序,我们称之为$\textit{暗调控组}$,是探索的自然底物,而序列基础模型通过计算机诱变(ISM)提供了一条零样本路径;然而,基于似然的评分与局部序列可预测性存在同义反复的耦合,使得调控解释不充分。在三个架构不同的基础模型(Caduceus-Ph、HyenaDNA、Enformer)和92个胶质瘤相关位点的30,448个暗基因组元件上,我们引入了一种残差化-置换诊断方法,以分离由可预测性驱动和由调控驱动的RIS方差。一个尖锐的10kb近端调控边界在我们应用的所有控制中仍然存在,但LM衍生的元件类别层次结构则不然:一个六特征线性基线在AUC=0.985时匹配Caduceus的十分位数成员。跨架构分解清晰地分离了序列可预测性层(两个语言模型共同对长且可预测的转座元件进行排序)和调控输出层(只有Enformer保留了区分cCRE的信号),两个前100列表之间完全没有重叠。然后,保守性、脑cis-eQTL和STRING-PPI交叉检查锚定了哪些生物学信息得以保留:所有三个模型的前100个元件在匹配脑eQTL方面每个模型富集了3.3倍($p_\mathrm{emp} < 5\times 10^{-3}$),而一个诱人的转座元件调控层和一个显著的NRXN1+NLGN1蛋白对收敛在构建适当的置换检验后均未通过。我们将该诊断方法作为任何基于ISM的调控研究的通用方法工具提供。

英文摘要

High-grade gliomas integrate into neural circuits through functional synapses with neurons, raising the question of which noncoding elements shape synaptogenic gene expression in tumor cells. The regulatory program written across the dark genome, what we call the $\textit{dark regulome}$, is the natural substrate to probe, and sequence foundation models offer a zero-shot route through in-silico mutagenesis (ISM); yet likelihood-based scoring is tautologically coupled to local sequence predictability, leaving the regulatory interpretation underdetermined. Across three architecturally distinct foundation models (Caduceus-Ph, HyenaDNA, Enformer) and 30,448 dark genome elements at 92 glioma-relevant loci, we introduce a residualization-and-permutation diagnostic that separates predictability-driven from regulation-driven RIS variance. A sharp 10kb proximal-regulatory horizon survives every control we apply, but the LM-derived element-class hierarchy does not: a six-feature linear baseline matches Caduceus top-decile membership at AUC $= 0.985$. Cross-architecture decomposition cleanly separates a sequence-predictability layer (the two language models co-rank long well-predicted transposable elements) from a regulatory-output layer (Enformer alone retains residual cCRE-discriminative signal), with literally zero overlap between the two top-100 lists. Conservation, brain cis-eQTL, and STRING-PPI cross-checks then anchor what biology survives: top-100 elements across all three models are $3.3\times$ enriched per model for matching brain eQTLs ($p_\mathrm{emp} < 5\times 10^{-3}$), while a tempting transposable-element regulatory layer and a striking NRXN1+NLGN1 protein-pair convergence both fail proper permutation tests once those tests are constructed. We deliver the diagnostic as a general methodological tool for any ISM-based regulatory study.

2606.06811 2026-06-08 cs.PF q-bio.GN 新提交

Dependencies and Dataflow in Seed-Filter-Extend Pipelines

种子-过滤-扩展流水线中的依赖关系与数据流

Shiv Sundram

AI总结 针对基因组比对中种子-过滤-扩展流水线的串行依赖和局部对齐不规则性,通过综合LASTZ等四种方法,优化跨区域全局流水线以加速端到端比对。

详情
AI中文摘要

比较基因组对于发现突变、追踪进化谱系和推进跨物种基因组学至关重要。从根本上讲,这归结为一个O(n^2)的字符串匹配动态规划(DP)问题,这一挑战推动了数十年的性能研究。然而,对于跨越数百万到数十亿碱基对的基因组,执行严格的O(n^2) DP算法在计算上是不可行的。因此,现代比对器依赖全局启发式方法来识别物种间数千个候选相似区域。不幸的是,这些方法受到复杂串行依赖关系的困扰。一旦识别出候选区域,流水线执行局部DP比对,这引入了其自身的非平凡启发式和不规则数据依赖。虽然并行化密集的二维DP是一个研究充分的问题,但加速这种端到端流水线更具挑战性。跨候选区域并行化以及将不规则、充满启发式的局部比对卸载到现代硬件(如GPU)仍然是一个主要障碍。在这项工作中,我们通过优化跨区域的全局流水线来克服这些串行瓶颈。我们从四篇论文中汲取灵感:LASTZ、SegAlign、Darwin-WGA和SNAP,综合每篇论文的发现以指导优化,我们在LASTZ中要么原型化要么直接实现这些优化。

英文摘要

Comparing genomes is critical for discovering mutations, tracking evolutionary lineages, and advancing cross-species genomics. Fundamentally, this reduces to an O(n^2) string-matching dynamic programming (DP) problem, a challenge that has driven decades of performance research. However, executing a strict O(n^2) DP algorithm is computationally intractable for genomes spanning millions to billions of base pairs. Consequently, modern aligners rely on global heuristics to identify thousands of candidate similarity regions between species. Unfortunately, these methods are burdened by complex serial dependencies. Once candidate regions are identified, the pipeline executes localized DP alignments, which introduce their own non-trivial heuristics and irregular data dependencies. While parallelizing dense, two-dimensional DP is a well-studied problem, accelerating this end-to-end pipeline is significantly more challenging. Parallelizing across candidate regions and offloading irregular, heuristic-laden local alignments to modern hardware (such as GPUs) remains a major hurdle. In this work, we address the challenge of overcoming these serial bottlenecks by optimizing the global pipeline across regions. We take inspiration from four papers: LASTZ, SegAlign, Darwin-WGA, and SNAP, synthesizing findings across each to inform optimizations, which we either prototype or implement directly in LASTZ.

2606.06717 2026-06-08 cs.LG cs.AI q-bio.BM q-bio.QM 新提交

ShallowBench: Benchmarking Generative Drug Design Models on Shallow-Pocket Targets

ShallowBench: 浅口袋靶标上的生成式药物设计模型基准测试

Saket Reddy, Shiwei Liu

发表机构 * University of Illinois - Urbana-Champaign

AI总结 提出ShallowBench基准,包含5780个浅口袋靶标,用于评估生成式药物设计模型在低凹度界面上的性能,揭示现有模型预测结合亲和力较弱的问题。

详情
AI中文摘要

虽然生成式AI模型在基于结构的药物设计中已展现出显著成功,但它们主要依赖深结合口袋,难以对具有挑战性的低口袋性靶标(如历史上“不可成药”的肿瘤靶标KRAS和MYC)采样有效配体。为弥补这一空白,我们引入了ShallowBench,这是一个从CrossDocked2020中提取的包含5780个浅口袋靶标的严格精选基准。通过计算Alpha Shape“盖子”体积与底层蛋白质原子体素体积之间的差异,我们成功分离出低凹度靶标,同时确保足够的结合表面积。评估多种最先进的生成模型显示,在这些低凹度界面上预测的结合亲和力较弱。因此,ShallowBench为生成生物学模型提供了一个严格的基准,并强调了需要能够应对这些具有挑战性靶标的新型架构创新或损失函数。

英文摘要

While generative AI models have demonstrated remarkable success in structure-based drug design, they predominantly rely on deep binding pockets and struggle to sample effective ligands for challenging low-pocketability targets, such as the historically "undruggable" oncology targets KRAS and MYC. To address this gap, we introduce ShallowBench, a strictly curated benchmark of 5,780 shallow-pocket targets extracted from CrossDocked2020. By computing the difference between an Alpha Shape "lid" volume and the underlying protein atom voxel volume, we successfully isolated targets with low concavity while ensuring sufficient surface area for binding. Evaluating various state-of-the-art generative models reveals weaker predicted binding affinity on these low-concavity interfaces. ShallowBench therefore provides a rigorous benchmark for generative biology models and highlights the necessity of new architectural innovations or loss functions capable of navigating these challenging targets.

2606.06647 2026-06-08 cs.LG q-bio.NC 新提交

The Identity Trap in EEG Foundation Models: A Diagnostic Audit

脑电图基础模型中的身份陷阱:一项诊断性审计

Jun-You Lin, Ying Choon Wu, Tzyy-Ping Jung

发表机构 * National Yang Ming Chiao Tung University University of California, San Diego

AI总结 提出FMScope协议,通过方差分解、主题轴擦除等五种诊断方法,揭示EEG基础模型在受试者分离交叉验证中可能依赖受试者身份特征而非临床生物标志物,并验证了该陷阱的普遍性及可移除性。

详情
Comments
28 pages, 6 figures, 8 tables. Code available at https://github.com/Jimmy110101013/fmscope
AI中文摘要

目标。EEG基础模型(FMs)在临床静息态EEG上报告了强准确性。然而,在受试者分离交叉验证下的高准确性仍然模棱两可:它可能反映真实的临床生物标志物,也可能反映与标签相关的受试者身份特征。我们将其命名为身份陷阱,并询问是否可以在微调之前从表示层面进行诊断。方法。我们提出FMScope,一种冻结表示协议,包含五种诊断方法:方差分解、受试者轴擦除、非周期性1/f消融、逐层标签探测和受试者内方向一致性。我们将其应用于三个预训练FM(LaBraM、CBraMod、REVE),在四个数据集上采用2x2布局:标签的受试者关系 x 是否存在共识的跨受试者EEG标志物。主要结果。(i) 身份陷阱是普遍存在的:在12/12对中,冻结的受试者方差是随机零假设的13-89倍,在微调下所有12对均上升(+10至+63个百分点)。这种主导性是一个可移除的线性轴:在标签在受试者内变化的情况下,擦除它可改善标签解码(主要单元中+6至+12个百分点;外部队列中+4至+27个百分点)。(ii) 非周期性1/f是受试者身份的一个载体:移除它会使LaBraM和CBraMod上的受试者探测下降9-19个百分点。REVE在无可测量的非周期性依赖下饱和了受试者身份。(iii) 微调仅在具有文献确立的跨受试者标志物的单元中放大标签方差。意义。身份陷阱是捷径学习的一个物理基础实例:偏好线索具有可测量的生理成分,仅靠受试者分离分割无法排除它。FMScope将反映生物标志物的增益与反映受试者身份的增益分开。

英文摘要

Objective. EEG foundation models (FMs) report strong accuracy on clinical resting-state EEG. However, high accuracy under subject-disjoint cross-validation remains ambiguous: it can reflect a genuine clinical biomarker, or subject-identity features that correlate with the label. We name this the Identity Trap and ask whether it can be diagnosed at the representation level before fine-tuning. Approach. We propose FMScope, a frozen-representation protocol packaging five diagnostics: variance decomposition, subject-axis erasure, aperiodic 1/f ablation, layer-wise label probing, and within-subject direction consistency. We apply it to three pretrained FMs (LaBraM, CBraMod, REVE) across four datasets in a 2x2 layout: subject relation of label x presence of a consensus cross-subject EEG marker. Main results. (i) The Identity Trap is universal: frozen subject-variance is 13-89x a random null in 12/12 pairs, rising in all 12 under fine-tuning (+10 to +63 pp). This dominance is a removable linear axis: erasing it improves label decoding where the label varies within subject (+6 to +12 pp in primary cells; +4 to +27 pp across external cohorts). (ii) Aperiodic 1/f is one subject carrier: removing it drops the subject probe by 9-19 pp on LaBraM and CBraMod. REVE saturates subject identity without measurable aperiodic dependence. (iii) Fine-tuning amplifies label-variance only in cells with a literature-established cross-subject marker. Significance. The Identity Trap is a physically-grounded instance of shortcut learning: the preferred cue has a measurable physiological component, and subject-disjoint splitting alone cannot rule it out. FMScope separates gains reflecting a biological marker from those reflecting subject identity.

2606.07413 2026-06-08 math.OC q-bio.PE 新提交

A Nine-Compartment Nonlinear Epidemic Model with Spline-Based Identification of Time-Varying Transmission and Vaccination Dynamics: Application to the COVID-19 Third Wave in Italy

具有基于样条的时间变化传播和疫苗接种动力学的九室非线性流行病模型:应用于意大利第三波COVID-19疫情

Lokman Rachid Melhani, Antonino Sferlazza, Lars Grüne, Dominique Persano Adorno, Filippo D'Ippolito, Omar Enzo Santangelo, Ivan Marchese, Antonino Lo Burgio, Alberto Firenze

AI总结 提出九室非线性流行病模型,包含两种病毒株、超级传播者、部分疫苗免疫和住院动态,使用PCHIP参数化识别时变传播和接种率,校准后拟合优度高,并分析了模型适定性、基本再生数和稳定性。

详情
Comments
23 pages, 4 figures. Submitted to the SIAM Journal on Applied Mathematics
AI中文摘要

我们开发了一个九室非线性流行病模型,包含两种共循环病毒株(原始株I1和Alpha变种B.1.1.7 I2,其传播性高出43-90%,c2=1.5)、超级传播者亚群、具有衰减的部分疫苗诱导免疫,以及具有差异化死亡率的明确住院动态。传播率和疫苗接种率被视为时变控制输入,并通过分段三次埃尔米特插值多项式(PCHIP)控制节点参数化从意大利COVID-19数据(2021年1月至5月)中识别,将校准简化为具有单调性和箱约束的十四变量序列二次规划(SQP)问题。参数自举(n=1000)量化了参数不确定性。校准模型对活跃住院人数达到R^2=0.966,累计死亡人数R^2=0.987,累计疫苗接种人数R^2=0.999。分析建立了适定性、闭式基本再生数以及无病平衡点的局部和全局稳定性。L无穷逼近误差界表明,随着节点间距趋近于零,PCHIP控制节点参数化以O(h^2)的速率收敛到真实时变参数。通过Fisher信息矩阵建立了局部可辨识性和噪声稳定性界。一个充分的阈值条件证明,当有效再生数持续低于1时,在时变抑制下流行病会衰减。敏感性分析一致地将医院吞吐量参数排在传播率之上,为反应性遏制措施无法阻止已经由预先存在的潜伏病毒载量驱动的住院高峰这一观察提供了数学基础。

英文摘要

We develop a nine-compartment nonlinear epidemic model incorporating two co-circulating viral strains (ancestral I1 and the Alpha variant B.1.1.7 I2, which is 43-90% more transmissible, c2=1.5), a super-spreader subpopulation, partial vaccine-induced immunity with waning, and explicit hospitalization dynamics with differentiated mortality. Transmission and vaccination rates are treated as time-varying control inputs and identified from Italian COVID-19 data (January-May 2021) via a Piecewise Cubic Hermite Interpolating Polynomial (PCHIP) control-node parameterization, reducing calibration to a fourteen-variable Sequential Quadratic Programming (SQP) problem with monotonicity and box constraints. A parametric bootstrap (n=1000) quantifies parameter uncertainty. The calibrated model achieves R^2=0.966 for active hospitalizations, R^2=0.987 for cumulative fatalities, and R^2=0.999 for cumulative vaccinations. Well-posedness, the basic reproduction number in closed form, and local and global stability of the disease-free equilibrium are established analytically. An L-infinity approximation error bound shows that the PCHIP control-node parameterization converges to the true time-varying parameters at rate O(h^2) as the node spacing vanishes. Local identifiability and a noise stability bound are established via the Fisher information matrix. A sufficient threshold condition proves epidemic decay under time-varying suppression whenever the effective reproduction number remains persistently below one. Sensitivity analyses consistently rank hospital throughput parameters above the transmission rate, providing a mathematical basis for the observation that reactive containment measures cannot prevent a hospitalization peak already driven by the pre-existing latent viral load.

2606.04525 2026-06-08 cs.CL cs.LG q-bio.GN 版本更新

GENEB: Why Genomic Models Are Hard to Compare

GENEB:为什么基因组模型难以比较

Daria Ledneva, Mikhail Nuridinov, Denis Kuznetsov

AI总结 针对基因组基础模型评估碎片化的问题,提出GENEB基准,通过统一探测协议在100项任务上比较40个模型,揭示模型排名不稳定、规模收益有限等关键发现。

详情
Comments
make some figures bigger in appendix; fix caduceus metadata
AI中文摘要

由于基准碎片化、评估协议不兼容以及任务特定报告,基因组基础模型的进展难以评估。因此,关于模型优越性或通用性的声明往往无法直接比较。我们引入GENEB,这是一个大规模诊断基准,在统一的基于探测的协议下(包括少样本场景),评估来自40个基因组基础模型的冻结表示,涵盖100个任务,跨越13个功能类别。GENEB能够在明确暴露任务级权衡的同时,对模型规模、架构、分词和预训练数据进行受控比较。我们的分析表明,整体排行榜不稳定:模型排名在不同任务类别间变化剧烈,规模仅带来适度且不一致的收益,而架构和预训练对齐常常超过参数数量的影响。这些结果凸显了当前评估实践的局限性,并将GENEB定位为基因组机器学习中原则性比较和类别感知模型选择的参考框架。

英文摘要

Progress in genomic foundation models is difficult to assess due to fragmented benchmarks, incompatible evaluation protocols, and task-specific reporting. As a result, claims of superiority or generality across models are often not directly comparable. We introduce GENEB, a large-scale diagnostic benchmark that evaluates frozen representations from 40 genomic foundation models across 100 tasks spanning 13 functional categories under a unified probing-based protocol, including few-shot regimes. GENEB enables controlled comparison across model scale, architecture, tokenization, and pretraining data while explicitly exposing task-level trade-offs. Our analysis shows that aggregate leaderboards are unstable: model rankings vary sharply across task categories, scale provides only modest and inconsistent gains, and architectural and pretraining alignment frequently outweigh parameter count. These results highlight limitations of current evaluation practices and position GENEB as a reference framework for principled comparison and category-aware model selection in genomic machine learning.

2606.02462 2026-06-08 q-bio.BM 版本更新

APLSuite: An Integrated Suite for CD4+ T Cell Epitope Prediction via Antigen Processing Likelihood

APLSuite:通过抗原加工可能性进行CD4+ T细胞表位预测的集成套件

Jiarui Li, Marco K. Carbullido, Jai Bansal, Samuel J. Landry, Ramgopal R. Mettu

AI总结 提出APLSuite,一个集成抗原加工可能性(APL)算法的轻量级软件套件,通过GPU加速实现快速CD4+ T细胞表位预测,弥补现有方法忽视抗原加工作用的不足。

详情
Comments
Application Note; The source code for this work is available at: https://github.com/Jiarui0923/APL The project page for this work is available at: https://tulane-mettu-landry-lab.github.io/blogs/APLSuite/
AI中文摘要

计算表位预测是探索和理解CD4+ T细胞介导的免疫反应(适应性免疫的关键方面)的重要工具。虽然现有计算方法主要关注监督学习方法,但它们常常忽视抗原加工在决定结合特异性中的关键作用。为了解决这一局限性,我们团队开发了抗原加工可能性(APL)算法,该算法整合了晶体学B因子、溶剂可及表面积(SASA)、氢交换保护因子(COREX)和序列熵。在本文中,我们介绍了APLSuite,一个全面且轻量级的软件套件,旨在简化基于APL的表位预测。APLSuite集成了分布式RESTful API服务、用于数据聚合和处理的Python客户端、用于高效表位计算的数据科学工具,以及面向非编码用户的用户友好型图形用户界面。它提供了一个无缝且高效的APL计算和表位预测流程,可在几分钟内通过GPU加速完成,这是现有工具尚未实现的。这个灵活且可扩展的软件套件可部署在桌面和云环境中,提供引导式和可定制的工作流程,以满足免疫学研究和免疫疗法开发中的多样化研究需求。

英文摘要

Computational epitope prediction is a critical tool for exploring and understanding CD4+ T cell-mediated immune responses, a key aspect of adaptive immunity. While existing computational methods primarily focus on supervised learning approaches, they often overlook the essential role of antigen processing in determining binding specificity. To address this limitation, our group developed Antigen Processing Likelihood (APL), an algorithm that integrates crystallographic B-factor, solvent accessible surface area (SASA), hydrogen exchange protection factors (COREX), and sequence entropy. In this paper we introduce APLSuite, a comprehensive and lightweight software suite designed to streamline APL-based epitope prediction. APLSuite integrates distributed RESTful API services, a Python client for data aggregation and processing, a data science tool for efficient epitope computation, and a user-friendly graphical user interface for non-coding users. It provides a seamless and efficient pipeline for APL calculation and epitope prediction that can be finished in minutes with GPU-acceleration, which has not been implemented by existed tools. This flexible and extensible software suite is deployable on desktop and cloud environments, offering both guided and customizable workflows to meet diverse research needs in immunology research and immunotherapy development. (The project page for this work is available at: https://tulane-mettu-landry-lab.github.io/blogs/APLSuite/)

2605.31071 2026-06-08 cs.DS cs.CC q-bio.PE 版本更新

Tree Containment Parameterized by Scanwidth

参数化扫描宽度的树包含问题

Leo van Iersel, Mark Jones, Mathias Weller

AI总结 本文研究树包含问题在扫描宽度参数下的算法复杂度,提出时间复杂度为 $O(4^{k + k\log{k}} n + nm^2)$ 的参数化算法,并证明在指数时间假设下不存在 $2^{o(c\log{c})} n^{O(1)}$ 时间的算法。

详情
AI中文摘要

树包含是数学系统发育学中的一个核心决策问题,询问给定的有根系统发育树是否可以嵌入(“显示于”)给定的有根系统发育网络中。虽然该问题对于一般网络是NP完全的,但许多算法进展依赖于捕捉网络“树状”程度的结构参数。在本文中,我们研究了结构参数扫描宽度下的树包含问题,扫描宽度是一种有向宽度度量,推广了衡量系统发育网络树状性的流行参数。我们首先提出一个参数化算法,该算法在 $O(4^{k + k\log{k}} n + nm^2)$ 时间内解决问题,其中 $n$ 和 $m$ 是网络中的节点数和弧数,$k$ 是给定树扩展的宽度。作为该上界的补充,我们在指数时间假设(ETH)下证明了匹配的下界,表明即使在二元输入上,也不存在运行时间为 $2^{o(c\log{c})} n^{O(1)}$ 的树包含算法,其中 $c$ 是输入网络的有向割宽,它上界于扫描宽度 $k$。

英文摘要

TREE CONTAINMENT is a central decision problem in mathematical phylogenetics, asking whether a given rooted phylogenetic tree is embeddable in ("displayed by") a given rooted phylogenetic network. While the problem is NP-complete for general networks, many algorithmic advances have relied on structural parameters that capture how "tree-like" a network is. In this paper we investigate TREE CONTAINMENT under the structural parameter scanwidth, a directed width measure generalizing popular parameters measuring tree-likeness of phylogenetic networks. We first present a parameterized algorithm that solves the problem in $O(4^{k + k\log{k}} n + nm^2)$ time, where $n$ and $m$ are the numbers of nodes and arcs in the network and $k$ is the width of a given tree-extension. Complementing this upper bound, we prove a matching lower bound under the Exponential-Time Hypothesis (ETH), showing that there is no algorithm for TREE CONTAINMENT that runs in $2^{o(c\log{c})} n^{O(1)}$ time, even on binary inputs, where $c$ is the directed cutwidth of the input network, which upper-bounds the scanwidth $k$.

2602.09997 2026-06-08 cs.SI q-bio.NC q-bio.PE 版本更新

Popularity Feedback Constrains Innovation in Cultural Markets

流行反馈制约文化市场中的创新

Lucas Gautheron, Raja Marjieh, Dalton C. Conley, Seth Frey, Hannah Rubin, Mike D. Schneider, Ofer Tchernichovski, Nori Jacoby

AI总结 研究探讨了流行反馈如何通过影响选择和创造阶段,减少文化多样性并减缓创新速度,揭示了流行反馈对文化创新方向的双重影响。

详情
AI中文摘要

现实中的创造性过程,从艺术到科学,都依赖于选择与创造之间的社会反馈循环。然而,流行反馈对集体创造力的影响仍不明确。我们通过大规模在线实验研究了流行评级如何影响文化动态,参与者(N=1,008)反复从演变市场中选择图像并制作自身修改。结果显示,暴露图像的流行度会减少文化多样性并减缓创新,延迟审美改进。流行反馈改变了选择和创造阶段。在选择阶段,流行信息触发累积优势,参与者更倾向于基于流行图像进行创作,减少多样性。在创造阶段,参与者做出的改变更不具颠覆性,更可能扩展现有视觉模式。文化市场的反馈循环不仅塑造了选择,还直接或间接地影响了文化创新的形式和方向。

英文摘要

Real-world creative processes ranging from art to science rely on social feedback-loops between selection and creation. Yet, the effects of popularity feedback on collective creativity remain poorly understood. We investigate how popularity ratings influence cultural dynamics in a large-scale online experiment where participants ($N = 1\,008$) iteratively \textit{select} images from evolving markets and \textit{produce} their own modifications. Results show that exposing the popularity of images reduces cultural diversity and slows innovation, delaying aesthetic improvements. Popularity feedback is associated with changes to both selection and creative stages. During selection, popularity information triggers cumulative advantage, with participants preferentially building upon popular images, reducing diversity. During creation, participants make less disruptive changes, and are more likely to expand existing visual patterns. Feedback loops in cultural markets thus not only shape selection, but also, directly or indirectly, the form and direction of cultural innovation.

2602.00163 2026-06-08 cs.CV q-bio.NC 版本更新

Deep Learning Pose Estimation for Multi-Label Recognition of Combined Hyperkinetic Movement Disorders

基于深度学习姿态估计的联合多动性运动障碍多标签识别

Laura Cif, Diane Demailly, Gabriella A. Horvàth, Juan Dario Ortigoza Escobar, Nathalie Dorison, Mayté Castro Jiménez, Cécile A. Hubsch, Thomas Wirth, Gun-Marie Hariz, Sophie Huby, Morgan Dornadic, Zohra Souei, Muhammad Mushhood Ur Rehman, Simone Hemm, Mehdi Boulayme, Eduardo M. Moraud, Jocelyne Bloch, Xavier Vasques

AI总结 针对多动性运动障碍(HMD)临床识别主观性强、表型重叠的问题,提出基于姿态的机器学习框架,从常规临床视频提取关键点时间序列并计算多维度运动学特征,实现多标签分类。

详情
AI中文摘要

多动性运动障碍(HMD),如肌张力障碍、震颤、舞蹈症、肌阵挛和抽动症,是儿童和成人中致残的运动表现。其波动性、间歇性和频繁共存的表达阻碍了临床识别和纵向监测,这些在很大程度上仍然是主观的且易受评估者间变异影响。目前仍缺乏客观且可扩展的方法来从常规临床视频中区分重叠的HMD表型。在此,我们开发了一个基于姿态的机器学习框架,将常规门诊视频转化为解剖学上有意义的关键点时间序列,并计算涵盖统计、时间、频谱以及高阶不规则性-复杂性特征的运动学描述符。

英文摘要

Hyperkinetic movement disorders (HMDs) such as dystonia, tremor, chorea, myoclonus, and tics are disabling motor manifestations across childhood and adulthood. Their fluctuating, intermittent, and frequently co-occurring expressions hinder clinical recognition and longitudinal monitoring, which remain largely subjective and vulnerable to inter-rater variability. Objective and scalable methods to distinguish overlapping HMD phenotypes from routine clinical videos are still lacking. Here, we developed a pose-based machine-learning framework that converts standard outpatient videos into anatomically meaningful keypoint time series and computes kinematic descriptors spanning statistical, temporal, spectral, and higher-order irregularity-complexity features.

2507.23146 2026-06-08 q-bio.QM 版本更新

Lightweight Language Models are Prone to Reasoning Errors for Complex Computational Phenotyping Tasks

轻量语言模型在复杂计算表型任务中易出现推理错误

Sarah Pungitore, Shashank Yadav, David Maughan, Vignesh Subbian

AI总结 研究探讨了轻量语言模型在复杂计算表型任务中的推理错误问题,通过扩展PHEONA框架评估了不同模型的推理准确性,并发现轻量推理模型在提示修改后表现显著提升。

详情
AI中文摘要

尽管计算表型是核心信息学活动,但因其手动数据审查而耗时。我们先前评估了LLM在计算表型任务中的能力,发现其在单疗法表型分类上表现良好,但在多疗法表型上表现不佳。为理解这些复杂任务的问题,我们扩展了PHEONA框架,以包含评估错误推理的方法。我们评估了两种轻量非推理LLM(Mistral Small 24 billion和Phi-4 14 billion)和一种轻量推理LLM(Qwen-distilled DeepSeek-r1 32 billion)在有无提示修改情况下的响应准确性及忠实性错误。在无提示修改实验中,所有模型均存在两种错误。在评估提示修改后准确性影响的实验中,Mistral的总体准确性提升最高,相比DeepSeek和Phi。由于推理错误在所有模型中普遍存在,我们对PHEONA的增强提供了关键支持,为LLM评估和复杂任务的推理错误提供了证据。尽管推理错误的见解有助于提示优化,但深入理解LLM推理错误发生原因可能需要进一步发展和细化可解释性方法。

英文摘要

Although computational phenotyping is a central informatics activity with resulting cohorts supporting a wide variety of applications, it is time-intensive because of manual data review. We previously assessed the ability of LLMs to perform computational phenotyping tasks using computable phenotypes for ARF respiratory support therapies. They successfully performed concept classification and classification of single-therapy phenotypes but underperformed on multi-therapy phenotypes. To understand issues with these complex tasks, we expanded PHEONA, a generalizable framework for evaluation of LLMs, to include methods specifically for evaluating faulty reasoning. We assessed the responses of two lightweight non-reasoning LLMs (Mistral Small 24 billion and Phi-4 14 billion) and one lightweight reasoning LLM (Qwen-distilled DeepSeek-r1 32 billion) both with and without prompt modifications to identify explanation correctness and unfaithfulness errors for phenotyping. For experiments without prompt modifications, both errors were present across all models. For experiments assessing accuracy impact after prompt modifications, Mistral had the highest overall accuracy impact when compared to DeepSeek and Phi. Since reasoning errors were ubiquitous across models, our enhancement of PHEONA to include a component for assessing faulty reasoning provides critical support for LLM evaluation and evidence for reasoning errors for complex tasks. While insights from reasoning errors can help prompt refinement, a deeper understanding of why LLM reasoning errors occur will likely require further development and refinement of interpretability methods.

2505.06718 2026-06-08 q-bio.OT 版本更新

Understanding nature's selection of genetic languages

理解自然界选择的遗传语言

Apoorva D. Patel

AI总结 本文探讨了生物体分子生物学中两种通用遗传语言的最优编码,通过Grover算法验证其在进化中的最优性,挑战在于证明其在活体中的执行方式。

详情
Journal ref
BioSystems 250 (2025), Article 105428
Comments
7 pages. Resubmitted in this category after arXiv Moderation Support accepted the appeal (v2) Final published version
AI中文摘要

所有生物体在其分子生物学机器中使用两种通用遗传语言,一种包含四个核苷酸碱基,另一种包含二十种氨基酸。它们可以被视为完成特定任务(如DNA/RNA的复制/转录和多肽链的翻译)的最优编码。这些任务通过互补的核苷酸碱基配对选择所需的字母,从细胞中的分子集合中进行。计算机科学范式为此过程是数据库搜索;各种算法可根据所需查询次数进行构造和比较。基于振荡波动力学的Grover搜索算法完美匹配搜索遗传字母所需的查询次数,并比最佳布尔搜索算法(即二进制树搜索)更高效。这一结果强烈表明,通用遗传语言通过进化被选择为完成其任务的最优字母,而非历史偶然。突出的挑战是证明Grover搜索算法如何在活体中被生物体执行。

英文摘要

All living organisms use two universal genetic languages in their molecular biology machinery, one containing four nucleotide bases in its alphabet, and the other containing twenty amino acids in its alphabet. They can be understood as the optimal encodings of genetic information for the tasks they carry out, i.e. replication/transcription for DNA/RNA and translation for polypeptide chains. These tasks select needed letters of the alphabet by complementary nucleotide base-pairing, from a collection of molecules in the cell. The computer science paradigm for this process is database search; various algorithms for it can be constructed and compared according to number of attempts (or queries) they need to make to find the correct nucleotide base-pairing. Grover's search algorithm based on oscillatory wave dynamics perfectly fits the number of queries needed to search the genetic alphabets, and it is more efficient than the best Boolean search algorithm (i.e. binary tree search) that needs a larger number of queries. This result strongly suggests that the universal genetic languages have been selected by evolution as the optimal alphabets for the tasks they carry out, and are not an accident of history. The outstanding challenge is to demonstrate how Grover's search algorithm would be executed in vivo by the living organisms.

2505.12286 2026-06-08 physics.soc-ph q-bio.OT 版本更新

Modeling hepatitis D virus kinetics during bulevirtide monotherapy: challenges and solutions

在单用布卢维瑞德治疗期间对乙型肝炎病毒动力学建模:挑战与解决方案

Adquate Mhlanga, Louis Shekhtman, Ashish Goyal, Elisabetta Degasperi, Maria Paola Anolli, Sara Colonia Uceda Renteria, Dana Sambarino, Marta Borghi, Riccardo Perbellini, Floriana Facchetti, Annapaola Callegaro, Scott J. Cotler, Pietro Lampertico, Harel Dahari

AI总结 本文探讨了在单用布卢维瑞德治疗期间对乙型肝炎病毒动力学建模的挑战与解决方案,发现现有模型无法准确预测病毒动态,需引入目标细胞动力学以提高预测准确性。

详情
AI中文摘要

布卢维瑞德(BLV)最近被批准用于治疗慢性乙型肝炎病毒(HDV)感染,被视为最严重的病毒性肝炎感染。理论表明,考虑自由病毒和感染细胞但不包括目标细胞动态(历史上称为两方程模型)的模型,仅能预测抗病毒药物单相病毒下降。本文研究了最近发表的两方程类型模型,利用非线性混合效应建模(NLME)分析接受BLV单药治疗的HDV患者在96周内的临床数据。发现(i)尽管模型参数的相对标准误差(RSE)<50%表明参数估计精确,但拟合未能再现大多数患者观察到的非单相HDV动力学模式,导致错误预测达到理论治愈边界(即整个患者体外细胞液中少于1个病毒颗粒)所需治疗时间。(ii)该模型无法解释病毒突破。(iii)该模型错误预测治疗停止后病毒载量保持不变。最后,我们证明,引入目标细胞动态可解释治疗期间的单相病毒下降以及非单相HDV下降模式,如双相、平坦部分反应和病毒突破。引入目标细胞动态还预测BLV停止后病毒反弹,如临床研究中观察到的。

英文摘要

The entry inhibitor Bulevirtide (BLV) was recently approved in Europe for treatment of chronic hepatitis D virus (HDV) infection, which is considered the most severe viral hepatitis infection. Theory indicates that models that account for free virus and infected cells, but do not include target cell dynamics (historically called the two-equation model) are limited to predicting a monophasic viral decline for antiviral agents that act only to block viral entry/infection. We investigated herein a recently published two-equation type model against clinical data obtained from patients with HDV treated with BLV monotherapy for up to 96 weeks using non-linear mixed effects modelling (NLME). We found that (i) although the model parameters had a relative standard error (RSE) <50\% suggesting that they were 'precisely estimated', the fits failed to reproduce the non-monophasic HDV kinetic patterns observed in most patients leading to incorrect predictions of the duration of treatment needed to reach a theoretical cure boundary, defined as less than 1 virion in the entire patient extracellular body fluid. (ii) The model cannot explain viral breakthrough, and (iii) the model wrongly predicts that viral load will remain at the same level once treatment is stopped. Lastly, we showed that including target cell dynamics in the model can explain not only monophasic viral decline during treatment but also non-monophasic HDV decline patterns such as biphasic, flat-partial response and viral breakthrough. Including target cell dynamics also predicts a viral rebound once BLV is stopped as observed in clinical studies.

2411.09092 2026-06-08 q-bio.PE cond-mat.stat-mech physics.bio-ph physics.comp-ph

Anomalous Diffusion and Emergent Universality in Coupled Memory-Driven Systems

异常扩散与耦合记忆驱动系统中的涌现普遍性

Nick Dashti, M. N. Najafi, Debra J. Searles

AI总结 研究探讨了简单局部相互作用如何产生涌现探索模式,提出耦合记忆驱动系统模型,揭示了异常扩散、非高斯分布及压缩指数相遇统计等新普遍性类别。

详情
Journal ref
J. Stat. Mech. (2026) 023215
Comments
24 pages including 7 figures and 2 tables; and 7 additional pages of supplementary information. In this revision the content and discussion has been extended. The title and abstract have been changed to be more aligned to the new content
AI中文摘要

理解简单局部相互作用如何产生涌现探索模式是统计物理的基本问题。我们引入了一个最小模型,描述两个耦合代理在避免重走自身路径的同时被彼此留下的痕迹吸引。该系统受启发但不局限于信息素引导的昆虫导航。自我回避与吸引的耦合产生了丰富的涌现行为,包括不同的异常扩散 regime、非高斯位置分布以及压缩的指数相遇统计。最值得注意的是,我们识别了耦合随机游走的新普遍性类别,其特征由独特的标度律和分布属性定义,这些属性据我们所知此前未被报道。这些发现推进了具有记忆和交互反馈的耦合随机过程的理论理解,为探索多代理系统中的传输现象提供了框架,超越了生物背景。

英文摘要

Understanding how simple local interactions give rise to emergent exploration patterns is a fundamental question in statistical physics. We introduce a minimal model of two coupled agents that avoid retracing their own paths while being attracted to the trails left by one another. This system is inspired by, but not limited to, pheromone-guided insect navigation. The coupling of self-avoidance and attraction generates rich emergent behavior, including distinct anomalous diffusion regimes, non-Gaussian position distributions, and compressed exponential encounter statistics. Most notably, we identify new universality classes for coupled random walks, characterized by unique scaling laws and distributional properties that, to our knowledge, have not been previously reported. These findings advance the theoretical understanding of coupled stochastic processes with memory and interaction feedback, providing a framework for exploring transport phenomena in a broad range of multi-agent systems beyond biological contexts.

2507.13310 2026-06-08 physics.soc-ph cs.SI math.DS nlin.AO q-bio.PE

Modelling the spillover from online engagement to offline protest: stochastic dynamics and mean-field approximations on networks

在线参与对线下抗议活动的溢出效应建模:网络上的随机动态与均场近似

Moyi Tian, P. Jeffrey Brantingham, Nancy Rodríguez

AI总结 本文提出一个耦合模型框架,分析在线参与如何影响线下抗议活动,通过随机模型和不同复杂度的均场模型估计基本再生数并预测活动激增时间,发现网络结构影响近似精度。

详情
Journal ref
Journal of Complex Networks, Volume 14, Issue 2, April 2026, cnaf057
Comments
44 pages, 33 figures
AI中文摘要

社交媒体正在改变离线生活的各个方面,从日常决定到冲突发展。本文提出一个耦合建模框架,通过在线社交网络层分析特定主题的参与如何溢出到线下抗议活动。我们开发了一个随机模型,并推导出几种不同复杂度的均场模型。这些模型使我们能够估计基本再生数并预测活动激增的时间。关键因素是在线与离线领域之间的传播率;为了产生线下爆发,该速率必须处于临界范围,既不高也不低。此外,使用合成网络,我们研究了网络结构如何影响这些近似的准确性。我们的发现表明,低密度网络需要更复杂的近似,而更简单的模型可以有效表示高密度网络。然而,当在两个真实网络上测试时,增加的复杂性并未提高准确性。

英文摘要

Social media is transforming various aspects of offline life, from everyday decisions such as dining choices to the progression of conflicts. In this study, we propose a coupled modelling framework with an online social network layer to analyse how engagement on a specific topic spills over into offline protest activities. We develop a stochastic model and derive several mean-field models of varying complexity. These models allow us to estimate the reproductive number and anticipate when surges in activity are likely to occur. A key factor is the transmission rate between the online and offline domains; for offline outbursts to emerge, this rate must fall within a critical range, neither too low nor too high. Additionally, using synthetic networks, we examine how network structure influences the accuracy of these approximations. Our findings indicate that low-density networks need more complex approximations, whereas simpler models can effectively represent higher-density networks. When tested on two real-world networks, however, increased complexity did not enhance accuracy.