arXivDaily arXiv每日学术速递 周一至周五更新
重置
2606.19280 2026-06-18 q-bio.QM 新提交

CollaboratoR: A scalable workflow for collaborative data entry and management

CollaboratoR:一种用于协作数据录入和管理的可扩展工作流程

Patrick Bills, Ashwini Ramesh, Lais Petri, Alejandra Martinez Blancas, Kelly Kapsar, Amar Deep Tiwari, Phoebe L. Zarnetske

AI总结 针对协作数据录入中不一致和效率低下的问题,开发了CollaboratoR R包,通过自动化验证和聚合,结合Google Sheets和GitHub,实现透明、可重复的数据管理,提升数据合成质量。

Comments 16 pages, 1 table, 1 figure

详情
AI中文摘要

有效的协作数据录入和透明度是构建稳健数据库和高质量数据综合的基础。然而,研究人员经常面临不一致的数据录入,无意中引入错误、误读和不一致,损害数据完整性。尽管开源工具的使用日益增多,许多人仍依赖低效的格式或昂贵的商业平台,而较少采用复杂的开源解决方案。这些低效率拖慢了工作流程,阻碍了研究人员构建用于综合研究(包括元分析)的基础数据库。为了解决这个问题,我们开发了CollaboratoR,一个可定制的R包,它自动化数据验证和聚合,确保一致性和透明度,并遵循FAIR数据原则,同时可选地使用Google Sheets进行协作数据录入和GitHub进行版本控制。CollaboratoR填补了临时电子表格和用于元分析数据提取的复杂系统之间的空白。数据被录入共享的Google Sheets,经过验证,推送到GitHub进行版本控制,然后在最终确定前再次验证以确保准确性。在两个案例研究(植物竞争和鸟类互动数据库)中测试,CollaboratoR在管理大型协作数据集方面证明是有效的。在这两个案例中,自动化验证及早标记了常见的录入和格式问题,提高了可追溯性,并减少了事后清理所花费的时间。该框架适用于数据综合为数据驱动决策提供信息的学科,如社会科学、生态学以及医学和药学研究。最终,CollaboratoR为高效、透明和可重复的协作数据管理提供了指导,增强了跨领域和行业的研究综合。

英文摘要

Effective collaborative data entry and transparency are foundational for building robust databases and high-quality data synthesis. Yet researchers often face inconsistent data entries, inadvertently introducing errors, misreadings, and inconsistencies that compromise data integrity. Despite the growing use of open-source tools, many still rely on inefficient formats or costly commercial platforms, while fewer adopt complex open-source solutions. These inefficiencies slow workflows and hinder researchers' ability to build foundational databases for synthesis research, including meta-analyses. To address this, we developed CollaboratoR, a customizable R package that automates data validation and aggregation, ensuring consistency and transparency and adhering to FAIR data principles, while optionally using Google Sheets for collaborative data entry and GitHub for version control. CollaboratoR fills the gap between ad-hoc spreadsheets and complex systems for data extraction in meta-analyses. Data are entered into shared Google Sheets, validated, and pushed to GitHub for version control, then re-validated after verification to ensure accuracy before finalizing. Tested in two case studies, plant competition and avian interaction databases, CollaboratoR proved effective at managing large collaborative datasets. In both, automated validation flagged common entry and formatting issues early, improving traceability and reducing time spent on post-hoc cleaning. This framework applies across disciplines where data synthesis informs data-driven decision-making, such as social science, ecology, and medical and pharmaceutical research. Ultimately, CollaboratoR offers guidance for efficient, transparent, and reproducible collaborative data management, enhancing research synthesis across fields and industries alike.

2606.18667 2026-06-18 q-bio.NC q-bio.QM 新提交

Can neurons speak? Semantic narration of vision at single-cell resolution

神经元能说话吗?单细胞分辨率的视觉语义叙述

Arnau Marin-Llobet, Richard Hakim, Sara Matias, Venkatesh N. Murthy, Na Li, Demba Ba

AI总结 提出NEURRATOR框架,通过将神经元活动解码为自然语言描述,实现单细胞分辨率的视觉语义叙述,并用于量化解码保真度及解析单个神经元和特定细胞类型的功能贡献。

详情
AI中文摘要

识别高级视觉皮层中单个神经元编码的内容是一个开放问题。响应难以直观参数化,而用于替代的深度网络嵌入是黑箱。这里,我们介绍NEURRATOR,一个将尖峰活动解码为单神经元分辨率的自由形式自然语言叙述的框架。一个学习编码器将来自任意子集的同步记录神经元的尖峰序列映射到冻结CLIP的补丁嵌入空间,多模态语言模型和稀疏自编码器生成并验证描述,无需语言侧训练。应用于自然电影观看期间小鼠视觉皮层的Neuropixel记录,NEURRATOR从数千个神经元、单个皮层区域、局部群体或分子定义的细胞类型进行叙述。我们利用这一特性来(i)量化解码保真度如何随群体大小和皮层区域变化,以及(ii)用平实的语言“叙述”单个神经元和基因标记的抑制性细胞类型对视觉表征的贡献。这将细胞身份从分类目标重新定义为视觉系统的功能探针,为神经系统提供了一种新的生物学见解单位。

英文摘要

Identifying what individual neurons encode in higher-order visual cortex is an open problem. Responses resist intuitive parameterization, and the deep-network embeddings used in their place are black boxes. Here, we introduce NEURRATOR, a framework that decodes spiking activity into free-form natural-language narration of the viewed scene at single-neuron resolution. A learned encoder maps spike trains from arbitrary subsets of simultaneously-recorded neurons into the patch-embedding space of a frozen CLIP, from which a multimodal language model and sparse autoencoder generates and validates a description with no language-side training. Applied to Neuropixel recordings of mouse visual cortex during natural-movie viewing, NEURRATOR narrates from thousands of neurons, singular cortical regions, local populations, or from a molecularly-defined cell-types. We use this property to (i) quantify how decoding fidelity scales with population size and cortical region, and (ii) "neurrate", in plain language, what individual neurons and genetically-tagged inhibitory cell-types contribute to visual representation. This recasts cell identity from a classification target into a functional probe of the visual system, providing a new unit of biological insights in neural systems.

2606.18575 2026-06-18 q-bio.QM 新提交

Adaptive COVID-19 Trajectory Forecasting Using MAB-Inspired Ensemble Weighting

基于MAB启发式集成加权的自适应COVID-19轨迹预测

Hamed Karami, Javier Redondo Anton, Geunsoo Jang, K. Selcuk Candan, Gerardo Chowell

AI总结 针对疫情预测中单一模型可靠性不足的问题,提出MAB启发式自适应加权策略,在三个美国COVID-19疫情波次中评估UCB、EXP3和epsilon-greedy等加权规则,发现EXP3和EPSStoch在概率预测质量上表现最优。

详情
AI中文摘要

预测疫情轨迹对公共卫生决策至关重要,但没有任何单一模型能在不同疫情阶段和预测场景中持续可靠。我们评估了多臂老虎机(MAB)启发的自适应加权策略,用于在组件模型性能随时间变化时组合疫情预测模型。利用来自三个疫情波次的美国COVID-19发病率数据,我们在固定短窗口和增长校准窗口下比较了UCB、EXP3和epsilon-greedy加权规则,包括确定性和随机集成变体。模型池包括SIR、SEIR、GLM、Gompertz、Richards、ARIMA、带漂移的随机游走、简单指数平滑、Holt线性趋势方法和指数增长。自适应集成与单个模型以及朴素、未加权和逆WIS加权集成基准进行比较。使用RMSE、加权区间分数(WIS)、95%预测区间覆盖率和平均95%预测区间宽度评估预测性能。在不同波次、校准窗口和预测时间跨度上,EXP3Stoch、EXP3Det和EPSStoch实现了最低的平均预测WIS。主要收益在于概率预测质量,特别是WIS和区间覆盖率,而非一致更低的点预测误差。简单基准(包括未加权和逆WIS集成)在若干场景中仍具竞争力。这些结果表明,MAB启发的自适应加权是疫情预测中有用的补充工具,尤其当模型技能随时间变化且预测不确定性较大时。

英文摘要

Forecasting epidemic trajectories is important for public health decision-making, but no single model is consistently reliable across epidemic phases and forecasting settings. We evaluate Multi-Armed Bandit (MAB)-inspired adaptive weighting strategies for combining epidemic forecasting models when component-model performance changes over time. Using U.S. COVID-19 incidence data from three epidemic waves, we compare UCB, EXP3, and epsilon-greedy weighting rules under fixed short-window and growing calibration windows, with both deterministic and stochastic ensemble variants. The model pool includes SIR, SEIR, GLM, Gompertz, Richards, ARIMA, random walk with drift, simple exponential smoothing, Holt's linear trend method, and exponential growth. Adaptive ensembles are compared with individual models and with naive, unweighted, and inverse-WIS weighted ensemble benchmarks. Forecast performance is assessed using RMSE, weighted interval score (WIS), 95% prediction-interval coverage, and mean 95% prediction-interval width. Across waves, calibration windows, and forecast horizons, EXP3Stoch, EXP3Det, and EPSStoch achieved the lowest mean forecast WIS. The main gains were in probabilistic forecast quality, especially WIS and interval coverage, rather than uniformly lower point forecast error. Simple benchmarks, including the unweighted and inverse-WIS ensembles, remained competitive in several settings. These results suggest that MAB-inspired adaptive weighting is a useful complementary tool for epidemic forecasting, especially when model skill is time-varying and forecast uncertainty is substantial.

2606.18295 2026-06-18 q-bio.QM 新提交

Archetypal Microbiome Profiles as Indicators of Nitrous Oxide Emission States in Activated Sludge

活性污泥中一氧化二氮排放状态的原型微生物组特征指标

Cheng Chen, Marcelo Seppi, Samir Suweis, Andreas Froemelt, Eberhard Morgenroth, Andreas Scheidegger, Carlo Albert

AI总结 本研究利用原型分析(AA)将活性污泥微生物组降维为可解释的低维状态空间,发现三个原型可解释63%-73%的群落变异,且高N2O排放样本集中在特定原型附近,为全尺度污水处理厂监测N2O排放状态提供了可解释框架。

详情
AI中文摘要

水资源回收设施(WRRFs)的一氧化二氮(N2O)排放随时间波动,可能源于多种微生物途径,使得源归因和全尺度预测困难。活性污泥微生物组的高维度进一步加剧了难度,其复杂动态的群落结构可能掩盖与N2O排放模式的关系。本研究评估了活性污泥微生物组的可解释低维表示是否与N2O排放状态相关。从瑞士两个全尺度WRRFs收集了时间序列16S rRNA基因扩增子谱和N2O排放指标。使用原型分析(AA)汇总属级相对丰度谱,将每个样本表示为少量可解释群落原型的凸组合。在两个WRRFs中,三个原型捕获了群落组成中大部分可解释变异(63%-73%),并定义了一个单纯形状态空间,其中样本聚集在顶点和边缘附近,表明群落组成围绕不同的原型状态及其混合组织。在训练时不使用排放标签的情况下,原型状态空间与二元N2O排放状态强烈对齐:两个工厂的高排放观测集中在特定原型周围,时间轨迹显示在高排放期间该原型的权重持续较高。功能总结表明高N2O原型具有位点特异性但途径相关的解释。温度进一步结构化原型状态空间,表明与N2O升高相关的微生物组配置的季节性驱动。总体而言,AA提供了一个可解释的框架来追踪微生物组状态转变,并可能支持全尺度WRRFs中高N2O排放状态的运行追踪。

英文摘要

Nitrous oxide (N2O) emissions from water resource recovery facilities (WRRFs) fluctuate over time and can arise from multiple microbial pathways, making source attribution and full-scale prediction difficult. The difficulty is compounded by the high dimensionality of activated sludge microbiomes, whose complex and dynamic community structure can obscure relationships with N2O emission patterns. This study evaluated whether interpretable, low-dimensional representations of activated sludge microbiomes can be correlated with N2O emission states. Temporal 16S rRNA gene amplicon profiles and N2O emission metrics were collected from two full-scale WRRFs in Switzerland. Genus-level relative-abundance profiles were summarized using archetypal analysis (AA), which represents each sample as a convex combination of a small number of interpretable community profiles. In both WRRFs, three archetypes captured most explainable variation in community composition (63%--73%) and defined a simplex state space in which samples clustered near vertices and edges, indicating that community compositions were organized around distinct archetypal states and their mixtures. Without using emission labels while training, the archetypal state space aligned strongly with binary N2O emission states: high-emission observations in both plants concentrated around a specific archetype, and temporal trajectories showed consistent high weights of this archetype during high-emission periods. Functional summaries suggested site-specific but pathway-relevant interpretations of the high-N2O archetype. Temperature further structured the archetypal state space, indicating seasonal forcing of microbiome configurations associated with elevated N2O. Overall, AA provides an interpretable framework to track microbiome regime shifts and may support operational tracking of high-N2O emission states in full-scale WRRFs.

2606.19081 2026-06-18 q-bio.NC cs.HC 新提交

Retrieval-Based Brain Decoding by Alignment, not Complexity

基于对齐而非复杂性的检索式脑解码

Matteo Ciferri, Matteo Ferrante, Nicola Toschi

AI总结 本文通过跨多数据集实验证明,线性对比解码器在脑解码中优于岭回归和标准非线性方法,表明解码增益更多来自训练目标而非架构复杂性。

详情
AI中文摘要

认知科学中的一个著名理论认为,大脑中的概念被组织为高维向量,语义含义由该空间中的方向和相对角度捕获。脑解码是从神经活动中重建或检索刺激(或其表示)的努力,涉及找到一个近似大脑如何表示概念的函数。这激发了对对比目标作为逆转脑损失函数的生物合理候选者的研究。在这项工作中,我们研究了如何将功能磁共振成像(fMRI)活动与视觉、语言和音频基础模型的嵌入空间进行一般性映射。尽管神经计算在微观尺度上是高度非线性的,但fMRI测量平均了跨空间和时间的信号,并进一步被噪声平滑,从而有效地线性化了可观察的表示。与这些观点一致,我们在多个数据集上的实验表明,线性对比解码器始终优于岭回归和标准非线性替代方案,并且这些结果在图像、文本和声音中普遍适用。这些发现表明,解码增益更多地来自训练目标的选择而非架构复杂性,指向对比线性模型作为脑解码的原则性策略。

英文摘要

A prominent theory in cognitive science suggests that concepts in the brain are organized as high-dimensional vectors, with semantic meaning captured by directions and relative angles in this space. Brain decoding is the effort of reconstructing or retrieving stimuli (or their representations) from neural activity and involves finding a function that approximates how the brain represents concepts. This motivates the investigation of contrastive objectives as biologically plausible candidates to reverse the brain loss function. In this work, we study how functional MRI (fMRI) activity can generally be mapped with the embedding spaces of foundation models in vision, language, and audio. Although neural computations are highly non-linear at the microscale, fMRI measurements average signals across space and time, further smoothed by noise, effectively linearizing the observable representation. Consistent with these views, our experiments across multiple datasets demonstrate that linear contrastive decoders consistently outperform ridge regression and standard non-linear alternatives, and that these results generalize across images, text, and sound. These findings indicate that decoding gains arise more from the choice of training objective than from architectural complexity, pointing to contrastive-linear models as a principled strategy for brain decoding.

2606.18523 2026-06-18 q-bio.QM cs.CV 新提交

DART: A design-aware microfluidic chip paradigm for real-time live-cell image analysis

DART: 一种设计感知的微流控芯片范式用于实时活细胞图像分析

Johannes Seiffarth, Matthias Pesch, Lukas Scholtes, Dietrich Kohlheyer, Hanno Scharr, Katharina Nöh

发表机构 * Institute for Bio- and Geosciences, IBG-1: Biotechnology(生物与地质科学研究所,IBG-1:生物技术) Computational Systems Biotechnology (AVT.CSB), RWTH Aachen University(计算系统生物技术(AVT.CSB),亚琛工业大学) Institute for Advanced Simulation, IAS-8: Data Analytics and Machine Learning(先进模拟研究所,IAS-8:数据分析与机器学习)

AI总结 提出DART范式,通过嵌入式标记和深度学习检测对齐CAD蓝图与物理芯片,实现高通量微流控芯片中所有感兴趣区域的快速定位和全自动图像处理,支持实时分析。

详情
AI中文摘要

高通量微流控活细胞成像产生丰富的单细胞数据。然而,用于定位每个包含一个细胞群体的感兴趣区域(RoI)并从记录图像中移除周围微流控结构的半自动化流程随RoI数量扩展,这阻碍了实时图像分析并将洞察时间延迟数小时至数天。我们提出了用于微流控培养芯片的设计感知和实时能力(DART)范式,该范式将CAD蓝图与物理芯片对齐,从而实现了对所有RoI的通量无关定位以及跨不同RoI几何形状和芯片布局的全自动图像处理。DART通过嵌入式基准标记和基于深度学习的标记检测建立这种对齐。我们使用瑞士军刀芯片验证DART,该芯片在1164个RoI位置上组合了八种结构不同的RoI设计。DART在五分钟内定位所有RoI,在40毫秒内从原始显微镜图像中移除微流控结构,并在每张图像1.1秒内执行全自动图像分析,包括细胞分割。这些能力共同使DART成为一个端到端的硬件-软件范式,具有实时分析能力,为闭环和结果驱动的智能显微镜铺平了道路。

英文摘要

High-throughput microfluidic live-cell imaging generates rich single-cell data. Yet semi-automated procedures for locating regions of interest (RoIs), each containing one cell population, and removing surrounding microfluidic structures from recorded images, scale with the number of RoIs. This prevents real-time image analysis and delays time-to-insight by hours to days. We introduce the Design-Aware and Real-Time capable (DART) paradigm for microfluidic cultivation chips, which aligns the CAD blueprint with the physical chip and thereby enables throughput-independent localization of all RoIs and fully automated image processing across diverse RoI geometries and chip layouts. DART establishes this alignment through embedded fiducial markers and deep-learning-based marker detection. We validate DART using the Swiss Army Knife chip, which combines eight structurally distinct RoI designs across 1164 RoI locations. DART localizes all RoIs in five minutes, removes microfluidic structures from raw microscopy images in 40 ms, and performs fully automated image analysis, including cell segmentation, in under 1.1 s per image. Together, these capabilities establish DART as an end-to-end hardware-software paradigm with real-time-capable analysis that paves the way toward closed-loop and outcome-driven smart microscopy.

2606.18302 2026-06-18 q-bio.OT cs.LG 新提交

Protein-Based Fish Species Identification: Dataset, Models, and Insights from Native Bangladeshi Fish

基于蛋白质的鱼类物种识别:孟加拉本土鱼类的数据集、模型与见解

Md Nasiat Hasan Fahim, Md. Abid Ullah Muhib, Mohammad Shahidur Rahman

发表机构 * Shahjalal University of Science

AI总结 本研究构建了首个孟加拉本土鱼类蛋白质序列数据集,并系统评估了七种架构,提出了一种轻量级混合模型MotifCNN-Transformer+TA-PE,在资源受限场景下优于大型蛋白质语言模型ProtBERT。

Comments Published in 2026 IEEE 2nd International Conference on Quantum Photonics, Artificial Intelligence & Networking (QPAIN). \c{opyright} 2026 IEEE. Personal use of this material is permitted

详情
Journal ref
2026 IEEE 2nd International Conference on Quantum Photonics, Artificial Intelligence & Networking (QPAIN)
AI中文摘要

在孟加拉国,正确识别鱼类物种对于粮食安全、经济发展和气候适应性至关重要。蛋白质序列直接反映功能和进化约束,对物种认证和生物多样性监测具有重要意义。然而,目前尚无针对孟加拉本土鱼类物种的蛋白质序列识别基准。本研究通过引入首个包含9种孟加拉本土鱼类2845条高质量蛋白质序列的精选数据集来填补这一空白。我们还通过对七种架构范式进行系统基准测试,建立了该领域首个蛋白质序列分类基线。此外,我们提出了一种实用的新型混合架构——MotifCNN与具有末端感知位置编码的Transformer(MotifCNN-Transformer+TA-PE)。该新架构实现了79.80%的准确率和0.80的宏F1分数。最高准确率83.04%由微调的蛋白质语言模型ProtBERT取得,该模型有4.2亿参数,需要双16GB GPU进行推理。根据McNemar检验,ProtBERT相比我们的MotifCNN-Transformer+TA-PE的3.24%准确率提升在统计上不显著(p = 0.1120)。在九类中的六类上,我们的新架构在每类识别中优于ProtBERT。此外,我们的MotifCNN-Transformer+TA-PE比ProtBERT快约5倍,小42倍,支持16倍更大的批处理大小,且无需GPU推理,使其在资源受限地区(如孟加拉农村)部署更为实用。除此之外,我们的基础性工作展示了系统发育关系对序列相似性的影响,并为南亚蛋白质依赖型经济中的渔业管理、食品认证和生物多样性保护建立了途径。

英文摘要

Correct identification of fish species is highly significant for food security, economic development, and climate resilience in Bangladesh. Protein sequences directly reflect functional and evolutionary constraints which are important for species authentication and biodiversity monitoring. Yet there exists no benchmark for native Bangladeshi fish species identification from protein sequence. In this study, we addressed this gap by introducing the first curated dataset for nine native Bangladeshi fish species of 2845 high quality protein sequences. We also established the first protein sequence classification baseline for this domain through a systematic benchmarking of seven architectural paradigms. Moreover, we propose a realistic deployable novel hybrid architecture of MotifCNN and Transformer with Terminal-Aware Positional-Encoding (MotifCNN-Transformer+TA-PE). Our novel architecture achieves 79.80% accuracy with macro-F1 of 0.80. The highest 83.04% accuracy is achieved by finetuned protein language model ProtBERT that has 420M parameters and requires dual 16GB GPUs for inference. According to McNemar's test, ProtBERT's 3.24% accuracy gain over our MotifCNN-Transformer+TA-PE is statistically insignificant (p = 0.1120). Our novel architecture beats it among six of the nine classes in per class identification. Also our MotifCNN-Transformer+TA-PE is approximately 5x faster, 42x smaller, and supports 16x larger batch size than ProtBERT and has GPU free inference, making it more practical for deployment in resources constrained areas such as rural Bangladesh. Beyond this, our foundational work shows effects of phylogenetic relationships on sequence similarity and establishes pathways for fisheries management, food authentication and biodiversity conservation in South Asia's protein dependent economy.

2606.18703 2026-06-18 cs.LG q-bio.QM 新提交

Contextualizing Biological Language Models across Modalities via Logit-Space Contrastive Alignment

跨模态生物学语言模型的逻辑空间对比对齐

Yanjun Shao, Yundi Chen, Yashvi Patel, Aurelien Pelissier, María Rodríguez Martínez

发表机构 * Biomedical Informatics and Data Science, Yale School of Medicine(耶鲁医学院生物医学信息学与数据科学)

AI总结 提出LOGICA框架,在输出逻辑空间进行对比学习,通过门控跨模态适配器保留预训练似然接口,实现跨不同词汇表模型的上下文条件预测,在蛋白质-配体结合、TCR-肽活性和药物耐药性预测任务上超越现有方法。

详情
AI中文摘要

预训练的生物学语言模型通过掩码标记预测暴露每个标记的概率分布,提供序列设计、变异评分和机制解释所依赖的似然接口。然而,这些分布是从广泛的无标注语料中学习得到的,并未自然地以任务特定的生物学上下文(如相互作用伙伴、细胞环境或治疗干预)为条件。现有的上下文匹配方法通常通过池化嵌入、对比潜在空间或任务特定的预测头来扭曲这一接口。我们提出了LOGICA(逻辑空间对比对齐),一种用于上下文条件预测的框架,直接在输出逻辑空间中进行对比学习。通过与每个模型的原生标记头兼容的门控跨模态适配器,LOGICA保留了预训练的似然接口,并将上下文化的标记对数似然转换为匹配分数。对齐是通过上下文敏感的标记概率来定义的,而不是共享嵌入空间中的邻近性,从而能够从具有不同词汇表的模型之间的稀疏配对数据中学习,无需共享分词器或解码器。LOGICA特别适用于突变局部变异排序,其中比较简化为扰动位点上突变标记的上下文条件似然。在蛋白质-配体结合、TCR-肽活性和药物条件耐药性预测中,LOGICA优于先前的最先进方法,包括匹配的潜在对比和条件MLM基线,同时保留了用于解释和生成的标记级接口。在保留基因的单突变药物耐药性预测中,LOGICA将AUC从接近随机的潜在空间基线约0.55提高到约0.65。

英文摘要

Pretrained biological language models expose per-token probability distributions through masked-token prediction, providing the likelihood interface central to sequence design, variant scoring, and mechanistic interpretation. Yet these distributions are learned from broad unlabeled corpora and are not naturally conditioned on task-specific biological contexts such as interaction partners, cellular environments, or therapeutic interventions. Existing contextual matching methods often distort this interface through pooled embeddings, contrastive latent spaces, or task-specific prediction heads. We introduce LOGICA (Logit-space Contrastive Alignment), a framework for context-conditioned prediction that performs contrastive learning directly in output-logit space. Using gated cross-modal adapters compatible with each model's native token head, LOGICA preserves the pretrained likelihood interface and converts contextualized token log-likelihoods into matching scores. Alignment is defined through context-sensitive token probabilities rather than proximity in a shared embedding space, enabling learning from sparse paired data across models with distinct vocabularies, without a shared tokenizer or decoder. LOGICA is particularly effective for mutation-local variant ranking, where comparisons reduce to context-conditioned likelihoods of mutant tokens at perturbed sites. Across protein--ligand binding, TCR--peptide activity, and drug-conditioned resistance prediction, LOGICA improves over prior state-of-the-art methods, including matched latent-contrastive and conditional MLM baselines, while retaining a token-level interface for interpretation and generation. On held-out-gene single-mutation drug-resistance prediction, LOGICA improves AUC from near-random latent-space baselines of $\sim$0.55 to $\sim$0.65.

2606.18672 2026-06-18 cs.LG cs.AI q-bio.GN 新提交

scGTN: Deep Siamese Graph Transformer Network for Single-cell RNA Sequencing Clustering

scGTN:用于单细胞RNA测序聚类的深度孪生图变换网络

Jinke Wu, Yifan Wang, Siyu Yi, Caiyang Yu, Ziyue Qiao, Nan Yin, Jiancheng Lv, Wei Ju

发表机构 * Sichuan University(四川大学) University of International Business and Economics(对外经济贸易大学) Great Bay University(大湾区大学) The Education University of Hong Kong(香港教育大学)

AI总结 提出scGTN框架,通过孪生图变换网络整合基因表达与细胞间结构信息,利用最优传输策略进行自监督聚类,在多个数据集上优于现有方法。

Comments Accepted by Proceedings of the Thirty-Fifth International Joint Conference on Artificial Intelligence (IJCAI 2026)

详情
AI中文摘要

单细胞RNA测序(scRNA-seq)在表征细胞水平基因表达、识别细胞类型以及促进对细胞异质性的理解中起着关键作用。尽管scRNA-seq数据聚类取得了显著进展,但我们认为当前方法常常忽略scRNA-seq数据固有的稀疏性和噪声,以及复杂的细胞间结构信息。为此,本文提出了一种基于深度孪生图变换网络(称为scGTN)的新型单细胞RNA-seq聚类框架,该框架明确整合了基因表达谱和细胞间结构依赖关系以进行细胞聚类。具体而言,我们将scRNA-seq数据建模为图,并构建两个增强图视图作为双视图以捕获互补的细胞间信息。然后,采用孪生图变换网络显式整合最短路径信息和节点间距离,以捕获细胞间更丰富的结构关系。最后,我们采用最优传输策略以自监督方式指导细胞聚类。在多个基准scRNA-seq数据集上的大量实验表明,我们的scGTN始终优于现有方法。我们的代码可在以下网址获取:https://github.com/...(原文链接)。

英文摘要

Single-cell RNA sequencing (scRNA-seq) serves a pivotal role in characterizing gene expression at the cellular level, enabling the identification of cell types and advancing the understanding of cellular heterogeneity. Despite the significant progress in scRNA-seq data clustering, we argue that current methods always ignore the sparsity and noise, as well as the complex intercellular structural information inherent in scRNA-seq data. Toward this end, in this paper, we propose a novel single-cell RNA-seq clustering framework via deep Siamese Graph Transformer Network (termed scGTN), which explicitly integrates gene expression profile and intercellular structural dependencies for cell clustering. In particular, we formulate scRNA-seq data as a graph and construct two augmented graph views that serve as dual views to capture complementary intercellular information. Then, a Siamese graph transformer network is employed to explicitly incorporate shortest-path information and node-wise distances for capturing richer structural relationships between cells. Finally, we employ an optimal transport strategy to guide the cell clustering in a self-supervised manner. Extensive experiments on multiple benchmark scRNA-seq datasets demonstrate that our scGTN consistently outperforms existing methods. Our code is available at https://github.com/W-RMSL/scGTN.

2606.18640 2026-06-18 cs.LG q-bio.QM 新提交

MetaboNet-Bench: A Multi-modal Benchmark for Glucose Forecasting in Type 1 Diabetes

MetaboNet-Bench:1型糖尿病血糖预测的多模态基准

Nathaniel Jeffries, Miriam Wolff, Sam Royston, Elizabeth Healey, Caleb Mayer, David Klonoff, Michael Snyder, Tao Wang

发表机构 * Department of Genetics, Stanford University School of Medicine(斯坦福大学医学院遗传学系) Replica Health Boston Children’s Hospital, Harvard Medical School(哈佛医学院波士顿儿童医院) Diabetes Research Institute, Mills-Peninsula Medical Center(米尔斯半岛医学中心糖尿病研究所)

AI总结 针对1型糖尿病血糖预测算法缺乏标准化评估基准的问题,提出MetaboNet-Bench多模态基准,集成血糖、胰岛素和碳水化合物数据,通过多个模型对比验证多模态数据对模型性能的影响。

Comments main content in 10 pages with 5 figures; supplementary section with 11 more pages and 5 more figures

详情
AI中文摘要

血糖预测算法是1型糖尿病血糖控制管理的重要方面。迄今为止,研究社区已经开发了大量预测算法和模型。然而,公认的是,缺乏标准化的模型性能评估基准使得公平比较变得困难,并阻碍了进一步的创新,因此基准标准化迫在眉睫。此外,许多已发表的血糖预测算法仅限于CGM数据,忽略了其他多模态信号,如胰岛素剂量和碳水化合物摄入。在此,我们介绍MetaboNet-Bench,这是一个针对1型糖尿病患者的多模态血糖预测基准,它提供了一个可扩展的开源评估框架,用于比较利用血糖、胰岛素和碳水化合物数据的血糖预测算法。然后,我们通过基准测试几个最近发布的血糖预测模型和一个自定义的多模态时间序列模型(代表不同的模型架构)来展示其实用性。结果表明,添加数据模态的好处取决于模型的复杂性,并且纳入更多临床指标有助于识别未来研究中有意义的空白。

英文摘要

Glucose forecasting algorithms are an important aspect of glycemic control management in type 1 diabetes. So far, the research community has developed numerous algorithms and models for forecasting. However, it is well-recognized that the lack of standardized model performance evaluation benchmarks makes fair comparison difficult and hinders further innovation, and thus benchmark standardization is in urgent need. Furthermore, many published glucose forecasting algorithms are limited to CGM data alone, ignoring other multimodal signals such as insulin dosing and carbohydrate intake. Here, we introduce MetaboNet-Bench, a benchmark for multimodal glucose forecasting for patients with type 1 diabetes that provides an extensible open-source evaluation framework for comparison of glucose forecasting algorithms that leverage glucose, insulin, and carbohydrate data. We then demonstrate its utility by benchmarking several recently published glucose forecasting models and a custom multimodal time-series model, representing different model architectures. The results show that the benefit of adding data modalities is conditioned on the complexity of the model and that incorporating more clinical metrics helps identify meaningful gaps to fill for future research.

2606.18420 2026-06-18 cs.LG q-bio.QM stat.ML 新提交

Measurement noise limits the advantage of nonlinear models over linear models in biomedical prediction

测量噪声限制了非线性模型在生物医学预测中相对于线性模型的优势

Marc-Andre Schulz, Kerstin Ritter

发表机构 * Hertie Institute for AI in Brain Health, University of Tübingen(赫蒂人工智能脑健康研究所,图宾根大学) Tübingen AI Center, University of Tübingen(图宾根人工智能中心,图宾根大学) Department of Psychiatry and Neurosciences, Charité – Universitätsmedizin Berlin(精神病学与神经科学系,柏林夏里特医学院) Bernstein Center for Computational Neuroscience, Berlin(伯恩斯坦计算神经科学中心,柏林) German Center for Mental Health (DZPG), partner site Tübingen(德国心理健康中心(DZPG),图宾根合作站点)

AI总结 本文指出,在生物医学表格数据中,测量噪声会削弱非线性结构,导致非线性模型与线性模型性能相当,并提出了一个精确的超额风险恒等式,揭示了测量可靠性、样本量和特征表示三个条件必须同时满足才能体现非线性优势。

详情
AI中文摘要

在生物医学表格数据上,诸如深度网络、梯度提升树和核方法等灵活模型,在给定相同特征的情况下,反复被线性回归和逻辑回归匹配或击败。通常的反应是将其视为模型方面的不足,需要通过更多数据、更好的架构或调参来修复,假设非线性结构存在而模型未能捕捉到。我们认为,当限制因素是测量而非模型时(这在生物医学中经常发生),这些修复无法奏效。加性噪声模糊了群体最优预测器,并且由于模糊在去除函数的广泛形状之前先去除精细、快速变化的细节,它比线性结构更快地抹去非线性结构。一个k阶交互作用被特征可靠性的k次幂衰减,而线性部分只衰减一次。在生物医学测量典型的可靠性下,即使底层生物学是强非线性的,非线性优势也可能消失,并且噪声所移除的部分无法通过更大的队列或更灵活的模型恢复,只能通过更好的测量。非线性是隐藏的,而非缺失,线性模型与灵活模型之间的平局本身并不能对生物学做出定论。这些片段是经典的,来自测量误差统计、心理测量学和高斯分析,我们将它们组合成一个精确的超额风险恒等式。测量可靠性是与样本量和特征表示并列的三个条件之一,必须对齐才能使灵活模型发挥作用,而它们共同只留下一个狭窄的窗口,大多数生物医学任务落在此窗口之外。在140个英国生物银行任务中,灵活模型与线性模型之间的差距(如果存在)带有预测的噪声特征,并且这三个条件可以通过干预而非仅通过基准测试来分离。

英文摘要

On biomedical tabular data, flexible models such as deep networks, gradient-boosted trees, and kernel methods are repeatedly matched or beaten by linear and logistic regression given the same features. The usual reaction is to treat this as a model-side shortfall, to be fixed with more data, a better architecture, or tuning, on the assumption that the nonlinear structure is there and the model has failed to capture it. We argue that these fixes cannot help when the binding limit is the measurement rather than the model, as it frequently is in biomedicine. Additive noise blurs the population-optimal predictor, and because blurring removes a function's fine, rapidly varying detail before its broad shape, it erases nonlinear structure faster than linear structure. A degree-$k$ interaction is attenuated by the $k$-th power of feature reliability, while the linear part is attenuated only once. At the reliabilities typical of biomedical measurement, the nonlinear advantage can vanish even when the underlying biology is strongly nonlinear, and what the noise removes cannot be recovered by a larger cohort or a more flexible model, only by better measurement. The nonlinearity is hidden, not absent, and a tie between linear and flexible models is not by itself a verdict on the biology. These pieces are classical, drawn from measurement-error statistics, psychometrics, and Gaussian analysis, and we assemble them into an exact excess-risk identity. Measurement reliability is one of three conditions, alongside sample size and feature representation, that must align for a flexible model to help, and together they leave only a narrow window that most biomedical tasks fall outside. Across 140 UK Biobank tasks, the gap between flexible and linear models, where it exists, carries the predicted noise signature, and the three conditions can be separated by intervention but not by a benchmark alone.

2606.18390 2026-06-18 cs.LG q-bio.QM 新提交

MOLAR: Learning Multimodal Molecular Representations from Noisy Labels

MOLAR: 从噪声标签中学习多模态分子表示

Yingxu Wang, Kunyu Zhang, Nan Yin, Yu Li, Eran Segal

发表机构 * Mohamed bin Zayed University of Artificial Intelligence(穆罕默德·本·扎耶德人工智能大学) Zhengzhou University(郑州大学) The Education University of Hong Kong(香港教育大学) The Chinese University of Hong Kong(香港中文大学) Weizmann Institute of Science(魏茨曼科学研究所)

AI总结 提出MOLAR框架,通过分离干净属性推断与标签观测,利用图与文本模态的残差证据,从噪声标签中学习多模态分子表示,在自然噪声和标签翻转基准上优于基线方法。

详情
AI中文摘要

动机:噪声标签是分子属性预测中的常见挑战,因为分子注释通常来自实验分析、 curated数据库或弱注释流程,而非直接观测到的干净生物状态。将记录标签视为可靠监督会导致模型记忆损坏的观测并学习误导性的分子证据。在多模态分子表示学习中,图-文本融合或对齐可能放大此问题,从而跨模态传播标签引起的错误。结果:我们提出MOLAR,一个从噪声标签中学习多模态分子表示的噪声感知框架。MOLAR将潜在干净属性推断与记录标签观测分离:图和文本视图为干净属性分布贡献残差证据,一个分类标签观测通道将此分布映射到记录标签用于训练。该公式从模型中推导出后验标签可靠性和模态特定的分子证据。在自然噪声分子基准和受控标签翻转基准上的实验表明,MOLAR始终优于代表性基线。可视化分析进一步表明MOLAR提供了可解释的可靠性和模态证据诊断。

英文摘要

Motivation: Noisy labels are a common challenge in molecular property prediction because molecular annotations are often obtained from assays, curated databases, or weak annotation pipelines rather than directly observed clean biological states. Treating recorded labels as reliable supervision can cause models to memorize corrupted observations and learn misleading molecular evidence. In multimodal molecular representation learning, this issue can be amplified by graph-text fusion or alignment, which may propagate label-induced errors across modalities. Results: We propose MOLAR, a noise-aware framework for learning multimodal molecular representations from noisy labels. MOLAR separates latent clean-property inference from recorded-label observation: graph and text views contribute residual evidence to a clean-property distribution, and a categorical label-observation channel maps this distribution to recorded labels for training. This formulation derives posterior label reliability and modality-specific molecular evidence from the model. Experiments on naturally noisy molecular benchmarks and controlled label-flipping benchmarks show that MOLAR consistently outperforms representative baselines. Visualization analyses further show that MOLAR provides interpretable reliability and modality-evidence diagnostics.

2606.18660 2026-06-18 q-bio.PE physics.soc-ph 新提交

Effects of spatial environmental noise on evolution of cooperation

空间环境噪声对合作演化的影响

Janguk Kim, Seung-Woo Son, Hye Jin Park

AI总结 通过添加退火和淬火噪声到空间演化博弈模型,发现退火噪声扩大合作区域和灭绝区域,而淬火噪声影响微弱,表明时间波动是噪声诱导合作相变的主要驱动力。

详情
AI中文摘要

我们研究了环境噪声对具有可变种群规模的空间演化博弈模型中合作的影响。基于一维晶格模型(其中空位通过空间选择促进合作),我们向环境质量参数添加随机噪声,并考虑两种不同类型:退火噪声(每个位置和时间步的环境质量独立波动)和淬火噪声(每个位置被分配一个永久固定的随机值)。对于退火噪声,我们通过用分布平均值替换依赖噪声的死亡概率来发展平均场理论,并发现增加噪声强度会使合作者-背叛者相边界和吸收边界在参数空间中向上移动,同时扩大合作区域和灭绝区域。这些预测得到了数值模拟的证实。相比之下,淬火噪声在所有噪声水平下几乎不改变相边界,对合作者频率只有微弱影响。这些结果共同表明,时间波动(而非静态空间异质性)是噪声诱导合作相结构变化的主要驱动因素。

英文摘要

We investigate the effects of environmental noise on cooperation in a spatial evolutionary game model with variable population size. Building on a one-dimensional lattice model in which vacancies promote cooperation through spatial selection, we add random noise to the environmental quality parameter and consider two distinct types: annealed noise, where the environmental quality fluctu ates independently at each site and each time step, and quenched noise, where each site is assigned a permanently fixed random value. For annealed noise, we develop a mean-field theory by replacing the noise-dependent death probabilities with their distribution averages, and find that increasing the noise intensity shifts both the cooperator-defector phase boundary and the absorbing boundary upward in the parameter space, simultaneously expanding the cooperative regime and the extinc tion region. These predictions are confirmed by numerical simulations. In contrast, quenched noise leaves the phase boundary nearly unchanged across all noise levels, exerting only a weak effect on cooperator frequency. Together, these results demonstrate that temporal fluctuations, rather than static spatial heterogeneity, are the primary driver of noise-induced shifts in the cooperative phase structure.

2606.18495 2026-06-18 physics.chem-ph physics.bio-ph physics.comp-ph q-bio.BM 新提交

Bayesian Sampling of Structural Ensembles: The Role of Ensemble-Counting Measures

结构系综的贝叶斯采样:系综计数测度的作用

Ivan Gilardoni, Giovanni Bussi

AI总结 本文提出Jeffreys测度作为系综计数测度,解决BELT框架中拉格朗日乘子空间平直测度导致的有限参考轨迹下后验分布不可归一化问题,并在RNA寡聚体模拟中验证了测度选择对贝叶斯估计的影响。

详情
AI中文摘要

结构系综精修被广泛用于将分子模拟与实验测量相结合。虽然大多数应用关注最大后验(MAP)系综,但后验分布的贝叶斯采样可以为任意可观测量提供不确定性估计和后验平均值。贝叶斯能量景观倾斜(BELT)框架引入了这一方向的一个显著步骤,其中对由拉格朗日乘子参数化的最大熵系综族进行采样。这里,我们表明在这种设置下,贝叶斯采样需要显式选择系综计数测度。特别是,原始BELT公式中使用的拉格朗日乘子空间的平直测度导致后验分布对于有限参考轨迹在形式上不可归一化。我们提出Jeffreys测度作为一种不变的系综计数处方,恢复了此处考虑的有限样本情况下的可归一化性,并为后验平均值提供了一致的定义。使用解析可处理的高斯模型和RNA寡聚体模拟的最大熵精修,我们比较了不同的系综计数测度,并表明它们可以显著影响贝叶斯估计。所得方法已在\ exttt{MDRefine}软件包中实现。

英文摘要

Structural ensemble refinement is widely used to integrate molecular simulations with experimental measurements. While most applications focus on the maximum-a-posteriori (MAP) ensemble, Bayesian sampling of the posterior distribution can provide uncertainty estimates and posterior averages for arbitrary observables. A notable step in this direction was introduced by the Bayesian Energy Landscape Tilting (BELT) framework, where sampling is performed on a family of maximum-entropy ensembles parametrized by Lagrange multipliers. Here, we show that Bayesian sampling in this setting requires an explicit choice of ensemble-counting measure. In particular, the flat measure in Lagrange-multiplier space used in the original BELT formulation leads to a posterior distribution that is formally non-normalizable for finite reference trajectories. We propose the Jeffreys measure as an invariant ensemble-counting prescription, restoring normalizability in the finite-sample situations considered here, and providing a consistent definition of posterior averages. Using both an analytically tractable Gaussian model and maximum-entropy refinement of RNA oligomer simulations, we compare different ensemble-counting measures and show that they can significantly affect Bayesian estimates. The resulting methodology has been implemented in the \texttt{MDRefine} software package.

2606.18277 2026-06-18 physics.soc-ph q-bio.PE 新提交

Multi-network comparison of between-farm contacts for infectious disease surveillance in swine production

猪生产中用于传染病监测的场间接触的多网络比较

Jason A. Galvis, Nicolas C. Cardenas, Gustavo Machado

AI总结 通过比较11种网络类型(车辆移动、动物移动和基于距离的场间接触),发现车辆移动网络(尤其是饲料运输)连接最密集,育肥场在多个网络中充当超级传播者,不同网络识别的高风险农场集合不同,支持将多种传播途径纳入疾病监测。

详情
AI中文摘要

了解猪场如何直接和间接地相互连接对于描述传染病传播至关重要。本研究旨在描述11种网络类型中猪场的连通性,包括车辆移动(即卡车和拖车)、动物移动和基于距离的场间接触,以识别生产类型之间以及可能一致被表征为超级传播者的场之间的联系。卡车和拖车移动网络连接最为密集,尤其是饲料运输,其连接水平比猪移动和基于距离的网络高98.7%至99.7%。这些网络还表现出农场之间最高程度和频率的连接,而聚合卡车网络(包括所有卡车类型)显示出作为连接农场的桥梁的最大潜力。育肥场在所有网络中都与其他农场类型高度互联。母猪场经常被其他农场类型访问,尤其是通过饲料卡车移动,占这些连接的8.7%。我们证明,在车辆移动和邻近网络中,育肥场作为超级传播者发挥了主要作用。当比较每个网络中按超级传播者得分排名前50的农场时,基于车辆的网络显示出最高的相似性,车辆网络之间共享高达89%的排名靠前的农场。相比之下,猪移动和基于距离的网络识别出大部分不同的排名靠前的农场集合,与其他接触网络分别最多共享4%和8%。总体而言,每个网络都表现出独特的连接结构,导致不同的高风险农场集合,特别是在向种猪场潜在传播方面。这些发现支持将多种传播途径整合到疾病监测中。

英文摘要

Understanding how swine farms are interconnected, directly and indirectly, is essential to characterizing infectious disease transmission. This study aimed to describe the connectivity of swine farms across 11 network types, including vehicle movements (i.e., trucks and trailers), animal movements, and distance-based farm-to-farm contacts, to identify links among production types and farms likely to be consistently characterized as super-spreaders. Truck and trailer movement networks were the most densely connected, particularly for feed transport, showing connectivity levels between 98.7% and 99.7% higher than those of pig movement and distance-based networks. These networks also exhibited the highest degree and frequency of connections between farms, while the aggregated truck network, which included all truck types, showed the greatest potential to act as a bridge connecting farms. Finisher farms were highly interconnected with other farm types across all networks. Sow farms were frequently reached by other farm types, especially through feed truck movements, representing up to 8.7% of these links. We demonstrated that in vehicle movements and proximity networks, finisher farms played a major role as super-spreaders. When comparing the top 50 farms ranked by super-spreader score in each network, vehicle-based networks showed the highest similarity, with up to 89% of top-ranked farms shared between vehicle networks. In contrast, pig movement and distance-based networks identified largely distinct sets of top-ranked farms, sharing at most 4% and 8%, respectively, with other contact networks. Overall, each network exhibited a distinct connectivity structure, resulting in different sets of high-risk farms, particularly regarding potential transmission to breeding farms. These findings support the integration of multiple transmission pathways into disease surveillance.

2601.12805 2026-06-18 q-bio.GN cs.AI cs.CL 版本更新

SciHorizon-GENE: Benchmarking LLM for Life Sciences Inference from Gene Knowledge to Functional Understanding

SciHorizon-GENE:从基因知识到功能理解的生命科学推理基准测试

Xiaohan Huang, Meng Xiao, Chuan Qin, Qingqing Long, Jinmiao Chen, Yuanchun Zhou, Hengshu Zhu

发表机构 * Computer Network Information Center, Chinese Academy of Sciences(中国科学院计算机网络信息中心) University of the Chinese Academy of Sciences(中国科学院大学) DUKE-NUS Medical School, National University of Singapore(新加坡国立大学杜克-新加坡医学学校) Singapore Immunology Network, Agency for Science, Technology and Research(新加坡免疫网络,科技研究局)

AI总结 针对大语言模型在基因级推理能力上的不足,构建了包含超过19万个人类基因和54万问题的基准SciHorizon-GENE,从研究关注敏感性、幻觉倾向、答案完整性和文献影响力四个生物学关键维度评估模型,揭示了模型在生成忠实、完整且基于文献的功能解释方面的持续挑战。

Comments Accepted by SIGKDD 2026. 12 pages

详情
AI中文摘要

大型语言模型(LLMs)在生物医学研究中展现出日益增长的潜力,尤其是在知识驱动的解释任务中。然而,它们从基因知识到功能理解的可靠推理能力——这是知识增强型细胞图谱解释的核心要求——仍然在很大程度上未被探索。为了填补这一空白,我们引入了SciHorizon-GENE,这是一个基于权威生物数据库构建的大规模基因中心基准。该基准整合了超过19万个人类基因的 curated 知识,包含超过54万个问题,涵盖了与细胞类型注释、功能解释和机制导向分析相关的多种基因到功能推理场景。受初步检查中观察到的行为模式启发,SciHorizon-GENE从四个生物学关键角度评估LLMs:研究关注敏感性、幻觉倾向、答案完整性和文献影响力,明确针对限制LLMs在生物解释管道中安全采用的失败模式。我们系统评估了多种最先进的通用和生物医学LLMs,揭示了基因级推理能力的显著异质性,以及在生成忠实、完整且基于文献的功能解释方面的持续挑战。我们的基准为在基因尺度上分析LLM行为建立了系统基础,并为模型选择和发展提供了见解,与知识增强型生物解释直接相关。

英文摘要

Large language models (LLMs) have shown growing promise in biomedical research, particularly for knowledge-driven interpretation tasks. However, their ability to reliably reason from gene-level knowledge to functional understanding, a core requirement for knowledge-enhanced cell atlas interpretation, remains largely underexplored. To address this gap, we introduce SciHorizon-GENE, a large-scale gene-centric benchmark constructed from authoritative biological databases. The benchmark integrates curated knowledge for over 190K human genes and comprises more than 540K questions covering diverse gene-to-function reasoning scenarios relevant to cell type annotation, functional interpretation, and mechanism-oriented analysis. Motivated by behavioral patterns observed in preliminary examinations, SciHorizon-GENE evaluates LLMs along four biologically critical perspectives: research attention sensitivity, hallucination tendency, answer completeness, and literature influence, explicitly targeting failure modes that limit the safe adoption of LLMs in biological interpretation pipelines. We systematically evaluate a wide range of state-of-the-art general-purpose and biomedical LLMs, revealing substantial heterogeneity in gene-level reasoning capabilities and persistent challenges in generating faithful, complete, and literature-grounded functional interpretations. Our benchmark establishes a systematic foundation for analyzing LLM behavior at the gene scale and offers insights for model selection and development, with direct relevance to knowledge-enhanced biological interpretation.

2605.10840 2026-06-18 cs.LG cs.AI q-bio.QM 版本更新

Clin-JEPA: A Multi-Phase Co-Training Framework for Joint-Embedding Predictive Pretraining on EHR Patient Trajectories

Clin-JEPA:一种多阶段协同训练框架,用于EHR患者轨迹的联合嵌入预测预训练

Yixuan Yang, Mehak Arora, Ryan Zhang, Baraa Abed, Junseob Kim, Tilendra Choudhary, Md Hassanuzzaman, Kevin Zhu, Ayman Ali, Chengkun Yang, Alasdair Edward Gent, Victor Moas, Rishikesan Kamaleswaran

发表机构 * Duke University(杜克大学)

AI总结 本文提出Clin-JEPA框架,通过多阶段预训练稳定协同训练编码器和预测器,解决EHR数据中联合嵌入预测的挑战,实现多任务下游任务的高性能表现。

Comments 16 pages, 4 figures, 8 tables. Code: https://github.com/YeungYathin/Clin-JEPA

详情
AI中文摘要

我们介绍了Clin-JEPA,一种用于EHR患者轨迹的联合嵌入预测(JEPA)预训练的多阶段协同训练框架。JEPA架构已在机器人领域实现了潜在空间规划,并在视觉领域实现了高质量的表示学习,但将其扩展到EHR数据以获得一个能够同时预测患者轨迹并服务于多种下游风险预测任务的单一主干,仍是一个开放性挑战。现有的JEPA框架要么在预训练后丢弃预测器(I-JEPA,V-JEPA),要么在冻结的预训练编码器上训练预测器(V-JEPA 2-AC),导致编码器在推理时无法感知预测器必须使用的滚动信号;在共享JEPA预测目标下协同训练编码器和预测器将提供这种基础,但朴素的协同训练不稳定,代表性崩溃和在线/目标漂移导致自回归滚动发散。Clin-JEPA的五阶段预训练课程——预测器预热、联合细化、EMA目标对齐、硬同步和预测器最终化——通过阶段解决每个失败模式,稳定地协同训练基于Qwen3-8B的编码器和一个具有9200万参数的潜在轨迹预测器。在MIMIC-IV ICU数据上,三个独立评估支持该框架:(1)潜在ℓ1滚动漂移唯一收敛(-15.7%)在48小时范围内,而基线和消融测试发散(+3%至+4951%);(2)编码器学习了临床可区分的潜在几何结构(衰变患者群体在潜在空间中偏离4.83×,而稳定患者仅偏离≤2.62×);(3)单一主干在多任务下游评估中优于强大的表格和序列基线。Clin-JEPA在ICareFM EEP上达到平均AUROC 0.851,在8个二元风险任务上达到0.883(比基线平均高0.038和0.041)

英文摘要

We present Clin-JEPA, a multi-phase co-training framework for joint-embedding predictive (JEPA) pretraining on EHR patient trajectories. JEPA architectures have enabled latent-space planning in robotics and high-quality representation learning in vision, but extending the paradigm to EHR data -- to obtain a single backbone that simultaneously forecasts patient trajectories and serves diverse downstream risk-prediction tasks without per-task fine-tuning -- remains an open challenge. Existing JEPA frameworks either discard the predictor after pretraining (I-JEPA, V-JEPA) or train it on a frozen pretrained encoder (V-JEPA 2-AC), leaving the encoder unaware of the rollout signal that the retained predictor must use at inference; co-training the encoder and predictor under a shared JEPA prediction objective would supply this grounding, but naïve co-training is unstable, with representation collapse and online/target drift causing autoregressive rollout to diverge. Clin-JEPA's five-phase pretraining curriculum -- predictor warmup, joint refinement, EMA target alignment, hard sync, and predictor finalization -- addresses each failure mode by phase, stably co-training a Qwen3-8B-based encoder and a 92M-parameter latent trajectory predictor. On MIMIC-IV ICU data, three independent evaluations support the framework: (1) latent $\ell_1$ rollout drift uniquely converges ($-$15.7%) over 48-hour horizons while baselines and ablations diverge (+3% to +4951%); (2) the encoder learns a clinically discriminative latent geometry (deteriorating-patient cohorts displace 4.83$\times$ further than stable patients in latent space, vs $\leq$2.62$\times$ for baseline encoders); (3) a single backbone outperforms strong tabular and sequence baselines on multi-task downstream evaluation. Clin-JEPA achieves mean AUROC 0.851 on ICareFM EEP and 0.883 on 8 binary risk tasks (+0.038 and +0.041 vs baseline average).

2605.01056 2026-06-18 q-bio.MN math.DS 版本更新

Numerical Reliability of Logistic Gene Regulatory Network Models: Preventing Expression Shutdown and Robust Integration of Boolean-Derived ODE Systems

逻辑基因调控网络模型的数值可靠性:防止表达关闭与布尔衍生常微分方程系统的鲁棒集成

Ismail Belgacem

AI总结 本研究证明Hill函数作为基因调控网络ODE模型中的调控核函数普遍不可靠,会导致表达关闭和复数污染;而逻辑函数作为替代,具有严格正的基础速率和全局Lipschitz性质,能提供鲁棒的数值积分和先验误差界。

Comments arXiv admin note: text overlap with arXiv:2512.14325

详情
AI中文摘要

基因调控网络通常从布尔更新规则转换为大型连续常微分方程系统,并通过数值积分进行吸引子识别、敏感性分析和控制设计。该积分的可靠性关键取决于代表调控的S形核函数。本仿真研究表明,Hill函数——近乎通用的选择——是一种普遍不可靠的核函数,而逻辑函数则是一种鲁棒的替代方案。展示了两种失效模式。首先,由于Hill函数在零输入时为零,双稳态电路会获得一个吸收的关闭状态:使用实验验证的大肠杆菌半乳糖操纵子自调控参数,Hill模型被困在不稳定分界线以下,而逻辑模型——其基础速率通过构造严格为正——仅通过基础产生在大约44分钟内逃逸,与约58分钟的分析估计相符。通过显式超越方程进行鞍结点分析表征双稳态窗口,并识别出阈值λθ=2,该阈值将单稳态和双稳态区域分开。其次,当Hill指数为非整数时(如在剂量-响应拟合中),幂律x^n=e^{nln x}在求解器过冲进入负浓度时会变为复数值。在一个80基因的布尔衍生基准测试中(n≈3.509),Hill求解器从t≈52.64开始被复数值无声污染,产生平滑但虚假的轨迹,而逻辑公式在t∈[0,200]内完成,没有出现任何警告。由于逻辑向量场是全局Lipschitz的且具有显式常数,我们进一步证明了经典阶的先验全局误差界——这是Hill公式在结构上无法获得的保证。

英文摘要

Gene regulatory networks are routinely translated from Boolean update rules into large continuous ODE systems integrated numerically for attractor identification, sensitivity analysis, and control design. The reliability of that integration depends critically on the sigmoidal kernel representing regulation. This simulation study shows that the Hill function -- the near-universal choice -- is a generically unreliable kernel, while the logistic function is a robust replacement. Two failure modes are demonstrated. First, because the Hill function vanishes at zero input, bistable circuits acquire an absorbing off-state: with experimentally grounded \textit{E. coli} galactose-operon autoregulation parameters, a Hill model stays trapped below the unstable separatrix, whereas the logistic model -- whose basal rate is strictly positive by construction -- escapes in about $44$~minutes through basal production alone, matching an analytical estimate of ${\approx}58$~min. A saddle-node analysis characterises the bistable window via an explicit transcendental equation and identifies the threshold $λθ=2$ separating monostable from bistable regimes. Second, when the Hill exponent is non-integer -- as in dose-response fits -- the power law $x^n=e^{n\ln x}$ turns complex-valued whenever a solver overshoots into negative concentrations. On an $80$-gene Boolean-derived benchmark with $n\approx3.509$, the Hill solver is silently contaminated by complex values from $t\approx52.64$, yielding smooth but spurious trajectories, whereas the logistic formulation completes $t\in[0,200]$ without a single warning. Because the logistic vector field is globally Lipschitz with explicit constant, we further prove an a priori global-error bound of classical order -- a guarantee structurally unavailable to the Hill formulation.

2603.27465 2026-06-18 q-bio.GN 版本更新

Poisoning the Genome: Targeted Backdoor Attacks on DNA Foundation Models

基因组投毒:针对DNA基础模型的目标后门攻击

Charalampos Koilakos, Ioannis Mouratidis, Ilias Georgakopoulos-Soares

AI总结 本研究首次系统研究基因组语言模型的训练数据投毒,通过在预训练和微调阶段注入少于1%的对抗序列,可选择性破坏目标基因组上下文的生成性能,并实现条件后门攻击和下游任务分类破坏。

Comments 23 pages, double column format

详情
AI中文摘要

基于DNA序列训练的基础模型在变异效应预测和基因组设计等生物学任务中取得了强劲性能。这些模型依赖于包含数万亿核苷酸标记的大规模公共基因组数据集。与自然语言不同,DNA序列缺乏语义透明性,使得在数据整理过程中难以检测被破坏或对抗性构造的条目。我们首次系统研究了基因组语言模型中的训练数据投毒,针对预训练和微调阶段。在预训练中,使用Evo 2和GENERator架构,我们表明训练语料中少于1%的对抗性构造序列可以选择性地降低目标基因组上下文上的生成性能,同时不影响无关序列。我们评估了三种场景:TATA-box启动子基序的破坏、CTCF结合位点的干扰以及插入所有训练基因组中不存在的合成序列。在微调中,我们展示了另外两种攻击。首先,在ClinVar衍生语料库中投毒一部分CTCF位点,在LoRA适配模型中安装一个条件后门,该后门几乎仅在触发序列存在时激活。其次,使用冻结的Evo 2 7B嵌入,对下游训练数据进行目标标签破坏,选择性地损害临床相关的变异分类任务,在BRCA1变异效应预测上进行了演示。这些结果表明基因组基础模型容易受到最小足迹的目标数据投毒。我们敦促该领域将数据来源追踪、完整性验证和对抗鲁棒性评估作为基因组模型开发管道的标准组成部分。

英文摘要

Foundation models trained on DNA sequences have achieved strong performance across biological tasks including variant effect prediction and genome design. These models rely on massive public genomic datasets comprising trillions of nucleotide tokens. Unlike natural language, DNA sequences lack semantic transparency, making corrupted or adversarially crafted entries difficult to detect during data curation. We present the first systematic study of training data poisoning in genomic language models, targeting both pre-training and fine-tuning stages. At pre-training, using Evo 2 and GENERator architectures, we show that less than 1% adversarially crafted sequences in the training corpus can selectively degrade generative performance on targeted genomic contexts while leaving unrelated sequences unaffected. We evaluate three scenarios: corruption of TATA-box promoter motifs, disruption of CTCF binding sites, and insertion of synthetic sequences absent from all training genomes. At fine-tuning, we demonstrate two additional attacks. First, poisoning a subset of CTCF sites in a ClinVar-derived corpus installs a conditional backdoor in a LoRA-adapted model that activates almost exclusively when the trigger sequence is present. Second, using frozen Evo 2 7B embeddings, targeted label corruption of downstream training data selectively compromises a clinically relevant variant classification task, demonstrated on BRCA1 variant effect prediction. These results show genomic foundation models are susceptible to targeted data poisoning with minimal footprint. We urge the field to adopt data provenance tracking, integrity verification, and adversarial robustness evaluation as standard components of the genomic model development pipeline.

2603.04939 2026-06-18 physics.soc-ph q-bio.PE 版本更新

When minor issues matter: symmetries, pluralism, and polarization in similarity-based opinion dynamics

当小问题重要时:基于相似性观点动力学中的对称性、多元性与极化

Brian Mintz, Daniel Simonson, Dominik Wodarz, Feng Fu, Natalia L. Komarova

AI总结 通过随机主体模型研究议题权重异质性对观点演化的影响,发现极小权重议题可破坏稳定态,增加收敛时间,并基于对称性分类揭示极化与议题重要性分布的关系。

Comments The supplement is provided as a pdf

详情
AI中文摘要

理解观点如何通过社会互动演化对于缓解极化至关重要。现有的观点动力学模型同时包含吸引和排斥相互作用,但通常假设所有议题同等重要。我们开发并分析了一个随机主体模型,其中议题具有异质性权重,这些权重既影响社会亲和力也影响观点改变的可能性。令人惊讶的是,即使引入一个权重任意小的单一议题,也能破坏原本稳定的状态,使收敛时间增加数个数量级。为了解释这些动力学,我们推导了一个平均场方法,并刻画了支配共识、极化与持续多元性的平衡对称性。对最多五个议题的这些对称性的完整分类表明,当重要性集中在少数议题上时,极化增加。相反,将重要性更广泛地分布在议题上会促进观点多样性并减少极化。我们的基于对称性的框架突显了议题显著性和社会容忍度如何共同塑造集体观点演化。

英文摘要

Understanding how opinions evolve through social interactions is crucial for mitigating polarization. Existing opinion-dynamics models incorporate both attractive and repulsive interactions but typically assume that all issues are equally important. We develop and analyze a stochastic agent-based model where issues carry heterogeneous weights that influence both social affinity and the likelihood of opinion change. Surprisingly, introducing even a single issue with arbitrarily small weight can destabilize otherwise stable states, increasing convergence times by orders of magnitude. To explain these dynamics, we derive a mean-field approach and characterize the equilibrium symmetries governing consensus, polarization, and persistent pluralism. A complete classification of these symmetries for up to five issues reveals that polarization increases when importance is concentrated on a small number of issues. Conversely, distributing importance more broadly across issues promotes diversity of opinions and reduces polarization. Our symmetry-based framework highlights how issue salience and social tolerance jointly shape collective opinion evolution.

2511.14555 2026-06-18 q-bio.NC cs.AI 版本更新

DecNefSimulator: A Modular, Interpretable Framework for Decoded Neurofeedback Simulation Using Generative Models

DecNefSimulator:一个用于解码神经反馈模拟的模块化、可解释框架

Alexander Olza, Roberto Santana, David Soto

发表机构 * Intelligent Systems Group, University of the Basque Country (UPV/EHU)(巴斯克国家大学智能系统组) Consciousness Group, Basque Center on Cognition, Brain and Language (BCBL)(巴斯克认知、大脑与语言中心意识组) Ikerbasque, Basque Foundation for Science(巴斯克科学基金会)

AI总结 提出DecNefSimulator,一个模块化可解释的模拟框架,将解码神经反馈形式化为机器学习问题,通过潜变量生成模型模拟参与者,直接观察内部状态并评估协议设计对学习的影响,可复现经验现象、识别失败条件并指导协议设计。

详情
AI中文摘要

解码神经反馈(DecNef)是一种有前景的非侵入性脑调控方法,在神经医学和认知神经科学中具有广泛应用。然而,DecNef研究的进展仍受限于受试者依赖的学习变异性、依赖间接测量来量化进展,以及实验的高成本和时间消耗。我们提出DecNefSimulator,一个模块化且可解释的模拟框架,将DecNef形式化为一个机器学习问题。除了提供虚拟实验室,DecNefSimulator使研究人员能够建模、分析和理解神经反馈动态。通过使用潜变量生成模型作为模拟参与者,DecNefSimulator允许直接观察内部认知状态,并系统评估不同协议设计和受试者特征如何影响学习。我们展示了这种方法如何(i)复现DecNef学习的经验现象,(ii)识别DecNef反馈未能诱导学习的条件,以及(iii)在人体实施之前,在计算机中指导设计更稳健可靠的DecNef协议。总之,DecNefSimulator连接了计算建模和认知神经科学,为方法创新、稳健协议设计以及最终更深入地理解基于DecNef的脑调控提供了原则性基础。

英文摘要

Decoded Neurofeedback (DecNef) is a promising non-invasive approach to brain modulation with wide-ranging applications in neuromedicine and cognitive neuroscience. However, progress in DecNef research remains constrained by subject-dependent learning variability, reliance on indirect measures to quantify progress, and the high cost and time demands of experimentation. We present DecNefSimulator, a modular and interpretable simulation framework that formalizes DecNef as a machine learning problem. Beyond providing a virtual laboratory, DecNefSimulator enables researchers to model, analyze and understand neurofeedback dynamics. Using latent variable generative models as simulated participants, DecNefSimulator allows direct observation of internal cognitive states and systematic evaluation of how different protocol designs and subject characteristics influence learning. We demonstrate how this approach can (i) reproduce empirical phenomena of DecNef learning, (ii) identify conditions under which DecNef feedback fails to induce learning, and (iii) guide the design of more robust and reliable DecNef protocols in silico before human implementation. In summary, DecNefSimulator bridges computational modeling and cognitive neuroscience, offering a principled foundation for methodological innovation, robust protocol design, and ultimately, a deeper understanding of DecNef-based brain modulation.

2511.05221 2026-06-18 cs.LG q-bio.NC 版本更新

ActiTect: A Generalizable Machine Learning Pipeline for REM Sleep Behavior Disorder Screening through Standardized Actigraphy

ActiTect:通过标准化体动记录进行REM睡眠行为障碍筛查的通用机器学习流程

David Bertram, Anja Ophey, Sinah Röttgen, Konstantin Kufer, Gereon R. Fink, Elke Kalbe, Clint Hansen, Walter Maetzler, Maximilian Kapsecker, Lara M. Reimer, Stephan Jonas, Andreas T. Damgaard, Natasha B. Bertelsen, Casper Skjaerbaek, Per Borghammer, Karolien Groenewald, Pietro-Luca Ratti, Michele T. Hu, Noémie Moreau, Michael Sommerauer, Katarzyna Bozek

发表机构 * Faculty of Mathematics and Natural Sciences, University of Cologne, Germany(科隆大学数学与自然科学学院,德国) Institute for Biomedical Informatics, Faculty of Medicine and University Hospital Cologne, University of Cologne, Germany(科隆大学医学院与科隆大学医院生物医学信息学研究所,德国) Center for Molecular Medicine Cologne (CMMC), Faculty of Medicine and University Hospital Cologne, University of Cologne, Germany(科隆分子医学中心(CMMC),科隆大学医学院与科隆大学医院,德国) Medical Psychology | Neuropsychology and Gender Studies, Faculty of Medicine and University Hospital Cologne, University of Cologne, Germany(科隆大学医学院与科隆大学医院医学心理学 | 神经心理学与性别研究,德国) Cognitive Neuroscience, Insitute for Neuroscience and Medicine, INM-3, Research Center Juelich, Germany(认知神经科学,神经科学与医学研究所,Juelich研究中心,德国) Department of Neurology, Faculty of Medicine and University Hospital Cologne, University of Cologne, Germany(科隆大学医学院与科隆大学医院神经科,德国) Center of Neurology, Department of Parkinson, Sleep and Movement Disorders, University Hospital Bonn, University of Bonn, Germany(神经科中心,帕金森、睡眠与运动障碍部门,波恩大学医院,德国) German Center for Neurodegenerative Diseases (DZNE), Bonn, Germany(德国神经退行性疾病研究中心(DZNE),波恩,德国) Cluster of Excellence for Aging and Aging-Associated Diseases (CECAD), University of Cologne, Germany(老龄化与相关疾病卓越中心(CECAD),科隆大学,德国) Department of Neurology, University Medical Center Schleswig-Holstein, Campus Kiel and Kiel University, Germany(神经科,施普伦德-霍斯特大学医院,基尔校区和基尔大学,德国) Department of Informatics, Technical University of Munich, Germany(信息学院,慕尼黑技术大学,德国) Institute for Digital Medicine, University Hospital Bonn, Germany(数字医学研究所,波恩大学医院,德国) Lundbeck Foundation Parkinson’s Disease Research Center (PACE), Aarhus University, Denmark(路德维希基金会帕金森病研究中心(PACE),奥胡斯大学,丹麦) Department of Nuclear Medicine, Aarhus University Hospital, Denmark(核医学部,奥胡斯大学医院,丹麦) Department of Electrical and Computer Engineering, Aarhus University, Denmark(电气与计算机工程系,奥胡斯大学,丹麦) Oxford Parkinson’s Disease Centre and Division of Neurology, Nuffield Department of Clinical Neurosciences, University of Oxford, UK(牛津帕金森病中心与神经科,牛津大学临床神经科学系,英国)

AI总结 提出ActiTect,一个全自动开源机器学习工具,通过标准化预处理和睡眠-觉醒检测,从体动记录中识别RBD,在多个独立队列中验证了泛化能力(AUROC 0.84-0.94)。

Comments 37 pages including Supplementary Information, 4 core figures, 1 supplementary figure. (v2: fixed a typo in Table 3 and made minor text edits; v3: post review)

详情
Journal ref
npj Digital Medicine (2026)
AI中文摘要

孤立性快速眼动睡眠行为障碍(iRBD)是α-突触核蛋白病的主要前驱标志,通常先于帕金森病、路易体痴呆或多系统萎缩的临床发作。虽然腕戴式体动记录仪通过捕捉异常夜间运动在大规模筛查中具有检测RBD的巨大潜力,但缺乏可靠高效的分析流程则无法使用。本研究提出了ActiTect,一个全自动开源机器学习工具,用于从体动记录中识别RBD。为确保跨异构采集设置的泛化能力,我们的流程包括稳健的预处理和自动睡眠-觉醒检测,以协调多设备数据并提取表征活动模式的生理可解释运动特征。模型开发基于78名个体的队列,在嵌套交叉验证下表现出强大的区分能力(AUROC = 0.95)。在盲法本地测试集(n = 31,AUROC = 0.86)和两个独立外部队列(n = 113,AUROC = 0.84;n = 57,AUROC = 0.94)上验证了泛化性。为评估现实世界鲁棒性,跨内部和外部队列的留一数据集交叉验证显示出一致的性能(AUROC范围 = 0.84-0.89)。补充稳定性分析表明,关键预测特征在数据集中保持可重复性,支持最终合并的多中心模型作为更广泛部署的稳健预训练资源。通过开源且易于使用,我们的工具促进了广泛采用,并促进了独立验证和协作改进,从而推动该领域向使用可穿戴设备的统一且可泛化的RBD检测模型发展。

英文摘要

Isolated rapid eye movement sleep behavior disorder (iRBD) is a major prodromal marker of $α$-synucleinopathies, often preceding the clinical onset of Parkinson's disease, dementia with Lewy bodies, or multiple system atrophy. While wrist-worn actimeters hold significant potential for detecting RBD in large-scale screening efforts by capturing abnormal nocturnal movements, they become inoperable without a reliable and efficient analysis pipeline. This study presents ActiTect, a fully automated, open-source machine learning tool to identify RBD from actigraphy recordings. To ensure generalizability across heterogeneous acquisition settings, our pipeline includes robust preprocessing and automated sleep-wake detection to harmonize multi-device data and extract physiologically interpretable motion features characterizing activity patterns. Model development was conducted on a cohort of 78 individuals, yielding strong discrimination under nested cross-validation (AUROC = 0.95). Generalization was confirmed on a blinded local test set (n = 31, AUROC = 0.86) and on two independent external cohorts (n = 113, AUROC = 0.84; n = 57, AUROC = 0.94). To assess real-world robustness, leave-one-dataset-out cross-validation across the internal and external cohorts demonstrated consistent performance (AUROC range = 0.84-0.89). A complementary stability analysis showed that key predictive features remained reproducible across datasets, supporting the final pooled multi-center model as a robust pre-trained resource for broader deployment. By being open-source and easy to use, our tool promotes widespread adoption and facilitates independent validation and collaborative improvements, thereby advancing the field toward a unified and generalizable RBD detection model using wearable devices.

2502.02904 2026-06-18 cs.HC cs.CL q-bio.NC 版本更新

ScholaWrite: A Dataset of End-to-End Scholarly Writing Process

ScholaWrite: 端到端学术写作过程数据集

Khanh Chi Le, Linghe Wang, Minhwa Lee, Ross Volkov, Luan Tuyen Chau, Dongyeop Kang

发表机构 * University of Minnesota(明尼苏达大学)

AI总结 提出ScholaWrite数据集,通过Chrome扩展记录Overleaf上的按键,捕捉从初稿到终稿的多月写作过程,包含5篇计算机科学预印本的近6.2万次文本修改及认知写作意图标注,揭示人类写作与LLM辅助之间的差距。

Comments Equal contribution: Khanh Chi Le, Linghe Wang, Minhwa Lee | project page: https://minnesotanlp.github.io/scholawrite/

详情
AI中文摘要

写作是一项认知要求高的活动,需要持续决策、高度依赖工作记忆,并在不同目标的任务之间频繁切换。为了构建与作者认知真正一致的写作助手,我们必须捕捉并解码作者将想法转化为最终文本背后的完整思维过程。我们提出了ScholaWrite,这是第一个端到端学术写作数据集,追踪从初稿到最终手稿的多月历程。我们贡献了三个关键进展:(1)一个Chrome扩展,可无干扰地记录Overleaf上的按键,从而能够收集真实、现场写作数据;(2)一个新颖的完整学术手稿语料库,附有认知写作意图的细粒度标注。该数据集包含基于LaTeX的五篇计算机科学预印本的编辑,捕捉了四个月内近6.2万次文本更改;(3)对学术写作微观动态的分析和见解,突出了人类写作过程与大型语言模型(LLM)在提供有意义帮助方面的当前能力之间的差距。ScholaWrite强调了捕获端到端写作数据以开发未来写作助手的重要性,这些助手支持而非取代科学家的认知工作。

英文摘要

Writing is a cognitively demanding activity that requires constant decision-making, heavy reliance on working memory, and frequent shifts between tasks of different goals. To build writing assistants that truly align with writers' cognition, we must capture and decode the complete thought process behind how writers transform ideas into final texts. We present ScholaWrite, the first dataset of end-to-end scholarly writing, tracing the multi-month journey from initial drafts to final manuscripts. We contribute three key advances: (1) a Chrome extension that unobtrusively records keystrokes on Overleaf, enabling the collection of realistic, in-situ writing data; (2) a novel corpus of full scholarly manuscripts, enriched with fine-grained annotations of cognitive writing intentions. The dataset includes \LaTeX-based edits from five computer science preprints, capturing nearly 62K text changes over four months; and (3) analyses and insights into the micro-dynamics of scholarly writing, highlighting gaps between human writing processes and the current capabilities of large language models (LLMs) in providing meaningful assistance. ScholaWrite underscores the value of capturing end-to-end writing data to develop future writing assistants that support, not replace, the cognitive work of scientists.

2510.12614 2026-06-18 physics.soc-ph cond-mat.stat-mech nlin.AO q-bio.PE 版本更新

Modeling Epidemics on Multiplex Networks: Epidemic Threshold and Basic Reproduction Number

多重网络上的流行病建模:流行阈值与基本再生数

Eric Alejandro Rozan, Mario Ignacio Simoy, Sebastian Bouzat, Marcelo Nestor Kuperman

AI总结 针对多重网络提出基本再生数R0的解析表达式,基于度均值场SIR模型和下一代矩阵方法,并通过数值模拟和随机仿真验证其作为流行阈值的作用。

Comments 22 pages, 7 figures

详情
AI中文摘要

准确的流行病预测需要考虑到真实社交互动的分层和异质性。基于同质混合或单层接触结构模型计算的基本再生数$\mathcal R_0$在复杂社会系统中的适用性有限。在此,我们推导了多重网络背景下$\mathcal R_0$的表达式,从而能够分析跨多个社会层的疾病传播。我们将单层复杂网络的基于度的平均场(DBMF)SIR模型推广到多重设置,其中每一层由其自身的度分布和感染率刻画。利用下一代矩阵方法,我们推导出基本再生数$\mathcal R_0$的解析表达式。多重DBMF方程的数值积分表明,$\mathcal R_0=1$标志着流行阈值,并如预期那样控制着关键爆发指标的行为。除了$\mathcal R_0$的精确表达式外,我们还引入了一个近似值$\tau$,它更易于计算,并且在系统的流行病学和拓扑参数方面具有更清晰的解释。基于随机智能体的模拟支持了这些发现,表明$\tau$与爆发早期阶段产生的平均继发感染数量之间存在直接对应关系,这与$\mathcal R_0$的流行病学解释一致。这项工作为分层接触结构提供了$\mathcal R_0$的稳健推广,为流行病预测和干预策略设计提供了更现实的基础。

英文摘要

Accurate epidemic forecasting requires models that account for the layered and heterogeneous nature of real social interactions. The basic reproduction number $\mathcal R_0$, as calculated from models that assume homogeneous mixing or single-layer contact structures, has limited applicability to complex social systems. Here, we derive an expression for $\mathcal R_0$ in the context of multiplex networks, enabling the analysis of disease transmission across multiple social layers. We adapt the Degree-Based Mean-Field (DBMF) SIR model for single-layer complex networks to the multiplex setting, where each layer is characterized by its own degree distribution and infection rate. Using the Next Generation Matrix method, we derive an analytical expression for the basic reproduction number $\mathcal R_0$. Numerical integration of the multiplex DBMF equations shows that $\mathcal R_0=1$ marks the epidemic threshold and governs the behavior of key outbreak indicators as expected. In addition to the exact expression for $\mathcal R_0$, we introduce an approximation, denoted by $τ$, which is simpler to compute and admits a more transparent interpretation in terms of the epidemiological and topological parameters of the system. Stochastic agent-based simulations support these findings, demonstrating a direct correspondence between $τ$ and the average number of secondary infections generated during the early stages of an outbreak, consistent with the epidemiological interpretation of $\mathcal R_0$. This work provides a robust generalization of $\mathcal R_0$ for layered contact structures, offering a more realistic basis for epidemic forecasting and the design of intervention strategies.

2508.10178 2026-06-18 q-bio.QM cs.LG 版本更新

Estimating carbon pools in the European Shelf sea environment: replacing reanalysis by model-informed machine learning?

估算欧洲陆架海环境中的碳库:用模型指导的机器学习替代再分析?

Jozef Skakala

发表机构 * Plymouth Marine Laboratory(普利茅斯海洋实验室) National Centre for Earth Observation(国家地球观测中心)

AI总结 提出用深度集成神经网络学习可观测变量与海洋碳库的关系,以低成本替代昂贵再分析,在西北欧陆架海实现高效碳库预测并提供不确定性。

Comments 37 pages, 9 figures (+ 3 in the appendix), v3 - published version

详情
Journal ref
JGR - Machine Learning and Computation 3 (2026)
AI中文摘要

陆架海对经济和碳循环至关重要,但碳库观测往往稀疏或高度不确定。碳再分析(无论是同化叶绿素a等代理变量还是直接同化碳)可提供替代方案,但运行成本高昂。我们提出使用计算成本低的神经网络集成(即深度集成)来学习直接可观测(大气、河流和海洋)变量与海洋碳库之间的关系,该关系来自一个物理-生物地球化学耦合模型。深度集成在西北欧陆架海(NWES)物理-生物地球化学模型自由运行模拟上训练。训练后,使用来自NWES再分析的输入而非自由运行来运行深度集成,证明它能高效预测多个NWES碳库(如碎屑、浮游动物、异养细菌),且与再分析的一致性远优于自由运行,同时提供不确定性信息。我们进一步表明,当深度集成直接由同化到再分析中的观测驱动时,其表现同样良好,但碳库只能预测在观测位置和时间。我们关注结果的可解释性,并展示了深度集成在未来气候假设情景中的潜在应用。我们认为,模型指导的机器学习为昂贵的再分析提供了可行的替代方案,并可在观测缺失和/或高度不确定的地方补充观测。

英文摘要

Shelf seas are important for the economy and the carbon cycle, but shelf sea observations for carbon pools are often sparse, or highly uncertain. An alternative can be provided by carbon reanalyses (whether assimilating proxy variables, such as chlorophyll-$a$, or directly carbon), but these are often expensive to run. We propose to use a computationally cheap ensemble of neural networks (i.e. deep ensemble) to learn the relationship between the directly observable (atmospheric, riverine and ocean) variables and marine carbon pools from a coupled physics-biogeochemistry model. The deep ensemble was trained on a North-West European Shelf (NWES) physical-biogeochemistry model free run simulation. After training, the deep ensemble was run using inputs from the NWES reanalysis instead of the free run, demonstrating that it can efficiently predict several NWES carbon pools (e.g., detritus, zooplankton, heterotrophic bacteria) in much better agreement with the reanalysis than the free run, while also providing uncertainty information. We further show that the deep ensemble performs similarly well when it is driven directly by the observations assimilated into the reanalysis, with the limitation that carbon pools can then be predicted only at the observed locations and times. We focus on explainability of the results and demonstrate potential use of the deep ensembles for future climate what-if scenarios. We suggest that model-informed machine learning presents a viable alternative to expensive reanalyses and could complement observations, wherever they are missing and/or highly uncertain.

2508.02400 2026-06-18 q-bio.QM 版本更新

Assimilation of machine learning-predicted nitrate to improve the quality of phytoplankton forecasting in the shelf sea environment

同化机器学习预测的硝酸盐以提高陆架海环境中浮游植物预报的质量

Deep S Banerjee, Jozef Skakala, David Ford

AI总结 本研究通过同化神经网络预测的表层硝酸盐浓度,显著提升了西北欧陆架海域浮游植物短期(1-5天)动力模型预报的准确性,最高改进达30%。

Comments 23 pages, 7 figures, v2 - published version

详情
Journal ref
Q.J.R.Meteorol.Soc. 152 (2026),
AI中文摘要

我们证明,同化神经网络(NN)预测的表层硝酸盐可显著改善西北欧陆架(NWES)海域浮游植物短期(1-5天)动力模型预报。我们表明,在当前英国气象局NWES业务系统中仅同化海洋水色叶绿素-$a$会导致春季水华后表层硝酸盐浓度过高,这是晚春和夏季NWES浮游植物预报中一些已知快速增长偏差的主要原因。同化硝酸盐观测数据可能有助于解决这一问题,但NWES硝酸盐数据通常不足以有效同化。因此,我们使用了一个最近开发并验证的神经网络(NN)模型,该模型从一系列可观测变量预测表层硝酸盐浓度,并将NN预测的硝酸盐同化到英国气象局NWES业务预报系统的研发版本中。由于硝酸盐同化,浮游植物5天预报技能提高了30%。我们表明,尽管通过使用NN模型预测的每周硝酸盐气候学数据可以实现大部分改进,但使用流依赖的硝酸盐数据具有明显优势。我们讨论了这一改进对一系列其他富营养化指标(如溶解无机磷和海底氧)的影响。我们认为,在近实时NWES业务预报系统中,将这种方法升级为完全混合机器学习-数据同化是可行的。

英文摘要

We demonstrate that assimilating Neural Network (NN)-predicted surface nitrate leads to a major improvement in phytoplankton short-range (1-5 day) dynamical model forecasts for the North-West European Shelf (NWES) seas. We show that assimilation of only ocean color chlorophyll-$a$ in the current Met Office NWES operational system can lead to excess surface nitrate concentrations in the post-Spring bloom period and these are a major reason behind some known, fast-growing biases in NWES phytoplankton forecasts during late Spring and Summer. Assimilating observations of nitrate would potentially help address this, but NWES nitrate data are typically not available in sufficient abundance to be effectively assimilated. We have therefore used a recently developed and validated neural network (NN) model predicting surface nitrate concentrations from a range of observable variables and assimilated the NN-predicted nitrate within a research and development version of the Met Office's NWES operational forecasting system. As a result of nitrate assimilation the phytoplankton 5-day forecast skill improves by up to 30%. We show that although much of this improvement can be achieved by using a weekly nitrate climatology predicted by the NN model, there is a clear advantage in using flow-dependent nitrate data. We discuss the impacts of this improvement on a range of additional eutrophication indicators, such as dissolved inorganic phosphorus and sea bottom oxygen. We argue that it should be feasible to upgrade this approach to a fully hybrid machine learning - data assimilation within the near-real time NWES operational forecasting system.

2506.13506 2026-06-18 cs.CV q-bio.NC 版本更新

Stimulus Motion Perception Studies Imply Specific Neural Computations in Human Visual Stabilization

刺激运动知觉研究暗示人类视觉稳定中的特定神经计算

David W Arathorn, Josephine C. D'Angelo, Austin Roorda

发表机构 * Montana State University, Dept of Electrical and Computer Engineering(蒙塔那州立大学电气与计算机工程系) University of California, Berkeley, Herbert Wertheim School of Optometry and Vision Science(加州大学伯克利分校赫伯特·韦特海姆视觉科学与眼科学学院)

AI总结 通过分析人类注视时眼球的微小抖动,发现视觉稳定机制比相机稳定或简单进化方案更复杂,提出了基于视网膜信号特定操作的功能模型和可能的神经回路实现。

详情
AI中文摘要

即使在注视期间,人眼也持续进行低幅度运动,以高达100Hz的频率在随机方向上小角度抖动。这种运动导致视网膜上图像的所有特征不断穿过多个视锥细胞,然而世界中稳定的物体被感知为稳定,而任何运动的物体被感知为运动。一系列持续十多年的实验揭示了视觉稳定的心理物理学比可能假设的(例如,从相机图像稳定的机制,或从进化角度可能假设的最简单解决方案)更为微妙。实验揭示的心理物理学强烈暗示了视网膜信号上的一组特定操作,导致了观察到的稳定行为。报告分为两个层次。首先是对很可能负责实验观察行为的机制的功能描述。其次是对可能实现功能行为的电路级神经元的更推测性提议。

英文摘要

Even during fixation the human eye is constantly in low amplitude motion, jittering over small angles in random directions at up to 100Hz. This motion results in all features of the image on the retina constantly traversing a number of cones, yet objects which are stable in the world are perceived to be stable, and any object which is moving in the world is perceived to be moving. A series of experiments carried out over a dozen years revealed the psychophysics of visual stabilization to be more nuanced than might be assumed, say, from the mechanics of stabilization of camera images, or what might be assumed to be the simplest solution from an evolutionary perspective. The psychophysics revealed by the experiments strongly implies a specific set of operations on retinal signals resulting in the observed stabilization behavior. The presentation is in two levels. First is a functional description of the action of the mechanism that is very likely responsible for the experimentally observed behavior. Second is a more speculative proposal of circuit-level neural elements that might implement the functional behavior.

2505.13373 2026-06-18 q-bio.MN 版本更新

State- versus Reaction-Based Information Processing in Biochemical Networks

生化网络中基于状态与基于反应的信息处理

Anne-Lena Moor, Age Tjalma, Manuel Reinhardt, Pieter Rein ten Wolde, Christoph Zechner

AI总结 本文引入基于反应与基于状态的轨迹描述概念,解释了线性噪声近似下互信息与精确马尔可夫跳变过程结果之间的差异,并提出基于反应的互信息变体以避免信息损失。

Comments Appendix is included as a PDF in the source files

详情
AI中文摘要

轨迹互信息常用于量化生化系统中的信息传递。通过广泛使用的线性噪声近似(LNA)结合高斯信道理论,可以获得轨迹互信息的可处理解。该方法预期对足够大的系统是准确的。然而,最近的观察表明,在某些情况下,通过这种方式获得的互信息与使用精确马尔可夫跳变过程形式主义推导的结果存在定性差异,并且即使在大拷贝数范围内,差异仍然存在。在本文中,我们表明这些差异可以通过引入基于反应与基于状态的轨迹描述概念来解释。在化学系统中,信息编码在反应事件序列中,马尔可夫跳变过程的基于反应的轨迹捕获了这些信息。我们证明,在高斯形式主义中,轨迹可以基于单个反应通道定义,也可以基于状态水平定义,其中不同反应通道被归纳为单个噪声项。虽然两种定义在拷贝数涨落方面一致,但基于状态的轨迹通常包含比基于反应的轨迹更少的信息。通过线性噪声近似常用的高斯互信息与基于状态的轨迹概念一致,这导致了与系统大小无关的系统性信息损失。我们证明,基于反应的高斯互信息变体可以防止这种信息损失。我们通过两个常见的细胞反应基序说明了不同轨迹描述的后果,并讨论了它们与Berg-Purcell和最大似然感知的联系。

英文摘要

Trajectory mutual information is frequently used to quantify information transfer in biochemical systems. Tractable solutions of the trajectory mutual information can be obtained via the widely used Linear-Noise Approximation (LNA) using Gaussian channel theory. This approach is expected to be accurate for sufficiently large systems. However, recent observations show that there are cases, where the mutual information obtained this way differs qualitatively from results derived using an exact Markov jump process formalism, and that the differences remain even in the large copy number regime. In this letter, we show that these differences can be explained by introducing the notion of reaction- versus state-based descriptions of trajectories. In chemical systems, the information is encoded in the sequence of reaction events, and the reaction-based trajectories of Markov jump processes capture this information. We show that within the Gaussian formalism, trajectories can be defined either based on individual reaction channels, or on a state-based level, where different reaction channels are summarised into a single noise term. While both definitions agree in terms of copy number fluctuations, state-based trajectories contain in general less information than reaction-based trajectories. The commonly used Gaussian mutual information via the Linear-Noise Approximation is consistent with a state-based trajectory notion, which causes a systematic loss of information independent of system size. We show that an alternative, reaction-based variant of the Gaussian mutual information prevents this loss of information. We illustrate the consequences of different trajectory descriptions for two common cellular reaction motifs and discuss their connection with Berg-Purcell and Maximum-Likelihood sensing.

2511.06140 2026-06-18 q-bio.QM

Non-invasive load measurement in the human tibia via spectral analysis of flexural waves

Ali Yawar, Daniel H. Aslan, Daniel E. Lieberman

Comments 23 pages, 23 figures, 1 table. Manuscript revised for clarity and consistency

详情
Journal ref
J. R. Soc. Interface (2026) 23 (239): 20251206
英文摘要

Forces transmitted by bones are routinely studied in human biomechanics, but it is challenging to measure them non-invasively, especially outside of laboratory settings. We introduce a technique for non-invasive, in vivo measurement of tibial compressive force using flexural waves propagating in the tibia. Modelling the tibia as an axially compressed Euler-Bernoulli beam, we show that tibial flexural waves have load-dependent frequency spectra. Specifically, under physiological conditions, peak locations in the wave acceleration spectra vary linearly with the compressive force on the tibia and may be used as proxies for the compressive force. We test the validity of this technique using a proof-of-concept wearable system that generates flexural waves via a skin-mounted mechanical transducer and measures the spectra of these waves using a skin-mounted accelerometer. In agreement with beam theory, data from 9 participants demonstrate linear relationships between tibial compressive force and spectral peak location, with Pearson correlation coefficients $r=0.82 - 0.99$ (mean $r=0.93$) for medial-lateral swaying and $r=0.81 - 0.98$ (mean $r=0.93$) for walking trials. This flexural wave-based technique could give rise to a new class of wearable sensors for non-invasive physiological bone load monitoring and measurement, impacting research in human locomotion and sports medicine.

2407.13037 2026-06-18 physics.bio-ph q-bio.QM

Dispersion Relations for Active Undulators in Overdamped Environments

Christopher J. Pierce, Daniel Irvine, Lucinda Peng, Xuefei Lu, Hang Lu, Daniel I. Goldman

详情
Journal ref
Phys. Rev. E 113, (2026) 065413
英文摘要

Organisms that locomote by propagating waves of body bending can maintain performance across heterogeneous environments by modifying their gait frequency $ω$ or wavenumber $k$. We identify a unifying relationship between these parameters for overdamped undulatory swimmers (including nematodes, spermatozoa, and mm-scale fish) moving in diverse environmental rheologies, in the form of an active `dispersion relation' $ω\propto k^{\pm2}$. A model treating the organisms as actively driven viscoelastic beams reproduces the experimentally observed scaling. The relative strength of rate-dependent dissipation in the body and the environment determines whether $k^2$ or $k^{-2}$ scaling is observed. The existence of these scaling regimes reflects the $k$ and $ω$ dependence of the various underlying force terms and how their relative importance changes with the external environment and the neuronally commanded gait.