The impact of behavioral homophily and conformity on epidemic spreading in networks with large groups
行为同质性和从众性对大规模群体网络中流行病传播的影响
AI总结 本研究通过近似主方程框架,探讨行为同质性和从众性如何共同影响大规模紧密连接群体(团)上的SIS流行病动态,发现同质性放大了从众性效应,使少数行为得以持续并显著改变流行阈值和传播模式。
行为同质性和从众性对大规模群体网络中流行病传播的影响
Olivier Ribordy, Clara Granell, Guillaume St-Onge, Laurent Hébert-Dufresne, Alex Arenas, Antoine Allard
AI总结 本研究通过近似主方程框架,探讨行为同质性和从众性如何共同影响大规模紧密连接群体(团)上的SIS流行病动态,发现同质性放大了从众性效应,使少数行为得以持续并显著改变流行阈值和传播模式。
理解社会行为如何影响流行病动态已成为数学流行病学的一个核心焦点。特别是, extit{行为同质性}(个体倾向于与相似他人交往)和 extit{从众性}(个体行为向群体规范调整)是塑造传播模式的关键机制。在这项工作中,我们研究了这些行为过程对大规模、紧密连接的群体(建模为团)网络上的易感-感染-易感(SIS)动态的综合影响。每个个体都有内在的行为偏好,但他们在群体中表达的行为受群体组成的调节,反映了从众动态。利用近似主方程(AME)框架,我们刻画了行为异质性、群体结构和流行病局域化之间的相互作用。我们的结果表明,行为同质性放大了从众性在大群体中的效应,使少数行为得以持续,并显著改变了流行阈值和传播模式。
Understanding how social behavior influences epidemic dynamics has become a central focus in mathematical epidemiology. In particular, \textit{behavioral homophily} (the tendency of individuals to associate with similar others) and \textit{conformity} (the adjustment of individual behavior to group norms) are key mechanisms in shaping transmission patterns. In this work, we investigate the combined impact of these behavioral processes on the susceptible-infected-susceptible (SIS) dynamics on networks with large, densely connected groups, modeled as cliques. Each individual has an intrinsic behavioral preference, but their expressed behavior within a group is modulated by its composition, reflecting conformity dynamics. Using the approximate master equations (AME) framework, we characterize the interplay between behavioral heterogeneity, group structure, and epidemic localization. Our results reveal that behavioral homophily amplifies the effects of conformity in large groups, enabling minority behaviors to persist as well as substantially shifting epidemic thresholds and spreading regimes.
平衡结构与随机性:用于上下文依赖计算的最大熵网络
Ludwig Hruza, Srdjan Ostojic
AI总结 提出基于最大熵原理的网络连接规范模型,通过将任务约束转化为概率分布上的条件,在无需特定学习算法的情况下,揭示了任务约束与熵最大化之间的权衡如何产生与梯度下降训练网络定性定量匹配的神经群体结构。
理解网络功能如何约束神经连接是神经科学的核心挑战。一种有影响力的方法是在认知任务上使用梯度下降训练神经网络,并表征由此产生的连接性。一个关键限制是,所得结构取决于训练过程的细节。这里我们提出一种基于最大熵原理的互补规范方法,用于网络连接性,独立于任何特定的学习算法。我们将连接性描述为单神经元权重的概率分布,将任务要求表达为该分布上的约束,并确定在这些约束下最大化香农熵的唯一分布。权重尺度参数控制随机性与任务诱导结构之间的平衡。我们将该框架应用于两层前馈网络中的上下文依赖输入选择任务,并表明通过将非线性网络映射到增益调制线性模型,最大熵推理变得解析可处理。从先验均匀分布开始,我们发现任务约束下的熵最大化导致神经元群体的出现,每个群体由其上下文增益调制模式定义。增加上下文数量驱动从上下文特化到非特化随机群体的转变。增加权重尺度驱动从结构化到随机刺激选择性的平行转变。引人注目的是,这种最大熵连接性在定性和定量上都与不同学习机制下梯度下降训练的网络结构相匹配。我们的结果表明,任务约束与熵最大化之间的相互作用为理解神经网络中结构与功能的关系提供了一个基本原理。
Understanding how network function constrains neural connectivity is a central challenge in neuroscience. An influential approach is to train neural networks with gradient descent on cognitive tasks and characterize the resulting connectivity. A key limitation is that the resulting structure depends on the details of the training procedure. Here we propose a complementary normative approach based on the maximum entropy principle for network connectivity, independent of any particular learning algorithm. We describe connectivity as a probability distribution over single-neuron weights, express task requirements as constraints on this distribution, and determine the unique distribution maximizing Shannon entropy subject to these constraints. A weight scale parameter controls the balance between randomness and task-induced structure. We apply this framework to context-dependent input-selection tasks in 2-layer feed-forward networks, and show that maximum entropy inference becomes analytically tractable by mapping nonlinear networks onto gain-modulated linear models. Starting from an a priori homogeneous distribution, we find that maximizing entropy under task constraints leads to the emergence of populations of neurons, each defined by its pattern of contextual gain modulation. Increasing the number of contexts drives a transition from context-specialized to unspecialized, random populations. Increasing the weight scale drives a parallel transition from structured to random stimulus selectivity. Strikingly, this maximum entropy connectivity matches both qualitatively and quantitatively the structure of networks trained with gradient descent across different learning regimes. Our results suggest that the interplay between task constraints and entropy maximization provides a fundamental principle for understanding the relationship between structure and function in neural networks.
桥接自催化的两个理论框架:RAF集和化学计量自催化
Richard Golnik, Thomas Gatter, Wim Hordijk, Peter F. Stadler, Nicola Vassena
AI总结 本文通过化学计量矩阵统一了RAF理论和化学计量自催化两种自催化框架,证明在温和条件下任何RAF都是化学计量自催化的。
自催化是许多(生物)化学过程的核心,也是导致生命起源过程的关键。出现了两种看似非常不同的形式化定义自催化。Kauffman引入了集体自催化来描述相互催化彼此形成的分子系统,强调自催化系统的自维持特性。这一观点在反射自催化与食物生成集(RAF)理论中得到了数学形式化。与此同时,化学计量自催化从化学反应网络(CRN)理论中产生,侧重于自催化子网络的净产出、自放大特性。这两个框架在文献中独立共存,因为RAF理论将每个反应视为显式催化,而CRN方法通常完全排除显式催化反应。然而,两个框架都描述反应网络,因此允许以化学计量矩阵的形式进行共同的数学表示。我们强调了这一联系,并表明这两个形式化并不像看起来那样截然不同。为了说明这一点,我们证明,在温和且一般的条件下,任何RAF都是化学计量自催化的。
Autocatalysis lies at the heart of many (bio)chemical processes and is key to processes leading up to the origin of life. Two seemingly very different formalisms have emerged that define autocatalysis. Kauffman introduced collective autocatalysis to describe systems of molecules that mutually catalyze each other's formation, emphasizing the self-sustaining character of autocatalytic systems. This view is mathematically formalized in the theory of Reflexively Autocatalytic and Food-generated sets (RAF). In parallel, stoichiometric autocatalysis emerged from the theory of Chemical Reaction Networks (CRN), focusing on the net-productive, self-amplifying character of autocatalytic subnetworks. These two frameworks have coexisted independently in the literature, since RAF theory considers each reaction as explicitly catalyzed, while the CRN approach often excludes explicitly catalyzed reactions altogether. Nevertheless, both frameworks describe reaction networks and thus admit a common mathematical representation in terms of stoichiometric matrices. We highlight this connection and show that the two formalisms are less disparate than they might appear. To illustrate this point we prove that, under mild and general conditions, any RAF is stoichiometrically autocatalytic.
AI驱动的无创无标记表面增强拉曼光谱检测泪液和汗液中不同细胞来源的细胞外囊泡
Yang Li, Xiaoming Lyu, Ling Xia, Kuo Zhan, Haoyu Ji, Lei Qin, Seppo J. Vainio, Jian-An Huang
AI总结 本研究开发了一种基于盐诱导纳米颗粒聚集的AI辅助SERS方法,实现了对泪液和汗液中不同细胞来源的细胞外囊泡的高准确度无标记检测,准确率超过92%。
可穿戴传感技术能够对泪液和汗液等生物流体中的外泌体进行即时、连续和无创分析,是未来个性化医疗的重要组成部分。细胞分泌的细胞外囊泡(EVs)的主要检测和识别方法通常需要标记且耗时,导致EV机制研究和疾病诊断效率低下。虽然无标记表面增强拉曼光谱(SERS)已与深度学习模型结合用于血液中EV的识别,但其在泪液和汗液中EV的无创检测应用尚属空白。在此,我们通过开发一种基于盐诱导纳米颗粒聚集的人工智能(AI)辅助表面增强拉曼光谱(SERS)方法,填补了这一空白,实现了对泪液和汗液中EV的快速高准确度识别。值得注意的是,我们对来自6种细胞系(HepG2、Hela、143B、LO-2、BMSC、H8)的EV进行无标记检测和AI区分,实现了对来自7种不同疾病来源的泪液EV的识别,准确率超过92%。我们的结果表明,该平台不仅能区分多种细胞来源的EV,还能在泪液中产生高度可重复和选择性的EV信号,无需化学标记或分离步骤。分子动力学模拟显示,银原子(Ag)与蛋白质中多个氨基酸残基的氧原子形成静电相互作用,表明具有高亲和力。该策略实现了EV的超灵敏和抗干扰检测,为临床疾病的快速诊断提供了新思路。
Wearable sensing technology capable of point-of-care, continuous and non-invasive analysis of exosomes in biofluid such as tears and sweat is an essential part for future personalized medicine. Major detection and identification methods of cell secreted Extracellular Vesicles (EVs) often require labeling and are time-consuming, resulting in low efficiency in EV mechanism research and disease diagnosis. While the label-free Surface-enhanced Raman spectroscopy (SERS) has been combined with deep learning model for EV identification in blood, their application to non-invasive detection of EVs in tears and sweat are missing. Here, we filled this gap by developing an artificial intelligence (AI)-assisted Surface-enhanced Raman spectroscopy (SERS) method based on salt-induced nanoparticle aggregation for fast EV identification in tears and sweat with high accuracy. Significantly, our label-free detection and AI differentiation of EVs from 6 cell lines (HepG2, Hela, 143B, LO-2, BMSC, H8) achieved the identification of EVs in tear fluids from 7 different disease sources with accuracies >92%. Our results showed that this platform can not only distinguish EVs from multiple cell sources but also generate highly reproducible and selective EV signals in tear fluids without a need for chemical labeling or separation steps. Molecular dynamics simulations revealed that silver atoms (Ag) form electrostatic interactions with oxygen atoms of multiple amino acid residues in proteins, suggesting a high affinity. This strategy realizes ultra-sensitive and anti-interference detection of EVs, providing a new idea for the rapid diagnosis of clinical diseases.
ViroBench:病毒基因组学任务中的核苷酸基础模型基准测试
Dongxin Ye, Fang Hu, Han Hu, Shu Hu, Yang Tan, Wanli Ouyang, Stan Z. Li, Jie Cui, Nanqing Dong
AI总结 提出首个针对病毒基因组学的综合基准ViroBench,评估66个核苷酸基础模型在生物学理解和潜在生物安全风险上的表现,发现模型在系统发育和时间偏移下性能下降,生成任务中统计似然与生物功能有效性脱钩,且预训练数据的分类多样性比参数规模更重要。
核苷酸序列构成了生物系统的基本遗传基础,使得病毒基因组分析对生物医学进步至关重要。尽管生物基础模型,特别是核苷酸基础模型(NFMs)取得了进展,但该领域缺乏一个统一的病毒基因组学标准来促进社区发展并实施生物安全约束。为了解决这个问题,我们引入了ViroBench,这是第一个专门为病毒场景中的NFMs设计的全面且大规模的基准测试。ViroBench在两个关键维度上评估模型:生物学理解和潜在生物安全风险,覆盖4种任务类型中的18个不同场景。对66个不同架构的NFMs的广泛评估得出了三个关键结论。首先,NFMs在系统发育和时间偏移下表现出生物学理解的性能下降,表明外推能力较弱。其次,生成任务揭示了统计似然与生物功能有效性之间的脱钩,构成了潜在的生物安全风险。第三,受控消融研究表明,预训练数据中的分类多样性比参数规模更重要。具体来说,一个在多样化数据上训练的轻量级基线相比其原始模型实现了67.5%的性能提升。总体而言,ViroBench为未来病毒核苷酸基础模型的研究提供了可解释的诊断评估和可重复的测量框架。数据集和代码公开于https://github.com/QIANJINYDX/ViroBench。
Nucleotide sequences constitute the fundamental genetic basis of biological systems, rendering viral genomic analysis critical for biomedical advancement. Despite progress in biological foundation models, specifically nucleotide foundation models (NFMs), the field lacks a unified standard for viral genomics to facilitate community development and enforce biosecurity constraints. To address this, we introduce ViroBench, the first comprehensive and large-scale benchmark specifically designed for NFMs in viral settings. ViroBench evaluates models across two critical dimensions: biological understanding and latent biosecurity risk, covering 18 diverse scenarios within 4 task types. Extensive evaluation of 66 NFMs across diverse architectures yields three critical conclusions. Firstly, NFMs exhibit a performance degradation in biological understanding under phylogenetic and temporal shifts, indicating weak extrapolation capabilities. Secondly, generation tasks reveal a decoupling between statistical likelihood and biological functional validity, posing latent biosecurity risks. Thirdly, controlled ablation studies reveal that taxonomic diversity in pretraining data outweighs parameter scale. Specifically, a lightweight baseline trained on diverse data achieves a 67.5% performance gain over its original model. Overall, ViroBench provides interpretable, diagnostic evaluations and a reproducible measurement framework for future research on viral nucleotide foundation models. The datasets and code are publicly available at https://github.com/QIANJINYDX/ViroBench.
具有一般发射分布的极端首次通过问题的加速模拟算法
Emmanuel Akame Mfoumou, David Holcman
AI总结 本文提出一种利用渐近首次通过分布的高效模拟框架,通过递归逆变换算法和迭代方法,无需追踪粒子轨迹即可模拟极端到达时间顺序统计量,适用于有界域中局部吸收目标的扩散过程。
最快到达事件,即众多扩散粒子中第一个到达目标的事件,是分子随机系统中触发信号启动的核心。经典模拟方法依赖于所有粒子的完整轨迹生成,导致在大粒子数情况下计算成本过高。本文提出一个通用模拟框架,通过利用渐近首次通过分布高效生成到达时间的顺序统计量。该框架适用于具有局部吸收目标的有界域中的扩散过程,这些过程具有短时间首次通过渐近性,例如一维、二维和三维布朗运动。从瞬时发射情况开始,我们推导并实现了一种递归逆变换算法,无需跟踪粒子轨迹即可模拟前$k$个到达。我们通过迭代方法将该算法扩展到时间依赖的发射分布,从而能够模拟具有时间注入(从快速到持续发射)的系统的极端统计量。此外,我们提供了平均最快到达时间的渐近估计。总之,本文提出的绕过布朗轨迹模拟的加速算法可用于空间反应网络、稀有事件检测或扩散控制激活。
Fastest arrival events, where the first among many diffusing particles reaches a target, are central in triggering signal initiation in molecular stochastic systems. Classical approaches to simulate such events rely on full trajectory generation of all particles, leading to prohibitive computational costs in the large particle number regime. In this work, we present a general simulation framework for efficiently generating order statistics of arrival times by exploiting asymptotic first-passage distributions. This framework applies to diffusion processes in bounded domains with localized absorbing targets, for which short-time first-passage asymptotics are available, such as Brownian motion in dimensions one, two, and three. Starting with the case of instantaneous emission, we derive and implement a recursive inverse transform algorithm to simulate the first $k$ arrivals without tracking particle trajectories. We extend this algorithm to time-dependent emission profiles via an iterative approach, enabling the simulation of extreme statistics in systems with temporal injection, ranging from rapid to prolonged emission. Additionally, we provide asymptotic estimates of the mean fastest arrival time. To conclude, the present acceleration algorithm which bypasses Brownian simulations of trajectories can be used for spatial reaction networks, rare event detection, or diffusion-controlled activation.
平稳高斯过程中任意水平穿越的精确方差和Fano因子
Shivang Rawat, Flaviano Morone, David J. Heeger, Stefano Martiniani
AI总结 本文推导了光滑平稳高斯过程中任意水平穿越的方差和Fano因子的精确解析公式,揭示了时间相关结构如何决定穿越事件的聚类或规则性,并展示了在振荡和非振荡系统中的不同统计行为。
理解随机过程中水平穿越的统计特性在许多科学学科中至关重要。传统的Kac-Rice公式给出了水平穿越的平均速率,并得到了广泛应用。然而,该平均速率仅捕捉了穿越过程的粗略总结,完全依赖于随机过程在给定时刻的局部性质,因此忽略了过程随时间变化的相关结构。为了理解穿越事件(如神经元尖峰)是否在时间上聚集、分散或表现出更复杂的时间组织,必须超越平均速率,研究高阶穿越统计。在这里,我们通过推导光滑平稳高斯过程中任意水平穿越的方差和Fano因子的精确解析公式,超越了均值。我们的精确解揭示了完整的时间相关结构如何决定穿越是聚集还是变得规则。在具有振荡相关性的系统中,如随机阻尼谐振子,一次穿越会抑制紧接着的下一次穿越,产生亚泊松统计。然而,随着阻尼增加和振荡消失,高于阈值的大而缓慢的偏移可能产生多个紧密间隔的穿越,产生超泊松统计。在纯弛豫、非振荡系统中,如由Ornstein-Uhlenbeck噪声驱动的均值回复过程,驱动噪声和系统弛豫的时间尺度之间的竞争产生了更丰富的景观,包括随着阈值水平变化在亚泊松和超泊松统计之间的重入转变。总之,这里推导的精确方差和Fano因子补充了Kac-Rice平均速率,使得在使用高斯过程的任何场景中能够进行更稳健的参数估计和模型选择。
Understanding the statistics of level crossings in stochastic processes is crucial across many scientific disciplines. The traditional Kac-Rice formula gives the mean rate of level crossings and has found broad use. However, that mean rate captures only a coarse summary of the crossing process. It depends entirely on local properties of the stochastic process at a given instant and is therefore blind to the correlation structure of the process over time. To understand whether crossing events, such as neuronal spikes, tend to cluster in time, spread apart, or exhibit more complex temporal organization, one must go beyond the mean rate and study higher-order crossing statistics. Here we go beyond the mean by deriving the exact analytical formulae for the variance and Fano factor of arbitrary level crossings in smooth stationary Gaussian processes. Our exact solution reveals how the full temporal correlation structure dictates whether crossings cluster or become regular. In systems with oscillatory correlations, such as a stochastic damped harmonic oscillator, a recent crossing suppresses an immediate subsequent one, producing sub-Poissonian statistics. However, as damping increases and oscillations disappear, a large and slow excursion above the threshold can produce multiple closely spaced crossings, yielding super-Poissonian statistics. In purely relaxational, non-oscillatory systems, such as a mean-reverting process driven by Ornstein-Uhlenbeck noise, the competition between the timescales of the driving noise and system relaxation produces a richer landscape, including reentrant transitions between sub- and super-Poissonian statistics as the threshold level is varied. Taken together, the exact variance and Fano factor derived here complement the Kac-Rice mean rate, enabling more robust parameter estimation and model selection across any setting where Gaussian processes are used.
基因通过自组织多细胞相互作用网络的控制
Kyle R. Allison
AI总结 本文从生物学一般第一性原理出发,提出基于动态图的多细胞自组织理论,旨在控制基因表达并推动实验与计算多细胞生物学发展。
多细胞自组织驱动生物体发育,但缺乏全面理论,因为细胞的基本性质使常见方法复杂化。通过动态图框架描述这些性质,为大肠杆菌中的多细胞自组织提出了新的理论命题。本文从生物学一般第一性原理出发发展相应观点。所得视角可能有助于实验和计算多细胞生物学方法,以及控制和工程化多细胞系统的努力。
Multicellular self-organization drives development in biological organisms, yet a comprehensive theory is lacking as basic properties of cells can complicate common approaches. Framing such properties by dynamic graphs led to new theoretical propositions for multicellular self-organization in Escherichia coli. Here, corresponding ideas are developed from biologically-general first principles. The resulting perspective could aid both experimental and computational approaches to multicellular biology as well as efforts to control and engineer it.
基于条件归一化流的从头皮脑电无创重建深颞叶颅内脑电
Dongyi He, Bin Jiang, Kecheng Feng, Luyin Zhang, Ling Liu, Yuxuan Li, Yun Zhao, He Yan
AI总结 提出NeuroFlowNet,一种基于条件归一化流的跨模态生成框架,首次从头皮脑电信号重建整个深颞叶区域的颅内脑电信号,解决了高保真重建的难题。
尽管从无创头皮脑电图(sEEG)获取深部脑活动对神经科学和临床诊断至关重要,但直接生成高保真颅内脑电图(iEEG)信号仍是一个基本未探索的领域,限制了对深部脑动力学的理解。当前研究主要集中于传统信号处理或源定位方法,这些方法难以捕捉iEEG的复杂波形和随机特性。为应对这一关键挑战,本文引入NeuroFlowNet,一种新颖的跨模态生成框架,其核心贡献在于首次利用sEEG信号重建整个深颞叶区域的iEEG信号。NeuroFlowNet基于条件归一化流(CNF),通过可逆变换直接建模复杂条件概率分布,从而显式捕捉脑信号的随机性,从根本上避免了现有生成模型中常见的模式崩溃问题。此外,该模型集成了多尺度架构和自注意力机制,以稳健地捕捉细粒度时间细节和长程依赖关系。在公开的同步sEEG-iEEG数据集上的验证结果表明,NeuroFlowNet在时间波形保真度、频谱特征再现和功能连接恢复方面具有有效性。本研究为深部脑动力学的无创分析建立了一种更可靠、可扩展的新范式。该研究的代码可在https://github.com/hdy6438/NeuroFlowNet获取。
Although obtaining deep brain activity from non-invasive scalp electroencephalography (sEEG) is crucial for neuroscience and clinical diagnosis, directly generating high-fidelity intracranial electroencephalography (iEEG) signals remains a largely unexplored field, limiting our understanding of deep brain dynamics. Current research primarily focuses on traditional signal processing or source localization methods, which struggle to capture the complex waveforms and random characteristics of iEEG. To address this critical challenge, this paper introduces NeuroFlowNet, a novel cross-modal generative framework whose core contribution lies in the first-ever reconstruction of iEEG signals from the entire deep temporal lobe region using sEEG signals. NeuroFlowNet is built on Conditional Normalizing Flow (CNF), which directly models complex conditional probability distributions through reversible transformations, thereby explicitly capturing the randomness of brain signals and fundamentally avoiding the pattern collapse issues common in existing generative models. Additionally, the model integrates a multi-scale architecture and self-attention mechanisms to robustly capture fine-grained temporal details and long-range dependencies. Validation results on a publicly available synchronized sEEG-iEEG dataset demonstrate NeuroFlowNet's effectiveness in terms of temporal waveform fidelity, spectral feature reproduction, and functional connectivity restoration. This study establishes a more reliable and scalable new paradigm for non-invasive analysis of deep brain dynamics. The code of this study is available in https://github.com/hdy6438/NeuroFlowNet
用于脚桥被盖核光遗传刺激的自动位置偏好范式揭示运动停止关联偏好行为
Guanghui Li, Xingfei Hou, Zhenxiang Zhao
AI总结 开发自动化闭环光遗传系统,通过刺激脚桥被盖核诱导运动停止并建立位置偏好,揭示运动抑制与奖赏回路的耦合。
理解大脑如何将运动抑制与动机过程整合仍然是神经科学的一个基本问题。头侧脚桥被盖核(rostral Pedunculopontine nucleus)是参与运动控制的脑干结构,光遗传或电刺激可诱导短暂的运动停止。然而,我们目前对其在连接运动抑制与动机或奖赏相关过程中的潜在作用仍了解不足。为了进一步探索PPN刺激引起的效应并推断其在运动和情绪调节中作用的潜在机制,我们开发了一个全自动、低成本的系统,结合实时动物追踪与闭环光遗传刺激,使用OpenMV Cam H7 Plus和嵌入式神经网络模型。该系统自主检测大鼠位置,并在进入预定义感兴趣区域时触发光刺激,实现无偏、无监督的行为分析。光遗传激活头侧PPN中表达CaMKIIa的神经元可靠地诱导了短暂的运动停止。当运动停止与定义的感兴趣区域空间配对时,大鼠在有限训练后形成了稳健的位置偏好。这些结果表明,头侧PPN的激活可以将运动抑制与奖赏相关的行为回路耦合。总之,我们的工作为可扩展的闭环神经科学实验提供了技术框架,并初步证明头侧PPN可能参与协调运动抑制与动机过程。
Understanding how the brain integrates motor suppression with motivational processes remains a fundamental question in neuroscience. The rostral Pedunculopontine nucleus, a brainstem structure involved in motor control, has been shown to induce transient motor arrest upon optogenetic or electrical stimulation. However, our current understanding of its potential role in linking motor suppression with motivational or reinforcement-related processes is still insufficient. To further explore the effects induced by PPN stimulations and infer the potential mechanism underlying its role involved in both motor and emotional regulation, we developed a fully automated, low-cost system combining real-time animal tracking with closed-loop optogenetic stimulation, using the OpenMV Cam H7 Plus and embedded neural network models. The system autonomously detects the rat's position and triggers optical stimulation upon entry into a predefined region of interest, enabling unbiased, unsupervised behavioral assays. Optogenetic activation of CaMKIIa-expressing neurons in the rostral PPN reliably induced transient motor arrest. When motor arrest was spatially paired with a defined region of interest, rats developed a robust place preference after limited training. These results suggest that rostral PPN activation can couple motor inhibition with reinforcement-related behavioral circuitry. Together, our work provides both a technical framework for scalable closed-loop neuroscience experiments and preliminary evidence that the rostral PPN may participate in coordinating motor suppression with motivational processes.
多对齐对比学习用于酶-反应检索
Gengmo Zhou, Feng Yu, Wenda Wang, Zhifeng Gao, Guolin Ke, Zhewei Wei, Zhen Wang
AI总结 提出多对齐对比学习框架,通过联合建模酶-反应跨域兼容性及功能注释驱动的域内关系,并引入Gromov-Wasserstein正则化项,提升酶虚拟筛选和双向检索性能。
识别催化目标生化反应的酶是计算酶发现和生物催化剂设计的关键步骤。最近的表示学习方法将这一问题表述为酶-反应匹配,其中配对的酶和反应被嵌入到共享空间中。然而,大多数现有方法主要依赖于成对的酶-反应监督,并且对反应集或酶家族内部关系的利用有限。本文介绍了一种用于生化检索的多对齐对比学习框架。该框架联合建模酶与反应之间的跨域兼容性以及由功能注释诱导的域内关系。此外,受Gromov-Wasserstein启发的正则化目标鼓励学习的酶和反应表示空间之间的几何一致性。通过将成对的催化监督与高阶关系对齐相结合,该模型捕获了直接的酶-反应关联以及更广泛的功能组织。我们在酶虚拟筛选和双向酶-反应检索任务上评估了该方法。在EnzymeMap上的实验表明,与强对比基线相比,在BEDROC和富集因子指标下,早期识别性能有所提高。在ReactZyme上,该方法在基于时间、酶相似性和反应相似性的划分中均取得了一致的增益,展示了对未见酶和未见反应的鲁棒性。消融研究进一步表明,域内对齐、功能监督和几何正则化项各自对观察到的改进有所贡献。这些结果表明,建模多种形式的对齐可以改进用于酶发现、反应注释及相关计算生物学应用的对比检索模型。
Identifying enzymes that catalyze target biochemical reactions is a key step in computational enzyme discovery and biocatalyst design. Recent representation-learning methods formulate this problem as enzyme--reaction matching, where paired enzymes and reactions are embedded into a shared space. However, most existing approaches primarily rely on pairwise enzyme--reaction supervision and make limited use of the relationships within reaction sets or enzyme families. This work introduces a multi-alignment contrastive learning framework for biochemical retrieval. The framework jointly models cross-domain compatibility between enzymes and reactions and within-domain relationships induced by functional annotations. In addition, a Gromov--Wasserstein-inspired regularization objective encourages geometric consistency between the learned enzyme and reaction representation spaces. By combining pairwise catalytic supervision with higher-order relational alignment, the model captures both direct enzyme--reaction associations and broader functional organization. We evaluate the approach on enzyme virtual screening and bidirectional enzyme--reaction retrieval tasks. Experiments on EnzymeMap show improved early-recognition performance under BEDROC and enrichment-factor metrics compared with strong contrastive baselines. On ReactZyme, the method achieves consistent gains across time-based, enzyme-similarity, and reaction-similarity splits, demonstrating robustness to unseen enzymes and unseen reactions. Ablation studies further indicate that within-domain alignment, functional supervision, and the geometric regularization term each contribute to the observed improvements. These results suggest that modeling multiple forms of alignment can improve contrastive retrieval models for enzyme discovery, reaction annotation, and related computational biology applications.
FragmentNet: 自适应图分片用于图到序列分子表示学习
Ankur Samanta, Rohan Gupta, Aditi Misra, Christian McIntosh Clarke, Jayakumar Rajadas
AI总结 提出FragmentNet,通过自适应学习的分词器将分子图分解为化学有效的片段,并利用化学感知的空间位置编码保持分子拓扑,在片段级别进行掩码预训练,在多个属性预测任务上提升了性能。
分子表示学习方法通常将分子标记为单个原子或使用刚性、基于规则的分片分解,限制了它们捕捉有意义化学子结构上下文的能力。我们引入了FragmentNet,一种围绕新颖的自适应学习分词器构建的图到序列模型,该分词器将分子图分解为可调整粒度的化学有效片段,并辅以化学感知的空间位置编码,在生成的序列中保留分子拓扑。将自然语言处理中的掩码预训练策略扩展到分子领域,我们在化学有意义的片段级别而非单个原子级别对分子进行掩码和重建。在多个属性预测基准上的评估发现,在片段粒度上进行预训练在大多数任务上提高了下游性能,表明标记化粒度是分子表示学习的重要设计选择。
Molecular representation learning methods typically tokenize molecules as individual atoms or use rigid, rule-based fragment decompositions, limiting their ability to capture meaningful chemical substructure context. We introduce FragmentNet, a graph-to-sequence model built around a novel adaptive, learned tokenizer that decomposes molecular graphs into chemically valid fragments of adjustable granularity, complemented by chemically aware spatial positional encodings that preserve molecular topology in the resulting sequence. Extending masked pre-training strategies from natural language processing to the molecular domain, we mask and reconstruct molecules at the level of chemically meaningful fragments rather than individual atoms. Evaluating across multiple property prediction benchmarks, we find that pre-training at fragment granularity leads to improved downstream performance on the majority of tasks, demonstrating that tokenization granularity is an important design choice for molecular representation learning.
PerSival:基于神经网络的肌肉骨骼生物力学中连续介质力学模拟的普适可视化
David Rosin, Johannes Kässinger, Xingyao Yu, Okan Avci, Christian Bleiler, Oliver Röhrle
AI总结 本文提出一种神经网络架构,通过稀疏网格代理捕捉肱二头肌表面变形,实现3D上肢肌肉骨骼系统模型在资源受限设备上的实时可视化,平均误差0.97 mm。
本文提出一种新颖的神经网络架构,用于3D人体上肢肌肉骨骼系统模型的普适可视化。将模拟能力扩展到移动设备等资源贫乏系统,在众多研究领域中日益受到关注,以拓宽方法和结果的适用性。直到最近,由于计算成本过高,这一目标被认为对于肌肉骨骼系统的真实连续介质力学模拟而言遥不可及。在本工作中,我们使用稀疏网格代理来捕捉肱二头肌的表面变形,以训练一个深度学习模型,用于同一肌肉的实时可视化。这些代理模型均以5个肌肉激活水平作为输入,并输出肌肉表面每个网格节点的笛卡尔坐标向量。因此,神经网络架构的输入维度显著低于输出维度。5个肌肉激活水平足以实现肱二头肌2809个网格节点位置的平均误差为0.97 ± 0.16 mm,即0.57 ± 0.10%。该模型在仅使用CPU时每个预测变形状态的评估时间为9.88 ms,在GPU支持下为3.48 ms,对应的理论帧率分别为101 fps和287 fps。因此,深度学习代理为连续介质力学模拟在视觉实时应用中的可访问性提供了一条途径。
This paper presents a novel neural network architecture for the purpose of pervasive visualisation of a 3D human upper limb musculoskeletal system model. Bringing simulation capabilities to resource-poor systems like mobile devices is of growing interest across many research fields, to widen applicability of methods and results. Until recently, this goal was thought to be out of reach for realistic continuum-mechanical simulations of musculoskeletal systems, due to prohibitive computational cost. Within this work we use a sparse grid surrogate to capture the surface deformation of the m.~biceps brachii in order to train a deep learning model, used for real-time visualisation of the same muscle. Both these surrogate models take 5 muscle activation levels as input and output Cartesian coordinate vectors for each mesh node on the muscle's surface. Thus, the neural network architecture features a significantly lower input than output dimension. 5 muscle activation levels were sufficient to achieve an average error of 0.97 +/- 0.16 mm, or 0.57 +/- 0.10 % for the 2809 mesh node positions of the biceps. The model achieved evaluation times of 9.88 ms per predicted deformation state on CPU only and 3.48 ms with GPU-support, leading to theoretical frame rates of 101 fps and 287 fps respectively. Deep learning surrogates thus provide a way to make continuum-mechanical simulations accessible for visual real-time applications.
Virchow:百万级数字病理学基础模型
Eugene Vorontsov, Alican Bozkurt, Adam Casson, George Shaikovski, Michal Zelechowski, Siqi Liu, Kristen Severson, Eric Zimmermann, James Hall, Neil Tenenholtz, Nicolo Fusi, Philippe Mathieu, Alexander van Eck, Donghun Lee, Julian Viret, Eric Robert, Yi Kan Wang, Jeremy D. Kunz, Matthew C. H. Lee, Jan Bernhard, Ran A. Godrich, Gerard Oakley, Ewan Millar, Matthew Hanna, Juan Retamero, William A. Moye, Razik Yousfi, Christopher Kanan, David Klimstra, Brandon Rothrock, Thomas J. Fuchs
AI总结 提出Virchow,一个基于DINOv2自监督学习、在150万张H&E染色全切片图像上训练的6.32亿参数视觉Transformer模型,用于计算病理学,在泛癌检测和生物标志物预测任务上达到最先进性能。
通过分析病理图像实现精准医疗和决策支持系统的人工智能应用,有潜力彻底改变癌症的诊断和治疗。这类应用将依赖于模型捕捉病理图像中观察到的多样化模式的能力。为应对这一挑战,我们提出了Virchow,一个用于计算病理学的基础模型。利用DINOv2算法支持的自监督学习,Virchow是一个拥有6.32亿参数的视觉Transformer模型,在来自不同组织和标本类型的150万张苏木精-伊红染色全切片图像上训练,数据量比以往工作高出数个数量级。Virchow模型使得开发一个泛癌检测系统成为可能,该系统在17种不同癌症类型上的整体标本级AUC达到0.949,同时在7种罕见癌症类型上达到0.937的AUC。Virchow模型在内部和外部图像块级基准测试以及切片级生物标志物预测任务上均达到了最先进水平。性能的提升凸显了在大型病理图像数据集上训练的重要性,表明扩展数据和网络架构可以提高许多高影响计算病理学应用的准确性,尤其是在训练数据有限的情况下。
The use of artificial intelligence to enable precision medicine and decision support systems through the analysis of pathology images has the potential to revolutionize the diagnosis and treatment of cancer. Such applications will depend on models' abilities to capture the diverse patterns observed in pathology images. To address this challenge, we present Virchow, a foundation model for computational pathology. Using self-supervised learning empowered by the DINOv2 algorithm, Virchow is a vision transformer model with 632 million parameters trained on 1.5 million hematoxylin and eosin stained whole slide images from diverse tissue and specimen types, which is orders of magnitude more data than previous works. The Virchow model enables the development of a pan-cancer detection system with 0.949 overall specimen-level AUC across 17 different cancer types, while also achieving 0.937 AUC on 7 rare cancer types. The Virchow model sets the state-of-the-art on the internal and external image tile level benchmarks and slide level biomarker prediction tasks. The gains in performance highlight the importance of training on massive pathology image datasets, suggesting scaling up the data and network architecture can improve the accuracy for many high-impact computational pathology applications where limited amounts of training data are available.
心身问题的理论解:一个操作化证明——没有纯粹物理系统能展现人类意识的所有属性
Catherine Reason
AI总结 本文通过严格定义的理论推理,证明自我确定性操作与物理系统属性不相容,从而操作化地解决了心身问题。
本文提出了一个心身问题的操作化解,该解依赖于严格定义的理论推理而非哲学论证。我们识别出一个特定操作,它是所有健康人类意识个体的必要属性——具体而言,自我确定性操作,即健康有意识的人类能够确定地“知道”自己有意识的能力。该操作被证明与任何有意义的物理系统定义中可能的属性不一致。这种不一致性通过证明一个“no-go”定理来展示,该定理适用于任何能够进行人类逻辑推理的物理系统,如果这种推理要求既可靠又一致。该定理的证明既是通用的——它适用于证据影响某个物理系统状态的任何函数——也是递归的,因为任何支持此类函数的物理过程都被证明会隐含另一个这样的函数。因此,对于人类意识的至少一个方面,心身问题现在得到了决定性的解决。
This article presents an operationalized solution to the mind-body problem which relies on rigorously defined theoretical reasoning rather than philosophical argument. We identify a specific operation which is a necessary property of all healthy human conscious individuals -- specifically the operation of self-certainty, or the capacity of healthy conscious humans to "know" with certainty that they are conscious. This operation is shown to be inconsistent with the properties possible in any meaningful definition of a physical system. This inconsistency is demonstrated by proving a "no-go" theorem for any physical system capable of human logical reasoning, if this reasoning is required to be both sound and consistent. The proof of this theorem is both general -- it applies to any function whereby evidence affects the state of some physical system -- and recursive, since any physical process subserving a function of this type is shown to imply another such function. Thus for at least one aspect of human consciousness, the mind-body problem is now conclusively resolved.
C3P: 对比启动子-蛋白质预训练产生捕捉细菌基因调控的表征
Cameron Dufault, Scott Xu, Alan M. Moses
AI总结 提出对比启动子-蛋白质预训练(C3P),通过对齐启动子与对应蛋白质来学习细菌调控序列表征,在调控注释推断和零样本共调控基因检索中显著优于基因组语言模型。
尽管基因组语言模型(gLMs)规模不断扩大,但其解码调控序列功能的能力仍不明确。gLM预训练依赖于序列重建,由于调控DNA的噪声和快速演化特性,这可能面临挑战。自监督对比方法提供了一种有前景的替代方案。受CLIP等语言-图像架构启发,我们引入了对比启动子-蛋白质预训练(C3P)。通过学习将启动子与其对应蛋白质对齐,我们利用蛋白质语言模型学习到的丰富蛋白质表征作为启动子表征学习的监督信号。在8800万细菌启动子-蛋白质对上进行训练后,我们评估了C3P学习到的启动子表征在推断精选调控注释中的预测能力,发现其相比领先的gLMs有数倍提升。我们还引入了零样本共调控基因检索,即无需实验数据即可在基因组中找到共调控基因的能力。我们发现,与随机初始化基线相比,C3P训练始终提供显著的零样本性能提升,而gLMs则不然。规模分析揭示了进一步改进的潜力以及C3P的效率,它以领先gLMs训练成本的一小部分实现了强劲性能。除了证明C3P训练对于学习细菌调控序列表征的有效性外,我们强大的零样本共调控基因检索性能表明,仅从基因组中解码数百万细菌的基因调控是可能的。
Despite the increasing scale of genome language models (gLMs), their ability to decode the function of regulatory sequences remains unclear. gLM pretraining relies on sequence reconstruction, which may struggle due to the noisy, rapidly evolving nature of regulatory DNA. Self-supervised contrastive approaches provide a promising alternative. Inspired by language-image architectures like CLIP, we introduce contrastive promoter-protein pretraining (C3P). By learning to align promoters to their corresponding proteins, we leverage the rich representations of proteins learned by protein language models as supervisory signal for the learning of promoter representations. After training on 88 million bacterial promoter-protein pairs, we evaluate the predictive power of C3P-learned promoter representations for inference of curated regulatory annotations, finding multi-fold improvement over leading gLMs. We also introduce zero-shot co-regulated gene retrieval, the ability to find co-regulated genes in a genome using no experimental data. We find that compared to a randomly initialized baseline, C3P training consistently provides significant zero-shot performance gains, unlike gLMs. Scaling analysis reveals the potential for further improvement as well as the efficiency of C3P, which achieved strong performance at a fraction of the training cost of leading gLMs. In addition to demonstrating that C3P training is effective for learning representations of bacterial regulatory sequences, our strong zero-shot co-regulated gene retrieval performance suggests the possibility of decoding gene regulation for millions of bacteria from their genomes alone.
自发与决策脉冲神经网络中具有振荡动力学的多目标优化
Divyansh Sethi, Muhammad Faraz, KongFatt Wong-Lin
AI总结 本研究扩展了遗传算法(NSGA-III)在基于Izhikevich神经元的递归脉冲神经网络上的应用,通过优化连接参数以同时匹配目标神经元群体发放率和网络振荡频率,并在自发活动模型、低激活脑类器官和决策模型中验证了其有效性。
脉冲神经网络(SNNs)可用于实现成本高效的人工智能计算或实验观察到的神经数据的机制建模。在后者中,用递归脉冲神经网络(RSNNs)拟合神经数据仍然是一个挑战。重要的是,鉴于已知神经元网络振荡在神经功能中发挥重要作用,用神经发放率拟合特定的RSNN振荡频率尚未得到充分探索。在这项工作中,我们扩展了先前对基于敏感Izhikevich神经元的RSNNs应用遗传算法(GA),特别是非支配排序GA(NSGA-III),通过优化其连接参数以目标涌现的神经元(子)群体发放率和网络振荡频率。我们通过帕累托前沿上的RMSE,在自发活跃的模拟RSNN模型和低激活脑类器官上进行了评估,随后是一个具有瞬态决策动力学的模拟RSNN模型。在所有情况下,模型均由自发发放的皮层兴奋性和抑制性神经元组成。我们表明,NSGA-III可以轻松优化多个网络发放率和主导网络振荡频率,对于决策模型,还可以优化不同时间时期的活动模式。值得注意的是,发现主导振荡频率对参数更敏感,而发放率则更稳健地满足。我们还确定了决策的低活动状态。总体而言,我们成功展示了多目标GA优化在RSNNs和脑类器官的神经发放率和振荡上的实现。
Spiking neural networks (SNNs) can be used for implementing cost-efficient artificial intelligence computing or mechanistic modelling of experimentally observed neural data. In the latter, fitting neural data with recurrent SNNs (RSNNs) remains a challenge. Importantly, given that neuronal network oscillations are known to play important roles in neural functions, fitting specific RSNN oscillation frequencies with neural firing rates has yet to be fully explored. In this work, we extended our previous application of genetic algorithm (GA), specifically non-dominated sorting GA (NSGA-III), on sensitive Izhikevich neuron-based RSNNs by optimising their connectivity parameters to target emergent neuronal (sub)population firing rates and network oscillation frequencies. We evaluated this, via RMSEs on a Pareto frontier, on spontaneously active simulated RSNN model and low-activation brain organoid, followed by a simulated RSNN model with transient decision dynamics. In all cases, the models comprised spontaneously firing cortical excitatory and inhibitory neurons. We showed that NSGA-III could readily optimise for multiple network firing rates and dominant network oscillation frequencies, and for the decision-making model, for activity patterns in different time epochs. Notably, dominant oscillation frequencies were found to be more parameter sensitive, but firing rates were more robustly met. We also identified low-activity regime for decision-making. Overall, we have successfully demonstrated the implementation of multi-objective GA optimisation on RSNNs' and brain organoid's neural firing rates and oscillations.
超阈限信息处理的量子类比形式体系
Vasily Lubashevskiy, Ihor Lubashevsky
AI总结 提出一种云函数形式体系,结合神经场理论与感知空间特征,模拟大规模脑网络中感官信息处理与心理表征之间的动态关系,并应用于决策中的改变主意现象。
我们发展了一种新颖的云函数形式体系,描述大规模脑网络中的感官信息处理(超阈限处理)与观察对象心理表征内容之间的动态关系。该形式体系结合了大规模神经活动的神经场理论与感知对象的空间特征及其从第一人称视角嵌入环境的方式。云函数具有两个关键特征:(i)其空间结构继承了感知物理对象的属性,(ii)其时间演化受反映大规模神经活动内在属性的规律支配。云函数的控制方程基于具有多项式非线性和神经模式振荡全局相移不变性的神经场模型。其结构可解释为带有非线性非厄米哈密顿量的薛定谔型方程,并补充了类似于Lotka-Volterra模型的项。所提出的方法应用于决策中的改变主意现象,即初始选择在执行过程中可能被修正。改变主意被解释为快速前意识感官处理与较慢的有意识比较备选方案之间的相互作用,这与持续决策后证据积累的神经生理学证据一致。还讨论了纳入云函数自相互作用的必要性。
We develop a novel cloud-function formalism describing the dynamical relationship between sensory-information processing in large-scale brain networks (supraliminal processing) and the content of the mental representation of an observed object. The formalism combines elements of neural field theory for large-scale neural activity with the spatial characteristics of perceived objects and their embedding in the environment from the first-person perspective. The cloud function is characterized by two key features: (i) its spatial structure inherits properties of the perceived physical object, and (ii) its temporal evolution is governed by regularities reflecting intrinsic properties of large-scale neural activity. The governing equation for the cloud function is based on a neural-field model with polynomial nonlinearities and global phase-shift invariance of neural-pattern oscillations. Its structure may be interpreted as a Schrodinger-type equation with a nonlinear non-Hermitian Hamiltonian supplemented by terms analogous to those of the Lotka-Volterra model. The proposed approach is applied to the change-of-mind phenomenon in decision-making, in which an initial choice may be revised during its execution. Changes of mind are explained as arising from the interplay between fast preconscious sensory processing and slower conscious comparison of alternatives, consistent with neurophysiological evidence for continuous post-decisional evidence accumulation. The necessity of incorporating cloud-function self-interaction is also discussed.
在广度、深度和时间上生长神经网络
Eivinas Butkus, Kedar Garzón Gupta, Nikolaus Kriegeskorte
AI总结 提出在循环卷积神经网络中定义广度、深度和时间的可微成本,通过反向传播联合优化任务误差和资源成本,发现三者可相互权衡,且模型使用的时间与人类反应时间相关。
空间和时间资源约束对生物和人工智能系统都至关重要。在这里,我们在一个被构想为无限格点有限子集的循环卷积神经网络中,定义了广度、深度和时间的可微成本项。我们通过反向传播将这些成本与任务误差联合优化。我们对广度、深度和时间施加不同的压力,导致通过训练有机地出现多样化的计算图。我们发现所有三种资源可以相互权衡以达到给定的准确度水平。网络在所有三个维度上随任务复杂性增长,并且在输入被遮挡时自发地采取更多的循环步骤。令人惊讶的是,模型使用的时间与人类在物体识别任务中的反应时间相关。我们的框架提供了资源约束如何塑造神经架构的规范性解释,与神经科学中关于大脑设计的问题相联系,并可能有助于阐明自然界中发现的神经解决方案的多样性。
Spatial and temporal resource constraints are critical for both biological and artificial intelligent systems. Here we define differentiable cost terms for breadth, depth, and time within a recurrent convolutional neural network conceived as a finite subset of an infinite lattice. We optimize these costs jointly with task errors via backpropagation. We set different pressures on breadth, depth, and time, which leads to diverse computational graphs emerging organically through training. We find that all three resources can be traded off against each other to achieve a given level of accuracy. Networks grow in all three dimensions with task complexity and spontaneously take more recurrent steps when inputs are occluded. Surprisingly, time used by the model correlates with human reaction times in an object recognition task. Our framework provides a normative account of how resource constraints shape neural architectures, connecting to questions about brain design in neuroscience, and may help illuminate the diversity of neural solutions found in nature.
自动化多数据集微升级INST $^{13}$C代谢通量分析揭示谷氨酸棒杆菌中稳健的通量但可变的代谢物池
Jochen Nießer, Anton Stratmann, Martin Beyß, Wolfgang Wiechert, Katharina Nöh, Stephan Noack
AI总结 本文提出一种微型化、自动化的多数据集INST $^{13}$C-MFA工作流程,在微升级别实现并行通量分析,揭示了谷氨酸棒杆菌在乙醇底物上生长时通量稳健但代谢物池大小可变。
同位素非稳态代谢通量分析(INST $^{13}$C-MFA)为细胞生理学提供了独特的见解,但通常受限于低通量和高实验成本。本文提出了一种微型化和自动化的工作流程,将瞬态同位素标记实验与先进的计算建模相结合,能够在微升级别实现并行INST $^{13}$C-MFA。该方法在一株进化的谷氨酸棒杆菌菌株上进行了验证,该菌株能够在乙醇上高效生长,而由于标记多样性低,同位素稳态$^{13}$C-MFA在此底物上固有地受限。利用机器人液体处理、快速热异丙醇淬灭和基于LC-QToF-MS的分析,从使用不同乙醇示踪剂的并行48孔实验中生成了高度信息化的数据集。多数据集INST $^{13}$C-MFA实现了细胞内通量和代谢物池大小的联合估计,并显著提高了通量精度,优于单数据集分析。虽然净通量在各数据集中稳健,但池大小估计表现出变异性,且在联合推断下不收敛,这突显了与单数据集INST $^{13}$C-MFA的根本性方法差异。由此产生的多数据集通量图揭示了在乙醇生长期间乙醛酸支路的核心作用,这与对C2底物利用的代谢适应一致。总体而言,本工作表明自动化多数据集INST $^{13}$C-MFA在技术上是可行的,并且以传统实验室规模生物反应器方法的一小部分成本提供了高质量的通量分析。所提出的工作流程为微生物生物技术中的高通量定量通量组学建立了一个可扩展的框架,并支持整合到迭代菌株工程和生物铸造厂流程中。
Isotopically non-stationary metabolic flux analysis (INST $^{13}$C-MFA) provides unique insights into cellular physiology but is typically limited by low throughput and high experimental costs. Here, we present a miniaturized and automated workflow that integrates transient isotope labeling experiments with advanced computational modeling to enable parallel INST $^{13}$C-MFA at microliter scale. The approach is demonstrated for an evolved $Corynebacterium~glutamicum$ strain capable of efficient growth on ethanol, a substrate for which isotopically stationary $^{13}$C-MFA is inherently limited due to low labeling diversity. Using robotic liquid handling, rapid hot isopropanol quenching, and LC-QToF-MS-based analytics, highly informative datasets were generated from parallel 48-well experiments with different ethanol tracers. Multi-dataset INST $^{13}$C-MFA unlocked joint estimation of intracellular fluxes and metabolite pool sizes and significantly improved flux precision compared to single-dataset analyses. While net fluxes were robust across datasets, pool size estimates exhibited variability and did not converge under joint inference, highlighting a fundamental methodological difference to single-dataset INST $^{13}$C-MFA. The resulting multi-dataset flux map reveals a central role of the glyoxylate shunt during growth on ethanol, consistent with metabolic adaption to C2-based substrate utilization. Overall, this work demonstrates that automated multi-dataset INST $^{13}$C-MFA is technically feasible and provides high-quality flux analysis at a fraction of the cost of conventional lab-scale bioreactor-based approaches. The presented workflow establishes a scalable framework for high-throughput quantitative fluxomics in microbial biotechnology and supports integration into iterative strain engineering and biofoundry pipelines.
具有分块缺失值的多模态堆叠及其在预测免疫治疗耐药性的PIONeeR生物标志物研究中的应用
Mohamed Boussena, Florence Monville, Jacques Fieschi-Meric, Frederic Vely, Pierre Milpied, Julien Mazieres, Maurice Perol, Eric Vivier, Laurent Greillier, Fabrice Barlesi, Sebastien Benzekry
AI总结 提出多模态堆叠框架MSB,通过独立建模各模态特征并利用交叉验证堆叠元学习器聚合预测,解决高维和分块缺失问题,在PIONeeR研究中预测非小细胞肺癌免疫治疗无进展生存期,性能优于基线算法。
在临床肿瘤学中,整合多模态数据集常受到高维性和分块缺失的阻碍,即特定患者子集无法获得完整数据源。标准生存模型通常难以处理这些缺失,导致结果偏倚或患者排除。我们提出具有分块缺失值的多模态堆叠(MSB),一种用于生存分析的晚期融合框架,它独立建模模态特定特征,然后通过交叉验证的堆叠元学习器聚合预测。MSB在PIONeeR研究(n=443名患者,来自八个异质来源的378个生物标志物)中进行了验证,以预测接受免疫治疗的晚期非小细胞肺癌患者的无进展生存期。MSB产生了比基线算法更高的预测性能(C-index)。改进幅度因基线强度而异:线性模型提高了15.9%(Wilcoxon符号秩检验p<0.001),随机生存森林提高了5.4%(p=0.002),梯度提升方法提高了2.1%(p=0.030)。除了区分能力外,MSB还缩小了泛化差距(5折交叉验证重复3次的训练-测试差异:0.055 vs 线性模型的0.380)。置换重要性分析确定了常规实验室标志物、临床特征和PD-L1表达为主要预测驱动因素。缺失块指示器的重要性可忽略,表明模型从生物标志物值而非数据可用性模式中学习。MSB为具有分块缺失的多模态生存预测提供了一个统计验证的框架。通过无需完整数据即可进行系统性生物标志物评估,MSB为生物医学研究中的预测建模提供了实用工具,有待外部验证。实现代码可在https://github.com/MohamedBoussena/MSB 根据Inria许可证获取。
Integrating multimodal datasets in clinical oncology is frequently hindered by high dimensionality and blockwise missingness, where entire data sources are unavailable for specific patient subsets. Standard survival models often struggle with these gaps, leading to biased results or patient exclusion. We introduce Multimodality Stacking with Blockwise missing values (MSB), a late-fusion framework for survival analysis that independently models modality-specific features before aggregating predictions via a cross-validated stacking meta-learner. MSB was validated on the PIONeeR study (n=443 patients, 378 biomarkers across eight heterogeneous sources) to predict progression-free survival in advanced non-small cell lung cancer patients receiving immunotherapy. MSB yielded higher predictive performance (C-index) than baseline algorithms. Improvements varied by baseline strength: linear models showed a 15.9% increase (p<0.001 for the Wilcoxon signed-rank test), random survival forests gained 5.4% (p=0.002), and gradient boosting methods improved by 2.1% (p=0.030). Beyond discrimination, MSB reduced the generalization gap (train-test difference in 5 folds cross-validation repeated 3 times: 0.055 vs 0.380 for linear models). Permutation importance analysis identified routine laboratory markers, clinical features, and PD-L1 expression as primary predictive drivers. Missing block indicators showed negligible importance, suggesting the model learned from biomarker values rather than data availability patterns. MSB provides a statistically validated framework for multimodal survival prediction with blockwise missingness. By enabling systematic biomarker evaluation without requiring complete data, MSB offers a practical tool for predictive modeling in biomedical research, pending external validation. Implementation is available at https://github.com/MohamedBoussena/MSB under Inria license.
解释、学习与共情作为单一约束:具有可问责弃权的残差充分性架构
Chainarong Amornbunchornvej
AI总结 提出一种认知架构,通过单一残差量统一处理解释、学习和共情,当情境超出表征能力时产生带类型和见证的弃权。
一个智能体必须对当前情境采取行动,学习它尚无法表征的内容,并充分建模其他智能体以进行协调。这些能力通常由独立的机制实现,但它们共享一种失败模式:情境可能超出智能体当前能表征的范围,此时诚实的回应是原则性的拒绝,并说明缺失了什么。我们开发了一个小型认知架构,其中这些限制源于单一量。一个解释-决策单元(IDU)通过一组体制(具有私有基的局部表征框架)解释内容向量,并决定其许可哪些行动;内容相对于活跃体制表征范围的标量残差驱动该单元。低残差且许可清晰时发出行动;否则单元重新解释、尝试描述长度合理的扩展,或停止并给出带类型和见证的终止。我们证明该单元是总且确定性的:对于任何内容和固定配置,它在有限有界步数内停止,并带有唯一终止见证,因此弃权由构造携带其原因。通过绑定架构的开放参数而不改变其机制,相同的残差-范围约束在三个范围上恢复了三个有记录的现象:不知的类型学(类型化弃权);智能体之间的强制误解,局限于一个共享概念且对犯错的智能体不可见(有界共情);以及学习中的先决条件依赖,源于有界关注窗口而非假设(发展先决条件)。每个实例化都针对自然智能体和人工智能体进行了阐述,并提出了可证伪的预测,因此一个约束可以模拟人类和机器认知中的限制。该工作提供了一种统一和一种可问责弃权的概念,通过构造带有类型和见证。
An agent must act on the situation before it, learn what it cannot yet represent, and model other agents well enough to coordinate. These faculties are usually realized by separate mechanisms, yet they share a failure mode: the situation can exceed what the agent can currently represent, and the honest response is then a principled refusal that says what was missing. We develop a small cognitive architecture in which these limits arise from a single quantity. An Interpretation-Decision Unit (IDU) interprets a content vector through a family of regimes - local representational frames with private bases - and decides which actions it licenses; a scalar residual of the content against the active regimes' representational scope drives the unit. Low residual with a clean licensing emits an action; otherwise the unit re-interprets, attempts a description-length-justified expansion, or halts with a typed, witnessed terminal. We prove the unit is total and deterministic: for any content and fixed configuration it halts in finitely many bounded-cost steps with a unique terminal witness, so abstention carries its cause by construction. By binding the architecture's open parameters without changing its mechanics, the same residual-against-scope constraint recovers three documented phenomena at three scopes: the typology of not-knowing (typed abstention); a forced misunderstanding between agents, localized to one shared concept and invisible to the agent committing it (bounded empathy); and prerequisite dependence in learning derived from a bounded focus window rather than posited (developmental prerequisites). Each instantiation is worked for a natural and an artificial agent and states a falsifiable prediction, so one constraint can model limits in both human and machine cognition. The account contributes a unification and a notion of accountable abstention, typed and witnessed by construction.
可解释多任务视网膜成像揭示2型糖尿病系统性风险分层的微血管信号:一项初步研究
Mini Han Wang, Liting Huang, Wei Hong, Boonthawan Wingwon
AI总结 本研究开发了一个可解释的多任务深度学习框架,通过分析视网膜微血管特征与系统性异常(如肾脏异常)的关联,验证了视网膜成像作为糖尿病系统性风险分层生物标志物的潜力。
视网膜成像提供了进入系统性微血管健康的非侵入性窗口,并已成为系统性疾病的潜在生物标志物。然而,视网膜特征是否编码了生物学上有意义的系统性信号,并且可以使用可解释人工智能(XAI)可靠地解释,仍不清楚。我们开发了一个可解释的多任务深度学习框架,以研究视网膜微血管特征与2型糖尿病系统性异常之间的关联。使用共享神经网络和针对血糖状态、肾脏异常和多系统参与的任务特定头部,分析了来自2,719名个体的11,011张眼底图像。使用梯度加权类激活映射(Grad-CAM)、解剖掩膜和血管对齐分析评估模型可解释性。该框架展示了任务依赖的预测性能,对肾脏异常的最佳区分度(AUC高达0.63),而血糖状态预测性能有限(AUC = 0.49-0.61)。可解释性分析一致地将模型注意力定位到视网膜血管和视盘周围区域。掩膜实验表明,遮挡血管区域导致性能下降最大,表明视网膜血管是主要的预测来源。不同架构表现出异质的注意力模式,提示存在多种系统性信号编码的表征路径。这项初步研究表明,视网膜微血管特征包含与系统性异常(尤其是微血管损伤)相关的可测量信号。通过将多任务学习与定量XAI验证相结合,该框架推动视网膜成像向用于糖尿病系统性风险分层的可解释数字生物标志物发展。
Retinal imaging provides a non-invasive window into systemic microvascular health and has emerged as a potential biomarker for systemic diseases. However, whether retinal features encode biologically meaningful systemic signals that can be reliably interpreted using explainable artificial intelligence (XAI) remains unclear. An explainable multi-task deep learning framework was developed to investigate associations between retinal microvascular features and systemic abnormalities in Type 2 Diabetes Mellitus. A total of 11,011 fundus images from 2,719 individuals were analysed using a shared neural network with task-specific heads for glycaemic status, kidney abnormality, and multi-system involvement. Model interpretability was evaluated using Gradient-weighted Class Activation Mapping (Grad-CAM), anatomical masking, and vessel alignment analysis. The framework demonstrated task-dependent predictive performance, with the best discrimination observed for kidney abnormality (AUC up to 0.63), whereas glycaemic status prediction showed limited performance (AUC = 0.49-0.61). Explainability analyses consistently localized model attention to retinal vessels and peripapillary regions. Masking experiments showed that occlusion of vascular regions caused the greatest performance decline, indicating that retinal vessels were the primary predictive source. Different architectures exhibited heterogeneous attention patterns, suggesting multiple representational pathways for systemic signal encoding. This pilot study demonstrates that retinal microvascular features contain measurable signals associated with systemic abnormalities, particularly microvascular damage. By integrating multi-task learning with quantitative XAI validation, this framework advances retinal imaging toward interpretable digital biomarkers for systemic risk stratification in diabetes.
可解释的视网膜成像用于预测2型糖尿病多器官功能障碍
Mini Han Wang, Liting Huang, Wei Hong, Boonthawan Wingwon
AI总结 本研究利用常规实验室生物标志物构建系统级异常指数,通过梯度提升模型预测2型糖尿病多系统失调,并采用SHAP实现可解释性,揭示了高血糖、肾功能障碍、血脂异常和炎症是主要驱动因素。
背景:2型糖尿病(T2DM)日益被认为是一种以代谢、肾脏、脂质和炎症通路协调功能障碍为特征的系统性疾病。现有的临床评估往往无法捕捉这种多维度负担。方法:我们对1,195名患者进行了回顾性研究,使用了常规收集的实验室生物标志物。构建了系统级异常指数以量化器官特异性功能障碍,并将多系统受累定义为两个或以上系统异常。训练了包括逻辑回归、随机森林和梯度提升在内的监督机器学习模型来预测多系统失调。使用SHapley Additive exPlanations(SHAP)实现模型可解释性。结果:梯度提升模型表现出近乎完美的区分能力(AUC = 1.000),显著优于逻辑回归(AUC = 0.925)。特征归因分析显示,高血糖、肾功能障碍、血脂异常和炎症是多系统风险的主要驱动因素。部分依赖分析中观察到的剂量-反应关系进一步支持了模型预测的生物学合理性。结论:本研究提出了一个可解释的、数据驱动的框架,用于量化T2DM的系统性疾病负担。通过将常规生物标志物与多器官功能障碍联系起来,我们的方法提供了预测准确性和机制洞察,为糖尿病护理中的风险分层和精准医学提供了潜力。本研究中使用的数据和代码可在GitHub上公开获取:https://github.com/MiniHanWang/Type-2-Diabetes-1.git
Background: Type 2 diabetes mellitus (T2DM) is increasingly recognised as a systemic disease characterised by coordinated dysfunction across metabolic, renal, lipid, and inflammatory pathways. Existing clinical assessments often fail to capture this multi-dimensional burden. Methods: We conducted a retrospective study of 1,195 patients using routinely collected laboratory biomarkers. System-level abnormality indices were constructed to quantify organ-specific dysfunction, and multi-system involvement was defined as abnormalities in two or more systems. Supervised machine learning models, including logistic regression, random forest, and gradient boosting, were trained to predict multi-system dysregulation. Model interpretability was achieved using SHapley Additive exPlanations (SHAP). Results: The gradient boosting model demonstrated near-perfect discrimination (AUC = 1.000), significantly outperforming logistic regression (AUC = 0.925). Feature attribution analysis revealed that hyperglycaemia, renal impairment, dyslipidaemia, and inflammation were the dominant drivers of multi-system risk. Dose-response relationships observed in partial dependence analyses further supported the biological plausibility of model predictions. Conclusion: This study presents an interpretable, data-driven framework for quantifying systemic disease burden in T2DM. By linking routine biomarkers to multi-organ dysfunction, our approach provides both predictive accuracy and mechanistic insight, offering potential for improved risk stratification and precision medicine in diabetes care. The data and code used in this study are openly available on GitHub at: https://github.com/MiniHanWang/Type-2-Diabetes-1.git
MetaboKG:面向非靶向代谢组学的分析中心知识图谱框架
Matthieu Féraud, Dina Boukhajou, Fabien Gandon, Louis-Félix Nothias
AI总结 提出MetaboKG框架,通过转换工作流、语义模型和通用注释标识符策略,整合公共存储库中的代谢组学数据,支持可追溯的注释复用和可重复的SPARQL查询。
非靶向代谢组学产生大量串联质谱(MS/MS)数据和计算注释,可揭示跨生物体和环境的分子机制。通过协调的存储库元数据和访问基础设施(如Pan-ReDU)以及代谢组学知识图谱(如ENPKG和METRIN-KG),公共复用得到了改善。然而,分析层仍然碎片化:谱图、特征、工作流输出、注释、置信度证据和上下文元数据仍然分散在存储库和表格制品中。我们提出MetaboKG,一个面向分析的知识图谱框架,用于从公共存储库、元数据和GNPS分子网络结果中工程化可复用的代谢组学知识。MetaboKG贡献了一个转换工作流,保留了存储库导出、分析文件、谱图、特征和注释结果之间的链接;一个基于PROV-O和SIO并与质谱本体(MS)、ChEBI、NCBITaxon、ENVO和NCIT对齐的语义模型,用于表示来源、分析证据、元数据属性和受控词汇术语;以及一个通用注释标识符策略,该策略扩展了通用谱标识符(USI),包含工作流特定组件,用于延迟绑定、增量摄取和分析间的后期链接。我们在680个GNPS分子网络结果的公共存储库规模上展示了MetaboKG,并通过涵盖生化富集、环境特异性和跨仪器分析变异的能力问题进行了评估。结果表明,基于图的集成支持可追溯的注释复用和可重复的SPARQL探索,这些生化关系在存储库原生资源中仍然是碎片化的。
Untargeted metabolomics generates large volumes of tandem mass spectrometry (MS/MS) data and computational annotations that can reveal molecular mechanisms across organisms and environments. Public reuse has improved through harmonized repository metadata and access infrastructures such as Pan-ReDU, and through metabolomics knowledge graphs such as ENPKG and METRIN-KG. Yet the analytical layer remains fragmented: spectra, features, workflow outputs, annotations, confidence evidence, and contextual metadata are still scattered across repositories and tabular artifacts. We present MetaboKG, an analysis-centric knowledge graph framework for engineering reusable metabolomics knowledge from public repositories, metadata, and GNPS molecular network results. MetaboKG contributes a transformation workflow that preserves links between repository exports, analytical files, spectra, features, and annotation results; a semantic model grounded in PROV-O and SIO and aligned with the Mass Spectrometry ontology (MS), ChEBI, NCBITaxon, ENVO, and NCIT to represent provenance, analytical evidence, metadata attributes, and controlled vocabulary terms; and a Universal Annotation Identifier strategy extending the Universal Spectrum Identifier (USI) with workflow-specific components for late binding, incremental ingestion, and post hoc linkage across analyses. We demonstrate MetaboKG at the public-repository scale on 680 GNPS molecular networking results and evaluate it through competency questions covering biochemical enrichment, environmental specificity, and cross instrument analytical variation. Results show that graph-based integration supports traceable annotation reuse and reproducible SPARQL exploration of biochemical relationships that remain fragmented across repository-native resources.
从自然语言训练的后继表示中自发涌现的词类表示
Mathis Immertreu, Achim Schilling, Thomas Kinfe, Patrick Krauss
AI总结 本研究将强化学习中的后继表示(SR)框架应用于自然语言,通过训练神经网络预测未来词分布,发现无监督下词类(如名词、动词、形容词)的几何结构自发涌现,且预测时域影响结构层次。
语言模型通常被训练来预测序列中的下一个词。这里,我们探索来自强化学习的另一种预测原则:后继表示(SR),它建模未来状态的期望折扣分布,而不是直接的下一个状态。我们将这一框架迁移到自然语言,并训练神经网络在多个时间视界上预测未来词分布,从而学习长程转移结构的表示。我们在WikiText-103(1.03亿词;2万词词汇)上训练深度残差神经网络,并使用KL散度将后继表示优化为概率分布。在没有显式语言监督的情况下,结构化语言表示自发涌现。训练后,学习到的空间相对于词性(POS)类别发展出清晰的几何组织:名词、动词和形容词变得可分离,并通过无监督聚类恢复。这种组织系统地依赖于预测视界:短视界产生最强的句法结构,而长视界逐渐整合更广泛的上下文和语义信息。在更精细的分辨率下,额外的可解释词汇子结构出现,揭示了主要词类内的连贯子类。这些发现表明,句法类别无需显式编码,而可能作为预测序列学习的结果出现。据我们所知,这项工作首次将后继表示系统应用于自然语言,并在强化学习、语言学和认知神经科学之间建立了概念桥梁。
Language models are typically trained to predict the next token in a sequence. Here, we explore an alternative predictive principle from reinforcement learning: Successor Representations (SRs), which model the expected discounted distribution of future states rather than the immediate next state. We transfer this framework to natural language and train neural networks to predict future word distributions across multiple temporal horizons, thereby learning representations of long-range transition structure. We train a deep residual neural network on WikiText-103 (103 million tokens; 20,000-word vocabulary) and optimize successor representations as probability distributions using KL divergence. Without explicit linguistic supervision, structured language representations emerge spontaneously. After training, the learned space develops a clear geometric organization with respect to part-of-speech (POS) categories: nouns, verbs, and adjectives become separable and recoverable through unsupervised clustering. This organization depends systematically on predictive horizon, with short horizons producing the strongest syntactic structure and longer horizons increasingly integrating broader contextual and semantic information. At finer resolutions, additional interpretable lexical substructure emerges, revealing coherent subclasses within major word categories. These findings suggest that syntactic categories need not be explicitly encoded but may arise as a consequence of predictive sequence learning. To our knowledge, this work provides the first systematic application of successor representations to natural language and establishes a conceptual bridge between reinforcement learning, linguistics, and cognitive neuroscience.
空间约束与边界条件对生物手性的影响:一项模拟研究
Arturo Tozzi
AI总结 通过反应扩散模拟,研究非线性立体化学放大、随机涨落和边界依赖空间约束对手性涌现的影响,发现空间耦合和有限边界可调控手性域形成与立体化学组织。
生物系统表现出显著的分子不对称性,蛋白质主要基于L-氨基酸,核酸和碳水化合物主要由D-糖组成。对手性纯度的解释包括不对称光化学、自催化放大、随机对称性破缺和矿物表面立体选择性,但这些机制仅部分解决了有限几何和集体空间相互作用对立体化学稳定性的影响。受凝聚态物理学最新进展的启发,我们研究了相干手性是否可以从非线性立体化学放大、随机涨落和边界依赖空间约束之间的相互作用中涌现。我们开发了一个反应扩散模拟,其中局部立体化学群体在有限二维域内,在空间耦合和弱几何偏置场下演化。我们的模型结合了双稳态自催化动力学、最近邻相互作用和抑制局部不一致的立体化学构型,以量化在不同相互作用强度和涨落幅度下对映体过量、同手性邻居一致性和径向立体化学组织的时间演化。我们的结果显示手性域逐渐形成、相反手性区域的分离以及几何依赖的局部立体化学组织调制。空间耦合增加了局部相干性并改变了混合立体化学态的持久性,而有限边界影响了分子群体的径向组织和各向异性稳定化。潜在应用包括几何控制的不对称合成、受限立体选择性催化系统、自适应手性材料以及微结构反应环境中异质立体化学分布的表征。
Biological systems exhibit marked molecular asymmetry, with proteins based predominantly on L-amino acids and nucleic acids and carbohydrates largely composed of D-sugars. Explanations for homochirality include asymmetric photochemistry, autocatalytic amplification, stochastic symmetry breaking and mineral-surface stereoselectivity, but these mechanisms only partially address the influence of finite geometry and collective spatial interactions on stereochemical stabilization. Inspired by recent developments in condensed-matter physics, we investigated whether coherent chirality could emerge from the interplay among nonlinear stereochemical amplification, stochastic fluctuations and boundary-dependent spatial constraints. We developed a reaction-diffusion simulation in which local stereochemical populations evolved within finite two-dimensional domains under spatial coupling and weak geometrical bias fields. Our model combined bistable autocatalytic dynamics, nearest-neighbor interactions and suppression of locally inconsistent stereochemical configurations in order to quantify temporal evolution of enantiomeric excess, same-handed neighbor agreement and radial stereochemical organization under varying interaction strengths and fluctuation amplitudes. Our results showed progressive formation of chiral domains, segregation of opposite-handed regions and geometry-dependent modulation of local stereochemical organization. Spatial coupling increased local coherence and modified persistence of mixed stereochemical states, while finite boundaries influenced radial organization and anisotropic stabilization of molecular populations. Potential applications include geometrically controlled asymmetric synthesis, confined stereoselective catalytic systems, adaptive chiral materials and characterization of heterogeneous stereochemical distributions in microstructured reaction environments.
多模态机器学习用于群体和个体特异性lncRNA-2型糖尿病关联分析
Ashwani Siwach, Sanjeev Narayan Sharma, Sunil Datt Sharma
AI总结 本研究通过整合表达、二级结构和序列特征的多模态机器学习框架,在独立队列中识别与2型糖尿病相关的lncRNA,并利用SHAP分析实现群体和个体水平的关联解释。
长链非编码RNA(lncRNA)是参与慢性疾病(包括2型糖尿病)发病机制的新兴调控分子。我们研究了文献中报道的与2型糖尿病相关的十种lncRNA:MALAT1、MEG3、MIAT、ANRIL、GAS5、KCNQ1OT1、H19、BCYRN1、XIST和HOTAIR,在两个独立的人群RNA-seq队列中进行了分析。单组学方法提供了疾病生物学的不完整视图,因此开发了一个整合多特征框架,提取每种lncRNA的表达、二级结构和序列特征。在分层k折交叉验证、留一法交叉验证和重复留出法方案下评估了八种机器学习分类器,以确保稳健的性能估计。应用SHAP分析进行个体水平的关联解释。在一个队列中,发现GAS5和XIST的表达特征以及GAS5、MEG3和ANRIL的序列特征与2型糖尿病相关,而在第二个队列中,发现MALAT1的表达特征以及KCNQ1OT1、ANRIL和MEG3的序列特征与2型糖尿病相关。SHAP将MEG3识别为两个队列中的主要lncRNA。机器学习结果与已建立的统计方法一致,同时额外提供了与特定分子特征类型相关的群体和个体水平疾病关联谱。所提出的框架增进了对2型糖尿病机制的理解,并支持基于lncRNA的精准医学。
Long non-coding RNAs (lncRNAs) are emerging regulatory molecules implicated in chronic disease pathogenesis, including Type 2 Diabetes Mellitus (T2D). We investigated ten literature reported lncRNAs associated with T2D: MALAT1, MEG3, MIAT, ANRIL, GAS5, KCNQ1OT1, H19, BCYRN1, XIST, and HOTAIR across two independent population-based RNA-seq cohorts. Single-omics approaches provide an incomplete view of disease biology, therefore, an integrative multi-feature framework was developed, extracting expression, secondary-structure, and sequence features for each lncRNA. Eight machine learning (ML) classifiers were evaluated under stratified k-fold, leave-one-out cross-validation (LOOCV), and repeated hold-out schemes to ensure robust performance estimation. SHAP analysis was applied for subject-level association interpretation. In one cohort, GAS5 and XIST expression features, along with GAS5, MEG3, and ANRIL sequence features, were found to be associated with T2D, while MALAT1 expression and KCNQ1OT1, ANRIL, and MEG3 sequence features were found to be associated in the second cohort. MEG3 was identified by SHAP as the dominant lncRNA in both cohorts. ML results were consistent with established statistical methods while additionally providing population- and subject-level disease association profiles linked to specific molecular feature types. The proposed framework advances mechanistic understanding of T2D and supports lncRNA-based precision medicine.
受生命启发的机器智能自举:从化学到认知与创造力的生物学路径
Giovanni Pezzulo, Michael Levin
AI总结 本文提出一种受生命启发的机器智能方法,通过从生物学中提炼出五个设计原则(多尺度自主性、通过活性组件自组装实现生长、能力的持续重建、利用物理和具身约束、以及实现自组织和目标导向自上而下控制的普遍信号传递),旨在构建更具鲁棒性、自主性和开放问题解决能力的人工系统。
实现高级机器智能仍然是人工智能研究中的一个核心挑战,通常通过扩展神经架构和生成模型来解决。然而,生物系统提供了更广泛的适应性、目标导向行为策略——这些策略在神经系统进化之前就已经出现。本文倡导一种真正受生命启发的机器智能方法,借鉴生物学中的原理,使系统能够跨尺度实现鲁棒性、自主性和开放式问题解决。我们遵循威廉·詹姆斯的思想,将智能定义为灵活的问题解决,并发展出“认知光锥”的概念来描述生命系统和机器中智能的连续体。我们认为,生物进化发现了一种可扩展的智能配方——以及生物体“认知光锥”、预测和控制能力的逐步扩展。为了解释这是如何可能的,我们提炼出五个设计原则——多尺度自主性、通过活性组件自组装实现生长、能力的持续重建、利用物理和具身约束、以及实现自组织和目标导向自上而下控制的普遍信号传递——这些原则支撑了生命在创造性多样的问题空间中导航的能力。我们讨论了这些原则与当前人工智能范式的对比,并概述了将它们整合到未来自主、具身和弹性人工系统中的路径。
Achieving advanced machine intelligence remains a central challenge in AI research, often approached through scaling neural architectures and generative models. However, biological systems offer a broader repertoire of strategies for adaptive, goal-directed behavior - strategies that emerged long before nervous systems evolved. This paper advocates a genuinely life-inspired approach to machine intelligence, drawing on principles from biology that enable robustness, autonomy, and open-ended problem-solving across scales. We frame intelligence as flexible problem-solving, following William James, and develop the concept of "cognitive light cones" to characterize the continuum of intelligence in living systems and machines. We argue that biological evolution has discovered a scalable recipe for intelligence - and the progressive expansion of organisms' "cognitive light cone", predictive and control capacities. To explain how this is possible, we distill five design principles - multiscale autonomy, growth through self-assemblage of active components, continuous reconstruction of capabilities, exploitation of physical and embodied constraints, and pervasive signaling enabling self-organization and top-down control from goals - that underpin life's ability to navigate creatively diverse problem spaces. We discuss how these principles contrast with current AI paradigms and outline pathways for integrating them into future autonomous, embodied, and resilient artificial systems.
微调语言模型使其了解自身所知
Sangjun Park, Elliot Meyerson, Xin Qiu, Risto Miikkulainen
AI总结 本文提出一种框架,通过进化策略对齐方法(ESMA)在控制偏差的同时提升大语言模型的元认知能力,并在未见数据集、语言和新知识上展现出鲁棒泛化性。
评估大语言模型(LLMs)的真实元认知能力因偏差和启发式方法而困难。本文提出一个框架,在控制这些偏差的同时测量和增强LLM的元认知能力。建立了使用$d'_{\rm type2}$指标的测量方法以隔离元认知能力。提出了元认知对齐进化策略(ESMA),在未见数据集、语言和新获取的知识上展现出鲁棒泛化性。最后,参数分析表明这些改进由一组稀疏参数驱动,为定向元认知优化提供了新途径。
Evaluating true metacognition in Large Language Models (LLMs) is difficult due to biases and heuristics. This paper presents a framework to measure and enhance LLM metacognition while controlling for these biases. A measurement method using the $d'_{\rm type2}$ metric is established to isolate metacognitive ability. The Evolution Strategy for Metacognitive Alignment (ESMA) is proposed, demonstrating robust generalization across unseen datasets, languages, and newly acquired knowledge. Finally, parameter analysis reveals that these improvements are driven by a sparse set of parameters, offering new pathways for targeted metacognitive optimization.