arXivDaily arXiv每日学术速递 周一至周五更新
重置
2605.10876 2026-05-12 cs.LG cs.AI q-bio.QM

AssayBench: An Assay-Level Virtual Cell Benchmark for LLMs and Agents

Edward De Brouwer, Carl Edwards, Alexander Wu, Jenna Collier, Graham Heimberg, Xiner Li, Meena Subramaniam, Ehsan Hajiramezanali, David Richmond, Jan-Christian Hütter, Sara Mostafavi, Gabriele Scalia

AI总结 本文提出 AssayBench,一个用于评估大语言模型和智能体在虚拟细胞表型筛选任务中表现的基准数据集,涵盖1920个公开的CRISPR筛选实验,涉及五类细胞表型。研究将表型筛选任务转化为基因排序预测问题,并引入调整后的nDCG指标以衡量不同实验间的模型性能。实验表明,现有的方法与经验估计的性能上限仍有较大差距,零样本通用大语言模型在该任务中表现优于专门的生物语言模型和可训练基线模型。

Comments 22 pages

详情
英文摘要

Recent advances in machine learning and large-scale biological data collections have revived the prospect of building a virtual cell, a computational model of cellular behavior that could accelerate biological discovery. One of the most compelling promises of this vision is the ability to perform in silico phenotypic screens, in which a model predicts the effects of cellular perturbations in unseen biological contexts. This task combines heterogeneous textual inputs with diverse phenotypic outputs, making it particularly well-suited to LLMs and agentic systems. Yet, no standard benchmark currently exists for this task, as existing efforts focus on narrower molecular readouts that are only indirectly aligned with the phenotypic endpoints driving many real-world drug discovery workflows. In this work, we present AssayBench, a benchmark for phenotypic screen prediction, built from 1,920 publicly available CRISPR screens spanning five broad classes of cellular phenotypes. We formulate the screen prediction task as a gene rank prediction for each screen and introduce the adjusted nDCG, a continuous metric for comparing performance across heterogeneous assays. Our extensive evaluation shows that existing methods remain far from empirically estimated performance ceilings and zero-shot generalist LLMs outperform biology-specific LLMs and trainable baselines. Optimization techniques such as fine-tuning, ensembling, and prompt optimization can further improve LLM performance on this task. Overall, AssayBench offers a practical testbed for measuring progress toward in silico phenotypic screening and, more broadly, virtual cell models.

2605.10644 2026-05-12 cond-mat.stat-mech q-bio.PE

Susceptible-Infected-Susceptible Model with Mitigation on Scale-Free Networks

João Gabriel Simões Delboni, M. O. Hase

AI总结 本文研究了在无标度网络上具有缓解机制的易感-感染-易感疾病传播模型。通过引入异质平均场方法和缓解因子,考虑了个体异质性和行为响应对传播的影响,从而在传播过程中引入非线性饱和效应。研究发现,缓解机制改变了疾病传播的动态特性,使得网络中感染概率和流行程度对网络度指数的依赖关系发生反转,为理解复杂网络中的传染病控制提供了新的视角。

详情
英文摘要

We investigate infectious disease spreading on scale-free networks using a heterogeneous mean-field approach applied to the susceptible-infected-susceptible model, incorporating a mitigation factor. Individual heterogeneity is incorporated through a power-law distribution, while a mitigation factor accounts for behavioral responses and external effects that effectively reduce transmission from infected individuals. This mechanism, inspired by Malthus-Verhulst-type constraints, introduces a nonlinear saturation effect that encodes self-limiting dynamics in a tractable way. Analytical results are supported by stochastic simulations. We find that the mitigation factor induces a nontrivial behavior in the probability that a link points to an infected node, which develops a maximum at finite infection rates. In contrast, the overall prevalence remains a monotonically increasing function of the transmission rate. Additionally, the mitigation mechanism leads to an inversion in the dependence of epidemic observables on the degree exponent at sufficiently high transmission rates. While in the standard model smaller exponents yield higher endemic prevalence, in the modified model this trend reverses, with larger exponents producing higher prevalence and increased infection probability along network links.

2605.10617 2026-05-12 math.PR q-bio.PE

Logarithmic scaling of selective sweep curves: from tents to houses

Florin Boenkost, Felix Hermann, András Tóbiás, Anton Wakolbinger

AI总结 本文研究了有益突变在大种群中固定过程中频率变化曲线(选择扫除曲线)的尺度特性,发现对数尺度下这些曲线在强选择条件下呈现帐篷状形态,而在中等选择条件下则演化为房屋状形态。研究通过 Moran 模型证明了在大种群极限下,这种形态变化的收敛性具有特定的统一性和 Skorokhod $M_1$ 拓扑性质,为扩展中等选择下的克隆干扰描述提供了理论基础。

Comments 23 pages, 2 figures

详情
英文摘要

One of the classical results of mathematical population genetics states that the frequency of a beneficial mutant's offspring, on its way to fixation in a large population, looks like a logistic curve. A logarithmic scaling (both in height and time) of these selective sweep curves leads (in the case of strong selection) to a tent-like shape in the large population limit: First the logarithmic frequency of the mutant increases linearly from 0 to 1, then that of the former resident decreases from 1 to 0. For moderate selection the logarithmic frequencies develop (in the large population limit) a jump at the beginning/the end of the sweep, which takes the shape of the tent into that of a house. Our main result (proved for the Moran model) assesses the regularity of this convergence in the large population limit: It is uniform in the house's roof (phases of linear growth and decline) and "Skorokhod $M_1$" in the house's walls (closely around the jumps). Apart from interest in its own right, we anticipate that this result and the proof techniques will be instrumental for extending the description of clonal interference by Poissonian interacting trajectories (as it was done in Hermann et al. (2024) for strong selection) also to moderate selection.

2605.10420 2026-05-12 q-bio.PE physics.bio-ph

Travelling waves of invasion in microbial communities with phenotypic switching

Diego Manso Anda, Pierre A. Haas

AI总结 该研究探讨了在具有表型转换能力的微生物群落中,不同菌株之间的入侵波传播特性。研究构建了一个包含两种竞争物种的简化模型,其中一种菌株能够随机地或响应竞争者而切换为抗竞争的持久态表型。研究发现,表型转换对入侵者进入目标种群的波速无影响,但却能加速目标种群反向入侵的竞争者,表明细菌的持久性可能是一种进攻性的生态策略,而非单纯的防御机制。

Comments 12 pages, 5 figures

详情
英文摘要

Complex microbial habitats see the spatial competition of different clonal bacterial populations that switch between different phenotypes. Here, we determine the effect of this subpopulation structure on the invasion of one species by another in a minimal model of two competing species: one species switches, both stochastically and in response to its competitor, to a persister phenotype resilient to competition. Surprisingly, our combined analytical and numerical results show that this phenotypic switching has no effect on the speed of the travelling wave by which the competitors invade the first population. Conversely, we discover that phenotypic switching can speed up the wave by which this population invades their competitors. Our results thus suggest, counterintuitively, that bacterial persistence can be an offensive, rather than defensive ecological strategy.

2605.10356 2026-05-12 q-bio.NC

Cortico-cerebellar modularity as an architectural inductive bias for efficient temporal learning

Alexandra Voce, Emmanouil Giannakakis, Claudia Clopath

AI总结 该研究探讨了小脑与大脑皮层之间的模块化结构如何促进高效的时间学习,并将其应用于人工神经网络设计。研究通过在循环神经网络中引入小脑启发的前馈模块,构建了皮层-小脑混合网络(CB-RNN),在多种时间任务中表现出更快的学习速度和更高的性能。实验表明,小脑模块在提升学习效率方面起主导作用,而皮层网络可作为固定的信息处理单元,为人工系统提供了有力的结构归纳偏差。

详情
英文摘要

The cerebellum and cerebral cortex form tightly coupled circuits thought to support flexible and efficient temporal processing. How this interaction shapes cortical learning dynamics, and whether such heterogeneous modularity can benefit artificial systems, remains unclear. Here, we augment a recurrent neural network (RNN) with a cerebellar-inspired feedforward module and evaluate the resulting architecture on temporal tasks of varying difficulty. The cortico-cerebellar RNN (CB-RNN) learns faster and reaches higher maximum performance than parameter-matched fully recurrent baselines across a variety of regimes. Crucially, freezing the recurrent core after minimal training and delegating subsequent learning to the cerebellar module preserves superior learning efficiency, suggesting the cerebellar module is a primary driver of efficiency and that the cortical network can largely function as a fixed reservoir. Our results suggest that heterogeneous modular architectures can act as a powerful structural inductive bias in neural systems.

2605.10178 2026-05-12 q-bio.NC cs.LG cs.NE

Joint sparse coding and temporal dynamics support context reconfiguration

Qianqian Shi, Yue Che, Faqiang Liu, Hongyi Li, Mingkun Xu, Sandra Reinert, Pieter M. Goltstein, Rong Zhao, Luping Shi

AI总结 该研究探讨了大脑如何在切换不同情境时保持对先前经验的表征,从而实现适应性行为。研究发现,联合稀疏编码与时间动态特性在小鼠内侧前额叶皮层和计算网络中共同作用,有助于在情境转换过程中维持已有知识,减少跨情境干扰。这些机制不仅为理解大脑如何灵活适应新环境提供了理论框架,也为构建避免灾难性遗忘的持续学习系统提供了高效的架构原则。

Comments 37 pages, 6 figures, 6 extended data figures. Preprint version

详情
英文摘要

Adaptive behavior requires the brain to transition between distinct contexts while maintaining representations of prior experience. The ability to reconfigure neural representations without erasing previously acquired knowledge is central to learning in dynamic environments, yet the neural mechanisms that support this balance remain unclear. Understanding these mechanisms is also critical for addressing catastrophic forgetting in artificial systems designed for lifelong learning. Here, we identify joint sparse coding and temporal dynamics in both the mouse medial prefrontal cortex (mPFC) and computational networks as mechanisms that help preserve prior representations during context transitions. Specifically, sparsity in context-dependent representations reduces cross-context interference, whereas temporal dynamics within the network activity further enhance context separability across time. Strikingly, networks endowed with both properties, such as spiking neural networks, exhibit improved retention during lifelong learning without auxiliary heuristics. These findings establish joint sparse coding and temporal dynamics as a core mechanism supporting flexible context reconfiguration in lifelong learning and, through their activity constraining nature, as an energy-efficient architectural principle for stable adaptation. Together, they provide a mechanistic framework for understanding how the brain preserves prior knowledge while flexibly adapting to new contexts.

2605.03827 2026-05-12 cs.DM q-bio.PE

Inferring Phylogenetic Networks from Required and Forbidden LCA-Constraints

Patricia A. Ebert, Marc Hellmuth

AI总结 该论文研究了如何从给定的必要和禁止的最近公共祖先(LCA)约束中推断出系统发育网络。系统发育网络用于描述包含杂交或水平基因转移等网状进化事件的演化历史,而LCA约束指定了不同物种对之间的祖先关系。本文提出了三种避免禁止约束的网络构建方式,并针对每种方式给出了判断是否存在满足约束的网络的精确条件,同时设计了多项式时间算法用于判定和构造此类网络。

详情
英文摘要

Phylogenetic networks provide a framework for representing evolutionary histories involving reticulate events such as hybridization or horizontal gene transfer. A central problem is to infer such networks from local structural information. In this paper, we study network inference from least common ancestor (LCA) constraints, which specify relative ancestral relationships between pairs of taxa. While previous work has characterized when a set of required LCA constraints can be realized by a phylogenetic network, practical applications may also involve constraints that must be explicitly avoided, for example due to biological prior knowledge. We therefore consider the realization problem for pairs $(R,F)$, where $R$ is a set of required LCA-constraints and $F$ is a set of forbidden ones. Since there are several natural ways to formalize what it means for a network to avoid a forbidden LCA-constraint, we study three such variants. For each of them, we characterize exactly when there exists a phylogenetic network that realizes all constraints in $R$ while avoiding all constraints in $F$ in the respective sense. Based on these characterizations, we derive polynomial-time algorithms that decide the existence of such networks and construct one whenever it exists.

2603.20476 2026-05-12 q-bio.NC eess.SP

Transcranial Alternating Current Stimulation (tACS) for patients with Post-Stroke Anomia: Preliminary Data on Picture Naming Performance

Maria Martzoukou, Nefeli K. Dimitriou, Binbin Xu, Malo Renaud-D'Ambra, Anastasia Nousia, Anne Beuter, Grigorios Nasios

AI总结 本研究评估了经颅交流电刺激(tACS)对中风后失语症患者命名能力的干预效果,采用单被试实验设计和图片命名任务进行评估。两名中风患者在接受为期八周的tACS治疗后,其命名速度和准确性均显著提高,且改善效果在三个月后仍保持稳定。研究还发现,治疗后的脑电活动模式趋近于健康被试,表明tACS可能是一种具有长期疗效的失语症干预方法。

详情
Journal ref
PLOS ONE 21(2): e0342191 (2026)
英文摘要

The present study evaluated the effectiveness of transcranial alternating current stimulation (tACS) treating patients with post-stroke anomia using a picture-naming task and a Single-Case Experimental Design (SCED). A right-handed 38-year-old woman with a left-hemisphere stroke and a left-handed 54-year-old man with a right-hemisphere stroke underwent an eight-week treatment program. Specifically, they participated in a picture-naming task three times a week, alternating between sessions with and without tACS stimulation every two weeks. Electroencephalography (EEG) measurements were taken at the end of each two-week period, and behavioral data were collected before, during and after the treatment. EEG and behavioral assessments were also conducted at one- and three-month follow-ups. Picture-naming performance was significantly faster during tACS sessions compared to sessions without tACS. By the end of the intervention, both participants demonstrated improved accuracy and speed, with positive effects also observed in behavioral measures. EEG analysis showed that post-treatment brain activity resembled that of healthy individuals performing similar tasks. Patients' improvements in picture-naming and behavioral tests showed that the positive effects remained stable even after three months. Thus, preliminary data suggest that tACS might be a promising intervention for anomia, with lasting effects. Large-scale studies are needed to confirm these findings.

2508.18710 2026-05-12 q-bio.PE physics.bio-ph

Adaptation to extreme stress under the growth-survival fitness trade-off

Nandita Chaturvedi, Charuhansini Tvishamayi, Shashi Thutupalli

AI总结 该研究探讨了微生物在极端压力下(如冷冻-解冻循环)如何在生存与增殖之间进行权衡的问题。研究通过构建定量模型,分析酵母在生长与极端压力交替环境中的适应机制,重点探讨了静止状态在生存中的作用,并将生长速率、滞后期、静止概率和压力耐受性等关键性状联系到一个统一的生理表型。研究发现,生长-生存权衡的强度依赖于环境参数,如生长期的持续时间,并揭示了在无压力环境下,优化生长的种群仍可能维持生存能力,表明生理权衡并不一定导致种群层面的适应性权衡。

Comments 13 pages, 7 figures

详情
英文摘要

Microbial adaptation to extreme stress, such as starvation, antimicrobial exposure, or freezing often reveals fundamental trade-offs between survival and proliferation. Understanding how populations navigate these trade-offs in fluctuating environments remains a central challenge. We develop a quantitative model to investigate the adaptation of populations of yeast (Saccharomyces cerevisiae) subjected to cycles of growth and extreme freeze-thaw stress, focusing on the role of quiescence as a mediator of survival. Our model links key life-history traits: growth rate, lag time, quiescence probability, and stress survival, to a single underlying phenotype, motivated by the role of intracellular trehalose in the adaptation of yeast to freeze-thaw stress. Through stochastic population simulations and analytical calculation of the long-term growth rate, we identify the evolutionary attractors of the system. We find that the strength of the growth-survival trade-off depends critically on environmental parameters, such as the duration of the growth phase. Crucially, our analysis reveals that populations optimized for growth-stress cycles can often maintain viability alongside growth-optimized populations even in the absence of stress. This demonstrates that underlying physiological trade-offs do not necessarily translate into fitness trade-offs at the population level, providing general insights into the complex interplay between environmental fluctuations, physiological constraints, and evolutionary dynamics.

2506.12944 2026-05-12 cs.LG q-bio.TO

Unsupervised risk factor identification across cancer types and data modalities via explainable artificial intelligence

Maximilian Ferle, Jonas Ader, Thomas Wiemers, Nora Grieb, Adrian Lindenmeyer, Hans-Jonas Meyer, Thomas Neumuth, Markus Kreuz, Kristin Reiche, Maximilian Merz

AI总结 该研究提出了一种基于可解释人工智能的无监督学习方法,用于跨癌症类型和数据模态识别风险因素。该方法通过可微分的多变量logrank统计量优化患者群体的生存异质性,无需依赖代理指标,可适用于任何神经网络架构和数据类型。研究在模拟实验和两种不同癌症数据(多发性骨髓瘤实验室参数和非小细胞肺癌CT图像)中验证了方法的有效性,成功识别出具有显著不同生存结果的患者亚组,并揭示了与已知风险因素一致的临床相关特征,为临床风险分层提供了新的可解释工具。

详情
Journal ref
npj Digit. Med. 9, 363 (2026)
英文摘要

Risk stratification is a key tool in clinical decision-making, yet current approaches often fail to translate sophisticated survival analysis into actionable clinical criteria. We present a novel method for unsupervised machine learning that directly optimizes for survival heterogeneity across patient clusters through a differentiable adaptation of the multivariate logrank statistic. Unlike most existing methods that rely on proxy metrics, our approach represents novel methodology for training any neural network architecture on any data modality to identify prognostically distinct patient groups. We thoroughly evaluate the method in simulation experiments and demonstrate its utility in practice by applying it to two distinct cancer types: analyzing laboratory parameters from multiple myeloma patients and computed tomography images from non-small cell lung cancer patients, identifying prognostically distinct patient subgroups with significantly different survival outcomes in both cases. Post-hoc explainability analyses uncover clinically meaningful features determining the group assignments which align well with established risk factors and thus lend strong weight to the methods utility. This pan-cancer, model-agnostic approach represents a valuable advancement in clinical risk stratification, enabling the discovery of novel prognostic signatures across diverse data types while providing interpretable results that promise to complement treatment personalization and clinical decision-making in oncology and beyond.

2506.04289 2026-05-12 cs.LG q-bio.NC

Relational reasoning and inductive bias in transformers and large language models

Jesse Geerts, Andrew Liu, Stephanie Chan, Claudia Clopath, Kimberly Stachenfeld

AI总结 该研究探讨了基于Transformer的模型在关系推理,特别是传递推理任务中的表现机制。研究比较了权重内学习(IWL)和上下文内学习(ICL)两种方式在传递推理中的行为差异,发现IWL模型通过线性嵌入实现类似人类的传递推理,而ICL模型则仅在训练数据需要时才表现出传递推理能力。研究还表明,通过预训练使ICL模型获得线性表示后,其推理行为可接近IWL,并在大语言模型中验证了训练方式和表示结构对传递推理能力的关键影响。

Comments 15 pages, 10 figures

详情
英文摘要

Transformer-based models have demonstrated remarkable reasoning abilities, but the mechanisms underlying relational reasoning remain poorly understood. We investigate how transformers perform \textit{transitive inference}, a classic relational reasoning behavior from psychology which elicits inference about indirectly related items (e.g., if $A > B$ and $B > C$, then $A > C$). We compare in-weights learning (IWL) and in-context learning (ICL) behaviors and mechanisms on these tasks, and fine profoundly different patterns of generalization. IWL models learn a linear embedding, which leads to transitive inference as well as other behavioral effects present in humans and animals. ICL models, in contrast, are capable of learning to generalize transitively, but only do so when it is necessitated by the training data, otherwise learning a match-and-copy strategy. Interestingly, pre-training ICL models on in-context linear regression tasks that provide them with a latent linear representation is sufficient to make the ICL behaviors and internal representations qualitatively and quantitatively more like IWL. In order to test whether the same inference patterns are present across in large language models, we leverage a congruency paradigm which allows us to differentially probe IWL and ICL generalization patterns without access to their training data. We indeed see IWL reasoning leads to more transitive generalization than ICL. Moreover, we find that prompting the ICL models to use a linear mental map led to increased transitive inference over different geometric prompts. Together, these results reveal that both the training regime and the geometric structure of induced representations critically determine transformers capacity for transitive inference.

2406.12910 2026-05-12 cs.LG cs.AI cs.NE physics.chem-ph q-bio.BM

Human-level molecular optimization driven by mol-gene evolution

Jiebin Fang, Churu Mao, Yuchen Zhu, Xiaoming Chen, Chang-Yu Hsieh, Zhongjun Ma

AI总结 该研究提出了一种名为DGMM的深度遗传分子修饰算法,旨在解决药物分子优化中结构新颖性与药理性质平衡的问题。通过引入离散变分自编码器(D-VAE),将分子编码为量化代码“mol-gene”,从而将深度学习与遗传算法结合,实现类似药物化学家的分子结构优化。该方法能够发现药理性质相似但结构不同的化合物,并揭示药物发现中结构优化的权衡关系,展示了其在多个应用中的有效性。

详情
英文摘要

De novo molecule generation allows the search for more drug-like hits across a vast chemical space. However, lead optimization is still required, and the process of optimizing molecular structures faces the challenge of balancing structural novelty with pharmacological properties. This study introduces the Deep Genetic Molecular Modification Algorithm (DGMM), which brings structure modification to the level of medicinal chemists. A discrete variational autoencoder (D-VAE) is used in DGMM to encode molecules as quantization code, mol-gene, which incorporates deep learning into genetic algorithms for flexible structural optimization. The mol-gene allows for the discovery of pharmacologically similar but structurally distinct compounds, and reveals the trade-offs of structural optimization in drug discovery. We demonstrate the effectiveness of the DGMM in several applications.

2605.09981 2026-05-12 q-bio.BM cs.AI

Yeti: A compact protein structure tokenizer for reconstruction and multi-modal generation

Nabin Giri, Steven Farrell, Kristofer E. Bouchard

AI总结 该研究提出了一种名为Yeti的紧凑型蛋白质结构分词器,旨在解决多模态模型中蛋白质结构、序列和功能注释联合建模的问题。Yeti基于无查找量化方法,通过端到端的流匹配目标进行训练,能够在保持高重建精度的同时实现优异的生成能力。实验表明,Yeti在参数数量大幅减少的情况下,仍能实现与现有模型相当甚至更优的结构重建和多模态生成性能,为高效训练多模态蛋白质生成模型提供了有力工具。

详情
英文摘要

Multimodal models that jointly reason over protein sequences, structures, and function annotations within a unified representation hold immense potential for integrating multimodal data and generating new proteins with designed functional properties. To utilize transformer architectures, such models require a tokenizer that converts protein structure from continuous atomic coordinates into discrete representations suitable for scalable multimodal training. The quality of such models are fundamentally upper bounded by the fidelity and expressiveness of the underlying tokenized structure. However, existing tokenizers prioritize reconstruction over generative abilities. To address these gaps, we introduce Yeti, a simple and compact protein structure tokenizer based on lookup free quantization and trained end to end with a flow matching objective for multimodal learning. Compared to existing models, Yeti generally achieves the best codebook utilization and token diversity, and second best reconstruction accuracy (with 10x fewer parameters than ESM3) on diverse datasets. To validate Yeti's generative capability, we trained a compact multimodal model jointly over its structure tokens and amino acid sequence entirely from scratch, with no pretrained initialization. The resulting multimodal model generates plausible structures under unconditional cogeneration of protein sequence and structures, achieving comparable results to 10x larger models. Together, these results demonstrate that Yeti is a compact and expressive protein structure tokenizer suitable for training multimodal models that cogenerates highly plausible sequences and structures.

2605.09810 2026-05-12 q-bio.BM cs.LG

TD3B: Transition-Directed Discrete Diffusion for Allosteric Binder Generation

Hanqun Cao, Aastha Pal, Sophia Tang, Yinuo Zhang, Jingjie Zhang, Pheng Ann Heng, Pranam Chatterjee

AI总结 该研究提出了一种名为TD3B的基于序列的生成框架,用于设计具有特定激动剂或拮抗剂行为的变构配体。TD3B通过方向性过渡控制目标,结合目标感知的方向向导、软结合亲和力门控以及预训练离散扩散模型的 amortized 微调,实现了与结合亲和力解耦的定向配体生成。该方法能够有效区分激动剂与拮抗剂行为,弥补了传统结构优化方法在方向性功能调控方面的不足。

Comments Published as a Spotlight at ICML 2026 (Proceedings of the 43rd International Conference on Machine Learning, Seoul, South Korea)

详情
英文摘要

Protein function is often controlled by ligands that bias the direction of state transitions, such as agonists and antagonists, rather than stabilizing a single conformation. This is especially important for clinically relevant G protein-coupled receptors (GPCRs), where therapeutic efficacy depends on functional directionality. Structure-based design methods optimize binding to static conformations and cannot represent non-reversible, directional effects or systematically distinguish agonist from antagonist behavior. To address this gap, we introduce Transition-Directed Discrete Diffusion for Allosteric Binder Design (TD3B), a sequence-based generative framework that designs binders with specified agonist or antagonist behavior via a directional transition control objective. TD3B combines a target-aware Direction Oracle, a soft binding-affinity gate, and amortized fine-tuning of a pre-trained discrete diffusion model, enabling targeted agonist and antagonist generation decoupled from binding affinity and unattainable by equilibrium-based or inference-only guidance baselines. The code and checkpoints are available at https://huggingface.co/ChatterjeeLab/TD3B.

2605.09770 2026-05-12 cs.NE eess.SP q-bio.NC

Encoding and Decoding Temporal Signals with Spiking Bandpass Wavelets

Jens Egholm Pedersen, Tony Lindeberg, Peter Gerstoft

AI总结 本文研究了如何利用脉冲波形滤波器对时间信号进行编码与解码,提出了一种基于脉冲的带通小波框架,将脉冲编码与信号处理理论相结合。该方法在保持脉冲表示稀疏性和局部性的同时,提供了定量的带宽和重建误差界,能够在脉冲量化和时间离散化的条件下实现信号重建。实验表明,该方法在ECG和音频数据集上的重建效果接近连续小波变换,且适用于类脑神经形态硬件。

详情
英文摘要

Spike-based encodings are sparse and energy-efficient, but have largely been formulated probabilistically, disconnected from most signal processing literature. We recast spike encoders as time-causal wavelet frames with quantitative bandwidths and reconstruction error bounds. The proposed wavelets preserve the sparsity and locality of spiking representations, with reconstruction up to spike quantization and time discretization. We demonstrate reconstruction on ECG and audio datasets, achieving a normalized RMSE comparable to continuous wavelet transforms. The spiking wavelets map directly to neuromorphic hardware.

2605.09704 2026-05-12 physics.bio-ph q-bio.CB

Coexistence of trapped and flow-transported nuclei enables fast pigeon post communication across multinucleated cell

Johnny Tong, Kaspar Wachinger, Fabian K. Henn, Nico Schramma, Siyu Chen, Karen Alim

AI总结 该研究探讨了多核细胞中核之间的快速通讯机制,以黏菌 *Physarum polycephalum* 为模型,发现核可以处于流动或被胞质基质捕获两种动态状态。通过理论分析与实验验证,研究提出了一种类似“信鸽传递”的通讯方式,即流动的核作为信使在被捕获的核之间传递信号,从而实现远距离快速通讯。这种机制比传统的扩散信号传递方式快数十倍,可能广泛存在于其他多核细胞中,并具有更广泛的生物学意义。

详情
Journal ref
Proc. Natl. Acad. Sci. U.S.A. 122 (50) e2411101122 (2025)
英文摘要

Multi-nucleated cells exist in all domains of life, ranging from animals, plants and fungi to single-celled organisms such as the slime mold Physarum polycephalum. The large cell size, in the case of Physarum reaching centimeters and more, challenges the coordination of nuclei activity as signals need to cross large distances. In search for a mechanism for fast long-ranged communication among nuclei, we quantify nuclei dynamics and cytoplasmic flows in Physarum's tubular network. We observe nuclei in two interchangeable, dynamic states: mobile, flowing within the cytoplasmic shuttle flow, or trapped in the tube's porous cell cortex. As we find nuclei to accumulate at the tube's inner fluid-porous interface we theoretically explore and confirm, with physiological parameters, that slowing down of mobile nuclei during flow is sufficient for diffusible signal exchange between mobile and trapped nuclei. We analytically derive that communication akin to pigeon-post with mobile nuclei serving as pigeons shuttling between trapped nuclei acting as waypoints, gives rise to signaling velocities that account for the rapid intracellular reorganization observed in Physarum. Since signal transfer by flow-transported nuclei outcompetes the mere diffusion of signals encoded in cytosolic proteins, pigeon-post communication surpasses alternative signaling mechanisms, even diffusive relay signaling up to twenty-fold in velocity. The key ingredients of pigeon-post communication, namely alternating flows and waypoints, exist in other multi-nucleated cells and may also be generalized beyond intracellular signaling.

2605.09506 2026-05-12 stat.ME q-bio.QM stat.CO

Accelerating Bayesian Phylogenetic Inference via Delayed Acceptance Sequential Monte Carlo with Random Forest Surrogates

Wentao Yu, Shijia Wang

AI总结 在贝叶斯系统发育分析中,研究旨在估计系统发育树的后验分布。本文提出了一种基于随机森林的代理模型,用于预测标准MCMC方法中树结构变化(如eSPR、stNNI)对似然函数的影响,从而设计出一种延迟接受MCMC核,显著减少似然函数的计算次数。该方法进一步集成到序贯蒙特卡洛采样框架中,实验表明其在保持估计精度的同时大幅提升了计算效率。

详情
英文摘要

In Bayesian phylogenetics, our goal is to estimate the posterior distribution over phylogenetic trees. Markov chain Monte Carlo methods are widely used to approximate the phylogenetic posterior distributions. For large-scale sequence data, repeated evaluation of the likelihood function incurs a high computational cost. In this article, we propose a machine-learning algorithm with over 35 topological and branch-length features to predict the changes in the likelihood function caused by tree moves (\eg,~eSPR, stNNI) used in standard MCMC approaches. This algorithm is then used to design a delayed acceptance MCMC kernel, which utilized the predicted surrogate function for preliminary rejection, to accelerate tree space searches. Furthermore, we integrate our proposed MCMC kernel into the sequential Monte Carlo sampler framework. We validate the proposed delayed-acceptance sequential Monte Carlo approach (DA-SMC) on simulation and real data sets. Our delayed acceptance kernel can maintain robust estimation while reduces the number of likelihood evaluations significantly, yielding substantial computational time savings. We develop a Python package that is available at https://github.com/wentYu/DAphyloSMC.

2605.09409 2026-05-12 q-bio.NC

Predictive and feedback signals differently shape the formation of group-level and individualized language representations

Shuguang Yang, Shaoyun Yu, Xin Jiang, Suiping Wang, Gangyi Feng

AI总结 本研究探讨了预测信号和反馈信号在成人语言学习过程中对群体层面和个体化语言表征形成的不同影响。通过行为实验和fMRI数据,研究发现以预测为导向的模型在群体层面解释了最大的神经可变性,而反馈信号则更有效地预测个体在学习后期的表现差异。结果支持了一种多信号语言学习模型,表明预测信号塑造了学习者共有的神经学习架构,而反馈机制则更好地解释了个体间的差异。

详情
英文摘要

Adults vary greatly in how effectively they learn a new language, but the signals driving the learning processes and individual differences remain unclear. Over seven days, we tracked behavioral learning and collected fMRI data from 102 adults as they learned an artificial language with corrective feedback. We trained matched transformer models with prediction, feedback, or combined objectives and compared their internal representations to brain activity. Representations derived from the prediction-focused model accounted for the largest share of unique neural variance at the group level, despite the human task being feedback-based. Throughout model training, both objectives showed a shift in brain-model alignment from sensory to higher-order language and associative networks, indicating abstraction processing. Conversely, neural patterns related to the feedback model were most useful for predicting individual generalization outcomes on Day 7. These findings support a multi-signal model of adult language learning, in which prediction shapes a common neural learning architecture across learners, whereas feedback-related mechanisms better explain individual differences over time.

2605.09384 2026-05-12 cs.CV cs.AI q-bio.QM

LiteMedCoT-VL: Parameter-Efficient Adaptation for Medical Visual Question Answering

Runze Ma, Shunbo Jia, Haonan Lyu, Guo Liu, Caizhi Liao

AI总结 本文提出了一种名为LiteMedCoT-VL的参数高效的适配方法,旨在提升医疗视觉问答(VQA)模型在资源受限设备上的推理能力。该方法通过基于LoRA的微调,将大型教师模型的链式推理能力迁移至小型学生模型,且无需依赖图像字幕,更贴近实际临床场景。实验表明,LiteMedCoT-VL在PMC-VQA基准测试中取得了64.9%的准确率,显著优于现有基线模型,验证了小参数模型通过推理蒸馏可达到甚至超越更大模型的效果。

Comments 17 pages, 5 figures

详情
英文摘要

The reasoning gap between large and compact vision-language models (VLMs) limits the deployment of medical AI on portable clinical devices. Compact VLMs of 2--4B parameters can run on resource-constrained hardware but lack the multi-step reasoning capacity needed for interpretable clinical decision support. Existing knowledge distillation methods transfer answers without the reasoning process behind them. Medical visual question answering (VQA) serves as a testbed for this problem, as it requires models to integrate visual evidence with clinical knowledge through structured reasoning chains. We introduce LiteMedCoT-VL, a pipeline that transfers chain-of-thought reasoning from a 235B teacher model to 2B student models through LoRA-based fine-tuning on explanation-enriched training data. All inference is conducted without image captions by default, simulating the clinical scenario in which a physician interprets a medical image directly without an accompanying radiology report. On the PMC-VQA benchmark, LiteMedCoT-VL achieves 64.9% accuracy, exceeding the zero-shot Qwen3-VL-4B baseline of 53.9% by 11.0 percentage points and outperforming all published baselines. This result indicates that a 2B model with reasoning distillation can match or exceed models with twice the parameters. Visual grounding analysis shows that the model relies on image content rather than exploiting textual priors. Our code is publicly available at https://anonymous.4open.science/r/LiteMedCoT-VL.

2605.05284 2026-05-12 cs.NE cs.LG q-bio.PE q-bio.QM

Direct From Darwin: Deriving Advanced Optimizers From Evolutionary First Principles

Daniel Grimmer

AI总结 本文从进化论的基本原理出发,直接推导出一系列先进的基于梯度的优化算法,旨在实现高性能优化工具与达尔文进化过程的科学模拟。研究引入了达尔文系谱模拟(DLS)方法,证明在无性繁殖背景下,费舍尔和赖特对进化的对立观点在形式上是等价的,并提出了DLS噪声关系以确保模拟的准确性。通过这一框架,许多成熟的优化算法如随机梯度下降、牛顿法及其正则化形式等被证明与进化动力学兼容,只需引入符合DLS噪声的遗传漂变即可实现对达尔文进化的科学仿真,甚至包括当前最先进的Adam优化器也可通过简单的数学调整实现进化一致性。

Comments 38 pages, 5 figures. Submitted to Evolutionary Computation, May 2026. Code available at: https://github.com/danielgrimmer/adam-dls

详情
英文摘要

Evolutionary computation has long promised to deliver both high-performance optimization tools as well as rigorous scientific simulations of Darwinian evolution. However, modern algorithms frequently abandon evolutionary fidelity for physics-inspired heuristics or superficial biological metaphors. This paper derives a suite of advanced gradient-based optimization algorithms directly from evolutionary first principles. We introduce Darwinian Lineage Simulations (DLS) to prove that, in an asexual context, Fisher's and Wright's historically opposed views of evolution are actually formally equivalent; One can partition Fisher's deterministically-evolving total population into Wright's randomly-drifting sub-populations. We prove that proper bookkeeping requires introducing a specific kind of structured noise (the DLS noise relation). Crucially, any bookkeeping choices which satisfy this relation will yield a faithful simulation of evolution. Using this vast representational freedom, we prove that a broad family of battle-tested optimization algorithms are already perfectly compatible with evolutionary dynamics. These include: Stochastic Gradient Descent as well as many regularizations/approximations of Newton's method and Natural Gradient Descent. By simply adding DLS noise (i.e., evolutionarily faithful genetic drift), these algorithms become scientifically valid in silico simulations of Darwinian evolution. Finally, we demonstrate that even the state-of-the-art Adam optimizer can be brought into evolutionary compliance through a minor mathematical surgery.

2603.22705 2026-05-12 q-bio.NC q-bio.PE

Detecting outliers of pursuit eye movements: a preliminary analysis of autism spectrum disorder

Emiko Shishido, Seiko Miyata, Tetsuya Yamamoto, Masaki Fukunaga, Ryota Hashimoto, Kenichiro Miura, Norio Ozaki

AI总结 本研究旨在通过分析平滑追踪眼动(SPEM)中的异常模式,揭示自闭症谱系障碍(ASD)患者在眼动控制方面的个体特异性异常。研究采用马哈拉诺比斯距离计算个体偏离正常值的“异常分数”,并基于主成分分析优化的特征向量进行统计判断,发现ASD组的异常比例显著高于正常对照组。该方法有效捕捉了传统群体均值分析难以发现的个体差异,为理解ASD异质性提供了新的视角。

Comments 4 pages, 2 figures, 2 video files, Supplementary Materials (2 files), Supplementary Methods and Codes in GitHub

详情
英文摘要

Background: Autism spectrum disorder (ASD) is characterized by significant clinical and biological heterogeneity. Conventional group-mean analyses of eye movements often mask individual atypicalities, potentially overlooking critical pathological signatures. This study aimed to identify idiosyncratic oculomotor patterns in ASD using an "outlier analysis" of smooth pursuit eye movement (SPEM). Methods: We recorded SPEM during a slow Lissajous pursuit task in 18 adults with ASD and 39 typically developed (TD) individuals. To quantify individual deviations, we derived an "outlier score" based on the Mahalanobis distance. This score was calculated from a feature vector, optimized via Principal Component Analysis (PCA), comprising the temporal lag ($Δ$t) and the spatial deviation ($Δ$s). An outlier was statistically defined as a score exceeding $\sqrt{10}$ (approximately 3.16$σ$) relative to the TD normative distribution. Results: While the TD group exhibited a low outlier rate of 5.1%, the ASD group demonstrated a significantly higher prevalence of 38.9% (7/18) (binomial P = 0.0034). Furthermore, the mean outlier score was significantly elevated in the ASD group (3.00 $\pm$ 2.62) compared to the TD group (1.52 $\pm$ 0.80; P = 0.002). Notably, these extreme deviations were captured even when conventional mean-based comparisons showed limited sensitivity. Conclusions: Our outlier analysis successfully visualized the high degree of idiosyncratic atypicality in ASD oculomotor control. By shifting the focus from group averages to individual deviations, this approach provides a sensitive metric for capturing the inherent heterogeneity of ASD, offering a potential baseline for identifying clinical subtypes.

2510.04578 2026-05-12 q-bio.PE

Gonosomic algebras: an extension of gonosomal algebras

Richard Varro

AI总结 本文引入了“gonosomic代数”,用于代数化描述遗传不育现象,扩展了用于性决定和性连锁基因传递的gonosomal代数模型,使其能够考虑遗传不育因素。文章给出了gonosomic代数非gonosomal的条件,并提供了若干代数构造方法。通过关联演化算子W和频率分布算子V,研究了平衡点的稳定性性质在两者之间的保持关系,为理解遗传现象提供了新的代数工具。

详情
英文摘要

In this paper, we introduce gonosomic algebras to algebraically translate the phenomenon of genetic sterility. Gonosomic algebras extend the concept of gonosomal algebras used as algebraic model of genetic phenomena related to sex-determination and sex-linked gene transmission by allowing genetic sterility to be taken into account. Conditions under which gonosomic algebras are not gonosomal and several algebraic constructions of gonosomic algebras are given. To each gonosomic algebra, an evolution operator noted W is associated that gives the state of the offspring population at the birth stage. Next from W we define the operator V which gives the frequency distribution of genetic types. We show that the various stability notions of equilibrium points are preserved by passing from W to V .

2509.21671 2026-05-12 cs.LG q-bio.NC

Neuroprobe: Evaluating Intracranial Brain Responses to Naturalistic Stimuli

Andrii Zahorodnii, Christopher Wang, Geeling Chau, Bennett Stankovits, Charikleia Moraitaki, Eli Gross, Alexander Brady, Andrei Barbu, Boris Katz, Ila R Fiete

AI总结 本文提出 Neuroprobe,一个用于评估侵入式脑电图(iEEG)记录下大脑对自然刺激响应的解码任务框架。该研究基于 BrainTreebank 数据集,包含10名受试者观看电影时超过40小时的iEEG记录,旨在系统研究语言处理过程中不同脑区的时间和空间特征解码规律。Neuroprobe 不仅有助于揭示语言和听觉信息在大脑中的处理流程,还为比较神经基础模型的架构和训练方法提供了标准化评估平台。

Comments 38 pages, 7 main figures, 16 supplementary figures, 13 tables

详情
英文摘要

High-resolution neural datasets enable foundation models for the next generation of brain-computer interfaces and neurological treatments. The community requires rigorous benchmarks to discriminate between competing modeling approaches, yet no standardized evaluation frameworks exist for intracranial EEG (iEEG) recordings. To address this gap, we present Neuroprobe: a suite of decoding tasks for studying multi-modal language processing in the brain. Unlike scalp EEG, intracranial EEG requires invasive surgery to implant electrodes that record neural activity directly from the brain with minimal signal distortion. Neuroprobe is built on the BrainTreebank dataset, which consists of over 40 hours of iEEG recordings from 10 human subjects performing a naturalistic movie viewing task. Neuroprobe serves two critical functions. First, it is a source from which neuroscience insights can be drawn. The high temporal and spatial resolution of the labeled iEEG allows researchers to systematically determine when and where computations for each aspect of language processing occur in the brain by measuring the decodability of each feature across time and all electrode locations. Using Neuroprobe, we visualize how information flows from key language and audio processing sites in the superior temporal gyrus to sites in the prefrontal cortex. We also demonstrate the time evolution of processing from simple auditory features (e.g., pitch and volume) to more complex language features (e.g., part of speech) in a purely data-driven manner. Second, as the field moves toward neural foundation models trained on large-scale datasets, Neuroprobe provides a rigorous framework for comparing competing architectures and training protocols. We make the code for Neuroprobe openly available, aiming to enable rapid progress in the field of iEEG foundation models. Public leaderboard: https://neuroprobe.dev/

2503.09007 2026-05-12 q-bio.QM

Reconstructing Noisy Gene Regulation Dynamics Using Extrinsic-Noise-Driven Neural Stochastic Differential Equations

Jiancheng Zhang, Xiangting Li, Xiaolu Guo, Zhaoyi You, Lucas Böttcher, Alex Mogilner, Alexander Hoffman, Tom Chou, Mingtao Xia

AI总结 该研究提出了一种基于外源噪声驱动的神经随机微分方程(END-nSDE)框架,用于从异质细胞群体的轨迹数据中重建受内在和外在噪声影响的基因调控动态。通过引入Wasserstein距离,该方法能够准确建模细胞异质性对外源噪声下反应动力学的调控作用,并在多个细胞生物学系统中验证了其有效性。相比传统的时序分析方法,如RNN和LSTM,该方法在建模精度和噪声还原方面表现更优,为复杂生物物理过程提供了高效的替代建模手段。

详情
Journal ref
PLoS Comput. Biol. 21(9), e1013462 (2025)
英文摘要

Proper regulation of cell signaling and gene expression is crucial for maintaining cellular function, development, and adaptation to environmental changes. Reaction dynamics in cell populations is often noisy because of (i) inherent stochasticity of intracellular biochemical reactions (``intrinsic noise'') and (ii) heterogeneity of cellular states across different cells that are influenced by external factors (``extrinsic noise''). In this work, we introduce an extrinsic-noise-driven neural stochastic differential equation (END-nSDE) framework that utilizes the Wasserstein distance to accurately reconstruct SDEs from trajectory data from a heterogeneous population of cells (extrinsic noise). We demonstrate the effectiveness of our approach using both simulated and experimental data from three different systems in cell biology: (i) circadian rhythms, (ii) RPA-DNA binding dynamics, and (iii) NF$κ$B signaling process. Our END-nSDE reconstruction method can model how cellular heterogeneity (extrinsic noise) modulates reaction dynamics in the presence of intrinsic noise. It also outperforms existing time-series analysis methods such as recurrent neural networks (RNNs) and long short-term memory networks (LSTMs). By inferring cellular heterogeneities from data, our END-nSDE reconstruction method can reproduce noisy dynamics observed in experiments. In summary, the reconstruction method we propose offers a useful surrogate modeling approach for complex biophysical processes, where high-fidelity mechanistic models may be impractical.

2501.11271 2026-05-12 q-bio.CB q-bio.MN q-bio.TO

Incorporating stochastic gene expression, signaling-mediated intercellular interactions, and regulated cell proliferation in models of coordinated tissue development

Casey O. Barkan, Tom Chou

AI总结 该研究旨在构建一个能够整合随机基因表达、细胞间信号调控以及受控细胞增殖的定量模型,以描述组织协调发育的过程。研究提出了一种简化的理论框架,通过引入Waddington矢量场描述细胞状态,支持非梯度动力学,并定义了表观遗传适应度景观以刻画不同细胞类型的增殖特性。该框架通过两个模型系统验证了其适用性,分别为两基因相互作用的分化过程和受斑马鱼启发的时空组织模型。

Comments 14 pages, 5 figures

详情
Journal ref
PRX Life 4, 023012, (2026)
英文摘要

Formulating quantitative and predictive models for tissue development requires consideration of the complex, stochastic gene expression dynamics, its regulation via cell-to-cell interactions, and cell proliferation. Including all of these processes into a practical mathematical framework requires complex expressions that are difficult to interpret and apply. We construct a simple theory that incorporates intracellular stochastic gene expression dynamics, signaling chemicals that influence these dynamics and mediate cell-cell interactions, and cell proliferation and its accompanying differentiation. Cellular states (genetic and epigenetic) are described by a Waddington vector field that allows for non-gradient dynamics (cycles, entropy production, loss of detailed balance) which is precluded in Waddington potential landscape representations of gene expression dynamics. We define an epigenetic fitness landscape that describes the proliferation of different cell types, and elucidate how this fitness landscape is related to Waddington's vector field. We illustrate the applicability of our framework by analyzing two model systems: an interacting two-gene differentiation process and a spatiotemporal organism model inspired by planaria.

2409.19378 2026-05-12 q-bio.PE cond-mat.dis-nn cond-mat.stat-mech

Stochastic quasi-cycles as a simple explanation for the time evolution of the Cape Rodney-Okakari Point Marine ecological reserve

César Parra-Rojas, Duccio Fanelli, Alan J. McKane

AI总结 该研究探讨了新西兰北岛Cape Rodney-Okakari Point海洋生态保护区中,潮间带生物群落长达20多年的种群数量周期性变化现象。研究提出了一种基于随机准周期的简单机制,作为解释生态系统中种群振荡的替代模型,避免了传统确定性模型所需的复杂假设。通过最大似然方法拟合个体随机轨迹,研究展示了在无需大量动态副本的情况下,也可有效验证随机准周期的存在,为生态动力学研究提供了新的分析思路。

Comments The published version of the paper (Ecological Modelling Volume 514, April 2026, 111477) is available here https://www.sciencedirect.com/science/article/abs/pii/S0304380026000050

详情
Journal ref
Ecological Modelling Volume 514 111477 (2026)
英文摘要

The dataset collected at the Cape Rodney-Okakari Point Marine (CR-OPM) reserve on the North Island of New Zealand is rather unique. It describes the cyclic time evolution of a rocky intertidal community, with the relative abundances of the various coastal species that have been meticulously monitored for more than 20 years. Past theoretical studies, anchored on a deterministic description, required invoking ad hoc mechanisms to reproduce the observed dynamical paths. Following a maximum likelihood approach to interpolate individual stochastic trajectories, we here propose quasi-cycles as an alternative and simpler mechanism to explain the oscillations observed in the population numbers of the ecosystem. From a general standpoint, we also show that it is possible to return conclusive evidence on the existence of stochastic quasi-cycles, without resorting to global fitting strategies which necessitate handling a large collection of independent replicas of the dynamics, a possibility that is often precluded in real life applications.

2407.16249 2026-05-12 q-bio.NC eess.SP

How Does a Single EEG Channel Tell Us About Brain States in Brain-Computer Interfaces ?

Zaineb Ajra, Binbin Xu, Gérard Dray, Jacky Montmain, Stéphane Perrey

AI总结 本文探讨了如何利用单一EEG通道数据在脑机接口中识别脑状态的问题,提出两种策略:一种是从多通道数据训练模型并在单通道数据上测试,另一种是仅用单通道数据训练并在其他通道上测试。研究设计了参数少、学习速度快的卷积神经网络,用于高效分类认知任务,并在三个数据集的算术和运动想象任务中实现了高达100%、91.55%和73.45%的分类准确率,为单通道BCI系统的发展提供了可行方案和可靠的脑状态生物标志物。

Comments Accepted in the 16th International Conference on Human System Interaction 2024, Paris, France

详情
英文摘要

Over recent decades, neuroimaging tools, particularly electroencephalography (EEG), have revolutionized our understanding of the brain and its functions. EEG is extensively used in traditional brain-computer interface (BCI) systems due to its low cost, non-invasiveness, and high temporal resolution. This makes it invaluable for identifying different brain states relevant to both medical and non-medical applications. Although this practice is widely recognized, current methods are mainly confined to lab or clinical environments because they rely on data from multiple EEG electrodes covering the entire head. Nonetheless, a significant advancement for these applications would be their adaptation for "real-world" use, using portable devices with a single-channel. In this study, we tackle this challenge through two distinct strategies: the first approach involves training models with data from multiple channels and then testing new trials on data from a single channel individually. The second method focuses on training with data from a single channel and then testing the performances of the models on data from all the other channels individually. To efficiently classify cognitive tasks from EEG data, we propose Convolutional Neural Networks (CNNs) with only a few parameters and fast learnable spectral-temporal features. We demonstrated the feasibility of these approaches on EEG data recorded during mental arithmetic and motor imagery tasks from three datasets. We achieved the highest accuracies of 100%, 91.55% and 73.45% in binary and 3-class classification on specific channels across three datasets. This study can contribute to the development of single-channel BCI and provides a robust EEG biomarker for brain states classification.

1603.02337 2026-05-12 cond-mat.soft q-bio.PE

Swarming in viscous fluids: three-dimensional patterns in swimmer- and force-induced flows

Yao-Li Chuang, M. R. D'Orsogna, T. Chou

AI总结 该研究从基本原理出发,建立了一个描述黏性流体中自推进粒子群体行为的三维理论模型。研究揭示了群体行为如何依赖于流体透明度、自推进机制和粒子间相互作用类型,并发现“社会”相互作用与直接“物理”相互作用在流体中产生的流动场存在显著差异。通过分析不同流体特性与运动机制的影响,研究发现了多种新的三维群体结构,展示了流体中介作用对群体形态、稳定性和运动方式的重要影响。

详情
英文摘要

We derive from first principles a three-dimensional theory of self-propelled particle swarming in a viscous fluid environment. Our model predicts emergent collective behavior that depends critically on fluid opacity, mechanism of self-propulsion, and type of particle-particle interaction. In "clear fluids" swimmers have full knowledge of their surroundings and can adjust their velocities with respect to the lab frame, while in "opaque fluids," they control their velocities only in relation to the local fluid flow. We also show that "social" interactions that affect only a particle's propensity to swim towards or away from neighbors induces a flow field that is qualitatively different from the long-ranged flow fields generated by direct "physical" interactions. The latter can be short-ranged but lead to much longer-ranged fluid-mediated hydrodynamic forces, effectively amplifying the range over which particles interact. These different fluid flows conspire to profoundly affect swarm morphology, kinetically stabilizing or destabilizing swarm configurations that would arise in the absence of fluid. Depending upon the overall interaction potential, the mechanism of swimming (e.g., pushers or pullers), and the degree of fluid opaqueness, we discover a number of new collective three-dimensional patterns including flocks with prolate or oblate shapes, recirculating peloton-like structures, and jet-like fluid flows that entrain particles mediating their escape from the center of mill-like structures. Our results reveal how the interplay among general physical elements influence fluid-mediated interactions and the self-organization, mobility, and stability of new three-dimensional swarms and suggest how they might be used to kinetically control their collective behavior.

1506.02111 2026-05-12 cond-mat.stat-mech q-bio.PE q-bio.QM

A kinetic theory for age-structured stochastic birth-death processes

Chris D. Greenman, Tom Chou

AI总结 本文提出了一种全新的全随机动力学理论,用于描述具有年龄结构的相互作用种群的出生与死亡过程。传统模型无法同时处理随机波动和种群规模依赖的出生死亡率,而本文通过定义多粒子概率密度函数,建立了一组描述年龄结构种群随机演化的动力学方程,揭示了年龄依赖的出生死亡过程无法分解为独立因子,必须通过类似BBGKY的方程体系求解。研究在两个简化情形下得到了显式解,并与平均场结果进行了对比,为同时建模年龄和种群规模依赖的随机动力学提供了直观且高效的方法,适用于人口学、干细胞动力学和疾病演化的研究。

Comments 9 pages, 2 figures. Abridged version with supporting appendix submitted to Phys. Rev. Lett

详情
Journal ref
Phys. Rev. E 93, 012112 (2016)
英文摘要

Classical age-structured mass-action models such as the McKendrick-von Foerster equation have been extensively studied but they are structurally unable to describe stochastic fluctuations or population-size-dependent birth and death rates. Stochastic theories that treat semi-Markov age-dependent processes using e.g., the Bellman-Harris equation, do not resolve a population's age-structure and are unable to quantify population-size dependencies. Conversely, current theories that include size-dependent population dynamics (e.g., mathematical models that include carrying capacity such as the Logistic equation) cannot be easily extended to take into account age-dependent birth and death rates. In this paper, we present a systematic derivation of a new fully stochastic kinetic theory for interacting age-structured populations. By defining multiparticle probability density functions, we derive a hierarchy of kinetic equations for the stochastic evolution of an ageing population undergoing birth and death. We show that the fully stochastic age-dependent birth-death process precludes factorization of the corresponding probability densities, which then must be solved by using a BBGKY-like hierarchy. However, explicit solutions are derived in two simple limits and compared with their corresponding mean-field results. Our results generalize both deterministic models and existing master equation approaches by providing an intuitive and efficient way to simultaneously model age- and population-dependent stochastic dynamics applicable to the study of demography, stem cell dynamics, and disease evolution.

1408.4518 2026-05-12 cond-mat.stat-mech q-bio.QM q-bio.SC

First Passage Problems in Biology

Tom Chou, Maria R. D'Orsogna

AI总结 本文综述了首次通过时间在生物学中的广泛应用,涵盖从分子到生态系统等多个尺度。作者介绍了典型的马尔可夫模型中的首次通过问题,涉及分子解离、基因表达、神经放电、细胞突变与疾病演化等过程,并提出一个简单的干细胞衰老模型并分析其结果。文章还讨论了相关近似方法及在不同应用中出现的物理与数学细节。

Comments 40 pages, 90 references, 10 figures

详情
Journal ref
in First-Passage Phenomena and Their Applications, eds. R. Metzler, G. Oshanin and S. Redner (World Scientific, 2014), pp. 306-345
英文摘要

Applications of first passage times in stochastic processes arise across a wide range of length and time scales in biological settings. After an initial technical overview, we survey representative applications and their corresponding models. Within models that are effectively Markovian, we discuss canonical examples of first passage problems spanning applications to molecular dissociation and self-assembly, molecular search, transcription and translation, neuronal spiking, cellular mutation and disease, and organismic evolution and population dynamics. In this last application, a simple model for stem-cell ageing is presented and some results derived. Various approximation methods and the physical and mathematical subtleties that arise in the chosen applications are also discussed.