arXivDaily arXiv每日学术速递 周一至周五更新
重置
2508.21749 2026-03-11 math.CO cs.DM q-bio.QM

When Many Trees Go to War: On Sets of Phylogenetic Trees With Almost No Common Structure

Mathias Weller, Norbert Zeh

详情
Journal ref
Discrete Mathematics & Theoretical Computer Science, vol. 28:2, Combinatorics (March 10, 2026) dmtcs:16446
英文摘要

It is known that any two trees on the same $n$ leaves can be displayed by a network with $n-2$ reticulations, and there are two trees that cannot be displayed by a network with fewer reticulations. But how many reticulations are needed to display multiple trees? For any set of $t$ trees on $n$ leaves, there is a trivial network with $(t - 1)n$ reticulations that displays them. To do better, we have to exploit common structure of the trees to embed non-trivial subtrees of different trees into the same part of the network. In this paper, we show that for $t \in o(\sqrt{\lg n})$, there is a set of $t$ trees with virtually no common structure that could be exploited. More precisely, we show for any $t\in o(\sqrt{\lg n})$, there are $t$ trees such that any network displaying them has $(t-1)n - o(n)$ reticulations. For $t \in o(\lg n)$, we obtain a slightly weaker bound. We also prove that already for $t = c\lg n$, for any constant $c > 0$, there is a set of $t$ trees that cannot be displayed by a network with $o(n \lg n)$ reticulations, matching up to constant factors the known upper bound of $O(n \lg n)$ reticulations sufficient to display \emph{all} trees with $n$ leaves. These results are based on simple counting arguments and extend to unrooted networks and trees.

2603.09860 2026-03-11 q-bio.BM physics.bio-ph

Joint Geometric-Chemical Distance for Protein Surfaces

Himanshu Swami, John M. McBride, Jean-Pierre Eckmann, Tsvi Tlusty

详情
英文摘要

Protein function is executed at the molecular surface, where shape and chemistry act together to govern interaction. Yet most comparison methods treat these aspects separately, privileging either global fold or local descriptors and missing their coupled organization. Here we introduce IFACE (Intrinsic Field-Aligned Coupled Embedding), a correspondence-based framework that aligns protein surfaces through probabilistic coupling of intrinsic geometry with spatially distributed chemical fields. From this alignment, we derive a joint geometric--chemical distance that integrates structural and physicochemical discrepancies within a single formulation. Across diverse proteins, this distance separates conformational variability from true structural divergence more effectively than fold-based similarity measures. Applied to the cytochrome P450 family, it reveals coherent family-level organization and identifies conserved buried catalytic pockets despite the complex topology. By linking interpretable surface correspondences with a unified distance, IFACE establishes a principled basis for comparing protein interfaces and detecting functionally related interaction patches across proteins.

2603.09765 2026-03-11 q-bio.NC

Curvature Blindness from Polarity Breaks and Orientation Channel Fragmentation in V1

Michael Menke

Comments 12 pages, 2 figures

详情
英文摘要

We present a mathematical model of the curvature blindness illusion in which sinusoids appear as angular zigzags when drawn with alternating contrast polarity against a gray background. The model identifies two complementary mechanisms, both operating in V1. First, polarity channel separation: simple cells are selective for contrast polarity, and lateral connections link only same polarity neurons; where the line switches from darker than background to lighter than background at each peak and trough, the encoding population changes and the lateral chain is broken, segmenting the contour into half-wavelength pieces. Second, orientation channel fragmentation: at moderate contrast, the active orientation window is narrow, and within each half-wavelength segment no single orientation channel spans the full range of edge normals; the inflection point at the center of each segment anchors a locally straight percept. Together, the two mechanisms produce a zigzag: polarity breaks supply the corners, and fragmentation straightens the segments between them.

2603.09729 2026-03-11 q-bio.NC cs.RO cs.SY eess.SY

Efficient and robust control with spikes that constrain free energy

André Urbano, Pablo Lanillos, Sander Keemink

详情
英文摘要

Animal brains exhibit remarkable efficiency in perception and action, while being robust to both external and internal perturbations. The means by which brains accomplish this remains, for now, poorly understood, hindering our understanding of animal and human cognition, as well as our own implementation of efficient algorithms for control of dynamical systems.A potential candidate for a robust mechanism of state estimation and action computation is the free energy principle, but existing implementations of this principle have largely relied on conventional, biologically implausible approaches without spikes. We propose a novel, efficient, and robust spiking control framework with realistic biological characteristics. The resulting networks function as free energy constrainers, in which neurons only fire if they reduce the free energy of their internal representation. The networks offer efficient operation through highly sparse activity while matching performance with other similar spiking frameworks, and have high resilience against both external (e.g. sensory noise or collisions) and internal perturbations (e.g. synaptic noise and delays or neuron silencing) that such a network would be faced with when deployed by either an organism or an engineer. Overall, our work provides a novel mathematical account for spiking control through constraining free energy, providing both better insight into how brain networks might leverage their spiking substrate and a new route for implementing efficient control algorithms in neuromorphic hardware.

2603.09384 2026-03-11 cond-mat.dis-nn q-bio.NC

Dreaming improves memorization in a Hopfield model with bounded synaptic strength

Enzo Marinari, Saverio Rossi, Francesco Zamponi

详情
英文摘要

The Hopfield model provides a paradigmatic framework for associative memory. Its classical implementation, based on the Hebbian learning rule, suffers from catastrophic forgetting: when one attempts storing too many patterns, the network fails to retrieve any of them. Yet, the Hebbian rule does not take into account that synaptic strength is bounded. Introducing this biologically plausible modification, known as "clipping", eliminates catastrophic forgetting; the model is now able to retrieve the most recently seen memories, eliminating older ones. Yet, its memorization capacity is much reduced with respect to the unclipped case. Here, we investigate the effects of adding a "dreaming" phase on the capacity of a clipped Hopfield model. Following a proposal by Hopfield, Feinstein and Palmer, we assume that during the dreaming phase, the model generates random patterns that are then "unlearned". We show that while clipping still removes catastrophic forgetting, alternating learning and dreaming phases improves the memorization capacity and makes the search for optimal performance more realistic from an evolutionary perspective.

2603.08444 2026-03-11 physics.bio-ph physics.flu-dyn q-bio.QM

Hydrodynamic origins of symmetric swimming strategies

Takahiro Kanazawa, Kenta Ishimoto, Kyogo Kawaguchi

Comments 28 pages, 3+4 figures

详情
英文摘要

Efficient locomotion is important for the evolution of complex life, yet the physical principles selecting specific swimming strokes often remain entangled with biological constraints. In viscous fluids, the scallop theorem constrains the temporal organization of strokes, but no analogous principle is known for their spatial structure, leaving the prevalence of symmetric gaits across diverse organisms without a physical explanation. Here we show that spatial symmetry acts as an emergent organizing principle for efficiency in viscous fluids. By analysing deformable swimmers whose strokes are not constrained to any particular symmetry class, we identify a hydrodynamic duality: symmetric and anti-symmetric strokes are dynamically equivalent, yielding identical speeds and efficiencies, which we prove are optimal among all strokes. We validate this using numerical simulations of Stokes flow, demonstrating that these symmetry rules persist even in three-dimensional body plans. Our results suggest that the prevalence of symmetric and alternating gaits in nature reflects not merely a developmental constraint, but a physical optimality principle for locomotion in viscous environments, complementing developmental and neural constraints.

2603.08409 2026-03-11 physics.bio-ph q-bio.NC

Embodied intelligence solves the centipede's dilemma

Adam Dionne, Fabio Giardina, L. Mahadevan

详情
英文摘要

Although commonly associated with limbless animals like snakes and fish, multi-legged organisms like centipedes also utilize undulatory locomotion. Whether these undulations are actively reinforced or resisted by the axial musculature remains an open question. We present a dynamical model of centipede locomotion that integrates leg-ground interactions, passive body mechanics, and active lateral musculature. By varying stepping rate, actuation, and body stiffness, we examine how locomotor strategies affect speed and an effective energetic efficiency. Coordination emerges only when body stiffness is tuned to stepping frequency: overly flexible bodies lose synchrony, while overly rigid ones move slowly and inefficiently. This leads to the prediction that centipedes utilize speed dependent active stiffness to maintain this coordination. Our results suggest that lateral muscles also have a speed dependent function, revealed by optimizing speed and an effective cost, that resists a phase lag between leg touchdowns and body curvature. Together, we find that centipedes actively modulate body mechanics to achieve rapid, efficient locomotion, highlighting how complex control can emerge from embodied physical properties rather than solely from neural computation.

2601.00050 2026-03-11 q-bio.QM

Domain-aware priors stabilize, not merely enable, vertical federated learning in data-scarce coral multi-omics

Sam Victor

Comments 22 pages, 06 figures, 04 tables, 01 algorithm, 20 references. Journal submission currently in progress

详情
英文摘要

Vertical federated learning (VFL) enables multi-laboratory collaboration on distributed multi-omics datasets without sharing raw data, but exhibits severe instability under extreme data scarcity (P >> N) when applied generically. Here, we investigate how domain-aware design choices; specifically gradient saliency-guided feature selection with biologically motivated priors; affect the stability, interpretability, and failure modes of VFL architectures in small-sample coral stress classification (N = 13 samples, P = 90,579 features across transcriptomics, proteomics, metabolomics, and microbiome data). We benchmark REEF (Robust Expert Encoder Federation), a domain-aware VFL framework, against two baselines on the Montipora capitata thermal stress dataset: (i) a standard NVFlare-based VFL and (ii) LASER, a state-of-the-art label-aware VFL method. REEF achieves an AUROC of 0.776 +/- 0.039 after reducing dimensionality by 98.6% (90,579 to 1,300 features), substantially outperforming NVFlare VFL at chance level (AUROC 0.500 +/- 0.125, p = 0.0106, Cohen's d = 2.265) and numerically exceeding LASER (AUROC 0.557 +/- 0.191, p = 0.0995, Cohen's d = 1.068), with 3-5-fold variance reduction. An equal-weights ablation confirms that biological priors specifically contribute stability: removing priors yields statistically indistinguishable mean AUROC (p = 0.405) but 2.3x higher variance (CV 0.110 vs 0.050). Negative control experiments using permuted labels produce AUROC near or below chance (0.357 for REEF, 0.238 for NVFlare), consistent with the absence of gross data leakage. These results motivate design principles for VFL in extreme P >> N regimes, emphasizing domain-informed dimensionality reduction, stability-focused evaluation, and interpretable feature selection for scarce biological data.

2509.21277 2026-03-11 q-bio.NC

More than a feeling: Expressive style influences cortical speech tracking in subjective cognitive decline

Matthew King-Hang Ma, Yun Feng, Cloris Pui-Hang Li, Manson Cheuk-Man Fong

详情
英文摘要

Subjective cognitive decline (SCD) doubles dementia risk. This study investigates how self-perceived cognitive worsening manifests in neural dynamics during naturalistic speech perception. EEG was collected from 60 cognitively normal older adults while they listened to speech varied in prosodic contexts, categorized by expressive styles (scrambled, descriptive, dialogue, exciting). Encoding models mapping three speech representations -- acoustic, subsyllabic segmentation and phonotactic features -- to the ongoing EEG signals were built. Cortical tracking strength (CTS) showed that models fitted with linguistic features outperformed acoustic ones. Crucially, a greater degree of SCD was associated with weaker CTS of (1) higher-level linguistic but not acoustic features, and (2) prosodically flat speech (scrambled and descriptive). Thus, the CTS of higher-level linguistic features while listening to prosodically flat speech may serve as a potential biomarker for early-stage cognitive decline.

2509.19536 2026-03-11 q-bio.QM q-bio.CB

Integrating Mechanistic Modeling and Machine Learning to Study CD4+/CD8+ CAR-T Cell Dynamics with Tumor Antigen Regulation

Saranya Varakunan, Melissa Stadt, Mohammad Kohandel

Comments 28 pages, 9 figures

详情
英文摘要

Chimeric antigen receptor (CAR) T cell therapy has shown remarkable success in hematological malignancies, yet patient responses remain highly variable and the roles of CD4+ and CD8+ subsets are not fully understood. We present an extended mathematical framework of CAR-T cell dynamics that explicitly models CD4+ helper and CD8+ cytotoxic lineages and their interactions with tumor antigen burden. Building on the Kirouac et al. (2023) model of antigen-regulated memory-effector-exhaustion transitions, our system of differential equations incorporates CD4-mediated modulation of CD8+ proliferation, cytotoxicity, and memory regeneration through biologically grounded, saturating interactions. Sensitivity analyses identify effector proliferation, antigen turnover, and CD8+ expansion rates as dominant drivers of treatment outcome. Virtual patient simulations recover reported qualitative trends in CAR-T composition, including enhanced expansion and tumor clearance for defined CD4:CD8 products relative to CD8-only formulations, while also revealing inter-patient variability and time-dependent effects. To assess the practical limits of patient-level prediction under parameter uncertainty, we introduce controlled noise into key parameters and show that direct mechanistic classification rapidly degrades. We then demonstrate that a simple feed-forward neural network can partially recover predictive signal from noisy inputs, outperforming a naive baseline while remaining consistent with mechanistic sensitivities. This work positions the extended model as a hypothesis generator, and illustrates how data-driven methods can complement mechanistic modeling when parameter uncertainty constrains predictive confidence.

2509.15328 2026-03-11 cs.LG cs.CV q-bio.NC

Kuramoto Orientation Diffusion Models

Yue Song, T. Anderson Keller, Sevan Brodjian, Takeru Miyato, Yisong Yue, Pietro Perona, Max Welling

Comments NeurIPS 2025

详情
英文摘要

Orientation-rich images, such as fingerprints and textures, often exhibit coherent angular directional patterns that are challenging to model using standard generative approaches based on isotropic Euclidean diffusion. Motivated by the role of phase synchronization in biological systems, we propose a score-based generative model built on periodic domains by leveraging stochastic Kuramoto dynamics in the diffusion process. In neural and physical systems, Kuramoto models capture synchronization phenomena across coupled oscillators -- a behavior that we re-purpose here as an inductive bias for structured image generation. In our framework, the forward process performs \textit{synchronization} among phase variables through globally or locally coupled oscillator interactions and attraction to a global reference phase, gradually collapsing the data into a low-entropy von Mises distribution. The reverse process then performs \textit{desynchronization}, generating diverse patterns by reversing the dynamics with a learned score function. This approach enables structured destruction during forward diffusion and a hierarchical generation process that progressively refines global coherence into fine-scale details. We implement wrapped Gaussian transition kernels and periodicity-aware networks to account for the circular geometry. Our method achieves competitive results on general image benchmarks and significantly improves generation quality on orientation-dense datasets like fingerprints and textures. Ultimately, this work demonstrates the promise of biologically inspired synchronization dynamics as structured priors in generative modeling.

2507.11531 2026-03-11 cs.LG q-bio.NC

Langevin Flows for Modeling Neural Latent Dynamics

Yue Song, T. Anderson Keller, Yisong Yue, Pietro Perona, Max Welling

Comments Full version of the Cognitive Computational Neuroscience (CCN) 2025 poster

详情
英文摘要

Neural populations exhibit latent dynamical structures that drive time-evolving spiking activities, motivating the search for models that capture both intrinsic network dynamics and external unobserved influences. In this work, we introduce LangevinFlow, a sequential Variational Auto-Encoder where the time evolution of latent variables is governed by the underdamped Langevin equation. Our approach incorporates physical priors -- such as inertia, damping, a learned potential function, and stochastic forces -- to represent both autonomous and non-autonomous processes in neural systems. Crucially, the potential function is parameterized as a network of locally coupled oscillators, biasing the model toward oscillatory and flow-like behaviors observed in biological neural populations. Our model features a recurrent encoder, a one-layer Transformer decoder, and Langevin dynamics in the latent space. Empirically, our method outperforms state-of-the-art baselines on synthetic neural populations generated by a Lorenz attractor, closely matching ground-truth firing rates. On the Neural Latents Benchmark (NLB), the model achieves superior held-out neuron likelihoods (bits per spike) and forward prediction accuracy across four challenging datasets. It also matches or surpasses alternative methods in decoding behavioral metrics such as hand velocity. Overall, this work introduces a flexible, physics-inspired, high-performing framework for modeling complex neural population dynamics and their unobserved influences.

2506.00168 2026-03-11 q-bio.QM q-bio.CB stat.ML

SSRCA: a novel machine learning pipeline to perform sensitivity analysis for agent-based models

Edward H. Rohr, John T. Nardini

详情
英文摘要

Agent-based models (ABMs) are widely used in biology to understand how individual actions scale into emergent population behavior. Modelers employ sensitivity analysis (SA) algorithms to quantify input parameters' impact on model outputs, however, it is hard to perform SA for ABMs due to their computational and complex nature. In this work, we develop the Simulate, Summarize, Reduce, Cluster, and Analyze (SSRCA) methodology, a machine-learning based pipeline designed to facilitate SA for ABMs. In particular, SSRCA can achieve the following tasks for ABMS: 1) identify sensitive model parameters, 2) reveal common output model patterns, and 3) determine which input parameter values generate these patterns. We use an example ABM of tumor spheroid growth to showcase how SSRCA identifies four common patterns from the ABM and the parameter regions that generate these outputs. Additionally, we compare the SA results between SSRCA and the popular Sobol' Method and find that SSRCA's identified sensitive parameters are robust to the choice of model descriptors while Sobol's are not. This analysis could streamline data-driven tasks, such as parameter estimation, for ABMs by reducing parameter space. While we highlight these results with an ABM on tumor spheroid formation, the SSRCA Methodology is broadly applicable to biological ABMs.

2503.23189 2026-03-11 cond-mat.dis-nn cond-mat.stat-mech econ.GN math.PR q-bio.PE q-fin.EC

A mean-field theory for heterogeneous random growth with redistribution

Maximilien Bernard, Jean-Philippe Bouchaud, Pierre Le Doussal

详情
Journal ref
Phys. Rev. E 113, L032101(2026)
英文摘要

We study the competition between random multiplicative growth and redistribution/migration in the mean-field limit, when the number of sites is very large but finite. We find that for static random growth rates, migration should be strong enough to prevent localisation, i.e. extreme concentration on the fastest growing site. In the presence of an additional temporal noise in the growth rates, a third partially localised phase is predicted theoretically, using results from Derrida's Random Energy Model. Such temporal fluctuations mitigate concentration effects, but do not make them disappear. We discuss our results in the context of population growth and wealth inequalities.

2501.17901 2026-03-11 q-bio.BM cs.LG

Molecular Fingerprints Are Strong Models for Peptide Function Prediction

Jakub Adamczyk, Piotr Ludynia, Wojciech Czech

详情
英文摘要

Understanding peptide properties is often assumed to require modeling long-range molecular interactions, motivating the use of complex graph neural networks and pretrained transformers. Yet, whether such long-range dependencies are essential remains unclear. We investigate if simple, domain-specific molecular fingerprints can capture peptide function without these assumptions. Atomic-level representation aims to provide richer information than purely sequence-based models and better efficiency than structural ones. Across 132 datasets, including LRGB and five other peptide benchmarks, models using count-based ECFP, Topological Torsion, and RDKit fingerprints with LightGBM achieve state-of-the-art accuracy. Despite encoding only short-range molecular features, these models outperform GNNs and transformer-based approaches. Control experiments with sequence shuffling and amino acid counts confirm that fingerprints, though inherently local, suffice for robust peptide property prediction. Our results challenge the presumed necessity of long-range interaction modeling and highlight molecular fingerprints as efficient, interpretable, and computationally lightweight alternatives for peptide prediction.

2501.10620 2026-03-11 q-bio.QM

AI-Driven Hybrid Ecological Model for Predicting Oncolytic Viral Therapy Dynamics

Abicumaran Uthamacumaran, Juri Kiyokawa, Hiroaki Wakimoto

详情
Journal ref
In Silico Research in Biomedicine, 2026. Volume 2, 100258
英文摘要

Oncolytic viral therapy (OVT) is an emerging precision therapy for aggressive and recurrent cancers. However, its clinical efficacy is hindered by the complexity of tumor-virus-immune interactions and the lack of predictive models for personalized treatment. This study develops a data-driven, AI-powered computational model combining time-delayed Generalized Lotka-Volterra equations with advanced optimization algorithms, including Genetic Algorithms, Differential Evolution, and Reinforcement Learning, to optimize OVT oscillations' growth and damping. We hypothesize that the model can provide accurate, real-time predictions of OVT responses while identifying key biomarkers to enhance therapeutic efficacy. The model demonstrates strong predictive accuracy, achieving mean squared error (MSE) < 0.02 and R-squared > 0.82. It also identifies experimentally validated biomarkers such as TNF, NFkB, CD81, TRAF2, IL18, and BID, among other inflammatory cytokines and extracellular matrix reconstruction factors, despite being causally agnostic and unaware of specific experimental conditions or therapeutic combinations. Gene set enrichment analysis confirmed these biosignatures as critical predictors of tumor progression and indicated that photodynamic therapy activates immune responses similar to those elicited by combined OVT and immune checkpoint inhibitors. This hybrid model represents a significant step toward precision oncology and computational medicine, enabling longitudinal, adaptive treatment regimens and developing targeted immunotherapies based on molecular signatures, potentially improving patient outcomes.

2311.04709 2026-03-11 q-bio.QM

Forecasting and predicting stochastic agent-based model data with biologically-informed neural networks

John T. Nardini

详情
英文摘要

Collective migration is an important component of many biological processes, including wound healing, tumorigenesis, and embryo development. Spatial agent-based models (ABMs) are often used to model collective migration, but it is challenging to thoroughly predict these models' behavior throughout parameter space due to their random and computationally intensive nature. Modelers often coarse-grain ABM rules into mean-field differential equation (DE) models. While these DE models are fast to simulate, they suffer from poor (or even ill-posed) ABM predictions in some regions of parameter space. In this work, we describe how biologically-informed neural networks (BINNs) can be trained to learn interpretable BINN-guided DE models capable of accurately predicting ABM behavior. In particular, we show that BINN-guided partial DE (PDE) simulations can 1.) forecast future spatial ABM data not seen during model training, and 2.) predict ABM data at previously-unexplored parameter values. This latter task is achieved by combining BINN-guided PDE simulations with multivariate interpolation. We demonstrate our approach using three case study ABMs of collective migration that imitate cell biology experiments and find that BINN-guided PDEs accurately forecast and predict ABM data with a one-compartment PDE when the mean-field PDE is ill-posed or requires two compartments. This work suggests that BINN-guided PDEs allow modelers to efficiently explore parameter space, which may enable data-driven tasks for ABMs, such as estimating parameters from experimental data. All code and data from our study is available at https://github.com/johnnardini/Forecasting_predicting_ABMs.

2603.09071 2026-03-11 quant-ph nlin.SI q-bio.PE

Toda-like Hamiltonian as a probe for quantized prey-predator dynamics

Alex E. Bernardini, Orfeu Bertolami

Comments 32 pages, 6 figures

详情
英文摘要

Phase-space features of a reduced version of the Toda-like Hamiltonian, $\mathcal{H}(x,\,k)$, written in a form constrained by the condition $\partial^2 \mathcal{H} / \partial x \partial k = 0$, with $x$ and $k$ as canonically conjugate variables, are analyzed in terms of Wigner currents. For Wigner currents convoluted with either thermodynamic or Gaussian ensembles, the underlying Hamiltonian dynamics admits analytic corrections due to quantum distortions over the classical phase-space pattern, computed and interpreted through quantifiers of quantumness and stationarity. Notably, while emulating the Lotka-Volterra (LV) dynamics that describe ecological competition systems, the Toda-like classical dynamics allows for analytical solutions with computable periods corresponding to closed phase-space orbits of isotropic prey-predator population distributions. The essential conditions for understanding how classical and quantum evolution can coexist are provided at different scales of quantumness, driven by the associated convoluting ensemble parameter. In the case of Gaussian statistical ensembles, the exact profile of the quantum distortions over classical prey-predator phase-space trajectories is obtained non-perturbatively. Our results indicate that, besides the classical stability admitted by LV models, the Toda-like patterns also exhibit quantum stability. Therefore, this can be regarded as the first step as a predictive theoretical framework towards more robust descriptions of quantum patterns in competitive microscopic biosystems.

2603.08949 2026-03-11 q-bio.NC

Diffusion of Neuromodulators for Temporal Credit Assignment

João Barretto-Bittar, Anna Levina, Emmanouil Giannakakis, Roxana Zeraati

详情
英文摘要

Biological learning achieves temporal credit assignment despite sparse and imprecise feedback, often relying on neuromodulatory signals acting over space and time. Here, we introduce a learning mechanism in which error information diffuses locally through the network, similar to volume transmission of neuromodulators. This distributed modulation allows neurons to learn even in the absence of direct feedback, using the local concentration of the diffusing credit signal. Applied to recurrent spiking neural networks with sparse feedback connectivity, diffusive credit signaling improves learning across three benchmark tasks. Using eligibility propagation as a baseline learning mechanism, we show how diffusion-based modulation can provide a plausible mechanism for credit assignment in sparsely connected neural circuits.

2603.08913 2026-03-11 cs.LG cs.CR q-bio.GN

Quantifying Memorization and Privacy Risks in Genomic Language Models

Alexander Nemecek, Wenbiao Li, Xiaoqian Jiang, Jaideep Vaidya, Erman Ayday

Comments 13 pages

详情
英文摘要

Genomic language models (GLMs) have emerged as powerful tools for learning representations of DNA sequences, enabling advances in variant prediction, regulatory element identification, and cross-task transfer learning. However, as these models are increasingly trained or fine-tuned on sensitive genomic cohorts, they risk memorizing specific sequences from their training data, raising serious concerns around privacy, data leakage, and regulatory compliance. Despite growing awareness of memorization risks in general-purpose language models, little systematic evaluation exists for these risks in the genomic domain, where data exhibit unique properties such as a fixed nucleotide alphabet, strong biological structure, and individual identifiability. We present a comprehensive, multi-vector privacy evaluation framework designed to quantify memorization risks in GLMs. Our approach integrates three complementary risk assessment methodologies: perplexity-based detection, canary sequence extraction, and membership inference. These are combined into a unified evaluation pipeline that produces a worst-case memorization risk score. To enable controlled evaluation, we plant canary sequences at varying repetition rates into both synthetic and real genomic datasets, allowing precise quantification of how repetition and training dynamics influence memorization. We evaluate our framework across multiple GLM architectures, examining the relationship between sequence repetition, model capacity, and memorization risk. Our results establish that GLMs exhibit measurable memorization and that the degree of memorization varies across architectures and training regimes. These findings reveal that no single attack vector captures the full scope of memorization risk, underscoring the need for multi-vector privacy auditing as a standard practice for genomic AI systems.

2603.08882 2026-03-11 q-bio.MN math.DS

Automated Classification of Homeostasis Structure in Input-Output Networks

Xinni Lin, Fernando Antoneli, Yangyang Wang

Comments 48 pages, 20 figures

详情
英文摘要

Homeostasis is widely observed in biological systems and refers to their ability to maintain an output quantity approximately constant despite variations in external disturbances. Mathematically, homeostasis can be formulated through an input-output function mapping an external parameter to an output variable. Infinitesimal homeostasis occurs at isolated points where the derivative of this input-output function vanishes, allowing tools from singularity theory and combinatorial matrix theory to characterize homeostatic mechanisms in terms of network topology. However, the required combinatorial enumeration becomes increasingly intractable as network size grows, and the reliance on advanced graph-theoretic concepts limits accessibility and practical use in biological applications. To overcome these limitations, we develop a Python-based algorithm that automates the identification of homeostasis subnetworks and their associated homeostasis conditions directly from network topology. Given an input-output network specified solely by its connectivity structure and designated input and output nodes, the algorithm identifies the relevant graph-theoretical structures and enumerates all homeostatic mechanisms. We demonstrate its applicability across a range of biological examples, including small and large networks, networks with single or multiple input nodes or parameters, and cases where input and output coincide. This wide applicability stems from our extension of the theoretical framework from single-input-single-output networks to networks with multiple input nodes through an augmented single-input-node representation. The resulting computational framework provides a scalable and systematic approach to classifying homeostatic mechanisms in complex biological networks, facilitating the application of advanced mathematical theory to a broad range of biological systems.

2603.08874 2026-03-11 q-bio.PE

The Black Death Anomaly: A Non-Abelian Field Theory of Epidemiological Safe Zones

Jose de Jesus Bernal-Alvarado, David Delepine

Comments 11 pages, 2 figures

详情
英文摘要

Classical reaction-diffusion models of the 14th-century Black Death fail to explain the rapid genetic radiation of \textit{Yersinia pestis} and the anomalous emergence of vast, untouched geographic safe zones, such as Central Europe. In this work, we resolve these historical anomalies by embedding macroscopic pathogen dynamics within a non-Abelian gauge theory. Utilizing the Doi-Peliti formalism, we map the stochastic master equation of a multi-strain epidemic into a covariant classical field theory. We introduce an $SU(N)$ environmental gauge field, $\mathbf{A}_μ$, which actively couples geographic displacement to phenotypic mutation, treating evolutionary drift as a spatial transport phenomenon. We demonstrate via linear stability analysis that this covariant advection drives a Differential Flow (Turing-Hopf) instability, spontaneously breaking spatial symmetry to generate traveling waves of mutation. Furthermore, by extending the pathogen multiplet to the large-$N$ ('t Hooft) continuum limit, we prove that historical safe zones are not statistical outliers nor the result of perfect quarantine, but are mathematically necessary topological voids. In this continuous limit, the destructive interference of the mutating wavefronts analytically resolves into a stable, isotropic macroscopic node governed by a zeroth-order Bessel function ($J_0$), precisely mapping onto the historical survival of Poland and Bohemia.

2603.08866 2026-03-11 q-bio.NC eess.SP

A Dynamical Systems and System Identification Framework for Phase Amplitude Coupling Analysis

Rajintha Gunawardena, Fei He

详情
英文摘要

Phase-amplitude coupling (PAC), a form of cross-frequency interaction, has been implicated in various cognitive functions and, by extension, in neural communication and information integration. Accurately detecting and characterising PAC is essential for understanding its role in processes such as memory and attention. However, this remains a significant challenge. Most existing methods rely on variations in the temporal profile to detect PAC, but they often suffer from key limitations, most notably, their sensitivity to filter bandwidth selection and their susceptibility to detecting spurious couplings. Previous studies have suggested that approaches grounded in the actual generative dynamics of PAC may offer improved accuracy. In this study, we adopt a dynamical systems perspective and propose a novel method for PAC detection and characterisation based on nonlinear system identification. This approach involves identifying a nonlinear dynamical model that captures the temporal dynamics underlying PAC. The resulting generative model enables noise-free simulation of estimated PAC signals, facilitating detailed analysis of modulation strength and the low-frequency phase at which the high-frequency bursts occur. The proposed method accounts for harmonic-induced spurious couplings through empirically derived criteria and remains robust to high noise levels and variations in slow-frequency power, offering an accurate and interpretable framework for PAC analysis. The performance of the proposed approach is illustrated using several simulated examples and a real case using local field potentials (LFP) data. The results are compared with several popular methods.

2603.08767 2026-03-11 q-bio.MN math.CO math.DS

Duality in mass-action networks

Alexandru Iosif

详情
英文摘要

Mass-action networks are special cases of chemical reaction networks. For these systems, we argue that conserved quantities are dual to internal cycles. We introduce maximal invariant polyhedral supports, and we conjecture that there is a duality relation between preclusters and maximal invariant polyhedral supports. Given the close relation between maximal invariant polyhedral supports and siphons, we also conjecture that siphons and preclusters are dual objects.

2601.03307 2026-03-11 q-bio.MN cond-mat.soft cond-mat.stat-mech nlin.AO physics.bio-ph

Understanding the temperature response of biological systems: Part II -- Network-level mechanisms and emergent dynamics

Simen Jacobs, Julian B. Voits, Nikita Frolov, Ulrich S. Schwarz, Lendert Gelens

Comments 10 pages, 3 figures

详情
英文摘要

Building on the phenomenological and microscopic models reviewed in Part I, this second part focuses on network-level mechanisms that generate emergent temperature response curves. We review deterministic models in which temperature modulates the kinetics of coupled biochemical reactions, as well as stochastic frameworks, such as Markov chains, that capture more complex multi-step processes. These approaches show how Arrhenius-like temperature dependence at the level of individual reactions is transformed into non-Arrhenius scaling, thermal limits, and temperature compensation at the system level. Together, network-level models provide a mechanistic bridge between empirical temperature response curves and the molecular organization of biological systems, giving us predictive insights into robustness, perturbations, and evolutionary constraints.

2601.00984 2026-03-11 q-bio.NC

A Biologically Plausible Dense Associative Memory with Exponential Capacity

Mohadeseh Shafiei Kafraj, Dmitry Krotov, Peter E. Latham

详情
英文摘要

Krotov and Hopfield (2021) proposed a biologically plausible two-layer associative memory network with memory storage capacity exponential in the number of visible neurons. However, the capacity was only linear in the number of hidden neurons. This limitation arose from the choice of nonlinearity between the visible and hidden units, which enforced winner-take-all dynamics in the hidden layer, thereby restricting each hidden unit to encode only a single memory. We overcome this limitation by introducing a novel associative memory network with a threshold nonlinearity that enables distributed representations. In contrast to winner-take-all dynamics, where each hidden neuron is tied to an entire memory, our network allows hidden neurons to encode basic components shared across many memories. Consequently, complex patterns are represented through combinations of hidden neurons. These representations reduce redundancy and allow many correlated memories to be stored compositionally. Thus, we achieve much higher capacity: exponential in the number of hidden units, provided the number of visible units is sufficiently large relative to the number of hidden units. Exponential capacity arises because all binary states of the hidden units can become stable memory patterns. Moreover, the distributed hidden representation, which has much lower dimensionality than the visible layer, preserves class-discriminative structure, supporting efficient nonlinear decoding. These results establish a new regime for associative memory, enabling high-capacity, robust, and scalable architectures consistent with biological constraints.

2512.08074 2026-03-11 q-bio.QM cond-mat.soft cond-mat.stat-mech nlin.AO physics.bio-ph

Understanding the temperature response of biological systems: Part I -- Phenomenological descriptions and microscopic models

Simen Jacobs, Julian Voits, Nikita Frolov, Ulrich S. Schwarz, Lendert Gelens

Comments 14 pages, 2 figures

详情
英文摘要

Virtually every biological rate depends on temperature, yet the resulting rate-temperature relationships often deviate strongly from simple Arrhenius behavior. In this first part of a two-part review, we survey phenomenological models used to describe biological temperature responses across scales, from enzymatic reactions to organismal performance. We discuss common functional forms, including symmetric and asymmetric thermal performance curves and extensions of the Arrhenius law, and we highlight how these models define operational quantities such as optimal temperatures, thermal breadths, and thermal limits. We also discuss microscopic models for the effect of temperature, which however do not capture cooperative effects. In Part II of this review, we will discuss how system-level temperature response curves emerge from the interaction of many underlying reactions.