arXivDaily arXiv每日学术速递 周一至周五更新
重置
2603.03237 2026-03-04 math.AT q-bio.QM

Topology of Multi-species Localization

Abhinav Natarajan, Thomas Chaplin, Joshua A. Bull, Eoghan J. Mulholland-Illingworth, Simon J. Leedham, Helen M. Byrne, Maria-Jose Jimenez, Heather A. Harrington

Comments Main manuscript: 16 pages, 4 figures, 1 table; SI Appendix: 13 pages, 5 figures

详情
英文摘要

Spatial relationships in multi-species data can indicate and affect system outcomes and behaviors, ranging from disease progression in cancer to coral reef resilience in ecology; therefore, quantifying these relationships is an important problem across scientific disciplines. Persistent homology (PH), a key mathematical and computational tool in topological data analysis (TDA), provides a multiscale description of the shape of data. While it effectively describes spatial organization of species, such as cellular patterns in pathology, it cannot detect the shape relations between different types of species. Traditionally, PH analyzes single-species data, which limits the spatial analysis of interactions between different species. Leveraging recent developments in TDA and computational geometry, we introduce a scalable approach to quantify higher-order interactions in multi-species data. The framework can distinguish the presence of shape features or patterns in the data that are (i) common to multiple species of points, (ii) present in some species but disappear in the presence of other species, (iii) only visible when multiple species are considered together, and (iv) formed by some species and remain visible in the presence of others. We demonstrate our approach on two example applications. We identify (1) different behavioral regimes in a synthetic tumor micro-environment model, and (2) interspecies spatial interactions that are most significantly altered in colorectal cancer tissue samples during disease progression.

2603.03037 2026-03-04 math.AT q-bio.NC

Zigzag Persistence of Neural Responses to Time-Varying Stimuli

Yuri Gardinazzi, Alessio Ansuini, Eugenio Piasini, Fabio Anselmi, Matteo Biagetti

Comments 4+7 pages, 7 figures, accepted as proceedings of the Geometry, Topology and Machine Learning Workshop (GTML) 2025

详情
英文摘要

We use topological data analysis to study neural population activity in the Sensorium 2023 dataset, which records responses from thousands of mouse visual cortex neurons to diverse video stimuli. For each video, we build frame-by-frame cubical complexes from neuronal activity and apply zigzag persistent homology to capture how topological structure evolves over time. These dynamics are summarized with persistence landscapes, providing a compact vectorized representation of temporal features. We focus on one-dimensional topological features-loops in the data-that reflect coordinated, cyclical patterns of neural co-activation. To test their informativeness, we compare repeated trials of different videos by clustering their resulting topological neural representations. Our results show that these topological descriptors reliably distinguish neural responses to distinct stimuli. This work highlights a connection between evolving neuronal activity and interpretable topological signatures, advancing the use of topological data analysis for uncovering neural coding in complex dynamical systems.

2603.02952 2026-03-04 q-bio.GN cs.LG q-bio.CB

Sparse autoencoders reveal organized biological knowledge but minimal regulatory logic in single-cell foundation models: a comparative atlas of Geneformer and scGPT

Ihor Kendiukhov

详情
英文摘要

Background: Single-cell foundation models such as Geneformer and scGPT encode rich biological information, but whether this includes causal regulatory logic rather than statistical co-expression remains unclear. Sparse autoencoders (SAEs) can resolve superposition in neural networks by decomposing dense activations into interpretable features, yet they have not been systematically applied to biological foundation models. Results: We trained TopK SAEs on residual stream activations from all layers of Geneformer V2-316M (18 layers, d=1152) and scGPT whole-human (12 layers, d=512), producing atlases of 82525 and 24527 features, respectively. Both atlases confirm massive superposition, with 99.8 percent of features invisible to SVD. Systematic characterization reveals rich biological organization: 29 to 59 percent of features annotate to Gene Ontology, KEGG, Reactome, STRING, or TRRUST, with U-shaped layer profiles reflecting hierarchical abstraction. Features organize into co-activation modules (141 in Geneformer, 76 in scGPT), exhibit causal specificity (median 2.36x), and form cross-layer information highways (63 to 99.8 percent). When tested against genome-scale CRISPRi perturbation data, only 3 of 48 transcription factors (6.2 percent) show regulatory-target-specific feature responses. A multi-tissue control yields marginal improvement (10.4 percent, 5 of 48 TFs), establishing model representations as the bottleneck. Conclusions: These models have internalized organized biological knowledge, including pathway membership, protein interactions, functional modules, and hierarchical abstraction, yet they encode minimal causal regulatory logic. We release both feature atlases as interactive web platforms enabling exploration of more than 107000 features across 30 layers of two leading single-cell foundation models.

2603.02770 2026-03-04 math.CO q-bio.MN

Autocatalytic Cores in Reaction Networks with Explicit Catalysis

Richard Golnik, Thomas Gatter, Peter F. Stadler, Nicola Vassena

Comments 31 pages, 4 figures, 1 table

详情
英文摘要

Autocatalytic cores are minimal units in reaction networks (RNs) responsible for the emergence of autocatalysis. In the absence of explicit catalysis, i.e., when an entity appears both as reactant and product in the same reaction, they are known to be encoded by square submatrices of the stoichiometric matrix whose columns can be reordered as an irreducible child-selection (CS) matrix with negative diagonal and nonnegative off-diagonal (Metzler matrix). In the bipartite Koenig graph representing the RN, these CS matrices can be identified by fluffles, i.e., strong blocks with an identical number of entity and reaction vertices that have out- and in-degree 1, respectively. Here, we adapt the concepts derived for autocatalytic cores to RNs with explicitly catalyzed reactions, which emerge as digons, i.e., elementary circuits in the Koenig graph of length 2. In this setting, we confirm that an inspection of the stoichiometric matrix alone is inconclusive concerning the presence and number of autocatalytic cores, requiring a more delicate algebraic analysis. Nevertheless, this generalization preserves both the graph and the matrix representation as fluffles and irreducible Metzler CS matrices, respectively, although the diagonal is no longer necessarily strictly negative. We introduce the notion of hard autocatalytic cores, i.e. those that do not yield other autocatalytic cores upon inclusion of all reverse reactions. Finally, we consider the case of unit stoichiometries and show that each autocatalytic core can be constructed as the superposition of at most 2 elementary circuits. In particular, autocatalytic cores involving explicitly catalyzed reactions always contain a spanning subgraph consisting of a single elementary circuit together with a simple entity-to-reaction chord. Moreover, we identify the essentially unique example for which at least two circuits are required.

2603.02757 2026-03-04 q-bio.PE

Mathematical model of tumor-macrophage interactions: Elucidating the tumor-driven macrophage phenotype reprogramming

Haifeng Zhang, Yipu Qu, Wuyue Yang, Chenghang Li

详情
英文摘要

The interplay between tumor cells and macrophages plays a central regulatory role in cancer progression. In this study, we developed a mathematical model that incorporates tumor cells, M1 type macrophages, M2 type macrophages and an M3 type macrophage population characterized by dual phenotypic features. First, we analyzed the fundamental mathematical properties of the model and derived the conditions under which the system attains a tumor free stable state or a coexistence state of tumor and immune cells. Second, global sensitivity analysis revealed that key parameters governing macrophage polarization and intercellular communication vary dynamically during tumor development. Bifurcation analysis further identified the polarization rate of M1 type macrophages $κ$ and the baseline level of resting macrophages $M_0$ as critical determinants of the system's dynamical behavior. Notably, using approximate Bayesian computation for parameter inference and dynamic simulations, the model successfully recapitulated the evolutionary trajectories of eight tumor samples. The results demonstrate that lower tumor burden is significantly associated with higher M1 type macrophage infiltration and delayed peak time of M3 type macrophage activation. Moreover, survival analysis indicated that both enhanced M1 type macrophage infiltration and delayed peak time of M3 type macrophage activation are correlated with longer survival time. In summary, this study not only provides a theoretical framework for understanding the dynamic mechanisms underlying tumor macrophage interactions but also proposes two potential clinical prognostic markers: the level of M1 type macrophage infiltration and the peak time of M3 type macrophage activation.

2603.02753 2026-03-04 cs.LG q-bio.QM stat.ML

Deep learning-guided evolutionary optimization for protein design

Erik Hartman, Di Tang, Johan Malmström

Comments Code available at GitHub

详情
英文摘要

Designing novel proteins with desired characteristics remains a significant challenge due to the large sequence space and the complexity of sequence-function relationships. Efficient exploration of this space to identify sequences that meet specific design criteria is crucial for advancing therapeutics and biotechnology. Here, we present BoGA (Bayesian Optimization Genetic Algorithm), a framework that combines evolutionary search with Bayesian optimization to efficiently navigate the sequence space. By integrating a genetic algorithm as a stochastic proposal generator within a surrogate modeling loop, BoGA prioritizes candidates based on prior evaluations and surrogate model predictions, enabling data-efficient optimization. We demonstrate the utility of BoGA through benchmarking on sequence and structure design tasks, followed by its application in designing peptide binders against pneumolysin, a key virulence factor of \textit{Streptococcus pneumoniae}. BoGA accelerates the discovery of high-confidence binders, demonstrating the potential for efficient protein design across diverse objectives. The algorithm is implemented within the BoPep suite and is available under an MIT license at \href{https://github.com/ErikHartman/bopep}{GitHub}.

2511.14065 2026-03-04 q-bio.NC cs.SY eess.SY

Intrinsic Resonance depends on Network Size of Coupled-Delayed Interacting Oscillators

Felipe A. Torres, Alejandro Weinstein, Jesus M. Cortes, Wael El-Deredy

Comments 16 pages, 3 figures: 2 figures in the main text and 1 figure in the appendix

详情
英文摘要

The collective frequency that emerges from synchronized neuronal populations--the network resonance--shows a systematic relationship with brain size: whole-brain's large networks oscillate slowly, whereas finer parcellations of fixed volume exhibit faster rhythms. This resonance-size scaling has been reported in delayed neural mass models and human neuroimaging, yet the physical mechanism remained unresolved. Here we show that size-dependent resonance follows directly from propagation delays in delay-coupled phase oscillators. Starting from a Kuramoto model with heterogeneous delays, we linearize around the near-synchronous solution and obtain a closed-form approximation linking the resonance $Ω$ to the mean delay and the effective coupling field. The analysis predicts a generic scaling law: $Ω\approx (\sum_j c_{ij} τ)^{-1}$, so resonance is delay-limited and therefore depends systematically on geometric size or parcellation density. We evaluate four growth scenarios--expanding geometry, fixed-volume parcellation, constant geometry, and an unphysical reference case--and show that only geometry-consistent scaling satisfies the analytical prediction. Numerical simulations with heterogeneous delays validate the law and quantify its error as a function of delay dispersion. These results identify a minimal physical mechanism for size-dependent cortical resonance and provide an analytical framework that unifies numeric simulation outputs.

2510.08946 2026-03-04 q-bio.BM cs.LG

Physically Valid Biomolecular Interaction Modeling with Gauss-Seidel Projection

Siyuan Chen, Minghao Guo, Caoliwen Wang, Anka He Chen, Yikun Zhang, Jingjing Chai, Yin Yang, Wojciech Matusik, Peter Yichen Chen

详情
英文摘要

Biomolecular interaction modeling has been substantially advanced by foundation models, yet they often produce all-atom structures that violate basic steric feasibility. We address this limitation by enforcing physical validity as a strict constraint during both training and inference with a uniffed module. At its core is a differentiable projection that maps the provisional atom coordinates from the diffusion model to the nearest physically valid conffguration. This projection is achieved using a Gauss-Seidel scheme, which exploits the locality and sparsity of the constraints to ensure stable and fast convergence at scale. By implicit differentiation to obtain gradients, our module integrates seamlessly into existing frameworks for end-to-end ffnetuning. With our Gauss-Seidel projection module in place, two denoising steps are sufffcient to produce biomolecular complexes that are both physically valid and structurally accurate. Across six benchmarks, our 2-step model achieves the same structural accuracy as state-of-the-art 200-step diffusion baselines, delivering approximately 10 times faster wall-clock speed while guaranteeing physical validity. The code is available at https://github.com/chensiyuan030105/ProteinGS.git.

2508.07326 2026-03-04 physics.chem-ph cs.LG math.PR physics.comp-ph q-bio.BM

Nonparametric Reaction Coordinate Optimization with Histories: A Framework for Rare Event Dynamics

Polina V. Banushkina, Sergei V. Krivov

Comments expanded the discussion of conceptual and methodological challenges in the Introduction; no changes to results

详情
英文摘要

Rare but critical events in complex systems, such as protein folding, chemical reactions, disease progression, and extreme weather or climate phenomena, are governed by complex, high-dimensional, stochastic dynamics. Identifying an optimal reaction coordinate (RC) that accurately captures the progress of these dynamics is crucial for understanding and simulating such processes. However, determining an optimal RC for realistic systems is notoriously difficult, due to methodological challenges that limit the success of standard machine learning techniques. These challenges include the absence of ground truth, the lack of a loss function for general nonequilibrium dynamics, the difficulty of selecting expressive neural network architectures that avoid overfitting, the irregular and incomplete nature of many real world trajectories, limited sampling and the extreme data imbalance inherent in rare event problems. Here, we introduce a nonparametric RC optimization framework that incorporates trajectory histories and circumvents these challenges, enabling robust analysis of irregular or incomplete data without requiring extensive sampling. The power of the method is demonstrated through increasingly challenging analyses of protein folding dynamics, where it yields accurate committor estimates that pass stringent validation tests and produce high resolution free energy profiles. Its generality is further illustrated through applications to phase space dynamics, a conceptual ocean circulation model, and a longitudinal clinical dataset. These results demonstrate that rare event dynamics can be accurately characterized without extensive sampling of the configuration space, establishing a general, flexible, and robust framework for analyzing complex dynamical systems and longitudinal datasets.

2506.01303 2026-03-04 cs.LG q-bio.NC

Dynamic Manifold Hopfield Networks for Context-Dependent Associative Memory

Chong Li, Taiping Zeng, Xiangyang Xue, Jianfeng Feng

详情
英文摘要

Neural population activity in cortical and hippocampal circuits can be flexibly reorganized by context, suggesting that cognition relies on dynamic manifolds rather than static representations. However, how such dynamic organization can be realized mechanistically within a unified dynamical system remains unclear. Continuous Hopfield networks provide a classical attractor framework in which neural dynamics follow gradient descent on a fixed energy landscape, constraining retrieval within a static attractor manifold geometry. Extending this approach, we introduce Dynamic Manifold Hopfield Networks (DMHN), continuous dynamical models in which contextual modulation dynamically reshapes attractor geometry, transforming a static attractor manifold into a context-dependent family of neural manifolds. In DMHN, network interactions are learned in a data-driven manner, to intrinsically deform the geometry of its attractor manifold across cues without explicit context-specific parameterization. As a result, in associative retrieval, DMHN achieve substantially higher capacity and robustness than classical and modern Hopfield networks: when storing $2N$ patterns in a network of $N$ neurons, DMHN attain reliable retrieval with an average accuracy of 64%, compared with 1% and 13% for classical and modern variants, respectively. Together, these results establish dynamic reorganization of attractor manifold geometry as a principled mechanism for context-dependent remapping in neural associative memory.

2504.10525 2026-03-04 q-bio.QM cs.CL cs.IR

BioChemInsight: An Online Platform for Automated Extraction of Chemical Structures and Activity Data from Patents

Zhe Wang, Fangtian Fu, Wei Zhang, Lige Yan, Nan Li, Wenxia Deng, Yan Meng, Jianping Wu, Hui Wu, Wenting Wu, Gang Xu, Xiang Li, Si Chen

Comments 21 pages, 7 figures

详情
英文摘要

The automated extraction of chemical structures and their corresponding bioactivity data is essential for accelerating drug discovery and enabling data-driven research. Current optical chemical structure recognition tools lack the capability to autonomously link molecular structures with their bioactivity profiles, posing a significant bottleneck in structure-activity relationship analysis. To address this, we present BioChemInsight, an open-source pipeline that integrates DECIMER Segmentation with MolNexTR for chemical structure recognition, GLM-4.5V for compound identifier association, and PaddleOCR combined with GLM-4.6 for bioactivity extraction and unit normalization. We evaluated BioChemInsight on 181 patents covering 15 therapeutic targets. The system achieved an average extraction accuracy of above 90% across three key tasks: chemical structure recognition, bioactivity data extraction, and compound identifier association. Our analysis indicates that the chemical space covered by patents is largely complementary to that contained in established public database ChEMBL. Consequently, by enabling systematic patent mining, BioChemInsight provides access to chemical information underrepresented in ChEMBL. This capability expands the landscape of explorable compound-target interactions, enriches the data foundation for quantitative structure-activity relationship modeling and targeted screening, and reduces data preprocessing time from weeks to hours. BioChemInsight is available at https://github.com/dahuilangda/BioChemInsight.

2407.01656 2026-03-04 cs.LG cond-mat.dis-nn physics.data-an q-bio.NC stat.ML

Absolute abstraction: a renormalisation group approach

Carlo Orientale Caputo, Elias Seiffert, Enrico Frausin, Matteo Marsili

Comments 35 pages, 6 figures

详情
英文摘要

Abstraction is the process of extracting the essential features from raw data while ignoring irrelevant details. It is well known that abstraction emerges with depth in neural networks, where deep layers capture abstract characteristics of data by combining lower level features encoded in shallow layers (e.g. edges). Yet we argue that depth alone is not enough to develop truly abstract representations. We advocate that the level of abstraction crucially depends on how broad the training set is. We address the issue within a renormalisation group approach where a representation is expanded to encompass a broader set of data. We take the unique fixed point of this transformation -- the Hierarchical Feature Model -- as a candidate for a representation which is absolutely abstract. This theoretical picture is tested in numerical experiments based on Deep Belief Networks and auto-encoders trained on data of different breadth. These show that representations in neural networks approach the Hierarchical Feature Model as the data get broader and as depth increases, in agreement with theoretical predictions.

2603.02708 2026-03-04 physics.bio-ph q-bio.SC

All-optical intracellular thermal profiling using nanodiamond-based "thermal radar"

Jiahua Zhang, Yong Hou, Xinhao Hu, Yicheng Wang, Madoka Suzuki, Bo Gao, Zhiqin Chu

详情
英文摘要

The local thermal conductivity (\k{appa}) is a pivotal biophysical parameter, governing intracellular heat flux and underlying functional processes like metabolic regulation and stress response. However, label-free mapping with sub-micron resolution in living cells remains challenge. Here, we present frequency-domain fluorescence thermometry (FD-FTM), an all-optical method based on a hybrid nanodiamond-on-gold-membrane platform, which enables quantitative mapping of \k{appa} in biological systems. Fluorescence nanodiamonds (FNDs) are deposited on substrates coated with a 50 nm gold membrane, where FNDs function as nanoscale thermometers, and the gold membrane serves as a photothermal heat source. We validate FD-FTM across reference materials and biological media, with fitting uncertainties of ~10%. By varying the modulation frequency, we tune the thermal penetration depths, enabling controlled heat propagation from the substrate to the cell nucleus. The method delivers sensitivity sufficient to resolve changes in biofluid thermal conductivity on the order of 16% relative to water. Using these capabilities, we demonstrate non-invasive thermal profiling across scales: at the cellular level, nuclear chromatin packing yields \k{appa} higher by ~10% relative to the cytoplasm; at the organelle level, we resolve \k{appa} variations associated with protein aggregates formed during liquid-liquid phase separation in an amyotrophic lateral sclerosis disease model. Temporal measurements in living cells over 30 minutes further reveal spatially resolved intracellular responses to osmotic stress, linking nanoscale thermal dynamics to biomolecular condensates. These results establish FD-FTM as a label-free, robust, and quantitative platform for thermally decoding intracellular processes, opening avenues for studying metabolic heterogeneity, disease mechanisms, and therapeutic responses.

2603.02665 2026-03-04 q-bio.QM physics.bio-ph

Stochastic modeling of long-legged ant A. gracilipes locomotion in laboratory experiments

Jack Featherstone, Anouk Béraud, Meta Virant-Doberlet, Antonio Celani, Mahesh Bandi

详情
英文摘要

Stochastic modeling of movement behavior provides a valuable way to understand how complex motion can be generated from relatively simple building blocks. Ants demonstrate sophisticated social behavior ranging from foraging to nest relocation; while emphasis is often placed on the communication methods used to synchronize individuals, the movement paradigms of those individuals are of tantamount importance. Here, we apply a stochastic modeling approach to better understand the movement of isolated long-legged ant (A. gracilipes) specimens, informed by extensive laboratory tracking experiments. We find that a combination of active Brownian and run-and-tumble models reproduces the trajectory statistics observed in experiments, both qualitatively and quantitatively. We identify reproducible probability distributions for the turn angles, run times, and waiting times across specimens, and find good agreement between analytical predictions and quantities empirically measured from the trajectories. Having such a model allows for a better understanding and predictions of movement ecology from both simulations and analytics, and even can give insight into the underlying generative mechanisms of motion and the ants' sensory systems.

2603.02572 2026-03-04 cs.CE q-bio.BM

Molecular Dynamics Simulations Reveal PolyQ-Length-Dependent Conformational Changes in Huntingtin Exon-1: Implications for Environmental Co-Solvent Modulation of Aggregation-Prone States

Jai Geddes-Nelson, Xiaochen Liu, Ken-Tye Yong

详情
英文摘要

Huntington's disease (HD) is caused by CAG-repeat expansion in HTT, which lengthens the polyglutamine (polyQ) tract in huntingtin (HTT) and promotes misfolding and aggregation. While polyQ-length-dependent aggregation is well established, the atomistic conformational dynamics preceding aggregation remain less defined. Here we perform all-atom molecular dynamics simulations of HTT exon-1 constructs containing the N17 domain, polyQ tracts of clinically relevant lengths (Q21, wildtype; Q40, adult onset threshold; Q70, juvenile onset), and the polyproline (polyP) region. Multi-copy simulations (four chains) were run for 100 ns in explicit SPC/E water using the OPLS-AA force field. We quantified radius of gyration (Rg), solvent-accessible surface area (SASA), root-mean-square deviation (RMSD), and intra-protein hydrogen bonds as proxies for conformational expansion and aggregation propensity. PolyQ expansion drove progressive increases in Rg and SASA, consistent with more extended, solvent-exposed ensembles. We further tested organic co-solvents (methanol, hexane, trichloroethylene; 0.5 to 1.0 M), which modulated these landscapes in a solvent-dependent manner. Trichloroethylene induced marked expansion in Q21 and Q40, whereas methanol produced mild compaction in Q21. To our knowledge, this is the first MD study to systematically examine co-solvent effects on HTT exon-1 conformational dynamics. Although limited sampling precludes definitive mechanistic conclusions, the observed trends suggest that hydrophobic co-solvents can bias HTT exon-1 toward more expanded ensembles, motivating computational studies of gene-environment modulation in HD.

2603.02496 2026-03-04 q-bio.PE

Recombination Rate Modifiers under Stochastic Transmission

Elisa Heinrich-Mora, Marcus W. Feldman

Comments 23 pages, 4 figures

详情
英文摘要

The Reduction Principle states that, near a stable equilibrium under fixed viability selection, a selectively neutral modifier allele that reduces recombination rate among selected loci is favored, whereas one that increases recombination rate is eliminated. This result assumes constant transmission parameters across generations, so that invasion is determined by the dominant eigenvalue of a single transmission-selection matrix. Here we analyze a minimal departure from this framework. In a diploid model, two loci experience symmetric multiplicative viability selection and a third, neutral locus modifies their recombination rate. All parameters are fixed except that recombination in modifier heterozygotes varies randomly across generations according to a stochastic process. When the recombination rate in modifier heterozygotes is constant, the Reduction Principle holds exactly: invasion occurs if the rare modifier allele reduces recombination relative to the resident rate. When recombination varies randomly across generations, invasion is governed by the top Lyapunov exponent of a product of random matrices. We show that temporal variation in recombination rate alone, in the absence of fluctuating viability selection, can reverse the direction of selection on the modifier locus predicted by the deterministic model. The mean recombination rate is insufficient to determine invasion of $M_2$; instead, outcomes depend on the full distribution of recombination rates and their ordered accumulation across generations. Parameters that affect only the magnitude of selection under constant transmission - including resident recombination, selection strength, and background linkage - can alter its sign under stochastic transmission. These results demonstrate that temporal variability in transmission constitutes an independent and qualitatively distinct force in the evolution of recombination rates.

2603.02461 2026-03-04 q-bio.NC

Understanding Decision-Making Across the Lifespan Needs Theoretical Neuroscience

Michael B. Ryan, Letizia Ye, Anne K. Churchland

详情
英文摘要

Understanding how decision making changes across the lifespan is a central challenge for neuroscience, yet research on cognitive aging has remained largely disconnected from the theoretical and computational advances that now shape modern systems neuroscience. Over the past two decades, theoretical frameworks have transformed how we study cognition in young, healthy brains, providing principled tools to model latent decision states, neural dynamics, population codes, and interareal communication. In contrast, aging research has often relied on single metric behavioral readouts, cross sectional comparisons, and descriptive neural analyses, limiting our ability to explain fundamental differences in individual aging trajectories. This gap represents a missed opportunity because aging offers a powerful platform for testing theories of neural computation, stability, and flexibility under changing biological constraints. Here, we argue that closer integration between aging research and contemporary theoretical neuroscience can move the field beyond descriptive accounts toward more mechanistic explanations of decision making across the lifespan. To this end, we outline how recent advances in behavioral quantification, latent state modeling, dynamical systems, encoding models, representational geometry, and recurrent neural networks offer a rich theoretical toolkit for neuroscientists studying decision making across the lifespan.

2603.02403 2026-03-04 q-bio.PE cond-mat.stat-mech physics.bio-ph

Fluctuating environments are sufficient to drive substantial variability in species abundance across locations

James F. D. Henderson, Andreas Tiffeau-Mayer

Comments 19 pages, 11 figures

详情
英文摘要

Species growing in environments that change in time and space will vary in their abundance across locations, even in the absence of persistent location preferences. Here we quantify this non-equilibrium effect by studying a minimal model of a spatially compartmentalised community with time-averaged-neutral competition but location-dependent environmental fluctuations. We analytically derive distributions of two-point inequality, defined as the log-ratio of a species' abundance across a pair of locations. We characterise how the balance of relaxation via migration and fluctuation strength determine the bulk and extreme value statistics of these distributions in the two-patch and infinite-patch cases. We demonstrate the existence of a noise-induced transition to bimodal inequality, which depends on the correlation timescale of the environmental fluctuations. Finally, we discuss the evolutionary benefit of finite migration rates in environments with temporal correlations.

2603.02402 2026-03-04 q-bio.GN

GPU-accelerated single-cell analysis at scale with rapids-singlecell

Severin Dicks, Lukas Heumos, Lilly May, Sara Jimenez, Philipp Angerer, Ilan Gold, Isaac Virshup, Felix Fischer, Michelle Gill, Melanie Boerries, Corey J Nolet, Tiffany J. Chen, Fabian J. Theis

详情
英文摘要

Single-cell sequencing technologies reveal cellular heterogeneity at high resolution, advancing our understanding of biological complexity. As datasets start to scale to tens of millions of cells, computational workflows face substantial bottlenecks, with CPU-based analytical pipelines requiring hours or days for routine processing steps like filtering, normalization, and clustering. These scalability limitations fundamentally restrict common interactive data exploration and iterative hypothesis testing. Here we introduce rapids-singlecell, a GPU-accelerated framework that integrates natively with the scverse ecosystem and operates directly on the AnnData data structure, which delivers orders-of-magnitude speedups for single-cell workflows. Built on CuPy arrays and the NVIDIA CUDA-X Data Science (RAPIDS) ecosystem, rapids-singlecell provides near drop-in GPU replacements for core scanpy-based analysis steps. Across standard single-cell workflows such as preprocessing, dimensionality reduction, neighborhood graph construction, clustering, and batch correction, rapids-singlecell achieves speedups of up to several hundred-fold compared to optimized CPU baselines. This reduces analysis time from hours to minutes on standard hardware, while maintaining consistent biological interpretations. These performance improvements make it possible to analyze large data sets in close to real time, without the need for data splitting. Together with real-time parameter tuning and iterative workflows, rapids-singlecell makes interactive large-scale single-cell analysis possible.

2603.02290 2026-03-04 q-bio.OT

Sunflower nested core collections for association studies and phenomics

Marie Coque, Sébastien Mesnildrey, Michel Romestant, Bruno Grezes-Besset, Félicity Vear, Nicolas B Langlade, Patrick Vincourt

详情
Journal ref
17th International Sunflower Conference, International Sunflower Association, Jun 2008, Cordoba (Cordoue), Spain
英文摘要

In order to develop association studies and to improve the phenotypic description for abiotic and biotic stress related traits, nested core collections of 48, 96, 144 and 384 sunflower lines were built from a set of 752 diverse, public or private accessions. These 752 lines were been genotyped with 51 SSR markers covering the genetic map (3 markers/linkage group). We then used MSTRAT software (Gouesnard et al., 2000) to construct 4 nested core collections as follows: we built a first core with 48 public lines and a kernel of 12 selected entries, accounting for 47% of the total diversity. This short core collection was then used as a kernel to define a second core with 96 public entries, accounting for 59% of the total diversity. Finally the private entries were added to build a core collection of 144 and 384 entries, accounting for 78% and 100% of the total diversity, respectively.

2603.02283 2026-03-04 q-bio.BM

Fast and Versatile RNA Design via Motif-level Divide-and-Conquer and Structure-level Rival Search

Tianshuo Zhou, David H. Mathews, Liang Huang

详情
英文摘要

RNA design aims to identify RNA sequences that fold into a target secondary structure. This task is challenging in terms of computational efficiency. Most existing methods focus on either minimum free energy (MFE)-based or ensemble-based metrics, leaving a gap for a unified approach that performs well across both. We introduce a fast and versatile RNA design algorithm inspired by our previous work on the undesignability of RNA structures and motifs (i.e., sets of contiguous structural loops). Our approach decomposes a target structure into a tree of sub-targets where each leaf node corresponds to a motif and each internal node corresponds to a substructure. We first design partial sequences for each motif, then these partial sequences are selectively and recursively combined via the cube pruning strategy borrowed from computational linguistics, enabling effective optimization of ensemble-based metrics. Finally, a novel whole-structure rival search further refines sequences to suppress misfolded alternatives and enhance MFE-based performance. Our method is highly efficient and also achieves state-of-the-art results on native RNAsolo structures and the Eterna100 benchmark, excelling in both ensemble- and MFE-based metrics. Additionally, it substantially improves the design of long-structure benchmark derived from 16S rRNA, increasing average folding probability from 0.18 to 0.39 with an order-of-magnitude speedup, demonstrating its effectiveness and scalability. Availability: Source code and data are available at: https://github.com/shanry/FastDesign.

2603.02241 2026-03-04 q-bio.NC cs.AI

A Benchmark Analysis of Graph and Non-Graph Methods for Caenorhabditis Elegans Neuron Classification

Jingqi Lu, Keqi Han, Yun Wang, Lu Mi, Carl Yang

详情
英文摘要

This study establishes a benchmark for Caenorhabditis elegans neuron classification, comparing four graph methods (GCN, GraphSAGE, GAT, GraphTransformer) against four non-graph methods (Logistic Regression, MLP, LOLCAT, NeuPRINT). Using the functional connectome, we classified Sensory, Interneuron, and Motor neurons based on Spatial, Connection, and Neuronal Activity features. Results show that attention-based GNNs significantly outperform baselines on the Spatial and Connection features. The Neuronal Activity features yielded poor performance, likely due to the low temporal resolution of the underlying neuronal activity data. Our benchmark validates the use of GNNs and highlights that Spatial and Connection features are key predictors for Caenorhabditis elegans neuron classes. Code is available at: https://github.com/JingqiLuu/neuronclf-gnn-benchmark.

2603.02213 2026-03-04 cs.CL cond-mat.stat-mech q-bio.GN

A Zipf-preserving, long-range correlated surrogate for written language and other symbolic sequences

Marcelo A. Montemurro, Mirko Degli Esposti

详情
Journal ref
Physica A 683 (2026) 131227
英文摘要

Symbolic sequences such as written language and genomic DNA display characteristic frequency distributions and long-range correlations extending over many symbols. In language, this takes the form of Zipf's law for word frequencies together with persistent correlations spanning hundreds or thousands of tokens, while in DNA it is reflected in nucleotide composition and long-memory walks under purine-pyrimidine mappings. Existing surrogate models usually preserve either the frequency distribution or the correlation properties, but not both simultaneously. We introduce a surrogate model that retains both constraints: it preserves the empirical symbol frequencies of the original sequence and reproduces its long-range correlation structure, quantified by the detrended fluctuation analysis (DFA) exponent. Our method generates surrogates of symbolic sequences by mapping fractional Gaussian noise (FGN) onto the empirical histogram through a frequency-preserving assignment. The resulting surrogates match the original in first-order statistics and long-range scaling while randomising short-range dependencies. We validate the model on representative texts in English and Latin, and illustrate its broader applicability with genomic DNA, showing that base composition and DFA scaling are reproduced. This approach provides a principled tool for disentangling structural features of symbolic systems and for testing hypotheses on the origin of scaling laws and memory effects across language, DNA, and other symbolic domains.

2602.12410 2026-03-04 eess.IV cs.CV q-bio.NC

Proceedings for the Inaugural Meeting of the International Society for Tractography -- IST 2025 Bordeaux

Flavio Dell Acqua, Maxime Descoteaux, Graham Little, Laurent Petit, Dogu Baran Aydogan, Stephanie Forkel, Alexander Leemans, Simona Schiavi, Michel Thiebaut de Schotten

Comments Proceedings of the Inaugural Conference of the International Society for Tractography (IST Conference 2025). Held at the Institut des Maladies Neurodégénératives in Bordeaux, France, October 13-16, 2025. Society website: https://www.tractography.io

详情
英文摘要

This collection comprises the abstracts presented during poster, power pitch and oral sessions at the Inaugural Conference of the International Society for Tractography (IST Conference 2025), held in Bordeaux, France, from October 13-16, 2025. The conference was designed to foster meaningful exchange and collaboration between disparate fields. The overall focus was on advancing research, innovation, and community in the common fields of interest: neuroanatomy, tractography methods and scientific/clinical applications of tractography. The included abstracts cover the latest advancements in tractography, Diffusion MRI, and related fields including new work on; neurological and psychiatric disorders, deep brain stimulation targeting, and brain development. This landmark event brought together world-leading experts to discuss critical challenges and chart the future direction of the field.

2509.22997 2026-03-04 physics.bio-ph q-bio.CB

Design principles of the cytotoxic CD8+ T-cell response

Obinna A. Ukogu, Zachary Montague, Grégoire Altan-Bonnet, Armita Nourmohammad

Comments 13 pages, 6 figures

详情
英文摘要

Cytotoxic T lymphocytes eliminate infected or malignant cells, safeguarding surrounding tissues. Although experimental and systems-immunology studies have cataloged many molecular and cellular actors involved in an immune response, the design principles governing how the speed and magnitude of T-cell responses emerge from cellular decision-making remain elusive. Here, we recast the T-cell response as a feedback-controlled program, wherein the rates of activation, proliferation, differentiation and death are regulated through antigenic, pro- and anti-inflammatory cues. By exploring a broad class of feedback-controller designs as potential immune programs, we demonstrate how the speed and magnitude of T-cell responses emerge from optimizing signal-feedback to protect against diverse infection settings. We recover an inherent trade-off: infection clearance at the cost of immunopathology. We show how this trade-off is encoded into the logic of T-cell responses by hierarchical sensitivity to different immune signals. Notably, we find that designs that balance harm from acute infections and autoimmunity produce immune responses consistent with experimentally observed patterns of T-cell effector expansion in mice. Extending our model to immune-based T-cell therapies for cancer tumors, we identify a trade-off between the affinity for tumor antigens ("quality") and the abundance ("quantity") of infused T-cells necessary for effective treatment. Finally, we show how therapeutic efficacy can be improved by targeted genetic perturbations to T-cells. Our findings offer a unified control-logic for cytotoxic T-cell responses and point to specific regulatory programs that can be engineered for more robust T-cell therapies.

2503.20513 2026-03-04 q-bio.BM q-bio.QM stat.AP

A Principal Submanifold-based Approach for Clustering and Multiscale RNA Correction

Menghao Wu, Zhigang Yao

Comments 30 pages, 15 figures

详情
英文摘要

RNA structure determination is essential for understanding its biological functions. However, the reconstruction process often faces challenges, such as atomic clashes, which can lead to inaccurate models. To address these challenges, we introduce the principal submanifold (PSM) approach for analyzing RNA data on a torus. This method provides an accurate, low-dimensional feature representation, overcoming the limitations of previous torus-based methods. By combining PSM with DBSCAN, we propose a novel clustering technique, the principal submanifold-based DBSCAN (PSM-DBSCAN). Our approach achieves superior clustering accuracy and increased robustness to noise. Additionally, we apply this new method for multiscale corrections, effectively resolving RNA backbone clashes at both microscopic and mesoscopic scales. Extensive simulations and comparative studies highlight the enhanced precision and scalability of our method, demonstrating significant improvements over existing approaches. The proposed methodology offers a robust foundation for correcting complex RNA structures and has broad implications for applications in structural biology and bioinformatics.

2406.10921 2026-03-04 q-bio.QM

Improved absolute abundance estimates from spatial count data with simulation and microfossil case studies

Chris Mays, Marcos Amores, Anthony Mays

Comments 103 pages, of which 67 pages constitutes the main text, and the remaining 36 pages are Supporting Information. 9 figures. The code for the related simulations can be found via a link in the manuscript

详情
Journal ref
PLoS One, 2005, 20(5): e0320887
英文摘要

Many fundamental parameters of biological systems -- eg. productivity, population sizes and biomass -- are most effectively expressed in absolute terms. In contrast to proportional data (eg. percentages), absolute values provide standardised metrics on the functioning of biological entities (eg. organisism, species, ecosystems). These are particularly valuable when comparing assemblages across time and space. Since it is almost always impractical to count entire populations, estimates of population abundances require a sampling method that is both accurate and precise. Such absolute abundance estimates typically entail more "sampling effort" (data collection time) than proportional data. Here we refined a method of absolute abundance estimates -- the "exotic marker technique" -- by producing a variant that is more efficient without losing accuracy. This new method, the "field-of-view subsampling method" (FOVS method) is based on area subsampling, from which large samples can be quickly extrapolated. Two case studies of the exotic marker technique were employed: 1, computer simulations; and 2, an observational "real world" data set of terrestrial organic microfossils from the Permian- and Triassic-aged rock strata of southeastern Australia, spiked with marker grains of known quantity and variance. We compared the FOVS method against the traditional "linear method" method using three metrics: 1, concentration (specimens/gram of sediment); 2, precision and 3, data collection effort. In almost all cases, the FOVS method delivers higher precision than the linear method, with equivalent effort, and our computer simulations suggest that the FOVS method more accurately estimates the true error for large target-to-marker ratios. Since we predict that these conditions are typically common, we recommend the new FOVS method in almost every "real world" case.