arXivDaily arXiv每日学术速递 周一至周五更新
重置
2602.24258 2026-03-02 q-bio.QM

A model of tuberculosis progression using CompuCell3D

James W. G. Doran, Christopher F. Rowlatt, Gibin G. Powathil, Ruth Bowness, Christian A. Yates

详情
英文摘要

Tuberculosis (TB) is an airborne disease caused by the bacterium Mycobacterium tuberculosis (M. tb). Prior to the COVID-19 pandemic, TB was the leading cause of death from an infectious agent globally. However, most people exposed to M. tb do not develop active TB and go on to display symptoms. Instead, in the majority of cases, the bacteria are contained within a granuloma (an aggregation of immune cells) without being eliminated; this is called latent TB. The spatial organisation of the bacteria and immune cells is important in determining whether an individual exposed to M. tb will develop latent or active TB. In this paper, we present a multi-cell, multiscale model of TB progression to investigate the importance of the spatial organisation. This is a novel TB within-host dynamics modelling framework, having been developed using CompuCell3D (CC3D), an open-source computer software used for simulating cellular biological processes both within and between cells. We used this model to compare the generated results with those from a previously developed within-host infectious disease model. We found that, although the results of our CC3D model mostly agree qualitatively with those from the previously developed model, there are quantitative differences. Additionally, we conducted a robustness analysis of key model parameters from the CC3D model to determine their importance to the CC3D model output, using a methodology specifically designed for agent-based models. The model output appears to be robust in response to perturbations in parameters controlling chemotactic movement, but less so in response to perturbations in parameters controlling persistence of movement in cells, cell adhesion and volume constraints. This work compares our CC3D model of TB progression with another agent-based modelling approach to the same problem.

2602.24106 2026-03-02 q-bio.PE math.PR

The interplay of selection and dormancy in a Moran model can lead to coexistence of types

Jochen Blath, Baptiste Le Duigou, András Tóbiás

Comments 36 pages, 5 figures

详情
英文摘要

In this paper we propose a Moran model that describes the population dynamics of two types: While the first type has a selective advantage during reproduction, the second type can avoid replacement during reproduction with some positive probability by switching temporarily into a dormant state. We investigate the interplay of both evolutionary strategies by studying the invasion dynamics of the dormant type into the resident (selectively advantageous) population in the large population limit of the system. It turns out that the dormancy trait can not only invade and subsequently fixate under suitable parameter assumptions despite its selective disadvantage (a phenomenon that has already been observed in a related context in Blath and Tóbiás (2020)), but that there is also a novel regime of stable coexistence of both types due to a frequency-dependent balancing effect that did not arise in the previous setup with Lotka--Volterra type symmetric competition. The emergence of a coexistence regime here rests in part on specific properties of the Moran modelling framework, in particular its fixed overall population size that enforces instant re-colonization after death events, as well as on the (positive) mortality and resuscitation rates of the dormant state. We provide heuristic explanations for the observed types of behaviour and the corresponding proofs, which involve comparisons to suitable branching processes, approximations by dynamical systems, and an analysis of asymptotic behaviour of the latter.

2602.13368 2026-03-02 q-bio.NC

The Influence of Width Ratios on Structural Beauty in Male Faces

Theresa Tennstedt, Benjamin Knopp, Dominik Endres

详情
英文摘要

This study investigates the relationship between interocular distance relative to overall facial width (width ratio) and perceived subjective beauty in male faces. Building on the methodology of Pallett et al. (2010), who found that average proportions in female faces were rated as most attractive, the current study aimed to test this hypothesis in male faces. Faces from the Chicago Face Database (Ma et al., 2015) were morphed into average faces within three groups (with low, medium, and high width ratios), each composed of 96 or 97 individual images. These three average faces were then systematically manipulated in their width ratios across three levels in both directions, respectively, resulting in a total of 21 comparable faces. The use of multiple base faces served as a control for potential artifacts of image processing. Consequently, comparisons were restricted to within-group pairs to avoid confounding by co-varying facial features (e.g., skin tone), which precluded direct cross-condition comparisons but ensured internal validity. In a two-alternative forced-choice task, participants selected the more beautiful face from each pair. The data were analyzed using a Bayesian model which enables inference of the width ratio perceived as most beautiful. Results support the hypothesis that averageness in facial proportions correlates with higher perceived attractiveness. The study highlights the importance of controlling for image manipulation, including attempts at methodological implementation, and of considering ethnicity as a potential moderating variable. These findings offer a data-driven foundation for understanding facial aesthetics and cognitive processes of human perception, with applications in advertising, artificial face generation, and plastic surgery.

2601.17582 2026-03-02 q-bio.QM cs.AI cs.LG cs.SY eess.SY q-bio.MN

GenAI-Net: A Generative AI Framework for Automated Biomolecular Network Design

Maurice Filo, Nicolò Rossi, Zhou Fang, Mustafa Khammash

详情
英文摘要

Biomolecular networks underpin emerging technologies in synthetic biology-from robust biomanufacturing and metabolic engineering to smart therapeutics and cell-based diagnostics-and also provide a mechanistic language for understanding complex dynamics in natural and ecological systems. Yet designing chemical reaction networks (CRNs) that implement a desired dynamical function remains largely manual: while a proposed network can be checked by simulation, the reverse problem of discovering a network from a behavioral specification is difficult, requiring substantial human insight to navigate a vast space of topologies and kinetic parameters with nonlinear and possibly stochastic dynamics. Here we introduce GenAI-Net, a generative AI framework that automates CRN design by coupling an agent that proposes reactions to simulation-based evaluation defined by a user-specified objective. GenAI-Net efficiently produces novel, topologically diverse solutions across multiple design tasks, including dose responses, complex logic gates, classifiers, oscillators, and robust perfect adaptation in deterministic and stochastic settings (including noise reduction). By turning specifications into families of circuit candidates and reusable motifs, GenAI-Net provides a general route to programmable biomolecular circuit design and accelerates the translation from desired function to implementable mechanisms.

2512.00306 2026-03-02 q-bio.CB cs.AI cs.LG

VCWorld: A Biological World Model for Virtual Cell Simulation

Zhijian Wei, Runze Ma, Zichen Wang, Zhongmin Li, Shuotong Song, Shuangjia Zheng

Comments Accepted at ICLR 2026

详情
英文摘要

Virtual cell modeling aims to predict cellular responses to perturbations. Existing virtual cell models rely heavily on large-scale single-cell datasets, learning explicit mappings between gene expression and perturbations. Although recent models attempt to incorporate multi-source biological information, their generalization remains constrained by data quality, coverage, and batch effects. More critically, these models often function as black boxes, offering predictions without interpretability or consistency with biological principles, which undermines their credibility in scientific research. To address these challenges, we present VCWorld, a cell-level white-box simulator that integrates structured biological knowledge with the iterative reasoning capabilities of large language models to instantiate a biological world model. VCWorld operates in a data-efficient manner to reproduce perturbation-induced signaling cascades and generates interpretable, stepwise predictions alongside explicit mechanistic hypotheses. In drug perturbation benchmarks, VCWorld achieves state-of-the-art predictive performance, and the inferred mechanistic pathways are consistent with publicly available biological evidence.

2511.13550 2026-03-02 q-bio.BM physics.comp-ph

MDIntrinsicDimension: Dimensionality-Based Analysis of Collective Motions in Macromolecules from Molecular Dynamics Trajectories

Irene Cazzaniga, Toni Giorgino

Comments Published version

详情
英文摘要

Molecular dynamics (MD) simulations provide atomistic insights into the structure, dynamics, and function of biomolecules by generating time-resolved, high-dimensional trajectories. Analyzing such data benefits from estimating the minimal number of variables required to describe the explored conformational manifold, known as the intrinsic dimension (ID). We present MDIntrinsicDimension, an open-source Python package that estimates ID directly from MD trajectories by combining rotation- and translation-invariant molecular projections (e.g., backbone dihedrals and inter-residue distances) with state-of-the-art estimators. The package provides three complementary analysis modes: whole-molecule ID; sliding windows along the sequence; and per-secondary-structure elements. It computes both overall ID (a single summary value) and instantaneous, time-resolved ID that can reveal transitions and heterogeneity over time. We illustrate the approach on fast folding-unfolding trajectories from the DESRES dataset, demonstrating that ID complements conventional geometric descriptors by highlighting spatially localized flexibility and differences across structural segments.

2508.11724 2026-03-02 q-bio.QM cs.CV

BeeNet: Reconstructing Flower Shapes from Electric Fields using Deep Learning

Jake Turley, Ryan A. Palmer, Isaac V. Chenchiah, Daniel Robert

Comments 14 pages, 4 figures

详情
英文摘要

Pollinating insects can obtain information from electric fields arising from flowers. The density and usefulness of electric information remain unknown. Here, we show that electric information can be used to reconstruct geometrical features of the field source. We develop an algorithm that infers the shapes of polarisable flowers from the electric field generated in response to a nearby charged arthropod. We computed the electric fields arising from arthropod flower interactions for varying petal geometries, and used these data to train a deep learning U Net model to recreate the floral shapes. The model accurately reconstructed diverse shapes, including more complex flower morphologies not included in training. Reconstruction performance peaked at an optimal arthropod flower distance, indicating distance dependent encoding of shape information. These findings indicate that electroreception can impart rich spatial detail, offering insights into the electric ecology of arthropods. Together, this work introduces a deep learning framework for solving the inverse electrostatic imaging problem, enabling object shape reconstruction directly from measured electric fields.

2507.09513 2026-03-02 q-bio.NC cs.CV

Animal behavioral analysis and neural encoding with transformer-based self-supervised pretraining

Yanchen Wang, Han Yu, Ari Blau, Yizi Zhang, The International Brain Laboratory, Liam Paninski, Cole Hurwitz, Matt Whiteway

详情
英文摘要

The brain can only be fully understood through the lens of the behavior it generates -- a guiding principle in modern neuroscience research that nevertheless presents significant technical challenges. Many studies capture behavior with cameras, but video analysis approaches typically rely on specialized models requiring extensive labeled data. We address this limitation with BEAST(BEhavioral Analysis via Self-supervised pretraining of Transformers), a novel and scalable framework that pretrains experiment-specific vision transformers for diverse neuro-behavior analyses. BEAST combines masked autoencoding with temporal contrastive learning to effectively leverage unlabeled video data. Through comprehensive evaluation across multiple species, we demonstrate improved performance in three critical neuro-behavioral tasks: extracting behavioral features that correlate with neural activity, and pose estimation and action segmentation in both the single- and multi-animal settings. Our method establishes a powerful and versatile backbone model that accelerates behavioral analysis in scenarios where labeled data remains scarce.

2411.07821 2026-03-02 math.DS math.PR q-bio.PE

Sufficient condition for dispersal-induced growth on dynamic networks

Michel Benaïm, Claude Lobry, Tewfik Sari, Edouard Strickler

Comments 43 pages

详情
英文摘要

We consider a population spreading across a finite number of sites. Individuals can move from one site to the other according to a network (oriented links between the sites) that vary periodically over time. On each site, the population experiences a growth rate which is also periodically time varying. Recently, this kind of models have been extensively studied, using various technical tools to derive precise necessary and sufficient conditions on the parameters of the system (ie the local growth rate on each site, the time period and the strength of migration between the sites) for the population to grow. In the present paper, we take a completely different approach: using elementary comparison results between linear systems, we give sufficient condition for the growth of the population This condition is easy to check and can be applied in a broad class of examples. In particular, in the case when all sites are sinks (ie, in the absence of migration, the population become extinct in each site), we prove that when our condition of growth if satisfied, the population grows when the time period is large and for values of the migration strength that are exponentially small with respect to the time period, which answers positively to a conjecture stated by Katriel.

2602.23488 2026-03-02 q-bio.OT

A Mathematical Model for Chemotherapy, Immunotherapy and Virotherapy Treatments of Cancer

Tarini Kumar Dutta, Silmera A Sangma, Janice Moore, Meir Shillor

详情
英文摘要

We continue our study of a model for cancer treatment, constructed in Dutta et. al., 2025, by adding Virotherapy to the Chemotherapy and Immunotherapy studied there. It is a dynamical system model for the spread of cancer in healthy tissue. It allows computer experiments of various combinations of the three modalities, which cannot be performed in the laboratory or experimentally. The novelty is the addition of Virotherapy. The analysis shows that the model solutions exist, are bounded, and nonnegative on each finite time interval, thus biologically feasible. A time-stepping algorithm is constructed and implemented, and computer simulations are presented. The simulations show the development of the disease under various treatment options, including a baseline case without treatment, cases for each of the three treatments separately, and some combinations of the three treatments. These simulations indicate that combinations of treatments are more effective. However, we do not consider any limitation or incompatibilities of the joint application of the three modalities, that may exist in practice. Once validated in the field, the model can be used to design treatment schedules of combinations of the three modalities for improved outcomes.

2602.23459 2026-03-02 cs.LG q-bio.QM stat.ML

Global Interpretability via Automated Preprocessing: A Framework Inspired by Psychiatric Questionnaires

Eric V. Strobl

详情
英文摘要

Psychiatric questionnaires are highly context sensitive and often only weakly predict subsequent symptom severity, which makes the prognostic relationship difficult to learn. Although flexible nonlinear models can improve predictive accuracy, their limited interpretability can erode clinical trust. In fields such as imaging and omics, investigators commonly address visit- and instrument-specific artifacts by extracting stable signal through preprocessing and then fitting an interpretable linear model. We adopt the same strategy for questionnaire data by decoupling preprocessing from prediction: we restrict nonlinear capacity to a baseline preprocessing module that estimates stable item values, and then learn a linear mapping from these stabilized baseline items to future severity. We refer to this two-stage method as REFINE (Redundancy-Exploiting Follow-up-Informed Nonlinear Enhancement), which concentrates nonlinearity in preprocessing while keeping the prognostic relationship transparently linear and therefore globally interpretable through a coefficient matrix, rather than through post hoc local attributions. In experiments, REFINE outperforms other interpretable approaches while preserving clear global attribution of prognostic factors across psychiatric and non-psychiatric longitudinal prediction tasks.

2602.23396 2026-03-02 q-bio.MN cs.LG

Complex Networks and the Drug Repositioning Problem

Felipe Bivort Haiek

详情
英文摘要

In this Master's thesis, the graph properties of a multi-level drug-protein network are studied, as well as how the network's shape has informed discoveries over the years, identifying primarily crawling discoveries and a smaller number of hopping discoveries. Finally, the network structure is used to inform a network diffusion recommendation system and to prioritize existing drugs for repurposing against proteins in organisms that cause Neglected Tropical Diseases.

2602.23382 2026-03-02 q-bio.NC

Audited calibration under regime shift as a computational test of support-structured broadcast

Mark Walsh

Comments 12 pages, 1 table, 5 figures

详情
英文摘要

A central prediction of the accompanying theoretical framework is that metacognitive calibration can vary even when content-level performance is held approximately fixed, depending on whether support structure is preserved in a globally reusable broadcast state. We provide a minimal computational test of this claim using a two-channel probabilistic cue-integration task with regime shifts that induce systematic miscalibration in one channel. We compare content-dominated architectures, in which confidence is calibrated by a single global mapping from evidence strength to probability, to an auditor architecture that learns a regime-conditioned calibration mapping from an audit trail of outcomes. We then couple confidence to control by implementing a policy that either acts immediately or requests one additional sample when confidence falls below a threshold. Across matched evidence streams, the auditor substantially improves calibration, particularly in the degraded regime, and produces qualitatively different control behavior by selectively requesting additional evidence under low-support conditions. These results demonstrate a concrete, testable dissociation between content performance and system-level confidence and policy that arises from globally reusable support summaries.

2510.06091 2026-03-02 cs.LG cs.SY eess.SY q-bio.NC stat.ML

Learning Mixtures of Linear Dynamical Systems via Hybrid Tensor-EM Method

Lulu Gong, Shreya Saxena

Comments 24 pages, 14 figures

详情
英文摘要

Mixtures of linear dynamical systems (MoLDS) provide a path to model time-series data that exhibit diverse temporal dynamics across trajectories. However, its application remains challenging in complex and noisy settings, limiting its effectiveness for neural data analysis. Tensor-based moment methods can provide global identifiability guarantees for MoLDS, but their performance degrades under noise and complexity. Commonly used expectation-maximization (EM) methods offer flexibility in fitting latent models but are highly sensitive to initialization and prone to poor local minima. Here, we propose a tensor-based method that provides identifiability guarantees for learning MoLDS, which is followed by EM updates to combine the strengths of both approaches. The novelty in our approach lies in the construction of moment tensors using the input-output data to recover globally consistent estimates of mixture weights and system parameters. These estimates can then be refined through a Kalman EM algorithm, with closed-form updates for all LDS parameters. We validate our framework on synthetic benchmarks and real-world datasets. On synthetic data, the proposed Tensor-EM method achieves more reliable recovery and improved robustness compared to either pure tensor or randomly initialized EM methods. We then analyze neural recordings from the primate somatosensory cortex while a non-human primate performs reaches in different directions. Our method successfully models and clusters different conditions as separate subsystems, consistent with supervised single-LDS fits for each condition. Finally, we apply this approach to another neural dataset where monkeys perform a sequential reaching task. These results demonstrate that MoLDS provides an effective framework for modeling complex neural data, and that Tensor-EM is a reliable approach to MoLDS learning for these applications.

2407.06195 2026-03-02 q-bio.NC cs.IT cs.LG cs.NE math.IT

Spectral-Stimulus Information for Self-Supervised Stimulus Encoding

Jared Deighton, Wyatt Mackey, Ioannis Schizas, David L. Boothe, Vasileios Maroulas

详情
英文摘要

Mammalian spatial navigation relies on specialized neurons, such as place and grid cells, which encode position based on self-motion and environmental cues. While extensive research has explored the computational role of grid cells, the principles underlying efficient place cell coding remain less understood. Existing spatial information rate measures primarily assess single-neuron encoding, limiting insights into population-level representations, while, the role of correlation in neural coding remains a subject of considerable debate. To address this, we introduce novel, correlation-aware information-theoretic measures that quantify the encoding efficiency of multiple neurons, including the joint stimulus information rate for neuron pairs and the spectral-stimulus information for arbitrary sized populations. The spectral-stimulus information, defined as the leading eigenvalue of the stimulus information matrix, is maximized when neurons exhibit localized, non-overlapping firing fields, mirroring place cell and head direction cell activity. We apply these measures to neural data recorded in mice and monkeys, elucidating differences in encoding efficiency across neuronal pairs and populations. Then, we demonstrate that these measures can be used to train recurrent neural networks (RNNs) via self-supervised learning, leading to the emergence of place cells and head direction cells. Our findings highlight how neural populations collectively encode stimuli, offering a more comprehensive framework for understanding stimulus encoding and optimizing artificial navigation systems in novel environments.