arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.20488 2026-04-23 q-bio.GN

Conditional Monte Carlo Tree Diffusion for Designing Cell-Type-Specific and Biologically Faithful Regulatory DNA

Animesh Awasthi, Raphael Bednarsky, Moritz Schaefer, Christoph Bock

详情

英文摘要

Designing regulatory DNA elements with precise cell-type-specific activity is broadly relevant for cell engineering and gene therapy. Deep generative models can generate functional gene-regulatory elements, but existing methods struggle to achieve high specificity against undesired cell types while adhering to the genome's natural regulatory grammar. Here, we introduce DNA-CRAFT, a generative framework that integrates class-conditioned discrete diffusion with Monte Carlo tree search to design cell-type-specific and biologically faithful regulatory elements. We first train a discrete diffusion model on the ENCODE registry of 3.2 million candidate regulatory elements. Second, we condition the model to learn class-specific regulatory grammars of naturally occurring DNA sequences, including enhancers and promoters. Third, we employ conditional Monte Carlo tree guidance, an inference-time alignment algorithm designed to maximize the differential regulatory activity between desired and undesired cell types. By benchmarking DNA-CRAFT on regulatory sequence design tasks for human cell lines and immune cell types, we demonstrate that our model generates sequences with high predicted cell-type-specific activity and biological fidelity, achieving the best trade-offs compared to methods that use diffusion, autoregressive models, and gradient-based optimization.

URL PDF HTML ☆

赞 0 踩 0

2604.20477 2026-04-23 q-bio.PE

Emergence biases in molecular evolution

Timothy Fuqua, Nikolaos Vakirlis

Comments 14 pages, 4 figures, perspective piece submitted to a peer-reviewed journal

2604.20469 2026-04-23 math.AP q-bio.PE

Indirect Prey-taxis VS a Shortwave External Signal in Multiple Dimensions

Andrey Morgulis, Karrar Malal

Comments 30 pages, 1 figures

2604.20263 2026-04-23 q-bio.QM cs.AI cs.LG

AROMA: Augmented Reasoning Over a Multimodal Architecture for Virtual Cell Genetic Perturbation Modeling

Zhenyu Wang, Geyan Ye, Wei Liu, Man Tat Alexander Ng

Comments Accepted to ACL 2026 as a Findings paper. Zhenyu Wang and Geyan Ye are equal contributors; Geyan Ye is the corresponding author and project lead

2604.20003 2026-04-23 q-bio.QM cs.AI cs.LG

scpFormer: A Foundation Model for Unified Representation and Integration of the Single-Cell Proteomics

Qifeng Zhou, Lei Yu, Yuzhi Guo, Yuwei Miao, Hehuan Ma, Wenliang Zhong, Lin Xu, Junzhou Huang

2604.19852 2026-04-23 q-bio.CB

Multi-stage volume exclusion models for cell proliferation

John Carlo Dimaculangan, Cameron A. Smith, Christian A. Yates

Comments 55 pages, 20 figures, submitted to Physical Review E

2604.19850 2026-04-23 cs.ET cs.LG cs.NE q-bio.MN q-bio.QM

What Makes a Bacterial Model a Good Reservoir Computer? Predicting Performance from Separability and Similarity

Laura Alonso Bartolomé, Jean-Loup Faulon, Xavier Hinaut

详情

英文摘要

Biological systems are promising substrates for computation because they naturally process environmental information through complex internal dynamics. In this study, we investigate whether bacterial metabolic models can act as physical reservoirs and whether their computational performance can be predicted from dynamical properties linked to separability and similarity. We simulated the growth dynamics of five bacterial species, one yeast species, and 29 Escherichia coli single-gene deletion mutants using dynamic flux balance analysis (dFBA), with glucose and xylose concentrations as inputs and growth curves as reservoir states. Computational performance was assessed on random nonlinear classification tasks using a linear readout, while reservoir properties linked to separability and similarity were characterised through kernel and generalisation ranks computed from growth-curve state matrices. Several microbial models achieved high classification accuracy, showing that bacterial metabolic dynamics can support nonlinear computation. Clear differences were observed between species, with some models converging more rapidly and others reaching higher maximum accuracy, revealing a trade-off between convergence speed and peak performance. In contrast, all E. coli mutants were dominated by the wild-type model, suggesting that gene deletions reduce the dynamical richness required for efficient computation. The difference between kernel and generalisation ranks was generally associated with improved accuracy, but deviations across models and sensitivity at low rank values limited its predictive power in practice. Overall, these results show that bacterial metabolic models constitute promising substrates for reservoir computing and provide a first step towards identifying microbial strains with favourable computational properties for future experimental implementations.

URL PDF HTML ☆

赞 0 踩 0

2604.19842 2026-04-23 q-bio.OT

Energy gradients as potential drivers of pre-cellular chemical organization

Arturo Tozzi

Comments 14 pages, 5 figures

2604.19840 2026-04-23 cs.LG q-bio.QM

Graph-Theoretic Models for the Prediction of Molecular Measurements

Anna Niane, Prudence Djagba

详情

英文摘要

Graph-theoretic approaches offer simplicity, interpretability, and low computational cost for molecular property prediction. Among these, the model proposed by Mukwembi and Nyabadza, based on the external activity $D(G)$ and internal activity $ζ(G)$ indices, achieved strong results on a small flavonoid dataset. However, its ability to generalize to larger and chemically diverse datasets has not been tested. This study evaluates the baseline $D(G)$-$ζ(G)$ polynomial model on five benchmark datasets from MoleculeNet, covering biological activity (BACE, 1,513 molecules), lipophilicity (LogP synthetic, 14,610 molecules; LogP experimental, 753 molecules), aqueous solubility (ESOL, 1,128 molecules), and hydration free energy (SAMPL, 642 molecules). The baseline model achieves an average $R^2 = 0.24$, confirming limited transferability. To address this, a systematic enhancement framework is proposed, progressively incorporating Ridge regularization, additional graph descriptors, physicochemical properties, ensemble learning with Gradient Boosting, Lasso feature selection, and a hybrid approach combining topological indices with Morgan fingerprints. The enhanced models raise the average best $R^2$ to 0.79, with individual improvements ranging from 165\% to 274\%. All improvements are statistically significant ($p < 0.001$). A direct comparison with a Graph Convolutional Network under identical experimental conditions shows that the enhanced classical models match or outperform deep learning on all five datasets. Comparison with the recent GNN+PGM hybrid of Djagba et al.\ further confirms competitiveness, with the enhanced models achieving the best results on two datasets and tying on one. The entire framework requires no GPU, trains in under five minutes, and uses only open-source tools, making it accessible for researchers in resource-limited settings.

URL PDF HTML ☆

赞 0 踩 0

2604.19805 2026-04-23 q-bio.PE

Modeling of Pneumococcal and Respiratory Syncytial Virus Pneumonia: An Epidemiological Review, with Statistical Inference

Rupchand Sutradhar, Anuj Mishra, Malay Banerjee, Subhra Sankar Dhar

2604.19799 2026-04-23 cs.HC cs.AI cs.CY q-bio.NC

Measuring Creativity in the Age of Generative AI: Distinguishing Human and AI-Generated Creative Performance in Hiring and Talent Systems

Yigal Rosen, Ilia Rushkin

Comments Research Paper Presented at the BIG.AI@MIT Conference, April 2, 2026

2601.11505 2026-04-23 cs.LG cs.AI cs.SY eess.SY q-bio.QM

MetaboNet: The Largest Publicly Available Consolidated Dataset for Type 1 Diabetes Management

Miriam K. Wolff, Peter Calhoun, Eleonora Maria Aiello, Yao Qin, Sam F. Royston

Comments 30 pages, 5 figures, 1 Table, 10 supplementary figures, 3 supplementary tables, submitted to JDST

2601.05367 2026-04-23 q-bio.PE

The rights and wrongs of rescaling in population genetics simulations

Parul Johri, Fanny Pouyet, Brian Charlesworth

2512.15808 2026-04-23 q-bio.QM cs.AI cs.CV cs.LG

Foundation Models in Biomedical Imaging: Turning Hype into Reality

Amgad Muneer, Kai Zhang, Ibraheem Hamdi, Rizwan Qureshi, Muhammad Waqas, Shereen Fouad, Hazrat Ali, Syed Muhammad Anwar, Jia Wu

Comments 9 figures and 3 tables

2510.21742 2026-04-23 q-bio.NC cond-mat.dis-nn cs.NE hep-th physics.bio-ph

Statistics of correlations in nonlinear recurrent neural networks

German Mato, Facundo Rigatuso, Gonzalo Torroba

Comments 39 pages, 9 figures

2509.17260 2026-04-23 q-bio.NC cs.OH stat.AP

A tutorial on electrogastrography using low-cost hardware and open-source software

Evgeniya Anisimova, Sameer N. B. Alladin, Styliani Tsamaz, Edwin S. Dalmaijer

2509.02060 2026-04-23 q-bio.BM cs.LG

Morphology-Aware Peptide Discovery via Masked Conditional Generative Modeling

Nuno Costa, Julija Zavadlav

Comments 46 pages, 4 figures, 6 tables

2507.07800 2026-04-23 q-bio.QM cs.CV

A novel attention mechanism for noise-adaptive and robust segmentation of microtubules in microscopy images

Achraf Ait Laydi, Louis Cueff, Mewen Crespo, Yousef El Mourabit, Hélène Bouvrais

详情

英文摘要

Segmenting cytoskeletal filaments in microscopy images is essential for studying their roles in cellular processes. However, this task is highly challenging due to the fine, densely packed, and intertwined nature of these structures. Imaging limitations further complicate analysis. While deep learning has advanced segmentation of large, well-defined biological structures, its performance often degrades under such adverse conditions. Additional challenges include obtaining precise annotations for curvilinear structures and managing severe class imbalance during training. We introduce a novel noise-adaptive attention mechanism that extends the Squeeze-and-Excitation (SE) module to dynamically adjust to varying noise levels. Integrated into a U-Net decoder with residual encoder blocks, this yields ASE_Res_UNet, a lightweight yet high-performance model. We also developed a synthetic dataset generation strategy that ensures accurate annotations of fine filaments in noisy images. We systematically evaluated loss functions and metrics to mitigate class imbalance, ensuring robust performance assessment. ASE_Res_UNet effectively segmented microtubules in noisy synthetic images, outperforming its ablated variants. It also demonstrated superior segmentation compared to models with alternative attention mechanisms or distinct architectures, while requiring fewer parameters, making it efficient for resource-constrained environments. Evaluation on a newly curated real microscopy dataset and a recently reannotated dataset highlighted ASE_Res_UNet's effectiveness in segmenting microtubules beyond synthetic images. For these datasets, ASE_Res_UNet was competitive with a recent synthetic data-driven approach that shares two cytoskeleton pretrained models. Importantly, ASE_Res_UNet showed strong transferability to other curvilinear structures (blood vessels and nerves) across diverse imaging conditions.

URL PDF HTML ☆

赞 0 踩 0

2506.14103 2026-04-23 stat.ME q-bio.QM

A Robust Nonparametric Framework for Detecting Repeated Spatial Patterns

Rajitha Senanayake, Pratheepa Jeganathan

Comments 39 pages including an Appendix of 17 pages, 39 figures

2411.00063 2026-04-23 q-bio.QM

Logistic Regression Analysis on the Dietary Behavior and the Risk of Nutritional Deficiency Dermatosis: The Case of Bicol Region, Philippines

John Ben S Temones

Comments 11 pages

2404.06459 2026-04-23 q-bio.PE

A hybrid discrete-continuum modelling approach for the interactions of the immune system with oncolytic viral infections

David Morselli, Marcello E. Delitala, Adrianne L. Jenner, Federico Frascoli

Comments 32 pages, 12 figures. Supplementary material available at https://doi.org/10.5281/zenodo.18340945

详情

DOI: 10.1016/j.jtbi.2026.112462
Journal ref: J. Theor. Biol. (2026), 627, p. 112462

英文摘要

Oncolytic virotherapy, utilizing genetically modified viruses to combat cancer and trigger anti-cancer immune responses, has garnered significant attention in recent years. In our previous work arXiv:2305.12386, we developed a stochastic agent-based model elucidating the spatial dynamics of infected and uninfected cells within solid tumours. Building upon this foundation, we present a novel stochastic agent-based model to describe the intricate interplay between the virus and the immune system; the agents' dynamics are coupled with a balance equation for the concentration of the chemoattractant that guides the movement of immune cells. We formally derive the continuum limit of the model and carry out a systematic quantitative comparison between this system of PDEs and the individual-based model in two spatial dimensions. Furthermore, we describe the traveling waves of the three populations, with the uninfected proliferative cells trying to escape from the infected cells while immune cells infiltrate the tumour. Simulations show a good agreement between agent-based approaches and numerical results for the continuum model. Some parameter ranges give rise to oscillations of cell number in both models, in line with the behaviour of the corresponding nonspatial model, which presents Hopf bifurcations. Nevertheless, in some situations the behaviours of the two models may differ significantly, suggesting that stochasticity plays a key role in the dynamics. Our results highlight that a too rapid immune response, before the infection is well-established, appears to decrease the efficacy of the therapy and thus some care is needed when oncolytic virotherapy is combined with immunotherapy. This further suggests the importance of clinically improving the modulation of the immune response according to the tumour's characteristics and to the immune capabilities of the patients.

URL PDF HTML ☆

赞 0 踩 0

2604.20824 2026-04-23 cs.LG q-bio.QM

Closing the Domain Gap in Biomedical Imaging by In-Context Control Samples

Ana Sanchez-Fernandez, Thomas Pinetz, Werner Zellinger, Günter Klambauer

2604.20629 2026-04-23 math.PR q-bio.PE

Rates of forgetting for the sequentially Markov coalescent

Jonathan Terhorst

2604.20626 2026-04-23 q-bio.PE cs.AI

Centering Ecological Goals in Automated Identification of Individual Animals

Lukas Picek, Timm Haucke, Lukáš Adam, Ekaterina Nepovinnykh, Lasha Otarashvili, Kostas Papafitsoros, Tanya Berger-Wolf, Michael B. Brown, Tilo Burghardt, Vojtech Cermak, Daniela Hedwig, Justin Kitzes, Sam Lapp, Subhransu Maji, Daniel Rubenstein, Arjun Subramonian, Charles Stewart, Silvia Zuffi, Sara Beery

2604.20524 2026-04-23 q-bio.NC cond-mat.dis-nn cs.NE

Response time of lateral predictive coding and benefits of modular structures

Guanghui Cai, Zhen-Ye Huang, Weikang Wang, Hai-Jun Zhou

Comments 16 pages, under review in Physica A

2407.01621 2026-04-23 cs.LG q-bio.QM stat.ME stat.ML

Deciphering interventional dynamical causality from non-intervention complex systems

Jifan Shi, Yang Li, Juan Zhao, Siyang Leng, Rui Bao, Kazuyuki Aihara, Luonan Chen, Wei Lin

详情

DOI: 10.1016/j.xinn.2026.101358

英文摘要

Detecting and quantifying causality is a focal topic in the fields of science, engineering, and interdisciplinary studies. However, causal studies on non-intervention systems attract much attention but remain extremely challenging. Delay-embedding technique provides a promising approach. In this study, we propose a framework named Interventional Dynamical Causality (IntDC) in contrast to the traditional Constructive Dynamical Causality (ConDC). ConDC, including Granger causality, transfer entropy and convergence of cross-mapping, measures the causality by constructing a dynamical model without considering interventions. A computational criterion, Interventional Embedding Entropy (IEE), is proposed to measure causal strengths in an interventional manner. IEE is an intervened causal information flow but in the delay-embedding space. Further, the IEE theoretically and numerically enables the deciphering of IntDC solely from observational (non-interventional) time-series data, without requiring any knowledge of dynamical models or real interventions in the considered system. In particular, IEE can be applied to rank causal effects according to their importance and construct causal networks from data. We conducted numerical experiments to demonstrate that IEE can find causal edges accurately, eliminate effects of confounding, and quantify causal strength robustly over traditional indices. We also applied IEE to real-world tasks. IEE performed as an accurate and robust tool for causal analyses solely from the observational data. The IntDC framework and IEE algorithm provide an efficient approach to the study of causality from time series in diverse non-intervention complex systems.

URL PDF HTML ☆

赞 0 踩 0