arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.18559 2026-04-21 q-bio.BM cs.LG

ConforNets: Latents-Based Conformational Control in OpenFold3

Minji Lee, Colin Kalicki, Minkyu Jeon, Aymen Qabel, Alisia Fadini, Mohammed AlQuraishi

详情

英文摘要

Models from the AlphaFold (AF) family reliably predict one dominant conformation for most well-ordered proteins but struggle to capture biologically relevant alternate states. Several efforts have focused on eliciting greater conformational variability through ad hoc inference-time perturbations of AF models or their inputs. Despite their progress, these approaches remain inefficient and fail to consistently recover major conformational modes. Here, we investigate both the optimal location and manner-of-operation for perturbing latent representations in the AF3 architecture. We distill our findings in ConforNets: channel-wise affine transforms of the pre-Pairformer pair latents. Unlike previous methods, ConforNets globally modulate AF3 representations, making them reusable across proteins. On unsupervised generation of alternate states, ConforNets achieve state-of-the-art success rates on all existing multi-state benchmarks. On the novel supervised task of conformational transfer, ConforNets trained on one source protein can induce a conserved conformational change across a protein family. Collectively, these results introduce a mechanism for conformational control in AF3-based models.

URL PDF HTML ☆

赞 0 踩 0

2604.18548 2026-04-21 cs.LG q-bio.QM

Physics-Informed Neural Networks for Biological $2\mathrm{D}{+}t$ Reaction-Diffusion Systems

William Lavery, Jodie A. Cochrane, Christian Olesen, Dagim S. Tadele, John T. Nardini, Sara Hamis

2604.18470 2026-04-21 math.NA cs.NA q-bio.NC

High-fidelity and Network-based Spatio-temporal Mathematical Models of Alzheimer's Disease Progression and their Validation Against PET-SUVR Imaging Data

Beatrice Caon, Mattia Corti, Francesca Bonizzoni, Paola F. Antonietti

2604.18345 2026-04-21 q-bio.PE

Effect of antibiotic spectrum on the abundance of resistant bacteria in multispecies communities

Magnus Aspenberg, Erik Andreas Martens, Kristofer Wollein Waldetoft

Comments 5 figures

2604.18230 2026-04-21 q-bio.QM cond-mat.mtrl-sci cond-mat.soft

ToFiE, a Topology-aware Fiber Extraction workflow for 3D reconstruction of dense and heterogeneous biological fiber networks from microscopy images

Risa Togo, Sara Cardona, Irène Nagle, Gijsje H. Koenderink, Behrooz Fereidoonnezhad, Mathias Peirlinck

2604.18185 2026-04-21 physics.bio-ph q-bio.MN

Noise-Driven Differentiation via Gene Frustration and Epigenetic Fixation

Davey Plugers, Kunihiko Kaneko

Comments 9 pages, 5 figures

2604.18031 2026-04-21 cs.CL cs.LG q-bio.BM

How Creative Are Large Language Models in Generating Molecules?

Wen Tao, Yiwei Wang, Peng Zhou, Bryan Hooi, Wanlong Fang, Tianle Zhang, Xiao Luo, Yuansheng Liu, Alvin Chan

2604.18022 2026-04-21 q-bio.BM cond-mat.stat-mech cs.LG stat.ML

Boltzmann Machine Learning with a Parallel, Persistent Markov chain Monte Carlo method for Estimating Evolutionary Fields and Couplings from a Protein Multiple Sequence Alignment

Sanzo Miyazawa

Comments A manuscript of 11 pages including 3 figures and 3 tables, and a supplementary material of 9 pages including 8 figures. The program and multiple sequence alignments employed here are available from https://gitlab.com/sanzo.miyazawa/BM/ and https://github.com/Sanzo-Miyazawa/BM/

2604.17960 2026-04-21 q-bio.NC cs.LG

The Umwelt Representation Hypothesis: Rethinking Universality

Victoria Bosch, Rowan Sommers, Adrien Doerig, Tim C Kietzmann

Comments preprint v1

2604.17926 2026-04-21 q-bio.PE

Information on hidden birth events restores identifiability in phylodynamic inference

Tobias Dieselhorst, Tanja Stadler

2603.19761 2026-04-21 math.OC q-bio.NC q-bio.QM

Multimodal branched transport infers anatomically aligned brain reaction maps

Cristian Mendico

2512.15948 2026-04-21 cs.AI q-bio.NC

Subjective functions

Samuel J. Gershman

2509.25872 2026-04-21 q-bio.QM q-bio.BM

Marginal Girsanov Reweighting: Stable Variance Reduction for Long-Timescale Dynamics from Biased Simulation

Yan Wang, Hao Wu, Simon Olsson

2506.22178 2026-04-21 q-bio.PE math.AP math.DS nlin.PS physics.bio-ph

Vegetation Patterning Can Both Impede and Trigger Critical Transitions from Savanna to Grassland

Jelle van der Voort, Mara Baudena, Ehud Meron, Max Rietkerk, Arjen Doelman

Comments 24 pages, 8 figures

2506.03157 2026-04-21 q-bio.BM cs.LG

UniSim: A Unified Simulator for Time-Coarsened Dynamics of Biomolecules

Ziyang Yu, Wenbing Huang, Yang Liu

Comments ICML 2025 poster

2604.17786 2026-04-21 q-bio.CB

Spatial dynamic modelling to understand how dendritic cell clustering affects T cell activation

Domenic P. J. Germano, Federico Frascoli, Robyn P. Araujo, Peter P. Lee, Peter S. Kim

2604.17581 2026-04-21 cs.LG cs.AI q-bio.NC

How Much Data is Enough? The Zeta Law of Discoverability in Biomedical Data, featuring the enigmatic Riemann zeta function

Paul M. Thompson

Comments 25 pages, 5 figures

详情

英文摘要

How much data is enough to make a scientific discovery? As biomedical datasets scale to millions of samples and AI models grow in capacity, progress increasingly depends on predicting when additional data will substantially improve performance. In practice, model development often relies on empirical scaling curves measured across architectures, modalities, and dataset sizes, with limited theoretical guidance on when performance should improve, saturate, or exhibit cross-over behavior. We propose a scaling-law framework for cross-modal discoverability based on spectral structure of data covariance operators, task-aligned signal projections, and learned representations. Many performance metrics, including AUC, can be expressed in terms of cumulative signal-to-noise energy accumulated across identifiable spectral modes of an encoder and cross-modal operator. Under mild assumptions, this accumulation follows a zeta-like scaling law governed by power-law decay of covariance spectra and aligned signal energy, leading naturally to the appearance of the Riemann zeta function. Representation learning methods such as sparse models, low-rank embeddings, and multimodal contrastive objectives improve sample efficiency by concentrating useful signal into earlier stable modes, effectively steepening spectral decay and shifting scaling curves. The framework predicts cross-over regimes in which simpler models perform best at small sample sizes, while higher-capacity or multimodal encoders outperform them once sufficient data stabilizes additional degrees of freedom. Applications include multimodal disease classification, imaging genetics, functional MRI, and topological data analysis. The resulting zeta law provides a principled way to anticipate when scaling data, improving representations, or adding modalities is most likely to accelerate discovery.

URL PDF HTML ☆

赞 0 踩 0

2604.17361 2026-04-21 q-bio.QM physics.med-ph

3D-DXA Cortical and Trabecular Parameters: Agreement Between Hologic Densitometers in Clinical Practice

Marta I. Bracco, Jorge Malouf, Laurent Maimoun, Xavier Nogues, Jean Paul Roux, François DuBoeuf, Ludovic Humbert

Comments 17 pages, 2 tables, 4 figures

2604.17291 2026-04-21 q-bio.NC

Poisson Flow Model of Cortical Folding Pattern

Moo K. Chung, Luigi Maccotta, Aaron Struck

Comments Published in IEEE EMBC 2026

2604.11824 2026-04-21 q-bio.QM

Patterns in Individual Blood Count Trajectories in the UK Biobank Characterise Disease-Specific Signatures and Anticipate Pan-Cancer Risk

Riya Nagar, Abicumaran Uthamacumaran, Adelaide de Vecchi, Hector Zenil

Comments 22 pages 6 figures

2603.06778 2026-04-21 q-bio.MN math.DS

A cocktail of chemical reaction networks and mathematical epidemiology tools for positive ODE stability problems

Florin Avram, Rim Adenane, Andrei-Dan Halanay

Comments Section 3 corrected

2602.08280 2026-04-21 q-bio.GN

ClusterChirp: Scalable Interactive Exploration of Omics Data with Natural Language-Guided Analysis

Osho Rawal, Rex Lu, Edgar Gonzalez-Kozlova, Sacha Gnjatic, Zeynep H. Gümüş

2601.17808 2026-04-21 cs.NE q-bio.GN

Motif Diversity in Human Liver ChIP-seq Data Using MAP-Elites

Alejandro Medina, Mary Lauren Benton

Comments Accepted Companion Paper to the GECCO 2026 Conference

2601.09173 2026-04-21 cs.LG cs.CL q-bio.QM stat.ML

Geometric Stability: The Missing Axis of Representations

Prashant C. Raju

详情

英文摘要

Representational similarity analysis and related methods have become standard tools for comparing the internal geometries of neural networks and biological systems. These methods measure what is represented, the alignment between two representational spaces, but not whether that structure is robust. We introduce geometric stability, a distinct dimension of representational quality that quantifies how reliably a representation's pairwise distance structure holds under perturbation. Our metric, Shesha, measures self-consistency through split-half correlation of representational dissimilarity matrices constructed from complementary feature subsets. A key formal property distinguishes stability from similarity: Shesha is not invariant to orthogonal transformations of the feature space, unlike CKA and Procrustes, enabling it to detect compression-induced damage to manifold structure that similarity metrics cannot see. Spectral analysis reveals the mechanism: similarity metrics collapse after removing the top principal component, while stability retains sensitivity across the eigenspectrum. Across 2463 encoder configurations in seven domains -- language, vision, audio, video, protein sequences, molecular profiles, and neural population recordings -- stability and similarity are empirically uncorrelated ($ρ=-0.01$). A regime analysis shows this independence arises from opposing effects: geometry-preserving transformations make the metrics redundant, while compression makes them anti-correlated, canceling in aggregate. Applied to 94 pretrained models across 6 datasets, stability exposes a "geometric tax": DINOv2, the top-performing model for transfer learning, ranks last in geometric stability on 5/6 datasets. Contrastive alignment and hierarchical architecture predict stability, providing actionable guidance for model selection in deployment contexts where representational reliability matters.

URL PDF HTML ☆

赞 0 踩 0

2509.01038 2026-04-21 q-bio.BM cs.LG

Learning residue level protein dynamics with multiscale Gaussians

Mihir Bafna, Bowen Jing, Bonnie Berger

Comments ICLR 2026

2310.07464 2026-04-21 eess.IV cs.LG q-bio.QM

Multi-Beholder: Biomarker Prediction for Low-Grade Glioma with Multiple Instance Learning and One-Class Classification

Zijie Fang, Yihan Liu, Yifeng Wang, Xiangyang Zhang, Yang Chen, Changjing Cai, Yiyang Lin, Ying Han, Zhi Wang, Shan Zeng, Jun Tan, Yongbing Zhang, Hong Shen

Comments 14 pages, 5 figures

2604.17151 2026-04-21 q-bio.NC

Causality as a Minimum Energy Principle

Moo K. Chung, D. Vijay Anand, Anass B El-Yaagoubi, Jae-Hun Jung, Anqi Qiu, Hernando Ombao

Comments Published in IEEE Engineering in Medicine and Biology Society Annual Conference (EMBC) 2026

2604.17036 2026-04-21 q-bio.PE

Evolution as fitness landscape navigation: Concepts, Measures, and Emerging Questions

Malvika Srivastava, Claudia Bank, Joachim Krug, Suman G. Das

Comments 27 pages, 2 figures

2604.16896 2026-04-21 q-bio.QM cs.AI

ProtoCycle: Reflective Tool-Augmented Planning for Text-Guided Protein Design

Yutang Ge, Guojiang Zhao, Sihang Li, Zheng Cheng, Zifeng Zhao, Hanchen Xia, Guolin Ke, Linfeng Zhang, Zhifeng Gao, Yuguang Wang

Comments 25 pages, 11 figures. Accepted to Findings of ACL 2026

2604.16851 2026-04-21 cs.LG cs.AI cs.CV q-bio.BM q-bio.QM

Applications of deep generative models to DNA reaction kinetics and to cryogenic electron microscopy

Chenwei Zhang

Comments PhD Thesis