arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.20115 2026-03-23 cs.LG q-bio.BM q-bio.QM

Conditioning Protein Generation via Hopfield Pattern Multiplicity

Jeffrey D. Varner

详情

英文摘要

Protein sequence generation via stochastic attention produces plausible family members from small alignments without training, but treats all stored sequences equally and cannot direct generation toward a functional subset of interest. We show that a single scalar parameter, added as a bias to the sampler's attention logits, continuously shifts generation from the full family toward a user-specified subset, with no retraining and no change to the model architecture. A practitioner supplies a small set of sequences (for example, hits from a binding screen) and a multiplicity ratio that controls how strongly generation favors them. The method is agnostic to what the subset represents: binding, stability, specificity, or any other property. We find that the conditioning is exact at the level of the sampler's internal representation, but that the decoded sequence phenotype can fall short because the dimensionality reduction used to encode sequences does not always preserve the residue-level variation that defines the functional split. We term this discrepancy the calibration gap and show that it is predicted by a simple geometric measure of how well the encoding separates the functional subset from the rest of the family. Experiments on five Pfam families (Kunitz, SH3, WW, Homeobox, and Forkhead domains) confirm the monotonic relationship between separation and gap across a fourfold range of geometries. Applied to omega-conotoxin peptides targeting a calcium channel involved in pain signaling, curated seeding from 23 characterized binders produces over a thousand candidates that preserve the primary pharmacophore and all experimentally identified binding determinants. These results show that stochastic attention enables practitioners to expand a handful of experimentally characterized sequences into diverse candidate libraries without retraining a generative model.

URL PDF HTML ☆

赞 0 踩 0

2603.19881 2026-03-23 q-bio.NC

Problem difficulty and waiting time shape the level of detail and temporal organization of visual strategies in human planning

Mattia Eluchans, Giovanni Pezzulo

2603.19751 2026-03-23 math.OC q-bio.NC q-bio.QM

Branched Optimal Transport for Stimulus to Reaction Brain Mapping

Cristian Mendico

2603.19723 2026-03-23 cond-mat.soft q-bio.TO

Modelling the passive and active response of skeletal muscles within the adapted Voigt representation framework

Sara Galasso, Giulio G. Giusteri

Comments 25 pages, 7 figures

2603.19690 2026-03-23 q-bio.NC cs.NE

A Unified Phase-native Computational Principle Governs Hippocampal Spike Timing and Neural Coding

Reza Ahmadvand, Sara Safura Sharif, Yaser Mike Banad

Comments 27 Pages, 5 Figures, 2 Tables

2601.18921 2026-03-23 cs.DB cs.CE cs.LG q-bio.QM

Accelerating Large-Scale Cheminformatics Using a Byte-Offset Indexing Architecture for Terabyte-Scale Data Integration

Malikussaid, Septian Caesar Floresko, Sutiyo

Comments 6 pages, 3 figures, 5 equations, 3 algorithms, 4 tables, to be published in ICoICT 2026, unabridged version exists as arXiv:2512.24643v1

2603.19577 2026-03-23 math.PR q-bio.QM stat.ME

Stochastic Averaging and Statistical Inference of Glycolytic Pathway

Arnab Ganguly, Hye-Won Kang

Comments 33 pages, 2 figures

2603.19473 2026-03-23 q-bio.BM cs.LG

Reinforcement-guided generative protein language models enable de novo design of highly diverse AAV capsids

Lucas Ferraz, Ana F. Rodrigues, Pedro Giesteira Cotovio, Mafalda Ventura, Gabriela Silva, Ana Sofia Coroadinha, Miguel Machuqueiro, Catia Pesquita

2603.19425 2026-03-23 q-bio.NC math.DG math.FA

Curvature Sensitive Cells in the Modular Structures of The Visual Cortex

Giovanna Citti, Vasiliki Liontou

2603.19341 2026-03-23 q-bio.QM math.CO

Assessing 3D tree model quality and species classification using imbalance indices

Sophie J. Kersting, Mareike Fischer

2603.19326 2026-03-23 q-bio.QM cs.LG cs.NA math.AP math.NA

Mathematical Modeling of Cancer-Bacterial Therapy: Analysis and Numerical Simulation via Physics-Informed Neural Networks

Ayoub Farkane, David Lassounon

2603.19320 2026-03-23 q-bio.NC cond-mat.dis-nn cs.NE cs.SI

Analytically tractable model of synaptic crowding explains emergent small-world structure and network dynamics

Makoto Fukushima

Comments An earlier version appears on Research Square

2603.18475 2026-03-23 math.NA cs.NA math.AP q-bio.NC

Resolving the Blow-Up: A Time-Dilated Numerical Framework for Multiple Firing Events in Mean-Field Neuronal Networks

Xu'an Dou, Louis Tao, Zhe Xue, Zhennan Zhou

2601.09320 2026-03-23 q-bio.NC

Mapping Connectomic Structure to Function(s) in Cerebellar-like Networks using Kernel Regression

William Dorrell, Peter E. Latham

Comments 12 pages, 7 figures

2506.12177 2026-03-23 stat.ME q-bio.QM stat.AP

A proxy-based approach for unmeasured confounding in electronic health records research

Haley Colgate Kottler, Amy Cochran

2504.09537 2026-03-23 q-bio.QM

Machine Learning - driven insights for predicting the impact of nanoparticles on the functionality of biomolecules, Illustrated by the case of DNA Damage-Inducible Transcript 3 (CHOP) inhibitors

Mariya L. Ivanova, Michael Nicholls, Nicola Russo, Gueorgui Mihaylov, Konstantin Nikolic

Comments 34 pages, 13 figures, 23 tables

2504.08637 2026-03-23 physics.bio-ph q-bio.NC

Direct dependencies between neurons explain activity

Christopher W. Lynn

Comments 43 pages, 13 figures

2503.03773 2026-03-23 q-bio.GN cs.LG

A Phylogenetic Approach to Genomic Language Modeling

Carlos Albors, Jianan Canal Li, Gonzalo Benegas, Chengzhong Ye, Yun S. Song

Comments 15 pages, 7 figures

2501.14044 2026-03-23 q-bio.OT

Machine learning model leveraging SMILES-derived NMR spectroscopy data to predict dopamine D1 receptor antagonists: a prospective framework for forecasting the impact of engineered nanoparticles on the functionalities of small biomolecules

Mariya L Ivanova, Michael Nichols, Nicola Russo, Gueorgui Mihaylov, Konstantin Nikolic

Comments 27 pages, 8 figures, 2 tables