arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.22269 2026-03-24 q-bio.BM

Computational modeling of RNA-protein binding interactions under an external force

Danielle Wampler, Ralf Bundschuh

详情

英文摘要

RNA binding proteins play a crucial role in post-transcriptional gene regulation by controlling the transport, processing, and translation of their target RNAs. Post-transcriptional gene regulation leads to the differential expression of genetic material and loss of regulation or over-regulation relates to a large range of cancers and diseases - many of which have directly been associated with RNA binding proteins and their target RNAs. To understand RNA, RNA binding proteins, and how they function in gene expression, it is essential to characterize how RNA binding proteins interact with their target RNAs. Here, we aim to assess the potential for single molecule force spectroscopy experiments to be used in the characterization of RNA-protein binding by investigating to what extent a change of extension due to RNA-protein binding is experimentally measurable and what aspects of the interaction can be deduced from such measurements. We predict the effect of protein binding on RNA force extension measurements via the open-source ViennaRNA package, which we have modified to simultaneously consider an external force, protein binding, and RNA secondary structure. From this work, we see protein concentration-dependent responses to external forces with discernable differences in predicted extensions around biologically relevant concentrations and a connection to protein binding domain geometry for several RNA binding proteins.

URL PDF HTML ☆

赞 0 踩 0

2603.22150 2026-03-24 q-bio.PE physics.soc-ph

Epidemic reproduction numbers in spatial networks

Zahra Ghadiri, Jari Saramäki, Takayuki Hiraoka

2603.21634 2026-03-24 math.PR q-bio.PE

Individual-based stochastic model with unbounded growth, birth and death rates: a tightness result

Virgile Brodu

Comments 52 pages, 6 figures, 1 table

2603.21542 2026-03-24 q-bio.NC

Brain Learning Principles Utilizing Non-Ideal Factors in Neural Circuits

Da-Zheng Feng, Hao-Xuan Du

2603.21503 2026-03-24 q-bio.BM

Persistent local Laplacian prediction of protein-ligand binding affinities

Jian Liu, Hongsong Feng

2603.19814 2026-03-24 math.AP math.DS q-bio.PE

Stability analysis and long-time convergence of a partial differential equation model of two-phase ageing

Luce Breuil

2602.23624 2026-03-24 q-bio.PE q-bio.MN

Sex chromosome stability and turnover across vertebrates: a developmental gene regulatory network perspective

Wen-Juan Ma, Ricard Fontserè, Tristan Cornelis, Paris Veltsos, Qi Zhou

Comments 22 pages, 2 figures, GBE invited review article

详情

英文摘要

Sex chromosomes have evolved repeatedly across the Tree of Life, yet their evolutionary fates differ strikingly. In sharp contrast to mammals and birds with degenerated, stable Y/W chromosomes, in most amphibians, teleosts, non avian reptiles and flowering plants, sex chromosomes remain largely homomorphic and undergo frequently turnover. Explanations such as the evolutionary trap hypothesis, sexually antagonistic selection, mutation load, genetic drift and selfish genetic elements, focus on population genetic processes and do not fully explain this pattern. Here we propose the developmental gene regulatory network (GRN) lock in hypothesis. We compile case studies of turnover across vertebrates, synthesise comparative developmental data on sex determination and dosage regulation (DC). In mammals and birds, sex is determined by an early, initiation by somatic cells, fully penetrant master signal acting within a narrow, thermally buffered embryonic window. This signal operates within highly canalised GRNs, coupled to chromosome scale dosage compensation, with alternative splicing events playing little or no causal role in primary sex determination. This configuration makes it difficult for new master sex determining loci to invade without generating deleterious intermediate states. By contrast, many ectothermic vertebrates possess flexible, integrative threshold GRNs in which genetic, germ cells and environmental inputs interact over a prolonged sensitive embryonic period, with absent or largely gene-by-gene based DC and environmentally responsive splicing near key regulatory nodes, providing many entry points for sex determining loci to evolve. We outline empirical predictions and highlight how integrating developmental biology, molecular mechanisms and population genetics can yield testable models for when sex chromosomes become evolutionarily locked-in versus repeated turnover.

URL PDF HTML ☆

赞 0 踩 0

2602.18889 2026-03-24 math.AT q-bio.QM

Topological shape transform for thymus structures

Haochen Yang, Vadim Lebovici, Andreas Tarcevski, Liliana Tchernev, Saulius Zuklys, Georg A. Holländer, Helen M. Byrne, Heather A. Harrington

Comments 41 pages, 13 figures

详情

英文摘要

The Euler characteristic transform (ECT) is an emerging and powerful framework within topological data analysis for quantifying the geometry of shape. The applicability of ECT has been limited due to its sensitivity to noisy data. Here, we introduce SampEuler, a novel ECT-based shape descriptor designed to achieve enhanced robustness to perturbations. We provide a theoretical analysis establishing the stability of SampEuler and validate these properties empirically through pairwise similarity analyses on a benchmark dataset and showcase it on a thymus dataset. The thymus is a primary lymphoid organ that is essential for the maturation and selection of self-tolerant T cells, and within the thymus, thymic epithelial cells are organized in complex three-dimensional architectures, yet the principles governing their formation, functional organization, and remodeling during age-related involution remain poorly understood. Addressing these questions requires robust and informative shape descriptors capable of capturing subtle architectural changes across developmental stages. We develop and apply SampEuler to a newly generated two-dimensional imaging dataset of mouse thymi spanning multiple age groups, where SampEuler outperforms both persistent homology--based methods and deep learning models in detecting subtle, localized morphological differences associated with aging. To facilitate interpretation, we develop a vectorization and visualization framework for SampEuler, which preserves rich morphological information and enables identification of structural features that distinguish thymi across age groups. Collectively, our results demonstrate that SampEuler provides a robust and interpretable approach for quantifying thymic architecture and reveals age-dependent structural changes that offer new insights into thymic organization and involution.

URL PDF HTML ☆

赞 0 踩 0

2511.17695 2026-03-24 q-bio.QM

SynCell: Contextualized Drug Synergy Prediction

Keqin Peng, Guangxin Su, Qinshan Shi, Shuai Gao, Ren Wang, Can Chen, Jun Wen

Comments 12 pages, 1 figures

2509.19988 2026-03-24 stat.ML cs.LG q-bio.QM

BioBO: Biology-informed Bayesian Optimization for Perturbation Design

Yanke Li, Tianyu Cui, Tommaso Mansi, Mangal Prakash, Rui Liao

Comments ICLR 2026

2508.15077 2026-03-24 q-bio.PE

Modelling the transmission and impact of Omicron variants of Covid-19 in different ethnicity groups in Aotearoa New Zealand

Samik Datta, Vincent X Lomas, Nicole Satherley, Andrew Sporle, Michael J Plank

详情

DOI: 10.1016/j.epidem.2026.100905
Journal ref: Epidemics (2026), 55: 100905

英文摘要

Previous pandemics, including influenza pandemics and Covid-19, have disproportionately impacted Māori and Pacific populations in Aotearoa New Zealand. The reasons for this are multi-faceted, including differences in socioeconomic deprivation, housing conditions and household size, vaccination rates, access to healthcare, and prevalence of pre-existing health conditions. Many mathematical models that were used to inform the response to the Covid-19 pandemic did not explicitly include ethnicity or other socioeconomic variables. This limited their ability to predict, understand and mitigate inequitable impacts of the pandemic. Here, we extend a model that was developed during the Covid-19 pandemic to support the public health response by stratifying the population into four ethnicity groups: Māori, Pacific, Asian and European/other. We include three ethnicity-specific components in the model: vaccination rates, clinical severity parameters, and contact patterns. We compare model results to ethnicity-specific data on Covid-19 cases, hospital admissions and deaths between 1 January 2022 and 30 June 2023, under different model scenarios in which these ethnicity-specific components are present or absent. We find that differences in vaccination rates explain only part of the observed disparities in outcomes. While no model scenario is able to fully capture the heterogeneous temporal dynamics, our results suggest that differences between ethnicities in the per-infection risk of clinical severe disease is an important factor. Our work is an important step towards models that are better able to predict inequitable impacts of future pandemic and emerging disease threats, and investigate the ability of interventions to mitigate these.

URL PDF HTML ☆

赞 0 踩 0

2508.14936 2026-03-24 q-bio.QM cs.AI cs.LG stat.AP stat.ML

Can synthetic data reproduce real-world findings in epidemiology? A replication study using adversarial random forests

Jan Kapar, Kathrin Günther, Lori Ann Vallis, Klaus Berger, Nadine Binder, Hermann Brenner, Stefanie Castell, Beate Fischer, Volker Harth, Bernd Holleczek, Timm Intemann, Till Ittermann, André Karch, Thomas Keil, Lilian Krist, Berit Lange, Michael F. Leitzmann, Katharina Nimptsch, Nadia Obi, Iris Pigeot, Tobias Pischon, Tamara Schikowski, Börge Schmidt, Carsten Oliver Schmidt, Anja M. Sedlmair, Justine Tanoey, Harm Wienbergen, Andreas Wienke, Claudia Wigmann, Marvin N. Wright

详情

英文摘要

Synthetic data holds substantial potential to address practical challenges in epidemiology due to restricted data access and privacy concerns. However, many current methods suffer from limited quality, high computational demands, and complexity for non-experts. Furthermore, common evaluation strategies for synthetic data often fail to directly reflect statistical utility and measure privacy risks sufficiently. Against this background, a critical underexplored question is whether synthetic data can reliably reproduce key findings from epidemiological research while preserving privacy. We propose adversarial random forests (ARF) as an efficient and convenient method for synthesizing tabular epidemiological data. To evaluate its performance, we replicated statistical analyses from six epidemiological publications covering blood pressure, anthropometry, myocardial infarction, accelerometry, loneliness, and diabetes, from the German National Cohort (NAKO Gesundheitsstudie), the Bremen STEMI Registry U45 Study, and the Guelph Family Health Study. We further assessed how dataset dimensionality and variable complexity affect the quality of synthetic data, and contextualized ARF's performance by comparison with commonly used tabular data synthesizers in terms of utility, privacy, generalisation, and runtime. Across all replicated studies, results on ARF-generated synthetic data consistently aligned with original findings. Even for datasets with relatively low sample size-to-dimensionality ratios, replication outcomes closely matched the original results across descriptive and inferential analyses. Reduced dimensionality and variable complexity further enhanced synthesis quality. ARF demonstrated favourable performance regarding utility, privacy preservation, and generalisation relative to other synthesizers and superior computational efficiency.

URL PDF HTML ☆

赞 0 踩 0

2508.06719 2026-03-24 q-bio.PE

Speciation by local adaptation and isolation by distance in extended environments

Lara D. Hissa, Flavia M. D. Marquitti, Marcus A. M. de Aguiar

Comments 26 pages, 5 figures, revised

2505.15054 2026-03-24 cs.CL cs.AI cs.LG q-bio.BM

MolLangBench: A Comprehensive Benchmark for Language-Prompted Molecular Structure Recognition, Editing, and Generation

Feiyang Cai, Jiahui Bai, Tao Tang, Guijuan He, Joshua Luo, Tianyu Zhu, Srikanth Pilla, Gang Li, Ling Liu, Feng Luo

Comments ICLR-2026 Camera-Ready version

2502.01178 2026-03-24 math.PR q-bio.PE

Genetic contribution of advantaged ancestors in the biparental Moran model -- finite selection

Camille Coron, Yves Le Jan

2407.03239 2026-03-24 q-bio.QM cs.CV

Solving the inverse problem of microscopy deconvolution with a residual Beylkin-Coifman-Rokhlin neural network

Rui Li, Mikhail Kudryashev, Artur Yakimovich

Comments 17 pages, 8 figures

详情

DOI: 10.1007/978-3-031-73226-3_22
Journal ref: 2024. In European Conference on Computer Vision (pp. 378-395). Cham: Springer Nature Switzerland

英文摘要

Optic deconvolution in light microscopy (LM) refers to recovering the object details from images, revealing the ground truth of samples. Traditional explicit methods in LM rely on the point spread function (PSF) during image acquisition. Yet, these approaches often fall short due to inaccurate PSF models and noise artifacts, hampering the overall restoration quality. In this paper, we approached the optic deconvolution as an inverse problem. Motivated by the nonstandard-form compression scheme introduced by Beylkin, Coifman, and Rokhlin (BCR), we proposed an innovative physics-informed neural network Multi-Stage Residual-BCR Net (m-rBCR) to approximate the optic deconvolution. We validated the m-rBCR model on four microscopy datasets - two simulated microscopy datasets from ImageNet and BioSR, real dSTORM microscopy images, and real widefield microscopy images. In contrast to the explicit deconvolution methods (e.g. Richardson-Lucy) and other state-of-the-art NN models (U-Net, DDPM, CARE, DnCNN, ESRGAN, RCAN, Noise2Noise, MPRNet, and MIMO-U-Net), the m-rBCR model demonstrates superior performance to other candidates by PSNR and SSIM in two real microscopy datasets and the simulated BioSR dataset. In the simulated ImageNet dataset, m-rBCR ranks the second-best place (right after MIMO-U-Net). With the backbone from the optical physics, m-rBCR exploits the trainable parameters with better performances (from ~30 times fewer than the benchmark MIMO-U-Net to ~210 times than ESRGAN). This enables m-rBCR to achieve a shorter runtime (from ~3 times faster than MIMO-U-Net to ~300 times faster than DDPM). To summarize, by leveraging physics constraints our model reduced potentially redundant parameters significantly in expertise-oriented NN candidates and achieved high efficiency with superior performance.

URL PDF HTML ☆

赞 0 踩 0

2307.14436 2026-03-24 eess.IV cs.CV q-bio.QM

Phenotype-preserving metric design for high-content image reconstruction by generative inpainting

Vaibhav Sharma, Artur Yakimovich

Comments 8 pages, 3 figures, conference proceedings

详情

DOI: 10.1117/12.2676835
Journal ref: In Emerging Topics in Artificial Intelligence (ETAI) 2023 (Vol. 12655, pp. 7-14). SPIE

英文摘要

In the past decades, automated high-content microscopy demonstrated its ability to deliver large quantities of image-based data powering the versatility of phenotypic drug screening and systems biology applications. However, as the sizes of image-based datasets grew, it became infeasible for humans to control, avoid and overcome the presence of imaging and sample preparation artefacts in the images. While novel techniques like machine learning and deep learning may address these shortcomings through generative image inpainting, when applied to sensitive research data this may come at the cost of undesired image manipulation. Undesired manipulation may be caused by phenomena such as neural hallucinations, to which some artificial neural networks are prone. To address this, here we evaluate the state-of-the-art inpainting methods for image restoration in a high-content fluorescence microscopy dataset of cultured cells with labelled nuclei. We show that architectures like DeepFill V2 and Edge Connect can faithfully restore microscopy images upon fine-tuning with relatively little data. Our results demonstrate that the area of the region to be restored is of higher importance than shape. Furthermore, to control for the quality of restoration, we propose a novel phenotype-preserving metric design strategy. In this strategy, the size and count of the restored biological phenotypes like cell nuclei are quantified to penalise undesirable manipulation. We argue that the design principles of our approach may also generalise to other applications.

URL PDF HTML ☆

赞 0 踩 0

2306.02929 2026-03-24 q-bio.QM

Microscopy image reconstruction with physics-informed denoising diffusion probabilistic model

Rui Li, Gabriel della Maggiora, Vardan Andriasyan, Anthony Petkidis, Artsemi Yushkevich, Mikhail Kudryashev, Artur Yakimovich

Comments 16 pages, 5 figures

2603.21201 2026-03-24 q-bio.GN

A harmonized benchmarking framework for implementation-aware evaluation of 46 polygenic risk score tools across binary and continuous phenotypes

Muhammad Muneeb, David B. Ascher

2603.21025 2026-03-24 q-bio.PE

Pattern Formation in a Spatial Public Goods Dilemma due to Diffusive or Directed Motion

Yuxuan Zhao, Kaisheng Zhu, Yefei Zhang, Daniel B. Cooney

2603.21020 2026-03-24 q-bio.QM

Characterizing Long-Range Dependencies in Knee Joint Contact Mechanics: A Comparison of Topology Diffusion, Global Routing, and Hybrid Graph Neural Networks

Zhengye Pan, Jianwei Zuo, Jiajia Luo

2603.20988 2026-03-24 cs.AI q-bio.NC

Can we automatize scientific discovery in the cognitive sciences?

Akshay K. Jagadish, Milena Rmus, Kristin Witte, Marvin Mathony, Marcel Binz, Eric Schulz

2603.20848 2026-03-24 cs.CV cs.CE q-bio.TO

GOLDMARK: Governed Outcome-Linked Diagnostic Model Assessment Reference Kit

Chad Vanderbilt, Gabriele Campanella, Siddharth Singi, Swaraj Nanda, Jie-Fu Chen, Ali Kamali, Amir Momeni Boroujeni, David Kim, Mohamed Yakoub, Jamal Benhamida, Meera Hameed, Neeraj Kumar, Gregory Goldgof

2603.20707 2026-03-24 q-bio.PE

Coexistence coalitions in propagule disperser quasi-communities

Leonardo Aguirre, José A. Capitán, David Alonso

Comments 35 pages (17 pages Appendix)

2603.20680 2026-03-24 q-bio.NC cs.LG

Hierarchical Multiscale Structure-Function Coupling for Brain Connectome Integration

Jianwei Chen, Zhengyang Miao, Wenjie Cai, Jiaxue Tang, Boxing Liu, Yunfan Zhang, Yuhang Yang, Hao Tang, Carola-Bibiane Schönlieb, Zaixu Cui, Du Lei, Shouliang Qi, Chao Li

2602.15677 2026-03-24 cs.LG q-bio.QM

CAMEL: An ECG Language Model for Forecasting Cardiac Events

Neelay Velingker, Alaia Solko-Breslin, Mayank Keoliya, Seewon Choi, Jiayi Xin, Anika Marathe, Alireza Oraii, Rajat Deo, Sameed Khatana, Rajeev Alur, Mayur Naik, Eric Wong

Comments 24 pages, 6 figures

2512.03497 2026-03-24 q-bio.QM cs.AI q-bio.CB

Cell-cell Communication Inference and Analysis: Biological Mechanisms, Computational Approaches, and Future Opportunities

Xiangzheng Cheng, Haili Huang, Ye Su, Qing Nie, Xiufen Zou, Suoqin Jin

Comments Published in CSIAM Transactions on Life Sciences (2026)

2511.17685 2026-03-24 q-bio.QM cs.AI cs.CV cs.LG

Dual-Path Knowledge-Augmented Contrastive Alignment Network for Spatially Resolved Transcriptomics

Wei Zhang, Jiajun Chu, Xinci Liu, Chen Tong, Xinyue Li

Comments AAAI 2026 Oral, extended version

详情

Journal ref: Proceedings of the AAAI Conference on Artificial Intelligence, 40(15), 12807-12815. 2026

英文摘要

Spatial Transcriptomics (ST) is a technology that measures gene expression profiles within tissue sections while retaining spatial context. It reveals localized gene expression patterns and tissue heterogeneity, both of which are essential for understanding disease etiology. However, its high cost has driven efforts to predict spatial gene expression from whole slide images. Despite recent advancements, current methods still face significant limitations, such as under-exploitation of high-level biological context, over-reliance on exemplar retrievals, and inadequate alignment of heterogeneous modalities. To address these challenges, we propose DKAN, a novel Dual-path Knowledge-Augmented contrastive alignment Network that predicts spatially resolved gene expression by integrating histopathological images and gene expression profiles through a biologically informed approach. Specifically, we introduce an effective gene semantic representation module that leverages the external gene database to provide additional biological insights, thereby enhancing gene expression prediction. Further, we adopt a unified, one-stage contrastive learning paradigm, seamlessly combining contrastive learning and supervised learning to eliminate reliance on exemplars, complemented with an adaptive weighting mechanism. Additionally, we propose a dual-path contrastive alignment module that employs gene semantic features as dynamic cross-modal coordinators to enable effective heterogeneous feature integration. Through extensive experiments across three public ST datasets, DKAN demonstrates superior performance over state-of-the-art models, establishing a new benchmark for spatial gene expression prediction and offering a powerful tool for advancing biological and clinical research.

URL PDF HTML ☆

赞 0 踩 0

2509.11545 2026-03-24 q-bio.NC

Representational drift under spontaneous activity -- self-organized criticality enhances representational reliability

Zhuda Yang, Junhao Liang, Wing Ho Yung, Changsong Zhou

2507.06358 2026-03-24 q-bio.PE cs.LG

Multi-scale species richness estimation with deep learning

Victor Boussange, Bert Wuyts, Philipp Brun, Johanna T. Malle, Gabriele Midolo, Jeanne Portier, Théophile Sanchez, Niklaus E. Zimmermann, Irena Axmanová, Helge Bruelheide, Milan Chytrý, Stephan Kambach, Zdeňka Lososová, Martin Večeřa, Idoia Biurrun, Klaus T. Ecker, Jonathan Lenoir, Jens-Christian Svenning, Dirk Nikolaus Karger

Comments 31 pages