arXivDaily arXiv每日学术速递 周一至周五更新
重置
2605.01656 2026-05-05 q-bio.NC cs.AI cs.LG

From Cortical Synchronous Rhythm to Brain Inspired Learning Mechanism: An Oscillatory Spiking Neural Network with Time-Delayed Coordination

Tingting Dan, Guorong Wu

Comments 19 pages, 6 figures

详情
英文摘要

Human cognition emerges from coordinated spiking dynamics in distributed neural circuits, where information is encoded via both firing rates and precise spike timing determined by brain rhythms. Inspired by this notion, we propose a brain-inspired learning primitive in which cognition-level neural synchrony emerges through iterative bottom-up and top-down interactions between micro-scale dynamics of spiking neurons and a macro-scale mechanism of oscillatory synchronization. Specifically, we model each parcel (e.g., a cortical region or an image pixel) in the target system as a spiking neuron embedded in a predefined connectivity scaffold. Low-level information is encoded in a spatiotemporal domain, where neurons are selectively grouped and fire spontaneously over time through self-organized dynamics. In the bottom-up route, oscillatory synchronization is formed from past spiking activity accumulated over a finite memory window. Since brain dynamics operate in a regime of partial and transient synchronization rather than global phase locking, we model oscillatory coordination using a time-delayed synchronization formulation, which enables a top-down modulation of heterogeneous neural spiking for a large-scale distributed system. Together, we devise a spiking-by-synchronization neural network (S2-Net) that uses rhythmic timing as a control mechanism for efficient information processing. Promising results have been achieved across a broad range of tasks, including neural activity decoding, energy-efficient signal processing, temporal binding and semantic reasoning.

2605.01472 2026-05-05 physics.optics q-bio.QM

Label-Free Microrefractometry of Interfacial Processes Using Fluorescent Smart Coverslips

Hodaya Klimovsky, Amitay Ginsberg, Dmytro Ohorodniichuk, Maria Shehadeh, Ilya Olevsko, Gerardo Byk, Martin Oheim, Adi Salomon

Comments 38 pages, 4 figures (main text) and 9 supporting figures

详情
英文摘要

Molecular dipoles near interfaces emit highly directional radiation due to near-field interactions, making surface-bound fluorophores sensitive probes of local physicochemical changes. We introduce smart coverslips, stably coated with uniform, brightly fluorescent nanobead films, that exploit refractive-index-dependent emission shifts for sensitive micro-refractometry in small volumes. Supercritical-angle fluorescence refractometry uses single back-focal-plane images to allow us real-time RI sensing and nanometric thin-film height measurements without the need for multi-angle or multi-wavelength acquisition. Our fast, label-free, and non-invasive approach allows measurements of thin-film properties and monitoring of interfacial dynamics on a standard inverted microscope and is broadly applicable to nanobiophotonics, chemical sensing, and in-situ materials analysis.

2605.01378 2026-05-05 q-bio.GN

PhenotypeToGeneDownloaderR: automated multi-source retrieval and validation of phenotype-associated genes

Muhammad Muneeb, David B. Ascher

Comments https://github.com/MuhammadMuneeb007/PhenotypeToGeneDownloaderR

详情
英文摘要

Identifying phenotype-associated genes is a common first step in polygenic risk score construction, enrichment testing, target prioritisation and variant interpretation, but relevant evidence is distributed across heterogeneous databases with different interfaces, formats and evidence models. Here, we present PhenotypeToGeneDownloaderR, a phenotype-guided R/Python pipeline for automated gene retrieval, harmonisation, symbol validation and cross-source summary analysis. Given a phenotype term, the pipeline queries integrated biological databases, standardises per-source outputs, combines gene lists, validates retrieved symbols against the NCBI human gene reference and generates summary tables and visualisations. Across 13 clinically relevant phenotypes and 13 databases, PhenotypeToGeneDownloaderR generated 136,487 raw gene retrievals, with at least one source returning genes for every phenotype. Across all 13 phenotypes, 100,175 of 114,345 combined input symbols were retained after direct or synonym-based validation, corresponding to an 87.6\% validation rate. Cross-source overlap was low, supporting the complementarity of integrated evidence sources. Against an HPO/ClinVar/OMIM-derived gold standard, the pipeline recovered 1,039 of 1,056 known phenotype-associated genes, corresponding to 98.4\% recall. PhenotypeToGeneDownloaderR provides a lightweight, reproducible upstream framework for generating candidate gene sets for downstream prioritisation and interpretation. The pipeline is implemented in R and Python, released under the MIT licence, and available at https://github.com/MuhammadMuneeb007/PhenotypeToGeneDownloaderR.

2605.01290 2026-05-05 q-bio.QM

How Light Reshapes the Mind. An Active Inference Framework for the Cognitive and Emotional Effects of Indoor Lighting

Luca M. Possati

详情
英文摘要

Indoor lighting affects cognition, affect, and behavioural regulation, but these effects are often treated as isolated findings rather than as parts of a unified process. This paper proposes an active inference account of shared indoor lighting in multi-user environments such as offices, classrooms, and libraries. It argues that lighting shapes behaviour through three distinct channels: illuminance modulates perceptual precision, correlated colour temperature modulates arousal relative to circadian optimum, and spectral composition biases behavioural disposition toward engagement or rest. The paper formalises this hypothesis through a proof-of-concept POMDP model of agents performing sustained reading over five hours, using both reading performance and eye-tracking observations. The model generates six falsifiable predictions, all confirmed across 20 Monte Carlo simulations.

2605.01083 2026-05-05 q-bio.TO cs.NA math.NA

Modelling the electrophysiological interactions between human pluripotent cell-derived cardiomyocite grafts and host ventricular tissue

Suran Galappaththige, Vadim N Biktashev, Faisal J Alibhai, Michael Laflamme

Comments 23 pages, 10 figures, as submitted to PLOS Comp Biol

详情
英文摘要

Human pluripotent stem cell-derived cardiomyocytes (hPSC-CMs) are a promising therapy for regenerating myocardium after infarction, but their use is limited by graft-related arrhythmias that frequently occur shortly after transplantation. Experimental studies indicate that these arrhythmias can originate within the graft, which may act as an ectopic pacemaker, yet the mechanisms governing successful excitation of host tissue remain poorly understood. In particular, the role of electrical coupling at the graft-host interface is important, but difficult to measure directly or control. Computer modelling can help here. Here, we present a computational framework that enables systematic investigation of graft-host electrical interactions using a physiologically interpretable parameterisation. We model the graft-host interface as an internal boundary with a defined specific conductance, allowing direct control over coupling strength in units that correspond to measurable tissue properties. We formulate the governing equations and implement the computations using both finite-difference and finite-element discretisations in established cardiac modelling platforms. Using representative anatomical and physiological configurations, we demonstrate how variations in interface conductance influence the ability of spontaneous graft activity to initiate propagating excitation in host tissue. This framework provides a reproducible, mechanistically transparent tool for studying graft-related arrhythmogenesis and lays a foundation for evaluating strategies to mitigate arrhythmic risk in cardiac cell therapy.

2605.01056 2026-05-05 q-bio.MN math.DS

Logistic Gene Regulatory Networks: Prevention of Expression Shutdown, and Numerical Stability Beyond Hill Function

Ismail Belgacem

Comments arXiv admin note: text overlap with arXiv:2512.14325

详情
英文摘要

Hill functions, the standard tool for modelling gene regulatory networks, carry three structural flaws when the cooperativity exponent is non-integer: loss of global smoothness, silent complex-valued arithmetic corruption of ODE trajectories, and an identically zero basal production rate that traps bistable models in off-states. Logistic functions $f^\pm$, being globally $C^\infty$, real-valued for all arguments, and strictly positive at zero, resolve all three simultaneously. For a two-gene negative-feedback oscillator, local asymptotic stability is established for all positive parameters via the Routh--Hurwitz criterion, and no Hopf bifurcation is possible without time delays. For bistable positive autoregulation, saddle-node thresholds are characterised through explicit transcendental equations; with biophysically grounded \textit{E.~coli} parameters, basal logistic production drives off-state escape in $\approx 44$~min while the Hill model remains permanently trapped. The 11-gene Traynard cell-cycle Boolean network is translated automatically via the product-of-logistics De~Morgan formalism and integrated without warnings, all variables remaining bounded and non-negative. The De~Morgan framework places every repressor threshold at a positive measurable concentration, whereas the weighted-sum formulation of Samuilik et al.\ places repressor critical points at negative concentrations, rendering them biologically inert. On an 80-gene Boolean-derived ODE system with $n = 3.509$, the Hill solver entered silent complex-valued contamination at $t \approx 52.64$ and terminated near $t \approx 63$--$65$; the logistic formulation completed $t \in [0, 200]$ without a single warning. The always-positive production rate ensures full controllability, enabling sliding mode, model predictive, and feedback-linearisation strategies where Hill-based formulations fail.

2605.00966 2026-05-05 cs.LG cs.NE q-bio.NC stat.ML

Robust volatility updates for Hierarchical Gaussian Filtering

Christoph Mathys, Nicolas Legrand, Peter Thestrup Waade, Nace Mikus, Lilian Aline Weber

详情
英文摘要

Hierarchical Gaussian Filtering (HGF) networks allow for efficient updating of posterior distributions (beliefs) about hidden states of an agent's environment. HGF parent nodes can target the mean or variance of their children. New information entering at input nodes leads to a cascade of belief updates across the network according to one-step update equations for each node's mean and precision (inverse variance). However, the original form of the update equations for variance-targeting parents(volatility coupling) can in some regions of parameter space lead to negative posterior precision, a logical impossibility which causes the updating algorithm to terminate with an error. In this report, we introduce a modified quadratic approximation to the variational energy of volatility-coupled nodes that avoids negative posterior precision. The key idea is to interpolate between two quadratic expansions of the variational energy: one at the prior prediction and one at a second mode whose location is obtained in closed form via the Lambert W function. The resulting update equations are robust across the entire parameter space and faithfully track the variational posterior even for large prediction errors.

2605.00948 2026-05-05 q-bio.QM cs.AI

Co-Generative De Novo Functional Protein Design

Xinrui Chen, Yizhen Luo, Siqi Fan, Zaiqing Nie

详情
英文摘要

De novo functional protein design aims to generate protein sequences that realize specified biochemical functions without relying on evolutionary templates, enabling broad applications in biotechnology and medicine. Existing approaches adopt either direct function-to-sequence mapping or decoupled structure-sequence generation strategies but often fail to achieve functionality and foldability simultaneously. To address this, we propose CodeFP, a Co-generative protein language model for de novo Functional Protein design that simultaneously decodes sequence and structure tokens, thereby enabling superior simultaneous realization of functionality and foldability. CodeFP utilizes functional local structures to enrich functional semantic encodings, overcoming the suboptimal translation of flat encodings into structure tokens, while introducing auxiliary functional supervision to alleviate training ambiguity stemming from the one-to-many structure-to-token mapping. Extensive experiments show that CodeFP consistently achieves average improvements of 6.1% in functional consistency and 3.2% in foldability over the strongest baseline.

2605.00930 2026-05-05 q-bio.GN cs.AI

CellxPert: Inference-Time MCMC Steering of a Multi-Omics Single-Cell Foundation Model for In-Silico Perturbation

Andac Demir, Erik W. Anderson, Jeremy L. Jenkins, Srayanta Mukherjee

详情
Journal ref
ICLR Machine Learning for Genomics Explorations Workshop 2026
英文摘要

In this work, we introduce CellxPert, a scalable multimodal foundation model that unifies single-cell and spatial multi-omics within a common representation space. CellxPert jointly encodes transcriptomic (scRNA-seq), chromatin-accessibility (ATAC-seq), and surface-proteomic (CITE-seq) measurements, while directly incorporating MERFISH and imaging mass-cytometry data as 2D or 3D spatial-visual layers. CellxPert facilitates four key downstream tasks out of the box: (i) cell-type annotation across a broad ontology of 154 largely overlapping identities -- the largest label space addressed to date and a stringent test of fine-grained discrimination, (ii) efficient fine-tuning using Low Rank Adaptation (LoRA), (iii) genome-wide transcriptomic response prediction to in-silico perturbations (ISP), and (iv) seamless multi-omic integration across various assays and platforms. Unlike current single-cell foundation models, which approximate gene perturbations by deleting or reordering tokenized gene expression ranks, CellxPert employs a Metropolis-Hastings sampler whose proposal kernel uses the model's masked conditional distributions to transition to new transcriptomic states conditioned on the perturbed genes. This Markov-chain procedure mitigates out-of-distribution artifacts introduced by abrupt token manipulation and produces trajectories that are biologically interpretable. Evaluations on PBMC68K, Replogle Perturb-seq, Systema, and BMMC benchmarks show that CellxPert surpasses classical and state-of-the-art baselines in cell-type annotation, perturbation response prediction, and multi-omic integration.

2605.00927 2026-05-05 q-bio.OT

BioVeil MATRIX: Uncovering and categorizing vulnerabilities of agentic biological AI scientists

Kimon Antonios Provatas, Avery Self, Ioannis Mouratidis, Ilias Georgakopoulos-Soares

详情
英文摘要

Agentic AI scientists equipped with domain-specific tools are rapidly entering scientific workflows across disciplines, with especially strong uptake in the life sciences where they can be used for literature synthesis, sequence analysis, and experimental planning support. While these systems accelerate biological research, they also introduce risks for dual-use applications that are not captured by current model-centric safety evaluations. We present evidence that current agentic AI scientists, including Biomni and K-Dense, are willing to assist with dual-use tasks that are blocked by base model safeguards. We also found that in a paired evaluation framework for biology and chemistry prompts involving Weapons of Mass Destruction proxies (WMDP), agentic scaffolding of Biomni increased the benchmark performance relative to the underlying standalone model, producing measurable capability uplift. We believe it is necessary to include additional safeguards in existing models and build future tools from the ground up with agentic vulnerabilities in mind. To systematically categorize broader risks, we introduce BioVeil MATRIX, a defensive taxonomy that maps AI-enabled biosecurity risks using 10 tactical categories (TA01--TA10) and 22 different techniques. We propose to use this taxonomy as a baseline for future AI scientist development and generate specialized benchmarks and protocols for red-teaming these vulnerabilities before public deployment. BioVeil MATRIX can be found at: https://bioveilmatrix.com/

2605.00925 2026-05-05 cs.LG cs.CV q-bio.QM

Linking spatial biology and clinical histology via Haiku

Yan Cui, Jacob S. Leiby, Wenhui Lei, Dokyoon Kim, Yanxiang Deng, Aaron T. Mayer, Zhenqin Wu, Alexandro E. Trevino, Zhi Huang

详情
英文摘要

Integrating molecular, morphological, and clinical data is essential for basic and translational biomedical research, yet systematic frameworks for jointly modeling these modalities remain limited. Here we present Haiku, a tri-modal contrastive learning model trained on multiplexed immunofluorescence (mIF). It comprises 26.7 million spatial proteomics patches from 3,218 tissue sections across 1,606 patients spanning 11 organ types, with matched hematoxylin and eosin (H&E) histology and clinical metadata aligned in a shared embedding space. Haiku enables three-way cross-modal retrieval, improves downstream classification and clinical prediction tasks over unimodal baselines, and supports zero-shot biomarker inference through fusion retrieval conditioned on clinical metadata-only text descriptions. Across tasks, Haiku outperforms competing approaches, achieving cross-modal retrieval (Recall@50 up to 0.611 versus near-zero baseline), survival prediction (C-index 0.737, +7.91% relative improvement), and zero-shot biomarker inference (mean Pearson correlation 0.718 across 52 biomarkers). Furthermore, we introduce a counterfactual prediction framework in which modifying only clinical metadata while fixing tissue morphology surfaces niche-specific molecular shifts associated with breast cancer stage progression and lung cancer survival outcomes. In a lung adenocarcinoma case study, the counterfactual analysis recovers niche-specific shifts characterized by increased CD8 and granzyme B, reduced PD-L1, and decreased Ki67, broadly consistent with patterns reported for favorable outcomes. We present these counterfactual results as exploratory, hypothesis-generating signals rather than mechanistic claims. These capabilities demonstrate that tri-modal alignment via Haiku enables integrative analysis of spatial biology, bridging molecular measurements with clinical context for biological exploration.

2605.00865 2026-05-05 eess.SP cs.CL cs.CV cs.LG cs.SD q-bio.NC

How Well Can We Decode Vowels from Auditory EEG -- A Rigorous Cross-Subject Benchmark with Honest Assessment

Xiaoyang Li

Comments 31 pages, 11 figures; includes supplementary material (14 pages, additional figures and analyses)

详情
英文摘要

EEG based phoneme decoding is promising for brain computer interfaces, but many prior studies rely on within subject evaluation, small cohorts, or weak leakage control. We present a reproducible cross subject benchmark for five class vowel decoding (a, e, i, o, u) from auditory EEG using OpenNeuro ds006104 (16 subjects, 61 channels, 256 Hz). Under strict leave one subject out evaluation with training only normalization and explicit anti leakage checks, we compare 14 pipelines from classical machine learning, deep learning, and Riemannian methods. The best full feature model (XGBoost) reaches 24.5 percent accuracy (chance 20 percent), while differential entropy features with LightGBM reach 25.5 percent in feature specific analysis. After multiple comparison correction, strong pairwise model advantages are limited. Classical methods are competitive with deep models in this low signal regime. Additional analyses (ablation, pairwise vowels, within subject CV, ERP, temporal generalization, and electrode importance) indicate that vowel information is real but weak and mainly carried by early transient auditory responses. We release code and evaluation scripts for full reproducibility.

2605.00857 2026-05-05 eess.SP cs.AI cs.LG q-bio.NC

Foundation Model Guided Dual-Branch Co-Adaptation for Source-Free EEG Decoding

Peiliang Gong, Han Zhang, Zhen Jiang, Chenyu Liu, Ziyu Jia, Xinliang Zhou, Daoqiang Zhang, Xiaoli Li

详情
英文摘要

Source-free domain adaptation (SFDA) provides a practical solution to cross-subject EEG decoding by adapting source-pretrained models to unlabeled target domains without accessing source data. However, existing SFDA methods rely solely on the limited internal knowledge of source-pretrained models, leading to inferior cross-domain generalization and unreliable pseudo-labels. Although EEG Foundation Models (FMs) pretrained on large-scale data exhibit strong generalizability, their potential in SFDA remains largely unexplored. To this end, we propose FUSED, a Foundation-guided Source-free EEG Decoding framework that integrates a large-scale FM with a compact Specialist Model (SM) via dual-branch co-adaptation. Specifically, we introduce a Co-adaptation mechanism equipping both branches with linear and prototype views, enabling cross-branch pseudo-label generation. Additionally, we design a Consensus Filtering Mechanism that exploits the FM's inherent stability to identify high-quality samples, along with a Two-Stage Pseudo-Label Refinement scheme to suppress error accumulation through cross-branch arbitration. Finally, we calibrate the FM's decision boundaries via mutual information maximization with the SM, followed by knowledge distillation from FM to SM, forming a principled calibrate-then-distill pipeline. To our knowledge, FUSED is the first work to leverage EEG FMs within the SFDA framework for cross-subject EEG decoding. Extensive experiments across three EEG paradigms, including motor imagery, emotion recognition, and SSVEP, demonstrate consistent state-of-the-art performance, validating the effectiveness of foundation-guided synergy for robust and privacy-preserving EEG decoding.

2604.18637 2026-05-05 q-bio.NC cs.AI cs.CY

NeuroAI and Beyond: Bridging Between Advances in Neuroscience and ArtificialIntelligence

Anthony Zador, Jean-Marc Fellous, Terrence Sejnowski, Gina Adam, James B Aimone, Akwasi Akwaboah, Yiannis Aloimonos, Carmen Amo Alonso, Chiara Bartolozzi, Michael J. Bennington, Michael Berry, Bing W. Brunton, Gert Cauwenberghs, Hillel J. Chiel, Tobi Delbruck, John Doyle, Jason Eshraghian, Ralph Etienne-Cummings, Cornelia Fermuller, Matthew Jacobsen, Ali A. Minai, Barbara Oakley, Alexander G. Ororbia, Joe Paton, Blake Richards, Yulia Sandamirskaya, Abhronil Sengupta, Shihab Shamma, Michael P. Stryker, Seong Jong Yoo, Steven W. Zucker

详情
英文摘要

Neuroscience and Artificial Intelligence (AI) have made impressive progress in recent years but remain only loosely interconnected. Based on a workshop convened by the National Science Foundation in August 2025, we identify three fundamental capability gaps in current AI: the inability to interact with the physical world, inadequate learning that produces brittle systems, and unsustainable energy and data inefficiency. We describe the neuroscience principles that address each: co-design of body and controller, prediction through interaction, multi-scale learning with neuromodulatory control, hierarchical distributed architectures, and sparse event-driven computation. We present a research roadmap organized around these principles at near, mid, and long-term horizons. We argue that realizing this program requires a new generation of researchers trained across the boundary between neuroscience and engineering, and describe the institutional conditions: interdisciplinary training, hardware access, community standards, and ethics, needed to support them. We conclude that NeuroAI, neuroscience-informed artificial intelligence, has the potential to overcome limitations of current AI while deepening our understanding of biological neural computation.

2603.09402 2026-05-05 q-bio.NC cond-mat.dis-nn nlin.CD

Compact Dynamical Mean-Field Theory of Oscillator Networks

Kanishka Reddy

Comments Accepted for publication in Physical Review E

详情
Journal ref
Phys. Rev. E 113, 034222 (2026)
英文摘要

We present a compact dynamical mean-field theory (DMFT) for large networks of coupled phase oscillators whose phases live on the circle $S^1$ and interact with both coherent mean-field coupling and quenched randomness. Starting from wrapped Langevin dynamics, we build a path-integral representation that keeps the $2π$-periodicity of the phases explicit. After averaging over the disorder in the thermodynamic limit, this construction reduces to a single-oscillator stochastic equation driven by a deterministic mean field and a self-consistent colored Gaussian noise, whose covariance is fixed by a circular two-time correlator. In the limit of vanishing disorder, the formalism reproduces the Ott--Antonsen reduction and recovers standard Kuramoto and theta-neuron neural-mass equations. The same framework accommodates arbitrary $2π$-periodic coupling functions, including those obtained from infinitesimal phase response curves (iPRCs) of biophysical neuron models. As an example, we show that for adaptive exponential integrate-and-fire neurons, inserting an iPRC-fitted coupling into the compact DMFT yields quantitative predictions for synchronization thresholds, providing a direct route from single-neuron phase response data to network-level mean-field predictions for arbitrary phase-reducible oscillators.

2603.06768 2026-05-05 q-bio.GN

Benchmarking end-to-end genotype-to-phenotype prediction workflows across 80 openSNP phenotypes

Muhammad Muneeb, David B. Ascher, YooChan Myung, Samuel F. Feng, Andreas Henschel

详情
英文摘要

Genotype-to-phenotype prediction is a central goal of statistical genetics, yet practical comparisons of prediction workflows remain limited in small, heterogeneous, participant-shared genomic datasets. Here, we benchmarked end-to-end case-control prediction across 80 curated binary phenotypes from openSNP using machine learning, deep learning, and polygenic score workflows. We evaluated 29 machine-learning algorithms, 80 deep-learning model variants, and 3 polygenic score tools across 675 clumping and pruning configurations. No workflow family dominated universally. Polygenic score workflows achieved the highest observed discrimination for 53 phenotypes, whereas machine-learning or deep-learning workflows achieved the highest for 27. However, many apparent phenotype-level wins were modest, with 41.2\% of comparisons representing practical ties within five discrimination points. Performance was strongly phenotype-dependent and sensitive to modeling and preprocessing choices. Distinct workflow-specific failure modes were also observed, including unstable behaviour in PRSice and non-informative collapse in lassosum for 13 phenotypes. Higher peak performance was concentrated in smaller phenotypes, reinforcing the need for cautious interpretation in limited-data settings. The cohort was predominantly of European ancestry, restricting generalisability. Together, these results position openSNP as a useful stress-test environment for genomic prediction and support benchmark-guided workflow selection under realistic conditions of data scarcity, phenotype heterogeneity, and ancestry imbalance.

2509.12073 2026-05-05 q-bio.GN q-bio.QM

CEP-IP: An Explainable Framework for Cell Subpopulation Identification in Single-cell Transcriptomics

Kah Keng Wong

Comments 80 pages, 17 figures, 17 supplementary tables, 10 supplementary figures, 103 references

详情
Journal ref
Comput Methods Programs Biomed 282 (2026) 109372
英文摘要

Single-cell RNA sequencing (scRNA-seq) frameworks lack explainable approaches for identifying cell subpopulations harboring strong pairwise monotonic gene-module relationships between a gene of interest (GOI) and its co-expressed genes. CEP-IP is introduced as a novel explainable machine learning framework to address this gap. In the primary dataset, TRPM4 served as the GOI and its co-expressed ribosomal genes (Ribo) were identified via Spearman-Kendall dual-filter (i.e., dual-filtered gene, DFG). Generalized additive modeling quantified TRPM4-Ribo relationship strength via deviance explained (DE), which was then mapped to individual cells via CEP classification to identify top-ranked explanatory power (TREP) cells. TRPM4-Ribo transcriptional space was then stratified into pre-IP and post-IP regions using inflection point (IP) analysis, producing four subpopulations per patient for pathway analysis. TRPM4-Ribo modeling outperformed alternative gene set modules (FDR<0.05). In each prostate cancer (PCa) patient, CEP-IP yielded four cell subpopulations, where pre-IP TREP cells showed enrichment of immune-related processes, and post-IP TREP cells were enriched for ribosomal, translation, and cell adhesion pathways. Validation was performed in the Allen middle temporal gyrus (MTG) and Neftel glioblastoma (GBM) datasets. In the MTG dataset (CARM1P1-DFG module), post-IP TREP cells showed enrichment of neuron projection ontologies. In the GBM dataset, FOXM1 was the sole GOI yielding mesenchymal-state DFGs, with FOXM1-DFG post-IP TREP cells enriched for cell division and microtubule pathways; 3D trajectory analysis demonstrated continuous trajectories of TREP cells that were obscured in 2D embeddings. CEP-IP identifies biologically distinct cell subpopulations in three independent scRNA-seq datasets, and it may be applicable to other pairwise GOI-DFG modules in single-cell transcriptomics.

2507.23057 2026-05-05 eess.SP q-bio.NC

Presurgical Neural Energy Landscapes Predict Postoperative Working Memory Outcome After Brain Tumor Resection

Triet M. Tran, Sina Khanmohammadi

详情
英文摘要

Surgical resection is the primary treatment option for brain tumor patients, but it carries the risk of postoperative cognitive impairments. This study investigates how tumor-induced alterations in presurgical neural dynamics relate to postoperative working memory outcome assessed by Spatial Span (SSP) test. We analyzed functional magnetic resonance imaging (fMRI) of brain tumor patients before surgery and extracted energy landscapes of high-order brain interactions. We then examined the relation between these energy features and postoperative working memory performance using statistical and machine learning (random forest) models. Patients with lower postoperative SSP Scores (2 to 5) exhibited fewer but more extreme transitions between local energy minima and maxima, whereas patients with higher SSP Scores (6 to 9) showed more frequent but less extreme shifts. Furthermore, the presurgical high-order energy features were able to accurately predict postoperative working memory outcome with a mean accuracy of 90%, F1 score of 87.5%, and an AUC of 0.95. Our study suggests that the brain tumor-induced disruptions in high-order neural dynamics before surgery are predictive of postoperative working memory outcome. Our findings pave the path for personalized surgical planning and targeted interventions to mitigate cognitive risks associated with brain tumor resection.

2202.10873 2026-05-05 q-bio.BM cs.LG

Ligandformer: A Graph Neural Network for Predicting Compound Property with Robust Interpretation

Jinjiang Guo, Qi Liu, Han Guo, Xi Lu

Comments 7 pages, 4 figures

详情
英文摘要

Robust and efficient interpretation of QSAR methods is quite useful to validate AI prediction rationales with subjective opinion (chemist or biologist expertise), understand sophisticated chemical or biological process mechanisms, and provide heuristic ideas for structure optimization in pharmaceutical industry. For this purpose, we construct a multi-layer self-attention based Graph Neural Network framework, namely Ligandformer, for predicting compound property with interpretation. Ligandformer integrates attention maps on compound structure from different network blocks. The integrated attention map reflects the machine's local interest on compound structure, and indicates the relationship between predicted compound property and its structure. This work mainly contributes to three aspects: 1. Ligandformer directly opens the black-box of deep learning methods, providing local prediction rationales on chemical structures. 2. Ligandformer gives robust prediction in different experimental rounds, overcoming the ubiquitous prediction instability of deep learning methods. 3. Ligandformer can be generalized to predict different chemical or biological properties with high performance. Furthermore, Ligandformer can simultaneously output specific property score and visible attention map on structure, which can support researchers to investigate chemical or biological property and optimize structure efficiently. Our framework outperforms over counterparts in terms of accuracy, robustness and generalization, and can be applied in complex system study.