arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2605.00937 2026-05-05 physics.flu-dyn cs.LG

An ALE-Consistent Graph Neural Operator-Transformer Framework for Fluid-Structure Interaction

Shihang Zhao, Martín Saravia, Haokui Jiang, Zhiyang Xue, Shunxiang Cao

Comments 29 pages, 20 figures

详情

英文摘要

We propose an arbitrary Lagrangian-Eulerian (ALE)-consistent machine learning framework for long-term fluid-structure interaction (FSI) prediction on deforming unstructured meshes. Specifically, the fluid dynamics are modeled by a surrogate that combines a graph neural operator (GNO) with a vision Transformer (ViT) for spatiotemporal prediction, while a lightweight long short-term memory (LSTM) network predicts structural kinematics at the interface. The two surrogates are coupled through a standard partitioned procedure. Most importantly, kinematic compatibility at the moving interface is enforced via an ALE-consistent boundary-correction step that updates the fluid-side interface velocity with the predicted structural velocity at each coupling update, thereby improving near-interface accuracy and long-term rollout stability. To mitigate autoregressive error accumulation, a two-stage training strategy is adopted, consisting of single-step supervised pretraining followed by long-term autoregressive fine-tuning. The proposed framework is validated on the benchmark problem of a flexible beam vibration in the wake of a cylinder. Results demonstrate accurate phase-consistent predictions over long rollouts and robust generalization under inlet-profile variations in both interpolation and extrapolation settings. Systematic ablation studies further assess the respective contributions of the ViT module, ALE-consistent boundary correction, and long-term training to predictive accuracy and rollout robustness.

URL PDF HTML ☆

赞 0 踩 0

2605.00930 2026-05-05 q-bio.GN cs.AI

CellxPert: Inference-Time MCMC Steering of a Multi-Omics Single-Cell Foundation Model for In-Silico Perturbation

Andac Demir, Erik W. Anderson, Jeremy L. Jenkins, Srayanta Mukherjee

2605.00923 2026-05-05 eess.IV cs.CV

A Proof-of-Concept Study of Multitask Learning for Cranial Synthetic CT Generation Across Heterogeneous MRI Field Strengths

Zhuoyao Xin, Yiren Zhang, Christopher Wu, Dong Liu, Chunming Gu, Elena Greco, Erik H. Middlebrooks, Jun Hua, Jia Guo

Comments Published in Medical Physics (2026). DOI: 10.1002/mp.70429

2605.00922 2026-05-05 cs.SE cs.AI

To Vibe Research or Not to Vibe Research? Generative AI in Qualitative Research

Katja Karhu, Kari Smolander, Jussi Kasurinen

Comments 13 pages, 2 figures. Accepted to VibeX 2026: 1st International Workshop on Vibe Coding and Vibe Researching

2605.00914 2026-05-05 cs.MA cs.AI

The Cost of Consensus: Isolated Self-Correction Prevails Over Unguided Homogeneous Multi-Agent Debate

Blaž Bertalanič, Carolina Fortuna

Comments 19 pages, ACM Conference on AI and Agentic Systems

详情

DOI: 10.1145/3786335.3813137

英文摘要

Multi-agent debate, where teams of LLMs iteratively exchange rationales and vote on answers, is widely deployed under the assumption that peer review filters hallucinations. Yet the failure dynamics of homogeneous debate remain poorly understood, therefore we report findings from a controlled empirical study of teams of $N{=}10$ homogeneous agents (Qwen2.5-7B, Llama-3.1-8B, Ministral-3-8B) across $R{=}3$ debate rounds on two high-difficulty benchmarks (GSM-Hard and MMLU-Hard). We compare peer debate against isolated self-correction and a stochastic noise control that injects rationales from unrelated problems. We decompose debate failure into three model-dependent pathways: sycophantic conformity, where agents uncritically adopt majority answers (modal adoption up to 85.5%); contextual fragility, where peer rationales destabilize previously correct reasoning (vulnerability rate up to 70.0%); and consensus collapse, where plurality voting discards correct answers already present in the generation pool (oracle gap up to 32.3 percentage points). Ablations over communication density ($K \in \{2,4,9\}$) and sampling temperature ($T \in \{0.4, 0.7\}$) show that conformity reaches high levels at minimal peer exposure ($K{=}2$) and intensifies with greater initial diversity. Across all configurations, debate consumes 2.1-3.4$\times$ more tokens (up to 28,631 tokens per problem) than self-correction for equal or lower accuracy. Our results indicate that, within the 7-8B parameter class, homogeneous teams without structured roles do not benefit from unguided peer exchange, and that isolated self-correction consistently offers a more favorable cost-accuracy tradeoff.

URL PDF HTML ☆

赞 0 踩 0

2605.00897 2026-05-05 eess.SP cs.CV eess.IV

SPAT: A Semantic Port-Aware Adaptive-Rate Transmission Protocol for Semantic Communication

Yunhao Wang, Shuai Ma, Bin Shen, Shouhan Shi, Youlong Wu, Guangming Shi, Xiang Cheng

2605.00895 2026-05-05 eess.SP cs.AI cs.LG

Transfer Learning for Tonal Noise Prediction in VRF Units Using Thermodynamic and Vibration Signals

ZhiWei Su, Ding Wang, Yuan Guo, Yang Qiao, HongJun Cao

2605.00881 2026-05-05 eess.IV cs.CV physics.med-ph

A Coupled Fourth Order Telegraph Diffusion Framework Using Grayscale Indicators for Image Despeckling

Manish Kumar, Rajendra K. Ray

2605.00877 2026-05-05 cs.MM cs.AI cs.CL cs.CV cs.LG

OceanPile: A Large-Scale Multimodal Ocean Corpus for Foundation Models

Yida Xue, Ningyu Zhang, Tingwei Wu, Zhe Ma, Daxiong Ji, Zhao Wang, Guozhou Zheng, Huajun Chen

Comments Work in progress

2605.00872 2026-05-05 eess.SP cs.AI cs.CV

Multi-View Hierarchical Representation Learning of Fetal Hemodynamics for Maternal Hypertension Detection at the Edge

Alireza Rafiei, Anahí Venzor Strader, Esteban Castro Aragón, Victoriana Rosibely Sut Serech, Enma Carolina Coyote Ixen, Reza Sameni, Peter Rohloff, Gari D. Clifford, Nasim Katebi

2605.00871 2026-05-05 eess.SP cs.AI cs.CV cs.LG

NAKUL-Med: Spectral-Graph State Space Models with Dynamics Kernels for Medical Signals

Badri N. Patro, Vijay S. Agneeswaran

Comments Accepted CVPR Finding Track

2605.00870 2026-05-05 eess.SP cs.AI cs.LG

An Algorithm for On-Sensor Agnostic Detection of Changes in Human Activity for Ultra-Low-Power Applications

Sara Rimoldi, Arianna De Vecchi, Hazem Hesham Yousef Shalby, Federica Villa

Comments Accepted to 2026 International Conference on Automatic Face and Gesture Recognition (FG)

2605.00869 2026-05-05 eess.SP cs.CV cs.LG

Robust Cross-Domain WiFi Fall Detection via Physics-Driven Attention-Enhanced Transformers

Yingzhe Wang, Cunhua Pan, Ruijing Liu, Shaokai Li, Hong Ren, Kezhi Wang, Jiangzhou Wang

2605.00865 2026-05-05 eess.SP cs.CL cs.CV cs.LG cs.SD q-bio.NC

How Well Can We Decode Vowels from Auditory EEG -- A Rigorous Cross-Subject Benchmark with Honest Assessment

Xiaoyang Li

Comments 31 pages, 11 figures; includes supplementary material (14 pages, additional figures and analyses)

2605.00861 2026-05-05 eess.AS cs.AI eess.SP

Voice Mapping of Text-to-Speech Systems: A Metric-Based Approach for Voice Quality Assessment

Huanchen Cai, Sten Ternström

2605.00860 2026-05-05 physics.ao-ph cs.LG

An Adaptive Spatiotemporal Clustering Framework for 3D Ocean Subsurface Temperature Reconstruction

Ming Shan Loo, Wengen Li, Xudong Jiang, Hailiang Cheng, Zhifei Zhang, Jihong Guan, Yichao Zhang

2605.00858 2026-05-05 eess.SP cs.CE cs.LG

A Hybrid Windkessel-Neural Approach for Improved Noninvasive Blood Pressure Monitoring

Vaibhav Gollapalli, Aniruth Ananthanarayanan

2605.00857 2026-05-05 eess.SP cs.AI cs.LG q-bio.NC

Foundation Model Guided Dual-Branch Co-Adaptation for Source-Free EEG Decoding

Peiliang Gong, Han Zhang, Zhen Jiang, Chenyu Liu, Ziyu Jia, Xinliang Zhou, Daoqiang Zhang, Xiaoli Li

2605.00855 2026-05-05 math.OC cs.LG stat.ML

An Efficient Spatial Branch-and-Bound Algorithm for Global Optimization of Gaussian Process Posterior Mean Functions

Wei-Ting Tang, Akshay Kudva, Calvin Tsay, Joel A. Paulson

2605.00850 2026-05-05 physics.ao-ph cs.AI cs.LG eess.IV

Earth System Foundation Model (ESFM): A unified framework for heterogeneous data integration and forecasting

Firat Ozdemir, Yun Cheng, Salman Mohebi, Fanny Lehmann, Simon Adamov, Zhenyi Zhang, Leonardo Trentini, Dana Grund, Oliver Fuhrer, Torsten Hoefler, Siddhartha Mishra, Sebastian Schemm, Benedikt Soja, Mathieu Salzmann

Comments ESFM is available on https://github.com/swiss-ai/ESFM. 48 pages, 29 figures, 18 tables

详情

英文摘要

Foundation models (FMs) for the Earth system learn statistical relationships between physical variables across massive datasets to enable versatile downstream applications through finetuning, separating them from task-specific weather models. Here, we introduce Earth System Foundation Model (ESFM), a fully open model building on the 3D Swin UNet backbone of the pioneering Aurora model. ESFM introduces extensions that increase functionality and foster adoption in climate sciences. First, the encoding scheme and training protocols have been extended to handle diverse datasets, including those containing missing values across all spatio-temporal dimensions such as satellite data, as well as station data, all under one backbone. Axial attention is introduced to capture inter-variable dependencies. As a result ESFM skillfully predicts variables in regions or on pressure levels where no data is present at the initial time, while preserving inter-variable relationships, for example between temperature, pressure, and humidity. Individual variable tokenization enables different sets of variables to be shuffled during training and simplifies the process of building extensions for new downstream tasks. Adaptive layer norm-based ensembles allow for a simple yet effective way to transform deterministic ESFM to a probabilistic FM. We present findings using dense gridded data (ERA5, CMIP6), regionally masked dense data, sparse gridded MODIS satellite data, and station data. Results demonstrate competitive or superior performance relative to state-of-the-art benchmarks. Case studies of Super Typhoon Doksuri (2023) and 2024 sudden stratospheric warming events show accurate positional and magnitude estimations of extreme weather. ESFM retains the strengths of previous foundation models, such as long-term stability, but facilitates application to a variety of downstream tasks.

URL PDF HTML ☆

赞 0 踩 0

2605.00849 2026-05-05 eess.SP cs.LG cs.SY eess.SY

Deep Learning for Multi-Antenna Modulation Recognition of Radio Signals

Tao Chen, Shilian Zheng, Jiepeng Chen, Zhangbin Pei, Qi Xuan, Xiaoniu Yang

2605.00845 2026-05-05 cs.DB cs.AI cs.CL

Graph Query Generation with Constraint-guided Large Language Agents

Mengying Wang, Nicolaas Jedema, Rahul Pandey, RaviKiran Krishnan, Jens Lehmann, Yinghui Wu

Comments 42nd IEEE International Conference on Data Engineering (ICDE)

2605.00844 2026-05-05 cs.CY cs.AI

The Oracle's Fingerprint: Correlated AI Forecasting Errors and the Limits of Bias Transmission

Theodor Spiro

Comments 23 pages, 3 figures, 5 tables

2605.00843 2026-05-05 cs.CY cs.AI

Generative-AI and the transformation of workforce. A job postings-driven analysis

Diana Maria Popa, Simona-Vasilica Oprea, Adela Bâra

2605.00838 2026-05-05 cs.NI cs.LG

Adaptive Alarm Threshold Prediction in 4G Mobile Networks: A Percentile-Guided Deep Learning Framework with Interpretable Outputs

Ayon Roy, Sadman Sharif, Shiva Prasad Sarkar

Comments 21 pages, 8 figures, preprint

2605.00831 2026-05-05 cs.DC cs.AI cs.PF

GhostServe: A Lightweight Checkpointing System in the Shadow for Fault-Tolerant LLM Serving

Shakya Jayakody, Youpeng Zhao, Chinmay Dhanraj Nehate, Jun Wang

Comments MLSys 2026

2605.00827 2026-05-05 cs.DC cs.AI cs.SE

Separating Intelligence from Execution: A Workflow Engine for the Model Context Protocol

Abhinav Singh Parmar

Comments 16 pages, 5 figures

2605.00826 2026-05-05 cs.IR cs.CV cs.LG cs.MM

Understanding the Performance Plateau in Text-to-Video Retrieval: A Comprehensive Empirical and Linguistic Analysis

Maria-Eirini Pegia, Dimitrios Stefanopoulos, Björn Þór Jónsson, Anastasia Moumtzidou, Ilias Gialampoukidis, Stefanos Vrochidis, Ioannis Kompatsiaris

Comments Survey, 50 pages, 15 figures, 13 tables, 154 citations

2604.27947 2026-05-05 cs.NE cs.AI cs.LG cs.LO

Attractor FCM

Alexis Kafantaris

2604.23940 2026-05-05 cs.SE cs.AI

Constraint-Guided Multi-Agent Decompilation for Executable Binary Recovery

Yifan Zhang, Xiaohan Wang, Yueke Zhang, Yu Huang, Kevin Leach