arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.04969 2026-04-08 cs.IR cs.AI

MG$^2$-RAG: Multi-Granularity Graph for Multimodal Retrieval-Augmented Generation

Sijun Dai, Qiang Huang, Xiaoxing You, Jun Yu

详情

英文摘要

Retrieval-Augmented Generation (RAG) mitigates hallucinations in Multimodal Large Language Models (MLLMs), yet existing systems struggle with complex cross-modal reasoning. Flat vector retrieval often ignores structural dependencies, while current graph-based methods rely on costly ``translation-to-text'' pipelines that discard fine-grained visual information. To address these limitations, we propose \textbf{MG$^2$-RAG}, a lightweight \textbf{M}ulti-\textbf{G}ranularity \textbf{G}raph \textbf{RAG} framework that jointly improves graph construction, modality fusion, and cross-modal retrieval. MG$^2$-RAG constructs a hierarchical multimodal knowledge graph by combining lightweight textual parsing with entity-driven visual grounding, enabling textual entities and visual regions to be fused into unified multimodal nodes that preserve atomic evidence. Building on this representation, we introduce a multi-granularity graph retrieval mechanism that aggregates dense similarities and propagates relevance across the graph to support structured multi-hop reasoning. Extensive experiments across four representative multimodal tasks (i.e., retrieval, knowledge-based VQA, reasoning, and classification) demonstrate that MG$^2$-RAG consistently achieves state-of-the-art performance while reducing graph construction overhead with an average 43.3$\times$ speedup and 23.9$\times$ cost reduction compared with advanced graph-based frameworks.

URL PDF HTML ☆

赞 0 踩 0

2604.04963 2026-04-08 stat.ML cs.LG

Learning Nonlinear Regime Transitions via Semi-Parametric State-Space Models

Prakul Sunil Hiremath

Comments 12 pages, 1 figures, 2 tables

2604.04961 2026-04-08 stat.ML cs.LG econ.EM math.ST stat.TH

Identification and Inference in Nonlinear Dynamic Network Models

Diego Vallarino

2604.04951 2026-04-08 cs.CR cs.AI

Synthetic Trust Attacks: Modeling How Generative AI Manipulates Human Decisions in Social Engineering Fraud

Muhammad Tahir Ashraf

Comments 15 pages, 3 figures, 2 tables

2604.04949 2026-04-08 cs.IR cs.AI cs.CL

Learning to Retrieve from Agent Trajectories

Yuqi Zhou, Sunhao Dai, Changle Qu, Liang Pang, Jun Xu, Ji-Rong Wen

2604.04947 2026-04-08 cs.IR cs.AI

SUMMIR: A Hallucination-Aware Framework for Ranking Sports Insights from LLMs

Nitish Kumar, Sannu Kumar, S Akash, Manish Gupta, Ankith Karat, Sriparna Saha

2604.04946 2026-04-08 cs.CE cs.LG physics.comp-ph

Sparse Autoencoders as a Steering Basis for Phase Synchronization in Graph-Based CFD Surrogates

Yeping Hu, Ruben Glatt, Shusen Liu

详情

英文摘要

Graph-based surrogate models provide fast alternatives to high-fidelity CFD solvers, but their opaque latent spaces and limited controllability restrict use in safety-critical settings. A key failure mode in oscillatory flows is phase drift, where predictions remain qualitatively correct but gradually lose temporal alignment with observations, limiting use in digital twins and closed-loop control. Correcting this through retraining is expensive and impractical during deployment. We ask whether phase drift can instead be corrected post hoc by manipulating the latent space of a frozen surrogate. We propose a phase-steering framework for pretrained graph-based CFD models that combines the right representation with the right intervention mechanism. To obtain disentangled representation for effective steering, we use sparse autoencoders (SAEs) on frozen MeshGraphNet embeddings. To steer dynamics, we move beyond static per-feature interventions such as scaling or clamping, and introduce a temporally coherent, phase-aware method. Specifically, we identify oscillatory feature pairs with Hilbert analysis, project spatial fields into low-rank temporal coefficients via SVD, and apply smooth time-varying rotations to advance or delay periodic modes while preserving amplitude-phase structure. Using a representation-agnostic setup, we compare SAE-based steering with PCA and raw embedding spaces under the same intervention pipeline. Results show that sparse, disentangled representations outperform dense or entangled ones, while static interventions fail in this dynamical setting. Overall, this work shows that latent-space steering can be extended from semantic domains to time-dependent physical systems when interventions respect the underlying dynamics, and that the same sparse features used for interpretability can also serve as physically meaningful control axes.

URL PDF HTML ☆

赞 0 踩 0

2604.04936 2026-04-08 cs.IR cs.AI

Web Retrieval-Aware Chunking (W-RAC) for Efficient and Cost-Effective Retrieval-Augmented Generation Systems

Uday Allu, Sonu Kedia, Tanmay Odapally, Biddwan Ahmed

Comments 13 pages, 9 tables, 0 figures

2604.02656 2026-04-08 stat.ML cs.LG

Transfer Learning for Meta-analysis Under Covariate Shift

Zilong Wang, Ali Abdeen, Turgay Ayer

Comments Accepted to IEEE ICHI 2026 Early Bird Track (Oral Presentation)

2604.01346 2026-04-08 cs.CR cs.AI cs.LG cs.RO

Safety, Security, and Cognitive Risks in World Models

Manoj Parmar

Comments version 2, 29 pages, 1 figure (6 panels), 3 tables. Empirical proof-of-concept on GRU/RSSM/DreamerV3 architectures

2604.00333 2026-04-08 math.NA cs.LG cs.NA physics.comp-ph

MVNN: A Measure-Valued Neural Network for Learning McKean-Vlasov Dynamics from Particle Data

Liyao Lyu, Xinyue Yu, Hayden Schaeffer

2603.29328 2026-04-08 cs.CR cs.AI cs.CV cs.DC cs.LG

Beyond Corner Patches: Semantics-Aware Backdoor Attack in Federated Learning

Kavindu Herath, Joshua Zhao, Saurabh Bagchi

Comments Accepted as a regular paper at IEEE/IFIP International Conference on Dependable Systems and Networks (DSN), 2026

2603.26684 2026-04-08 cs.MA cs.RO

Decoupling Geometric Planning and Execution in Scalable Multi-Agent Path Finding

Fernando Salanova, Eduardo Montijano, Cristian Mahulea

Comments 6 pages, 3 figures, WODES conference paper

2603.23448 2026-04-08 cs.SE cs.AI

Code Review Agent Benchmark

Yuntong Zhang, Zhiyuan Pan, Imam Nur Bani Yusuf, Haifeng Ruan, Ridwan Shariffdeen, Abhik Roychoudhury

详情

英文摘要

Software engineering agents have shown significant promise in writing code. As AI agents permeate code writing, and generate huge volumes of code automatically -- the matter of code quality comes front and centre. As the automatically generated code gets integrated into huge code-bases -- the issue of code review and broadly quality assurance becomes important. In this paper, we take a fresh look at the problem and curate a code review dataset for AI agents to work with. Our dataset called c-CRAB (pronounced see-crab) can evaluate agents for code review tasks. Specifically given a pull-request (which could be coming from code generation agents or humans), if a code review agent produces a review, our evaluation framework can asses the reviewing capability of the code review agents. Our evaluation framework is used to evaluate the state of the art today -- the open-source PR-agent, as well as commercial code review agents from Devin, Claude Code, and Codex. Our c-CRAB dataset is systematically constructed from human reviews -- given a human review of a pull request instance we generate corresponding tests to evaluate the code review agent generated reviews. Such a benchmark construction gives us several insights. Firstly, the existing review agents taken together can solve only around 40% of the c-CRAB tasks, indicating the potential to close this gap by future research. Secondly, we observe that the agent reviews often consider different aspects from the human reviews -- indicating the potential for human-agent collaboration for code review that could be deployed in future software teams. Last but not the least, the agent generated tests from our data-set act as a held out test-suite and hence quality gate for agent generated reviews. What this will mean for future collaboration of code generation agents, test generation agents and code review agents -- remains to be investigated.

URL PDF HTML ☆

赞 0 踩 0

2603.20231 2026-04-08 cs.CY cs.AI cs.CL

Moral Mazes in the Era of LLMs

Dang Nguyen, Harvey Yiyun Fu, Peter West, Ari Holtzman, Chenhao Tan

Comments 47 pages (including appendix), 7 figures, 2 tables in the main body. v2: updated title and abstract

2603.11519 2026-04-08 cs.HC cs.CV

Prediction of Grade, Gender, and Academic Performance of Children and Teenagers from Handwriting Using the Sigma-Lognormal Model

Adrian Iste, Kazuki Nishizawa, Chisa Tanaka, Andrew Vargo, Anna Scius-Bertrand, Andreas Fischer, Koichi Kise

Comments 18 pages, 8 figures

2603.07339 2026-04-08 cs.HC cs.AI cs.CE

Agora: Teaching the Skill of Consensus-Finding with AI Personas Grounded in Human Voice

Prerna Ravi, Om Gokhale, Suyash Fulay, Eugene Yi, Deb Roy, Michiel Bakker

Comments Short version: Accepted to ACM CHI Extended Abstracts 2026 (https://doi.org/10.1145/3772363.3798888); Long version under review

2603.02050 2026-04-08 cs.HC cs.AI

"When to Hand Off, When to Work Together": Expanding Human-Agent Co-Creative Collaboration through Concurrent Interaction

Kihoon Son, Hyewon Lee, DaEun Choi, Yoonsu Kim, Tae Soo Kim, Yoonjoo Lee, John Joon Young Chung, HyunJoon Jung, Juho Kim

Comments Check the demo videos on the website: https://cleo.kixlab.org/

2602.16000 2026-04-08 physics.med-ph cs.LG

Imaging-Derived Coronary Fractional Flow Reserve: Advances in Physics-Based, Machine Learning, and Physics-Informed Methods

Tanxin Zhu, Emran Hossen, Chen Zhao, Jingfeng Jiang, Michele Esposito, Jiguang Sun, Weihua Zhou

Comments 32 pages 4 tables

详情

英文摘要

Purpose of Review Imaging derived fractional flow reserve (FFR) is rapidly evolving beyond conventional computational fluid dynamics (CFD) based pipelines toward machine learning (ML), deep learning (DL), and physics informed approaches that enable fast, wire free, and scalable functional assessment of coronary artery stenosis. This review synthesizes recent advances in computed tomography (CT)- and angiography-based FFR measurement, with particular emphasis on emerging physics-informed neural networks and neural operators (PINNs and PINOs), as well as key considerations for their clinical translation. Recent Findings ML/DL approaches have markedly improved automation and computational speed, enabling prediction of pressure and FFR from anatomical descriptors or angiographic contrast dynamics. However, their real-world performance and generalizability can remain variable and sensitive to domain shift, due to multi-center heterogeneity, interpretability challenges, and differences in acquisition protocols and image quality. Physics informed learning introduces conservation structure and boundary condition consistency into model training, improving generalizability and reducing dependence on dense supervision while maintaining rapid inference. Recent evaluation trends increasingly highlight deployment oriented metrics, including calibration, uncertainty quantification, and quality control gatekeeping, as essential for safe clinical use. Summary The field is converging toward imaging derived FFR methods that are faster, more automated, and more reliable. While ML/DL offers substantial efficiency gains, physics informed frameworks such as PINNs and PINOs may provide a more robust balance between speed and physical consistency. Prospective multi center validation and standardized evaluation will be critical to support broad and safe clinical adoption.

URL PDF HTML ☆

赞 0 踩 0

2602.04816 2026-04-08 cs.OS cs.CL cs.DC

Horizon-LM: A RAM-Centric Architecture for LLM Training

Zhengqing Yuan, Lichao Sun, Yanfang Ye

Comments This paper contained an error in the throughput computation used in the experimental evaluation. Specifically, the TFLOPS calculation omitted the 12HL term in the training FLOPs formula, which led to systematic underestimation of the reported throughput numbers in the experimental results. We are withdrawing this version to correct the evaluation and avoid confusion for readers

2602.04728 2026-04-08 eess.SP cs.IT cs.LG math.IT

Scalable Cross-Attention Transformer for Cooperative Multi-AP OFDM Uplink Reception

Xavier Tardy, Grégoire Lefebvre, Apostolos Kountouris, Haïfa Fares, Amor Nafkha

Comments 7 pages, 3 figures, 2 tables, conference submission

2602.00185 2026-04-08 cond-mat.mtrl-sci cs.AI

QUASAR: A Universal Autonomous System for Atomistic Simulation and a Benchmark of Its Capabilities

Fengxu Yang, Jack D. Evans

Comments 14 pages, 2 figures

2601.20167 2026-04-08 quant-ph cs.AI cs.IT math.IT

Contextuality as an External Bookkeeping Cost under Fixed Shared-State Semantics

Song-Ju Kim

Comments 5 pages, 0 figure

2601.12630 2026-04-08 physics.chem-ph cond-mat.mtrl-sci cs.LG physics.comp-ph

Enhanced Climbing Image Nudged Elastic Band method with Hessian Eigenmode Alignment

Rohit Goswami, Miha Gunde, Hannes Jónsson

Comments 43 pages. 8 main, 32 supplementary figures

2601.12614 2026-04-08 physics.space-ph cs.LG physics.plasm-ph

Deterministic and probabilistic neural surrogates of global hybrid-Vlasov simulations

Daniel Holmberg, Ivan Zaitsev, Markku Alho, Ioanna Bouri, Fanni Franssila, Haewon Jeong, Minna Palmroth, Teemu Roos

2601.11652 2026-04-08 cs.DC cs.AI

WISP: Waste- and Interference-Suppressed Distributed Speculative LLM Serving at the Edge via Dynamic Drafting and SLO-Aware Batching

Xiangchen Li, Jiakun Fan, Qingyuan Wang, Dimitrios Spatharakis, Saeid Ghafouri, Hans Vandierendonck, Deepu John, Bo Ji, Ali R. Butt, Dimitrios S. Nikolopoulos

Comments 31 Pages, 13 Figures, 13 Tables

2601.03323 2026-04-08 cs.GR cs.CV cs.HC cs.LG cs.SD

Listen to Rhythm, Choose Movements: Autoregressive Multimodal Dance Generation via Diffusion and Mamba with Decoupled Dance Dataset

Oran Duan, Yinghua Shen, Yingzhu Lv, Luyang Jie, Yaxin Liu, Qiong Wu

Comments 12 pages, 13 figures

2512.17239 2026-04-08 cs.SI cs.AI cs.CY

Privacy-Preserving Synthetic Dataset of Individual Daily Trajectories for City-Scale Mobility Analytics

Jun'ichi Ozaki, Ryosuke Susuta, Takuhiro Moriyama, Yohei Shida

Comments 9 pages, 4 figures

详情

DOI: 10.1109/BigData66926.2025.11401071

英文摘要

Urban mobility data are indispensable for urban planning, transportation demand forecasting, pandemic modeling, and many other applications; however, individual mobile phone-derived Global Positioning System traces cannot generally be shared with third parties owing to severe re-identification risks. Aggregated records, such as origin-destination (OD) matrices, offer partial insights but fail to capture the key behavioral properties of daily human movement, limiting realistic city-scale analyses. This study presents a privacy-preserving synthetic mobility dataset that reconstructs daily trajectories from aggregated inputs. The proposed method integrates OD flows with two complementary behavioral constraints: (1) dwell-travel time quantiles that are available only as coarse summary statistics and (2) the universal law for the daily distribution of the number of visited locations. Embedding these elements in a multi-objective optimization framework enables the reproduction of realistic distributions of human mobility while ensuring that no personal identifiers are required. The proposed framework is validated in two contrasting regions of Japan: (1) the 23 special wards of Tokyo, representing a dense metropolitan environment; and (2) Fukuoka Prefecture, where urban and suburban mobility patterns coexist. The resulting synthetic mobility data reproduce dwell-travel time and visit frequency distributions with high fidelity, while deviations in OD consistency remain within the natural range of daily fluctuations. The results of this study establish a practical synthesis pathway under real-world constraints, providing governments, urban planners, and industries with scalable access to high-resolution mobility data for reliable analytics without the need for sensitive personal records, and supporting practical deployments in policy and commercial domains.

URL PDF HTML ☆

赞 0 踩 0

2512.15921 2026-04-08 eess.IV cs.CV

In search of truth: Evaluating concordance of AI-based anatomy segmentation models

Lena Giebeler, Deepa Krishnaswamy, David Clunie, Jakob Wasserthal, Lalith Kumar Shiyam Sundar, Andres Diaz-Pinto, Klaus H. Maier-Hein, Murong Xu, Bjoern Menze, Steve Pieper, Ron Kikinis, Andrey Fedorov

详情

DOI: 10.1117/1.jmi.13.6.062204

英文摘要

Purpose AI-based methods for anatomy segmentation can help automate characterization of large imaging datasets. The growing number of similar in functionality models raises the challenge of evaluating them on datasets that do not contain ground truth annotations. We introduce a practical framework to assist in this task. Approach We harmonize the segmentation results into a standard, interoperable representation, which enables consistent, terminology-based labeling of the structures. We extend 3D Slicer to streamline loading and comparison of these harmonized segmentations, and demonstrate how standard representation simplifies review of the results using interactive summary plots and browser-based visualization using OHIF Viewer. To demonstrate the utility of the approach we apply it to evaluating segmentation of 31 anatomical structures (lungs, vertebrae, ribs, and heart) by six open-source models - TotalSegmentator 1.5 and 2.6, Auto3DSeg, MOOSE, MultiTalent, and CADS - for a sample of Computed Tomography (CT) scans from the publicly available National Lung Screening Trial (NLST) dataset. Results We demonstrate the utility of the framework in enabling automating loading, structure-wise inspection and comparison across models. Preliminary results ascertain practical utility of the approach in allowing quick detection and review of problematic results. The comparison shows excellent agreement segmenting some (e.g., lung) but not all structures (e.g., some models produce invalid vertebrae or rib segmentations). Conclusions The resources developed are linked from https://imagingdatacommons.github.io/segmentation-comparison/ including segmentation harmonization scripts, summary plots, and visualization tools. This work assists in model evaluation in absence of ground truth, ultimately enabling informed model selection.

URL PDF HTML ☆

赞 0 踩 0

2512.10785 2026-04-08 physics.ed-ph cs.AI cs.HC

Developing and Evaluating a Large Language Model-Based Automated Feedback System Grounded in Evidence-Centered Design for Supporting Physics Problem Solving

Holger Maus, Paul Tschisgale, Fabian Kieser, Stefan Petersen, Peter Wulff