arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2605.00177 2026-05-04 cs.GR cs.CV

FieryGS: In-the-Wild Fire Synthesis with Physics-Integrated Gaussian Splatting

Qianfan Shen, Ningxiao Tao, Qiyu Dai, Tianle Chen, Minghan Qin, Yongjie Zhang, Mengyu Chu, Wenzheng Chen, Baoquan Chen

Comments ICLR 2026

详情

英文摘要

We consider the problem of synthesizing photorealistic, physically plausible combustion effects in in-the-wild 3D scenes. Traditional CFD and graphics pipelines can produce realistic fire effects but rely on handcrafted geometry, expert-tuned parameters, and labor-intensive workflows, limiting their scalability to the real world. Recent scene modeling advances like 3D Gaussian Splatting (3DGS) enable high-fidelity real-world scene reconstruction, yet lack physical grounding for combustion. To bridge this gap, we propose FieryGS, a physically-based framework that integrates physically-accurate and user-controllable combustion simulation and rendering within the 3DGS pipeline, enabling realistic fire synthesis for real scenes. Our approach tightly couples three key modules: (1) multimodal large-language-model-based physical material reasoning, (2) efficient volumetric combustion simulation, and (3) a unified renderer for fire and 3DGS. By unifying reconstruction, physical reasoning, simulation, and rendering, FieryGS removes manual tuning and automatically generates realistic, controllable fire dynamics consistent with scene geometry and materials. Our framework supports complex combustion phenomena -- including flame propagation, smoke dispersion, and surface carbonization -- with precise user control over fire intensity, airflow, ignition location and other combustion parameters. Evaluated on diverse indoor and outdoor scenes, FieryGS outperforms all comparative baselines in visual realism, physical fidelity, and controllability. Project page can be found at https://pku-vcl-geometry.github.io/FieryGS/.

URL PDF HTML ☆

赞 0 踩 0

2605.00176 2026-05-04 stat.ML cs.LG

SHIFT: Robust Double Machine Learning for Average Dose-Response Functions under Heavy-Tailed Contamination

Eichi Uehara

Comments 77 pages, 43 figures, 35 tables. Code and raw CSVs: https://github.com/EichiUehara/ADRF-Robust-DML

2605.00171 2026-05-04 stat.ML cs.LG stat.AP

Adaptive Norm-Based Regularization for Neural Networks

Muhammad Qasim, Farrukh Javed

Comments 37 pages, 9 figures

2605.00169 2026-05-04 cs.NI cs.DC cs.LG

Network Digital Untwinning: Towards Backward Optimization of Digital Twins

Zifan Zhang, Dianwei Chen, Anjun Gao, Manhua Wang, Mingzhe Chen, Minghong Fang, Xianfeng Yang, Yuchen Liu

Comments Accepted by ICDCS 2026

2605.00107 2026-05-04 quant-ph cs.LG

Efficient Mutation Testing of Quantum Machine Learning Models

Emma Andrews, Prabhat Mishra

2605.00099 2026-05-04 quant-ph cs.LG stat.ML

Provable and scalable quantum Gaussian processes for quantum learning

Jonas Jäger, Paolo Braccia, Pablo Bermejo, Manuel G. Algaba, Diego García-Martín, M. Cerezo

Comments 18 + 70 pages, 5 + 14 figures, 2 tables

2605.00087 2026-05-04 cs.NI cs.AI cs.CY cs.IR cs.LG

DeGenTWeb: A First Look at LLM-dominant Websites

Sichang Steven He, Calvin Ardi, Ramesh Govindan, Harsha V. Madhyastha

Comments 6 pages, 6 figures, 13 page total; in submission

2605.00072 2026-05-04 cs.CR cs.AI

XekRung Technical Report

Jiutian Zeng, Junjie Li, Chengwei Dai, Jie Liang, Zhaoyu Hu, Yiliang Zhang, Ziang Weng, Longtao Huang, Dongjie Zhang, Libin Dong, Yang Ge, Yuanda Wang, Kaiwen Lv Kacuila, Bingyu Zhu, Jing Wang, Jin Xu

Comments 22 pages, 2 figures, 5 tables. Jiutian Zeng, Junjie Li, Chengwei Dai, Jie Liang, and Zhaoyu Hu contributed equally to this work

2605.00071 2026-05-04 cs.CR cs.AI cs.CE cs.MA

Compliance-Aware Agentic Payments on Stablecoin Rails

Kenneth See, Xue Wen Tan

Comments Demo Paper Track

2605.00063 2026-05-04 cs.IR cs.AI

A Survey of Reasoning-Intensive Retrieval: Progress and Challenges

Yiyang Wei, Tingyu Song, Siyue Zhang, Yilun Zhao

Comments Accepted to the ACL 2026 Main Conference; camera-ready version

2605.00058 2026-05-04 cs.AR cs.LG

Autoformalizing Memory Specifications with Agents

Jan Ole Ernst, Dmitri Michelangelo Saberi, Derek Christ, Thomas Zimmermann, Rajath Salegame, Suhaas M. Bhat, Stanislav Levental, Thomas Dybdahl Ahle, Matthias Jung

2605.00055 2026-05-04 cs.CR cs.AI cs.MA

Ambient Persuasion in a Deployed AI Agent: Unauthorized Escalation Following Routine Non-Adversarial Content Exposure

Diego F. Cuadros, Abdoul-Aziz Maiga

2605.00043 2026-05-04 cs.DB cs.AI cs.MA

SiriusHelper: An LLM Agent-Based Operations Assistant for Big Data Platforms

Yu Shen, Shiyang Liu, Qihang He, Yihang Cheng, Haining Xie, Zhiming He, Huahua Fan, Xianzhi Tan, Teng Ma, Shaoquan Zhang, Danqing Huang, Fan Jiang, Yang Li, Chongqing Zhao, Peng Chen, Jie Jiang, Bin Cui

2605.00033 2026-05-04 q-bio.NC cs.AI cs.HC cs.LG eess.IV

Sure About That Line? Approaching Confidence-Based, Real-Time Line Assignment in Reading Gaze Data

Franziska Kaltenberger, Wei-Ling Chen, Enkeleda Thaqi, Enkelejda Kasneci

Comments Accepted at ETRA 2026. To appear in Proceedings of the ACM on Computer Graphics and Interactive Techniques. 21 pages, 12 figures

2605.00032 2026-05-04 cs.AR cs.LG

ROSA: Robust and Energy-Efficient Microring-Based Optical Neural Networks via Optical Shift-and-Add and Layer-Wise Hybrid Mapping

Huifan Zhang, Yun Hu, Caizhi Sheng, Yurui Qu, Pingqiang Zhou

2605.00029 2026-05-04 eess.IV cs.CV physics.optics

Broadband Wide Field of View Imaging with Computational Mirrors

Vishwanath Saragadam, Niki Nezakati, Amit Roy-Chowdhury, Vivek Boominathan

2605.00015 2026-05-04 eess.SP cs.AI cs.CV cs.LG

TimeRFT: Stimulating Generalizable Time Series Forecasting for TSFMs via Reinforcement Finetuning

Siyang Li, Yize Chen, Zijie Zhu, Yuxin Pan, Yan Guo, Ming Huang, Hui Xiong

Comments 14 pages, 6 figures, In Submission

2605.00012 2026-05-04 cs.IR cs.AI cs.CL

Exploring LLM biases to manipulate AI search overview

Roman Smirnov

Comments 14 pages, 7 figures

2605.00007 2026-05-04 math.OC cs.AI stat.ML

Mean-Field Path-Integral Diffusion: From Samples to Interacting Agents

Michael Chertkov

Comments 31 pages, 14 figures

2604.28139 2026-05-04 cs.SE cs.AI

Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows

Chenxin Li, Zhengyang Tang, Mingxin Huang, Yunlong Lin, Shijue Huang, Shengyuan Liu, Bowen Ye, Rang Li, Lei Li, Benyou Wang, Yixuan Yuan

Comments Project page: https://claw-eval-live.github.io

2604.27855 2026-05-04 cs.DC cs.AI

AI Inference as Relocatable Electricity Demand: A Latency-Constrained Energy-Geography Framework

Xubin Luo, Cheng Yang

Comments 29 pages, 3 figures, 8 tables; preprint

2604.27209 2026-05-04 cs.SE cs.AI

Theory Under Construction: Orchestrating Language Models for Research Software Where the Specification Evolves

Halley Young, Nikolaj Björner

2604.25372 2026-05-04 math.OC cs.LG cs.NA cs.SY eess.SY math.NA

From Cursed to Competitive: Closing the ZO-FO Gap via Input-to-State Stability

Amir Ali Farzin, Philipp Braun, Iman Shames

2604.24415 2026-05-04 cs.SI cs.CV eess.SP

Phase-Separated Complex Hilbert PCA on Markerless 3D Pose Estimation Data: A Global Phase Network and Its Extension to a Continuous Field on the Body Surface

Hiromitsu Goto, Tao Tao, Zheng-Lin Chia

Comments 19 pages, 8 figures, 6 tables. Extended English version of a paper to be submitted to Transactions of the Japanese Society for Artificial Intelligence (JSAI; Special Issue on Emerging Topics in Sports Informatics). v2: corrected reference metadata for 8 entries (Kichikawa+18 -> Iyetomi+20); minor wording revisions in Sections 1, 3.4, 3.5, 4.2-4.5; no change to results or main claims

详情

英文摘要

Quantitative analysis of the kinematic chain in sports motion is essential for performance evaluation and injury prevention. Conventional methods such as the kinematic-sequence (KS) and continuous relative phase (CRP) are confined to adjacent joint pairs and lack a unified framework for whole-body coordination, while segmental power-flow analysis requires force plates and inertial parameters that restrict it to laboratory environments. We apply Complex Hilbert Principal Component Analysis (CHPCA) separately to each motion phase (backswing and downswing) on markerless 3D pose estimation data, extracting the dominant whole-body phase pattern as a single complex eigenvector. The pipeline further includes a fully automatic signal-based phase segmentation (no priors on strike count or rest location) and an extension to 1,079 body-surface mesh vertices, so that the kinematic chain is represented as a continuous phase field across the body. On 14 hammer-striking trials of a single subject, the framework reveals (i) a trunk-anchored global phase architecture, (ii) a functional asymmetry between preparation and execution phases quantified by Mode-1 contribution (45.5% vs. 70.5%) and inter-trial Spearman consistency (0.38 vs. 0.58), and (iii) a consistent reorganisation across both skeletal joints and mesh vertices ($p < 10^{-10}$ on 1,079 vertices). As a methodological consistency check, pairwise phase differences from the Mode-1 eigenvector are compared against CRP on all 190 joint pairs by a permutation test ($ρ= 0.473$, $p = 0.0005$). A correspondence analysis between Mode-1 amplitude and kinetic-energy mobilisation variance further shows a strong positive correlation in the downswing ($ρ\approx 0.71$ on both skeleton and mesh) and no correlation in the backswing, indicating that the proposed framework bridges kinematic and kinetic descriptions of coordination through phase structure.

URL PDF HTML ☆

赞 0 踩 0

2604.21960 2026-05-04 eess.IV cs.CV cs.LG

Conditional Diffusion Posterior Alignment for Sparse-View CT Reconstruction

Luis Barba, Johannes Kirschner, Benjamin Bejar

2604.16345 2026-05-04 cs.HC cs.AI

Bridging the Experimental Last Mile: Digitizing Laboratory Know-How for Safe AI-Assisted Support

Akira Miura, Yuki Sasahara, Momoka Demura, Yuji Masubuchi, Tetsuya Asai, Chikahiko Mitsui

Comments 32 pages in total (main 13 pages, appendix 19 pages), 2 main figures, 1 main table

详情

英文摘要

While advances in materials informatics have accelerated the development of Self-Driving Laboratories (SDLs), human-led experiments remain standard in many educational and exploratory research laboratories. In specific lab settings, formal documentation alone is often insufficient for safe and reliable operation. We refer to the gap between formal documentation and reliable execution in such settings as the experimental last mile; this gap mainly involves site-specific operational know-how, including local rules, routine checks, procedural details, and safety-conscious actions that are can be verbalizable but are often under-documented in standard manuals. In this proof-of-concept study, we developed a human-in-the-loop AI assistant that combines first-person experimental video, multimodal AI, and retrieval-augmented generation (RAG). Using powder X-ray diffraction experiments and student-recorded video data as inputs, the system extracts site-specific laboratory knowledge from recorded procedures, including physical techniques and audible confirmation that conventional manuals could omit. It then provides grounded responses based on the resulting manual. To reduce the risk of unsupported outputs, the system employs a two-layer safety design: source restriction through RAG and strict system-prompt constraints. Instructor-based evaluation showed alignment with expected guidance for questions covered by the manual. For out-of-scope queries, the system appropriately refused to answer, indicating a reduced risk of hallucination. Expert evaluation further indicated that the generated advisory reports were useful and safe (utility: 3.25/4.00; safety: 4.00/4.00). These results suggest the feasibility of a framework for bridging the experimental last mile in which AI supports laboratory practice under explicit human supervision rather than replacing human judgment.

URL PDF HTML ☆

赞 0 踩 0

2604.04567 2026-05-04 stat.ML cs.LG

Generative Modeling under Non-Monotone MAR Missingness via Approximate Wasserstein Gradient Flows

Gitte Kremling, Jeffrey Näf, Johannes Lederer

2604.00070 2026-05-04 eess.IV cs.AI cs.CV

Brain MR Image Synthesis with 3D Multi-Contrast Self-Attention GAN

Zaid A. Abod, Furqan Aziz

Comments Note: This work has been submitted to the IEEE for possible publication

详情

英文摘要

Complete and high-quality multi-modal Magnetic Resonance Imaging (MRI) is essential for accurate neuro-oncological assessment, as each contrast provides complementary anatomical and pathological information. However, acquiring all modalities (e.g., T1c, T1n, T2w, T2f) for every patient is often impractical due to prolonged scan times, cost, and patient discomfort, potentially limiting comprehensive tumour evaluation. We propose 3D-MC-SAGAN (3D Multi-Contrast Self-Attention Generative Adversarial Network), a unified 3D multi-contrast synthesis framework that generates high-fidelity missing modalities from a single T2w input while explicitly preserving tumour characteristics. The model employs a multi-scale 3D encoder--decoder generator with residual connections and a novel Memory-Bounded Hybrid Attention (MBHA) block to capture long-range dependencies efficiently, and is trained with a WGAN-GP critic and an auxiliary domain classification head to produce T2f, T1n, and T1c volumes within a unified network. To ensure anatomical and pathological fidelity, we incorporate a frozen 3D U-Net-based segmentation network that enforces a tumour-consistency constraint during training. A composite objective combining adversarial, reconstruction, perceptual, structural similarity, contrast-classification, and segmentation-guided losses further promotes both global realism and tumour-preserving structure. Extensive experiments on 3D brain MRI datasets demonstrate that 3D-MC-SAGAN achieves state-of-the-art quantitative performance and produces visually coherent, anatomically plausible contrasts with improved distributional realism. Importantly, the proposed method maintains tumour segmentation accuracy comparable to that achieved using fully acquired multi-modal inputs, highlighting its potential to reduce acquisition burden while preserving clinically meaningful information.

URL PDF HTML ☆

赞 0 踩 0

2603.18829 2026-05-04 cs.CR cs.AI

Agent Control Protocol: Admission Control for Agent Actions

Marcelo Fernandez

Comments 95 pages. Paper 1 of 6 in the Agent Governance Series (Papers 0-6). Zenodo: https://doi.org/10.5281/zenodo.19672575. Companion: P0 (arXiv:2604.17511), P2/IML (arXiv:2604.17517), P3/4 (zenodo.19708496), P5/RAM (arXiv:2604.22898), P6 (zenodo.19699460). Spec: https://github.com/chelof100/acp-framework-en. v1.30: series updated to 6 papers, P3/4 consolidated, P6 added

详情

DOI: 10.5281/zenodo.19672575

英文摘要

Autonomous agents can produce harmful behavioral patterns from individually valid requests -- a threat class per-request policy evaluation cannot address, because stateless engines evaluate each request in isolation. We present ACP, a temporal admission control protocol enforcing behavioral properties over execution traces via static risk scoring combined with stateful signals (anomaly accumulation, cooldown) through a LedgerQuerier abstraction. ACP blocks execution based on deterministic, history-aware risk scoring -- not anomaly detection. Under a 500-request workload where every request is individually valid (RS=35), a stateless engine approves all 500; ACP limits autonomous execution to 2 out of 500 (0.4%), escalating after 3 actions and denying after 11. We identify a state-mixing vulnerability in ACP-RISK-2.0 (cross-context false denials) and introduce ACP-RISK-3.0, scoping anomaly signals to PatternKey(agentID, capability, resource). Decision evaluation: 739-832 ns (p50); throughput 1,720,000 req/s. Safety and liveness model-checked via TLA+ (11 invariants + 4 temporal properties, 0 violations) across 4,294,930,695 distinct states. We formalize deviation collapse -- enforcement active but never exercised due to upstream constraints -- and introduce Boundary Activation Rate (BAR) as its detection mechanism. An adversary suppressing BAR to 0.00 is detected via DeltaBAR before collapse (BAR_C=1.00). N coordinated agents accumulate risk independently; coordination window CW_appr=2N with zero deviation: activity scales linearly, preventing superlinear amplification. ACP is Paper 1 of a 6-paper Agent Governance Series: P0 -- atomic decision boundaries; P2 -- behavioral drift detection (IML); P3/4 -- governance structure, fair allocation, and irreducibility; P5 -- runtime execution validity (RAM, arXiv:2604.22898); P6 -- operationalization of RAM.

URL PDF HTML ☆

赞 0 踩 0

2603.18413 2026-05-04 stat.ML cs.LG

Statistical Testing Framework for Clustering Pipelines by Selective Inference

Yugo Miyata, Tomohiro Shiraishi, Shuichi Nishino, Ichiro Takeuchi

Comments 59 pages, 11 figures