arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.17104 2026-03-19 cs.SE cs.AI

When the Specification Emerges: Benchmarking Faithfulness Loss in Long-Horizon Coding Agents

Lu Yan, Xuan Chen, Xiangyu Zhang

详情

英文摘要

Current coding-agent benchmarks usually pro- vide the full task specification upfront. Real research coding often does not: the intended system is progressively disclosed through in- teraction, requiring the agent to track durable design commitments across a long session. We introduce a benchmark for this setting and study faithfulne Ss Loss U nder eM ergent s Pecification (SLUMP), defined as the reduc- tion in final implementation faithfulness un- der emergent specification relative to a single- shot specification control. The benchmark con- tains 20 recent ML papers (10 ICML 2025, 10 NeurIPS 2025), 371 atomic verifiable compo- nents, and interaction scripts of approximately 60 coding requests that progressively disclose the target design without revealing the paper itself. Final repositories are scored with a five-level component-faithfulness rubric and accompanied by an exposure audit to verify that scored components are recoverable from the visible interaction. Evaluated on Claude Code and Codex, the single-shot specification control achieves higher overall implementation fidelity on 16/20 and 14/20 papers, respectively. Structural integration degrades under emergent specification on both platforms, while seman- tic faithfulness loss is substantial on Claude Code and small on Codex. As a mitigation case study, we introduce ProjectGuard, an exter- nal project-state layer for specification tracking. On Claude Code, ProjectGuard recovers 90% of the faithfulness gap, increases fully faith- ful components from 118 to 181, and reduces severe failures from 72 to 49. These results identify specification tracking as a distinct eval- uation target for long-horizon coding agents.

URL PDF HTML ☆

赞 0 踩 0

2603.17100 2026-03-19 cs.CR cs.LG

An End-to-End Framework for Functionality-Embedded Provenance Graph Construction and Threat Interpretation

Kushankur Ghosh, Mehar Klair, Kian Kyars, Euijin Choo, Jörg Sander

Comments 21 pages, 7 figures

2603.17058 2026-03-19 cs.GT cs.MA cs.RO cs.SY eess.SY math.OC

Asymmetric Nash Seeking via Best Response Maps: Global Linear Convergence and Robustness to Inexact Reaction Models

Mahdis Rabbani, Navid Mojahed, Shima Nazari

Comments 6 Pages, 2 Figures, Preprint submitted to IEEE L-CSS and CDC 2026

2603.17057 2026-03-19 physics.flu-dyn cs.LG cs.NE math.OC

Optimization-Embedded Active Multi-Fidelity Surrogate Learning for Multi-Condition Airfoil Shape Optimization

Isaac Robledo, Alberto Vilariño, Arnau Miró, Oriol Lehmkuhl, Carlos Sanmiguel Vila, Rodrigo Castellanos

Comments 21 pages, 14 figures

2603.17025 2026-03-19 eess.AS cs.AI

Shared Representation Learning for Reference-Guided Targeted Sound Detection

Shubham Gupta, Adarsh Arigala, B. R. Dilleswari, Sri Rama Murty Kodukula

Comments Accepted to IEEE ICASSP 2026

2603.16980 2026-03-19 math.NA cs.LG cs.NA

Interpretable AI-Assisted Early Reliability Prediction for a Two-Parameter Parallel Root-Finding Scheme

Bruno Carpentieri, Andrei Velichko, Mudassir Shams, Paola Lecca

Comments 23 pages, 9 figures

2603.16976 2026-03-19 cs.MS cs.AI cs.LG

Implementation of tangent linear and adjoint models for neural networks based on a compiler library tool

Sa Xiao, Hao Jing, Honglu Sun, Haoyu Li

详情

英文摘要

This paper presents TorchNWP, a compilation library tool for the efficient coupling of artificial intelligence components and traditional numerical models. It aims to address the issues of poor cross-language compatibility, insufficient coupling flexibility, and low data transfer efficiency between operational numerical models developed in Fortran and Python-based deep learning frameworks. Based on LibTorch, it optimizes and designs a unified application-layer calling interface, converts deep learning models under the PyTorch framework into a static binary format, and provides C/C++ interfaces. Then, using hybrid Fortran/C/C++ programming, it enables the deployment of deep learning models within numerical models. Integrating TorchNWP into a numerical model only requires compiling it into a callable link library and linking it during the compilation and linking phase to generate the executable. On this basis, tangent linear and adjoint model based on neural networks are implemented at the C/C++ level, which can shield the internal structure of neural network models and simplify the construction process of four-dimensional variational data assimilation systems. Meanwhile, it supports deployment on heterogeneous platforms, is compatible with mainstream neural network models, and enables mapping of different parallel granularities and efficient parallel execution. Using this tool requires minimal code modifications to the original numerical model, thus reducing coupling costs. It can be efficiently integrated into numerical weather prediction models such as CMA-GFS and MCV, and has been applied to the coupling of deep learning-based physical parameterization schemes (e.g., radiation, non-orographic gravity wave drag) and the development of their tangent linear and adjoint models, significantly improving the accuracy and efficiency of numerical weather prediction.

URL PDF HTML ☆

赞 0 踩 0

2603.16975 2026-03-19 cs.SE cs.AI cs.CY cs.ET cs.HC

The State of Generative AI in Software Development: Insights from Literature and a Developer Survey

Vincent Gurgul, Robin Gubela, Stefan Lessmann

2603.16973 2026-03-19 quant-ph cs.LG

Hybrid Classical-Quantum Transfer Learning with Noisy Quantum Circuits

D. Martín-Pérez, F. Rodríguez-Díaz, D. Gutiérrez-Avilés, A. Troncoso, F. Martínez-Álvarez

2603.16972 2026-03-19 eess.AS cs.LG cs.SD

Over-the-air White-box Attack on the Wav2Vec Speech Recognition Neural Network

Protopopov Alexey

Comments 9 pages, 5 figures, 1 table

2603.16969 2026-03-19 cs.CR cs.AI cs.LG

DeepStage: Learning Autonomous Defense Policies Against Multi-Stage APT Campaigns

Trung V. Phan, Tri Gia Nguyen, Thomas Bauschert

2603.16963 2026-03-19 q-bio.QM cs.CV

Topology-Guided Biomechanical Profiling: A White-Box Framework for Opportunistic Screening of Spinal Instability on Routine CT

Zanting Ye, Xuanbin Wu, Guoqing Zhong, Shengyuan Liu, Jiashuai Liu, Ge Song, Zhisong Wang, Jing Hao, Xiaolong Niu, Yefeng Zheng, Yu Zhang, Lijun Lu

Comments 11 pages, 3 tables, 2 figures

2603.16959 2026-03-19 cond-mat.mtrl-sci cs.AI

Machine intelligence supports the full chain of 2D dendrite synthesis

Wenqiang Huang, Susu Fang, Xuhang Gu, Shen'ao Xue, Huanhuan Xing, Junjie Jiang, Junying Zhang, Shen Zhou, Zheng Luo, Jin Zhang, Fangping Ouyang, Shanshan Wang

Comments 20 pages, 5 figures

2603.16950 2026-03-19 stat.ML cs.LG stat.ME

Kriging via variably scaled kernels

Gianluca Audone, Francesco Marchetti, Emma Perracchione, Milvia Rossini

2603.16948 2026-03-19 physics.geo-ph cs.LG

A Framework for Modeling Liquefaction-Induced Road Disruptions After Earthquakes: Implications for Emergency Response and Access in the Cascadia Region of North America

Morgan D. Sanger, Olyvia B. Smith, Brett W. Maurer, Liam Wotherspoon, Marc O. Eberhard, Jeffrey W. Berman

详情

英文摘要

Large earthquakes along the Cascadia Subduction Zone (CSZ) are expected to trigger widespread soil liquefaction that could disrupt transportation systems across the U.S. Pacific Northwest. However, past regional assessments have relied on simple geologic screening methods and binomial shaking thresholds that are only loosely informed by liquefaction science. This study introduces a mechanics-informed, data-driven framework for estimating liquefaction-induced road closures and service reductions, and the framework is applied to a magnitude-9 CSZ earthquake. Predicted liquefaction severity is translated into segment-level probabilities of closure and reduced service using empirically derived fragility relationships. These probabilities are mapped at 90-m resolution and propagated through the National Highway System using a spatially correlated Monte Carlo simulation to estimate link-level disruption. Results show that impacts are concentrated in low-lying coastal zones, river valleys, and urban waterfronts, with major disruptions expected along critical routes including U.S. Route 101. Local mobility is further examined in Pacific and Grays Harbor counties, Washington, where limited network redundancy, strong shaking, and high liquefaction susceptibility lead to elevated probabilities of isolation and loss of hospital access. Socioeconomic analysis reveals modest but statistically significant associations between road impacts and demographic indicators, suggesting that liquefaction impacts may compound with existing social vulnerabilities. While not a substitute for site-specific analysis, the results provide a regional baseline for emergency planning, risk communication, and prioritization of more advanced geotechnical sampling and analysis. Moreover, the methodology proposed here is not specific to the CSZ, but rather, could be applied to analogous studies of road impacts elsewhere.

URL PDF HTML ☆

赞 0 踩 0

2603.16946 2026-03-19 physics.data-an cs.AI

Automatic Termination Strategy of Inelastic Neutron-scattering Measurement Using Bayesian Optimization for Bin-width Selection

Kensuke Muto, Hirotaka Sakamoto, Kenji Nagata, Taka-hisa Arima, Masato Okada

Comments 14 pages, 6 figures; under review at Journal of the Physical Society of Japan (JPSJ)

2603.16942 2026-03-19 eess.IV cs.AI cs.CV q-bio.QM

UNICORN: Ultrasound Nakagami Imaging via Score Matching and Adaptation for Assessing Hepatic Steatosis

Kwanyoung Kim, Jaa-Yeon Lee, Youngjun Ko, GunWoo Lee, Jong Chul Ye

Comments 12pages, 7 figures, 6 tables. arXiv admin note: text overlap with arXiv:2403.06275

2603.16940 2026-03-19 eess.IV cs.AI cs.CV

On the Degrees of Freedom of Gridded Control Points in Learning-Based Medical Image Registration

Wen Yan, Qianye Yang, Yipei Wang, Shonit Punwani, Mark Emberton, Vasilis Stavrinides, Yipeng Hu, Dean Barratt

Comments 27 pages; 8 figures

2603.16938 2026-03-19 cs.CR cs.AI cs.CY

Cryptographic Runtime Governance for Autonomous AI Systems: The Aegis Architecture for Verifiable Policy Enforcement

Adam Massimo Mazzocchetti

详情

DOI: 10.5281/zenodo.19027190

英文摘要

Contemporary AI governance frameworks rely heavily on post hoc oversight, policy guidance, and behavioral alignment techniques, yet these mechanisms become fragile as systems gain autonomy, speed, and operational opacity. This paper presents Aegis, a runtime governance architecture for autonomous AI systems that treats policy and legal constraints as execution conditions rather than advisory principles. Aegis binds each governed agent to a cryptographically sealed Immutable Ethics Policy Layer (IEPL) at system genesis and enforces external emissions through an Ethics Verification Agent (EVA), an Enforcement Kernel Module (EKM), and an Immutable Logging Kernel (ILK). Amendments to the governing policy layer require quorum approval and redeclaration of the system trust root; verified violations trigger autonomous shutdown and generation of auditable proof artifacts. We evaluate the architecture within the Civitas runtime using three operational measures: proof verification latency under tamper conditions, publication overhead, and alignment retention performance relative to an ungoverned baseline. In controlled trials, Aegis demonstrates median proof verification latency of 238 ms, median publication overhead of approximately 9.4 ms, and higher alignment retention than the baseline condition across matched tasks. We argue that these results support a shift in AI governance from discretionary oversight toward verifiable runtime constraint. Rather than claiming to resolve machine ethics in the abstract, the proposed architecture seeks to show that policy violating behavior can be rendered operationally non executable within a controlled runtime governance framework. The paper concludes by discussing methodological limits, evidentiary implications, and the role of proof oriented governance in high assurance AI deployment.

URL PDF HTML ☆

赞 0 踩 0

2603.16928 2026-03-19 cs.CR cs.LG

Noticing the Watcher: LLM Agents Can Infer CoT Monitoring from Blocking Feedback

Thomas Jiralerspong, Flemming Kondrup, Yoshua Bengio

2603.16925 2026-03-19 cond-mat.soft cond-mat.mtrl-sci cs.LG

Gaussian Process Regression-based Knowledge Distillation Framework for Simultaneous Prediction of Physical and Mechanical Properties of Epoxy Polymers

Sindu B. S., Jan Hamaekers

2603.16924 2026-03-19 eess.AS cs.AI cs.CL

SimulU: Training-free Policy for Long-form Simultaneous Speech-to-Speech Translation

Amirbek Djanibekov, Luisa Bentivogli, Matteo Negri, Sara Papi

2603.16923 2026-03-19 eess.AS cs.SD

Beyond Deep Learning: Speech Segmentation and Phone Classification with Neural Assemblies

Trevor Adelson, Vidhyasaharan Sethu, Ting Dang

Comments Submitted to Interspeech 2026. 9 Pages

2603.16922 2026-03-19 eess.AS cs.SD

Learnable Pulse Accumulation for On-Device Speech Recognition: How Much Attention Do You Need?

Yakov Pyotr Shkolnikov

2603.16920 2026-03-19 eess.AS cs.SD

Synthetic Data Domain Adaptation for ASR via LLM-based Text and Phonetic Respelling Augmentation

Natsuo Yamashita, Koichi Nagatsuka, Hiroaki Kokubo, Kota Dohi, Tuan Vu Ho

Comments accepted by ICASSP 2026

2603.16918 2026-03-19 cs.HC cs.AI

Privacy and Safety Experiences and Concerns of U.S. Women Using Generative AI for Seeking Sexual and Reproductive Health Information

Ina Kaleva, Xiao Zhan, Ruba Abu-Salma, Jose Such

Comments 21 pages, 2 tables, CHI conference on Human Factors in Computing Systems

2603.16910 2026-03-19 cs.MA cs.AI physics.soc-ph

TerraLingua: Emergence and Analysis of Open-endedness in LLM Ecologies

Giuseppe Paolo, Jamieson Warner, Hormoz Shahrzad, Babak Hodjat, Risto Miikkulainen, Elliot Meyerson

2603.16904 2026-03-19 q-fin.PM cs.AI

Quantum-Assisted Optimal Rebalancing with Uncorrelated Asset Selection for Algorithmic Trading Walk-Forward QUBO Scheduling via QAOA

Abraham Itzhak Weinberg

2603.16900 2026-03-19 physics.soc-ph cs.AI cs.HC

Social physics in the age of artificial intelligence

The Anh Han, Joel Z. Leibo, Tom Lenaerts, Iyad Rahwan, Fernando Santos, Matjaž Perc, Valerio Capraro

2603.16897 2026-03-19 eess.SP cs.CL cs.HC cs.LG q-bio.NC

EEG-Based Brain-LLM Interface for Human Preference Aligned Generation

Junzi Zhang, Jianing Shen, Weijie Tu, Yi Zhang, Hailin Zhang, Tom Gedeon, Bin Jiang, Yue Yao

Comments 15 pages, 9 figures