arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.25024 2026-03-27 stat.ML cs.LG

Improving Infinitely Deep Bayesian Neural Networks with Nesterov's Accelerated Gradient Method

Chenxu Yu, Wenqi Fang

详情

英文摘要

As a representative continuous-depth neural network approach, stochastic differential equation (SDE)-based Bayesian neural networks (BNNs) have attracted considerable attention due to their solid theoretical foundations and strong potential for real-world applications. However, their reliance on numerical SDE solvers inevitably incurs a large number of function evaluations (NFEs), resulting in high computational cost and occasional convergence instability. To address these challenges, we propose a Nesterov-accelerated gradient (NAG) enhanced SDE-BNN model. By integrating NAG into the SDE-BNN framework along with an NFE-dependent residual skip connection, our method accelerates convergence and substantially reduces NFEs during both training and testing. Extensive empirical results show that our model consistently outperforms conventional SDE-BNNs across various tasks, including image classification and sequence modeling, achieving lower NFEs and improved predictive accuracy.

URL PDF HTML ☆

赞 0 踩 0

2603.24986 2026-03-27 cs.HC cs.AI

Rethinking Health Agents: From Siloed AI to Collaborative Decision Mediators

Ray-Yuan Chung, Xuhai Xu, Ari Pollack

Comments Accepted in CHI '26 Workshop on Human-Agent Collaboration

2603.24974 2026-03-27 math.OC cs.LG stat.ML

The Value of Information in Resource-Constrained Pricing

Ruicheng Ao, Jiashuo Jiang, David Simchi-Levi

Comments Extended version of the NeurIPS 2025 paper (arXiv:2501.14155). This version adds phase transition, surrogate-assisted variance reduction under model misspecification, and numerical experiments

2603.24968 2026-03-27 eess.IV cs.AI

Subject-Specific Low-Field MRI Synthesis via a Neural Operator

Ziqi Gao, Nicha Dvornek, Xiaoran Zhang, Gigi Galiana, Hemant Tagare, Todd Constable

Comments 11 pages, 2 figures, 2 tables

2603.24898 2026-03-27 cs.CR cs.AI cs.NI

Sovereign AI at the Front Door of Care: A Physically Unidirectional Architecture for Secure Clinical Intelligence

Vasu Srinivasan, Dhriti Vasu

Comments 31 pages

2603.24891 2026-03-27 cs.AR cs.AI

Surrogates, Spikes, and Sparsity: Performance Analysis and Characterization of SNN Hyperparameters on Hardware

Ilkin Aliyev, Jesus Lopez, Tosiron Adegbija

详情

Journal ref: IEEE International Symposium on Performance Analysis of Systems and Software 2026

英文摘要

Spiking Neural Networks (SNNs) offer inherent advantages for low-power inference through sparse, event-driven computation. However, the theoretical energy benefits of SNNs are often decoupled from real hardware performance due to the opaque relationship between training-time choices and inference-time sparsity. While prior work has focused on weight pruning and compression, the role of training hyperparameters -- specifically surrogate gradient functions and neuron model configurations -- in shaping hardware-level activation sparsity remains underexplored. This paper presents a workload characterization study quantifying the sensitivity of hardware latency to SNN hyperparameters. We decouple the impact of surrogate gradient functions (e.g., Fast Sigmoid, Spike Rate Escape) and neuron models (LIF, Lapicque) on classification accuracy and inference efficiency across three event-based vision datasets: DVS128-Gesture, N-MNIST, and DVS-CIFAR10. Our analysis reveals that standard accuracy metrics are poor predictors of hardware efficiency. While Fast Sigmoid achieves the highest accuracy on DVS-CIFAR10, Spike Rate Escape reduces inference latency by up to 12.2% on DVS128-Gesture with minimal accuracy trade-offs. We also demonstrate that neuron model selection is as critical as parameter tuning; transitioning from LIF to Lapicque neurons yields up to 28% latency reduction. We validate on a custom cycle-accurate FPGA-based SNN instrumentation platform, showing that sparsity-aware hyperparameter selection can improve accuracy by 9.1% and latency by over 2x compared to baselines. These findings establish a methodology for predicting hardware behavior from training parameters. The RTL and reproducibility artifacts are at https://zenodo.org/records/18893738.

URL PDF HTML ☆

赞 0 踩 0

2603.24877 2026-03-27 cs.HC cs.AI

More Than "Means to an End": Supporting Reasoning with Transparently Designed AI Data Science Processes

Venkatesh Sivaraman, Patrick Vossler, Adam Perer, Julian Hong, Jean Feng

Comments Accepted to Workshop on Tools for Thought at CHI'26: Understanding, Protecting, and Augmenting Human Cognition with Generative AI - From Vision to Implementation

2603.24857 2026-03-27 cs.CR cs.AI cs.CL cs.CV cs.LG

AI Security in the Foundation Model Era: A Comprehensive Survey from a Unified Perspective

Zhenyi Wang, Siyu Luan

Comments Published at Transactions on Machine Learning Research (TMLR)

2603.24849 2026-03-27 cs.HC cs.AI cs.CV cs.CY

Gaze patterns predict preference and confidence in pairwise AI image evaluation

Nikolas Papadopoulos, Shreenithi Navaneethan, Sheng Bai, Ankur Samanta, Paul Sajda

Comments This paper has been accepted to ACM ETRA 2026

2603.24825 2026-03-27 cs.SE cs.AI

Learning From Developers: Towards Reliable Patch Validation at Scale for Linux

Chih-En Lin, Attreyee Mukherjee, Ajay Rawat, Ruqi Zhang, Pedro Fonseca

Comments Submitted to OSDI'26

详情

英文摘要

Patch reviewing is critical for software development, especially in distributed open-source development, which highly depends on voluntary work, such as Linux. This paper studies the past 10 years of patch reviews of the Linux memory management subsystem to characterize the challenges involved in patch reviewing at scale. Our study reveals that the review process is still primarily reliant on human effort despite a wide-range of automatic checking tools. Although kernel developers strive to review all patch proposals, they struggle to keep up with the increasing volume of submissions and depend significantly on a few developers for these reviews. To help scale the patch review process, we introduce FLINT, a patch validation system framework that synthesizes insights from past discussions among developers and automatically analyzes patch proposals for compliance. FLINT employs a rule-based analysis informed by past discussions among developers and an LLM that does not require training or fine-tuning on new data, and can continuously improve with minimum human effort. FLINT uses a multi-stage approach to efficiently distill the essential information from past discussions. Later, when a patch proposal needs review, FLINT retrieves the relevant validation rules for validation and generates a reference-backed report that developers can easily interpret and validate. FLINT targets bugs that traditional tools find hard to detect, ranging from maintainability issues, e.g., design choices and naming conventions, to complex concurrency issues, e.g., deadlocks and data races. FLINT detected 2 new issues in Linux v6.18 development cycle and 7 issues in previous versions. FLINT achieves 21% and 14% of higher ground-truth coverage on concurrency bugs than the baseline with LLM only. Moreover, FLINT achieves a 35% false positive rate, which is lower than the baseline.

URL PDF HTML ☆

赞 0 踩 0

2603.24775 2026-03-27 cs.CR cs.AI

AIP: Agent Identity Protocol for Verifiable Delegation Across MCP and A2A

Sunil Prakash

Comments 17 pages, 10 tables, 2 figures

2603.24774 2026-03-27 cs.SE cs.AI

From Untestable to Testable: Metamorphic Testing in the Age of LLMs

Valerio Terragni

Comments Accepted for publication at IEEE Computer Magazine. This is the authors' accepted manuscript. Version of record available via DOI: 10.1109/MC.2026.3671990

2603.24752 2026-03-27 physics.chem-ph cs.LG

Autotuning T-PaiNN: Enabling Data-Efficient GNN Interatomic Potential Development via Classical-to-Quantum Transfer Learning

Vivienne Pelletier, Vedant Bhat, Daniel J. Rivera, Steven A. Wilson, Christopher L. Muhich

Comments 19 pages, 7 figures

2603.24738 2026-03-27 cs.DC cs.AI cs.LG cs.MA

Decentralized Task Scheduling in Distributed Systems: A Deep Reinforcement Learning Approach

Daniel Benniah John

Comments 12 pages, 8 figures. Under review. Code available at GitHub

2603.24704 2026-03-27 stat.ME cs.LG stat.AP stat.ML

Conformal Selective Prediction with General Risk Control

Tian Bai, Ying Jin

2603.24703 2026-03-27 cs.SE cs.RO cs.SY eess.SY

IndustriConnect: MCP Adapters and Mock-First Evaluation for AI-Assisted Industrial Operations

Melwin Xavier, Melveena Jolly, Vaisakh M A, Midhun Xavier

2603.24692 2026-03-27 cs.NE cs.AI

Reconstructing Spiking Neural Networks Using a Single Neuron with Autapses

Wuque Cai, Hongze Sun, Quan Tang, Shifeng Mao, Zhenxing Wang, Jiayi He, Duo Chen, Dezhong Yao, Daqing Guo

2603.24634 2026-03-27 cs.NI cs.AI cs.LG

Dual-Graph Multi-Agent Reinforcement Learning for Handover Optimization

Matteo Salvatori, Filippo Vannella, Sebastian Macaluso, Stylianos E. Trevlakis, Carlos Segura Perales, José Suarez-Varela, Alexandros-Apostolos A. Boulogeorgos, Ioannis Arapakis

2603.24629 2026-03-27 cs.SE cs.AI cs.MA cs.SY eess.SY

Sketch2Simulation: Automating Flowsheet Generation via Multi Agent Large Language Models

Abdullah Bahamdan, Emma Pajak, John D. Hedengren, Antonio del Rio Chanona

Comments 27 pages, 14 figures, 8 tables

2603.24618 2026-03-27 cs.AR cs.AI cs.LG

Causal AI For AMS Circuit Design: Interpretable Parameter Effects Analysis

Mohyeu Hussain, David Koblah, Reiner Dizon-Paradis, Domenic Forte

2603.24617 2026-03-27 cs.DS cs.LG math.OC

Multi-LLM Query Optimization

Arlen Dean, Zijin Zhang, Stefanus Jasin, Yuqing Liu

2603.24601 2026-03-27 eess.SP cs.AI cs.LG

FED-HARGPT: A Hybrid Centralized-Federated Approach of a Transformer-based Architecture for Human Context Recognition

Wandemberg Gibaut, Alexandre Osorio, Amparo Munoz, Sildolfo F. G. Neto, Fabio Grassiotto

Comments Paper presented on: July 2025 Conference: XVII Simpósio Brasileiro de Automação Inteligente (SBAI) At: São João del-Rei

2603.24599 2026-03-27 eess.SP cs.AI

A Learnable SIM Paradigm: Fundamentals, Training Techniques, and Applications

Hetong Wang, Yashuai Cao, Tiejun Lv

Comments 9 pages, 5 figures, accepted by IEEE Wireless Communications Magazine

2603.24598 2026-03-27 math.OC cs.LG cs.SY eess.SY

Response-Aware Risk-Constrained Control Barrier Function With Application to Vehicles

Qijun Liao, Jue Yang

Comments 22 pages, 20 figures

2603.24595 2026-03-27 cs.PL cs.AI

Model2Kernel: Model-Aware Symbolic Execution For Safe CUDA Kernels

Mengting He, Shihao Xia, Haomin Jia, Wenfei Wu, Linhai Song

2603.24477 2026-03-27 cs.SE cs.LG

Composer 2 Technical Report

Cursor Research, :, Aaron Chan, Ahmed Shalaby, Alexander Wettig, Aman Sanger, Andrew Zhai, Anurag Ajay, Ashvin Nair, Charlie Snell, Chen Lu, Chen Shen, Emily Jia, Federico Cassano, Hanpeng Liu, Haoyu Chen, Henry Wildermuth, Jacob Jackson, Janet Li, Jediah Katz, Jiajun Yao, Joey Hejna, Josh Warner, Julius Vering, Kevin Frans, Lee Danilek, Less Wright, Lujing Cen, Luke Melas-Kyriazi, Michael Truell, Michiel de Jong, Naman Jain, Nate Schmidt, Nathan Wang, Niklas Muennighoff, Oleg Rybkin, Paul Loh, Phillip Kravtsov, Rishabh Yadav, Sahil Shah, Sam Kottler, Alexander M Rush, Shengtong Zhang, Shomil Jain, Sriram Sankar, Stefan Heule, Stuart H. Sul, Sualeh Asif, Victor Rong, Wanqi Zhu, William Lin, Yuchen Wu, Yuri Volkov, Yury Zemlyanskiy, Zack Holbrook, Zhiyuan Zhang

2603.21152 2026-03-27 physics.geo-ph cs.AI

TRACE: A Multi-Agent System for Autonomous Physical Reasoning for Seismology

Feng Liu, Jian Xu, Xin Cui, Xinghao Wang, Zijie Guo, Jiong Wang, S. Mostafa Mousavi, Xinyu Gu, Hao Chen, Ben Fei, Lihua Fang, Fenghua Ling, Zefeng Li, Lei Bai

Comments 25 pages for main text and 164 pages for appendices

2603.20283 2026-03-27 cs.IR cs.LG

FastPFRec: A Fast Personalized Federated Recommendation with Secure Sharing

Zhenxing Yan, Jidong Yuan, Yongqi Sun, Haiyang Liu, Zhihui Gao

2603.18865 2026-03-27 eess.SY cs.LG cs.SY

RadioDiff-FS: Physics-Informed Manifold Alignment in Few-Shot Diffusion Models for High-Fidelity Radio Map Construction

Xiucheng Wang, Zixuan Guo, Nan Cheng

2603.11413 2026-03-27 cs.HC cs.AI

Evaluation format, not model capability, drives triage failure in the assessment of consumer health AI

David Fraile Navarro, Farah Magrabi, Enrico Coiera

Comments 12 pages