arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2605.05386 2026-05-08 cs.AI cs.CL cs.LG

BALAR : A Bayesian Agentic Loop for Active Reasoning

Aymen Echarghaoui, Dongxia Wu, Emily B. Fox

详情

英文摘要

Large language models increasingly operate in interactive settings where solving a task requires multiple rounds of information exchange with a user. However, most current systems treat dialogue reactively and lack a principled mechanism to reason about what information is missing and which question should be asked next. We propose BALAR (Bayesian Agentic Loop for Active Reasoning), a task-agnostic outer-loop algorithm that requires no fine-tuning and enables structured multi-turn interaction between an LLM agent and a user. BALAR maintains a structured belief over latent states, selects clarifying questions by maximizing expected mutual information, and dynamically expands its state representation when the current one proves insufficient. We evaluate BALAR on three diverse benchmarks: AR-Bench-DC (detective cases), AR-Bench-SP (thinking puzzles), and iCraft-MD (clinical diagnosis). BALAR significantly outperforms all baselines across all three benchmarks, with $14.6\%$ higher accuracy on AR-Bench-DC, $38.5\%$ on AR-Bench-SP, and $30.5\%$ on iCraft-MD.

URL PDF HTML ☆

赞 0 踩 0

2605.05379 2026-05-08 cs.AI cs.CC cs.ET

Partial Evidence Bench: Benchmarking Authorization-Limited Evidence in Agentic Systems

Krti Tallam

Comments Benchmark paper with deterministic synthetic corpora, 14 pages, 6 tables

2605.05370 2026-05-08 cs.LG cs.AI

SPADE: Faster Drug Discovery by Learning from Sparse Data

Rahul Nandakumar, Ben Fauber, Deepayan Chakrabarti

2605.05365 2026-05-08 cs.AI cs.CL

ZAYA1-8B Technical Report

Robert Washbourne, Rishi Iyer, Tomas Figliolia, Henry Zheng, Ryan Lorig-Roach, Sungyeon Yang, Pritish Yuvraj, Quentin Anthony, Yury Tokpanov, Xiao Yang, Ganesh Nanduru, Stephen Ebert, Praneeth Medepalli, Skyler Szot, Srivatsan Rajagopal, Alex Ong, Bhavana Mehta, Beren Millidge

2605.05360 2026-05-08 cs.LG cs.AI

COPYCOP: Ownership Verification for Graph Neural Networks

Rahul Nandakumar, Deepayan Chakrabarti

2605.05358 2026-05-08 cs.LG cs.CV

Balancing Stability and Plasticity in Sequentially Trained Early-Exiting Neural Networks

Alaa Zniber, Ouassim Karrakchou, Mounir Ghogho

Comments Accepted for publication at IEEE ICIP 2026

2605.05354 2026-05-08 cs.LG

A Multi-Head Attention Approach for SLA Compliance Monitoring in Data Centers

Omanshu Thapliyal

Comments 6 pages, 9 figures, 46th IEEE International Conference on Distributed Computing Systems

2605.05353 2026-05-08 cs.CL cs.AI

Counterargument for Critical Thinking as Judged by AI and Humans

Tosin Adewumi, Marcus Liwicki, Foteini Simistira Liwicki, Lama Alkhaled, Hamam Mokayed, Esra Sümer-Arpak

Comments 9 pages

2605.05351 2026-05-08 cs.CV

egenioussBench: A New Dataset for Geospatial Visual Localisation

Phillipp Fanta-Jende, Francesco Vultaggio, Alexander Kern, Yasmin Loeper, Markus Gerke

2605.05344 2026-05-08 cs.CV cs.AI cs.IR

Open-SAT: LLM-Guided Query Embedding Refinement for Open-Vocabulary Object Retrieval in Satellite Imagery

Md Adnan Arefeen, Biplob Debnath, Ravi K. Rajendran, Murugan Sankaradas, Srimat T. Chakradhar

2605.05341 2026-05-08 cs.LG cs.AI math.OC stat.ML

Feature Starvation as Geometric Instability in Sparse Autoencoders

Faris Chaudhry, Keisuke Yano, Anthea Monod

Comments 26 pages, 3 figures, 5 tables

2605.05339 2026-05-08 cs.RO math.OC

Passive Fault Tolerance through Tension-to-Thrust Feed-Forward: Hybrid Input-to-State Stability for Decentralized Multi-UAV Slung-Load Transport under Abrupt Cable Severance

Hadi Hajieghrary, Paul Schmitt

Comments Submitted for review at IEEE Transactions on Control Systems Technology For the paper and simulation code see: https://github.com/Hadi-Hajieghrary/Tether_Grace.git

2605.05338 2026-05-08 cs.RO

Track A*: Fast Visibility-Aware Trajectory Planning for Active Target Tracking

Hanxuan Chen, Kangli Wang, Ji Pei

2605.05331 2026-05-08 cs.CV cs.AI cs.LG

ViTok-v2: Scaling Native Resolution Auto-Encoders to 5 Billion Parameters

Philippe Hansen-Estruch, Jiahui Chen, Vivek Ramanujan, Orr Zohar, Yan Ping, Animesh Sinha, Markos Georgopoulos, Edgar Schoenfeld, Ji Hou, Felix Juefei-Xu, Sriram Vishwanath, Ali Thabet

2605.05330 2026-05-08 cs.LG cs.AI cs.DM cs.NE

Graph Normalization: Fast Binarizing Dynamics for Differentiable MWIS

Laurent Guigues

详情

英文摘要

We introduce Graph Normalization (GN), a principled dynamical system on graphs that serves as a differentiable approximation engine for the NP-hard Maximum Weight Independent Set (MWIS) problem. MWIS encompasses many combinatorial challenges, including optimal assignment, scheduling, set packing, and MAP inference in discrete Markov Random Fields. Unlike Belief Propagation, we prove GN always converges to a binary indicator of a Maximum Independent Set. GN realizes a fast quasi-Newton descent through an exact Majorization-Minimization step, systematically improving the MWIS relaxed primal objective. We establish an equivalence between GN and the Replicator Dynamics of a nonlinear evolutionary game, where vertices compete for inclusion in an independent set. While a non-potential game, the GN game follows Fisher's Fundamental Theorem of Natural Selection, where the average fitness equals the MWIS primal objective and strictly increases. This connection leads to a weighted extension of the Motzkin-Straus theorem, showing MISes are in bijection with the local minima of a quadratic form over a tilted simplex. For the Assignment Problem, GN acts as a variant of the Sinkhorn algorithm that naturally converges to a hard assignment while generalizing to arbitrary constraint graphs. We demonstrate GN's performance as a fast binarization engine for the state-of-the-art Bregman-Sinkhorn relaxed MWIS solver. On real-world benchmarks with up to 1M edges, GN identifies solutions within 1% of the best known results in seconds on a CPU. GN opens new avenues for deep learning architectures requiring differentiable, "hard" decisions under constraints, with applications in structured sparse attention, dynamic network pruning, and Mixture-of-Experts. Beyond core AI, the GN framework enables end-to-end learning of constrained optimization in computer vision, computational biology, and resource allocation.

URL PDF HTML ☆

赞 0 踩 0

2605.05329 2026-05-08 cs.AI cs.LG

Understanding Annotator Safety Policy with Interpretability

Alex Oesterling, Donghao Ren, Yannick Assogba, Dominik Moritz, Sunnie S. Y. Kim, Leon Gatys, Fred Hohman

Comments 38 pages, 13 figures, ACM FAccT 2026

详情

DOI: 10.1145/3805689.3806472

英文摘要

Safety policies define what constitutes safe and unsafe AI outputs, guiding data annotation and model development. However, annotation disagreement is pervasive and can stem from multiple sources such as operational failures (annotators misunderstand or misexecute the task), policy ambiguity (policy wording leaves room for interpretation), or value pluralism (different annotators hold different perspectives on safety). Distinguishing these sources matters. For example, operational failures call for quality control, ambiguity calls for policy clarification, and pluralism calls for deliberation about incorporating diverse perspectives. Yet understanding why annotators disagree is difficult. Directly asking annotators for their reasoning is costly, substantially increasing annotation burden, and can be unreliable for both human and LLM annotators as self-reported reasoning often fails to reflect actual decision processes. We introduce Annotator Policy Models (APMs), interpretable models that learn annotators' internal safety policies from labeling behavior alone, making annotator reasoning visible and comparable without additional annotation effort. We validate that APMs accurately model annotator safety policy (>80% accuracy), faithfully predict responses to counterfactual edits, and recover known policy differences in controlled settings. Applying APMs to LLM and human annotations, we demonstrate two core applications: (1) surfacing policy ambiguity by revealing how annotators interpret safety instructions differently, and (2) surfacing value pluralism by uncovering systematic differences in safety priorities across demographic groups. Together, these capabilities support more targeted, transparent, and inclusive safety policy design.

URL PDF HTML ☆

赞 0 踩 0

2605.05328 2026-05-08 cs.CV cs.RO

Query2Uncertainty: Robust Uncertainty Quantification and Calibration for 3D Object Detection under Distribution Shift

Till Beemelmanns, Alexey Nekrasov, Stefan Vilceanu, Jonas Steinhaus, Timo Woopen, Bastian Leibe, Lutz Eckstein

Comments Accepted for publication at CVPR 2026

2605.05285 2026-05-08 cs.LG

Attribution-Guided Continual Learning for Large Language Models

Yazheng Liu, Yuxuan Wan, Rui Xu, Xi Zhang, Sihong Xie, Hui Xiong

2605.05283 2026-05-08 cs.CV

Seeing What Shouldn't Be There: Counterfactual GANs for Medical Image Attribution

Shakeeb Murtaza

2605.05280 2026-05-08 cs.LG

Forecasting Green Skill Demand in the Automotive Industry: Evidence from Online Job Postings

Sabur Butt, Joshua N. Arrazola E., Hector G. Ceballos, Patricia Caratozzolo

2605.05278 2026-05-08 cs.LG cs.IT math.IT

Expert Routing for Communication-Efficient MoE via Finite Expert Banks

Mohammad Reza Deylam Salehi, Ali Khalesi

2605.05245 2026-05-08 cs.CL cs.IR

AdaGATE: Adaptive Gap-Aware Token-Efficient Evidence Assembly for Multi-Hop Retrieval-Augmented Generation

Yilin Guo, Yinshan Wang, Yixuan Wang

Comments 10 pages, 4 figures, 2 tables

2605.05241 2026-05-08 cs.RO cs.LG

DexSim2Real: Foundation Model-Guided Sim-to-Real Transfer for Generalizable Dexterous Manipulation

Zijian Zeng, Fei Ding, Huiming Yang, Xianwei Li, Yuhao Liao

Comments 13 pages, 2 figures, 5 tables

2605.05236 2026-05-08 cs.RO cs.AI

Topology-Driven Anti-Entanglement Control for Soft Robots

Haoyang Le, Shengxuan Wang, Mohan Chen, Shuo Feng

Comments 17 pages, 4 figures

2605.05228 2026-05-08 cs.LG cs.AI cs.NE

Evolutionary fine tuning of quantized convolution-based deep learning models

Marcin Pietroń

2605.05227 2026-05-08 cs.LG cs.AI

Rethinking Data Curation in LLM Training: Online Reweighting Offers Better Generalization than Offline Methods

Wanru Zhao, Yihong Chen, Yuzhi Tang, Wentao Ma, Shengchao Hu, Shell Xu Hu, Alex Iacob, Abhinav Mehrotra, Nicholas D. Lane

Comments ICLR 2026

2605.05224 2026-05-08 cs.LG cs.AI cs.CR

Channel-Level Semantic Perturbations: Unlearnable Examples for Diverse Training Paradigms

Bo Wang, Jia Ni, Mengnan Zhao, Zhan Qin, Kui Ren

2605.05223 2026-05-08 cs.LG cs.AI

Structural Instability of Feature Composition

Yunpeng Zhou

2605.05222 2026-05-08 cs.LG cs.AI

Adaptive Computation Depth via Learned Token Routing in Transformers

Ahmed Abdelmuniem Abdalla Mohammed

Comments 11 pages, 9 figures, 4 tables, https://github.com/AhmedHamadto/TSA

2605.05221 2026-05-08 cs.LG cs.CL

Data-Driven Variational Basis Learning Beyond Neural Networks: A Non-Neural Framework for Adaptive Basis Discovery

Andrew Kiruluta