arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.08708 2026-04-13 cs.LG cs.AI cs.CL

Every Response Counts: Quantifying Uncertainty of LLM-based Multi-Agent Systems through Tensor Decomposition

Tiejin Chen, Huaiyuan Yao, Jia Chen, Evangelos E. Papalexakis, Hua Wei

Comments Accept to ACL 26

详情

英文摘要

While Large Language Model-based Multi-Agent Systems (MAS) consistently outperform single-agent systems on complex tasks, their intricate interactions introduce critical reliability challenges arising from communication dynamics and role dependencies. Existing Uncertainty Quantification methods, typically designed for single-turn outputs, fail to address the unique complexities of the MAS. Specifically, these methods struggle with three distinct challenges: the cascading uncertainty in multi-step reasoning, the variability of inter-agent communication paths, and the diversity of communication topologies. To bridge this gap, we introduce MATU, a novel framework that quantifies uncertainty through tensor decomposition. MATU moves beyond analyzing final text outputs by representing entire reasoning trajectories as embedding matrices and organizing multiple execution runs into a higher-order tensor. By applying tensor decomposition, we disentangle and quantify distinct sources of uncertainty, offering a comprehensive reliability measure that is generalizable across different agent structures. We provide comprehensive experiments to show that MATU effectively estimates holistic and robust uncertainty across diverse tasks and communication topologies.

URL PDF HTML ☆

赞 0 踩 0

2604.08707 2026-04-13 cs.AI cs.CC

Parameterized Complexity Of Representing Models Of MSO Formulas

Petr Kučera, Petr Martinek

2604.08706 2026-04-13 cs.LG

Efficient RL Training for LLMs with Experience Replay

Charles Arnal, Vivien Cabannes, Taco Cohen, Julia Kempe, Remi Munos

2604.08704 2026-04-13 cs.CV

RS-OVC: Open-Vocabulary Counting for Remote-Sensing Data

Tamir Shor, George Leifman, Genady Beryozkin

2604.08698 2026-04-13 cs.LG q-bio.GN

EvoLen: Evolution-Guided Tokenization for DNA Language Model

Nan Huang, Xiaoxiao Zhou, Junxia Cui, Mario Tapia-Pacheco, Tiffany Amariuta, Yang Li, Jingbo Shang

2604.08694 2026-04-13 cs.CV cs.LG

EfficientSign: An Attention-Enhanced Lightweight Architecture for Indian Sign Language Recognition

Rishabh Gupta, Shravya R. Nalla

Comments Submitted to IEEE Transactions on Human-Machine Systems

2604.08690 2026-04-13 cs.LG cs.CL

Skip-Connected Policy Optimization for Implicit Advantage

Fengwei Teng, Jinyi Bai, Xinhao Yao, Demi Ruohan Wang, Jiahao Zhao, Zhijiang Guo

2604.08685 2026-04-13 cs.AI

RAMP: Hybrid DRL for Online Learning of Numeric Action Models

Yarin Benyamin, Argaman Mordoch, Shahaf S. Shperberg, Roni Stern

Comments Accepted as a workshop paper at the Adaptive and Learning Agents (ALA) Workshop at AAMAS 2026

2604.08664 2026-04-13 cs.RO

Generative Simulation for Policy Learning in Physical Human-Robot Interaction

Junxiang Wang, Xinwen Xu, Tiancheng Wu, Julian Millan, Nir Pechuk, Zackory Erickson

Comments 9 pages, 3 figures, 2 tables

2604.08649 2026-04-13 cs.LG cs.CE cs.CL cs.IR q-fin.CP

PRAGMA: Revolut Foundation Model

Maxim Ostroukhov, Ruslan Mikhailov, Vladimir Iashin, Artem Sokolov, Andrei Akshonov, Vitaly Protasov, Dmitrii Beloborodov, Vince Mullin, Roman Yokunda Enzmann, Georgios Kolovos, Jason Renders, Pavel Nesterov, Anton Repushko

2604.08646 2026-04-13 cs.CV

InsEdit: Towards Instruction-based Visual Editing via Data-Efficient Video Diffusion Models Adaptation

Zhefan Rao, Bin Zou, Haoxuan Che, Xuanhua He, Chong Hou Choi, Yanheng Li, Rui Liu, Qifeng Chen

Comments 13 pages, 10 figures

2604.08645 2026-04-13 cs.CV cs.AI cs.LG cs.RO

3D-VCD: Hallucination Mitigation in 3D-LLM Embodied Agents through Visual Contrastive Decoding

Makanjuola Ogunleye, Eman Abdelrahman, Ismini Lourentzou

Comments 8 pages, 6 figures, Accepted at IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2026

2604.08644 2026-04-13 cs.CL

EXAONE 4.5 Technical Report

Eunbi Choi, Kibong Choi, Sehyun Chun, Seokhee Hong, Junwon Hwang, Hyojin Jeon, Ahra Jo, Hyunjik Jo, Yeonsik Jo, Joonkee Kim, Seonghwan Kim, Soyeon Kim, Sunkyoung Kim, Yireun Kim, Yongil Kim, Changhun Lee, Haeju Lee, Jinsik Lee, Kyungmin Lee, Sangha Park, Kwangrok Ryoo, Minju Seo, Sejong Yang, Heuiyeen Yeen, Hwan Chang, Stanley Jungkyu Choi, Yejin Choi, Kyubeen Han, Joonwon Jang, Kijeong Jeon, Geunyeong Jeong, Gerrard Jeongwon Jo, Jiyeon Jung, Daeseong Kim, Dohoon Kim, Dohyun Kim, Hyunseo Kim, Minu Kim, Myoungshin Kim, Youchul Kim, Byungoh Ko, Christopher Lee, Edward Hwayoung Lee, Honglak Lee, Jiyoung Lee, Sangeun Lee, Seungwon Lim, Woohyung Lim, Jueun Mun, Jaewoo Park, Jimin Park, Jinho Park, Yongmin Park, Wooseok Seo, Yongwoo Song, Sihyuk Yi, Kyungjae Yoo, Sangyeon Yoon

2604.08643 2026-04-13 cs.LG cs.CY cs.GT cs.SI

Creator Incentives in Recommender Systems: A Cooperative Game-Theoretic Approach for Stable and Fair Collaboration in Multi-Agent Bandits

Ramakrishnan Krishnamurthy, Arpit Agarwal, Lakshminarayanan Subramanian, Maximilian Nickel

Comments Accepted in AISTATS 2026 as an Oral Presentation

2604.08641 2026-04-13 cs.CV cs.AI cs.HC cs.MM

On Semiotic-Grounded Interpretive Evaluation of Generative Art

Ruixiang Jiang, Changwen Chen

2604.08639 2026-04-13 cs.LG cs.AI cs.CV

VOLTA: The Surprising Ineffectiveness of Auxiliary Losses for Calibrated Deep Learning

Rahul D Ray, Utkarsh Srivastava

2604.08627 2026-04-13 cs.LG cs.AI

Evidential Transformation Network: Turning Pretrained Models into Evidential Models for Post-hoc Uncertainty Estimation

Yongchan Chun, Chanhee Park, Jeongho Yoon, Jaehyung Seo, Heuiseok Lim

Comments Accepted to CVPR 2026 (Highlight)

2604.08624 2026-04-13 cs.LG cs.AI

Practical Bayesian Inference for Speech SNNs: Uncertainty and Loss-Landscape Smoothing

Yesmine Abdennadher, Philip N. Garner

2604.08621 2026-04-13 cs.AI cs.HC cs.LG

Sustained Impact of Agentic Personalisation in Marketing: A Longitudinal Case Study

Olivier Jeunen, Eleanor Hanna, Schaun Wheeler

Comments To appear in the 34th ACM International Conference on User Modeling, Adaptation and Personalization (UMAP '26) Industry Track

2604.08617 2026-04-13 cs.LG cs.AI cs.CV

From Selection to Scheduling: Federated Geometry-Aware Correction Makes Exemplar Replay Work Better under Continual Dynamic Heterogeneity

Zhuang Qi, Ying-Peng Tang, Lei Meng, Guoqing Chao, Lei Wu, Han Yu, Xiangxu Meng

Comments CVPR 2026 accepted

2604.08615 2026-04-13 cs.CV cs.AI

MARINER: A 3E-Driven Benchmark for Fine-Grained Perception and Complex Reasoning in Open-Water Environments

Xingming Liao, Ning Chen, Muying Shu, Yunpeng Yin, Peijian Zeng, Zhuowei Wang, Nankai Lin, Lianglun Cheng

2604.08613 2026-04-13 cs.CV

ViSAGE @ NTIRE 2026 Challenge on Video Saliency Prediction

Kun Wang, Yupeng Hu, Zhiran Li, Hao Liu, Qianlong Xiang, Liqiang Nie

2604.08610 2026-04-13 cs.CV

A Semi-Automated Framework for 3D Reconstruction of Medieval Manuscript Miniatures

Riccardo Pallotto, Pierluigi Feliciati, Tiberio Uricchio

2604.08609 2026-04-13 cs.CV cs.AI cs.LG

Detection of Hate and Threat in Digital Forensics: A Case-Driven Multimodal Approach

Ponkoj Chandra Shill

Comments 8 pages, 4 figures

2604.08607 2026-04-13 cs.LG cs.AI cs.CR cs.IT math.IT

Joint Interference Detection and Identification via Adversarial Multi-task Learning

H. Xu, B. He, S. Wang

Comments 13 pages, 13 figures. Submitted to IEEE Transactions on Cognitive Communications and Networking

2604.08603 2026-04-13 cs.AI cs.CL

From Business Events to Auditable Decisions: Ontology-Governed Graph Simulation for Enterprise AI

Hongyin Zhu, Jinming Liang, Mengjun Hou, Ruifan Tang, Xianbin Zhu, Jingyuan Yang, Yuanman Mao, Feng Wu

2604.08601 2026-04-13 cs.AI cs.LG

OpenKedge: Governing Agentic Mutation with Execution-Bound Safety and Evidence Chains

Jun He, Deying Yu

Comments 17 pages, 3 figures, 2 tables

2604.08595 2026-04-13 cs.CL cs.AI

Adaptive Rigor in AI System Evaluation using Temperature-Controlled Verdict Aggregation via Generalized Power Mean

Aleksandr Meshkov

2604.08592 2026-04-13 cs.LG nlin.CD

Reservoir observer enhanced with residual calibration and attention mechanism

Yichen Liu, Wei Xiao, Tianguang Chu

2604.08591 2026-04-13 cs.LG cs.AI

From Dispersion to Attraction: Spectral Dynamics of Hallucination Across Whisper Model Scales

Ivan Viakhirev, Kirill Borodin, Grach Mkrtchian

Comments This paper has been submitted to Interspeech 2026 for review