arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.09021 2026-04-13 cs.SD cs.AI

Noise-Aware In-Context Learning for Hallucination Mitigation in ALLMs

Qixuan Huang, Khalid Zaman, Masashi Unoki

详情

英文摘要

Auditory large language models (ALLMs) have demonstrated strong general capabilities in audio understanding and reasoning tasks. However, their reliability is still undermined by hallucination issues. Existing hallucination evaluation methods are formulated as binary classification tasks, which are insufficient to characterize the more complex hallucination patterns that arise in generative tasks. Moreover, current hallucination mitigation strategies rely on fine-tuning, resulting in high computational costs. To address the above limitations, we propose a plug-and-play Noise-Aware In-Context Learning (NAICL) method. Specifically, we construct a noise prior library, retrieve noise examples relevant to the input audio, and incorporate them as contextual priors, thereby guiding the model to reduce speculative associations when acoustic evidence is insufficient and to adopt a more conservative generation strategy. In addition, we establish a hallucination benchmark for audio caption tasks including the construction of the Clotho-1K multi-event benchmark dataset, the definition of four types of auditory hallucinations, and the introduction of metrics such as hallucination type distribution to support fine-grained analysis. Experimental results show that all evaluated ALLMs exhibit same hallucination behaviors. Moreover, the proposed NAICL method reduces the overall hallucination rate from 26.53% to 16.98%.

URL PDF HTML ☆

赞 0 踩 0

2604.09018 2026-04-13 cs.CV

Domain-generalizable Face Anti-Spoofing with Patch-based Multi-tasking and Artifact Pattern Conversion

Seungjin Jung, Yonghyun Jeong, Minha Kim, Jimin Min, Youngjoon Yoo, Jongwon Choi

Comments The published version is available at DOI: https://doi.org/10.1016/j.patcog.2026.113640

2604.09016 2026-04-13 cs.LG cs.AI

Identification and Anonymization of Named Entities in Unstructured Information Sources for Use in Social Engineering Detection

Carlos Jimeno Miguel, Raul Orduna, Francesco Zola

2604.09009 2026-04-13 cs.CV

Robust by Design: A Continuous Monitoring and Data Integration Framework for Medical AI

Mohammad Daouk, Jan Ulrich Becker, Neeraja Kambham, Anthony Chang, Chandra Mohan, Hien Van Nguyen

Comments Accepted at IEEE ISBI 2026. Chandra Mohan and Hien Van Nguyen jointly supervised this work

2604.09008 2026-04-13 cs.CL cs.AI

Towards Linguistically-informed Representations for English as a Second or Foreign Language: Review, Construction and Application

Wenxi Li, Xihao Wang, Weiwei Sun

2604.09001 2026-04-13 cs.AI cs.LG cs.LO

Hypergraph Neural Networks Accelerate MUS Enumeration

Hiroya Ijima, Koichiro Yawata

2604.08990 2026-04-13 cs.CV

ActFER: Agentic Facial Expression Recognition via Active Tool-Augmented Visual Reasoning

Shifeng Liu, Zhengye Zhang, Sirui Zhao, Xinglong Mao, Zhehan Kan, Zhixiang Wei, Shiwei Wu, Chaoyou Fu, Tong Xu, Enhong Chen

Comments 10 pages, 7 figures

2604.08987 2026-04-13 cs.AI

PilotBench: A Benchmark for General Aviation Agents with Safety Constraints

Yalun Wu, Haotian Liu, Zhoujun Li, Boyang Wang

Comments Accepted at the 2026 IEEE International Joint Conference on Neural Networks (IJCNN 2026). 6 pages, 7 figures

2604.08986 2026-04-13 cs.CL cs.AI

PerMix-RLVR: Preserving Persona Expressivity under Verifiable-Reward Alignment

Jihwan Oh, Soowon Oh, Murad Aghazada, Minchan Jeong, Sungnyun Kim, Se-Young Yun

Comments Preprint

2604.08980 2026-04-13 cs.LG cs.AI

Neighbourhood Transformer: Switchable Attention for Monophily-Aware Graph Learning

Yi Luo, Xu Sun, Guangchun Luo, Aiguo Chen

2604.08977 2026-04-13 cs.CL

Testing the Assumptions of Active Learning for Translation Tasks with Few Samples

Lorenzo Jaime Yu Flores, Cesare Spinoso di-Piano, Ori Ernst, David Ifeoluwa Adelani, Jackie Chi Kit Cheung

2604.08976 2026-04-13 cs.CL

Quantisation Reshapes the Metacognitive Geometry of Language Models

Jon-Paul Cacioli

Comments 10 pages, 2 figures, 5 tables. Pre-registered study. Code and data: https://github.com/synthiumjp/sdt-calibration

2604.08974 2026-04-13 cs.CL

Confident in a Confidence Score: Investigating the Sensitivity of Confidence Scores to Supervised Fine-Tuning

Lorenzo Jaime Yu Flores, Cesare Spinoso di-Piano, Jackie Chi Kit Cheung

2604.08971 2026-04-13 cs.LG

Modality-Aware Zero-Shot Pruning and Sparse Attention for Efficient Multimodal Edge Inference

Yueyuan Sui, Payal Mohapatra, Doğaç Eldenk, Haodong Yang, Yiting Zhang, Haoyan Zhang, Qi Zhu, Stephen Xia

2604.08970 2026-04-13 cs.CL cs.AI cs.HC cs.MA

Litmus (Re)Agent: A Benchmark and Agentic System for Predictive Evaluation of Multilingual Models

Avni Mittal, Shanu Kumar, Sandipan Dandapat, Monojit Choudhury

2604.08967 2026-04-13 cs.SD

AudioGS: Spectrogram-Based Audio Gaussian Splatting for Sound Field Reconstruction

Chunhao Bi, Houqiang Zhong, Zhixin Xu, Li Song, Zhengxue Cheng

2604.08966 2026-04-13 cs.CV

How Should Video LLMs Output Time? An Analysis of Efficient Temporal Grounding Paradigms

Shengji Jin, Yuanhao Zou, Victor Zhu, Zhengping Ji, Chen Chen

Comments CVPR 2026 Workshop Paper

2604.08965 2026-04-13 cs.CV

Dynamic Class-Aware Active Learning for Unbiased Satellite Image Segmentation

Gadi Hemanth Kumar, Athira Nambiar, Pankaj Bodani

2604.08964 2026-04-13 cs.CL

Breaking Block Boundaries: Anchor-based History-stable Decoding for Diffusion Large Language Models

Shun Zou, Yong Wang, Zehui Chen, Lin Chen, Chongyang Tao, Feng Zhao, Xiangxiang Chu

Comments Accepted for ACL 2026

2604.08960 2026-04-13 cs.LG

Efficient Hierarchical Implicit Flow Q-learning for Offline Goal-conditioned Reinforcement Learning

Zhiqiang Dong, Teng Pang, Rongjian Xu, Guoqiang Wu

2604.08956 2026-04-13 cs.CV cs.LG

Low-Data Supervised Adaptation Outperforms Prompting for Cloud Segmentation Under Domain Shift

Harshith Kethavath, Weiming Hu

Comments 10 pages, 6 figures, to be published in EarthVision @ CVPR 2026

2604.08947 2026-04-13 cs.CL cs.AI

MuTSE: A Human-in-the-Loop Multi-use Text Simplification Evaluator

Rares-Alexandru Roscan, Gabriel Petre1, Adrian-Marius Dumitran, Angela-Liliana Dumitran

Comments Accepted for ITS 2026

2604.08945 2026-04-13 cs.CV cs.RO

TouchAnything: Diffusion-Guided 3D Reconstruction from Sparse Robot Touches

Langzhe Gu, Hung-Jui Huang, Mohamad Qadri, Michael Kaess, Wenzhen Yuan

Comments Project Page: https://grange007.github.io/touchanything

2604.08943 2026-04-13 cs.CV cs.RO

MASS: Mesh-inellipse Aligned Deformable Surfel Splatting for Hand Reconstruction and Rendering from Egocentric Monocular Video

Haoyu Zhu, Yi Zhang, Lei Yao, Lap-pui Chau, Yi Wang

Comments This paper has been accepted to CVM 2026 Journal Track and is under consideration for publication in IEEE TVCG

2604.08941 2026-04-13 cs.LG

Predictive Entropy Links Calibration and Paraphrase Sensitivity in Medical Vision-Language Models

Binesh Sadanandan, Vahid Behzadan

2604.08939 2026-04-13 cs.LG

Delve into the Applicability of Advanced Optimizers for Multi-Task Learning

Zhipeng Zhou, Linxiao Cao, Pengcheng Wu, Peilin Zhao, Chunyan Miao

Comments 12 pages, 5 figures

2604.08931 2026-04-13 cs.AI cs.MA

Enhancing LLM Problem Solving via Tutor-Student Multi-Agent Interaction

Nurullah Eymen Özdemir, Erhan Oztop

Comments 7 pages, 3 figures, This work is under review for conference appearance

2604.08926 2026-04-13 cs.LG

Bridging SFT and RL: Dynamic Policy Optimization for Robust Reasoning

Taojie Zhu, Dongyang Xu, Ding Zou, Sen Zhao, Qiaobo Hao, Zhiguo Yang, Yonghong He

Comments ACL 2026 findings

2604.08924 2026-04-13 cs.CV

Customized Fusion: A Closed-Loop Dynamic Network for Adaptive Multi-Task-Aware Infrared-Visible Image Fusion

Zengyi Yang, Yu Liu, Juan Cheng, Zhiqin Zhu, Yafei Zhang, Huafeng Li

Comments This paper has been accepted by CVPR 2026

2604.08922 2026-04-13 cs.CV

Degradation-Robust Fusion: An Efficient Degradation-Aware Diffusion Framework for Multimodal Image Fusion in Arbitrary Degradation Scenarios

Yu Shi, Yu Liu, Zhong-Cheng Wu, Juan Cheng, Huafeng Li, Xun Chen

Comments Accepted by CVPR 2026