arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2512.23578 2026-04-17 cs.CL cs.SD

Style Amnesia: Investigating Speaking Style Degradation and Mitigation in Multi-Turn Spoken Language Models

Yu-Xiang Lin, Cheng-Han Chiang, Hung-yi Lee

Comments ACL 2026 Findings

详情

英文摘要

In this paper, we show that when spoken language models (SLMs) are instructed to speak in a specific speaking style at the beginning of a multi-turn conversation, they cannot maintain the required speaking styles after several turns of interaction; we refer to this as the style amnesia of SLMs. We focus on paralinguistic speaking styles, including emotion, accent, volume, and speaking speed. We evaluate three proprietary and two open-source SLMs, demonstrating that none of these models can maintain a consistent speaking style when instructed to do so. We further show that while SLMs can recall the style instruction when prompted in later turns, they still fail to express it, but through explicit recall can mitigate style amnesia. In addition, SLMs struggle more when the style instruction is placed in system messages rather than user messages, even though system messages are specifically designed to provide persistent, conversation-level instructions. Our findings highlight a systematic gap in current SLMs' ability to maintain speaking styles, highlighting the need for improved style adherence in future models. Our code and evaluation data are publicly available at https://github.com/YuXiangLin1234/SLM-Style-Amnesia.

URL PDF HTML ☆

赞 0 踩 0

2512.22897 2026-04-17 cs.LG cs.MM

Federated Multi-Task Clustering

Suyan Dai, Gan Sun, Fazeng Li, Xu Tang, Qianqian Wang, Yang Cong

2512.17091 2026-04-17 cs.LG cs.AI cs.RO

Learning to Plan, Planning to Learn: Adaptive Hierarchical RL-MPC for Sample-Efficient Decision Making

Toshiaki Hori, Jonathan DeCastro, Deepak Gopinath, Avinash Balachandran, Guy Rosman

Comments 27 pages, 10 figures, 8th Annual Learning for Dynamics & Control Conference (L4DC)

2512.15925 2026-04-17 cs.CL cs.AI cs.LG cs.SI

Social Story Frames: Contextual Reasoning about Narrative Intent and Reception

Joel Mire, Maria Antoniak, Steven R. Wilson, Zexin Ma, Achyutarama R. Ganti, Andrew Piper, Maarten Sap

Comments ACL 2026 (Main)

2512.14098 2026-04-17 cs.LG cs.DC

Cornfigurator: Automated Planning for Any-to-Any Multimodal Model Serving

Jeff J. Ma, Jae-Won Chung, Jisang Ahn, Yizhuo Liang, Runyu Lu, Akshay Jajoo, Myungjin Lee, Mosharaf Chowdhury

Comments Open-source at https://github.com/cornserve-ai/cornfigurator

2512.13671 2026-04-17 cs.CV

AgentIAD: Agentic Industrial Anomaly Detection via Adaptive Memory Augmentation

Junwen Miao, Penghui Du, Yingying Fan, Yi Liu, Yu Wang, Runze He, Lida Huang, Yan Wang

2512.13168 2026-04-17 cs.AI cs.CE cs.IR cs.MA

Finch: Benchmarking Finance & Accounting across Spreadsheet-Centric Enterprise Workflows

Haoyu Dong, Pengkun Zhang, Yan Gao, Xuanyu Dong, Yilin Cheng, Mingzhe Lu, Zikun Zhu, Adina Yakefu, Shuxin Zheng

Comments ACL 2026 Findings

2512.07222 2026-04-17 cs.LG cs.CL

Pay Less Attention to Function Words for Free Robustness of Vision-Language Models

Qiwei Tian, Chenhao Lin, Zhengyu Zhao, Chao Shen

Comments The paper has been accepted by ICLR26

2512.04585 2026-04-17 cs.CV

SAM3-I: Segment Anything with Instructions

Jingjing Li, Yue Feng, Yuchen Guo, Jincai Huang, Wei Ji, Qi Bi, Yongri Piao, Miao Zhang, Xiaoqi Zhao, Qiang Chen, Shihao Zou, Huchuan Lu, Li Cheng

2512.04578 2026-04-17 cs.CL

LexGenius: An Expert-Level Benchmark for Large Language Models in Legal General Intelligence

Wenjin Liu, Haoran Luo, Xin Feng, Xiang Ji, Lijuan Zhou, Rui Mao, Jiapu Wang, Shirui Pan, Erik Cambria

2511.21025 2026-04-17 cs.CV

CaptionQA: Is Your Caption as Useful as the Image Itself?

Shijia Yang, Yunong Liu, Bohan Zhai, Ximeng Sun, Zicheng Liu, Emad Barsoum, Manling Li, Chenfeng Xu

2511.20892 2026-04-17 cs.AI

Representation Interventions Enable Lifelong Knowledge Memory Control in LLMs

Xuyuan Liu, Shengyu Chen, Xinshuai Dong, Yanchi Liu, Xujiang Zhao, Haoyu Wang, Yujun Yan, Haifeng Chen, Zhengzhang Chen

Comments In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics: ACL 2026

2511.18107 2026-04-17 cs.LG stat.ML

Active Learning with Selective Time-Step Acquisition for PDEs

Yegon Kim, Hyunsu Kim, Gyeonghoon Ko, Juho Lee

Comments This manuscript is an improvement over the camera-ready version in ICML 2025. We have added a clearer motivation for our acquisition function. (See Sections 2.3 and 3.2)

2511.15915 2026-04-17 cs.LG cs.CL

AccelOpt: A Self-Improving LLM Agentic System for AI Accelerator Kernel Optimization

Genghan Zhang, Shaowei Zhu, Anjiang Wei, Zhenyu Song, Allen Nie, Zhen Jia, Nandita Vijaykumar, Yida Wang, Kunle Olukotun

2511.15825 2026-04-17 cs.AI

IMACT-CXR: An Interactive Multi-Agent Conversational Tutoring System for Chest X-Ray Interpretation

Tuan-Anh Le, Anh Mai Vu, David Yang, Akash Awasthi, Hien Van Nguyen

Comments Accepted at IEEE ISBI 2026. This version corresponds to the accepted manuscript

2511.14178 2026-04-17 cs.RO cs.AI

Towards Deploying VLA without Fine-Tuning: Plug-and-Play Inference-Time VLA Policy Steering via Embodied Evolutionary Diffusion

Zhuo Li, Junjia Liu, Zhipeng Dong, Tao Teng, Quentin Rouxel, Darwin Caldwell, Fei Chen

Comments 9 pages, 8 figures, submitted to IEEE RA-L

2511.02135 2026-04-17 cs.CL

Graph-Based Alternatives to LLMs for Human Simulation

Joseph Suh, Suhong Moon, Serina Chang

Comments Conference: ACL 2026 Long Main Code: https://github.com/schang-lab/gems

2511.01016 2026-04-17 cs.CL

Prompt-R1: Collaborative Automatic Prompting Framework via End-to-end Reinforcement Learning

Wenjin Liu, Haoran Luo, Xueyuan Lin, Haoming Liu, Tiesunlong Shen, Jiapu Wang, Rui Mao, Erik Cambria

2511.01014 2026-04-17 cs.CL

IF-CRITIC: Towards a Fine-Grained LLM Critic for Instruction-Following Evaluation

Bosi Wen, Yilin Niu, Cunxiang Wang, Pei Ke, Xiaoying Ling, Ying Zhang, Aohan Zeng, Hongning Wang, Minlie Huang

Comments ACL 2026

2510.27420 2026-04-17 cs.RO

Towards a Multi-Embodied Grasping Agent

Roman Freiberg, Alexander Qualmann, Ngo Anh Vien, Gerhard Neumann

Comments 8 pages, 3 figures

2510.26109 2026-04-17 cs.LG

Do Not Step Into the Same River Twice: Learning to Reason from Trial and Error

Chenming Tang, Hsiu-Yuan Huang, Weijie Liu, Clive Bai, Saiyong Yang, Yunfang Wu

Comments Accepted to ACL 2026 (main conference)

2510.25892 2026-04-17 cs.LG

Topology-Aware Active Learning on Graphs

Harris Hardiman-Mostow, Jack Mauro, Adrien Weihs, Andrea L. Bertozzi

2510.24284 2026-04-17 cs.AI

MCP-Flow: Facilitating LLM Agents to Master Real-World, Diverse and Scaling MCP Tools

Wenhao Wang, Peizhi Niu, Zhao Xu, Zhaoyu Chen, Jian Du, Yaxin Du, Xianghe Pang, Keduan Huang, Yanfeng Wang, Qiang Yan, Siheng Chen

Comments ACL 2026 Main, Camera Ready

2510.23853 2026-04-17 cs.CL

Your LLM Agents are Temporally Blind: The Misalignment Between Tool Use Decisions and Human Time Perception

Yize Cheng, Arshia Soltani Moakhar, Chenrui Fan, Parsa Hosseini, Kazem Faghih, Zahra Sodagar, Wenxiao Wang, Soheil Feizi

Comments ACL 2026 (findings), Camera-ready

2510.20151 2026-04-17 cs.CL

BoundRL: Efficient Structured Text Segmentation through Reinforced Boundary Generation

Haoyuan Li, Zhengyuan Shen, Sullam Jeoung, Yueyan Chen, Jiayu Li, Qi Zhu, Shuai Wang, Vassilis Ioannidis, Huzefa Rangwala

Comments accepted by ACL 2026 findings

2510.18935 2026-04-17 cs.CV

Feature Extraction in the Remote Sensing Data Value Chain: A Systematic Review of Methods and Applications

Nathan Mankovich, Kai-Hendrik Cohrs, Homer Durand, Vasileios Sitokonstantinou, Tristan Williams, Gustau Camps-Valls

2510.15946 2026-04-17 cs.LG cs.AI cs.CR

Fall into a Pit, Gain in a Wit: Cognitive-Guided Harmful Meme Detection via Misjudgment Risk Pattern Retrieval

Wenshuo Wang, Ziyou Jiang, Junjie Wang, Mingyang Li, Jie Huang, Yuekai Huang, Zhiyuan Chang, Feiyan Duan, Qing Wang

Comments 14 pages, 11 figures

2510.14665 2026-04-17 cs.AI cs.HC

Beyond "Hallucinations": A Framework for Stable Human-AI Reasoning

Rikard Rosenbacke, Carl Rosenbacke, Victor Rosenbacke, Martin McKee

2510.14664 2026-04-17 cs.SD eess.AS

SpeechLLM-as-Judges: Towards General and Interpretable Speech Quality Evaluation

Hui Wang, Jinghua Zhao, Yifan Yang, Shujie Liu, Junyang Chen, Yanzhe Zhang, Shiwan Zhao, Jinyu Li, Jiaming Zhou, Haoqin Sun, Yan Lu, Yong Qin

Comments ACL 2026

2510.08483 2026-04-17 cs.CL cs.AI

DeepPrune: Parallel Scaling without Inter-trace Redundancy

Shangqing Tu, Yaxuan Li, Yushi Bai, Lei Hou, Juanzi Li

Comments Accepted by ACL 2026 Findings, please check out the project page: https://deepprune.github.io/