arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.18356 2026-04-21 cs.CL

ComPASS: Towards Personalized Agentic Social Support via Tool-Augmented Companionship

Zhaopei Huang, Yanfeng Jia, Jiayi Zhao, Xinjie Zhang, Wenxuan Wang, Qin Jin

详情

英文摘要

Developing compassionate interactive systems requires agents to not only understand user emotions but also provide diverse, substantive support. While recent works explore empathetic dialogue generation, they remain limited in response form and content, struggling to satisfy diverse needs across users and contexts. To address this, we explore empowering agents with external tools to execute diverse actions. Grounded in the psychological concept of "social support", this paradigm delivers substantive, human-like companionship. Specifically, we first design a dozen user-centric tools simulating various multimedia applications, which can cover different types of social support behaviors in human-agent interaction scenarios. We then construct ComPASS-Bench, the first personalized social support benchmark for LLM-based agents, via multi-step automated synthesis and manual refinement. Based on ComPASS-Bench, we further synthesize tool use records to fine-tune the Qwen3-8B model, yielding a task-specific ComPASS-Qwen. Comprehensive evaluations across two settings reveal that while the evaluated LLMs can generate valid tool-calling requests with high success rates, significant gaps remain in final response quality. Moreover, tool-augmented responses achieve better overall performance than directly producing conversational empathy. Notably, our trained ComPASS-Qwen demonstrates substantial improvements over its base model, achieving comparable performance to several large-scale models. Our code and data are available at https://github.com/hzp3517/ComPASS.

URL PDF HTML ☆

赞 0 踩 0

2604.18354 2026-04-21 cs.CL

PRISMA: Preference-Reinforced Self-Training Approach for Interpretable Emotionally Intelligent Negotiation Dialogues

Prajwal Vijay Kajare, Priyanshu Priya, Bikash Santra, Asif Ekbal

Comments 10 pages + appendix (23 pages total), paper accepted at ACL (Main) 2026

2604.18348 2026-04-21 cs.CV cs.AI

AdaCluster: Adaptive Query-Key Clustering for Sparse Attention in Video Generation

Haoyue Tan, Shengnan Wang, Yulin Qiao, Juncheng Zhang, Youhui Bai, Ping Gong, Zewen Jin, Cheng Li

Comments CVPR 2026 poster

2604.18344 2026-04-21 cs.AI

One Pass for All: A Discrete Diffusion Model for Knowledge Graph Triple Set Prediction

Jihong Guan, Jiaqi Wang, Wengen Li, Hanchen Yang, Yichao Zhang, Shuigeng Zhou

2604.18343 2026-04-21 cs.RO cs.SY eess.SY

DAG-STL: A Hierarchical Framework for Zero-Shot Trajectory Planning under Signal Temporal Logic Specifications

Ruijia Liu, Ancheng Hou, Xiao Yu, Xiang Yin

详情

英文摘要

Signal Temporal Logic (STL) is a powerful language for specifying temporally structured robotic tasks. Planning executable trajectories under STL constraints remains difficult when system dynamics and environment structure are not analytically available. Existing methods typically either assume explicit models or learn task-specific behaviors, limiting zero-shot generalization to unseen STL tasks. In this work, we study offline STL planning under unknown dynamics using only task-agnostic trajectory data. Our central design philosophy is to separate logical reasoning from trajectory realization. We instantiate this idea in DAG-STL, a hierarchical framework that converts long-horizon STL planning into three stages. It first decomposes an STL formula into reachability and invariance progress conditions linked by shared timing constraints. It then allocates timed waypoints using learned reachability-time estimates. Finally, it synthesizes trajectories between these waypoints with a diffusion-based generator. This decomposition--allocation--generation pipeline reduces global planning to shorter, better-supported subproblems. To bridge the gap between planning-level correctness and execution-level feasibility, we further introduce a rollout-free dynamic consistency metric, an anytime refinement search procedure for improving multiple allocation hypotheses under finite budgets, and a hierarchical online replanning mechanism for execution-time recovery. Experiments in Maze2D, OGBench AntMaze, and the Cube domain show that DAG-STL substantially outperforms direct robustness-guided diffusion on complex long-horizon STL tasks and generalizes across navigation and manipulation settings. In a custom environment with an optimization-based reference, DAG-STL recovers most model-solvable tasks while retaining a clear computational advantage over direct optimization based on the explicit system model.

URL PDF HTML ☆

赞 0 踩 0

2604.18336 2026-04-21 cs.RO cs.CV

Enhancing Glass Surface Reconstruction via Depth Prior for Robot Navigation

Jiamin Zheng, Jingwen Yu, Guangcheng Chen, Hong Zhang

Comments 9 pages, 8 figures

2604.18331 2026-04-21 cs.RO

Will People Enjoy a Robot Trainer? A Case Study with Snoopie the Pacerbot

Maximilian Du, Jennifer Grannen, Shuran Song, Dorsa Sadigh

Comments 8 pages, 4 figures. To appear at ICRA 2026

2604.18328 2026-04-21 cs.CL

FregeLogic at SemEval 2026 Task 11: A Hybrid Neuro-Symbolic Architecture for Content-Robust Syllogistic Validity Prediction

Adewale Akinfaderin, Nafi Diallo

Comments Camera-ready version to appear at The 20th International Workshop on Semantic Evaluation (SemEval-2026), ACl 2026

2604.18327 2026-04-21 cs.AI cs.CL

PARM: Pipeline-Adapted Reward Model

Xingyu Fan, Wei Shao, Jiacheng Liu, Linqi Song, Pheng Ann Heng

2604.18320 2026-04-21 cs.CV cs.AI

EVE: Verifiable Self-Evolution of MLLMs via Executable Visual Transformations

Yongrui Heng, Chaoya Jiang, Han Yang, Shikun Zhang, Wei Ye

2604.18313 2026-04-21 cs.CV

Denoise and Align: Diffusion-Driven Foreground Knowledge Prompting for Open-Vocabulary Temporal Action Detection

Sa Zhu, Wanqian Zhang, Lin Wang, Jinchao Zhang, Cong Wang, Bo Li

Comments Accepted by SIGIR 2026

2604.18312 2026-04-21 cs.LG

Scale-free adaptive planning for deterministic dynamics & discounted rewards

Peter L. Bartlett, Victor Gabillon, Jennifer Healey, Michal Valko

Comments 36th International Conference on Machine Learning (ICML 2019)

2604.18311 2026-04-21 cs.CL cs.AI

On the Importance and Evaluation of Narrativity in Natural Language AI Explanations

Mateusz Cedro, David Martens

Comments 30 pages, 7 figures, 9 tables

2604.18305 2026-04-21 cs.LG

CAARL: In-Context Learning for Interpretable Co-Evolving Time Series Forecasting

Etienne Tajeuna, Patrick Asante Owusu, Armelle Brun, Shengrui Wang

Comments Double-columned, 8 pages, 4 figures

2604.18302 2026-04-21 cs.AI

Toward Zero-Egress Psychiatric AI: On-Device LLM Deployment for Privacy-Preserving Mental Health Decision Support

Eranga Bandara, Asanga Gunaratna, Ross Gore, Anita H. Clayton, Christopher K. Rhea, Sachini Rajapakse, Isurunima Kularathna, Sachin Shetty, Ravi Mukkamala, Xueping Liang, Preston Samuel, Atmaram Yarlagadda

详情

英文摘要

Privacy represents one of the most critical yet underaddressed barriers to AI adoption in mental healthcare -- particularly in high-sensitivity operational environments such as military, correctional, and remote healthcare settings, where the risk of patient data exposure can deter help-seeking behavior entirely. Existing AI-enabled psychiatric decision support systems predominantly rely on cloud-based inference pipelines, requiring sensitive patient data to leave the device and traverse external servers, creating unacceptable privacy and security risks in these contexts. In this paper, we propose a zero-egress, on-device AI platform for privacy-preserving psychiatric decision support, deployed as a cross-platform mobile application. The proposed system extends our prior work on fine-tuned LLM consortiums for psychiatric diagnosis standardization by fundamentally re-architecting the inference pipeline for fully local execution -- ensuring that no patient data is transmitted to, processed by, or stored on any external server at any stage. The platform integrates a consortium of three lightweight, fine-tuned, and quantized open-source LLMs -- Gemma, Phi-3.5-mini, and Qwen2 -- selected for their compact architectures and proven efficiency on resource-constrained mobile hardware. An on-device orchestration layer coordinates ensemble inference and consensus-based diagnostic reasoning, producing DSM-5-aligned assessments for conditions. The platform is designed to assist clinicians with differential diagnosis and evidence-linked symptom mapping, as well as to support patient-facing self-screening with appropriate clinical safeguards. Initial evaluation demonstrates that the proposed zero-egress deployment achieves diagnostic accuracy comparable to its server-side predecessor while sustaining real-time inference latency on commodity mobile hardware.

URL PDF HTML ☆

赞 0 踩 0

2604.18296 2026-04-21 cs.CL

Exploring Concreteness Through a Figurative Lens

Saptarshi Ghosh, Tianyu Jiang

Comments ACL 2026

2604.18293 2026-04-21 cs.CL

An Existence Proof for Neural Language Models That Can Explain Garden-Path Effects via Surprisal

Ryo Yoshida, Shinnosuke Isono, Taiga Someya, Yohei Oseki, Tatsuki Kuribayashi

Comments To appear in ACL 2026

2604.18292 2026-04-21 cs.AI cs.CL

Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence

Guanting Dong, Junting Lu, Junjie Huang, Wanjun Zhong, Longxiang Liu, Shijue Huang, Zhenyu Li, Yang Zhao, Xiaoshuai Song, Xiaoxi Li, Jiajie Jin, Yutao Zhu, Hanbin Wang, Fangyu Lei, Qinyu Luo, Mingyang Chen, Zehui Chen, Jiazhan Feng, Ji-Rong Wen, Zhicheng Dou

Comments Working in progress

2604.18289 2026-04-21 cs.RO cs.CV cs.SY eess.SY

Relative State Estimation using Event-Based Propeller Sensing

Ravi Kumar Thakur, Luis Granados Segura, Jan Klivan, Radim Špetlík, Tobiáš Vinklárek, Matouš Vrba, Martin Saska

2604.18284 2026-04-21 cs.CV

Spike-NVPT: Learning Robust Visual Prompts via Bio-Inspired Temporal Filtering and Discretization

Qiugang Zhan, Anning Jiang, Ran Tao, Ao Ma, Xiangyu Zhang, Xiurui Xie, Guisong Liu

2604.18277 2026-04-21 cs.LG

Dissipative Latent Residual Physics-Informed Neural Networks for Modeling and Identification of Electromechanical Systems

Youyuan Long, Gokhan Solak, Arash Ajoudani

Comments Accepted for publication at the 23rd IFAC World Congress 2026

2604.18271 2026-04-21 cs.RO

EmbodiedLGR: Integrating Lightweight Graph Representation and Retrieval for Semantic-Spatial Memory in Robotic Agents

Paolo Riva, Leonardo Gargani, Matteo Frosi, Matteo Matteucci

Comments 8 pages, 3 figures

2604.18267 2026-04-21 cs.CV

MARCO: Navigating the Unseen Space of Semantic Correspondence

Claudia Cuttano, Gabriele Trivigno, Carlo Masone, Stefan Roth

Comments CVPR 2026 Oral. Project page: https://visinf.github.io/MARCO/

2604.18266 2026-04-21 cs.AI

Enhancing Tabular Anomaly Detection via Pseudo-Label-Guided Generation

Wei Huang, Yuxuan Xiong, Hezhe Qiao, Yu-Ming Shang, Xiangling Fu, Guansong Pang

Comments 13 pages, 6 figures

2604.18264 2026-04-21 cs.LG

Universally Empowering Zeroth-Order Optimization via Adaptive Layer-wise Sampling

Fei Wang, Li Shen, Liang Ding, Chao Xue, Ye Liu, Changxing Ding

2604.18260 2026-04-21 cs.CV

Geometry-Guided 3D Visual Token Pruning for Video-Language Models

Han Li, Zehao Huang, Jiahui Fu, Naiyan Wang, Si Liu

Comments Accepted by CVPR 2026

2604.18258 2026-04-21 cs.CV cs.AI

Long-Text-to-Image Generation via Compositional Prompt Decomposition

Jen-Yuan Huang, Tong Lin, Yilun Du

Comments Accepted to the Fourteenth International Conference on Learning Representations (ICLR 2026)

2604.18256 2026-04-21 cs.CV cs.LG

Domain-Specialized Object Detection via Model-Level Mixtures of Experts

Svetlana Pavlitska, Malte Stüven, Beyza Keskin, J. Marius Zöllner

Comments Accepted for publication at IJCNN 2026

2604.18254 2026-04-21 cs.AI cs.DB cs.SE

LeGo-Code: Can Modular Curriculum Learning Advance Complex Code Generation? Insights from Text-to-SQL

Salmane Chafik, Saad Ezzini, Ismail Berrada

Comments 7 pages, 3 figures, 4 tables

2604.18251 2026-04-21 cs.CV cs.AI cs.LG stat.AP

Style-Based Neural Architectures for Real-Time Weather Classification

Hamed Ouattara, Pascal Houssam Salmane, Pierre Duthon, Frédéric Bernardin, Omar Ait Aider

Comments 9 pages, 21 figures