arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.14251 2026-03-17 cs.CL cs.AI

Mitigating Overthinking in Large Reasoning Language Models via Reasoning Path Deviation Monitoring

Weixin Guan, Liang Li, Jiapeng Liu, Bing Li, Peng Fu, Chengyang Fang, Xiaoshuai Hao, Can Ma, Weiping Wang

详情

英文摘要

Large Reasoning Language Models (LRLMs) demonstrate impressive capabilities on complex tasks by utilizing long Chain-of-Thought reasoning. However, they are prone to overthinking, which generates redundant reasoning steps that degrade both performance and efficiency. Recently, early-exit strategies are proposed to mitigate overthinking by dynamically and adaptively terminating redundant reasoning. However, current early-exit methods either introduce extra training overhead by relying on proxy models or limit inference throughput due to the frequent content switching between reasoning and generating probing answers. Moreover, most early-exit methods harm LRLMs performance due to over-truncation. Our insight stems from an observation: overthinking often causes LRLMs to deviate from the correct reasoning path, which is frequently accompanied by high-entropy transition tokens. Given this, we propose an early-exit method deeply coupled with the native reasoning process, which leverages the path deviation index as a dedicated monitoring metric for the frequent occurrence of high-entropy transition tokens to dynamically detect and terminate overthinking trajectories. We conduct experiments across multiple benchmarks using LRLMs of different types and scales, and the results indicate that our method delivers the largest performance improvement over vanilla CoT compared to existing early-exit methods.

URL PDF HTML ☆

赞 0 踩 0

2603.14249 2026-03-17 cs.CV

OAHuman: Occlusion-Aware 3D Human Reconstruction from Monocular Images

Yuanwang Yang, Hongliang Liu, Muxin Zhang, Nan Ma, Jingyu Yang, Yu-Kun Lai, Kun Li

2603.14245 2026-03-17 cs.LG cs.AI

GoldenStart: Q-Guided Priors and Entropy Control for Distilling Flow Policies

He Zhang, Ying Sun, Hui Xiong

Comments 23 pages, 13 figures

2603.14244 2026-03-17 cs.RO

Design of a Bio-Inspired Miniature Submarine for Low-Cost Water Quality Monitoring

Quang Huy Vu, Quan Le, Manh Duong Phung

2603.14243 2026-03-17 cs.CV

BIT: Matching-based Bi-directional Interaction Transformation Network for Visible-Infrared Person Re-Identification

Haoxuan Xu, Guanglin Niu

2603.14241 2026-03-17 cs.CV

CamLit: Unified Video Diffusion with Explicit Camera and Lighting Control

Zhiyi Kuang, Chengan He, Egor Zakharov, Yuxuan Xue, Shunsuke Saito, Olivier Maury, Timur Bagautdinov, Youyi Zheng, Giljoo Nam

Comments 11 pages, 6 figures

2603.14240 2026-03-17 cs.CV

FOCUS: Bridging Fine-Grained Recognition and Open-World Discovery across Domains

Vaibhav Rathore, Divyam Gupta, Moloud Abdar, Subhasis Chaudhuri, Biplab Banerjee

Comments Under Review

2603.14239 2026-03-17 cs.CL cs.AI cs.AR cs.LG

QiMeng-CodeV-SVA: Training Specialized LLMs for Hardware Assertion Generation via RTL-Grounded Bidirectional Data Synthesis

Yutong Wu, Chenrui Cao, Pengwei Jin, Di Huang, Rui Zhang, Xishan Zhang, Zidong Du, Qi Guo, Xing Hu

Comments Accepted by DAC 2026. Code: https://github.com/wyt2000/CodeV-SVA; Model: https://huggingface.co/wyt2000/CodeV-SVA-14B

2603.14238 2026-03-17 cs.LG cs.MM

Domain-Skewed Federated Learning with Feature Decoupling and Calibration

Huan Wang, Jun Shen, Jun Yan, Guansong Pang

Comments Accepted at CVPR 2026

2603.14236 2026-03-17 cs.RO cs.DC

AeroGen: Agentic Drone Autonomy through Single-Shot Structured Prompting & Drone SDK

Kautuk Astu, Yogesh Simmhan

2603.14232 2026-03-17 cs.CV

S2GS: Streaming Semantic Gaussian Splatting for Online Scene Understanding and Reconstruction

Renhe Zhang, Yuyang Tan, Jingyu Gong, Zhizhong Zhang, Lizhuang Ma, Yuan Xie, Xin Tan

Comments 10 pages, 7 figures

2603.14229 2026-03-17 cs.AI cs.SE

Agentic DAG-Orchestrated Planner Framework for Multi-Modal, Multi-Hop Question Answering in Hybrid Data Lakes

Kirushikesh D B, Manish Kesarwani, Nishtha Madaan, Sameep Mehta, Aldrin Dennis, Siddarth Ajay, Rakesh B R, Renu Rajagopal, Sudheesh Kairali

2603.14224 2026-03-17 cs.LG cs.AI

Self-Indexing KVCache: Predicting Sparse Attention from Compressed Keys

Xu Yang, Jiapeng Zhang, Dongyang Zhao, Guo Chen, Zhuo Tang

2603.14221 2026-03-17 cs.RO cs.AI cs.LG

A Real-Time Neuro-Symbolic Ethical Governor for Safe Decision Control in Autonomous Robotic Manipulation

Aueaphum Aueawatthanaphisut, Kuepon Aueawatthanaphisut

Comments 6 pages, 6 figures, 5 equations

2603.14220 2026-03-17 cs.CV

FIND: A Simple yet Effective Baseline for Diffusion-Generated Image Detection

Jie Li, Yingying Feng, Chi Xie, Jie Hu, Lei Tan, Jiayi Ji

Comments AAAI'26

2603.14219 2026-03-17 cs.CV

Safety-Potential Pruning for Enhancing Safety Prompts Against VLM Jailbreaking Without Retraining

Chongxin Li, Hanzhang Wang, Lian Duan

Comments Accepted for publication in Transactions of the Association for Computational Linguistics (TACL)

2603.14216 2026-03-17 cs.RO cs.HC

Navigation beyond Wayfinding: Robots Collaborating with Visually Impaired Users for Environmental Interactions

Shaojun Cai, Nuwan Janaka, Ashwin Ram, Janidu Shehan, Yingjia Wan, Kotaro Hara, David Hsu

Comments Accepted to ACM/IEEE HRI 2026, 10 pages, 6 figures

2603.14214 2026-03-17 cs.CV cs.AI

UniFusion: A Unified Image Fusion Framework with Robust Representation and Source-Aware Preservation

Xingyuan Li, Songcheng Du, Yang Zou, HaoYuan Xu, Zhiying Jiang, Jinyuan Liu

Comments 11 pages, 8 figures, published to CVPR2026

2603.14212 2026-03-17 cs.AI

Memory as Asset: From Agent-centric to Human-centric Memory Management

Yanqi Pan, Qinghao Huang, Weihao Yang

2603.14207 2026-03-17 cs.CV cs.AI

DualTSR: Unified Dual-Diffusion Transformer for Scene Text Image Super-Resolution

Axi Niu, Kang Zhang, Qingsen Yan, Hao Jin, Jinqiu Sun, Yanning Zhang

2603.14189 2026-03-17 cs.CV cs.AI

Walking Further: Semantic-aware Multimodal Gait Recognition Under Long-Range Conditions

Zhiyang Lu, Wen Jiang, Tianren Wu, Zhichao Wang, Changwang Zhang, Siqi Shen, Ming Cheng

Comments Accepted by AAAI 2026

2603.14187 2026-03-17 cs.CV

Deep Learning From Routine Histology Improves Risk Stratification for Biochemical Recurrence in Prostate Cancer

Clément Grisi, Khrystyna Faryna, Nefise Uysal, Vittorio Agosti, Enrico Munari, Solène-Florence Kammerer-Jacquet, Paulo Guilherme de Oliveira Salles, Yuri Tolkach, Reinhard Büttner, Sofiya Semko, Maksym Pikul, Axel Heidenreich, Jeroen van der Laak, Geert Litjens

Comments Preprint

2603.14183 2026-03-17 cs.CL

Selective Fine-Tuning of GPT Architectures for Parameter-Efficient Clinical Text Classification

Fariba Afrin Irany, Sampson Akwafuo

2603.14182 2026-03-17 cs.RO cs.HC

Towards Equitable Robotic Furnishing Agents for Aging-in-Place: ADL-Grounded Design Exploration

Hansoo Lee, Changhee Seo, Subin Park, Sonya S. Kwak

Comments Accepted at the ACM/IEEE International Conference on Human-Robot Interaction (HRI) 2026 Workshop: Equitable Robotics for Wellbeing (Eq-RW)

2603.14176 2026-03-17 cs.CV

BluRef: Unsupervised Image Deblurring with Dense-Matching References

Bang-Dang Pham, Anh Tran, Cuong Pham, Minh Hoai

Comments Accepted to CVPR 2026. Project page: https://qualcomm-ai-research.github.io/BluRef/

2603.14175 2026-03-17 cs.LG cs.CV

Balancing Multimodal Domain Generalization via Gradient Modulation and Projection

Hongzhao Li, Guohao Shen, Shupan Li, Mingliang Xu, Muhammad Haris Khan

Comments AAAI 2026 Oral Accepted

2603.14173 2026-03-17 cs.LG cs.AI cs.IR

Hybrid Intent-Aware Personalization with Machine Learning and RAG-Enabled Large Language Models for Financial Services Marketing

Akhil Chandra Shanivendra

Comments 18 pages, 5 figures, 3 tables. Applied ML systems paper. The contribution is architectural rather than algorithmic

2603.14171 2026-03-17 cs.LG

TACTIC for Navigating the Unknown: Tabular Anomaly deteCTion via In-Context inference

Patryk Marszałek, Tomasz Kuśmierczyk, Marek Śmieja

2603.14161 2026-03-17 cs.LG q-bio.NC

Deep probabilistic model synthesis enables unified modeling of whole-brain neural activity across individual subjects

William E. Bishop, Luuk W. Hesselink, Bernhard Englitz, Misha B. Ahrens, James E. Fitzgerald

Comments 40 pages, 8 figures

2603.14160 2026-03-17 cs.RO

See, Learn, Assist: Safe and Self-Paced Robotic Rehabilitation via Video-Based Learning from Demonstration

Ali Alabbas, Camillo Murgia, Joanne Regan, Philip Long