arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.03648 2026-04-09 cs.CV

Linearized Coupling Flow with Shortcut Constraints for One-Step Face Restoration

Xiaohui Sun, Hanlin Wu

详情

英文摘要

Face restoration can be formulated as a continuous-time transformation between image distributions via Flow Matching (FM). However, standard FM typically employs independent coupling, ignoring the statistical correlation between low-quality (LQ) and high-quality (HQ) data. This leads to intersecting trajectories and high velocity-field curvature, requiring multi-step integration. We propose Shortcut-constrained Coupling Flow for Face Restoration (SCFlowFR) to address these challenges. By establishing a data-dependent coupling, we explicitly model the LQ-HQ dependency to minimize path crossovers and promote near-linear probability flow. Furthermore, we employ a conditional mean estimator to refine the source distribution's anchor, effectively tightening the transport cost and stabilizing the velocity field. To ensure stable one-step inference, a shortcut constraint is introduced to supervise average velocities over arbitrary intervals, mitigating discretization bias in large-step updates. SCFlowFR achieves state-of-the-art one-step restoration, providing a superior trade-off between perceptual fidelity and computational efficiency.

URL PDF HTML ☆

赞 0 踩 0

2602.16005 2026-04-09 cs.RO cs.AI

ODYN: An All-Shifted Non-Interior-Point Method for Quadratic Programming in Robotics and AI

Jose Rojas, Aristotelis Papatheodorou, Sergi Martinez, Andrea Patrizi, Ioannis Havoutis, Carlos Mastalli

Comments 20 pages, 12 figures, under-review

2602.09987 2026-04-09 cs.LG cs.AI cs.CY

Infusion: Shaping Model Behavior by Editing Training Data via Influence Functions

J Rosser, Robert Kirk, Edward Grefenstette, Jakob Foerster, Laura Ruis

Comments 10 pages, 14 figures

2602.07181 2026-04-09 cs.CL

PACIFIC: Can LLMs Discern the Traits Influencing Your Preferences? Evaluating Personality-Driven Preference Alignment in LLMs

Tianyu Zhao, Siqi Li, Yasser Shoukry, Salma Elmalaki

2602.02676 2026-04-09 cs.CV

AdaptMMBench: Benchmarking Adaptive Multimodal Reasoning for Mode Selection and Reasoning Process

Xintong Zhang, Xiaowen Zhang, Jingrong Wu, Zhi Gao, Shilin Yan, Zhenxin Diao, Kunpeng Gao, Xuanyan Chen, Yuwei Wu, Yunde Jia, Qing Li

2601.23155 2026-04-09 cs.LG cs.AI

SPICE: Submodular Penalized Information-Conflict Selection for Efficient Large Language Model Training

Powei Chang, Jinpeng Zhang, Bowen Chen, Chenyu Wang, Chenlu Guo, Yixing Zhang, Yukang Gao, JianXiang Xiang, Yue Gao, Chaoqun Sun, Yiyi Chen, Dongying Kong

Comments Accepted to ICLR 2026 main conference ; Code available at <https://github.com/Chang-pw/SPICE>

2601.22451 2026-04-09 cs.CV cs.AI

Countering the Over-Reliance Trap: Mitigating Object Hallucination for LVLMs via a Self-Validation Framework

Shiyu Liu, Xinyi Wen, Zhibin Lan, Ante Wang, Jinsong Su

Comments Code is available at https://github.com/Liushiyu-0709/SelfVal

2601.21708 2026-04-09 cs.AI cs.CL

FBS: Modeling Native Parallel Reading inside a Transformer

Tongxi Wang

Comments Accept to ACL2026 as findings

2601.18100 2026-04-09 cs.CV

Spatial-Conditioned Reasoning in Long-Egocentric Videos

James Tribble, Hao Wang, Si-En Hong, Chaoyi Zhou, Ashish Bastola, Siyu Huang, Abolfazl Razi

2601.16206 2026-04-09 cs.CL cs.AI

Computer Environments Elicit General Agentic Intelligence in LLMs

Daixuan Cheng, Shaohan Huang, Yuxian Gu, Huatong Song, Guoxin Chen, Li Dong, Wayne Xin Zhao, Ji-Rong Wen, Furu Wei

Comments Project Page: https://llm-in-sandbox.github.io

2601.16074 2026-04-09 cs.LG

Explainable AI to Improve Machine Learning Reliability for Industrial Cyber-Physical Systems

Annemarie Jutte, Uraz Odyurt

2601.11471 2026-04-09 cs.LG

Low-Rank Key Value Attention

James O'Neill, Robert Clancy, Mariia Matskevichus, Fergal Reid

2601.08258 2026-04-09 cs.AI

Diagnosing and Mitigating Sycophancy and Skepticism in LLM Causal Judgment

Edward Y. Chang

Comments 19 pages, 3 figures, 15 tables

2601.07995 2026-04-09 cs.CL

Is Sentiment Banana-Shaped? Exploring the Geometry and Portability of Sentiment Concept Vectors

Laurits Lyngbaek, Pascale Feldkamp, Yuri Bizzoni, Kristoffer L. Nielbo, Kenneth Enevoldsen

Comments Published at WASSA 2026 (15th Workshop on Computational Approaches to Subjectivity, Sentiment & Social Media Analysis), ACL 2026. Pages 146-160

2601.07154 2026-04-09 cs.CV

Motion Focus Recognition in Fast-Moving Egocentric Video

Si-En Hong, James Tribble, Alexander Lake, Hao Wang, Chaoyi Zhou, Ashish Bastola, Siyu Huang, Eisa Chaudhary, Brian Canada, Ismahan Arslan-Ari, Abolfazl Razi

2601.05529 2026-04-09 cs.AI cs.RO

Before We Trust Them: Decision-Making Failures in Navigation of Foundation Models

Jua Han, Jaeyoon Seo, Jungbin Min, Sieun Choi, Huichan Seo, Jihie Kim, Jean Oh

Comments Corrected author order in metadata; manuscript changed

2601.02721 2026-04-09 cs.CV cs.MM

Robust Mesh Saliency Ground Truth Acquisition in VR via View Cone Sampling and Manifold Diffusion

Guoquan Zheng, Jie Hao, Huiyu Duan, Long Tang, Shuo Yang, Yucheng Zhu, Yongming Han, Liang Yuan, Patrick Le Callet, Guangtao Zhai

2601.02627 2026-04-09 cs.CL cs.AI

Improved Evidence Extraction and Metrics for Document Inconsistency Detection with LLMs

Nelvin Tan, Yaowen Zhang, James Asikin Cheung, Fusheng Liu, Yu-Ching Shih, Dong Yang

Comments 14 pages, 9 figures

2512.24933 2026-04-09 cs.CL cs.LG

ADOPT: Adaptive Dependency-Guided Joint Prompt Optimization for Multi-Step LLM Pipelines

Minjun Zhao, Xinyu Zhang, Shuai Zhang, Deyang Li, Ruifeng Shi

2512.19433 2026-04-09 cs.CV

dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models

Yi Xin, Siqi Luo, Tianxiang Xu, Qi Qin, Haoxing Chen, Kaiwen Zhu, Zhiwei Zhang, Yangfan He, Rongchao Zhang, Jinbin Bai, Shuo Cao, Bin Fu, Junjun He, Yihao Liu, Yuewen Cao, Xiaohong Liu

Comments Project page: https://github.com/Alpha-VLLM/Lumina-DiMOO

2512.10510 2026-04-09 cs.LG cs.AI

Adaptive Replay Buffer for Offline-to-Online Reinforcement Learning

Chihyeon Song, Jaewoo Lee, Jinkyoo Park

Comments AISTATS 2026

2512.07527 2026-04-09 cs.CV cs.GR

From Orbit to Ground: Generative City Photogrammetry from Extreme Off-Nadir Satellite Images

Fei Yu, Yu Liu, Luyang Tang, Mingchao Sun, Zengye Ge, Rui Bu, Yuchao Jin, Haisen Zhao, He Sun, Yangyan Li, Mu Xu, Wenzheng Chen, Baoquan Chen

Comments Accepted by CVPR 2026 Findings. Project page: https://pku-vcl-geometry.github.io/Orbit2Ground/

2512.01925 2026-04-09 cs.CL cs.AI

Rectifying LLM Thought from Lens of Optimization

Junnan Liu, Hongwei Liu, Songyang Zhang, Kai Chen

Comments Accepted by ICLR 2026

2511.23158 2026-04-09 cs.CV cs.AI

REVEAL: Reasoning-Enhanced Forensic Evidence Analysis for Explainable AI-Generated Image Detection

Huangsen Cao, Qin Mei, Zhiheng Li, Yuxi Li, Zhan Meng, Ying Zhang, Chen Li, Zhimeng Zhang, Xin Ding, Yongwei Wang, Jing Lyu, Fei Wu

2511.22490 2026-04-09 cs.CV cs.IR

SciPostGen: Bridging the Gap between Scientific Papers and Poster Layouts

Shun Inadumi, Shohei Tanaka, Tosho Hirasawa, Atsushi Hashimoto, Koichiro Yoshino, Yoshitaka Ushiku

Comments CVPR2026 Findings

2511.22396 2026-04-09 cs.CV cs.AI

Asking like Socrates: Socrates helps VLMs understand remote sensing images

Run Shao, Ziyu Li, Zhaoyang Zhang, Linrui Xu, Xinran He, Hongyuan Yuan, Bolei He, Yongxing Dai, Yiming Yan, Yijun Chen, Wang Guo, Haifeng Li

Comments Accepted by CVPR 2026

2511.20886 2026-04-09 cs.CV

V$^{2}$-SAM: Marrying SAM2 with Multi-Prompt Experts for Cross-View Object Correspondence

Jiancheng Pan, Runze Wang, Tianwen Qian, Mohammad Mahdi, Yanwei Fu, Xiangyang Xue, Xiaomeng Huang, Luc Van Gool, Danda Pani Paudel, Yuqian Fu

Comments 19 pages

2511.20779 2026-04-09 cs.LG cs.CV cs.HC

CHiQPM: Calibrated Hierarchical Interpretable Image Classification

Thomas Norrenbrock, Timo Kaiser, Sovan Biswas, Neslihan Kose, Ramesh Manuvinakurike, Bodo Rosenhahn

Comments Accepted to NeurIPS 2025, updated version with correction

2511.19693 2026-04-09 cs.LG cs.AI

TREASURE: The Visa Payment Foundation Model for High-Volume Transaction Understanding

Chin-Chia Michael Yeh, Uday Singh Saini, Xin Dai, Xiran Fan, Shubham Jain, Yujie Fan, Jiarui Sun, Junpeng Wang, Menghai Pan, Yingtong Dou, Yuzhong Chen, Vineeth Rakesh, Liang Wang, Yan Zheng, Mahashweta Das

2511.19474 2026-04-09 cs.CV cs.AI cs.MM

Pistachio: Towards Synthetic, Balanced, and Long-Form Video Anomaly Benchmarks

Jie Li, Hongyi Cai, Mingkang Dong, Muxin Pu, Shan You, Fei Wang, Tao Huang

Comments https://github.com/Lizruletheworld/Low-Confidence_Gold