arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.25140 2026-03-27 cs.CV cs.AI cs.LG cs.MM cs.SD

SAVe: Self-Supervised Audio-visual Deepfake Detection Exploiting Visual Artifacts and Audio-visual Misalignment

Sahibzada Adil Shahzad, Ammarah Hashmi, Junichi Yamagishi, Yusuke Yasuda, Yu Tsao, Chia-Wen Lin, Yan-Tsung Peng, Hsin-Min Wang

2603.25139 2026-03-27 cs.RO cs.SY eess.SY

Dissimilarity-Based Persistent Coverage Control of Multi-Robot Systems for Improving Solar Irradiance Prediction Accuracy in Solar Thermal Power Plants

Haruki Kawase, Taiga Sugawara, A. Daniel Carnerero

Comments 8 pages, 6 figures, 5 tables

2603.25135 2026-03-27 cs.CV

EgoXtreme: A Dataset for Robust Object Pose Estimation in Egocentric Views under Extreme Conditions

Taegyoon Yoon, Yegyu Han, Seojin Ji, Jaewoo Park, Sojeong Kim, Taein Kwon, Hyung-Sin Kim

Comments Camera ready version for CVPR 2026, appendix included

2603.25133 2026-03-27 cs.AI

RubricEval: A Rubric-Level Meta-Evaluation Benchmark for LLM Judges in Instruction Following

Tianjun Pan, Xuan Lin, Wenyan Yang, Qianyu He, Shisong Chen, Licai Qi, Wanqing Xu, Hongwei Feng, Bo Xu, Yanghua Xiao

Comments 9 pages, 5 figures

2603.25131 2026-03-27 cs.CV

Denoise and Align: Towards Source-Free UDA for Robust Panoramic Semantic Segmentation

Yaowen Chang, Zhen Cao, Xu Zheng, Xiaoxin Mi, Zhen Dong

Comments Accepted to CVPR26

2603.25129 2026-03-27 cs.CV

AirSplat: Alignment and Rating for Robust Feed-Forward 3D Gaussian Splatting

Minh-Quan Viet Bui, Jaeho Moon, Munchurl Kim

Comments Project page: https://kaist-viclab.github.io/airsplat-site

2603.25121 2026-03-27 cs.RO

CTS-PLL: A Robust and Anytime Framework for Collaborative Task Sequencing and Multi-Agent Path Finding

Junkai Jiang, Yitao Xu, Ruochen Li, Shaobing Xu, Jianqiang Wang

Comments 8 pages, 5 figures, under review

2603.25118 2026-03-27 cs.CV

AnyDoc: Enhancing Document Generation via Large-Scale HTML/CSS Data Synthesis and Height-Aware Reinforcement Optimization

Jiawei Lin, Wanrong Zhu, Vlad I Morariu, Christopher Tensmeyer

Comments CVPR 2026 Main Conference

2603.25115 2026-03-27 cs.AI

When Sensing Varies with Contexts: Context-as-Transform for Tactile Few-Shot Class-Incremental Learning

Yifeng Lin, Aiping Huang, Wenxi Liu, Si Wu, Tiesong Zhao, Zheng-Jun Zha

Comments 11 pages, 6 figures

2603.25112 2026-03-27 cs.CL cs.AI

Do LLMs Know What They Know? Measuring Metacognitive Efficiency with Signal Detection Theory

Jon-Paul Cacioli

Comments 12 pages, 3 figures, 7 tables. Pre-registered; code and data at https://anonymous.4open.science/r/sdt_calibration

2603.25109 2026-03-27 cs.CV cs.AI

MoireMix: A Formula-Based Data Augmentation for Improving Image Classification Robustness

Yuto Matsuo, Yoshihiro Fukuhara, Yuki M. Asano, Rintaro Yanagi, Hirokatsu Kataoka, Akio Nakamura

2603.25108 2026-03-27 cs.CV

MSRL: Scaling Generative Multimodal Reward Modeling via Multi-Stage Reinforcement Learning

Chenglong Wang, Yifu Huo, Yang Gan, Qiaozhi He, Qi Meng, Bei Li, Yan Wang, Junfu Liu, Tianhua Zhou, Jingbo Zhu, Tong Xiao

Comments Accepted by CVPR 2026

2603.25107 2026-03-27 cs.CV

Label What Matters: Modality-Balanced and Difficulty-Aware Multimodal Active Learning

Yuqiao Zeng, Xu Wang, Tengfei Liang, Yiqing Hao, Yi Jin, Hui Yu

2603.25105 2026-03-27 cs.CL

OMIND: Framework for Knowledge Grounded Finetuning and Multi-Turn Dialogue Benchmark for Mental Health LLMs

Suraj Racha, Prashant Harish Joshi, Utkarsh Maurya, Nitin Yadav, Mridul Sharma, Ananya Kunisetty, Saranya Darisipudi, Nirmal Punjabi, Ganesh Ramakrishnan

Comments 9 pages, 3 figures, 5 tables

2603.25103 2026-03-27 cs.LG cs.AI

Layer-Specific Lipschitz Modulation for Fault-Tolerant Multimodal Representation Learning

Diyar Altinses, Andreas Schwung

2603.24746 2026-03-27 cs.LG cond-mat.stat-mech cs.AI

Grokking as a Falsifiable Finite-Size Transition

Yuda Bi, Chenyu Zhang, Qiheng Wang, Vince D Calhoun

2603.24343 2026-03-27 cs.SD cs.AI

Enhancing Efficiency and Performance in Deepfake Audio Detection through Neuron-level Dropin & Neuroplasticity Mechanisms

Yupei Li, Shuaijie Shao, Manuel Milling, Björn Schuller

Comments Accepted at IJCNN 2026

2603.24295 2026-03-27 cs.CV

RS-SSM: Refining Forgotten Specifics in State Space Model for Video Semantic Segmentation

Kai Zhu, Zhenyu Cui, Zehua Zang, Jiahuan Zhou

Comments Accepted by CVPR 2026

2603.24242 2026-03-27 cs.CL

Optimizing Multilingual LLMs via Federated Learning: A Study of Client Language Composition

Aleix Sant, Jordi Luque, Carlos Escolano

Comments 12 pages, 4 figures, 5 tables

2603.23997 2026-03-27 cs.CV

HGGT: Robust and Flexible 3D Hand Mesh Reconstruction from Uncalibrated Images

Yumeng Liu, Xiao-Xiao Long, Marc Habermann, Xuanze Yang, Cheng Lin, Yuan Liu, Yuexin Ma, Wenping Wang, Ligang Liu

Comments project page: https://lym29.github.io/HGGT/

2603.23783 2026-03-27 cs.LG cs.AI math.OC math.PR stat.ML

Probabilistic Geometric Alignment via Bayesian Latent Transport for Domain-Adaptive Foundation Models

Aueaphum Aueawatthanaphisut, Kuepon Auewattanapisut

Comments 11 pages, 8 Figures, 25 Equations, 5 Tables and 3 Theorems

2603.23637 2026-03-27 cs.CV

Stochastic Ray Tracing for the Reconstruction of 3D Gaussian Splatting

Peiyu Xu, Xin Sun, Krishna Mullia, Raymond Fei, Iliyan Georgiev, Shuang Zhao

Comments Project Page: https://xupaya.github.io/stoch3DGS/

2603.23361 2026-03-27 cs.LG q-bio.GN

Central Dogma Transformer III: Interpretable AI Across DNA, RNA, and Protein

Nobuyuki Ota

Comments 21 pages, 8 figures, v2: corrected mRNA-protein divergence analysis with DSB-normalized data

2603.23324 2026-03-27 cs.CV

Pose-Free Omnidirectional Gaussian Splatting for 360-Degree Videos with Consistent Depth Priors

Chuanqing Zhuang, Xin Lu, Zehui Deng, Zhengda Lu, Yiqun Wang, Junqi Diao, Jun Xiao

2603.23159 2026-03-27 cs.CV cs.LG

Conformal Cross-Modal Active Learning

Huy Hoang Nguyen, Cédric Jung, Shirin Salehi, Tobias Glück, Anke Schmeink, Andreas Kugi

Comments 20 pages, 14 figures

2603.23101 2026-03-27 cs.LG

SpecXMaster Technical Report

Yutang Ge, Yaning Cui, Hanzheng Li, Jun-Jie Wang, Fanjie Xu, Jinhan Dong, Yongqi Jin, Dongxu Cui, Peng Jin, Guojiang Zhao, Hengxing Cai, Rong Zhu, Linfeng Zhang, Xiaohong Ji, Zhifeng Gao

Comments Technical report from DP Technology.22 pages, 7 figures

2603.22883 2026-03-27 cs.CV

Group Editing: Edit Multiple Images in One Go

Yue Ma, Xinyu Wang, Qianli Ma, Qinghe Wang, Mingzhe Zheng, Xiangpeng Yang, Hao Li, Chongbo Zhao, Jixuan Ying, Harry Yang, Hongyu Liu, Qifeng Chen

Comments Accepted by CVPR 2026, Project page: https://group-editing.github.io/, Github: https://github.com/mayuelala/GroupEditing

2603.22459 2026-03-27 cs.CL cs.AI

LLM-guided headline rewriting for clickability enhancement without clickbait

Yehudit Aperstein, Linoy Halifa, Sagiv Bar, Alexander Apartsin

Comments 14 pages, 4 figures

详情

英文摘要

Enhancing reader engagement while preserving informational fidelity is a central challenge in controllable text generation for news media. Optimizing news headlines for reader engagement is often conflated with clickbait, resulting in exaggerated or misleading phrasing that undermines editorial trust. We frame clickbait not as a separate stylistic category, but as an extreme outcome of disproportionate amplification of otherwise legitimate engagement cues. Based on this view, we formulate headline rewriting as a controllable generation problem, where specific engagement-oriented linguistic attributes are selectively strengthened under explicit constraints on semantic faithfulness and proportional emphasis. We present a guided headline rewriting framework built on a large language model (LLM) that uses the Future Discriminators for Generation (FUDGE) paradigm for inference-time control. The LLM is steered by two auxiliary guide models: (1) a clickbait scoring model that provides negative guidance to suppress excessive stylistic amplification, and (2) an engagement-attribute model that provides positive guidance aligned with target clickability objectives. Both guides are trained on neutral headlines drawn from a curated real-world news corpus. At the same time, clickbait variants are generated synthetically by rewriting these original headlines using an LLM under controlled activation of predefined engagement tactics. By adjusting guidance weights at inference time, the system generates headlines along a continuum from neutral paraphrases to more engaging yet editorially acceptable formulations. The proposed framework provides a principled approach for studying the trade-off between attractiveness, semantic preservation, and clickbait avoidance, and supports responsible LLM-based headline optimization in journalistic settings.

URL PDF HTML ☆

赞 0 踩 0

2603.22384 2026-03-27 cs.LG cs.AI

Learning When to Act: Interval-Aware Reinforcement Learning with Predictive Temporal Structure

Davide Di Gioia

2603.22120 2026-03-27 cs.CV

StreamingClaw Technical Report

Jiawei Chen, Zhe Chen, Chaoqun Du, Maokui He, Wei He, Hengtao Li, Qizhen Li, Zide Liu, Hao Ma, Xuhao Pan, Chang Ren, Xudong Rao, Xintian Shen, Chenfeng Wang, Tao Wei, Chengjun Yu, Pengfei Yu, Shengyu Yao, Chunpeng Zhou, Kun Zhan, Lihao Zheng, Pan Zhou, Xuhan Zhu, Yufei Zheng

Comments Under Progress