arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.01116 2026-04-02 cs.CV

ProTPS: Prototype-Guided Text Prompt Selection for Continual Learning

Jie Mei, Li-Leng Peng, Keith Fuller, Jenq-Neng Hwang

详情

英文摘要

For continual learning, text-prompt-based methods leverage text encoders and learnable prompts to encode semantic features for sequentially arrived classes over time. A common challenge encountered by existing works is how to learn unique text prompts, which implicitly carry semantic information of new classes, so that the semantic features of newly arrived classes do not overlap with those of trained classes, thereby mitigating the catastrophic forgetting problem. To address this challenge, we propose a novel approach Prototype-guided Text Prompt Selection (ProTPS)'' to intentionally increase the training flexibility thus encouraging the learning of unique text prompts. Specifically, our ProTPS learns class-specific vision prototypes and text prompts. Vision prototypes guide the selection and learning of text prompts for each class. We first evaluate our ProTPS in both class incremental (CI) setting and cross-datasets continual (CDC) learning setting. Because our ProTPS achieves performance close to the upper bounds, we further collect a real-world dataset with 112 marine species collected over a span of six years, named Marine112, to bring new challenges to the community. Marine112 is authentically suited for the class and domain incremental (CDI) learning setting and is under natural long-tail distribution. The results under three settings show that our ProTPS performs favorably against the recent state-of-the-art methods. The implementation code and Marine112 dataset will be released upon the acceptance of our paper.

URL PDF HTML ☆

赞 0 踩 0

2604.01113 2026-04-02 cs.CL

CARE: Privacy-Compliant Agentic Reasoning with Evidence Discordance

Haochen Liu, Weien Li, Rui Song, Zeyu Li, Chun Jason Xue, Xiao-Yang Liu, Sam Nallaperuma, Xue Liu, Ye Yuan

Comments Preprint

2604.01108 2026-04-02 cs.AI

Adversarial Moral Stress Testing of Large Language Models

Saeid Jamshidi, Foutse Khomh, Arghavan Moradi Dakhel, Amin Nikanjam, Mohammad Hamdaqa, Kawser Wazed Nafi

2604.01098 2026-04-02 cs.LG cs.AI cs.LO

Approximating Pareto Frontiers in Stochastic Multi-Objective Optimization via Hashing and Randomization

Jinzhao Li, Nan Jiang, Yexiang Xue

2604.01094 2026-04-02 cs.CL cs.AI

Temporal Dependencies in In-Context Learning: The Role of Induction Heads

Anooshka Bajaj, Deven Mahesh Mistry, Sahaj Singh Maini, Yash Aggarwal, Billy Dickson, Zoran Tiganj

2604.01083 2026-04-02 cs.SD cs.AI cs.CV

TRACE: Training-Free Partial Audio Deepfake Detection via Embedding Trajectory Analysis of Speech Foundation Models

Awais Khan, Muhammad Umar Farooq, Kutub Uddin, Khalid Malik

2604.01082 2026-04-02 cs.CV cs.GR

ReMoGen: Real-time Human Interaction-to-Reaction Generation via Modular Learning from Diverse Data

Yaoqin Ye, Yiteng Xu, Qin Sun, Xinge Zhu, Yujing Sun, Yuexin Ma

Comments accepted by CVPR 2026, project page: https://4dvlab.github.io/project_page/remogen/

2604.01081 2026-04-02 cs.CV cs.LG cs.RO eess.IV

ProOOD: Prototype-Guided Out-of-Distribution 3D Occupancy Prediction

Yuheng Zhang, Mengfei Duan, Kunyu Peng, Yuhang Wang, Di Wen, Danda Pani Paudel, Luc Van Gool, Kailun Yang

Comments Accepted to CVPR 2026. The source code is publicly available at https://github.com/7uHeng/ProOOD

2604.01073 2026-04-02 cs.CL cs.DL cs.IR

Narrative Fingerprints: Multi-Scale Author Identification via Novelty Curve Dynamics

Fred Zimmerman, Hilmar AI

Comments 12 pages, 6 figures, 4 tables

2604.01064 2026-04-02 cs.RO

BAT: Balancing Agility and Stability via Online Policy Switching for Long-Horizon Whole-Body Humanoid Control

Donghoon Baek, Sang-Hun Kim, Sehoon Ha

2604.01053 2026-04-02 cs.CV

PHASOR: Anatomy- and Phase-Consistent Volumetric Diffusion for CT Virtual Contrast Enhancement

Zilong Li, Dongyang Li, Chenglong Ma, Zhan Feng, Dakai Jin, Junping Zhang, Hao Luo, Fan Wang, Hongming Shan

2604.01043 2026-04-02 cs.CV

ONE-SHOT: Compositional Human-Environment Video Synthesis via Spatial-Decoupled Motion Injection and Hybrid Context Integration

Fengyuan Yang, Luying Huang, Jiazhi Guan, Quanwei Yang, Dongwei Pan, Jianglin Fu, Haocheng Feng, Wei He, Kaisiyuan Wang, Hang Zhou, Angela Yao

Comments 23 pages, 7 figures

2604.01038 2026-04-02 cs.CV

Foundation Model-guided Iteratively Prompting and Pseudo-Labeling for Partially Labeled Medical Image Segmentation

Qiaochu Zhao, Wei Wei, David Horowitz, Richard Bakst, Yading Yuan

Comments 5 pages, 5 figures. Accepted for presentation at IEEE International Symposium on Biomedical Imaging (ISBI) 2026

2604.01030 2026-04-02 cs.CV

Diff3R: Feed-forward 3D Gaussian Splatting with Uncertainty-aware Differentiable Optimization

Yueh-Cheng Liu, Jozef Hladký, Matthias Nießner, Angela Dai

Comments Project page: https://liu115.github.io/diff3r, Video: https://www.youtube.com/watch?v=IxzNSAdUY70

2604.01025 2026-04-02 cs.LG cs.AI

Fast and Accurate Probing of In-Training LLMs' Downstream Performances

Zhichen Liu, Tianle Lun, Zhibin Wen, Hao An, Yulin Ou, Jianhui Xu, Hao Zhang, Wenyi Fang, Yang Zheng, Yang Xu

2604.01024 2026-04-02 cs.LG

Model-Based Learning of Near-Optimal Finite-Window Policies in POMDPs

Philip Jordan, Maryam Kamgarpour

2604.01023 2026-04-02 cs.RO

Infinite-Horizon Ergodic Control via Kernel Mean Embeddings

Christian Hughes, Ian Abraham

Comments 8 pages, 11 figures

2604.01015 2026-04-02 cs.CV

Forecasting Motion in the Wild

Neerja Thakkar, Shiry Ginosar, Jacob Walker, Jitendra Malik, Joao Carreira, Carl Doersch

Comments project page: https://motion-forecasting.github.io/

2604.01010 2026-04-02 cs.CV cs.MM

PDA: Text-Augmented Defense Framework for Robust Vision-Language Models against Adversarial Image Attacks

Jingning Xu, Haochen Luo, Chen Liu

2604.01002 2026-04-02 cs.CV cs.AI cs.LG

Query-Conditioned Evidential Keyframe Sampling for MLLM-Based Long-Form Video Understanding

Yiheng Wang, Lichen Zhu, Yueqian Lin, Yudong Liu, Jingyang Zhang, Hai "Helen" Li, Yiran Chen

2604.01001 2026-04-02 cs.CV cs.AI

EgoSim: Egocentric World Simulator for Embodied Interaction Generation

Jinkun Hao, Mingda Jia, Ruiyan Wang, Xihui Liu, Ran Yi, Lizhuang Ma, Jiangmiao Pang, Xudong Xu

Comments Project Page: egosimulator.github.io

2604.01000 2026-04-02 cs.LG cs.DB cs.DC

EmbedPart: Embedding-Driven Graph Partitioning for Scalable Graph Neural Network Training

Nikolai Merkel, Ruben Mayer, Volker Markl, Hans-Arno Jacobsen

2604.00997 2026-04-02 cs.CL

Uncertainty-Aware Variational Reward Factorization via Probabilistic Preference Bases for LLM Personalization

Gyuseok Lee, Wonbin Kweon, Zhenrui Yue, SeongKu Kang, Jiawei Han, Dong Wang

2604.00994 2026-04-02 cs.CL cs.AI cs.SI

Multimodal Analysis of State-Funded News Coverage of the Israel-Hamas War on YouTube Shorts

Daniel Miehling, Sandra Kuebler

2604.00985 2026-04-02 cs.CV

Maximizing T2-Only Prostate Cancer Localization from Expected Diffusion Weighted Imaging

Weixi Yi, Yipei Wang, Wen Yan, Hanyuan Zhang, Natasha Thorley, Alexander Ng, Shonit Punwani, Fernando Bianco, Mark Emberton, Veeru Kasivisvanathan, Dean C. Barratt, Shaheer U. Saeed, Yipeng Hu

2604.00983 2026-04-02 cs.CV

ACT Now: Preempting LVLM Hallucinations via Adaptive Context Integration

Bei Yan, Yuecong Min, Jie Zhang, Shiguang Shan, Xilin Chen

2604.00977 2026-04-02 cs.LG cs.AI

Flow-based Policy With Distributional Reinforcement Learning in Trajectory Optimization

Ruijie Hao, Longfei Zhang, Yang Dai, Yang Ma, Xingxing Liang, Guangquan Cheng

2604.00971 2026-04-02 cs.RO

An Integrated Soft Robotic System for Measuring Vital Signs in Search and Rescue Environments

Jorge Francisco García-Samartín, Christyan Cruz Ulloa, Andrés Sánchez-Silva, Jaime del Cerro, Antonio Barrientos

2604.00969 2026-04-02 cs.CV

DLWM: Dual Latent World Models enable Holistic Gaussian-centric Pre-training in Autonomous Driving

Yiyao Zhu, Ying Xue, Haiming Zhang, Guangfeng Jiang, Wending Zhou, Xu Yan, Jiantao Gao, Yingjie Cai, Bingbing Liu, Zhen Li, Shaojie Shen

Comments Accepted by CVPR 2026

2604.00955 2026-04-02 cs.CV

Enhancing Gradient Inversion Attacks in Federated Learning via Hierarchical Feature Optimization

Hao Fang, Wenbo Yu, Bin Chen, Xuan Wang, Shu-Tao Xia, Qing Liao, Ke Xu