arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.10445 2026-03-13 cs.LG cs.CV

Unlearning the Unpromptable: Prompt-free Instance Unlearning in Diffusion Models

Kyungryeol Lee, Kyeonghyun Lee, Seongmin Hong, Byung Hyun Lee, Se Young Chun

Comments 12 pages

详情

英文摘要

Machine unlearning aims to remove specific outputs from trained models, often at the concept level, such as forgetting all occurrences of a particular celebrity or filtering content via text prompts. However, many undesired outputs, such as an individual's face or generations culturally or factually misinterpreted, cannot often be specified by text prompts. We address this underexplored setting of instance unlearning for outputs that are undesired but unpromptable, where the goal is to forget target outputs selectively while preserving the rest. To this end, we introduce an effective surrogate-based unlearning method that leverages image editing, timestep-aware weighting, and gradient surgery to guide trained diffusion models toward forgetting specific outputs. Experiments on conditional (Stable Diffusion 3) and unconditional (DDPM-CelebA) diffusion models demonstrate that our prompt-free method uniquely unlearns unpromptable outputs, such as faces and culturally inaccurate depictions, with preserved integrity, unlike prompt-based and prompt-free baselines. Our proposed method would serve as a practical hotfix for diffusion model providers to ensure privacy protection and ethical compliance.

URL PDF HTML ☆

赞 0 踩 0

2603.10365 2026-03-13 cs.CV

Geometric Autoencoder for Diffusion Models

Hangyu Liu, Jianyong Wang, Yutao Sun

Comments Code and models are publicly available at https://github.com/sii-research/GAE

2603.10354 2026-03-13 cs.CV

StyleGallery: Training-free and Semantic-aware Personalized Style Transfer from Arbitrary Image References

Boyu He, Yunfan Ye, Chang Liu, Weishang Wu, Fang Liu, Zhiping Cai

Comments 18 pages, 23 figures, Conference on Computer Vision and Pattern Recognition 2026

2603.10061 2026-03-13 cs.RO

Decision-Aware Uncertainty Evaluation of Vision-Language Model-Based Early Action Anticipation for Human-Robot Interaction

Zhaoda Du, Michael Bowman, Qiaojie Zheng, Xiaoli Zhang

2603.10005 2026-03-13 cs.CL cs.AI

SENS-ASR: Semantic Embedding injection in Neural-transducer for Streaming Automatic Speech Recognition

Youness Dkhissi, Valentin Vielzeuf, Elys Allesiardo, Anthony Larcher

2603.10001 2026-03-13 cs.CL cs.AI cs.LG

Leveraging Wikidata for Geographically Informed Sociocultural Bias Dataset Creation: Application to Latin America

Yannis Karmim, Renato Pino, Hernan Contreras, Hernan Lira, Sebastian Cifuentes, Simon Escoffier, Luis Martí, Djamé Seddah, Valentin Barrière

2603.10000 2026-03-13 cs.CL cs.LG

Beyond the Prompt in Large Language Models: Comprehension, In-Context Learning, and Chain-of-Thought

Yuling Jiao, Yanming Lai, Huazhen Lin, Wensen Ma, Houduo Qi, Defeng Sun

2603.09982 2026-03-13 cs.CL cs.AI

AraModernBERT: Transtokenized Initialization and Long-Context Encoder Modeling for Arabic

Omar Elshehy, Omer Nacar, Abdelbasset Djamai, Muhammed Ragab, Khloud Al Jallad, Mona Abdelazim

Comments 9 pages, 1 figure. Accepted at AbjadNLP Workshop, EACL 2026

2603.09731 2026-03-13 cs.CV cs.AI cs.CL

EXPLORE-Bench: Egocentric Scene Prediction with Long-Horizon Reasoning

Chengjun Yu, Xuhan Zhu, Chaoqun Du, Pengfei Yu, Wei Zhai, Yang Cao, Zheng-Jun Zha

2603.09695 2026-03-13 cs.RO cs.CV

DRIFT: Dual-Representation Inter-Fusion Transformer for Automated Driving Perception with 4D Radar Point Clouds

Siqi Pei, Andras Palffy, Dariu M. Gavrila

2603.09203 2026-03-13 cs.AI

Evaluate-as-Action: Self-Evaluated Process Rewards for Retrieval-Augmented Agents

Jiangming Shu, Yuxiang Zhang, Ye Ma, Xueyuan Lin, Jitao Sang

2603.09175 2026-03-13 cs.RO

STONE Dataset: A Scalable Multi-Modal Surround-View 3D Traversability Dataset for Off-Road Robot Navigation

Konyul Park, Daehun Kim, Jiyong Oh, Seunghoon Yu, Junseo Park, Jaehyun Park, Hongjae Shin, Hyungchan Cho, Jungho Kim, Jun Won Choi

Comments ICRA 2026

2603.09151 2026-03-13 cs.AI

Deep Tabular Research via Continual Experience-Driven Execution

Junnan Dong, Chuang Zhou, Zheng Yuan, Yifei Yu, Qiufeng Wang, Yinghui Li, Siyu An, Di Yin, Xing Sun, Feiyue Huang

Comments 23 pages, 6 tables, 6 figures

2603.08938 2026-03-13 cs.AI

AgentOS: From Application Silos to a Natural Language-Driven Data Ecosystem

Rui Liu, Tao Zhe, Dongjie Wang, Zijun Yao, Kunpeng Liu, Yanjie Fu, Huan Liu, Jian Pei

2603.08420 2026-03-13 cs.RO cs.AI cs.HC

Human-Aware Robot Behaviour in Self-Driving Labs

Satheeshkumar Veeramani, Anna Kisil, Abigail Bentley, Hatem Fakhruldeen, Gabriella Pizzuto, Andrew I. Cooper

2603.08281 2026-03-13 cs.CL cs.AI cs.CY

Evaluating LLM-Based Grant Proposal Review via Structured Perturbations

William Thorne, Joseph James, Yang Wang, Chenghua Lin, Diana Maynard

2603.08064 2026-03-13 cs.CV

Evaluating Generative Models via One-Dimensional Code Distributions

Zexi Jia, Pengcheng Luo, Yijia Zhong, Jinchao Zhang, Jie Zhou

2603.07892 2026-03-13 cs.RO

RoboRouter: Training-Free Policy Routing for Robotic Manipulation

Yiteng Chen, Zhe Cao, Hongjia Ren, Chenjie Yang, Wenbo Li, Shiyi Wang, Yemin Wang, Li Zhang, Yanming Shao, Zhenjun Zhao, Huiping Zhuang, Qingyao Wu

Comments We need to withdraw the paper as some of the reference papers are incorrect and need to be removed

2603.06605 2026-03-13 cs.LG

Structure-Aware Set Transformers: Temporal and Variable-Type Attention Biases for Asynchronous Clinical Time Series

Joohyung Lee, Kwanhyung Lee, Changhun Kim, Eunho Yang

Comments ICLR 2026 Workshop on Time Series in the Age of Large Models (TSALM)

2603.05598 2026-03-13 cs.LG astro-ph.IM cs.AI physics.comp-ph

On the Value of Tokeniser Pretraining in Physics Foundation Models

Hadi Sotoudeh, Payel Mukhopadhyay, Ruben Ohana, Michael McCabe, Neil D. Lawrence, Shirley Ho, Miles Cranmer

Comments 16 pages, 4 figures. Workshop paper at ICLR 2026 AI & PDE

2603.04848 2026-03-13 cs.RO

Hyperbolic Multiview Pretraining for Robotic Manipulation

Jin Yang, Ping Wei, Yixin Chen, Nanning Zheng

Comments This paper was submitted to CVPR 2026 and was recommended for Findings, but the authors have withdrawn it and are currently adding more content to submit it elsewhere

2603.03964 2026-03-13 cs.CV cs.AI

BLOCK: An Open-Source Bi-Stage MLLM Character-to-Skin Pipeline for Minecraft

Hengquan Guo

2603.01928 2026-03-13 cs.CV

LaST-VLA: Thinking in Latent Spatio-Temporal Space for Vision-Language-Action in Autonomous Driving

Yuechen Luo, Fang Li, Shaoqing Xu, Yang Ji, Zehan Zhang, Bing Wang, Yuannan Shen, Jianwei Cui, Long Chen, Guang Chen, Hangjun Ye, Zhi-Xin Yang, Fuxi Wen

2603.01685 2026-03-13 cs.CV

FastLightGen: Fast and Light Video Generation with Fewer Steps and Parameters

Shitong Shao, Yufei Gu, Zeke Xie

Comments Accepted by CVPR 2026

2603.01470 2026-03-13 cs.LG stat.ML

Randomized Kriging Believer for Parallel Bayesian Optimization with Regret Bounds

Shuhei Sugiura, Ichiro Takeuchi, Shion Takeno

2603.01214 2026-03-13 cs.CL cs.LG

Reasoning Boosts Opinion Alignment in LLMs

Frédéric Berdoz, Yann Billeter, Yann Vonlanthen, Roger Wattenhofer

Comments Accepted at ICLR 2026

2603.01204 2026-03-13 cs.LG

Subliminal Signals in Preference Labels

Isotta Magistrali, Frédéric Berdoz, Sam Dauncey, Roger Wattenhofer

Comments Accepted at AITW@ICLR 2026

2602.24161 2026-03-13 cs.CV

GeoDiff4D: Geometry-Aware Diffusion for 4D Head Avatar Reconstruction

Chao Xu, Xiaochen Zhao, Xiang Deng, Jingxiang Sun, Donglin Di, Zhuo Su, Yebin Liu

Comments 17 pages

2602.23653 2026-03-13 cs.CV cs.AI

ProtoDCS: Towards Robust and Efficient Open-Set Test-Time Adaptation for Vision-Language Models

Wei Luo, Yangfan Ou, Jin Deng, Zeshuai Deng, Xiquan Yan, Zhiquan Wen, Mingkui Tan

Comments 13 pages, under review

2602.23349 2026-03-13 cs.LG cs.AI

FlashOptim: Optimizers for Memory-Efficient Training

Jose Javier Gonzalez Ortiz, Abhay Gupta, Christopher Rinard, Davis Blalock

Comments Source code is available at https://github.com/databricks/flashoptim