arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.05898 2026-04-08 cs.CV

Physics-Aware Video Instance Removal Benchmark

Zirui Li, Xinghao Chen, Lingyu Jiang, Dengzhe Hou, Fangzhou Lin, Kazunori Yamada, Xiangbo Gao, Zhengzhong Tu

详情

英文摘要

Video Instance Removal (VIR) requires removing target objects while maintaining background integrity and physical consistency, such as specular reflections and illumination interactions. Despite advancements in text-guided editing, current benchmarks primarily assess visual plausibility, often overlooking the physical causalities, such as lingering shadows, triggered by object removal. We introduce the Physics-Aware Video Instance Removal (PVIR) benchmark, featuring 95 high-quality videos annotated with instance-accurate masks and removal prompts. PVIR is partitioned into Simple and Hard subsets, the latter explicitly targeting complex physical interactions. We evaluate four representative methods, PISCO-Removal, UniVideo, DiffuEraser, and CoCoCo, using a decoupled human evaluation protocol across three dimensions to isolate semantic, visual, and spatial failures: instruction following, rendering quality, and edit exclusivity. Our results show that PISCO-Removal and UniVideo achieve state-of-the-art performance, while DiffuEraser frequently introduces blurring artifacts and CoCoCo struggles significantly with instruction following. The persistent performance drop on the Hard subset highlights the ongoing challenge of recovering complex physical side effects.

URL PDF HTML ☆

赞 0 踩 0

2604.05887 2026-04-08 cs.AI

HybridKV: Hybrid KV Cache Compression for Efficient Multimodal Large Language Model Inference

Bowen Zeng, Feiyang Ren, Jun Zhang, Xiaoling Gu, Ke Chen, Lidan Shou, Huan Li

2604.05877 2026-04-08 cs.CV cs.AI

Automatic dental superimposition of 3D intraorals and 2D photographs for human identification

Antonio D. Villegas-Yeguas, Xavier Abreau-Freire, Guillermo R-García, Andrea Valsecchi, Teresa Pinho, Daniel Pérez-Mongiovi, Oscar Ibáñez, Oscar Cordón

Comments 10 pages, 9 figures, 3 tables

2604.05876 2026-04-08 cs.CL

Mechanistic Circuit-Based Knowledge Editing in Large Language Models

Tianyi Zhao, Yinhan He, Wendy Zheng, Chen Chen

2604.05875 2026-04-08 cs.AI

Joint Knowledge Base Completion and Question Answering by Combining Large Language Models and Small Language Models

Yinan Liu, Dongying Lin, Sigang Luo, Xiaochun Yang, Bin Wang

Comments 20 pages, 11 figures

2604.05868 2026-04-08 cs.CL

Understanding Performance Gap Between Parallel and Sequential Sampling in Large Reasoning Models

Xiangming Gu, Soham De, Larisa Markeeva, Petar Veličković, Razvan Pascanu

Comments Under review

2604.05865 2026-04-08 cs.AI cs.PL

JTON: A Token-Efficient JSON Superset with Zen Grid Tabular Encoding for Large Language Models

Gowthamkumar Nandakishore

Comments 20 pages, 13 figures, 14 tables. Code and test suite available at https://github.com/gowthamkumar-nandakishore/JTON

2604.05863 2026-04-08 cs.CL

LoRM: Learning the Language of Rotating Machinery for Self-Supervised Condition Monitoring

Xiao Qin, Xingyi Song, Tong Liu, Hatim Laalej, Zepeng Liu, Yunpeng Zhu, Ligang He

2604.05857 2026-04-08 cs.LG

Weight-Informed Self-Explaining Clustering for Mixed-Type Tabular Data

Lehao Li, Qiang Huang, Yihao Ang, Bryan Kian Hsiang Low, Anthony K. H. Tung, Xiaokui Xiao

2604.05856 2026-04-08 cs.CV cs.AI cs.LG cs.NE

Neural Network Pruning via QUBO Optimization

Osama Orabi, Artur Zagitov, Hadi Salloum, Viktor A. Lobachev, Kasymkhan Khubiev, Yaroslav Kholodov

Comments 13 pages, 5 figures, 4 tables

2604.05854 2026-04-08 cs.AI

Deep Researcher Agent: An Autonomous Framework for 24/7 Deep Learning Experimentation with Zero-Cost Monitoring

Xiangyue Zhang

2604.05848 2026-04-08 cs.CL cs.AI

Evaluating Learner Representations for Differentiation Prior to Instructional Outcomes

Junsoo Park, Youssef Medhat, Htet Phyo Wai, Ploy Thajchayapong, Ashok K. Goel

Comments Accepted to AIED 2026

2604.05846 2026-04-08 cs.CL

AgentGL: Towards Agentic Graph Learning with LLMs via Reinforcement Learning

Yuanfu Sun, Kang Li, Dongzhe Fan, Jiajin Liu, Qiaoyu Tan

Comments ACL 2026 Main Conference

2604.05844 2026-04-08 cs.LG q-bio.QM

Modeling Patient Care Trajectories with Transformer Hawkes Processes

Saumya Pandey, Varun Chandola

2604.05843 2026-04-08 cs.LG cs.AI

EEG-MFTNet: An Enhanced EEGNet Architecture with Multi-Scale Temporal Convolutions and Transformer Fusion for Cross-Session Motor Imagery Decoding

Panagiotis Andrikopoulos, Siamak Mehrkanoon

Comments 6 pages, 4 figs

2604.05842 2026-04-08 cs.LG cs.IT math.IT stat.ML

Expectation Maximization (EM) Converges for General Agnostic Mixtures

Avishek Ghosh

Comments Accepted at IEEE International Symposium on Information Theory (ISIT 2026)

2604.05839 2026-04-08 cs.AI

Vision-Guided Iterative Refinement for Frontend Code Generation

Hannah Sansford, Derek H. C. Law, Wei Liu, Abhishek Tripathi, Niresh Agarwal, Gerrit J. J. van den Burg

Comments Accepted at ICLR 2026 Workshop on AI with Recursive Self-Improvement

2604.04917 2026-04-08 cs.CV cs.AI cs.CL

Vero: An Open RL Recipe for General Visual Reasoning

Gabriel Sarch, Linrong Cai, Qunzhong Wang, Haoyang Wu, Danqi Chen, Zhuang Liu

Comments Project page: https://vero-reasoning.github.io/

2604.04756 2026-04-08 cs.LG cs.CL

Darkness Visible: Reading the Exception Handler of a Language Model

Peter Balogh

2604.04403 2026-04-08 cs.AI

MolDA: Molecular Understanding and Generation via Large Language Diffusion Model

Seohyeon Shin, HanJun Choi, Jun-Hyung Park, Hong Kook Kim, Mansu Kim

2604.04192 2026-04-08 cs.CV cs.AI cs.LG

Graphic-Design-Bench: A Comprehensive Benchmark for Evaluating AI on Graphic Design Tasks

Adrienne Deganutti, Elad Hirsch, Haonan Zhu, Jaejung Seol, Purvanshi Mehta

2604.03541 2026-04-08 cs.LG stat.ML

Choosing the Right Regularizer for Applied ML: Simulation Benchmarks of Popular Scikit-learn Regularization Frameworks

Benjamin S. Knight, Ahsaas Bajaj

2604.02779 2026-04-08 cs.RO

Vision-Based End-to-End Learning for UAV Traversal of Irregular Gaps via Differentiable Simulation

Linzuo Zhang, Yu Hu, Feng Yu, Yang Deng, Wenxian Yu, Danping Zou

2604.02601 2026-04-08 cs.LG math.DS

WGFINNs: Weak formulation-based GENERIC formalism informed neural networks

Jun Sur Richard Park, Auroni Huque Hashim, Siu Wun Cheung, Youngsoo Choi, Yeonjong Shin

2604.02320 2026-04-08 cs.CV cs.GR

Large-scale Codec Avatars: The Unreasonable Effectiveness of Large-scale Avatar Pretraining

Junxuan Li, Rawal Khirodkar, Chengan He, Zhongshi Jiang, Giljoo Nam, Lingchen Yang, Jihyun Lee, Egor Zakharov, Zhaoen Su, Rinat Abdrashitov, Yuan Dong, Julieta Martinez, Kai Li, Qingyang Tan, Takaaki Shiratori, Matthew Hu, Peihong Guo, Xuhua Huang, Ariyan Zarei, Marco Pesavento, Yichen Xu, He Wen, Teng Deng, Wyatt Borsos, Anjali Thakrar, Jean-Charles Bazin, Carsten Stoll, Ginés Hidalgo, James Booth, Lucy Wang, Xiaowen Ma, Yu Rong, Sairanjith Thalanki, Chen Cao, Christian Häne, Abhishek Kar, Sofien Bouaziz, Jason Saragih, Yaser Sheikh, Shunsuke Saito

Comments Accepted in CVPR2026. Website: https://junxuan-li.github.io/lca

2604.01328 2026-04-08 cs.LG

Efficient and Principled Scientific Discovery through Bayesian Optimization: A Tutorial

Zhongwei Yu, Rasul Tutunov, Alexandre Max Maraval, Zikai Xie, Zhenzhi Tan, Jiankang Wang, Bin Cao, Zijing Li, Liangliang Xu, Qi Yang, Jun Jiang, Sanzhong Luo, Zhenxiao Guo, Tongyi Zhang, Haitham Bou-Ammar, Jun Wang

2604.01044 2026-04-08 cs.CV

A global dataset of continuous urban dashcam driving

Md Shadab Alam, Olena Bazilinska, Pavlo Bazilinskyy

2603.27874 2026-04-08 cs.LG math.OC

Stability and Sensitivity Analysis of Relative Temporal-Difference Learning: Extended Version

Masoud S. Sakha, Rushikesh Kamalapurkar, Sean Meyn

Comments Extended version of manuscript submitted to the 2026 IEEE CDC, March 31 2026

2603.23202 2026-04-08 cs.CV

Gaze-Regularized Vision-Language-Action Models for Robotic Manipulation

Anupam Pani, Yanchao Yang

2603.22844 2026-04-08 cs.AI

PhySe-RPO: Physics and Semantics Guided Relative Policy Optimization for Diffusion-Based Surgical Smoke Removal

Zining Fang, Cheng Xue, Chunhui Liu, Bin Xu, Ming Chen, Xiaowei Hu

Comments 12 pages,7figures,published to CVPR