arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2602.10419 2026-04-06 cs.LG cs.AI

Equivariant Evidential Deep Learning for Interatomic Potentials

Zhongyao Wang, Taoyong Cui, Jiawen Zou, Shufei Zhang, Bo Yan, Wanli Ouyang, Weimin Tan, Mao Su

详情

英文摘要

Uncertainty quantification (UQ) is critical for assessing the reliability of machine learning interatomic potentials (MLIPs) in molecular dynamics (MD) simulations, identifying extrapolation regimes and enabling uncertainty-aware workflows such as active learning for training dataset construction. Existing UQ approaches for MLIPs are often limited by high computational cost or suboptimal performance. Evidential deep learning (EDL) provides a theoretically grounded single-model alternative that determines both aleatoric and epistemic uncertainty in a single forward pass. However, extending evidential formulations from scalar targets to vector-valued quantities such as atomic forces introduces substantial challenges, particularly in maintaining statistical self-consistency under rotational transformations. To address this, we propose \textit{Equivariant Evidential Deep Learning for Interatomic Potentials} ($\text{e}^2$IP), a backbone-agnostic framework that models atomic forces and their uncertainty jointly by representing uncertainty as a full $3\times3$ symmetric positive definite covariance tensor that transforms equivariantly under rotations. Experiments on diverse molecular benchmarks show that $\text{e}^2$IP provides a stronger accuracy-efficiency-reliability balance than the non-equivariant evidential baseline and the widely used ensemble method. It also achieves better data efficiency through the fully equivariant architecture while retaining single-model inference efficiency.

URL PDF HTML ☆

赞 0 踩 0

2602.06343 2026-04-06 cs.CV

Uncertainty-Aware 4D Gaussian Splatting for Monocular Occluded Human Rendering

Weiquan Wang, Feifei Shao, Lin Li, Zhen Wang, Jun Xiao, Long Chen

2602.00918 2026-04-06 cs.LG

Early Classification of Time Series in Non-Stationary Cost Regimes

Aurélien Renault, Alexis Bondu, Antoine Cornuéjols, Vincent Lemaire

2602.00683 2026-04-06 cs.CV

Video Understanding: Through A Temporal Lens

Thong Thanh Nguyen

Comments PhD Thesis, NUS, 2025

2601.23048 2026-04-06 cs.AI

From Abstract to Contextual: What LLMs Still Cannot Do in Mathematics

Bowen Cao, Dongdong Zhang, Yixia Li, Junpeng Liu, Shijue Huang, Chufan Shi, Hongyuan Lu, Yaokang Wu, Guanhua Chen, Wai Lam, Furu Wei

Comments ICLR 2026

2601.21957 2026-04-06 cs.CV

PaddleOCR-VL-1.5: Towards a Multi-Task 0.9B VLM for Robust In-the-Wild Document Parsing

Cheng Cui, Ting Sun, Suyin Liang, Tingquan Gao, Zelun Zhang, Jiaxuan Liu, Xueqing Wang, Changda Zhou, Hongen Liu, Manhui Lin, Yue Zhang, Yubo Zhang, Yi Liu, Dianhai Yu, Yanjun Ma

2601.21064 2026-04-06 cs.LG cs.AI

Textual Equilibrium Propagation for Deep Compound AI Systems

Minghui Chen, Wenlong Deng, James Zou, Han Yu, Xiaoxiao Li

Comments Accepted to ICLR 2026

详情

英文摘要

Large language models (LLMs) are increasingly deployed as part of compound AI systems that coordinate multiple modules (e.g., retrievers, tools, verifiers) over long-horizon workflows. Recent approaches that propagate textual feedback globally (e.g., TextGrad) make it feasible to optimize such pipelines, but we find that performance degrades as system depth grows. In particular, long-horizon agentic workflows exhibit two depth-scaling failure modes: 1) exploding textual gradient, where textual feedback grows exponentially with depth, leading to prohibitively long message and amplifies evaluation biases; and 2) vanishing textual gradient, where limited long-context ability causes models overemphasize partial feedback and compression of lengthy feedback causes downstream messages to lose specificity gradually as they propagate many hops upstream. To mitigate these issues, we introduce Textual Equilibrium Propagation (TEP), a local learning principle inspired by Equilibrium Propagation in energy-based models. TEP includes two phases: 1) a free phase where a local LLM critics iteratively refine prompts until reaching equilibrium (no further improvements are suggested); and 2) a nudged phase which applies proximal prompt edits with bounded modification intensity, using task-level objectives that propagate via forward signaling rather than backward feedback chains. This design supports local prompt optimization followed by controlled adaptation toward global goals without the computational burden and signal degradation of global textual backpropagation. Across long-horizon QA benchmarks and multi-agent tool-use dataset, TEP consistently improves accuracy and efficiency over global propagation methods such as TextGrad. The gains grows with depth, while preserving the practicality of black-box LLM components in deep compound AI system.

URL PDF HTML ☆

赞 0 踩 0

2601.16933 2026-04-06 cs.CV cs.LG

Reward-Forcing: Autoregressive Video Generation with Reward Feedback

Jingran Zhang, Ning Li, Yuanhao Ban, Andrew Bai, Justin Cui

Comments https://openreview.net/forum?id=K8Qjsxxl7y&noteId=K8Qjsxxl7y

2601.16672 2026-04-06 cs.CV

ReWeaver: Towards Simulation-Ready and Topology-Accurate Garment Reconstruction

Ming Li, Hui Shan, Kai Zheng, Chentao Shen, Siyu Liu, Yanwei Fu, Zhen Chen, Xiangru Huang

Comments Accepted to CVPR 2026

2601.14617 2026-04-06 cs.RO cs.SE

UniCon: A Unified System for Efficient Robot Learning Transfers

Yunfeng Lin, Li Xu, Yong Yu, Jiangmiao Pang, Weinan Zhang

Comments The article has been accepted by Frontiers of Computer Science (FCS), with the DOI: {10.1007/s11704-026-52064-1}

2601.13633 2026-04-06 cs.CV

EGM: Efficient Visual Grounding Language Models

Guanqi Zhan, Changye Li, Zhijian Liu, Yao Lu, Yi Wu, Song Han, Ligeng Zhu

2601.13518 2026-04-06 cs.AI cs.NE

AgenticRed: Evolving Agentic Systems for Red-Teaming

Jiayi Yuan, Jonathan Nöther, Natasha Jaques, Goran Radanović

Comments Website: https://yuanjiayiy.github.io/AgenticRed/

2601.13303 2026-04-06 cs.LG

On the Extreme Variance of Certified Local Robustness Across Model Seeds

Minh Le, Phuong Cao

2601.10722 2026-04-06 cs.RO cs.DC cs.SE

A Survey of Real-Time Support, Analysis, and Advancements in ROS 2

Daniel Casini, Jian-Jia Chen, Jing Li, Federico Reghenzani, Harun Teper

2601.03127 2026-04-06 cs.CV cs.AI

Unified Thinker: A General Reasoning Modular Core for Image Generation

Sashuai Zhou, Qiang Zhou, Jijin Hu, Hanqing Yang, Yue Cao, Junpeng Ma, Yinchao Ma, Jun Song, Tiezheng Ge, Cheng Yu, Bo Zheng, Zhou Zhao

2601.00290 2026-04-06 cs.AI cs.MA

ClinicalReTrial: Clinical Trial Redesign with Self-Evolving Agents

Sixue Xing, Kerui Wu, Xuanye Xia, Meng Jiang, Jintai Chen, Tianfan Fu

2512.18809 2026-04-06 cs.CV cs.AI cs.MM

FedVideoMAE: Efficient Privacy-Preserving Federated Video Moderation

Ziyuan Tao, Chuanzhi Xu, Sandaru Jayawardana, Adnan Mahmood, Wei Bao, Kanchana Thilakarathna, Teng Joon Lim

2512.16383 2026-04-06 cs.LG stat.ML

Multivariate Uncertainty Quantification with Tomographic Quantile Forests

Takuya Kanazawa

Comments 36 pages. v2: matches published version

2512.16300 2026-04-06 cs.AI

Code-in-the-Loop Forensics: Agentic Tool Use for Image Forgery Detection

Fanrui Zhang, Qiang Zhang, Sizhuo Zhou, Jianwen Sun, Chuanhao Li, Jiaxin Ai, Yukang Feng, Yujie Zhang, Wenjie Li, Zizhen Li, Yifan Chang, Jiawei Liu, Kaipeng Zhang

Comments 18 pages, 7 figures

2512.13122 2026-04-06 cs.CV cs.AI

DePT3R: Joint Dense Point Tracking and 3D Reconstruction of Dynamic Scenes in a Single Forward Pass

Vivek Alumootil, Tuan-Anh Vu

Comments This is a work in progress

2512.09112 2026-04-06 cs.CV

GimbalDiffusion: Gravity-Aware Camera Control for Video Generation

Frédéric Fortier-Chouinard, Yannick Hold-Geoffroy, Valentin Deschaintre, Matheus Gadelha, Jean-François Lalonde

Comments Project page: https://lvsn.github.io/GimbalDiffusion/

2512.08980 2026-04-06 cs.CV cs.AI

Training Multi-Image Vision Agents via End2End Reinforcement Learning

Chengqi Dong, Chuhuai Yue, Hang He, Rongge Mao, Fenghe Tang, S Kevin Zhou, Zekun Xu, Xiaohan Wang, Jiajun Chai, Guojun Yin

2512.07951 2026-04-06 cs.CV

Preserving Source Video Realism: High-Fidelity Face Swapping for Cinematic Quality

Zekai Luo, Zongze Du, Zhouhang Zhu, Hao Zhong, Muzhi Zhu, Wen Wang, Yuling Xi, Chenchen Jing, Hao Chen, Chunhua Shen

Comments Accepted to CVPR 2026. Project webpage: https://aim-uofa.github.io/LivingSwap

2512.03537 2026-04-06 cs.LG stat.ML

Pushing the Limits of Distillation-Based Continual Learning via Classifier-Proximal Lightweight Plugins

Zhiming Xu, Baile Xu, Jian Zhao, Furao Shen, Suorong Yang

Comments 10 pages, 8 figures, 2 tables

2512.03424 2026-04-06 cs.CV

DM3D: Deformable Mamba via Offset-Guided Differentiable Scanning for Point Cloud Understanding

Bin Liu, Chunyang Wang, Xuelian Liu, Ge Zhang

2512.00961 2026-04-06 cs.LG

Goal-Driven Reward by Video Diffusion Models for Reinforcement Learning

Qi Wang, Mian Wu, Yuyang Zhang, Mingqi Yuan, Wenyao Zhang, Haoxiang You, Yunbo Wang, Xin Jin, Xiaokang Yang, Wenjun Zeng

Comments Accepted by CVPR 2026. Project page: https://qiwang067.github.io/genreward

2512.00129 2026-04-06 cs.CV cs.AI

Analysis of Invasive Breast Cancer in Mammograms Using YOLO, Explainability, and Domain Adaptation

Jayan Adhikari, Prativa Joshi, Sushish Baral

2511.23292 2026-04-06 cs.CV cs.GR

FACT-GS: Frequency-Aligned Complexity-Aware Texture Reparameterization for 2D Gaussian Splatting

Tianhao Xie, Linlian Jiang, Xinxin Zuo, Yang Wang, Tiberiu Popa

Comments 11 pages, 6 figures, CVPR 2026 Findings track. Project page: https://tianhaoxie.github.io/project/FACT-GS/

2511.21331 2026-04-06 cs.CV cs.AI

The More, the Merrier: Contrastive Fusion for Higher-Order Multimodal Alignment

Stefanos Koutoupis, Michaela Areti Zervou, Konstantinos Kontras, Maarten De Vos, Panagiotis Tsakalides, Grigorios Tsagkatakis

Comments Accepted to CVPR 2026

2511.17722 2026-04-06 cs.CV

Can Vision-Language Models Count? A Synthetic Benchmark and Analysis of Attention-Based Interventions

Saurav Sengupta, Nazanin Moradinasab, Jiebei Liu, Donald E. Brown

Comments Accepted at COGVL Workshop at CVPR 2026