arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.17867 2026-03-19 cs.LG cs.SY eess.SY math.OC

RHYME-XT: A Neural Operator for Spatiotemporal Control Systems

Marijn Ruiter, Miguel Aguiar, Jake Rap, Karl H. Johansson, Amritam Das

Comments 6 pages, 5 figures. Submitted to IEEE Control Systems Letters (L-CSS) and CDC 2026

2603.17863 2026-03-19 cs.LG cs.AI

Procedural Generation of Algorithm Discovery Tasks in Machine Learning

Alexander D. Goldie, Zilin Wang, Adrian Hayler, Deepak Nathani, Edan Toledo, Ken Thampiratwong, Aleksandra Kalisz, Michael Beukman, Alistair Letcher, Shashank Reddy, Clarisse Wibault, Theo Wolf, Charles O'Neill, Uljad Berdica, Nicholas Roberts, Saeed Rahmani, Hannah Erlebach, Roberta Raileanu, Shimon Whiteson, Jakob N. Foerster

2603.17855 2026-03-19 cs.LG

Physics-Aware Machine Learning for Seismic and Volcanic Signal Interpretation

William Thorossian

Comments 18 pages, 2 Tables, 1 Figure, 22 References

2603.17851 2026-03-19 cs.RO

DexViTac: Collecting Human Visuo-Tactile-Kinematic Demonstrations for Contact-Rich Dexterous Manipulation

Xitong Chen, Yifeng Pan, Min Li, Xiaotian Ding

Comments 9 pages, 9 figures.Project page: https://xitong-c.github.io/DexViTac/

2603.17850 2026-03-19 cs.RO

ProbeFlow: Training-Free Adaptive Flow Matching for Vision-Language-Action Models

Zhou Fang, Jiaqi Wang, Yi Zhou, Qiongfeng Shi

2603.17845 2026-03-19 cs.CV

Revisiting foundation models for cell instance segmentation

Anwai Archit, Constantin Pape

Comments Published in MIDL 2026

2603.17841 2026-03-19 cs.CV

Omni-3DEdit: Generalized Versatile 3D Editing in One-Pass

Chen Liyi, Wang Pengfei, Zhang Guowen, Ma Zhiyuan, Zhang Lei

Comments accepted by CVPR26

2603.17840 2026-03-19 cs.CV

Video Understanding: From Geometry and Semantics to Unified Models

Zhaochong An, Zirui Li, Mingqiao Ye, Feng Qiao, Jiaang Li, Zongwei Wu, Vishal Thengane, Chengzu Li, Lei Li, Luc Van Gool, Guolei Sun, Serge Belongie

Comments A comprehensive survey of video understanding, spanning low-level geometry, high-level semantics, and unified understanding models

2603.17838 2026-03-19 cs.CL

Event-Centric Human Value Understanding in News-Domain Texts: An Actor-Conditioned, Multi-Granularity Benchmark

Yao Wang, Xin Liu, Zhuochen Liu, Jiankang Chen, Adam Jatowt, Kyoungsook Kim, Noriko Kando, Haitao Yu

2603.17832 2026-03-19 cs.CL cs.AI cs.LG

Text-to-Stage: Spatial Layouts from Long-form Narratives

Jefferson Hernandez, Swarnadeep Saha, Chenxi Whitehouse, Sanjeel Parekh, Calvin Murdock, Yuliang Li, W. Owen Brimijoin, Vamsi Krishna Ithapu, Ishwarya Ananthabhotla

2603.17831 2026-03-19 cs.AI

RPMS: Enhancing LLM-Based Embodied Planning through Rule-Augmented Memory Synergy

Zhenhang Yuan, Shenghai Yuan, Lihua Xie

2603.17828 2026-03-19 cs.CV

TINA: Text-Free Inversion Attack for Unlearned Text-to-Image Diffusion Models

Qianlong Xiang, Miao Zhang, Haoyu Zhang, Kun Wang, Junhui Hou, Liqiang Nie

Comments 16 pages, accepted by CVPR 2026

2603.17825 2026-03-19 cs.CV

Steering Video Diffusion Transformers with Massive Activations

Xianhang Cheng, Yujian Zheng, Zhenyu Xie, Tingting Liao, Hao Li

2603.17824 2026-03-19 cs.LG

Symmetry-Reduced Physics-Informed Learning of Tensegrity Dynamics

Jing Qin, Muhao Chen

2603.17823 2026-03-19 cs.LG cs.CL

Discovering Decoupled Functional Modules in Large Language Models

Yanke Yu, Jin Li, Ying Sun, Ping Li, Zhefeng Wang, Yi Zheng

Comments AAAI-26 Oral

2603.17820 2026-03-19 cs.LG

Federated Distributional Reinforcement Learning with Distributional Critic Regularization

David Millard, Cecilia Alm, Rashid Ali, Pengcheng Shi, Ali Baheri

Comments 9 pages, 4 Figures, conference

2603.17815 2026-03-19 cs.CL

Process Supervision for Chain-of-Thought Reasoning via Monte Carlo Net Information Gain

Corentin Royer, Debarun Bhattacharjya, Gaetano Rossiello, Andrea Giovannini, Mennatallah El-Assady

2603.17813 2026-03-19 cs.CV

M2P: Improving Visual Foundation Models with Mask-to-Point Weakly-Supervised Learning for Dense Point Tracking

Qiangqiang Wu, Tianyu Yang, Bo Fang, Jia Wan, Matias Di Martino, Guillermo Sapiro, Antoni B. Chan

2603.17811 2026-03-19 cs.LG cs.AI

Dropout Robustness and Cognitive Profiling of Transformer Models via Stochastic Inference

Antônio Junior Alves Caiado, Michael Hahsler

详情

英文摘要

Transformer-based language models are widely deployed for reasoning, yet their behavior under inference-time stochasticity remains underexplored. While dropout is common during training, its inference-time effects via Monte Carlo sampling lack systematic evaluation across architectures, limiting understanding of model reliability in uncertainty-aware applications. This work analyzes dropout-induced variability across 19 transformer models using MC Dropout with 100 stochastic forward passes per sample. Dropout robustness is defined as maintaining high accuracy and stable predictions under stochastic inference, measured by standard deviation of per-run accuracies. A cognitive decomposition framework disentangles performance into memory and reasoning components. Experiments span five dropout configurations yielding 95 unique evaluations on 1,000 samples. Results reveal substantial architectural variation. Smaller models demonstrate perfect prediction stability while medium-sized models exhibit notable volatility. Mid-sized models achieve the best overall performance; larger models excel at memory tasks. Critically, 53% of models suffer severe accuracy degradation under baseline MC Dropout, with task-specialized models losing up to 24 percentage points, indicating unsuitability for uncertainty quantification in these architectures. Asymmetric effects emerge: high dropout reduces memory accuracy by 27 percentage points while reasoning degrades only 1 point, suggesting memory tasks rely on stable representations that dropout disrupts. 84% of models demonstrate memory-biased performance. This provides the first comprehensive MC Dropout benchmark for transformers, revealing dropout robustness is architecture-dependent and uncorrelated with scale. The cognitive profiling framework offers actionable guidance for model selection in uncertainty-aware applications.

URL PDF HTML ☆

赞 0 踩 0

2603.17809 2026-03-19 cs.CV cs.AI

Fine-Grained Post-Training Quantization for Large Vision Language Models with Quantization-Aware Integrated Gradients

Ziwei Xiang, Fanhu Zeng, Hongjian Fang, Rui-Qi Wang, Renxing Chen, Yanan Zhu, Yi Chen, Peipei Yang, Xu-Yao Zhang

Comments Accepted by CVPR 2026 Main Conference

2603.17795 2026-03-19 cs.LG cs.AI

RangeAD: Fast On-Model Anomaly Detection

Luca Hinkamp, Simon Klüttermann, Emmanuel Müller

Comments 16 pages, 5 figures

2603.17787 2026-03-19 cs.AI cs.CL cs.MA

Governed Memory: A Production Architecture for Multi-Agent Workflows

Hamed Taheri

Comments 18 pages, 4 figures, 11 tables, 7 appendices. Code and datasets: https://github.com/personizeai/governed-memory

2603.17782 2026-03-19 cs.CV

Exploring parameter-efficient fine-tuning (PEFT) of billion-parameter vision models with QLoRA and DoRA: insights into generalization for limited-data image classification under a 98:1 test-to-train regime

Haiyu Yang, Sumit Sharma, Enhong Liu, Miel Hostens

2603.17781 2026-03-19 cs.AI

Facts as First Class Objects: Knowledge Objects for Persistent LLM Memory

Oliver Zahn, Simran Chana

Comments 26 pages, 7 figures

2603.17771 2026-03-19 cs.LG cs.AI

Attention Sinks Induce Gradient Sinks

Yihong Chen, Quanming Yao

Comments 10 pages, 5 figures

2603.17768 2026-03-19 cs.RO

Huddle: Parallel Shape Assembly using Decentralized, Minimalistic Robots

Khai Yi Chin, Tingwei Meng, Zhe Chen, Daniel Bassett, Yuri Ivanov

Comments 16 pages, 6 figures, submitted to DARS 2026

2603.17761 2026-03-19 cs.CV

Evidence Packing for Cross-Domain Image Deepfake Detection with LVLMs

Yuxin Liu, Fei Wang, Kun Li, Yiqi Nie, Junjie Chen, Zhangling Duan, Zhaohong Jia

2603.17753 2026-03-19 cs.CV

PC-CrossDiff: Point-Cluster Dual-Level Cross-Modal Differential Attention for Unified 3D Referring and Segmentation

Wenbin Tan, Jiawen Lin, Fangyong Wang, Yuan Xie, Yong Xie, Yachao Zhang, Yanyun Qu

2603.17750 2026-03-19 cs.LG

Towards Infinitely Long Neural Simulations: Self-Refining Neural Surrogate Models for Dynamical Systems

Qi Liu, Laure Zanna, Joan Bruna

2603.17746 2026-03-19 cs.CV

Concept-to-Pixel: Prompt-Free Universal Medical Image Segmentation

Haoyun Chen, Fenghe Tang, Wenxin Ma, Shaohua Kevin Zhou

Comments 32 pages, code is available at: https://github.com/Yundi218/Concept-to-Pixel