arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.02331 2026-04-03 cs.CV

EventHub: Data Factory for Generalizable Event-Based Stereo Networks without Active Sensors

Luca Bartolomei, Fabio Tosi, Matteo Poggi, Stefano Mattoccia, Guillermo Gallego

Comments CVPR 2026. Project Page: https://bartn8.github.io/eventhub/ Code: https://github.com/bartn8/eventhub

2604.02330 2026-04-03 cs.CV cs.AI cs.LG

ActionParty: Multi-Subject Action Binding in Generative Video Games

Alexander Pondaven, Ziyi Wu, Igor Gilitschenski, Philip Torr, Sergey Tulyakov, Fabio Pizzati, Aliaksandr Siarohin

Comments Project page: https://action-party.github.io/

2604.02329 2026-04-03 cs.CV

Generative World Renderer

Zheng-Hui Huang, Zhixiang Wang, Jiaming Tan, Ruihan Yu, Yidan Zhang, Bo Zheng, Yu-Lun Liu, Yung-Yu Chuang, Kaipeng Zhang

Comments Project page: https://alaya-studio.github.io/renderer/

2604.02328 2026-04-03 cs.CV

Modulate-and-Map: Crossmodal Feature Mapping with Cross-View Modulation for 3D Anomaly Detection

Alex Costanzino, Pierluigi Zama Ramirez, Giuseppe Lisanti, Luigi Di Stefano

Comments Accepted at CVPR Findings 2026

2604.02327 2026-04-03 cs.CV cs.AI

Steerable Visual Representations

Jona Ruthardt, Manu Gaur, Deva Ramanan, Makarand Tapaswi, Yuki M. Asano

Comments preprint

2604.02324 2026-04-03 cs.CL cs.AI cs.LG

Grounded Token Initialization for New Vocabulary in LMs for Generative Recommendation

Daiwei Chen, Zhoutong Fu, Chengming Jiang, Haichao Zhang, Ran Zhou, Tan Wang, Chunnan Yao, Guoyao Li, Rui Cai, Yihan Cao, Ruijie Jiang, Fedor Borisyuk, Jianqiang Shen, Jingwei Wu, Ramya Korlakai Vinayak

2604.02323 2026-04-03 cs.CV

Beyond Referring Expressions: Scenario Comprehension Visual Grounding

Ruozhen He, Nisarg A. Shah, Qihua Dong, Zilin Xiao, Jaywon Koo, Vicente Ordonez

Comments 20 pages, 18 figures, Project Page: https://catherine-r-he.github.io/RSC/

2604.02322 2026-04-03 cs.LG cs.AI cs.CL

Batched Contextual Reinforcement: A Task-Scaling Law for Efficient Reasoning

Bangji Yang, Hongbo Ma, Jiajun Fan, Ge Liu

Comments 43 pages, 5 figures, 24 tables

详情

英文摘要

Large Language Models employing Chain-of-Thought reasoning achieve strong performance but suffer from excessive token consumption that inflates inference costs. Existing efficiency methods such as explicit length penalties, difficulty estimators, or multi-stage curricula either degrade reasoning quality or require complex training pipelines. We introduce Batched Contextual Reinforcement, a minimalist, single-stage training paradigm that unlocks efficient reasoning through a simple structural modification: training the model to solve N problems simultaneously within a shared context window, rewarded purely by per-instance accuracy. This formulation creates an implicit token budget that yields several key findings: (1) We identify a novel task-scaling law: as the number of concurrent problems N increases during inference, per-problem token usage decreases monotonically while accuracy degrades far more gracefully than baselines, establishing N as a controllable throughput dimension. (2) BCR challenges the traditional accuracy-efficiency trade-off by demonstrating a "free lunch" phenomenon at standard single-problem inference. Across both 1.5B and 4B model families, BCR reduces token usage by 15.8% to 62.6% while consistently maintaining or improving accuracy across five major mathematical benchmarks. (3) Qualitative analyses reveal emergent self-regulated efficiency, where models autonomously eliminate redundant metacognitive loops without explicit length supervision. (4) Crucially, we empirically demonstrate that implicit budget constraints successfully circumvent the adversarial gradients and catastrophic optimization collapse inherent to explicit length penalties, offering a highly stable, constraint-based alternative for length control. These results prove BCR practical, showing simple structural incentives unlock latent high-density reasoning in LLMs.

URL PDF HTML ☆

赞 0 踩 0

2604.02318 2026-04-03 cs.RO cs.CV

Stop Wandering: Efficient Vision-Language Navigation via Metacognitive Reasoning

Xueying Li, Feng Lyu, Hao Wu, Mingliu Liu, Jia-Nan Liu, Guozi Liu

Comments 10 pages, 6 figures

2604.02317 2026-04-03 cs.CV

A Simple Baseline for Streaming Video Understanding

Yujiao Shen, Shulin Tian, Jingkang Yang, Ziwei Liu

Comments Project page: https://simple-stream.github.io/

2604.02309 2026-04-03 cs.LG cs.CL

go-$m$HC: Direct Parameterization of Manifold-Constrained Hyper-Connections via Generalized Orthostochastic Matrices

Torque Dandachi, Sophia Diggs-Galligan

Comments 29 pages, 30 figures, 9 tables. Includes supplementary material

2604.02296 2026-04-03 cs.CV cs.AI

VOID: Video Object and Interaction Deletion

Saman Motamed, William Harvey, Benjamin Klein, Luc Van Gool, Zhuoning Yuan, Ta-Ying Cheng

2604.02292 2026-04-03 cs.LG cs.AR

Taming the Exponential: A Fast Softmax Surrogate for Integer-Native Edge Inference

Dimitrios Danopoulos, Enrico Lupi, Michael Kagan, Maurizio Pierini

2604.02290 2026-04-03 cs.CV math.OC

AdamFlow: Adam-based Wasserstein Gradient Flows for Surface Registration in Medical Imaging

Qiang Ma, Qingjie Meng, Xin Hu, Yicheng Wu, Wenjia Bai

2604.02289 2026-04-03 cs.CV cs.AI

Omni123: Exploring 3D Native Foundation Models with Limited 3D Data by Unifying Text to 2D and 3D Generation

Chongjie Ye, Cheng Cao, Chuanyu Pan, Yiming Hao, Yihao Zhi, Yuanming Hu, Xiaoguang Han

2604.02288 2026-04-03 cs.LG cs.AI

Unifying Group-Relative and Self-Distillation Policy Optimization via Sample Routing

Gengsheng Li, Tianyu Yang, Junfeng Fang, Mingyang Song, Mao Zheng, Haiyun Guo, Dan Zhang, Jinqiao Wang, Tat-Seng Chua

2604.02282 2026-04-03 cs.RO cs.CV

Deep Neural Network Based Roadwork Detection for Autonomous Driving

Sebastian Wullrich, Nicolai Steinke, Daniel Goehring

Comments 7 pages, 10 figures

2604.02280 2026-04-03 cs.AI cs.CV

Novel Memory Forgetting Techniques for Autonomous AI Agents: Balancing Relevance and Efficiency

Payal Fofadiya, Sunil Tiwari

2604.02279 2026-04-03 cs.AI cs.MA q-fin.GN q-fin.PM

The Self Driving Portfolio: Agentic Architecture for Institutional Asset Management

Andrew Ang, Nazym Azimbayev, Andrey Kim

Comments 31 pages, 11 exhibits

2604.02276 2026-04-03 cs.AI cs.CL cs.LG

De Jure: Iterative LLM Self-Refinement for Structured Extraction of Regulatory Rules

Keerat Guliani, Deepkamal Gill, David Landsman, Nima Eshraghi, Krishna Kumar, Lovedeep Gondara

2604.02270 2026-04-03 cs.LG cs.AI

Crystalite: A Lightweight Transformer for Efficient Crystal Modeling

Tin Hadži Veljković, Joshua Rosenthal, Ivor Lončarić, Jan-Willem van de Meent

Comments 39 pages, 13 figures. Code available at: https://github.com/joshrosie/crystalite

2604.02265 2026-04-03 cs.CV

Modular Energy Steering for Safe Text-to-Image Generation with Foundation Models

Yaoteng Tan, Zikui Cai, M. Salman Asif

2604.02260 2026-04-03 cs.LG cs.RO

Model-Based Reinforcement Learning for Control under Time-Varying Dynamics

Klemens Iten, Bruce Lee, Chenhao Li, Lenart Treven, Andreas Krause, Bhavya Sukhija

Comments 15 pages, 5 figues, 2 tables. This work has been submitted to the IEEE for possible publication

2604.02256 2026-04-03 cs.RO cs.NA cs.SY eess.SY math.NA

A virtual-variable-length method for robust inverse kinematics of multi-segment continuum robots

Weiting Feng, Federico Renda, Yunjie Yang, Francesco Giorgio-Serchi

Comments 8 pages, 6 figures, accepted for presentation in IEEE RoboSoft 2026, Kanazawa, Japan

2604.02252 2026-04-03 cs.CV

SPAR: Single-Pass Any-Resolution ViT for Open-vocabulary Segmentation

Naomi Kombol, Ivan Martinović, Siniša Šegvić, Giorgos Tolias

Comments Accepted to CVPR 2026

2604.02250 2026-04-03 cs.LG stat.ML

Smoothing the Landscape: Causal Structure Learning via Diffusion Denoising Objectives

Hao Zhu, Di Zhou, Donna Slonim

Comments To appear in the Proceedings of the 5th Conference on Causal Learning and Reasoning (CLeaR 2026)

2604.02236 2026-04-03 cs.AI

Do Emotions in Prompts Matter? Effects of Emotional Framing on Large Language Models

Minda Zhao, Yutong Yang, Chufei Peng, Rachel Gonsalves, Weiyue Li, Ruyi Yang, Zhixi Liu, Mengyu Wang

2604.02230 2026-04-03 cs.AI

Answering the Wrong Question: Reasoning Trace Inversion for Abstention in LLMs

Abinitha Gourabathina, Inkit Padhi, Manish Nagireddy, Subhajit Chaudhury, Prasanna Sattigeri

2604.02226 2026-04-03 cs.AI cs.LG

When to ASK: Uncertainty-Gated Language Assistance for Reinforcement Learning

Juarez Monteiro, Nathan Gavenski, Gianlucca Zuin, Adriano Veloso

Comments In Proceedings of International Joint Conference on Neural Networks (IJCNN)

2604.02222 2026-04-03 cs.CV

SCALE: Semantic- and Confidence-Aware Conditional Variational Autoencoder for Zero-shot Skeleton-based Action Recognition

Soroush Oraki, Feng Ding, Jie Liang

Comments Accepted to ICPR 2026