arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.02331 2026-04-03 cs.CV

EventHub: Data Factory for Generalizable Event-Based Stereo Networks without Active Sensors

Luca Bartolomei, Fabio Tosi, Matteo Poggi, Stefano Mattoccia, Guillermo Gallego

Comments CVPR 2026. Project Page: https://bartn8.github.io/eventhub/ Code: https://github.com/bartn8/eventhub

2604.02330 2026-04-03 cs.CV cs.AI cs.LG

ActionParty: Multi-Subject Action Binding in Generative Video Games

Alexander Pondaven, Ziyi Wu, Igor Gilitschenski, Philip Torr, Sergey Tulyakov, Fabio Pizzati, Aliaksandr Siarohin

Comments Project page: https://action-party.github.io/

2604.02329 2026-04-03 cs.CV

Generative World Renderer

Zheng-Hui Huang, Zhixiang Wang, Jiaming Tan, Ruihan Yu, Yidan Zhang, Bo Zheng, Yu-Lun Liu, Yung-Yu Chuang, Kaipeng Zhang

Comments Project page: https://alaya-studio.github.io/renderer/

2604.02328 2026-04-03 cs.CV

Modulate-and-Map: Crossmodal Feature Mapping with Cross-View Modulation for 3D Anomaly Detection

Alex Costanzino, Pierluigi Zama Ramirez, Giuseppe Lisanti, Luigi Di Stefano

Comments Accepted at CVPR Findings 2026

2604.02327 2026-04-03 cs.CV cs.AI

Steerable Visual Representations

Jona Ruthardt, Manu Gaur, Deva Ramanan, Makarand Tapaswi, Yuki M. Asano

Comments preprint

2604.02324 2026-04-03 cs.CL cs.AI cs.LG

Grounded Token Initialization for New Vocabulary in LMs for Generative Recommendation

Daiwei Chen, Zhoutong Fu, Chengming Jiang, Haichao Zhang, Ran Zhou, Tan Wang, Chunnan Yao, Guoyao Li, Rui Cai, Yihan Cao, Ruijie Jiang, Fedor Borisyuk, Jianqiang Shen, Jingwei Wu, Ramya Korlakai Vinayak

2604.02323 2026-04-03 cs.CV

Beyond Referring Expressions: Scenario Comprehension Visual Grounding

Ruozhen He, Nisarg A. Shah, Qihua Dong, Zilin Xiao, Jaywon Koo, Vicente Ordonez

Comments 20 pages, 18 figures, Project Page: https://catherine-r-he.github.io/RSC/

2604.02322 2026-04-03 cs.LG cs.AI cs.CL

Batched Contextual Reinforcement: A Task-Scaling Law for Efficient Reasoning

Bangji Yang, Hongbo Ma, Jiajun Fan, Ge Liu

Comments 43 pages, 5 figures, 24 tables

详情

英文摘要

Large Language Models employing Chain-of-Thought reasoning achieve strong performance but suffer from excessive token consumption that inflates inference costs. Existing efficiency methods such as explicit length penalties, difficulty estimators, or multi-stage curricula either degrade reasoning quality or require complex training pipelines. We introduce Batched Contextual Reinforcement, a minimalist, single-stage training paradigm that unlocks efficient reasoning through a simple structural modification: training the model to solve N problems simultaneously within a shared context window, rewarded purely by per-instance accuracy. This formulation creates an implicit token budget that yields several key findings: (1) We identify a novel task-scaling law: as the number of concurrent problems N increases during inference, per-problem token usage decreases monotonically while accuracy degrades far more gracefully than baselines, establishing N as a controllable throughput dimension. (2) BCR challenges the traditional accuracy-efficiency trade-off by demonstrating a "free lunch" phenomenon at standard single-problem inference. Across both 1.5B and 4B model families, BCR reduces token usage by 15.8% to 62.6% while consistently maintaining or improving accuracy across five major mathematical benchmarks. (3) Qualitative analyses reveal emergent self-regulated efficiency, where models autonomously eliminate redundant metacognitive loops without explicit length supervision. (4) Crucially, we empirically demonstrate that implicit budget constraints successfully circumvent the adversarial gradients and catastrophic optimization collapse inherent to explicit length penalties, offering a highly stable, constraint-based alternative for length control. These results prove BCR practical, showing simple structural incentives unlock latent high-density reasoning in LLMs.

URL PDF HTML ☆

赞 0 踩 0

2604.02318 2026-04-03 cs.RO cs.CV

Stop Wandering: Efficient Vision-Language Navigation via Metacognitive Reasoning

Xueying Li, Feng Lyu, Hao Wu, Mingliu Liu, Jia-Nan Liu, Guozi Liu

Comments 10 pages, 6 figures

2604.02317 2026-04-03 cs.CV

A Simple Baseline for Streaming Video Understanding

Yujiao Shen, Shulin Tian, Jingkang Yang, Ziwei Liu

Comments Project page: https://simple-stream.github.io/

2604.02309 2026-04-03 cs.LG cs.CL

go-$m$HC: Direct Parameterization of Manifold-Constrained Hyper-Connections via Generalized Orthostochastic Matrices

Torque Dandachi, Sophia Diggs-Galligan

Comments 29 pages, 30 figures, 9 tables. Includes supplementary material

2604.02308 2026-04-03 math.NA cs.NA

A Positivity-Preserving Relaxation Algorithm

Thomas Izgin, Hendrik Ranocha, Chi-Wang Shu

Comments 40 pages, 6 figures

2604.02303 2026-04-03 cs.DM

Trapping and commutative Boolean networks

Maximilien Gadouleau

Comments arXiv admin note: substantial text overlap with arXiv:2404.03553

2604.02299 2026-04-03 cs.CR

PARD-SSM: Probabilistic Cyber-Attack Regime Detection via Variational Switching State-Space Models

Prakul Sunil Hiremath, PeerAhammad M Bagawan, Sahil Bhekane

Comments 18 pages, 3 figures, 3 tables, code available on GitHub

2604.02296 2026-04-03 cs.CV cs.AI

VOID: Video Object and Interaction Deletion

Saman Motamed, William Harvey, Benjamin Klein, Luc Van Gool, Zhuoning Yuan, Ta-Ying Cheng

2604.02292 2026-04-03 cs.LG cs.AR

Taming the Exponential: A Fast Softmax Surrogate for Integer-Native Edge Inference

Dimitrios Danopoulos, Enrico Lupi, Michael Kagan, Maurizio Pierini

2604.02291 2026-04-03 cs.AR

TensorPool: A 3D-Stacked 8.4TFLOPS/4.3W Many-Core Domain-Specific Processor for AI-Native Radio Access Networks

Marco Bertuletti, Yichao Zhang, Diyou Shen, Alessandro Vanelli-Coralli, Frank K. Gürkaynak, Luca Benini

Comments 12 pages, 16 figures

2604.02290 2026-04-03 cs.CV math.OC

AdamFlow: Adam-based Wasserstein Gradient Flows for Surface Registration in Medical Imaging

Qiang Ma, Qingjie Meng, Xin Hu, Yicheng Wu, Wenjia Bai

2604.02289 2026-04-03 cs.CV cs.AI

Omni123: Exploring 3D Native Foundation Models with Limited 3D Data by Unifying Text to 2D and 3D Generation

Chongjie Ye, Cheng Cao, Chuanyu Pan, Yiming Hao, Yihao Zhi, Yuanming Hu, Xiaoguang Han

2604.02288 2026-04-03 cs.LG cs.AI

Unifying Group-Relative and Self-Distillation Policy Optimization via Sample Routing

Gengsheng Li, Tianyu Yang, Junfeng Fang, Mingyang Song, Mao Zheng, Haiyun Guo, Dan Zhang, Jinqiao Wang, Tat-Seng Chua

2604.02285 2026-04-03 cs.CC cs.DS

The Computational Complexity of Avoiding Strict Saddle Points in Constrained Optimization

Andreas Kontogiannis, Ioannis Panageas, Vasilis Pollatos

Comments Abstract shortened to meet arXiv requirements

2604.02284 2026-04-03 cs.NI

CIVIC: Cooperative Immersion Via Intelligent Credit-sharing in DRL-Powered Metaverse

Amr Aboeleneen, Mohamed Abdallah, Aiman Erbad, Amr Salem

Comments Journal submission; 19 pages; 9 figures

2604.02282 2026-04-03 cs.RO cs.CV

Deep Neural Network Based Roadwork Detection for Autonomous Driving

Sebastian Wullrich, Nicolai Steinke, Daniel Goehring

Comments 7 pages, 10 figures

2604.02280 2026-04-03 cs.AI cs.CV

Novel Memory Forgetting Techniques for Autonomous AI Agents: Balancing Relevance and Efficiency

Payal Fofadiya, Sunil Tiwari

2604.02279 2026-04-03 cs.AI cs.MA q-fin.GN q-fin.PM

The Self Driving Portfolio: Agentic Architecture for Institutional Asset Management

Andrew Ang, Nazym Azimbayev, Andrey Kim

Comments 31 pages, 11 exhibits

2604.02278 2026-04-03 cs.SE

LLMs as Idiomatic Decompilers: Recovering High-Level Code from x86-64 Assembly for Dart

Raafat Abualazm, Ayman Abo Elhassan

Comments 5 pages, 1 figure, 3 tables. Accepted at SANER 2026 ERA Track

2604.02276 2026-04-03 cs.AI cs.CL cs.LG

De Jure: Iterative LLM Self-Refinement for Structured Extraction of Regulatory Rules

Keerat Guliani, Deepkamal Gill, David Landsman, Nima Eshraghi, Krishna Kumar, Lovedeep Gondara

2604.02275 2026-04-03 cs.IT math.IT

One-Shot Secret Sharing with Monotone Access Structures over Classical-Quantum Broadcast Channels

Truman Welling, Rémi A. Chou, Aylin Yener

Comments 18 pages. Submitted for an IEEE publication: April 2026

2604.02273 2026-04-03 eess.SY cs.SY

Selective State-Space Models for Koopman-based Data-driven Distribution System State Estimation

Bader Alabdulrazzaq, Bri-Mathias Hodge

2604.02270 2026-04-03 cs.LG cs.AI

Crystalite: A Lightweight Transformer for Efficient Crystal Modeling

Tin Hadži Veljković, Joshua Rosenthal, Ivor Lončarić, Jan-Willem van de Meent

Comments 39 pages, 13 figures. Code available at: https://github.com/joshrosie/crystalite