arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.19486 2026-03-23 cs.LG

Any-Subgroup Equivariant Networks via Symmetry Breaking

Abhinav Goel, Derek Lim, Hannah Lawrence, Stefanie Jegelka, Ningyuan Huang

Comments Accepted at ICLR 2026

详情

英文摘要

The inclusion of symmetries as an inductive bias, known as equivariance, often improves generalization on geometric data (e.g. grids, sets, and graphs). However, equivariant architectures are usually highly constrained, designed for symmetries chosen a priori, and not applicable to datasets with other symmetries. This precludes the development of flexible, multi-modal foundation models capable of processing diverse data equivariantly. In this work, we build a single model -- the Any-Subgroup Equivariant Network (ASEN) -- that can be simultaneously equivariant to several groups, simply by modulating a certain auxiliary input feature. In particular, we start with a fully permutation-equivariant base model, and then obtain subgroup equivariance by using a symmetry-breaking input whose automorphism group is that subgroup. However, finding an input with the desired automorphism group is computationally hard. We overcome this by relaxing from exact to approximate symmetry breaking, leveraging the notion of 2-closure to derive fast algorithms. Theoretically, we show that our subgroup-equivariant networks can simulate equivariant MLPs, and their universality can be guaranteed if the base model is universal. Empirically, we validate our method on symmetry selection for graph and image tasks, as well as multitask and transfer learning for sequence tasks, showing that a single network equivariant to multiple permutation subgroups outperforms both separate equivariant models and a single non-equivariant model.

URL PDF HTML ☆

赞 0 踩 0

2603.19481 2026-03-23 cs.CV

Narrative Aligned Long Form Video Question Answering

Rahul Jain, Keval Doshi, Burak Uzkent, Garin Kessler

2603.19477 2026-03-23 cs.RO

Real-Time Optical Communication Using Event-Based Vision with Moving Transmitters

Harmeet Dhillon, Pranay Katyal, Brendan Long, Rohan Walia, Matthew Cleaveland, Kevin Leahy

Comments 8 pages, 7 Figures, Submitted to IROS 2026 - Under Review

2603.19474 2026-03-23 cs.LG cs.AI

TRACE: Trajectory Recovery with State Propagation Diffusion for Urban Mobility

Jinming Wang, Hai Wang, Hongkai Wen, Geyong Min, Man Luo

Comments This article is accepted by WWW 2026, Dubai, United Arab Emirates

2603.19468 2026-03-23 cs.SD eess.AS

Listen First, Then Answer: Timestamp-Grounded Speech Reasoning

Jihoon Jeong, Pooneh Mousavi, Mirco Ravanelli, Cem Subakan

Comments Submitted to Interspeech 2026

2603.19466 2026-03-23 cs.CV

ProactiveBench: Benchmarking Proactiveness in Multimodal Large Language Models

Thomas De Min, Subhankar Roy, Stéphane Lathuilière, Elisa Ricci, Massimiliano Mancini

2603.19464 2026-03-23 cs.RO

Can LLMs Prove Robotic Path Planning Optimality? A Benchmark for Research-Level Algorithm Verification

Zhengbang Yang, Md. Tasin Tazwar, Minghan Wei, Zhuangdi Zhu

2603.19463 2026-03-23 cs.LG cs.NA math.AP math.NA math.OC math.PR

Deep Hilbert--Galerkin Methods for Infinite-Dimensional PDEs and Optimal Control

Samuel N. Cohen, Filippo de Feo, Jackson Hebner, Justin Sirignano

2603.19461 2026-03-23 cs.AI

Hyperagents

Jenny Zhang, Bingchen Zhao, Wannan Yang, Jakob Foerster, Jeff Clune, Minqi Jiang, Sam Devlin, Tatiana Shavrina

Comments Code at https://github.com/facebookresearch/Hyperagents

详情

英文摘要

Self-improving AI systems aim to reduce reliance on human engineering by learning to improve their own learning and problem-solving processes. Existing approaches to self-improvement rely on fixed, handcrafted meta-level mechanisms, fundamentally limiting how fast such systems can improve. The Darwin Gödel Machine (DGM) demonstrates open-ended self-improvement in coding by repeatedly generating and evaluating self-modified variants. Because both evaluation and self-modification are coding tasks, gains in coding ability can translate into gains in self-improvement ability. However, this alignment does not generally hold beyond coding domains. We introduce \textbf{hyperagents}, self-referential agents that integrate a task agent (which solves the target task) and a meta agent (which modifies itself and the task agent) into a single editable program. Crucially, the meta-level modification procedure is itself editable, enabling metacognitive self-modification, improving not only the task-solving behavior, but also the mechanism that generates future improvements. We instantiate this framework by extending DGM to create DGM-Hyperagents (DGM-H), eliminating the assumption of domain-specific alignment between task performance and self-modification skill to potentially support self-accelerating progress on any computable task. Across diverse domains, the DGM-H improves performance over time and outperforms baselines without self-improvement or open-ended exploration, as well as prior self-improving systems. Furthermore, the DGM-H improves the process by which it generates new agents (e.g., persistent memory, performance tracking), and these meta-level improvements transfer across domains and accumulate across runs. DGM-Hyperagents offer a glimpse of open-ended AI systems that do not merely search for better solutions, but continually improve their search for how to improve.

URL PDF HTML ☆

赞 0 踩 0

2603.19460 2026-03-23 cs.LG cs.CG

GeoLAN: Geometric Learning of Latent Explanatory Directions in Large Language Models

Tianyu Bell Pan, Damon L. Woodard

2603.19456 2026-03-23 cs.CV

In-the-Wild Camouflage Attack on Vehicle Detectors through Controllable Image Editing

Xiao Fang, Yiming Gong, Stanislav Panev, Celso de Melo, Shuowen Hu, Shayok Chakraborty, Fernando De la Torre

Comments 45 pages, 35 figures

2603.19429 2026-03-23 cs.AI cs.LO cs.SC

When both Grounding and not Grounding are Bad -- A Partially Grounded Encoding of Planning into SAT (Extended Version)

João Filipe, Gregor Behnke

2603.19427 2026-03-23 cs.CL cs.AI cs.LG

Vocabulary shapes cross-lingual variation of word-order learnability in language models

Jonas Mayer Martins, Jaap Jumelet, Viola Priesemann, Lisa Beinborn

Comments Submitted to ACL 2026. 17 pages, 11 figures

2603.19426 2026-03-23 cs.CL cs.AI

Is Evaluation Awareness Just Format Sensitivity? Limitations of Probe-Based Evidence under Controlled Prompt Structure

Viliana Devbunova

Comments 10 pages, 5 tables, 2 figures. Accepted at ICLR 2026 Workshop "I Can't Believe It's Not Better"

2603.19424 2026-03-23 cs.RO

A Closed-Form CLF-CBF Controller for Whole-Body Continuum Soft Robot Collision Avoidance

Kiwan Wong, Maximillian Stölzle, Wei Xiao, Daniela Rus

2603.19418 2026-03-23 cs.RO cs.DC

Speculative Policy Orchestration: A Latency-Resilient Framework for Cloud-Robotic Manipulation

Chanh Nguyen, Shutong Jin, Florian T. Pokorny, Erik Elmroth

Comments 9 pages, 7 figures, conference submission

2603.19384 2026-03-23 cs.RO

SOFTMAP: Sim2Real Soft Robot Forward Modeling via Topological Mesh Alignment and Physics Prior

Ziyong Ma, Uksang Yoo, Jonathan Francis, Weiming Zhi, Jeffrey Ichnowski, Jean Oh

2603.19371 2026-03-23 cs.CV

Factored Levenberg-Marquardt for Diffeomorphic Image Registration: An efficient optimizer for FireANTs

Rohit Jena, Pratik Chaudhari, James C. Gee

2603.19370 2026-03-23 cs.RO

VAMPO: Policy Optimization for Improving Visual Dynamics in Video Action Models

Zirui Ge, Pengxiang Ding, Baohua Yin, Qishen Wang, Zhiyong Xie, Yemin Wang, Jinbo Wang, Hengtao Li, Runze Suo, Wenxuan Song, Han Zhao, Shangke Lyu, Zhaoxin Fan, Haoang Li, Ran Cheng, Cheng Chi, Huibin Ge, Yaozhi Luo, Donglin Wang

2603.19364 2026-03-23 cs.CV

AURORA: Adaptive Unified Representation for Robust Ultrasound Analysis

Ufaq Khan, L. D. M. S. Sai Teja, Ayuba Shakiru, Mai A. Shaaban, Yutong Xie, Muhammad Bilal, Muhammad Haris Khan

2603.19360 2026-03-23 cs.LG

Warm-Start Flow Matching for Guaranteed Fast Text/Image Generation

Minyoung Kim

2603.19349 2026-03-23 cs.LG cs.IT econ.TH math.IT

A Mathematical Theory of Understanding

Bahar Taşkesen

2603.19348 2026-03-23 cs.LG cs.CL

Anatomical Heterogeneity in Transformer Language Models

Tomasz Wietrzykowski

Comments 11 pages, 10 tables. Independent research. Code available at https://github.com/tomaszwi66

2603.19344 2026-03-23 cs.LG cs.AI

Beyond Weighted Summation: Learnable Nonlinear Aggregation Functions for Robust Artificial Neurons

Berke Deniz Bozyigit

Comments 7 pages, 2 tables

2603.19338 2026-03-23 cs.LG

DAPA: Distribution Aware Piecewise Activation Functions for On-Device Transformer Inference and Training

Maoyang Xiang, Bo Wang

2603.19337 2026-03-23 cs.CV cs.AI

Diffusion-Guided Semantic Consistency for Multimodal Heterogeneity

Jing Liu, Zhengliang Guo, Yan Wang, Xiaoguang Zhu, Yao Du, Zehua Wang, Victor C. M. Leung

Comments Accepted by IEEE ICME 2026

2603.19335 2026-03-23 cs.LG cs.AI

Do Post-Training Algorithms Actually Differ? A Controlled Study Across Model Scales Uncovers Scale-Dependent Ranking Inversions

Xiaoyi Li

2603.19331 2026-03-23 cs.LG stat.ML

FalconBC: Flow matching for Amortized inference of Latent-CONditioned physiologic Boundary Conditions

Chloe H. Choi, Alison L. Marsden, Daniele E. Schiavazzi

2603.19325 2026-03-23 cs.LG cs.AI

Target Concept Tuning Improves Extreme Weather Forecasting

Shijie Ren, Xinyue Gu, Ziheng Peng, Haifan Zhang, Peisong Niu, Bo Wu, Xiting Wang, Liang Sun, Jirong Wen

2603.19322 2026-03-23 cs.LG cs.AI cs.IT math.IT

A General Deep Learning Framework for Wireless Resource Allocation under Discrete Constraints

Yikun Wang, Yang Li, Yik-Chung Wu, Rui Zhang