arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.03283 2026-03-04 cs.CV

Utonia: Toward One Encoder for All Point Clouds

Yujia Zhang, Xiaoyang Wu, Yunhan Yang, Xianzhe Fan, Han Li, Yuechen Zhang, Zehao Huang, Naiyan Wang, Hengshuang Zhao

Comments produced by Pointcept, project page: https://pointcept.github.io/Utonia

详情

英文摘要

We dream of a future where point clouds from all domains can come together to shape a single model that benefits them all. Toward this goal, we present Utonia, a first step toward training a single self-supervised point transformer encoder across diverse domains, spanning remote sensing, outdoor LiDAR, indoor RGB-D sequences, object-centric CAD models, and point clouds lifted from RGB-only videos. Despite their distinct sensing geometries, densities, and priors, Utonia learns a consistent representation space that transfers across domains. This unification improves perception capability while revealing intriguing emergent behaviors that arise only when domains are trained jointly. Beyond perception, we observe that Utonia representations can also benefit embodied and multimodal reasoning: conditioning vision-language-action policies on Utonia features improves robotic manipulation, and integrating them into vision-language models yields gains on spatial reasoning. We hope Utonia can serve as a step toward foundation models for sparse 3D data, and support downstream applications in AR/VR, robotics, and autonomous driving.

URL PDF HTML ☆

赞 0 踩 0

2603.03280 2026-03-04 cs.RO cs.AI cs.CV cs.LG cs.SY eess.SY

How to Peel with a Knife: Aligning Fine-Grained Manipulation with Human Preference

Toru Lin, Shuying Deng, Zhao-Heng Yin, Pieter Abbeel, Jitendra Malik

Comments Project page can be found at https://toruowo.github.io/peel

2603.03279 2026-03-04 cs.RO cs.CV

ULTRA: Unified Multimodal Control for Autonomous Humanoid Whole-Body Loco-Manipulation

Xialin He, Sirui Xu, Xinyao Li, Runpei Dong, Liuyu Bian, Yu-Xiong Wang, Liang-Yan Gui

Comments Project Page: https://ultra-humanoid.github.io/

2603.03278 2026-03-04 cs.RO cs.AI cs.CV

Tether: Autonomous Functional Play with Correspondence-Driven Trajectory Warping

William Liang, Sam Wang, Hung-Ju Wang, Osbert Bastani, Yecheng Jason Ma, Dinesh Jayaraman

Comments International Conference on Learning Representations (ICLR), 2026. Project website and code: https://tether-research.github.io

2603.03276 2026-03-04 cs.CV

Beyond Language Modeling: An Exploration of Multimodal Pretraining

Shengbang Tong, David Fan, John Nguyen, Ellis Brown, Gaoyue Zhou, Shengyi Qian, Boyang Zheng, Théophane Vallaeys, Junlin Han, Rob Fergus, Naila Murray, Marjan Ghazvininejad, Mike Lewis, Nicolas Ballas, Amir Bar, Michael Rabbat, Jakob Verbeek, Luke Zettlemoyer, Koustuv Sinha, Yann LeCun, Saining Xie

Comments Project website at https://beyond-llms.github.io/

2603.03275 2026-03-04 cs.LG

Learning Demographic-Conditioned Mobility Trajectories with Aggregate Supervision

Jessie Z. Li, Zhiqing Hong, Toru Shirakawa, Serina Chang

2603.03273 2026-03-04 cs.DS cs.SI

An Improved Combinatorial Algorithm for Edge-Colored Clustering in Hypergraphs

Seongjune Han, Nate Veldt

Comments Full version of paper accepted as a short paper to the ACM Web Conference 2026

2603.03271 2026-03-04 cs.DB cs.OS

Virtual-Memory Assisted Buffer Management In Tiered Memory

Yeasir Rayhan, Walid G. Aref

2603.03270 2026-03-04 cs.CR cs.LG cs.NI

Gravity Falls: A Comparative Analysis of Domain-Generation Algorithm (DGA) Detection Methods for Mobile Device Spearphishing

Adam Dorian Wong, John D. Hastings

Comments Disclaimer: The views expressed are those of the authors and do not necessarily reflect the official policy or position of the U.S. Department of Defense or the U.S. Government. References to external sites do not constitute endorsement. Cleared for release on 24 FEB 2026 (DOPSR 26-T-0771). Gravity Falls Dataset DOI: 10.5281/zenodo.17624554

2603.03267 2026-03-04 cs.CY

Policy myopia as a mechanism of gradual disempowerment in Post-AGI governance, Circa 2049

Subramanyam Sahoo

Comments Accepted at the Post-AGI Science and Society Workshop at ICLR 2026. 16 Pages and 4 Figures

2603.03265 2026-03-04 cs.CV

DuoMo: Dual Motion Diffusion for World-Space Human Reconstruction

Yufu Wang, Evonne Ng, Soyong Shin, Rawal Khirodkar, Yuan Dong, Zhaoen Su, Jinhyung Park, Kris Kitani, Alexander Richard, Fabian Prada, Michael Zollhofer

Comments CVPR 2026. Project page: https://yufu-wang.github.io/duomo/

2603.03262 2026-03-04 cs.LO

Yeo's Theorem for Locally Colored Graphs: the Path to Sequentialization in Linear Logic

Rémi Di Guardia, Olivier Laurent, Lorenzo Tortora de Falco, Lionel Vaux Auclair

Comments Preprint submitted to Logical Methods in Computer Science, 57 pages, 29 figures

2603.03259 2026-03-04 math.NA cs.LG cs.NA

Physics-informed post-processing of stabilized finite element solutions for transient convection-dominated problems

Süleyman Cengizci, Ömür Uğur, Srinivasan Natesan

2603.03258 2026-03-04 cs.AI

Inherited Goal Drift: Contextual Pressure Can Undermine Agentic Goals

Achyutha Menon, Magnus Saebo, Tyler Crosse, Spencer Gibson, Eyon Jang, Diogo Cruz

Comments 22 pages, 7 figures. Accepted at ICLR 2026 Lifelong Agents Workshop

2603.03252 2026-03-04 cs.AI

Valet: A Standardized Testbed of Traditional Imperfect-Information Card Games

Mark Goadrich, Achille Morenville, Éric Piette

Comments 12 pages, 1 table, 4 figures

2603.03242 2026-03-04 cs.AI cs.CL

Density-Guided Response Optimization: Community-Grounded Alignment via Implicit Acceptance Signals

Patrick Gerard, Svitlana Volkova

Comments 27 Pages

详情

英文摘要

Language models deployed in online communities must adapt to norms that vary across social, cultural, and domain-specific contexts. Prior alignment approaches rely on explicit preference supervision or predefined principles, which are effective for well-resourced settings but exclude most online communities -- particularly those without institutional backing, annotation infrastructure, or organized around sensitive topics -- where preference elicitation is costly, ethically fraught, or culturally misaligned. We observe that communities already express preferences implicitly through what content they accept, engage with, and allow to persist. We show that this acceptance behavior induces measurable geometric structure in representation space: accepted responses occupy coherent, high-density regions that reflect community-specific norms, while rejected content falls in sparser or misaligned areas. We operationalize this structure as an implicit preference signal for alignment and introduce density-guided response optimization (DGRO), a method that aligns language models to community norms without requiring explicit preference labels. Using labeled preference data, we demonstrate that local density recovers pairwise community judgments, indicating that geometric structure encodes meaningful preference signal. We then apply DGRO in annotation-scarce settings across diverse communities spanning platform, topic, and language. DGRO-aligned models consistently produce responses preferred by human annotators, domain experts, and model-based judges over supervised and prompt-based baselines. We position DGRO as a practical alignment alternative for communities where explicit preference supervision is unavailable or misaligned with situated practices, and discuss the implications and risks of learning from emergent acceptance behavior.

URL PDF HTML ☆

赞 0 踩 0

2603.03241 2026-03-04 cs.CV cs.AI

UniG2U-Bench: Do Unified Models Advance Multimodal Understanding?

Zimo Wen, Boxiu Li, Wanbo Zhang, Junxiang Lei, Xiaoyu Chen, Yijia Fan, Qi Zhang, Yujiang Wang, Lili Qiu, Bo Li, Ziwei Liu, Caihua Shan, Yifan Yang, Yifei Shen

2603.03238 2026-03-04 cs.LG cs.NA math.NA physics.comp-ph

On Geometry Regularization in Autoencoder Reduced-Order Models with Latent Neural ODE Dynamics

Mikhail Osipov

Comments 25 pages, 2 figures, 3 tables

2603.03236 2026-03-04 cs.CY

Conversational Learning Diagnosis via Reasoning Multi-Turn Interactive Learning

Fangzhou Yao, Sheng Chang, Weibo Gao, Qi Liu

Comments AAAI 2026

2603.03235 2026-03-04 stat.ML cs.LG stat.ME

The elbow statistic: Multiscale clustering statistical significance

Francisco J. Perez-Reche

Comments 30 pages, 3 figures, 5 tables

2603.03234 2026-03-04 cs.LG

Guiding Sparse Neural Networks with Neurobiological Principles to Elicit Biologically Plausible Representations

Patrick Inoue, Florian Röhrbein, Andreas Knoblauch

2603.03233 2026-03-04 cs.AI

AI-for-Science Low-code Platform with Bayesian Adversarial Multi-Agent Framework

Zihang Zeng, Jiaquan Zhang, Pengze Li, Yuan Qi, Xi Chen

2603.03231 2026-03-04 cs.GR

Quadratic-Order Geodesics on Meshes

Yue Ruan, Albert Chern, Tzu-Mao Li, Kartic Subr, Amir Vaxman

2603.03230 2026-03-04 cs.LG cs.AI

SynthCharge: An Electric Vehicle Routing Instance Generator with Feasibility Screening to Enable Learning-Based Optimization and Benchmarking

Mertcan Daysalilar, Fuat Uyguroglu, Gabriel Nicolosi, Adam Meyers

Comments This work has been submitted to the IEEE for possible publication

2603.03227 2026-03-04 cs.LG

Coalgebras for categorical deep learning: Representability and universal approximation

Dragan Mašulović

2603.03226 2026-03-04 cs.LG cs.CR

Adaptive Methods Are Preferable in High Privacy Settings: An SDE Perspective

Enea Monzio Compagnoni, Alessandro Stanghellini, Rustem Islamov, Aurelien Lucchi, Anastasiia Koloskova

Comments Accepted at ICLR 2026 (Poster)

2603.03225 2026-03-04 quant-ph cs.CR math-ph math.MP

Multiparty Quantum Key Agreement: Architectures, State-of-the-art, and Open Problems

Malik Mouaji, Saif Al-Kuwari

2603.03224 2026-03-04 cs.LG cs.AI

Stabilized Adaptive Loss and Residual-Based Collocation for Physics-Informed Neural Networks

Divyavardhan Singh, Shubham Kamble, Dimple Sonone, Kishor Upla

Comments 6 pages, 2 Figures, 4 tables

2603.03212 2026-03-04 cs.AI

NeuroSkill(tm): Proactive Real-Time Agentic System Capable of Modeling Human State of Mind

Nataliya Kosmyna, Eugene Hauptmann

Comments 36 pages, 18 figures

2603.03211 2026-03-04 math.OC cs.LG cs.NA math.NA

Shape Derivative-Informed Neural Operators with Application to Risk-Averse Shape Optimization

Xindi Gong, Dingcheng Luo, Thomas O'Leary-Roseberry, Ruanui Nicholson, Omar Ghattas

详情

英文摘要

Shape optimization under uncertainty (OUU) is computationally intensive for classical PDE-based methods due to the high cost of repeated sampling-based risk evaluation across many uncertainty realizations and varying geometries, while standard neural surrogates often fail to provide accurate and efficient sensitivities for optimization. We introduce Shape-DINO, a derivative-informed neural operator framework for learning PDE solution operators on families of varying geometries, with a particular focus on accelerating PDE-constrained shape OUU. Shape-DINOs encode geometric variability through diffeomorphic mappings to a fixed reference domain and employ a derivative-informed operator learning objective that jointly learns the PDE solution and its Fréchet derivatives with respect to design variables and uncertain parameters, enabling accurate state predictions and reliable gradients for large-scale OUU. We establish a priori error bounds linking surrogate accuracy to optimization error and prove universal approximation results for multi-input reduced basis neural operators in suitable $C^1$ norms. We demonstrate efficiency and scalability on three representative shape OUU problems, including boundary design for a Poisson equation and shape design governed by steady-state Navier-Stokes exterior flows in two and three dimensions. Across these examples, Shape-DINOs produce more reliable optimization results than operator surrogates trained without derivative information. In our examples, Shape-DINOs achieve 3-8 orders-of-magnitude speedups in state and gradient evaluations. Counting training data generation, Shape-DINOs reduce necessary PDE solves by 1-2 orders-of-magnitude compared to a strictly PDE-based approach for a single OUU problem. Moreover, Shape-DINO construction costs can be amortized across many objectives and risk measures, enabling large-scale shape OUU for complex systems.

URL PDF HTML ☆

赞 0 踩 0