arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2511.17266 2026-04-06 cs.RO

Simulation of Active Soft Nets for Capture of Space Debris

Leone Costi, Dario Izzo

详情

英文摘要

In this work, we propose a simulator, based on the open-source physics engine MuJoCo, for the design and control of soft robotic nets for the autonomous removal of space debris. The proposed simulator includes net dynamics, contact between the net and the debris, self-contact of the net, orbital mechanics, and a controller that can actuate thrusters on the four satellites at the corners of the net. It showcases the case of capturing Envisat, a large ESA satellite that remains in orbit as space debris following the end of its mission. This work investigates different mechanical models, which can be used to simulate the net dynamics, simulating various degrees of compliance, and different control strategies to achieve the capture of the debris, depending on the relative position of the net and the target. Unlike previous works on this topic, we do not assume that the net has been previously ballistically thrown toward the target, and we start from a relatively static configuration. The results show that a more compliant net achieves higher performance when attempting the capture of Envisat. Moreover, when paired with a sliding mode controller, soft nets are able to achieve successful capture in 100% of the tested cases, whilst also showcasing a higher effective area at contact and a higher number of contact points between net and Envisat.

URL PDF HTML ☆

赞 0 踩 0

2511.17207 2026-04-06 cs.CV cs.RO

SING3R-SLAM: Submap-based Indoor Monocular Gaussian SLAM with 3D Reconstruction Priors

Kunyi Li, Michael Niemeyer, Sen Wang, Stefano Gasperini, Nassir Navab, Federico Tombari

2511.15279 2026-04-06 cs.RO cs.CV

Look, Zoom, Understand: The Robotic Eyeball for Embodied Perception

Jiashu Yang, Yifan Han, Yucheng Xie, Ning Guo, Wenzhao Lian

2511.13394 2026-04-06 cs.LG stat.ML

Fast and Robust Simulation-Based Inference With Optimization Monte Carlo

Vasilis Gkolemis, Christos Diou, Michael U. Gutmann

Comments Accepted at AISTATS 2026

2511.13096 2026-04-06 cs.RO

ResAlignNet: A Data-Driven Approach for INS/DVL Alignment

Guy Damari, Itzik Klein

详情

DOI: 10.1016/j.oceaneng.2026.125277

英文摘要

Autonomous underwater vehicles rely on precise navigation systems that combine the inertial navigation system and the Doppler velocity log for successful missions in challenging environments where satellite navigation is unavailable. The effectiveness of this integration critically depends on accurate alignment between the sensor reference frames. Standard model-based alignment methods between these sensor systems suffer from lengthy convergence times, dependence on prescribed motion patterns, and reliance on external aiding sensors, significantly limiting operational flexibility. To address these limitations, this paper presents ResAlignNet, a data-driven approach using the 1D ResNet-18 architecture that transforms the alignment problem into deep neural network optimization, operating as an in-situ solution that requires only sensors on board without external positioning aids or complex vehicle maneuvers, while achieving rapid convergence in seconds. Additionally, the approach demonstrates the learning capabilities of Sim2Real transfer, enabling training in synthetic data while deploying in operational sensor measurements. Experimental validation using the Snapir autonomous underwater vehicle demonstrates that ResAlignNet achieves alignment accuracy within 0.8° using only 25 seconds of data collection, representing a 65\% reduction in convergence time compared to standard velocity-based methods. The trajectory-independent solution eliminates motion pattern requirements and enables immediate vehicle deployment without lengthy pre-mission procedures, advancing underwater navigation capabilities through robust sensor-agnostic alignment that scales across different operational scenarios and sensor specifications.

URL PDF HTML ☆

赞 0 踩 0

2511.12834 2026-04-06 cs.CV cs.AI

SAGA: Source Attribution of Generative AI Videos

Rohit Kundu, Vishal Mohanty, Hao Xiong, Shan Jia, Athula Balachandran, Amit K. Roy-Chowdhury

2511.08666 2026-04-06 cs.CV

Privacy Beyond Pixels: Latent Anonymization for Privacy-Preserving Video Understanding

Joseph Fioresi, Ishan Rajendrakumar Dave, Mubarak Shah

Comments Accepted to ICLR 2026

详情

英文摘要

We introduce a novel formulation of visual privacy preservation for video foundation models that operates entirely in the latent space. While spatio-temporal features learned by foundation models have deepened general understanding of video content, sharing or storing these extracted visual features for downstream tasks inadvertently reveals sensitive personal information like skin color, gender, or clothing. Current privacy preservation methods focus on input-pixel-level anonymization, which requires retraining the entire utility video model and results in task-specific anonymization, making them unsuitable for recent video foundational models. To address these challenges, we introduce a lightweight Anonymizing Adapter Module (AAM) that removes private information from video features while retaining general task utility. AAM can be applied in a plug-and-play fashion to frozen video encoders, minimizing the computational burden of finetuning and re-extracting features. Our framework employs three newly designed training objectives: (1) a clip-level self-supervised privacy objective to reduce mutual information between static clips, (2) a co-training objective to retain utility across seen tasks, and (3) a latent consistency loss for generalization on unseen tasks. Our extensive evaluations demonstrate a significant 35% reduction in privacy leakage while maintaining near-baseline utility performance across various downstream tasks: Action Recognition (Kinetics400, UCF101, HMDB51), Temporal Action Detection (THUMOS14), and Anomaly Detection (UCF-Crime). We also provide an analysis on anonymization for sensitive temporal attribute recognition. Additionally, we propose new protocols for assessing gender bias in action recognition models, showing that our method effectively mitigates such biases and promotes more equitable video understanding. https://joefioresi718.github.io/SPLAVU_webpage/

URL PDF HTML ☆

赞 0 踩 0

2511.02734 2026-04-06 cs.AI cs.CL

CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments for LLM Tool-Use Agents

Jiayu Liu, Cheng Qian, Zhaochen Su, Qing Zong, Shijue Huang, Bingxiang He, Yi R. Fung

2511.01770 2026-04-06 cs.RO

Lightweight Learning from Actuation-Space Demonstrations via Flow Matching for Whole-Body Soft Robotic Grasping

Liudi Yang, Yang Bai, Yuhao Wang, Ibrahim Alsarraj, Gitta Kutyniok, Zhanchi Wang, Ke Wu

2510.27176 2026-04-06 cs.AI cs.CL cs.DC

Glia: A Human-Inspired AI for Automated Systems Design and Optimization

Pouya Hamadanian, Pantea Karimi, Arash Nasr-Esfahany, Kimia Noorbakhsh, Joseph Chandler, Ali ParandehGheibi, Mohammad Alizadeh, Hari Balakrishnan

2510.19127 2026-04-06 cs.LG cs.AI cs.SD eess.AS

Steering Autoregressive Music Generation with Recursive Feature Machines

Daniel Zhao, Daniel Beaglehole, Taylor Berg-Kirkpatrick, Julian McAuley, Zachary Novack

2510.17569 2026-04-06 cs.LG physics.comp-ph

Towards best practices in low-dimensional semi-supervised latent Bayesian optimization for the design of antimicrobial peptides

Jyler Menard, R. A. Mansbach

Comments (Post peer review version) v3: 22 pages, 10 figures. New/clearer figures. Small title and abstract change. Edits to results to make points clearer, but no drastic changes to findings. Inclusion of preliminary comparisons to deep kernel learning

2510.17421 2026-04-06 cs.LG

Diffusion Models as Dataset Distillation Priors

Duo Su, Huyu Wu, Huanran Chen, Yiming Shi, Yuzhu Wang, Xi Ye, Jun Zhu

2510.15075 2026-04-06 cs.LG stat.ML

Physics-informed data-driven machine health monitoring for two-photon lithography

Sixian Jia, Zhiqiao Dong, Chenhui Shao

2510.10510 2026-04-06 cs.LG cs.AI

f-INE: A Hypothesis Testing Framework for Estimating Influence under Training Randomness

Subhodip Panda, Dhruv Tarsadiya, Shashwat Sourav, Prathosh A. P, Sai Praneeth Karimireddy

2510.10415 2026-04-06 cs.CL cs.AI

CQA-Eval: Designing Reliable Evaluations of Multi-paragraph Clinical QA under Resource Constraints

Federica Bologna, Tiffany Pan, Matthew Wilkens, Yue Guo, Lucy Lu Wang

2510.06649 2026-04-06 cs.LG cs.AI

Local Reinforcement Learning with Action-Conditioned Root Mean Squared Q-Functions

Frank Wu, Mengye Ren

Comments 18 pages, 11 figures

2510.05528 2026-04-06 cs.LG

ARMOR: High-Performance Semi-Structured Pruning via Adaptive Matrix Factorization

Lawrence Liu, Alexander Liu, Mengdi Wang, Tuo Zhao, Lin F. Yang

Comments ICLR 2026, code: https://github.com/LawrenceRLiu/ARMOR

2509.25438 2026-04-06 cs.LG cs.AI

Beyond Noisy-TVs: Noise-Robust Exploration Via Learning Progress Monitoring

Zhibo Hou, Zhiyu An, Wan Du

Comments Accepted for ICLR 2026

详情

Journal ref: International Conference on Learning Representations (ICLR) 2026

英文摘要

When there exists an unlearnable source of randomness (noisy-TV) in the environment, a naively intrinsic reward driven exploring agent gets stuck at that source of randomness and fails at exploration. Intrinsic reward based on uncertainty estimation or distribution similarity, while eventually escapes noisy-TVs as time unfolds, suffers from poor sample efficiency and high computational cost. Inspired by recent findings from neuroscience that humans monitor their improvements during exploration, we propose a novel method for intrinsically-motivated exploration, named Learning Progress Monitoring (LPM). During exploration, LPM rewards model improvements instead of prediction error or novelty, effectively rewards the agent for observing learnable transitions rather than the unlearnable transitions. We introduce a dual-network design that uses an error model to predict the expected prediction error of the dynamics model in its previous iteration, and use the difference between the model errors of the current iteration and previous iteration to guide exploration. We theoretically show that the intrinsic reward of LPM is zero-equivariant and a monotone indicator of Information Gain (IG), and that the error model is necessary to achieve monotonicity correspondence with IG. We empirically compared LPM against state-of-the-art baselines in noisy environments based on MNIST, 3D maze with 160x120 RGB inputs, and Atari. Results show that LPM's intrinsic reward converges faster, explores more states in the maze experiment, and achieves higher extrinsic reward in Atari. This conceptually simple approach marks a shift-of-paradigm of noise-robust exploration. For code to reproduce our experiments, see https://github.com/Akuna23Matata/LPM_exploration

URL PDF HTML ☆

赞 0 踩 0

2509.23880 2026-04-06 cs.CV

Learning Adaptive Pseudo-Label Selection for Semi-Supervised 3D Object Detection

Taehun Kong, Tae-Kyun Kim

Comments Accepted to the IEEE International Conference on Robotics and Automation (ICRA) 2026

2509.22367 2026-04-06 cs.CL cs.AI cs.CY

What Is The Political Content in LLMs' Pre- and Post-Training Data?

Tanise Ceron, Dmitry Nikolaev, Dominik Stammbach, Debora Nozza

Comments 10 pages, under review

2509.21716 2026-04-06 cs.LG

A Unifying Framework for Parallelizing Sequential Models with Linear Dynamical Systems

Xavier Gonzalez, E. Kelly Buchanan, Hyun Dong Lee, Jerry Weihong Liu, Ke Alexander Wang, David M. Zoltowski, Leo Kozachkov, Christopher Ré, Scott W. Linderman

Comments TMLR. Code: https://github.com/lindermanlab/parallelizing_with_lds

2509.19893 2026-04-06 cs.CL

Future Policy Approximation for Offline Reinforcement Learning Improves Mathematical Reasoning

Minjae Oh, Yunho Choi, Dongmin Choi, Yohan Jo

Comments 9 pages

2509.19579 2026-04-06 cs.RO

Terra: Hierarchical Terrain-Aware 3D Scene Graph for Task-Agnostic Outdoor Mapping

Chad R. Samuelson, Abigail Austin, Seth Knoop, Blake Romrell, Gabriel R. Slade, Timothy W. McLain, Joshua G. Mangelson

2509.19454 2026-04-06 cs.RO cs.AI cs.CV cs.LG

ROPA: Synthetic Robot Pose Generation for RGB-D Bimanual Data Augmentation

Jason Chen, I-Chun Arthur Liu, Gaurav Sukhatme, Daniel Seita

Comments Accepted to the International Conference on Robotics and Automation (ICRA) 2026

2509.12643 2026-04-06 cs.AI

Learn to Relax with Large Language Models: Solving Constraint Optimization Problems via Bidirectional Coevolution

Beidan Liu, Zhengqiu Zhu, Chen Gao, Tianle Pu, Yong Zhao, Wei Qi, Quanjun Yin

2509.07801 2026-04-06 cs.CL cs.DL cs.IR

SciNLP: A Domain-Specific Benchmark for Full-Text Scientific Entity and Relation Extraction in NLP

Decheng Duan, Yingyi Zhang, Jitong Peng, Chengzhi Zhang

Comments EMNLP 2025 Main

2509.07553 2026-04-06 cs.CL

VeriOS: Query-Driven Proactive Human-Agent-GUI Interaction for Trustworthy OS Agents

Zheng Wu, Heyuan Huang, Xingyu Lou, Xiangmou Qu, Pengzhou Cheng, Zongru Wu, Weiwen Liu, Weinan Zhang, Jun Wang, Zhaoxiang Wang, Zhuosheng Zhang

2509.07274 2026-04-06 cs.CL cs.CY cs.LG

LLM Analysis of 150+ years of German Parliamentary Debates on Migration Reveals Shift from Post-War Solidarity to Anti-Solidarity in the Last Decade

Aida Kostikova, Ole Pütz, Steffen Eger, Olga Sabelfeld, Benjamin Paassen

2509.04276 2026-04-06 cs.CV

PAOLI: Pose-free Articulated Object Learning from Sparse-view Images

Jianning Deng, Kartic Subr, Hakan Bilen