arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.01786 2026-03-03 cs.LG cs.AI stat.ML

Learning Shortest Paths with Generative Flow Networks

Nikita Morozov, Ian Maksimov, Daniil Tiapkin, Sergey Samsonov

详情

英文摘要

In this paper, we present a novel learning framework for finding shortest paths in graphs utilizing Generative Flow Networks (GFlowNets). First, we examine theoretical properties of GFlowNets in non-acyclic environments in relation to shortest paths. We prove that, if the total flow is minimized, forward and backward policies traverse the environment graph exclusively along shortest paths between the initial and terminal states. Building on this result, we show that the pathfinding problem in an arbitrary graph can be solved by training a non-acyclic GFlowNet with flow regularization. We experimentally demonstrate the performance of our method in pathfinding in permutation environments and in solving Rubik's Cubes. For the latter problem, our approach shows competitive results with state-of-the-art machine learning approaches designed specifically for this task in terms of the solution length, while requiring smaller search budget at test-time.

URL PDF HTML ☆

赞 0 踩 0

2603.01783 2026-03-03 cs.AI

GAM-RAG: Gain-Adaptive Memory for Evolving Retrieval in Retrieval-Augmented Generation

Yifan Wang, Mingxuan Jiang, Zhihao Sun, Yixin Cao, Yicun Liu, Keyang Chen, Guangnan Ye, Hongfeng Chai

2603.01780 2026-03-03 cs.LG q-bio.GN

D3LM: A Discrete DNA Diffusion Language Model for Bidirectional DNA Understanding and Generation

Zhao Yang, Hengchang Liu, Chuan Cao, Bing Su

Comments Accepted as a workshop paper at MLGenX 2026

2603.01778 2026-03-03 cs.CL

LLM-as-an-Annotator: Training Lightweight Models with LLM-Annotated Examples for Aspect Sentiment Tuple Prediction

Nils Constantin Hellwig, Jakob Fehle, Udo Kruschwitz, Christian Wolff

Comments Accepted for publication at LREC 2026. Final version will appear in the ACL Anthology

2603.01775 2026-03-03 cs.CL

Beyond the Resumé: A Rubric-Aware Automatic Interview System for Information Elicitation

Harry Stuart, Masahiro Kaneko, Timothy Baldwin

2603.01773 2026-03-03 cs.CL

AnnoABSA: A Web-Based Annotation Tool for Aspect-Based Sentiment Analysis with Retrieval-Augmented Suggestions

Nils Constantin Hellwig, Jakob Fehle, Udo Kruschwitz, Christian Wolff

Comments Accepted for publication at LREC 2026. Final version will appear in the ACL Anthology

2603.01767 2026-03-03 cs.CV eess.IV

Downstream Task Inspired Underwater Image Enhancement: A Perception-Aware Study from Dataset Construction to Network Design

Bosen Lin, Feng Gao, Yanwei Yu, Junyu Dong, Qian Du

Comments Accepted for publication in IEEE TIP 2026

2603.01762 2026-03-03 cs.LG

DGNet: Discrete Green Networks for Data-Efficient Learning of Spatiotemporal PDEs

Yingjie Tan, Quanming Yao, Yaqing Wang

Comments Accepted as a conference paper at ICLR 2026

2603.01759 2026-03-03 cs.LG

Meta-Learning Hyperparameters for Parameter Efficient Fine-Tuning

Zichen Tian, Yaoyao Liu, Qianru Sun

Comments Accepted by CVPR 2025 (Highlight). Code is available at: https://github.com/doem97/metalora

2603.01758 2026-03-03 cs.CV

Unifying Heterogeneous Multi-Modal Remote Sensing Detection Via Language-Pivoted Pretraining

Yuxuan Li, Yuming Chen, Yunheng Li, Ming-Ming Cheng, Xiang Li, Jian Yang

2603.01757 2026-03-03 cs.CV

StepVAR: Structure-Texture Guided Pruning for Visual Autoregressive Models

Keli Liu, Zhendong Wang, Wengang Zhou, Houqiang Li

2603.01751 2026-03-03 cs.RO cs.AI cs.LG cs.SY eess.SY

Shape-Interpretable Visual Self-Modeling Enables Geometry-Aware Continuum Robot Control

Peng Yu, Xin Wang, Ning Tan

详情

英文摘要

Continuum robots possess high flexibility and redundancy, making them well suited for safe interaction in complex environments, yet their continuous deformation and nonlinear dynamics pose fundamental challenges to perception, modeling, and control. Existing vision-based control approaches often rely on end-to-end learning, achieving shape regulation without explicit awareness of robot geometry or its interaction with the environment. Here, we introduce a shape-interpretable visual self-modeling framework for continuum robots that enables geometry-aware control. Robot shapes are encoded from multi-view planar images using a Bezier-curve representation, transforming visual observations into a compact and physically meaningful shape space that uniquely characterizes the robot's three-dimensional configuration. Based on this representation, neural ordinary differential equations are employed to self-model both shape and end-effector dynamics directly from data, enabling hybrid shape-position control without analytical models or dense body markers. The explicit geometric structure of the learned shape space allows the robot to reason about its body and surroundings, supporting environment-aware behaviors such as obstacle avoidance and self-motion while maintaining end-effector objectives. Experiments on a cable-driven continuum robot demonstrate accurate shape-position regulation and tracking, with shape errors within 1.56% of image resolution and end-effector errors within 2% of robot length, as well as robust performance in constrained environments. By elevating visual shape representations from two-dimensional observations to an interpretable three-dimensional self-model, this work establishes a principled alternative to vision-based end-to-end control and advances autonomous, geometry-aware manipulation for continuum robots.

URL PDF HTML ☆

赞 0 踩 0

2603.01750 2026-03-03 cs.LG

Practical Deep Heteroskedastic Regression

Mikkel Jordahn, Jonas Vestergaard Jensen, James Harrison, Michael Riis Andersen, Mikkel N. Schmidt

2603.01748 2026-03-03 cs.LG cs.AI

Discrete World Models via Regularization

Davide Bizzaro, Luciano Serafini

2603.01746 2026-03-03 cs.CV cs.AI

An Analysis of Multi-Task Architectures for the Hierarchic Multi-Label Problem of Vehicle Model and Make Classification

Alexandru Manole, Laura Diosan

Comments 14 pages, 8 figures ,7 tables

2603.01739 2026-03-03 cs.LG cs.AI cs.DC

CA-AFP: Cluster-Aware Adaptive Federated Pruning

Om Govind Jha, Harsh Shukla, Haroon R. Lone

2603.01730 2026-03-03 cs.LG

Decentralized Federated Learning by Partial Message Exchange

Shan Sha, Shenglong Zhou, Xin Wang, Lingchen Kong, Geoffrey Ye Li

2603.01725 2026-03-03 cs.CV

Learning Domain-Aware Task Prompt Representations for Multi-Domain All-in-One Image Restoration

Guanglu Dong, Chunlei Li, Chao Ren, Jingliang Hu, Yilei Shi, Xiao Xiang Zhu, Lichao Mou

Comments ICLR 2026

2603.01724 2026-03-03 cs.AI

GMP: A Benchmark for Content Moderation under Co-occurring Violations and Dynamic Rules

Houde Dong, Yifei She, Kai Ye, Liangcai Su, Chenxiong Qian, Jie Hao

2603.01720 2026-03-03 cs.CV

Preoperative-to-intraoperative Liver Registration for Laparoscopic Surgery via Latent-Grounded Correspondence Constraints

Ruize Cui, Jialun Pei, Haiqiao Wang, Jun Zhou, Jeremy Yuen-Chun Teoh, Pheng-Ann Heng, Jing Qin

Comments 10 pages, 4 figures

2603.01714 2026-03-03 cs.LG cs.CL

TopoCurate:Modeling Interaction Topology for Tool-Use Agent Training

Jinluan Yang, Yuxin Liu, Zhengyu Chen, Chengcheng Han, Yueqing Sun, Qi Gu, Hui Su, Xunliang Cai, Fei Wu, Kun Kuang

Comments Under Review

2603.01713 2026-03-03 cs.CV

Dual Distillation for Few-Shot Anomaly Detection

Le Dong, Qinzhong Tan, Chunlei Li, Jingliang Hu, Yilei Shi, Weisheng Dong, Xiao Xiang Zhu, Lichao Mou

Comments ICLR 2026

2603.01710 2026-03-03 cs.CL cs.IR cs.LG

Legal RAG Bench: an end-to-end benchmark for legal RAG

Abdur-Rahman Butler, Umar Butler

Comments 13 pages, 3 figures, 4 tables

2603.01708 2026-03-03 cs.CV

WhisperNet: A Scalable Solution for Bandwidth-Efficient Collaboration

Gong Chen, Chaokun Zhang, Xinyan Zhao

Comments Accepted by CVPR26

2603.01706 2026-03-03 cs.CV cs.LG

Search Multilayer Perceptron-Based Fusion for Efficient and Accurate Siamese Tracking

Tianqi Shen, Huakao Lin, Ning An

Comments 23 pages, 12 figures, 7 tables. This work was completed in 2024 and accepted for publication in IEEE TCDS (2026)

2603.01705 2026-03-03 cs.RO

A Safety-Aware Shared Autonomy Framework with BarrierIK Using Control Barrier Functions

Berk Guler, Kay Pompetzki, Yuanzheng Sun, Simon Manschitz, Jan Peters

Comments Accepted on ICRA 2026, 9 pages, 5 figures

2603.01700 2026-03-03 cs.RO

TacMamba: A Tactile History Compression Adapter Bridging Fast Reflexes and Slow VLA Reasoning

Zhenan Wang, Yanzhe Wang, Meixuan Ren, Peng Li, Yang Liu, Yifei Nie, Limin Long, Yun Ye, Xiaofeng Wang, Zhen Zhu, Huixu Dong

2603.01698 2026-03-03 cs.CV cs.AI

Towards Principled Dataset Distillation: A Spectral Distribution Perspective

Ruixi Wu, Shaobo Wang, Jiahuan Chen, Zhiyuan Liu, Yicun Yang, Zhaorun Chen, Zekai Li, Kaixin Li, Xinming Wang, Hongzhu Yi, Kai Wang, Linfeng Zhang

Comments 30 pages, 5 tables, 4 figures

2603.01697 2026-03-03 cs.LG cs.AI

DynaMoE: Dynamic Token-Level Expert Activation with Layer-Wise Adaptive Capacity for Mixture-of-Experts Neural Networks

Gökdeniz Gülmez

2603.01695 2026-03-03 cs.LG cs.AI

Streaming Continual Learning for Unified Adaptive Intelligence in Dynamic Environments

Federico Giannini, Giacomo Ziffer, Andrea Cossu, Vincenzo Lomonaco