arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.06972 2026-03-13 cs.LG cs.CV

Conditional Unbalanced Optimal Transport Maps: An Outlier-Robust Framework for Conditional Generative Modeling

Jiwoo Yoon, Kyumin Choi, Jaewoong Choi

Comments 15 pages, 6 figures

详情

英文摘要

Conditional Optimal Transport (COT) problem aims to find a transport map between conditional source and target distributions while minimizing the transport cost. Recently, these transport maps have been utilized in conditional generative modeling tasks to establish efficient mappings between the distributions. However, classical COT inherits a fundamental limitation of optimal transport, i.e., sensitivity to outliers, which arises from the hard distribution matching constraints. This limitation becomes more pronounced in a conditional setting, where each conditional distribution is estimated from a limited subset of data. To address this, we introduce the Conditional Unbalanced Optimal Transport (CUOT) framework, which relaxes conditional distribution-matching constraints through Csiszár divergence penalties while strictly preserving the conditioning marginals. We establish a rigorous formulation of the CUOT problem and derive its dual and semi-dual formulations. Based on the semi-dual form, we propose Conditional Unbalanced Optimal Transport Maps (CUOTM), an outlier-robust conditional generative model built upon a triangular $c$-transform parameterization. We theoretically justify the validity of this parameterization by proving that the optimal triangular map satisfies the $c$-transform relationships. Our experiments on 2D synthetic and image-scale datasets demonstrate that CUOTM achieves superior outlier robustness and competitive distribution-matching performance compared to existing COT-based baselines, while maintaining high sampling efficiency.

URL PDF HTML ☆

赞 0 踩 0

2603.06168 2026-03-13 cs.CV

JOPP-3D: Joint Open Vocabulary Semantic Segmentation on Point Clouds and Panoramas

Sandeep Inuganti, Hideaki Kanayama, Kanta Shimizu, Mahdi Chamseddine, Soichiro Yokota, Didier Stricker, Jason Rambach

2602.20792 2026-03-13 cs.CV

SIMSPINE: A Biomechanics-Aware Simulation Framework for 3D Spine Motion Annotation and Benchmarking

Muhammad Saif Ullah Khan, Didier Stricker

Comments Camera-ready version

2602.13823 2026-03-13 cs.CV

Embed-RL: Reinforcement Learning for Reasoning-Driven Multimodal Embeddings

Haonan Jiang, Yuji Wang, Yongjie Zhu, Xin Lu, Wenyu Qin, Meng Wang, Pengfei Wan, Yansong Tang

Comments Correcting errors and improving organizational logic

2602.04634 2026-03-13 cs.AI cs.LG cs.MA

WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning

Zelai Xu, Zhexuan Xu, Ruize Zhang, Chunyang Zhu, Shi Yu, Weilin Liu, Quanlu Zhang, Wenbo Ding, Chao Yu, Yu Wang

Comments https://wideseek-r1.github.io/

2601.21884 2026-03-13 cs.RO

Scalable Surface-Based Manipulation Through Modularity and Inter-Module Object Transfer

Pratik Ingle, Jørn Lambertsen, Kasper Støy, Andres Faina

Comments 8 pages

2601.06550 2026-03-13 cs.CV cs.AI

LLMTrack: Semantic Multi-Object Tracking with Multi-modal Large Language Models

Pan Liao, Feng Yang, Di Wu, Jinwen Yu, Yuhua Zhu, Wenhui Zhao, Dingwen Zhang

2601.03464 2026-03-13 cs.CL

Prompting Underestimates LLM Capability for Time Series Classification

Dan Schumacher, Erfan Nourbakhsh, Rocky Slavin, Anthony Rios

Comments 8 pages + Appendix and References, 9 figures

2601.02907 2026-03-13 cs.CL

Beyond the Black Box: A Survey on the Theory and Mechanism of Large Language Models

Zeyu Gan, Ruifeng Ren, Wei Yao, Xiaolin Hu, Gengze Xu, Chen Qian, Huayi Tang, Zixuan Gong, Xinhao Yao, Pengwei Tang, Zhenxing Dou, Yong Liu

2512.17086 2026-03-13 cs.AI

Value Under Ignorance in Universal Artificial Intelligence

Cole Wyeth, Marcus Hutter

2512.06297 2026-03-13 cs.LG cond-mat.dis-nn cond-mat.stat-mech cs.AI stat.ML

Entropic Confinement and Mode Connectivity in Overparameterized Neural Networks

Luca Di Carlo, Chase Goddard, David J. Schwab

Comments ICLR 2026

2512.05391 2026-03-13 cs.CV

LoC-Path: Learning to Compress for Pathology Multimodal Large Language Models

Qingqiao Hu, Weimin Lyu, Meilong Xu, Kehan Qi, Xiaoling Hu, Saumya Gupta, Jiawei Zhou, Chao Chen

Comments Code will be released soon

2512.04862 2026-03-13 cs.CV

Contact-Aware Refinement of Human Pose Pseudo-Ground Truth via Bioimpedance Sensing

Maria-Paola Forte, Nikos Athanasiou, Giulia Ballardini, Jan Ulrich Bartels, Katherine J. Kuchenbecker, Michael J. Black

Comments * Equal contribution. Minor figure corrections compared to the ICCV 2025 version

2511.20823 2026-03-13 cs.CV cs.AI cs.LG

RefTr: Recurrent Refinement of Confluent Trajectories for 3D Vascular Tree Centerlines

Roman Naeem, David Hagerman, Jennifer Alvén, Fredrik Kahl

2510.08575 2026-03-13 cs.CV

ReSplat: Learning Recurrent Gaussian Splatting

Haofei Xu, Daniel Barath, Andreas Geiger, Marc Pollefeys

Comments Project page: https://haofeixu.github.io/resplat/ Code: https://github.com/cvg/resplat

2510.00584 2026-03-13 cs.CV

Color Models in Image Processing: A Review and Experimental Comparison

Muragul Muratbekova, Nuray Toganas, Ayan Igali, Maksat Shagyrov, Elnara Kadyrgali, Adilet Yerkin, Pakizar Shamoi

Comments This manuscript has been submitted to Scientific Reports for consideration

2509.19297 2026-03-13 cs.CV

VolSplat: Rethinking Feed-Forward 3D Gaussian Splatting with Voxel-Aligned Prediction

Weijie Wang, Yeqing Chen, Zeyu Zhang, Hengyu Liu, Haoxiao Wang, Zhiyuan Feng, Wenkang Qin, Feng Chen, Zheng Zhu, Donny Y. Chen, Bohan Zhuang

Comments Project Page: https://lhmd.top/volsplat, Code: https://github.com/ziplab/VolSplat

2509.15423 2026-03-13 cs.RO cs.SY eess.SY

Online Slip Detection and Friction Coefficient Estimation for Autonomous Racing

Christopher Oeltjen, Carson Sobolewski, Saleh Faghfoorian, Lorant Domokos, Giancarlo Vidal, Sriram Yerramsetty, Ivan Ruchkin

Comments Equal contribution by the first three authors

2508.19742 2026-03-13 cs.CV

Adaptive Dual-Constrained Line Aggregation for Robust Generic and Wireframe Line Segment Detection

Chenguang Liu, Chisheng Wang, Huilin Chen, Chuanhua Zhu, Qingquan Li

2508.09202 2026-03-13 cs.CV cs.AI

Personalized Feature Translation for Expression Recognition: An Efficient Source-Free Domain Adaptation Method

Masoumeh Sharafi, Soufiane Belharbi, Muhammad Osama Zeeshan, Houssem Ben Salem, Ali Etemad, Alessandro Lameiras Koerich, Marco Pedersoli, Simon Bacon, Eric Granger

2507.11412 2026-03-13 cs.CL cs.IR cs.LG

Seq vs Seq: An Open Suite of Paired Encoders and Decoders

Orion Weller, Kathryn Ricci, Marc Marone, Antoine Chaffin, Dawn Lawrie, Benjamin Van Durme

Comments Accepted to ICLR'26

2507.08800 2026-03-13 cs.CV cs.AI cs.CL cs.HC cs.LG

NeuralOS: Towards Simulating Operating Systems via Neural Generative Models

Luke Rivard, Sun Sun, Hongyu Guo, Wenhu Chen, Yuntian Deng

Comments ICLR 2026

2506.20793 2026-03-13 cs.CL

Multi-lingual Functional Evaluation for Large Language Models

Victor Ojewale, Inioluwa Deborah Raji, Suresh Venkatasubramanian

Comments This is an updated version with details of the CL-GSM Symbolic and CL-IFEval datasets validation

2506.07726 2026-03-13 cs.CL

Swiss Parliaments Corpus Re-Imagined (SPC_R): Enhanced Transcription with RAG-based Correction and Predicted BLEU

Vincenzo Timmel, Manfred Vogel, Daniel Perruchoud, Reza Kakooee

Comments Change: Updated number of hours for train/test

2506.06214 2026-03-13 cs.CL cs.AI math-ph math.MP quant-ph

Can Theoretical Physics Research Benefit from Language Agents?

Sirui Lu, Zhijing Jin, Terry Jingchen Zhang, Pavel Kos, J. Ignacio Cirac, Bernhard Schölkopf

Comments 8+2 pages + references

2505.18675 2026-03-13 cs.CV cs.AI cs.CL

ReasonMap: Towards Fine-Grained Visual Reasoning from Transit Maps

Sicheng Feng, Song Wang, Shuyi Ouyang, Lingdong Kong, Zikai Song, Jianke Zhu, Huan Wang, Xinchao Wang

Comments CVPR 2026, website: https://fscdc.github.io/ReasonMap/

2505.18017 2026-03-13 cs.LG

Strictly Constrained Generative Modeling via Split Augmented Langevin Sampling

Matthieu Blanke, Yongquan Qu, Sara Shamekh, Pierre Gentine

2504.21767 2026-03-13 cs.RO

Whleaper: A 10-DOF Flexible Bipedal Wheeled Robot

Yinglei Zhu, Sixiao He, Yan Ning, Zhenghao Qi, Zhuoyuan Yong, Yihua Qin, Jianyu Chen

详情

DOI: 10.1109/IROS58592.2024.10801355
Journal ref: 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Abu Dhabi, United Arab Emirates, 2024, pp. 11272-11277

英文摘要

Wheel-legged robots combine the advantages of both wheeled robots and legged robots, offering versatile locomotion capabilities with excellent stability on challenging terrains and high efficiency on flat surfaces. However, existing wheel-legged robots typically have limited hip joint mobility compared to humans, while hip joint plays a crucial role in locomotion. In this paper, we introduce Whleaper, a novel 10-degree-of-freedom (DOF) bipedal wheeled robot, with 3 DOFs at the hip of each leg. Its humanoid joint design enables adaptable motion in complex scenarios, ensuring stability and flexibility. This paper introduces the details of Whleaper, with a focus on innovative mechanical design, control algorithms and system implementation. Firstly, stability stems from the increased DOFs at the hip, which expand the range of possible postures and improve the robot's foot-ground contact. Secondly, the extra DOFs also augment its mobility. During walking or sliding, more complex movements can be adopted to execute obstacle avoidance tasks. Thirdly, we utilize two control algorithms to implement multimodal motion for walking and sliding. By controlling specific DOFs of the robot, we conducted a series of simulations and practical experiments, demonstrating that a high-DOF hip joint design can effectively enhance the stability and flexibility of wheel-legged robots. Whleaper shows its capability to perform actions such as squatting, obstacle avoidance sliding, and rapid turning in real-world scenarios.

URL PDF HTML ☆

赞 0 踩 0

2503.18981 2026-03-13 cs.LG cs.AI

FedSKD: Aggregation-free Model-heterogeneous Federated Learning via Multi-dimensional Similarity Knowledge Distillation for Medical Image Classification

Ziqiao Weng, Weidong Cai, Bo Zhou

Comments Accepted at IEEE-TNNLS, 17 pages

2502.04308 2026-03-13 cs.LG cs.AI cs.SI physics.soc-ph

HOG-Diff: Higher-Order Guided Diffusion for Graph Generation

Yiming Huang, Tolga Birdal

Comments Accepted at ICLR 2026