arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.14333 2026-03-17 cs.RO cs.LG

Data-Driven Physics Embedded Dynamics with Predictive Control and Reinforcement Learning for Quadrupeds

Prakrut Kotecha, Aditya Shirwatkar, Shishir Kolathaya

Comments 9 pages, 6 figures

详情

英文摘要

State of the art quadrupedal locomotion approaches integrate Model Predictive Control (MPC) with Reinforcement Learning (RL), enabling complex motion capabilities with planning and terrain adaptive behaviors. However, they often face compounding errors over long horizons and have limited interpretability due to the absence of physical inductive biases. We address these issues by integrating Lagrangian Neural Networks (LNNs) into an RL MPC framework, enabling physically consistent dynamics learning. At deployment, our inverse dynamics infinite horizon MPC scheme avoids costly matrix inversions, improving computational efficiency by up to 4x with minimal loss of task performance. We validate our framework through multiple ablations of the proposed LNN and its variants. We show improved sample efficiency, reduced long-horizon error, and faster real time planning compared to unstructured neural dynamics. Lastly, we also test our framework on the Unitree Go1 robot to show real world viability.

URL PDF HTML ☆

赞 0 踩 0

2603.14328 2026-03-17 cs.SD eess.AS

CodecMOS-Accent: A MOS Benchmark of Resynthesized and TTS Speech from Neural Codecs Across English Accents

Wen-Chin Huang, Nicholas Sanders, Erica Cooper

Comments Preprint

2603.14327 2026-03-17 cs.RO

OmniClone: Engineering a Robust, All-Rounder Whole-Body Humanoid Teleoperation System

Yixuan Li, Le Ma, Yutang Lin, Yushi Du, Mengya Liu, Kaizhe Hu, Jieming Cui, Yixin Zhu, Wei Liang, Baoxiong Jia, Siyuan Huang

Comments Website: https://omniclone.github.io/

2603.14326 2026-03-17 cs.LG cs.AI cs.CL

ECG-Reasoning-Benchmark: A Benchmark for Evaluating Clinical Reasoning Capabilities in ECG Interpretation

Jungwoo Oh, Hyunseung Chung, Junhee Lee, Min-Gyu Kim, Hangyul Yoon, Ki Seong Lee, Youngchae Lee, Muhan Yeo, Edward Choi

Comments Preprint. 9 pages for main text, 2 pages for references, 19 pages for supplementary materials (appendix)

2603.14323 2026-03-17 cs.CV cs.AI

How Do Medical MLLMs Fail? A Study on Visual Grounding in Medical Images

Guimeng Liu, Tianze Yu, Somayeh Ebrahimkhani, Lin Zhi Zheng Shawn, Kok Pin Ng, Ngai-Man Cheung

Comments Published as a conference paper at ICLR 2026

2603.14321 2026-03-17 cs.CV

Personalized Cell Segmentation: Benchmark and Framework for Reference-Guided Cell Type Segmentation

Bisheng Wang, Jaime S. Cardoso, Lin Wu

Comments Accepted by IEEE ICASSP 2026. 5 pages, 3 figures. (C) 2026 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising/promotional purposes, creating new collective works, for resale or redistribution, or reuse of any copyrighted component

2603.14320 2026-03-17 cs.CV

Early Failure Detection and Intervention in Video Diffusion Models

Kwon Byung-Ki, Sohwi Lim, Nam Hyeon-Woo, Moon Ye-Bin, Tae-Hyun Oh

Comments 29 pages, 24 figures, 9 tables

2603.14319 2026-03-17 cs.LG

Structure-Dependent Regret and Constraint Violation Bounds for Online Convex Optimization with Time-Varying Constraints

Xiufeng Liu, Qian Chen, Zhijin Wang, Ruyu Liu

2603.14316 2026-03-17 cs.CV

Direct Object-Level Reconstruction via Probabilistic Gaussian Splatting

Shuai Guo, Ao Guo, Junchao Zhao, Qi Chen, Yuxiang Qi, Zechuan Li, Dong Chen, Tianjia Shao, Mingliang Xu

2603.14313 2026-03-17 cs.CL

Mind the Shift: Decoding Monetary Policy Stance from FOMC Statements with Large Language Models

Yixuan Tang, Yi Yang

2603.14312 2026-03-17 cs.AI cond-mat.dis-nn cs.LG cs.MA q-bio.BM

Autonomous Agents Coordinating Distributed Discovery Through Emergent Artifact Exchange

Fiona Y. Wang, Lee Marom, Subhadeep Pal, Rachel K. Luu, Wei Lu, Jaime A. Berkovich, Markus J. Buehler

详情

英文摘要

We present ScienceClaw + Infinite, a framework for autonomous scientific investigation in which independent agents conduct research without central coordination, and any contributor can deploy new agents into a shared ecosystem. The system is built around three components: an extensible registry of over 300 interoperable scientific skills, an artifact layer that preserves full computational lineage as a directed acyclic graph (DAG), and a structured platform for agent-based scientific discourse with provenance-aware governance. Agents select and chain tools based on their scientific profiles, produce immutable artifacts with typed metadata and parent lineage, and broadcast unsatisfied information needs to a shared global index. The ArtifactReactor enables plannerless coordination: peer agents discover and fulfill open needs through pressure-based scoring, while schema-overlap matching triggers multi-parent synthesis across independent analyses. An autonomous mutation layer actively prunes the expanding artifact DAG to resolve conflicting or redundant workflows, while persistent memory allows agents to continuously build upon complex epistemic states across multiple cycles. Infinite converts these outputs into auditable scientific records through structured posts, provenance views, and machine-readable discourse relations, with community feedback steering subsequent investigation cycles. Across four autonomous investigations, peptide design for the somatostatin receptor SSTR2, lightweight impact-resistant ceramic screening, cross-domain resonance bridging biology, materials, and music, and formal analogy construction between urban morphology and grain-boundary evolution, the framework demonstrates heterogeneous tool chaining, emergent convergence among independently operating agents, and traceable reasoning from raw computation to published finding.

URL PDF HTML ☆

赞 0 踩 0

2603.14309 2026-03-17 cs.CV

In-Field 3D Wheat Head Instance Segmentation From TLS Point Clouds Using Deep Learning Without Manual Labels

Tomislav Medic, Liangliang Nan

Comments to be published in ISPRS Annals of Photogrammetry and Remote Sensing at XXV ISPRS Congress, Toronto, Canada, July 2026, 8 pages, 5 figures

2603.14308 2026-03-17 cs.RO

Load-Aware Locomotion Control for Humanoid Robots in Industrial Transportation Tasks

Lequn Fu, Yijun Zhong, Xiao Li, Yibin Liu, Zhiyuan Xu, Jian Tang, Shiqi Li

Comments This work has been submitted to the IEEE Transactions on Industrial Electronics for possible publication

2603.14304 2026-03-17 cs.CV

A Physically-Grounded Attack and Adaptive Defense Framework for Real-World Low-Light Image Enhancement

Tongshun Zhang, Pingping Liu, Yuqing Lei, Zixuan Zhong, Qiuzhan Zhou, Zhiyuan Zha

2603.14303 2026-03-17 cs.CL

SemantiCache: Efficient KV Cache Compression via Semantic Chunking and Clustered Merging

Shunlong Wu, Hai Lin, Shaoshen Chen, Tingwei Lu, Yongqin Zeng, Shaoxiong Zhan, Hai-Tao Zheng, Hong-Gee Kim

2603.14301 2026-03-17 cs.CV cs.AI cs.GR

4D Synchronized Fields: Motion-Language Gaussian Splatting for Temporal Scene Understanding

Mohamed Rayan Barhdadi, Samir Abdaljalil, Rasul Khanbayov, Erchin Serpedin, Hasan Kurban

Comments 34 pages, 3 figures, 7 tables. Includes supplementary material. Preprint

2603.14300 2026-03-17 cs.CV

Show Me When and Where: Towards Referring Video Object Segmentation in the Wild

Mingqi Gao, Jinyu Yang, Jingnan Luo, Xiantong Zhen, Jungong Han, Giovanni Montana, Feng Zheng

2603.14297 2026-03-17 cs.CV

RL-ScanIQA: Reinforcement-Learned Scanpaths for Blind 360°Image Quality Assessment

Yujia Wang, Yuyan Li, Jiuming Liu, Fang-Lue Zhang, Xinhu Zheng, Neil. A Dodgson

Comments Accepted by CVPR 2026

2603.14290 2026-03-17 cs.CV

RegFormer++: An Efficient Large-Scale 3D LiDAR Point Registration Network with Projection-Aware 2D Transformer

Jiuming Liu, Guangming Wang, Zhe Liu, Chaokang Jiang, Haoang Li, Mengmeng Liu, Tianchen Deng, Marc Pollefeys, Michael Ying Yang, Hesheng Wang

2603.14289 2026-03-17 cs.LG cs.NA math.NA

Windowed Fourier Propagator: A Frequency-Local Neural Operator for Wave Equations in Inhomogeneous Media

Yiyang Cai, Zixuan Qiu, Yunlu Shu, Jiamao Wu, Yingzhou Li, Tianyu Wang, Xi Chen

2603.14282 2026-03-17 cs.CV

Multi-Period Texture Contrast Enhancement for Low-Contrast Wafer Defect Detection and Segmentation

Zihan Zhang

2603.14281 2026-03-17 cs.CV

DC-ViT: Modulating Spatial and Channel Interactions for Multi-Channel Images

Umar Marikkar, Syed Sameed Husain, Muhammad Awais, Sara Atito

2603.14276 2026-03-17 cs.CV cs.AI

All-day Multi-scenes Lifelong Vision-and-Language Navigation with Tucker Adaptation

Xudong Wang, Gan Li, Zhiyu Liu, Yao Wang, Lianqing Liu, Zhi Han

Comments ICLR 2026

2603.14272 2026-03-17 cs.LG

Learning in Function Spaces: An Unified Functional Analytic View of Supervised and Unsupervised Learning

K. Lakshmanan

Comments 17 pages, 2 figures

2603.14271 2026-03-17 cs.CV

Toward Clinically Ready Foundation Models in Medical Image Analysis: Adaptation Mechanisms and Deployment Trade-offs

Karma Phuntsho, Abdullah, Kyungmi Lee, Ickjai Lee, Euijoon Ahn

2603.14265 2026-03-17 cs.CL cs.MA

MedPriv-Bench: Benchmarking the Privacy-Utility Trade-off of Large Language Models in Medical Open-End Question Answering

Shaowei Guan, Yu Zhai, Hin Chi Kwok, Jiawei Du, Xinyu Feng, Jing Li, Harry Qin, Vivian Hui

Comments 17 pages, 5 figures

2603.14258 2026-03-17 cs.LG cs.NA math.NA math.PR

Sampling Boltzmann distributions via normalizing flow approximation of transport maps

Zia Ur Rehman, Gero Friesecke

2603.14257 2026-03-17 cs.CL

Automatic Inter-document Multi-hop Scientific QA Generation

Seungmin Lee, Dongha Kim, Yuni Jeon, Junyoung Koh, Min Song

Comments 14 pages, 5 figures, 8 tables. Accepted to the 2026 International Conference on Language Resources and Evaluation (LREC 2026)

2603.14254 2026-03-17 cs.CV cs.LG

ZOTTA: Test-Time Adaptation with Gradient-Free Zeroth-Order Optimization

Ronghao Zhang, Shuaicheng Niu, Qi Deng, Yanjie Dong, Jian Chen, Runhao Zeng

Comments 14 pages, 13figures

2603.14252 2026-03-17 cs.CV

MistExit: Learning to Exit for Early Mistake Detection in Procedural Videos

Sagnik Majumder, Anish Nethi, Ziad Al-Halah, Kristen Grauman