arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.03610 2026-03-05 cs.LG

Riemannian Optimization in Modular Systems

Christian Pehle, Jean-Jacques Slotine

Comments 9 pages

详情

英文摘要

Understanding how systems built out of modular components can be jointly optimized is an important problem in biology, engineering, and machine learning. The backpropagation algorithm is one such solution and has been instrumental in the success of neural networks. Despite its empirical success, a strong theoretical understanding of it is lacking. Here, we combine tools from Riemannian geometry, optimal control theory, and theoretical physics to advance this understanding. We make three key contributions: First, we revisit the derivation of backpropagation as a constrained optimization problem and combine it with the insight that Riemannian gradient descent trajectories can be understood as the minimum of an action. Second, we introduce a recursively defined layerwise Riemannian metric that exploits the modular structure of neural networks and can be efficiently computed using the Woodbury matrix identity, avoiding the $O(n^3)$ cost of full metric inversion. Third, we develop a framework of composable ``Riemannian modules'' whose convergence properties can be quantified using nonlinear contraction theory, providing algorithmic stability guarantees of order $O(κ^2 L/(ξμ\sqrt{n}))$ where $κ$ and $L$ are Lipschitz constants, $μ$ is the mass matrix scale, and $ξ$ bounds the condition number. Our layerwise metric approach provides a practical alternative to natural gradient descent. While we focus here on studying neural networks, our approach more generally applies to the study of systems made of modules that are optimized over time, as it occurs in biology during both evolution and development.

URL PDF HTML ☆

赞 0 踩 0

2603.03604 2026-03-05 cs.CV q-bio.QM

Tracking Feral Horses in Aerial Video Using Oriented Bounding Boxes

Saeko Takizawa, Tamao Maeda, Shinya Yamamoto, Hiroaki Kawashima

Comments Author's version of the paper presented at AROB-ISBC 2026

2603.03603 2026-03-05 cs.CV q-bio.QM

Detection and Identification of Penguins Using Appearance and Motion Features

Kasumi Seko, Hiroki Kinoshita, Raj Rajeshwar Malinda, Hiroaki Kawashima

Comments Author's version of the paper presented at AROB-ISBC 2026

2603.03602 2026-03-05 cs.CV

DM-CFO: A Diffusion Model for Compositional 3D Tooth Generation with Collision-Free Optimization

Yan Tian, Pengcheng Xue, Weiping Ding, Mahmoud Hassaballah, Karen Egiazarian, Aura Conci, Abdulkadir Sengur, Leszek Rutkowski

Comments Received by IEEE Transactions on Visualization and Computer Graphics

2603.03418 2026-03-05 cs.CV

mHC-HSI: Clustering-Guided Hyper-Connection Mamba for Hyperspectral Image Classification

Yimin Zhu, Zack Dewis, Quinn Ledingham, Saeid Taleghanidoozdoozan, Mabel Heffring, Zhengsen Xu, Motasem Alkayid, Megan Greenwood, Lincoln Linlin Xu

Comments arXiv admin note: text overlap with arXiv:2601.15757

2603.03311 2026-03-05 cs.CL

The Logovista English-Japanese Machine Translation System

Barton D. Wright

2603.03187 2026-03-05 cs.CV

ProSMA-UNet: Decoder Conditioning for Proximal-Sparse Skip Feature Selection

Chun-Wun Cheng, Yanqi Cheng, Peiyuan Jing, Guang Yang, Javier A. Montoya-Zegarra, Carola-Bibiane Schönlieb, Angelica I. Aviles-Rivero

2603.03101 2026-03-05 cs.CV cs.AI

MoECLIP: Patch-Specialized Experts for Zero-shot Anomaly Detection

Jun Yeong Park, JunYoung Seo, Minji Kang, Yu Rang Park

Comments Accepted by CVPR 2026

2603.02989 2026-03-05 cs.RO

CASSR: Continuous A-Star Search through Reachability for real time footstep planning

Jiayi Wang, Steve Tonneau

2603.02929 2026-03-05 cs.CV

TRACE: Task-Adaptive Reasoning and Representation Learning for Universal Multimodal Retrieval

Xiangzhao Hao, Shijie Wang, Tianyu Yang, Tianyue Wang, Haiyun Guo, Jinqiao Wang

2603.02909 2026-03-05 cs.CL cs.AI

Learning to Generate and Extract: A Multi-Agent Collaboration Framework For Zero-shot Document-level Event Arguments Extraction

Guangjun Zhang, Hu Zhang, Yazhou Han, Yue Fan, Yuhang Shao, Ru Li, Hongye Tan

Comments Accepted by AAAI 2026

2603.02862 2026-03-05 cs.LG

Learning in Markov Decision Processes with Exogenous Dynamics

Davide Maran, Davide Salaorni, Marcello Restelli

2603.02829 2026-03-05 cs.CV cs.LG

Toward Early Quality Assessment of Text-to-Image Diffusion Models

Huanlei Guo, Hongxin Wei, Bingyi Jing

2603.02504 2026-03-05 cs.AI

NeuroProlog: Multi-Task Fine-Tuning for Neurosymbolic Mathematical Reasoning via the Cocktail Effect

Pratibha Zunjare, Michael Hsiao

2603.02468 2026-03-05 cs.RO

A Novel Modular Cable-Driven Soft Robotic Arm with Multi-Segment Reconfigurability

Moeen Ul Islam, Cheng Ouyang, Xinda Qi, Azlan Zahid, Xiaobo Tan, Dong Chen

Comments 6 pages, 8 figures

2603.02430 2026-03-05 cs.LG cs.CV

A Unified Revisit of Temperature in Classification-Based Knowledge Distillation

Logan Frank, Jim Davis

2603.02365 2026-03-05 cs.AI

Can machines be uncertain?

Luis Rosa

2603.02353 2026-03-05 cs.CL

Detecting AI-Generated Essays in Writing Assessment: Responsible Use and Generalizability Across LLMs

Jiangang Hao

Comments 21 pages, 2 figures

2603.02214 2026-03-05 cs.AI cs.CR cs.LG

Federated Inference: Toward Privacy-Preserving Collaborative and Incentivized Model Serving

Jungwon Seo, Ferhat Ozgur Catak, Chunming Rong, Jaeyeon Jang

Comments 19 pages, 6 figures, 10 tables

2603.02029 2026-03-05 cs.AI cs.LG stat.ML

Rich Insights from Cheap Signals: Efficient Evaluations via Tensor Factorization

Felipe Maia Polo, Aida Nematzadeh, Virginia Aglietti, Adam Fisch, Isabela Albuquerque

2603.01930 2026-03-05 cs.CL cs.AI cs.LG

From Variance to Invariance: Qualitative Content Analysis for Narrative Graph Annotation

Junbo Huang, Max Weinig, Ulrich Fritsche, Ricardo Usbeck

Comments LREC 2026 Accepted Paper

2603.01752 2026-03-05 cs.LG q-bio.CB q-bio.GN

Causal Circuit Tracing Reveals Distinct Computational Architectures in Single-Cell Foundation Models: Inhibitory Dominance, Biological Coherence, and Cross-Model Convergence

Ihor Kendiukhov

2603.01550 2026-03-05 cs.CL cs.AI

Extracting Training Dialogue Data from Large Language Model based Task Bots

Shuo Zhang, Junzhou Zhao, Junji Hou, Pinghui Wang, Chenxu Wang, Jing Tao

Comments Accepted for publication in IEEE Transactions on Information Forensics and Security (TIFS). \c{opyright} 2026 IEEE

2603.01266 2026-03-05 cs.CL

A Study on Building Efficient Zero-Shot Relation Extraction Models

Hugo Thomas, Caio Corro, Guillaume Gravier, Pascale Sébillot

Comments LREC 2026

2603.01116 2026-03-05 cs.CV

Improved MambdaBDA Framework for Robust Building Damage Assessment Across Disaster Domains

Alp Eren Gençoğlu, Hazım Kemal Ekenel

Comments Preprint. Accepted at VISAPP 2026

2602.24065 2026-03-05 cs.CV

EvalMVX: A Unified Benchmarking for Neural 3D Reconstruction under Diverse Multiview Setups

Zaiyan Yang, Jieji Ren, Xiangyi Wang, zonglin li, Xu Cao, Heng Guo, Zhanyu Ma, Boxin Shi

2602.22730 2026-03-05 cs.CL

Extending Czech Aspect-Based Sentiment Analysis with Opinion Terms: Dataset and LLM Benchmarks

Jakub Šmíd, Pavel Přibáň, Pavel Král

Comments Accepted for the 15th edition of the Language Resources and Evaluation Conference (LREC 2026)

2602.22469 2026-03-05 cs.CV cs.AI

Beyond Dominant Patches: Spatial Credit Redistribution For Grounded Vision-Language Models

Niamul Hassan Samin, Md Arifur Rahman, Abdullah Ibne Hanif Arean, Juena Ahmed Noshin, Md Ashikur Rahman

2602.22227 2026-03-05 cs.LG cs.AI

Dynamic Adversarial Reinforcement Learning for Robust Multimodal Large Language Models

Yicheng Bao, Xuhong Wang, Qiaosheng Zhang, Chaochao Lu, Xia Hu, Xin Tan

2602.22056 2026-03-05 cs.RO cs.LG

FlowCorrect: Efficient Interactive Correction of Generative Flow Policies for Robotic Manipulation

Edgar Welte, Yitian Shi, Rosa Wolf, Maximillian Gilles, Rania Rayyes

Comments 8 pages, 5 figures