arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2502.08577 2026-03-11 cs.LG cs.AI

FBFL: A Field-Based Coordination Approach for Data Heterogeneity in Federated Learning

Davide Domini, Gianluca Aguzzi, Lukas Esterle, Mirko Viroli

详情

DOI: 10.46298/lmcs-22(1:19)2026
Journal ref: Logical Methods in Computer Science, Volume 22, Issue 1 (March 6, 2026) lmcs:15239

英文摘要

In the last years, Federated learning (FL) has become a popular solution to train machine learning models in domains with high privacy concerns. However, FL scalability and performance face significant challenges in real-world deployments where data across devices are non-independently and identically distributed (non-IID). The heterogeneity in data distribution frequently arises from spatial distribution of devices, leading to degraded model performance in the absence of proper handling. Additionally, FL typical reliance on centralized architectures introduces bottlenecks and single-point-of-failure risks, particularly problematic at scale or in dynamic environments. To close this gap, we propose Field-Based Federated Learning (FBFL), a novel approach leveraging macroprogramming and field coordination to address these limitations through: (i) distributed spatial-based leader election for personalization to mitigate non-IID data challenges; and (ii) construction of a self-organizing, hierarchical architecture using advanced macroprogramming patterns. Moreover, FBFL not only overcomes the aforementioned limitations, but also enables the development of more specialized models tailored to the specific data distribution in each subregion. This paper formalizes FBFL and evaluates it extensively using MNIST, FashionMNIST, and Extended MNIST datasets. We demonstrate that, when operating under IID data conditions, FBFL performs comparably to the widely-used FedAvg algorithm. Furthermore, in challenging non-IID scenarios, FBFL not only outperforms FedAvg but also surpasses other state-of-the-art methods, namely FedProx and Scaffold, which have been specifically designed to address non-IID data distributions. Additionally, we showcase the resilience of FBFL's self-organizing hierarchical architecture against server failures.

URL PDF HTML ☆

赞 0 踩 0

2603.09972 2026-03-11 cs.LG cs.AI cs.CV

From Data Statistics to Feature Geometry: How Correlations Shape Superposition

Lucas Prieto, Edward Stevinson, Melih Barsbey, Tolga Birdal, Pedro A. M. Mediano

2603.09971 2026-03-11 cs.RO

TiPToP: A Modular Open-Vocabulary Planning System for Robotic Manipulation

William Shen, Nishanth Kumar, Sahit Chintalapudi, Jie Wang, Christopher Watson, Edward Hu, Jing Cao, Dinesh Jayaraman, Leslie Pack Kaelbling, Tomás Lozano-Pérez

Comments Project website: https://tiptop-robot.github.io

2603.09968 2026-03-11 cs.CV

ReCoSplat: Autoregressive Feed-Forward Gaussian Splatting Using Render-and-Compare

Freeman Cheng, Botao Ye, Xueting Li, Junqi You, Fangneng Zhan, Ming-Hsuan Yang

2603.09961 2026-03-11 cs.RO cs.AI cs.CV

BEACON: Language-Conditioned Navigation Affordance Prediction under Occlusion

Xinyu Gao, Gang Chen, Javier Alonso-Mora

Comments 8 pages. Project page: https://xin-yu-gao.github.io/beacon

2603.09956 2026-03-11 cs.RO

Kinodynamic Motion Retargeting for Humanoid Locomotion via Multi-Contact Whole-Body Trajectory Optimization

Xiaoyu Zhang, Steven Haener, Varun Madabushi, Maegan Tucker

2603.09955 2026-03-11 cs.CV cs.LG

From Semantics to Pixels: Coarse-to-Fine Masked Autoencoders for Hierarchical Visual Understanding

Wenzhao Xiang, Yue Wu, Hongyang Yu, Feng Gao, Fan Yang, Xilin Chen

2603.09953 2026-03-11 cs.CV

Leveraging whole slide difficulty in Multiple Instance Learning to improve prostate cancer grading

Marie Arrivat, Rémy Peyret, Elsa Angelini, Pietro Gori

Comments ISBI 2026

2603.09952 2026-03-11 cs.LG cs.NA cs.SY eess.SY math.NA math.OC stat.ML

On the Width Scaling of Neural Optimizers Under Matrix Operator Norms I: Row/Column Normalization and Hyperparameter Transfer

Ruihan Xu, Jiajin Li, Yiping Lu

详情

英文摘要

A central question in modern deep learning is how to design optimizers whose behavior remains stable as the network width $w$ increases. We address this question by interpreting several widely used neural-network optimizers, including \textrm{AdamW} and \textrm{Muon}, as instances of steepest descent under matrix operator norms. This perspective links optimizer geometry with the Lipschitz structure of the network forward map, and enables width-independent control of both Lipschitz and smoothness constants. However, steepest-descent rules induced by standard $p \to q$ operator norms lack layerwise composability and therefore cannot provide width-independent bounds in deep architectures. We overcome this limitation by introducing a family of mean-normalized operator norms, denoted $\pmean \to \qmean$, that admit layerwise composability, yield width-independent smoothness bounds, and give rise to practical optimizers such as \emph{rescaled} \textrm{AdamW}, row normalization, and column normalization. The resulting learning rate width-aware scaling rules recover $μ$P scaling~\cite{yang2021tensor} as a special case and provide a principled mechanism for cross-width learning-rate transfer across a broad class of optimizers. We further show that \textrm{Muon} can suffer an $\mathcal{O}(\sqrt{w})$ worst-case growth in the smoothness constant, whereas a new family of row-normalized optimizers we propose achieves width-independent smoothness guarantees. Based on the observations, we propose MOGA (Matrix Operator Geometry Aware), a width-aware optimizer based only on row/column-wise normalization that enables stable learning-rate transfer across model widths. Large-scale pre-training on GPT-2 and LLaMA shows that MOGA, especially with row normalization, is competitive with Muon while being notably faster in large-token and low-loss regimes.

URL PDF HTML ☆

赞 0 踩 0

2603.09951 2026-03-11 cs.LG cs.AI cs.SE

Towards a Neural Debugger for Python

Maximilian Beck, Jonas Gehring, Jannik Kossen, Gabriel Synnaeve

Comments 22 pages

2603.09950 2026-03-11 cs.LG cs.AI

When Learning Rates Go Wrong: Early Structural Signals in PPO Actor-Critic

Alberto Fernández-Hernández, Cristian Pérez-Corral, Jose I. Mestre, Manuel F. Dolz, Jose Duato, Enrique S. Quintana-Ortí

2603.09947 2026-03-11 cs.AI

The Confidence Gate Theorem: When Should Ranked Decision Systems Abstain?

Ronald Doku

2603.09945 2026-03-11 cs.CV cs.AI

No Image, No Problem: End-to-End Multi-Task Cardiac Analysis from Undersampled k-Space

Yundi Zhang, Sevgi Gokce Kafali, Niklas Bubeck, Daniel Rueckert, Jiazhen Pan

2603.09940 2026-03-11 cs.LG

SignalMC-MED: A Multimodal Benchmark for Evaluating Biosignal Foundation Models on Single-Lead ECG and PPG

Fredrik K. Gustafsson, Xiao Gu, Mattia Carletti, Patitapaban Palo, David W. Eyre, David A. Clifton

Comments Code is available at https://github.com/fregu856/SignalMC-MED

2603.09932 2026-03-11 cs.CV

Unsupervised Domain Adaptation with Target-Only Margin Disparity Discrepancy

Gauthier Miralles, Loïc Le Folgoc, Vincent Jugnon, Pietro Gori

Comments ISBI 2026

2603.09931 2026-03-11 cs.CV cs.AI

Adaptive Clinical-Aware Latent Diffusion for Multimodal Brain Image Generation and Missing Modality Imputation

Rong Zhou, Houliang Zhou, Yao Su, Brian Y. Chen, Yu Zhang, Lifang He, Alzheimer's Disease Neuroimaging Initiative

2603.09930 2026-03-11 cs.CV cs.IR

Fine-grained Motion Retrieval via Joint-Angle Motion Images and Token-Patch Late Interaction

Yao Zhang, Zhuchenyang Liu, Yanlan He, Thomas Ploetz, Yu Xiao

2603.09925 2026-03-11 cs.CV cs.GR

On the Structural Failure of Chamfer Distance in 3D Shape Optimization

Chang-Yong Song, David Hyde

Comments 27 pages, including supplementary material

2603.09908 2026-03-11 cs.RO cs.SY eess.SY

NanoBench: A Multi-Task Benchmark Dataset for Nano-Quadrotor System Identification, Control, and State Estimation

Syed Izzat Ullah, Jose Baca

Comments 9 pages, 6 figures

2603.09906 2026-03-11 cs.CL

Thinking to Recall: How Reasoning Unlocks Parametric Knowledge in LLMs

Zorik Gekhman, Roee Aharoni, Eran Ofek, Mor Geva, Roi Reichart, Jonathan Herzig

2603.09896 2026-03-11 cs.CV

Stepping VLMs onto the Court: Benchmarking Spatial Intelligence in Sports

Yuchen Yang, Yuqing Shao, Duxiu Huang, Linfeng Dong, Yifei Liu, Suixin Tang, Xiang Zhou, Yuanyuan Gao, Wei Wang, Yue Zhou, Xue Yang, Yanfeng Wang, Xiao Sun, Zhihang Zhong

2603.09892 2026-03-11 cs.LG cs.AI cs.CL

MSSR: Memory-Aware Adaptive Replay for Continual LLM Fine-Tuning

Yiyang Lu, Yu He, Jianlong Chen, Hongyuan Zha

2603.09890 2026-03-11 cs.AI cs.MA

Influencing LLM Multi-Agent Dialogue via Policy-Parameterized Prompts

Hongbo Bo, Jingyu Hu, Weiru Liu

2603.09884 2026-03-11 cs.CL cs.CY

Benchmarking Political Persuasion Risks Across Frontier Large Language Models

Zhongren Chen, Joshua Kalla, Quan Le

2603.09883 2026-03-11 cs.CV

DISPLAY: Directable Human-Object Interaction Video Generation via Sparse Motion Guidance and Multi-Task Auxiliary

Jiazhi Guan, Quanwei Yang, Luying Huang, Junhao Liang, Borong Liang, Haocheng Feng, Wei He, Kaisiyuan Wang, Hang Zhou, Jingdong Wang

2603.09877 2026-03-11 cs.CV

InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing

Changyao Tian, Danni Yang, Guanzhou Chen, Erfei Cui, Zhaokai Wang, Yuchen Duan, Penghao Yin, Sitao Chen, Ganlin Yang, Mingxin Liu, Zirun Zhu, Ziqian Fan, Leyao Gu, Haomin Wang, Qi Wei, Jinhui Yin, Xue Yang, Zhihang Zhong, Qi Qin, Yi Xin, Bin Fu, Yihao Liu, Jiaye Ge, Qipeng Guo, Gen Luo, Hongsheng Li, Yu Qiao, Kai Chen, Hongjie Zhang

Comments technical report, 61 pages, https://github.com/OpenGVLab/InternVL-U

2603.09874 2026-03-11 cs.CV

MissBench: Benchmarking Multimodal Affective Analysis under Imbalanced Missing Modalities

Tien Anh Pham, Phuong-Anh Nguyen, Duc-Trong Le, Cam-Van Thi Nguyen

2603.09872 2026-03-11 cs.CL

N-gram-like Language Models Predict Reading Time Best

James A. Michaelov, Roger P. Levy

2603.09868 2026-03-11 cs.LG physics.ao-ph

CarbonBench: A Global Benchmark for Upscaling of Carbon Fluxes Using Zero-Shot Learning

Aleksei Rozanov, Arvind Renganathan, Yimeng Zhang, Vipin Kumar

2603.09865 2026-03-11 cs.LG

GAST: Gradient-aligned Sparse Tuning of Large Language Models with Data-layer Selection

Kai Yao, Zhenghan Song, Kaixin Wu, Mingjie Zhong, Danzhao Cheng, Zhaorui Tan, Yixin Ji, Penglei Gao