arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.09952 2026-03-11 cs.LG cs.NA cs.SY eess.SY math.NA math.OC stat.ML

On the Width Scaling of Neural Optimizers Under Matrix Operator Norms I: Row/Column Normalization and Hyperparameter Transfer

Ruihan Xu, Jiajin Li, Yiping Lu

详情

英文摘要

A central question in modern deep learning is how to design optimizers whose behavior remains stable as the network width $w$ increases. We address this question by interpreting several widely used neural-network optimizers, including \textrm{AdamW} and \textrm{Muon}, as instances of steepest descent under matrix operator norms. This perspective links optimizer geometry with the Lipschitz structure of the network forward map, and enables width-independent control of both Lipschitz and smoothness constants. However, steepest-descent rules induced by standard $p \to q$ operator norms lack layerwise composability and therefore cannot provide width-independent bounds in deep architectures. We overcome this limitation by introducing a family of mean-normalized operator norms, denoted $\pmean \to \qmean$, that admit layerwise composability, yield width-independent smoothness bounds, and give rise to practical optimizers such as \emph{rescaled} \textrm{AdamW}, row normalization, and column normalization. The resulting learning rate width-aware scaling rules recover $μ$P scaling~\cite{yang2021tensor} as a special case and provide a principled mechanism for cross-width learning-rate transfer across a broad class of optimizers. We further show that \textrm{Muon} can suffer an $\mathcal{O}(\sqrt{w})$ worst-case growth in the smoothness constant, whereas a new family of row-normalized optimizers we propose achieves width-independent smoothness guarantees. Based on the observations, we propose MOGA (Matrix Operator Geometry Aware), a width-aware optimizer based only on row/column-wise normalization that enables stable learning-rate transfer across model widths. Large-scale pre-training on GPT-2 and LLaMA shows that MOGA, especially with row normalization, is competitive with Muon while being notably faster in large-token and low-loss regimes.

URL PDF HTML ☆

赞 0 踩 0

2603.09942 2026-03-11 eess.SY cs.AI cs.NI cs.SY

Towards Flexible Spectrum Access: Data-Driven Insights into Spectrum Demand

Mohamad Alkadamani, Amir Ghasemi, Halim Yanikomeroglu

Comments 7 pages, 5 figures. Presented at IEEE VTC 2024, Washington, DC. Published in the IEEE conference proceedings

2603.09918 2026-03-11 eess.SY cs.SY

Emergency Locator Transmitters in the Era of More Electric Aircraft: A Comprehensive Review of Energy, Integration and Safety Challenges

Juana M. Martínez-Heredia, Adrián Portos, Marcel Štěpánek, Francisco Colodro

2603.09916 2026-03-11 eess.SY cs.AI cs.SY

AI-Enabled Data-driven Intelligence for Spectrum Demand Estimation

Colin Brown, Mohamad Alkadamani, Halim Yanikomeroglu

Comments Presented at an IEEE ICC 2025 Workshop and published in the conference proceedings

2603.09908 2026-03-11 cs.RO cs.SY eess.SY

NanoBench: A Multi-Task Benchmark Dataset for Nano-Quadrotor System Identification, Control, and State Estimation

Syed Izzat Ullah, Jose Baca

Comments 9 pages, 6 figures

2603.09904 2026-03-11 eess.SY cs.SY

Dynamic Average Consensus with Privacy Guarantees and Its Application to Battery Energy Storage Systems

Mihitha Maithripala, Chenyang Qiu, Zongli Lin

2603.09894 2026-03-11 eess.SY cs.SY

A Survey on Cloud-Based 6G Deployments: Current Solutions, Future Directions and Open Challenges

Tolga O. Atalay, Alireza Famili, Amirreza Ghafoori, Angelos Stavrou

Comments 47 pages, 403 citations, 21 figures, journal

2603.09893 2026-03-11 eess.SP

Efficient, Adaptive Near-Field Beam Training based on Linear Bandit

Junchi Liu, Zijun Wang, Rui Zhang

Comments This paper is submitted to IEEE Wireless Communication Letter

2603.09878 2026-03-11 eess.SY cs.SY

Field Free Novel Architecture for Spintronic Flash Analog to Digital Converter

Abin Francis, Nikhil Kumar, Prince Philip

Comments 9 pages incluinding 2 pages of reference, 11 figures and 2 tables. Invited and presented at conference(ICMAGMA,2024)

2603.09859 2026-03-11 cs.LG cs.AI cs.NI cs.SY eess.SY

A Graph-Based Approach to Spectrum Demand Prediction Using Hierarchical Attention Networks

Mohamad Alkadamani, Halim Yanikomeroglu, Amir Ghasemi

Comments 7 pages, 6 figures. Presented at IEEE GLOBECOM 2025, Taiwan. To appear in the conference proceedings

2603.09840 2026-03-11 eess.IV cs.CV

CycleULM: A unified label-free deep learning framework for ultrasound localisation microscopy

Su Yan, Clara Rodrigo Gonzalez, Vincent C. H. Leung, Herman Verinaz-Jadan, Jiakang Chen, Matthieu Toulemonde, Kai Riemer, Jipeng Yan, Clotilde Vié, Qingyuan Tan, Peter D. Weinberg, Pier Luigi Dragotti, Kevin G. Murphy, Meng-Xing Tang

Comments 43 pages, 14 figures, 2 tables, journal

详情

英文摘要

Super-resolution ultrasound via microbubble (MB) localisation and tracking, also known as ultrasound localisation microscopy (ULM), can resolve microvasculature beyond the acoustic diffraction limit. However, significant challenges remain in localisation performance and data acquisition and processing time. Deep learning methods for ULM have shown promise to address these challenges, however, they remain limited by in vivo label scarcity and the simulation-to-reality domain gap. We present CycleULM, the first unified label-free deep learning framework for ULM. CycleULM learns a physics-emulating translation between the real contrast-enhanced ultrasound (CEUS) data domain and a simplified MB-only domain, leveraging the power of CycleGAN without requiring paired ground truth data. With this translation, CycleULM removes dependence on high-fidelity simulators or labelled data, and makes MB localisation and tracking substantially easier. Deployed as modular plug-and-play components within existing pipelines or as an end-to-end processing framework, CycleULM delivers substantial performance gains across both in silico and in vivo datasets. Specifically, CycleULM improves image contrast (contrast-to-noise ratio) by up to 15.3 dB and sharpens CEUS resolution with a 2.5{\times} reduction in the full width at half maximum of the point spread function. CycleULM also improves MB localisation performance, with up to +40% recall, +46% precision, and a -14.0 μm mean localisation error, yielding more faithful vascular reconstructions. Importantly, CycleULM achieves real-time processing throughput at 18.3 frames per second with order-of-magnitude speed-ups (up to ~14.5{\times}). By combining label-free learning, performance enhancement, and computational efficiency, CycleULM provides a practical pathway toward robust, real-time ULM and accelerates its translation to clinical applications.

URL PDF HTML ☆

赞 0 踩 0

2603.09814 2026-03-11 eess.SY cs.SY

Learning-Augmented Primal-Dual Control Design for Secondary Frequency Regulation

Yixuan Yu, Rajni K. Bansal, Yan Jiang, Pengcheng You

2603.09808 2026-03-11 eess.SP

A Hybrid Model-Assisted Approach for Path Loss Prediction in Suburban Scenarios

Chenlong Wang, Bo Ai, Ruiming Chen, Ruisi He, Mi Yang, Yuxin Zhang, Weirong Liu, Liu Liu

2603.09807 2026-03-11 physics.optics cs.ET cs.SY eess.SY

Experimental Characterization of Biological Tissue Dielectric Properties through THz Time-Domain Spectroscopy

Elisabetta Marini, Silvia Mura, Marco Hernandez, Matti Hamalainen, Maurizio Magarini

Comments To be published in EAI BODYNETS 2025

2603.09791 2026-03-11 cs.ET eess.SP

Trade-Offs in FMCW Radar-Based Respiration and Heart Rate Variability

Silvia Mura, Davide Scazzoli, Lorenzo Fineschi, Maurizio Magarini

Comments to be published in EAI BODYNETS 2025

2603.09760 2026-03-11 cs.CV cs.RO eess.IV

PanoAffordanceNet: Towards Holistic Affordance Grounding in 360° Indoor Environments

Guoliang Zhu, Wanjun Jia, Caoyang Shao, Yuheng Zhang, Zhiyong Li, Kailun Yang

Comments The source code and benchmark dataset will be made publicly available at https://github.com/GL-ZHU925/PanoAffordanceNet

2603.09737 2026-03-11 cs.CV cs.RO eess.IV

$M^2$-Occ: Resilient 3D Semantic Occupancy Prediction for Autonomous Driving with Incomplete Camera Inputs

Kaixin Lin, Kunyu Peng, Di Wen, Yufan Chen, Ruiping Liu, Kailun Yang

Comments The source code will be publicly released at https://github.com/qixi7up/M2-Occ

2603.09729 2026-03-11 q-bio.NC cs.RO cs.SY eess.SY

Efficient and robust control with spikes that constrain free energy

André Urbano, Pablo Lanillos, Sander Keemink

2603.09725 2026-03-11 eess.AS

A Semi-spontaneous Dutch Speech Dataset for Speech Enhancement and Speech Recognition

Dimme de Groot, Yuanyuan Zhang, Jorge Martinez, Odette Scharenborg

Comments Submitted to Interspeech 2026

2603.09714 2026-03-11 cs.SD cs.AI cs.CL eess.AS

MUGEN: Evaluating and Improving Multi-audio Understanding of Large Audio-Language Models

Chih-Kai Yang, Yun-Shao Tsai, Yu-Kai Guo, Ping-Le Tsai, Yen-Ting Piao, Hung-Wei Chen, Ting-Lin Hsiao, Yun-Man Hsu, Ke-Han Lu, Hung-yi Lee

Comments 6 pages, 3 figures, 3 tables. Dataset: https://huggingface.co/Multi-Audio-Grounding

2603.09671 2026-03-11 eess.SY cs.SY

Embedded Model Predictive Control for EMS-type Maglev Vehicles

Arnim Kargl, Mario Hermle, Zhiqiang Zhang, Yanmin Li, Dainan Zhao, Yong Cui, Peter Eberhard

2603.09657 2026-03-11 cs.CV cs.AI cs.ET eess.IV

When to Lock Attention: Training-Free KV Control in Video Diffusion

Tianyi Zeng, Jincheng Gao, Tianyi Wang, Zijie Meng, Miao Zhang, Jun Yin, Haoyuan Sun, Junfeng Jiao, Christian Claudel, Junbo Tan, Xueqian Wang

Comments 18 pages, 9 figures, 3 tables

2603.09644 2026-03-11 eess.SP cs.IT math.IT

Site-Specific Finetuning of Neural Receivers with Real-World 5G NR Measurements

Nuri Berke Baytekin, Reinhard Wiesmayr, Sebastian Cammerer, Chris Dick, Christoph Studer

Comments This work has been submitted to the 2026 IEEE 27th International Workshop on Signal Processing and Artificial Intelligence in Wireless Communications (IEEE SPAWC 2026)

2603.09627 2026-03-11 eess.AS

Speech-Omni-Lite: Portable Speech Interfaces for Vision-Language Models

Dehua Tao, Xuan Luo, Daxin Tan, Kai Chen, Lanqing Hong, Jing Li, Ruifeng Xu, Xiao Chen

2603.09617 2026-03-11 eess.SY cs.SY

Constrained finite-time stabilization by model predictive control: an infinite control horizon framework

Bing Zhu, Xiaozhuoer Yuan, Zewei Zheng, Zongyu Zuo

Comments 10 pages, 5 figures

2603.09590 2026-03-11 cs.CR eess.SP

Benchmarking Dataset for Presence-Only Passive Reconnaissance in Wireless Smart-Grid Communications

Bochra Al Agha, Razane Tajeddine

2603.09579 2026-03-11 eess.SP

Low-Rank Cyclostationarity Predictive Routing Is Almost as Good as Real-Time Data-based Routing

Oriel-Singer, Ilai-Bistritz, Giseung-Park, Woohyeon-Byeon, Youngchul-Sung, Amir-Leshem

Comments 4 figures, 2 tables

2603.09577 2026-03-11 cs.IT cs.CR cs.DC cs.SC eess.SP math.IT

Randomized Distributed Function Computation (RDFC): Ultra-Efficient Semantic Communication Applications to Privacy

Onur Günlü

2603.09508 2026-03-11 eess.AS

A Fast Solver for Interpolating Stochastic Differential Equation Diffusion Models for Speech Restoration

Bunlong Lay, Timo Gerkmann

2603.09505 2026-03-11 eess.AS

End-to-End Direction-Aware Keyword Spotting with Spatial Priors in Noisy Environments

Rui Wang, Zhifei Zhang, Yu Gao, Xiaofeng Mou, Yi Xu

Comments Submitted for review to Interspeech 2026