arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.01563 2026-04-03 cs.AI cs.LG

Does Your Optimizer Care How You Normalize? Normalization-Optimizer Coupling in LLM Training

Abdelrahman Abouzeid

Comments 16 pages, 8 figures. Preprint. Under review

详情

英文摘要

In LLM training, normalization layers and optimizers are typically treated as independent design choices. In a 3x2 factorial at 1B parameters and 1000 training steps, we show this assumption can fail: Dynamic Erf (Derf; Chen & Liu, 2025) suffers a large negative interaction with Muon (Jordan, 2024), with its gap to RMSNorm growing from +0.31 nats under AdamW to +0.97 under Muon, approximately three times larger. Dynamic Tanh (DyT; Zhu et al., 2025), included as a bounded-normalizer control, shows no such penalty. Our evidence points to two failure modes of erf under Muon's faster spectral-norm growth: saturation (lossy compression) and scale blindness (discarding activation magnitude). An EMA-blend that reintroduces running scale estimates recovers ~84% of the gap. Separately, reducing Derf's alpha from its published default (0.5 to 0.3) recovers ~80% by keeping erf in its near-linear regime, where it approximately preserves relative scale; this setting is not the published default of Chen & Liu (2025). Using Derf's published default alpha with Muon incurs a 0.66-nat interaction penalty without producing NaNs or divergence, making the failure easy to miss in short pilot runs.

URL PDF HTML ☆

赞 0 踩 0

2604.01561 2026-04-03 cs.CV cs.AI

ReFlow: Self-correction Motion Learning for Dynamic Scene Reconstruction

Yanzhe Liang, Ruijie Zhu, Hanzhi Chang, Zhuoyuan Li, Jiahao Lu, Tianzhu Zhang

Comments Project page: https://rosetta-leong.github.io/ReFlow_Page/ {this https URL}

2604.01560 2026-04-03 cs.CL

DeltaMem: Towards Agentic Memory Management via Reinforcement Learning

Qi Zhang, Shen Huang, Chu Liu, Shouqing Yang, Junbo Zhao, Haobo Wang, Pengjun Xie

Comments preprint, under review

2604.01553 2026-04-03 cs.CV

Cross-Domain Vessel Segmentation via Latent Similarity Mining and Iterative Co-Optimization

Zhanqiang Guo, Jianjiang Feng, Jie Zhou

2604.01552 2026-04-03 cs.LG

ZEUS: Accelerating Diffusion Models with Only Second-Order Predictor

Yixiao Wang, Ting Jiang, Zishan Shao, Hancheng Ye, Jingwei Sun, Mingyuan Ma, Jianyi Zhang, Yiran Chen, Hai Li

2604.01550 2026-04-03 cs.CV

Prototype-Based Low Altitude UAV Semantic Segmentation

Da Zhang, Gao Junyu, Zhao Zhiyuan

Comments Accepted to ICME 2026

2604.01545 2026-04-03 cs.AI

RAE-AR: Taming Autoregressive Models with Representation Autoencoders

Hu Yu, Hang Xu, Jie Huang, Zeyue Xue, Haoyang Huang, Nan Duan, Feng Zhao

2604.01542 2026-04-03 cs.CV physics.optics

Universal computational thermal imaging overcoming the ghosting effect

Hongyi Xu, Du Wang, Chenjun Zhao, Jiashuo Chen, Jiale Lin, Liqin Cao, Yanfei Zhong, Yiyuan She, Fanglin Bao

Comments 9 pages, 6 figures

2604.01538 2026-04-03 cs.CL cs.AI

Countering Catastrophic Forgetting of Large Language Models for Better Instruction Following via Weight-Space Model Merging

Mengxian Lyu, Cheng Peng, Ziyi Chen, Mengyuan Zhang, Jieting Li Lu, Yonghui Wu

2604.01535 2026-04-03 cs.CL

A Role-Based LLM Framework for Structured Information Extraction from Healthy Food Policies

Congjing Zhang, Ruoxuan Bao, Jingyu Li, Yoav Ackerman, Shuai Huang, Yanfang Su

2604.01526 2026-04-03 cs.LG

Learning ECG Image Representations via Dual Physiological-Aware Alignments

Hung Manh Pham, Jialu Tang, Aaqib Saeed, Dong Ma, Bin Zhu, Pan Zhou

2604.01523 2026-04-03 cs.RO

Robust Autonomous Control of a Magnetic Millirobot in In Vitro Cardiac Flow

Anuruddha Bhattacharjee, Xinhao Chen, Lamar O. Mair, Suraj Raval, Yancy Diaz-Mercado, Axel Krieger

2604.01520 2026-04-03 cs.AI

LLM Agents as Social Scientists: A Human-AI Collaborative Platform for Social Science Automation

Lei Wang, Yuanzi Li, Jinchao Wu, Heyang Gao, Xiaohe Bo, Xu Chen, Ji-Rong Wen

2604.01514 2026-04-03 cs.CL cs.CV

Why Instruction-Based Unlearning Fails in Diffusion Models?

Zeliang Zhang, Rui Sun, Jiani Liu, Qi Wu, Chenliang Xu

2604.01506 2026-04-03 cs.LG

Beyond Logit Adjustment: A Residual Decomposition Framework for Long-Tailed Reranking

Zhanliang Wang, Hongzhuo Chen, Quan Minh Nguyen, Mian Umair Ahsan, Kai Wang

Comments Preprint

2604.01504 2026-04-03 cs.CL cs.AI cs.CY

Magic, Madness, Heaven, Sin: LLM Output Diversity is Everything, Everywhere, All at Once

Harnoor Dhingra

Comments Under review

2604.01499 2026-04-03 cs.LG

Matching Accuracy, Different Geometry: Evolution Strategies vs GRPO in LLM Post-Training

William Hoy, Binxu Wang, Xu Pan

2604.01490 2026-04-03 cs.RO

Distal-Stable Beam for Continuum Robots

Ryouichi Saito, Takahiro Koide, Yuya Tanaka, Yasutaka Nakashima, Motoji Yamamoto, Ayato Kanada

Comments 8 pages, 7 figures

2604.01481 2026-04-03 cs.LG cs.AI

DISCO-TAB: A Hierarchical Reinforcement Learning Framework for Privacy-Preserving Synthesis of Complex Clinical Data

Arshia Ilaty, Hossein Shirazi, Amir Rahmani, Hajar Homayouni

2604.01480 2026-04-03 cs.AI physics.comp-ph

A Self-Evolving Agentic Framework for Metasurface Inverse Design

Yi Huang, Bowen Zheng, Yunxi Dong, Hong Tang, Huan Zhao, S. M. Rakibul Hasan Shawon, Hualiang Zhang

2604.01477 2026-04-03 cs.LG cs.SY eess.SY

Soft MPCritic: Amortized Model Predictive Value Iteration

Thomas Banker, Nathan P. Lawrence, Ali Mesbah

Comments submitted to CDC 2026

2604.01476 2026-04-03 cs.LG cs.CL

When Reward Hacking Rebounds: Understanding and Mitigating It with Representation-Level Signals

Rui Wu, Ruixiang Tang

Comments 15 pages, 8 figures

2604.01474 2026-04-03 cs.CV cs.LG

Prime Once, then Reprogram Locally: An Efficient Alternative to Black-Box Service Model Adaptation

Yunbei Zhang, Chengyi Cai, Feng Liu, Jihun Hamm

Comments CVPR 2026

2604.01467 2026-04-03 cs.CL cs.AI

A Dynamic Atlas of Persian Poetic Symbolism: Families, Fields, and the Historical Rewiring of Meaning

Kourosh Shahnazari, Seyed Moein Ayyoubzadeh, Mohammadali Keshtparvar

2604.01466 2026-04-03 cs.RO cs.CV cs.LG

Efficient Equivariant Transformer for Self-Driving Agent Modeling

Scott Xu, Dian Chen, Kelvin Wong, Chris Zhang, Kion Fallah, Raquel Urtasun

Comments CVPR 2026

2604.01461 2026-04-03 cs.AI

Reducing Hallucinations in LLM-based Scientific Literature Analysis Using Peer Context Outlier Detection

Daniel Xie, Maxwell J. Jacobson, Adil Wazeer, Haiyan Wang, Xinghang Zhang, Yexiang Xue

2604.01460 2026-04-03 cs.CV

Reinforcing Consistency in Video MLLMs with Structured Rewards

Yihao Quan, Zeru Shi, Jinman Zhao, Ruixiang Tang

2604.01457 2026-04-03 cs.CL

Wired for Overconfidence: A Mechanistic Perspective on Inflated Verbalized Confidence in LLMs

Tianyi Zhao, Yinhan He, Wendy Zheng, Yujie Zhang, Chen Chen

2604.01453 2026-04-03 cs.CV

Nonlinear Methods for Analyzing Pose in Behavioral Research

Carter Sale, Margaret C. Macpherson, Gaurav Patil, Kelly Miles, Rachel W. Kallen, Sebastian Wallot, Michael J. Richardson

Comments 40 pages, 13 figures