arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.01310 2026-04-03 cs.CV

Sparse Spectral LoRA: Routed Experts for Medical VLMs

Omid Nejati Manzari, Hojat Asgariandehkordi, Taha Koleilat, Yiming Xiao, Hassan Rivaz

详情

英文摘要

Large vision-language models (VLMs) excel on general benchmarks but often lack robustness in medical imaging, where heterogeneous supervision induces cross-dataset interference and sensitivity to data regime (i.e., how the supervisory signals are mixed). In realistic clinical workflows, data and tasks arrive sequentially, so naive continual training further leads to catastrophic forgetting. To address these challenges, we propose MedQwen, a parameter-efficient medical VLM that couples a spectrally routed Mixture-of-Experts (MoE) with a theoretically grounded scaling rule that aligns low-rank updates with a full-rank, fully fine-tuned MoE, without changing the base architecture. Concretely, we initialize each expert from non-overlapping singular value decomposition (SVD) segments of the pretrained weight and introduce a residual compensation and scaling scheme to enable stable expert specialization and consistent routing under distribution shift. Across 23 medical datasets covering visual question answering, report generation, radiology classification, and hallucination mitigation, MedQwen achieves strong, reliable performance: it approaches full fine-tuning on zero-shot classification with 339$\times$ fewer trainable parameters, and reduces sequential forgetting to $\sim$5\% where strong baselines degrade by $>$20-50\%.

URL PDF HTML ☆

赞 0 踩 0

2604.01308 2026-04-03 cs.LG cs.CE math.OC

An Online Machine Learning Multi-resolution Optimization Framework for Energy System Design Limit of Performance Analysis

Oluwamayowa O. Amusat, Luka Grbcic, Remi Patureau, M. Jibran S. Zuberi, Dan Gunter, Michael Wetter

2604.01305 2026-04-03 cs.LG cs.CE

UQ-SHRED: uncertainty quantification of shallow recurrent decoder networks for sparse sensing via engression

Mars Liyao Gao, Yuxuan Bao, Amy S. Rude, Xinwei Shen, J. Nathan Kutz

2604.01302 2026-04-03 cs.CL

Scaling Reasoning Tokens via RL and Parallel Thinking: Evidence From Competitive Programming

Qianfan Zhang, Tianyu Guo, Xuandi Ren, Jiale Chen, Ming Ding, Ran Xin, Xia Xiao

2604.01298 2026-04-03 cs.LG

Forecasting Supply Chain Disruptions with Foresight Learning

Benjamin Turtel, Paul Wilczewski, Kris Skotheim

2604.01280 2026-04-03 cs.CV cs.AI cs.CL

Look Twice: Training-Free Evidence Highlighting in Multimodal Large Language Models

Marco Morini, Sara Sarto, Marcella Cornia, Lorenzo Baraldi

Comments Project Page: https://aimagelab.github.io/LoT/

2604.01279 2026-04-03 cs.LG cs.AI hep-th math.OC

Sven: Singular Value Descent as a Computationally Efficient Natural Gradient Method

Samuel Bright-Thonney, Thomas R. Harvey, Andre Lukas, Jesse Thaler

2604.01268 2026-04-03 cs.CL cs.AI

The Overlooked Repetitive Lengthening Form in Sentiment Analysis

Lei Wang, Eduard Dragut

Comments Findings of EMNLP 2024

2604.01261 2026-04-03 cs.LG cs.AI

DySCo: Dynamic Semantic Compression for Effective Long-term Time Series Forecasting

Xiang Ao, Yinyu Tan, Mengru Chen

Comments 9 pages, 7 figures

2604.01259 2026-04-03 cs.RO

Bench2Drive-VL: Benchmarks for Closed-Loop Autonomous Driving with Vision-Language Models

Xiaosong Jia, Yuqian Shao, Zhenjie Yang, Qifeng Li, Zhiyuan Zhang, Junchi Yan

Comments All codes and annotated datasets are available at \url{https://github.com/Thinklab-SJTU/Bench2Drive-VL} and \url{https://huggingface.co/datasets/Telkwevr/Bench2Drive-VL-base}

2604.01254 2026-04-03 cs.RO eess.IV

Simulating Realistic LiDAR Data Under Adverse Weather for Autonomous Vehicles: A Physics-Informed Learning Approach

Vivek Anand, Bharat Lohani, Rakesh Mishra, Gaurav Pandey

2604.01251 2026-04-03 cs.CV eess.IV

Camouflage-aware Image-Text Retrieval via Expert Collaboration

Yao Jiang, Zhongkuan Mao, Xuan Wu, Keren Fu, Qijun Zhao

2604.01247 2026-04-03 cs.SD eess.AS

Combining Masked Language Modeling and Cross-Modal Contrastive Learning for Prosody-Aware TTS

Kirill Borodin, Vasiliy Kudryavtsev, Maxim Maslov, Nikita Vasiliev, Mikhail Gorodnichev, Grach Mkrtchian

Comments This paper has been submitted to Interspeech 2026 for review

2604.01235 2026-04-03 cs.AI

Runtime Burden Allocation for Structured LLM Routing in Agentic Expert Systems: A Full-Factorial Cross-Backend Methodology

Zhou Hanlin, Chan Huah Yong

2604.01234 2026-04-03 cs.CV cs.AI eess.IV

CLPIPS: A Personalized Metric for AI-Generated Image Similarity

Khoi Trinh, Jay Rothenberger, Scott Seidenberger, Dimitrios Diochnos, Anindya Maiti

2604.01226 2026-04-03 cs.CV cs.SE

DOne: Decoupling Structure and Rendering for High-Fidelity Design-to-Code Generation

Xinhao Huang, Jinke Yu, Wenhao Xu, Zeyi Wen, Ying Zhou, Junzhuo Liu, Junhao Ji, Zulong Chen

2604.00979 2026-04-03 cs.CL cs.AI

Dual Optimal: Make Your LLM Peer-like with Dignity

Xiangqi Wang, Yue Huang, Haomin Zhuang, Kehan Guo, Xiangliang Zhang

2604.00830 2026-04-03 cs.LG cs.AI

Learning to Learn-at-Test-Time: Language Agents with Learnable Adaptation Policies

Zhanzhi Lou, Hui Chen, Yibo Li, Qian Wang, Bryan Hooi

2604.00653 2026-04-03 cs.LG

Chameleons do not Forget: Prompt-Based Online Continual Learning for Next Activity Prediction

Marwan Hassani, Tamara Verbeek, Sjoerd van Straten

Comments This paper has been accepted for publication in the International Journal of Cooperative Information Systems

2604.00528 2026-04-03 cs.CV cs.AI

Think, Act, Build: An Agentic Framework with Vision Language Models for Zero-Shot 3D Visual Grounding

Haibo Wang, Zihao Lin, Zhiyang Xu, Lifu Huang

2604.00513 2026-04-03 cs.LG cs.AI cs.CV cs.IR

MOON3.0: Reasoning-aware Multimodal Representation Learning for E-commerce Product Understanding

Junxian Wu, Chenghan Fu, Zhanheng Nie, Daoze Zhang, Bowen Wan, Wanxian Guan, Chuan Yu, Jian Xu, Bo Zheng

Comments 10 pages, 6 figures

2604.00394 2026-04-03 cs.LG cs.AI

Deep Networks Favor Simple Data

Weyl Lu, Chenjie Hao, Yubei Chen

Comments 16 pages, 9 figures

2603.29376 2026-04-03 cs.CV

Assessing Multimodal Chronic Wound Embeddings with Expert Triplet Agreement

Fabian Kabus, Julia Hindel, Jelena Bratulić, Meropi Karakioulaki, Ayush Gupta, Cristina Has, Thomas Brox, Abhinav Valada, Harald Binder

2603.29353 2026-04-03 cs.AI

Nomad: Autonomous Exploration and Discovery

Bokang Jia, Samta Kamboj, Satheesh Katipomu, Seung Hun Han, Neha Sengupta, Andrew Jackson

2603.29272 2026-04-03 cs.CV cs.GR cs.RO

MaskAdapt: Learning Flexible Motion Adaptation via Mask-Invariant Prior for Physics-Based Characters

Soomin Park, Eunseong Lee, Kwang Bin Lee, Sung-Hee Lee

Comments CVPR 2026

2603.29247 2026-04-03 cs.CL cs.AI cs.LG

MemRerank: Preference Memory for Personalized Product Reranking

Zhiyuan Peng, Xuyang Wu, Huaixiao Tou, Yi Fang, Yu Gong

Comments correct author name in metadata

2603.29245 2026-04-03 cs.CV

Monocular Building Height Estimation from PhiSat-2 Imagery: Dataset and Method

Yanjiao Song, Bowen Cai, Timo Balz, Zhenfeng Shao, Neema Simon Sumari, James Magidi, Walter Musakwa

2603.29200 2026-04-03 cs.LG cs.AI

Improving Ensemble Forecasts of Abnormally Deflecting Tropical Cyclones with Fused Atmosphere-Ocean-Terrain Data

Qixiang Li, Yuan Zhou, Shuwei Huo, Chong Wang, Xiaofeng Li

2603.28650 2026-04-03 cs.LG cs.AI stat.ML

Information-Theoretic Limits of Safety Verification for Self-Improving Systems

Arsenios Scrivens

Comments 27 pages, 6 figures. Companion empirical paper: doi:10.5281/zenodo.19237566

2603.28590 2026-04-03 cs.AI

MonitorBench: A Comprehensive Benchmark for Chain-of-Thought Monitorability in Large Language Models

Han Wang, Yifan Sun, Brian Ko, Mann Talati, Jiawen Gong, Zimeng Li, Naicheng Yu, Xucheng Yu, Wei Shen, Vedant Jolly, Huan Zhang

Comments 57 pages