arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.21862 2026-03-24 cs.LG

Holistic Scaling Laws for Optimal Mixture-of-Experts Architecture Optimization

Weilin Wan, Jingtao Han, Weizhong Zhang, Cheng Jin

详情

英文摘要

Scaling laws for Large Language Models govern macroscopic resource allocation, yet translating them into precise Mixture-of-Experts (MoE) architectural configurations remains an open problem due to the combinatorially vast design space. Existing MoE scaling studies are constrained by experimental budgets to either augment scaling formulas with extra MoE variables, risking unreliable fits, or fix all non-MoE factors, ignoring global interactions. We propose a reusable framework for holistic MoE architectural optimization that bridges this gap. We first show that FLOPs per token alone is an inadequate fairness metric for MoE models because differing computational densities across layer types can inflate parameters without proportional compute cost, and establish a joint constraint triad of FLOPs per token, active parameters, and total parameters. We then reduce the 16-dimensional architectural search space to two sequential low-dimensional phases through algebraic constraints and a rank-preserving property of the hidden dimension. Validated across hundreds of MoE models spanning six orders of magnitude in compute, our framework yields robust scaling laws that map any compute budget to a complete, optimal MoE architecture. A key finding is that the near-optimal configuration band widens with scale, giving practitioners quantitative flexibility to balance scaling law recommendations against infrastructure constraints.

URL PDF HTML ☆

赞 0 踩 0

2603.21856 2026-03-24 cs.CV

Climate Prompting: Generating the Madden-Julian Oscillation using Video Diffusion and Low-Dimensional Conditioning

Sulian Thual, Feiyang Cai, Jingjing Wang, Feng Luo

2603.21854 2026-03-24 cs.AI

Reasoning or Rhetoric? An Empirical Analysis of Moral Reasoning Explanations in Large Language Models

Aryan Kasat, Smriti Singh, Aman Chadha, Vinija Jain

Comments 32 pages, 34 figures, 7 tables

详情

英文摘要

Do large language models reason morally, or do they merely sound like they do? We investigate whether LLM responses to moral dilemmas exhibit genuine developmental progression through Kohlberg's stages of moral development, or whether alignment training instead produces reasoning-like outputs that superficially resemble mature moral judgment without the underlying developmental trajectory. Using an LLM-as-judge scoring pipeline validated across three judge models, we classify more than 600 responses from 13 LLMs spanning a range of architectures, parameter scales, and training regimes across six classical moral dilemmas, and conduct ten complementary analyses to characterize the nature and internal coherence of the resulting patterns. Our results reveal a striking inversion: responses overwhelmingly correspond to post-conventional reasoning (Stages 5-6) regardless of model size, architecture, or prompting strategy, the effective inverse of human developmental norms, where Stage 4 dominates. Most strikingly, a subset of models exhibit moral decoupling: systematic inconsistency between stated moral justification and action choice, a form of logical incoherence that persists across scale and prompting strategy and represents a direct reasoning consistency failure independent of rhetorical sophistication. Model scale carries a statistically significant but practically small effect; training type has no significant independent main effect; and models exhibit near-robotic cross-dilemma consistency producing logically indistinguishable responses across semantically distinct moral problems. We posit that these patterns constitute evidence for moral ventriloquism: the acquisition, through alignment training, of the rhetorical conventions of mature moral reasoning without the underlying developmental trajectory those conventions are meant to represent.

URL PDF HTML ☆

赞 0 踩 0

2603.21847 2026-03-24 cs.CL

Riding Brainwaves in LLM Space: Understanding Activation Patterns Using Individual Neural Signatures

Ajan Subramanian, Sumukh Bettadapura, Rohan Sathish

2603.21846 2026-03-24 cs.AI cs.HC

Agentic Personas for Adaptive Scientific Explanations with Knowledge Graphs

Susana Nunes, Tiago Guerreiro, Catia Pesquita

Comments 17 pages, 9 figures

2603.21844 2026-03-24 cs.LG cs.AI stat.ME stat.ML

On the Number of Conditional Independence Tests in Constraint-based Causal Discovery

Marc Franquesa Monés, Jiaqi Zhang, Caroline Uhler

2603.21840 2026-03-24 cs.CL cs.AI

Select, Label, Evaluate: Active Testing in NLP

Antonio Purificato, Maria Sofia Bucarelli, Andrea Bacciu, Amin Mantrach, Fabrizio Silvestri

Comments 27 pages, 6 figures

2603.21836 2026-03-24 cs.CL cs.AI cs.PL

Instruction Set and Language for Symbolic Regression

Ezequiel Lopez-Rubio, Mario Pascual-Gonzalez

2603.21832 2026-03-24 cs.LG eess.SP

Deriving Health Metrics from the Photoplethysmogram: Benchmarks and Insights from MIMIC-III-Ext-PPG

Mohammad Moulaeifard, Philip J. Aston, Peter H. Charlton, Nils Strodthoff

Comments 22 pages, 1 figure

2603.21829 2026-03-24 cs.CV

Multi-View Deformable Convolution Meets Visual Mamba for Coronary Artery Segmentation

Xiaochan Yuan, Pai Zeng

详情

英文摘要

Accurate segmentation of coronary arteries from computed tomography angiography (CTA) images is of paramount clinical importance for the diagnosis and treatment planning of cardiovascular diseases. However, coronary artery segmentation remains challenging due to the inherent multi-branching and slender tubular morphology of the vasculature, compounded by severe class imbalance between foreground vessels and background tissue. Conventional convolutional neural network (CNN)-based approaches struggle to capture long-range dependencies among spatially distant vascular structures, while Vision Transformer (ViT)-based methods incur prohibitive computational overhead that hinders deployment in resource-constrained clinical settings. Motivated by the recent success of state space models (SSMs) in efficiently modeling long-range sequential dependencies with linear complexity, we propose MDSVM-UNet, a novel two-stage coronary artery segmentation framework that synergistically integrates multidirectional snake convolution (MDSConv) with residual visual Mamba (RVM). In the encoding stage, we introduce MDSConv, a deformable convolution module that learns adaptive offsets along three orthogonal anatomical planes -- sagittal, coronal, and axial -- thereby enabling comprehensive multi-view feature fusion that faithfully captures the elongated and tortuous geometry of coronary vessels. In the decoding stage, we design an RVM-based upsampling decoder block that leverages selective state space mechanisms to model inter-slice long-range dependencies while preserving linear computational complexity. Furthermore, we propose a progressive two-stage segmentation strategy: the first stage performs coarse whole-image segmentation to guide intelligent block extraction, while the second stage conducts fine-grained block-level segmentation to recover vascular details and suppress false positives..

URL PDF HTML ☆

赞 0 踩 0

2603.21828 2026-03-24 cs.LG cs.AI

CoRA: Boosting Time Series Foundation Models for Multivariate Forecasting through Correlation-aware Adapter

Hanyin Cheng, Xingjian Wu, Yang Shu, Zhongwen Rao, Lujia Pan, Bin Yang, Chenjuan Guo

2603.21820 2026-03-24 cs.CV

Beyond Strict Pairing: Arbitrarily Paired Training for High-Performance Infrared and Visible Image Fusion

Yanglin Deng, Tianyang Xu, Chunyang Cheng, Hui Li, Xiao-jun Wu, Josef Kittler

Comments Accepted by CVPR2026

2603.21819 2026-03-24 cs.CV cs.AI cs.LG cs.SY eess.SY

Ctrl-A: Control-Driven Online Data Augmentation

Jesper B. Christensen, Ciaran Bench, Spencer A. Thomas, Hüsnü Aslan, David Balslev-Harder, Nadia A. S. Smith, Alessandra Manzin

Comments 17 pages (11 pages main manuscript), 8 figures (5 in main manuscript)

2603.21809 2026-03-24 cs.CV

Clinical Graph-Mediated Distillation for Unpaired MRI-to-CFI Hypertension Prediction

Dillan Imans, Phuoc-Nguyen Bui, Duc-Tai Le, Hyunseung Choo

Comments 10 pages, 2 figures, 2 tables. Under review at MICCAI 2026

2603.21808 2026-03-24 cs.CV

Cascade-Free Mandarin Visual Speech Recognition via Semantic-Guided Cross-Representation Alignment

Lei Yang, Yi He, Fei Wu, Shilin Wang

2603.21803 2026-03-24 cs.CV

Timing In stand-up Comedy: Text, Audio, Laughter, Kinesics (TIC-TALK): Pipeline and Database for the Multimodal Study of Comedic Timing

Yaelle Zribi, Florian Cafiero, Vincent Lépinay, Chahan Vidal-Gorène

2603.21786 2026-03-24 cs.CV eess.IV

The Universal Normal Embedding

Chen Tasker, Roy Betser, Eyal Gofer, Meir Yossef Levi, Guy Gilboa

Comments Accepted to CVPR 2026

2603.21785 2026-03-24 cs.CV

Image-Conditioned Adaptive Parameter Tuning for Visual Odometry Frontends

Simone Nascivera, Leonard Bauersfeld, Jeff Delaune, Davide Scaramuzza

2603.21784 2026-03-24 cs.CV

Dynamic Exposure Burst Image Restoration

Woohyeok Kim, Jaesung Rim, Daeyeon Kim, Sunghyun Cho

2603.21782 2026-03-24 cs.LG

Show Me What You Don't Know: Efficient Sampling from Invariant Sets for Model Validation

Armand Rousselot, Joran Wendebourg, Ullrich Köthe

Comments 19 pages, 19 figures

2603.21774 2026-03-24 cs.RO

Memory-Efficient Boundary Map for Large-Scale Occupancy Grid Mapping

Benxu Tang, Yunfan Ren, Yixi Cai, Fanze Kong, Wenyi Liu, Fangcheng Zhu, Longji Yin, Liuyu Shi, Fu Zhang

详情

DOI: 10.1177/02783649261425266
Journal ref: Benxu Tang, et al. The International Journal of Robotics Research, published online 2026

英文摘要

Determining the occupancy status of locations in the environment is a fundamental task for safety-critical robotic applications. Traditional occupancy grid mapping methods subdivide the environment into a grid of voxels, each associated with one of three occupancy states: free, occupied, or unknown. These methods explicitly maintain all voxels within the mapped volume and determine the occupancy state of a location by directly querying the corresponding voxel that the location falls within. However, maintaining all grid voxels in high-resolution and large-scale scenarios requires substantial memory resources. In this paper, we introduce a novel representation that only maintains the boundary of the mapped volume. Specifically, we explicitly represent the boundary voxels, such as the occupied voxels and frontier voxels, while free and unknown voxels are automatically represented by volumes within or outside the boundary, respectively. As our representation maintains only a closed surface in two-dimensional (2D) space, instead of the entire volume in three-dimensional (3D) space, it significantly reduces memory consumption. Then, based on this 2D representation, we propose a method to determine the occupancy state of arbitrary locations in the 3D environment. We term this method as boundary map. Besides, we design a novel data structure for maintaining the boundary map, supporting efficient occupancy state queries. Theoretical analyses of the occupancy state query algorithm are also provided. Furthermore, to enable efficient construction and updates of the boundary map from the real-time sensor measurements, we propose a global-local mapping framework and corresponding update algorithms. Finally, we will make our implementation of the boundary map open-source on GitHub to benefit the community:https://github.com/hku-mars/BDM.

URL PDF HTML ☆

赞 0 踩 0

2603.21754 2026-03-24 cs.CV cs.AI

Let's Think with Images Efficiently! An Interleaved-Modal Chain-of-Thought Reasoning Framework with Dynamic and Precise Visual Thoughts

Xu Liu, Yongheng Zhang, Qiguang Chen, Yao Li, Sheng Wang, Libo Qin

Comments Accepted by AAAI 2026

2603.21728 2026-03-24 cs.AI cs.CL

EvoIdeator: Evolving Scientific Ideas through Checklist-Grounded Reinforcement Learning

Andreas Sauter, Yuyue Zhao, Jacopo Urbani, Wenxiang Hu, Zaiqiao Meng, Lun Zhou, Xiaohui Yan, Yougang Lyu

2603.21725 2026-03-24 cs.AI cs.LG

CurvZO: Adaptive Curvature-Guided Sparse Zeroth-Order Optimization for Efficient LLM Fine-Tuning

Shuo Wang, Ziyu Chen, Ming Tang

2603.21724 2026-03-24 cs.LG cs.AI

FISformer: Replacing Self-Attention with a Fuzzy Inference System in Transformer Models for Time Series Forecasting

Bulent Haznedar, Levent Karacan

2603.21720 2026-03-24 cs.CL cs.AI

SemEval-2026 Task 12: Abductive Event Reasoning: Towards Real-World Event Causal Inference for Large Language Models

Pengfei Cao, Mingxuan Yang, Yubo Chen, Chenlong Zhang, Mingxuan Liu, Kang Liu, Jun Zhao

Comments 9 pages, 3 figures, semeval 2026 task 12 description paper

2603.21719 2026-03-24 cs.CL

Probing How Scalable Table Data Enhances General Long-Context Reasoning

Huaibing Xie, Guoliang Zhao, Yang Liu, Shihan Dou, Siming Huang, Yanling Xiao, Shaolei Wang, Yiting Liu, Cheng Zhang, Shaofan Liu, Pluto Zhou

2603.21716 2026-03-24 cs.LG cs.AI cs.CV

When Exploration Comes for Free with Mixture-Greedy: Do we need UCB in Diversity-Aware Multi-Armed Bandits?

Bahar Dibaei Nia, Farzan Farnia

2603.21708 2026-03-24 cs.AI cs.CV

Compensating Visual Insufficiency with Stratified Language Guidance for Long-Tail Class Incremental Learning

Xi Wang, Xu Yang, Donghao Sun, Cheng Deng

2603.21705 2026-03-24 cs.LG

Data-Free Layer-Adaptive Merging via Fisher Information for Long-to-Short Reasoning LLMs

Tian Xia

Comments 14 pages, NeurIPS 2026 submission