arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.14434 2026-04-17 cs.AI

Geometric Routing Enables Causal Expert Control in Mixture of Experts

Ivan Ternovtsii, Yurii Bilak

详情

英文摘要

Sparse Mixture-of-Experts (MoE) models scale parameters while fixing active computation per token, but the specialization of individual experts remains opaque. In a companion paper we showed that routing topology is quality-neutral: five structurally different configurations converge to statistically equivalent language modeling quality. Here we show that expert identity is nonetheless causally meaningful: individual rank-1 experts are monosemantic by construction, and cosine-similarity routing in a low-dimensional metric space makes their specialization directly inspectable. We present four lines of evidence. First, projecting expert output vectors through the unembedding matrix yields a Semantic Dictionary: 15% of experts are monosemantic specialists spanning 10 categories (temporal, geographic, cardinal, discourse, emotional, financial, military, scientific). Second, routing exhibits a frequency-to-syntax gradient: early layers separate tokens by word frequency, deeper layers by syntactic class (Zipf-confound controls, all $p < 0.001$). Third, causal interventions confirm these labels: steering toward a temporal expert's centroid increases P(temporal) by +321% (median across 44 prompts); suppressing a geographic expert drops P(geographic) by -23%; rewriting an expert's output vector halves target-category probability, and effects compose additively across layers. Fourth, the interventions are not unique to cosine routing: linear routers support comparable steering, but only cosine routing provides geometric transparency -- expert specialization is readable directly from the centroid matrix. MoE expert-level specialization is a first-class interpretability primitive: architecturally monosemantic, causally validated, and controllable at inference with zero overhead.

URL PDF HTML ☆

赞 0 踩 0

2604.14433 2026-04-17 cs.CV cs.LG

Zero-Ablation Overstates Register Content Dependence in DINO Vision Transformers

Felipe Parodi, Jordan Matelsky, Melanie Segado

Comments 12 pages, 10 figures, to be published in CVPR 2026 HOW Vision Interpretability Workshop Proceedings

2604.14430 2026-04-17 cs.CL cs.AI cs.LG

Three-Phase Transformer

Mohammad R. Abu Ayyash

Comments 48 pages, 20 figures, 23 tables. Code: https://github.com/achelousace/three-phase-transformer

2604.14424 2026-04-17 cs.LG physics.flu-dyn

Non-intrusive Learning of Physics-Informed Spatio-temporal Surrogate for Accelerating Design

Sudeepta Mondal, Soumalya Sarkar

2604.14422 2026-04-17 cs.AI

Demonstration of Pneuma-Seeker: Agentic System for Reifying and Fulfilling Information Needs on Tabular Data

Muhammad Imam Luthfi Balaka, Raul Castro Fernandez

Comments ACM CAIS 2026 (Demo)

2604.14421 2026-04-17 cs.RO

BIEVR-LIO: Robust LiDAR-Inertial Odometry through Bump-Image-Enhanced Voxel Maps

Patrick Pfreundschuh, Turcan Tuna, Cedric Le Gentil, Roland Siegwart, Cesar Cadena, Helen Oleynikova

2604.14419 2026-04-17 cs.AI

Equifinality in Mixture of Experts: Routing Topology Does Not Determine Language Modeling Quality

Ivan Ternovtsii, Yurii Bilak

2604.14414 2026-04-17 cs.CL

The Autocorrelation Blind Spot: Why 42% of Turn-Level Findings in LLM Conversation Analysis May Be Spurious

Ferdinand M. Schessl

Comments 14 pages, 3 figures, 5 tables, 1 algorithm. Code and synthetic demonstration data: https://github.com/ferdinandschessl-boop/autocorrelation-correction

2604.14401 2026-04-17 cs.AI cs.DB

Credo: Declarative Control of LLM Pipelines via Beliefs and Policies

Duo Lu, Andrew Crotty, Uğur Çetintemel

2604.14399 2026-04-17 cs.RO cs.AI cs.SY eess.SY

SpaceMind: A Modular and Self-Evolving Embodied Vision-Language Agent Framework for Autonomous On-orbit Servicing

Aodi Wu, Haodong Han, Xubo Luo, Ruisuo Wang, Shan He, Xue Wan

Comments 23 pages, 6 figures, 7 tables. Code available at https://github.com/wuaodi/SpaceMind

2604.14379 2026-04-17 cs.LG cs.AI cs.CV

Step-level Denoising-time Diffusion Alignment with Multiple Objectives

Qi Zhang, Dawei Wang, Shaofeng Zou

2604.14375 2026-04-17 cs.LG cs.AI

Modular Continual Learning via Zero-Leakage Reconstruction Routing and Autonomous Task Discovery

Noureddine Kermiche

2604.14363 2026-04-17 cs.CL cs.AI cs.CV

The Cost of Language: Centroid Erasure Exposes and Exploits Modal Competition in Multimodal Language Models

Akshay Paruchuri, Ishan Chatterjee, Henry Fuchs, Ehsan Adeli, Piotr Didyk

Comments 29 pages, 9 figures, 19 tables

2604.14362 2026-04-17 cs.CL cs.AI cs.IR

APEX-MEM: Agentic Semi-Structured Memory with Temporal Reasoning for Long-Term Conversational AI

Pratyay Banerjee, Masud Moshtaghi, Shivashankar Subramanian, Amita Misra, Ankit Chadha

Comments Accepted to ACL 2026 Mains

2604.14353 2026-04-17 cs.RO

RoSLAC: Robust Simultaneous Localization and Calibration of Multiple Magnetometers

Qiyang Lyu, Zhenyu Wu, Wei Wang, Hongming Shen, Danwei Wang

详情

英文摘要

Localization of autonomous mobile robots (AMRs) in enclosed or semi-enclosed environments such as offices, hotels, hospitals, indoor parking facilities, and underground spaces where GPS signals are weak or unavailable remains a major obstacle to the deployment of fully autonomous systems. Infrastructure-based localization approaches, such as QR codes and RFID, are constrained by high installation and maintenance costs as well as limited flexibility, while onboard sensor-based methods, including LiDAR- and vision-based solutions, are affected by ambiguous geometric features and frequent occlusions caused by dynamic obstacles such as pedestrians. Ambient magnetic field (AMF)-based localization has therefore attracted growing interest in recent years because it does not rely on external infrastructure or geometric features, making it well-suited for AMR applications such as service robots and security robots. However, magnetometer measurements are often corrupted by distortions caused by ferromagnetic materials present on the sensor platform, which bias the AMF and degrade localization reliability. As a result, accurate magnetometer calibration to estimate distortion parameters becomes essential. Conventional calibration methods that rely on rotating the magnetometer are impractical for large and heavy platforms. To address this limitation, this paper proposes a robust simultaneous localization and calibration (RoSLAC) approach based on alternating optimization, which iteratively and efficiently estimates both the platform pose and magnetometer calibration parameters. Extensive evaluations conducted in high-fidelity simulation and real-world environments demonstrate that the proposed RoSLAC method achieves high localization accuracy while maintaining low computational cost compared with state-of-the-art magnetometer calibration techniques.

URL PDF HTML ☆

赞 0 踩 0

2604.14339 2026-04-17 cs.CL

Shuffle the Context: RoPE-Perturbed Self-Distillation for Long-Context Adaptation

Zichong Li, Chen Liang, Liliang Ren, Tuo Zhao, Yelong Shen, Weizhu Chen

2604.14338 2026-04-17 cs.LG stat.ML

Path-Sampled Integrated Gradients

Firuz Kamalov, Fadi Thabtah, R. Sivaraj, Neda Abdelhamid

2604.14336 2026-04-17 cs.AI

Mistake gating leads to energy and memory efficient continual learning

Aaron Pache, Mark CW van Rossum

2604.14332 2026-04-17 cs.LG cs.AI

Thermodynamic Diffusion Inference with Minimal Digital Conditioning

Aditi De

2604.14331 2026-04-17 cs.LG stat.ML

Heat and Matérn Kernels on Matchings

Dmitry Eremeev, Salem Said, Viacheslav Borovitskiy

2604.14329 2026-04-17 cs.CV

Interpretable Human Activity Recognition for Subtle Robbery Detection in Surveillance Videos

Bryan Jhoan Cazáres Leyva, Ulises Gachuz Davila, José Juan González Fonseca, Juan Irving Vasquez, Vanessa A. Camacho-Vázquez, Sergio Isahí Garrido-Castañeda

Comments submitted to MCPR

2604.14325 2026-04-17 cs.CL cs.AI

Faithfulness Serum: Mitigating the Faithfulness Gap in Textual Explanations of LLM Decisions via Attribution Guidance

Bar Alon, Itamar Zimerman, Lior Wolf

Comments 24 pages, multiple figures (e.g., at least 6 main figures), includes experiments across several benchmarks (MMLU, CommonsenseQA, SciQ, ARC, OpenBookQA); code available on GitHub

2604.14324 2026-04-17 cs.CL

Purging the Gray Zone: Latent-Geometric Denoising for Precise Knowledge Boundary Awareness

Hao An, Yibin Lou, Jiayi Guo, Yang Xu

Comments ACL 2026 Findings

2604.14321 2026-04-17 cs.CL

LLM Predictive Scoring and Validation: Inferring Experience Ratings from Unstructured Text

Jason Potteiger, Andrew Hong, Ito Zapata

Comments 29 pages, 5 figures, 6 tables

2604.14316 2026-04-17 cs.AI

Seeing Through Experts Eyes A Foundational Vision Language Model Trained on Radiologists Gaze and Reasoning

Kinhei Lee, Peiyuan Jing, Zhenxuan Zhang, Yue Yang, Tao Wang, Dominic C Marshall, Yingying Fang, Guang Yang

2604.14315 2026-04-17 cs.CL cs.CY

Tracking the Temporal Dynamics of News Coverage of Catastrophic and Violent Events

Emily Lugos, Maurício Gruppi

2604.14314 2026-04-17 cs.CV cs.AI cs.CL

DharmaOCR: Specialized Small Language Models for Structured OCR that outperform Open-Source and Commercial Baselines

Gabriel Pimenta de Freitas Cardoso, Caio Lucas da Silva Chacon, Jonas Felipe da Fonseca Oliveira, Paulo Henrique de Medeiros Araujo

2604.14302 2026-04-17 cs.CV

Geometrically Consistent Multi-View Scene Generation from Freehand Sketches

Ahmed Bourouis, Savas Ozkan, Andrea Maracani, Yi-Zhe Song, Mete Ozay

2604.14287 2026-04-17 cs.LG cs.AI quant-ph

Quantum-inspired tensor networks in machine learning models

Guillermo Valverde, Igor García-Olaizola, Giannicola Scarpa, Alejandro Pozas-Kerstjens

Comments 28 pages, 11 figures, article class. The interactive version of the graph can be found at https://github.com/gvalverde21/research-graph-TensorNetworks-MachineLearning

2604.14268 2026-04-17 cs.CV

HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds

Team HY-World, Chenjie Cao, Xuhui Zuo, Zhenwei Wang, Yisu Zhang, Junta Wu, Zhenyang Liu, Yuning Gong, Yang Liu, Bo Yuan, Chao Zhang, Coopers Li, Dongyuan Guo, Fan Yang, Haiyu Zhang, Hang Cao, Jianchen Zhu, Jiaxin Lin, Jie Xiao, Jihong Zhang, Junlin Yu, Lei Wang, Lifu Wang, Lilin Wang, Linus, Minghui Chen, Peng He, Penghao Zhao, Qi Chen, Rui Chen, Rui Shao, Sicong Liu, Wangchen Qin, Xiaochuan Niu, Xiang Yuan, Yi Sun, Yifei Tang, Yifu Sun, Yihang Lian, Yonghao Tan, Yuhong Liu, Yuyang Yin, Zhiyuan Min, Tengfei Wang, Chunchao Guo

Comments Project Page: https://3d-models.hunyuan.tencent.com/world/ ; Code: https://github.com/Tencent-Hunyuan/HY-World-2.0