arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2602.04105 2026-03-16 cs.CL cs.CR

Expert Selections In MoE Models Reveal (Almost) As Much As Text

Amir Nuriyev, Gabriel Kulp

详情

英文摘要

We present a text-reconstruction attack on mixture-of-experts (MoE) language models that recovers tokens from expert selections alone. In MoE models, each token is routed to a subset of expert subnetworks; we show these routing decisions leak substantially more information than previously understood. Prior work using logistic regression achieves limited reconstruction; we show that a 3-layer MLP improves this to 63.1% top-1 accuracy, and that a transformer-based sequence decoder recovers 91.2% of tokens top-1 (94.8% top-10) on 32-token sequences from OpenWebText after training on 100M tokens. These results connect MoE routing to the broader literature on embedding inversion. We outline practical leakage scenarios (e.g., distributed inference and side channels) and show that adding noise reduces but does not eliminate reconstruction. Our findings suggest that expert selections in MoE deployments should be treated as sensitive as the underlying text.

URL PDF HTML ☆

赞 0 踩 0

2602.02983 2026-03-16 cs.AI

Do LLMs Share Human-Like Biases? Causal Reasoning Under Prior Knowledge, Irrelevant Context, and Varying Compute Budgets

Hanna M. Dettki, Charley M. Wu, Bob Rehder

2602.02592 2026-03-16 cs.LG cs.AI cs.SY eess.SY

Learnable Koopman-Enhanced Transformer-Based Time Series Forecasting with Spectral Control

Ali Forootani, Raffaele Iervolino

2601.16309 2026-03-16 cs.CL cs.SI

A Longitudinal, Multinational, and Multilingual Corpus of News Coverage of the Russo-Ukrainian War

Dikshya Mohanty, Taisiia Sabadyn, Jelwin Rodrigues, Chenlu Wang, Abhishek Kalugade, Ritwik Banerjee

Comments To appear in Language Resources and Evaluation Conference (LREC) 2026

2601.12823 2026-03-16 cs.CV

TreeDGS: Aerial Gaussian Splatting for Distant DBH Measurement

Belal Shaheen, Minh-Hieu Nguyen, Bach-Thuan Bui, Shubham, Tim Wu, Michael Fairley, Matthew David Zane, Michael Wu, James Tompkin

2601.12551 2026-03-16 cs.CV eess.IV

PISE: Physics-Anchored Semantically-Enhanced Deep Computational Ghost Imaging for Robust Low-Bandwidth Machine Perception

Tong Wu

Comments 4 pages, 4 figures, 4 tables. Refined version with updated references and formatting improvements

2601.08265 2026-03-16 cs.CV

AIMC-Spec: A Benchmark Dataset for Automatic Intrapulse Modulation Classification under Variable Noise Conditions

Sebastian L. Cocks, Salvador Dreo, Brian Ng, Feras Dayoub

Comments This version updates the previously released dataset by reducing storage requirements, revising the SNR calculation procedure, and restructuring the dataset format The first version of this work was published in IEEE Access DOI: 10.1109/ACCESS.2025.3645091

2601.07540 2026-03-16 cs.CV

Enhancing Novel View Synthesis via Geometry Grounded Set Diffusion

Farhad G. Zanjani, Hong Cai, Amirhossein Habibian

Comments Paper and supplementary materials

2601.04864 2026-03-16 cs.AI

Key-Value Pair-Free Continual Learner via Task-Specific Prompt-Prototype

Haihua Luo, Xuming Ran, Zhengji Li, Huiyan Xue, Tingting Jiang, Jiangrong Shen, Tommi Kärkkäinen, Qi Xu, Fengyu Cong

Comments Accepted by Neural Networks

2601.02267 2026-03-16 cs.CV

DiffProxy: Multi-View Human Mesh Recovery via Diffusion-Generated Dense Proxies

Renke Wang, Zhenyu Zhang, Ying Tai, Jun Li, Jian Yang

Comments Page: https://wrk226.github.io/DiffProxy.html, Code: https://github.com/wrk226/DiffProxy

2601.02224 2026-03-16 cs.CL

From XAI to Stories: A Factorial Study of LLM-Generated Explanation Quality

Fabian Lukassen, Jan Herrmann, Christoph Weisser, Benjamin Saefken, Thomas Kneib

2601.02123 2026-03-16 cs.CL cs.AI

DeCode: Decoupling Content and Delivery for Medical QA

Po-Jen Ko, Chen-Han Tsai, Yu-Shao Peng

Comments Preprint

2601.00217 2026-03-16 cs.SD cs.AI eess.AS

Mitigating Latent Mismatch in cVAE-Based Singing Voice Synthesis via Flow Matching

Minhyeok Yun, Yong-Hoon Choi

2601.00150 2026-03-16 cs.CV cs.AI cs.CE cs.MM

FCMBench: The First Large-scale Financial Credit Multimodal Benchmark for Real-world Applications

Yehui Yang, Dalu Yang, Fangxin Shang, Wenshuo Zhou, Jie Ren, Yifan Liu, Haojun Fei, Qing Yang, Yanwu Xu, Tao Chen

2512.18396 2026-03-16 cs.RO

AOMGen: Photoreal, Physics-Consistent Demonstration Generation for Articulated Object Manipulation

Yulu Wu, Jiujun Cheng, Haowen Wang, Dengyang Suo, Pei Ren, Qichao Mao, Shangce Gao, Yakun Huang

Comments Accepted by CVPR Findings2026

2512.16201 2026-03-16 cs.CV

Visual Alignment of Medical Vision-Language Models for Grounded Radiology Report Generation

Sarosij Bose, Ravi K. Rajendran, Biplob Debnath, Konstantinos Karydis, Amit K. Roy-Chowdhury, Srimat Chakradhar

2512.15098 2026-03-16 cs.CV

Uni-Parser Technical Report

Xi Fang, Haoyi Tao, Shuwen Yang, Chaozheng Huang, Suyang Zhong, Haocheng Lu, Han Lyu, Junjie Wang, Xinyu Li, Linfeng Zhang, Guolin Ke

2512.15011 2026-03-16 cs.LG cs.AI cs.CY cs.MA

Epistemic diversity across language models mitigates knowledge collapse

Damian Hodel, Jevin D. West

Comments 30 pages, 21 figures. v2 changelog: added experimental variations, updated theory, writing revisions, updated metadata

2512.14873 2026-03-16 cs.LG

How Does Fourier Analysis Network Work? A Mechanism Analysis and a New Dual-Activation Layer Proposal

Sam Jeong, Hae Yong Kim

Comments Received 16 December 2025, accepted 9 February 2026, date of publication 18 February 2026. This work is an enhanced version of the article accepted and published in IEEE Access. Date of current version 4 March 2026

详情

DOI: 10.1109/ACCESS.2026.3666073
Journal ref: IEEE Access, vol. 14, pp. 30431-30440, 2026

英文摘要

Fourier Analysis Network (FAN) was recently proposed as a simple way to improve neural network performance by replacing part of Rectified Linear Unit (ReLU) activations with sine and cosine functions. Although several studies have reported small but consistent gains across tasks, the underlying mechanism behind these improvements has remained unclear. In this work, we show that only the sine activation contributes positively to performance, whereas the cosine activation tends to be detrimental. Our analysis reveals that the improvement is not a consequence of the sine function's periodic nature; instead, it stems from the function's local behavior near x = 0, where its non-zero derivative mitigates the vanishing-gradient problem. We further show that FAN primarily alleviates the dying-ReLU problem, in which a neuron consistently receives negative inputs, produces zero gradients, and stops learning. Although modern ReLU-like activations, such as Leaky ReLU, GELU, and Swish, reduce ReLU's zero-gradient region, they still contain input domains where gradients remain significantly diminished, contributing to slower optimization and hindering rapid convergence. FAN addresses this limitation by introducing a more stable gradient pathway. This analysis shifts the understanding of FAN's benefits from a spectral interpretation to a concrete analysis of training dynamics, leading to the development of the Dual-Activation Layer (DAL), a more efficient convergence accelerator. We evaluate DAL on three tasks: classification of noisy sinusoidal signals versus pure noise, MNIST digit classification, and Electrocardiogram (ECG)-based biometric recognition. In all cases, DAL models converge faster and achieve equal or higher validation accuracy compared to models with conventional activations.

URL PDF HTML ☆

赞 0 踩 0

2512.13674 2026-03-16 cs.CV cs.CL cs.GR cs.HC

Towards Interactive Intelligence for Digital Humans

Yiyi Cai, Xuangeng Chu, Xiwei Gao, Sitong Gong, Yifei Huang, Caixin Kang, Kunhang Li, Haiyang Liu, Ruicong Liu, Yun Liu, Dianwen Ng, Zixiong Su, Erwin Wu, Yuhan Wu, Dingkun Yan, Tianyu Yan, Chang Zeng, Bo Zheng, You Zhou

2512.08881 2026-03-16 cs.CV

SATGround: A Spatially-Aware Approach for Visual Grounding in Remote Sensing

Aysim Toker, Andreea-Maria Oncescu, Roy Miles, Ismail Elezi, Jiankang Deng

2512.06684 2026-03-16 cs.CV

EMGauss: Continuous Slice-to-3D Reconstruction via Dynamic Gaussian Modeling in Volume Electron Microscopy

Yumeng He, Zanwei Zhou, Yekun Zheng, Chen Liang, Yunbo Wang, Xiaokang Yang

Comments Accepted by CVPR 2026. Project page: https://raynehe.github.io/EMGauss/

2512.02293 2026-03-16 cs.RO cs.CV

VIGS-SLAM: Visual Inertial Gaussian Splatting SLAM

Zihan Zhu, Wei Zhang, Moyang Li, Norbert Haala, Marc Pollefeys, Daniel Barath

Comments Project page: https://vigs-slam.github.io

2512.01550 2026-03-16 cs.RO cs.CV

NavForesee: A Unified Vision-Language World Model for Hierarchical Planning and Dual-Horizon Navigation Prediction

Fei Liu, Shichao Xie, Minghua Luo, Zedong Chu, Junjun Hu, Xiaolong Wu, Mu Xu

2511.22645 2026-03-16 cs.CV

GeoZero: Incentivizing Reasoning from Scratch on Geospatial Scenes

Di Wang, Shunyu Liu, Wentao Jiang, Fengxiang Wang, Yi Liu, Xiaolei Qin, Zhiming Luo, Chaoyang Zhou, Haonan Guo, Jing Zhang, Bo Du, Dacheng Tao, Liangpei Zhang

Comments Code, data, and models are available at https://github.com/MiliLab/GeoZero

2511.21662 2026-03-16 cs.CV

Multi-Crit: Benchmarking Multimodal Judges on Pluralistic Criteria-Following

Tianyi Xiong, Yi Ge, Ming Li, Zuolong Zhang, Pranav Kulkarni, Kaishen Wang, Qi He, Zeying Zhu, Chenxi Liu, Ruibo Chen, Tong Zheng, Yanshuo Chen, Xiyao Wang, Renrui Zhang, Wenhu Chen, Heng Huang

Comments Accepted to CVPR 2026

2511.21251 2026-03-16 cs.CV

AVFakeBench: A Comprehensive Audio-Video Forgery Detection Benchmark for AV-LMMs

Shuhan Xia, Peipei Li, Xuannan Liu, Dongsen Zhang, Xinyu Guo, Zekun Li

Comments The experimental results in this paper have been further improved and updated; the baseline results do not match existing results, therefore the paper needs to be retracted

2511.21105 2026-03-16 cs.CV

RLM: A Vision-Language Model Approach for Radar Scene Understanding

Pushkal Mishra, Kshitiz Bansal, Dinesh Bharadia

2511.18765 2026-03-16 cs.CV cs.AI

NI-Tex: Non-isometric Image-based Garment Texture Generation

Hui Shan, Ming Li, Haitao Yang, Kai Zheng, Sizhe Zheng, Yanwei Fu, Xiangru Huang

Comments Accepted to CVPR 2026

2511.18507 2026-03-16 cs.CV cs.AI

Multimodal Continual Learning with MLLMs from Multi-scenario Perspectives

Kai Jiang, Siqi Huang, Xiangyu Chen, Jiawei Shao, Hongyuan Zhang, Ping Luo, Xuelong Li

Comments 22 pages, 17 figures. This is a preprint version of a paper submitted to ICML 2026