arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.11308 2026-03-13 cs.LG

Heavy-Tailed Principle Component Analysis

Mario Sayde, Christopher Khater, Jihad Fahs, Ibrahim Abou-Faycal

详情

英文摘要

Principal Component Analysis (PCA) is a cornerstone of dimensionality reduction, yet its classical formulation relies critically on second-order moments and is therefore fragile in the presence of heavy-tailed data and impulsive noise. While numerous robust PCA variants have been proposed, most either assume finite variance, rely on sparsity-driven decompositions, or address robustness through surrogate loss functions without a unified treatment of infinite-variance models. In this paper, we study PCA for high-dimensional data generated according to a superstatistical dependent model of the form $\mathbf{X} = A^{1/2}\mathbf{G}$, where $A$ is a positive random scalar and $\mathbf{G}$ is a Gaussian vector. This framework captures a wide class of heavy-tailed distributions, including multivariate $t$ and sub-Gaussian $α$-stable laws. We formulate PCA under a logarithmic loss, which remains well defined even when moments do not exist. Our main theoretical result shows that, under this loss, the principal components of the heavy-tailed observations coincide with those obtained by applying standard PCA to the covariance matrix of the underlying Gaussian generator. Building on this insight, we propose robust estimators for this covariance matrix directly from heavy-tailed data and compare them with the empirical covariance and Tyler's scatter estimator. Extensive experiments, including background denoising tasks, demonstrate that the proposed approach reliably recovers principal directions and significantly outperforms classical PCA in the presence of heavy-tailed and impulsive noise, while remaining competitive under Gaussian noise.

URL PDF HTML ☆

赞 0 踩 0

2603.11307 2026-03-13 cs.LG

Client-Conditional Federated Learning via Local Training Data Statistics

Rickard Brännvall

Comments 9 pages, 4 figures, 5 tables. Submitted to FLICS 2026

2603.11306 2026-03-13 cs.CV

Hierarchical Granularity Alignment and State Space Modeling for Robust Multimodal AU Detection in the Wild

Jun Yu, Yunxiang Zhang, Naixiang Zheng, Lingsi Zhu, Guoyuan Wang

Comments 8 pages, 1 figures

2603.11299 2026-03-13 cs.AI

Counterweights and Complementarities: The Convergence of AI and Blockchain Powering a Decentralized Future

Yibai Li, Zhiye Jin, Xiaobing, Li, K. D. Joshi, Xuefei, Deng

Comments 7 pages, Editorial, published in ACM SIGMIS Database Vol. 56, Iss. 2

2603.11296 2026-03-13 cs.LG q-bio.QM

Single molecule localization microscopy challenge: a biologically inspired benchmark for long-sequence modeling

Fatemeh Valeh, Monika Farsang, Radu Grosu, Gerhard Schütz

Comments 11 pages, 4 figures. Under review

2603.11295 2026-03-13 cs.CL

Temporal Text Classification with Large Language Models

Nishat Raihan, Marcos Zampieri

2603.11290 2026-03-13 cs.RO

A Causal Approach to Predicting and Improving Human Perceptions of Social Navigation Robots

Maximilian Diehl, Nathan Tsoi, Gustavo Chavez, Karinne Ramirez-Amaro, Marynel Vázquez

Comments 8 pages, to be submitted to RA-L

2603.11281 2026-03-13 cs.CL

ThReadMed-QA: A Multi-Turn Medical Dialogue Benchmark from Real Patient Questions

Monica Munnangi, Saiph Savage

2603.11273 2026-03-13 cs.LG

Duration Aware Scheduling for ASR Serving Under Workload Drift

Darshan Makwana, Yash Jogi, Harsh Kotta, Aayush Kubba

2603.11269 2026-03-13 cs.LG

Beyond the Class Subspace: Teacher-Guided Training for Reliable Out-of-Distribution Detection in Single-Domain Models

Hong Yang, Devroop Kar, Qi Yu, Travis Desell, Alex Ororbia

Comments 14 pages main text, 22 pages appendix; under review at ECCV 2026

2603.11266 2026-03-13 cs.AI

The Unlearning Mirage: A Dynamic Framework for Evaluating LLM Unlearning

Raj Sanjay Shah, Jing Huang, Keerthiram Murugesan, Nathalie Baracaldo, Diyi Yang

Comments Published at COLM 2025

2603.11257 2026-03-13 cs.CV

Towards Automated Initial Probe Placement in Transthoracic Teleultrasound Using Human Mesh and Skeleton Recovery

Yu Chung Lee, David G. Black, Ryan S. Yeung, Septimiu E. Salcudean

Comments 10 pages, 6 figures. Under review

2603.11254 2026-03-13 cs.CL cs.AI

Artificial Intelligence for Sentiment Analysis of Persian Poetry

Arash Zargar, Abolfazl Moshiri, Mitra Shafaei, Shabnam Rahimi-Golkhandan, Mohamad Tavakoli-Targhi, Farzad Khalvati

2603.11252 2026-03-13 cs.CV

Radiometric fingerprinting of object surfaces using mobile laser scanning and semantic 3D road space models

Benedikt Schwab, Thomas H. Kolbe

2603.11246 2026-03-13 cs.CV

When Slots Compete: Slot Merging in Object-Centric Learning

Christos Chatzisavvas, Panagiotis Rigas, George Ioannakis, Vassilis Katsouros, Nikolaos Mitianoudis

2603.11245 2026-03-13 cs.AI

Mind the Sim2Real Gap in User Simulation for Agentic Tasks

Xuhui Zhou, Weiwei Sun, Qianou Ma, Yiqing Xie, Jiarui Liu, Weihua Du, Sean Welleck, Yiming Yang, Graham Neubig, Sherry Tongshuang Wu, Maarten Sap

2603.11230 2026-03-13 cs.LG eess.SP

Monitoring and Prediction of Mood in Elderly People during Daily Life Activities

Daniel Bautista-Salinas, Joaquín Roca González, Inmaculada Méndez, Oscar Martinez Mozos

Comments This is the authors' manuscript. The final published article is available at https://doi.org/10.1109/EMBC.2019.8857847

2603.11228 2026-03-13 cs.CL cs.AI cs.LG

Markovian Generation Chains in Large Language Models

Mingmeng Geng, Amr Mohamed, Guokan Shang, Michalis Vazirgiannis, Thierry Poibeau

2603.11223 2026-03-13 cs.CL cs.AI cs.IR

MDER-DR: Multi-Hop Question Answering with Entity-Centric Summaries

Riccardo Campi, Nicolò Oreste Pinciroli Vago, Mathyas Giudici, Marco Brambilla, Piero Fraternali

Comments Our code is available at https://github.com/DataSciencePolimi/MDER-DR_RAG

2603.11220 2026-03-13 cs.CV cs.CL

Frequency-Modulated Visual Restoration for Matryoshka Large Multimodal Models

Qingtao Pan, Zhihao Dou, Shuo Li

2603.11219 2026-03-13 cs.CV

Senna-2: Aligning VLM and End-to-End Driving Policy for Consistent Decision Making and Planning

Yuehao Song, Shaoyu Chen, Hao Gao, Yifan Zhu, Weixiang Yue, Jialv Zou, Bo Jiang, Zihao Lu, Yu Wang, Qian Zhang, Xinggang Wang

Comments 15 pages, 8 figures. Project page: https://ambitious-idiot.github.io/senna2-project

2603.11210 2026-03-13 cs.LG

Reference-Guided Machine Unlearning

Jonas Mirlach, Sonia Laguna, Julia E. Vogt

Comments 12 pages, 1 figure, 4 tables. Accepted at three ICLR 2026 workshops: Test-Time Updates (TTU), AI with Recursive Self-Improvement (RSI), and Agents in the Wild (AIWILD)

2603.11206 2026-03-13 cs.CV

Evidential learning driven Breast Tumor Segmentation with Stage-divided Vision-Language Interaction

Jingxing Zhong, Qingtao Pan, Xuchang Zhou, Jiazhen Lin, Xinguo Zhuang

2603.11199 2026-03-13 cs.LG

Bayesian Optimization of Partially Known Systems using Hybrid Models

Eike Cramer, Luis Kutschat, Oliver Stollenwerk, Joel A. Paulson, Alexander Mitsos

Comments 16 pages, 5 Figures

2603.11193 2026-03-13 cs.CL

DeReason: A Difficulty-Aware Curriculum Improves Decoupled SFT-then-RL Training for General Reasoning

Hanxu Hu, Yuxuan Wang, Maggie Huan, Jannis Vamvas, Yinya Huang, Zhijiang Guo, Rico Sennrich

Comments 13 pages, 6 figures

2603.11174 2026-03-13 cs.CV

GGPT: Geometry Grounded Point Transformer

Yutong Chen, Yiming Wang, Xucong Zhang, Sergey Prokudin, Siyu Tang

Comments CVPR 2026, Project website: https://chenyutongthu.github.io/research/ggpt

2603.11168 2026-03-13 cs.LG cs.CL cs.SD

Huntington Disease Automatic Speech Recognition with Biomarker Supervision

Charles L. Wang, Cady Chen, Ziwei Gong, Julia Hirschberg

2603.11142 2026-03-13 cs.LG cs.AI cs.CV

Attention Gathers, MLPs Compose: A Causal Analysis of an Action-Outcome Circuit in VideoViT

Sai V R Chereddy

Comments Accepted at the AAAI 2026 Workshop on Deployable AI (DAI). Non-archival. Code and custom dataset available upon request

2603.11140 2026-03-13 cs.LG cs.AI cs.CY

Procedural Fairness via Group Counterfactual Explanation

Gideon Popoola, John Sheppard

Comments 16 pages, submitted to ECML 2026

2603.11137 2026-03-13 cs.LG cs.CL

Scaling Reasoning Efficiently via Relaxed On-Policy Distillation

Jongwoo Ko, Sara Abdali, Young Jin Kim, Tianyi Chen, Pashmina Cameron

Comments Code will be available soon