arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2509.22166 2026-04-27 cs.LG cs.AI

Motivating Next-Gen Accelerators with Flexible (N:M) Activation Sparsity via Benchmarking Lightweight Post-Training Sparsification Approaches

Shirin Alanova, Kristina Kazistova, Ekaterina Galaeva, Alina Kostromina, Vladimir Smirnov, Redko Dmitry, Alexey Dontsov, Maxim Zhelnin, Evgeny Burnaev, Egor Shvetsov

详情

英文摘要

The demand for efficient large language model (LLM) inference has intensified the focus on sparsification techniques. While semi-structured (N:M) pruning is well-established for weights, its application to activation pruning remains underexplored despite its potential for dynamic, input-adaptive compression and reductions in I/O overhead. This work presents a comprehensive analysis of methods for post-training N:M activation pruning in LLMs. Across multiple LLMs, we demonstrate that pruning activations enables superior preservation of generative capabilities compared to weight pruning at equivalent sparsity levels. We evaluate lightweight, plug-and-play error mitigation techniques and pruning criteria, establishing strong hardware-friendly baselines that require minimal calibration. Furthermore, we explore sparsity patterns beyond NVIDIA's standard 2:4, showing that the 16:32 pattern achieves performance nearly on par with unstructured sparsity. However, considering the trade-off between flexibility and hardware implementation complexity, we focus on the 8:16 pattern as a superior candidate. Our findings provide both effective practical methods for activation pruning and a motivation for future hardware to support more flexible sparsity patterns. Our code is available https://anonymous.4open.science/r/Structured-Sparse-Activations-Inference-EC3C/README.md .

URL PDF HTML ☆

赞 0 踩 0

2509.20979 2026-04-27 cs.LG

Toward Robust and Efficient ML-Based GPU Caching for Modern Inference

Peng Chen, Jiaji Zhang, Hailiang Zhao, Yirong Zhang, Shenyao Chen, Jiahong Yu, Xueyan Tang, Yixuan Wang, Hao Li, Jianping Zou, Gang Xiong, Kingsum Chow, Shuibing He, Shuiguang Deng

2509.20886 2026-04-27 cs.CV cs.LG eess.IV

Nuclear Diffusion Models for Low-Rank Background Suppression in Videos

Tristan S. W. Stevens, Oisín Nolan, Jean-Luc Robert, Ruud J. G. van Sloun

Comments 5 pages, 4 figures, preprint

2509.14127 2026-04-27 cs.RO cs.MA

Relay-Based Coordination for Energy-Efficient Multi-Robot Pickup and Delivery

Alkesh K. Srivastava, Jared Michael Levin, Philip Dames

2508.15025 2026-04-27 cs.LG cs.SY eess.SY

Federated Nonlinear System Identification

Omkar Tupe, Max Hartman, Lav R. Varshney, Saurav Prakash

Comments Accepted at American Control Conference 2026

2508.10695 2026-04-27 cs.CL cs.AI cs.IR

Learning from Natural Language Feedback for Personalized Question Answering

Alireza Salemi, Hamed Zamani

2508.09160 2026-04-27 cs.LG cs.DB q-bio.QM

Presenting DiaData for Research on Type 1 Diabetes

Beyza Cinar, Maria Maleshkova

Comments 11 pages, 7 figures, 3 tables. References were corrected for version 2

2508.03963 2026-04-27 cs.AI

Can Large Language Models Adequately Perform Symbolic Reasoning Over Time Series?

Zewen Liu, Juntong Ni, Xianfeng Tang, Max S. Y. Lau, Qi He, Wenpeng Yin, Wei Jin

Comments camera_ready

2507.13706 2026-04-27 cs.CV math.ST stat.TH

GOSPA and T-GOSPA quasi-metrics for evaluation of multi-object tracking algorithms

Ángel F. García-Fernández, Jinhao Gu, Lennart Svensson, Yuxuan Xia, Jan Krejčí, Oliver Kost, Ondřej Straka

Comments Matlab code of GOSPA and T-GOSPA q-metrics is provided at https://github.com/Agarciafernandez/MTT. Python code of the T-GOSPA q-metric is provided at https://github.com/Agarciafernandez/T-GOSPA-metric-python

2506.16494 2026-04-27 cs.LG eess.SP

Manifold Learning for Personalized and Label-Free Detection of Cardiac Arrhythmias

Amir Reza Vazifeh, Jason W. Fleischer

详情

英文摘要

Electrocardiograms (ECGs) provide non-invasive measurements of heart activity and are established tools for detecting cardiac arrhythmias. Although supervised machine learning has emerged as a promising approach for automated heartbeat classification, substantial variations in ECG signals across individuals and leads, combined with inconsistent labeling standards and dataset biases, make it difficult to develop generalizable models. Dimensionality reduction maps high-dimensional data into a lower-dimensional space while preserving the underlying structure, enabling visualization and pattern discovery. Conventional methods, e.g., principal component analysis, prioritize large variances and typically overlook subtle yet clinically relevant patterns. Here, we show that nonlinear dimensionality reduction (NLDR) algorithms, e.g., t-SNE and UMAP, can identify medically relevant features in ECG signals without pretraining or prior information. Using the MIT-BIH Arrhythmia Database, we show that: a) applying NLDR to a mixed population of heartbeats reveals inter-individual morphological differences, as signals from the same person cluster together in latent spaces; and b) applying NLDR to heartbeats of a single individual separates normal beats from arrhythmias into distinct clusters, identifiable in an unsupervised manner. To our knowledge, this is the first systematic evaluation of NLDR for unsupervised arrhythmia detection. Both UMAP and t-SNE achieved trustworthiness scores >=0.95, indicating that local neighborhoods are well preserved in the embedding. Classification on 2D embeddings outperforms the original high-dimensional space, with a k-NN classifier discriminating individual recordings with >=80% accuracy and identifying arrhythmias with median accuracy >=98% and median F1-score >=85%. These results show that NLDR holds much promise for cardiac monitoring and personalized healthcare.

URL PDF HTML ☆

赞 0 踩 0

2506.07298 2026-04-27 cs.LG cs.AI

Pre-trained Large Language Models Learn Hidden Markov Models In-context

Yijia Dai, Zhaolin Gao, Yahya Sattar, Sarah Dean, Jennifer J. Sun

Comments NeurIPS 2025

2506.05038 2026-04-27 cs.CL

Toward Automated Robustness Evaluation of Mathematical Reasoning

Yutao Hou, Zeguan Xiao, Fei Yu, Yihan Jiang, Ma Shuguang, Zhaoqian Dai, Hailiang Huang, Yun Chen, Guanhua Chen

Comments Accepted by Findings of ACL2026

2505.20662 2026-04-27 cs.AI

AutoReproduce: Automatic AI Experiment Reproduction with Paper Lineage

Xuanle Zhao, Zilin Sang, Yuxuan Li, Qi Shi, Weilun Zhao, Shuo Wang, Duzhen Zhang, Xu Han, Zhiyuan Liu, Maosong Sun

Comments Accepted by ACL 2026 Main

2505.17639 2026-04-27 cs.LG

PreMoE: Proactive Inference for Efficient Mixture-of-Experts

Zehua Pei, Ying Zhang, Hui-Ling Zhen, Tao Yuan, Xianzhi Yu, Zhenhua Dong, Sinno Jialin Pan, Mingxuan Yuan, Bei Yu

2505.14990 2026-04-27 cs.CL

Language Specific Knowledge: Do Models Know Better in X than in English?

Ishika Agarwal, Nimet Beyza Bozdag, Nisval Patel, Dilek Hakkani-Tür

2505.14351 2026-04-27 cs.SD cs.AI cs.CL eess.AS

FMSD-TTS: Few-shot Multi-Speaker Multi-Dialect Text-to-Speech Synthesis for Ü-Tsang, Amdo and Kham Speech Dataset Generation

Yutong Liu, Ziyue Zhang, Ban Ma-bao, Yuqing Cai, Yongbin Yu, Renzeng Duojie, Xiangxiang Wang, Fan Gao, Cheng Huang, Nyima Tashi

Comments This paper has been substantially restructured using a revised writing style. In addition, considering that maintaining two preprints simultaneously may not fully align with academic publishing ethics, we have withdrawn the previous version. Please refer to the updated manuscript at: arXiv:509.18060

2505.14234 2026-04-27 cs.LG cs.AI

Fast, close, non-singular and property-preserving approximations of entropic measures

Illia Horenko, Davide Bassetti, Lukáš Pospíšil

Comments 17 pages, 4 figures

2505.13527 2026-04-27 cs.CL cs.AI

Logic Jailbreak: Efficiently Unlocking LLM Safety Restrictions Through Formal Logical Expression

Jingyu Peng, Maolin Wang, Nan Wang, Jiatong Li, Yuchen Li, Yuyang Ye, Wanyu Wang, Pengyue Jia, Kai Zhang, Xiangyu Zhao

2505.13255 2026-04-27 cs.RO

Policy Contrastive Decoding for Robotic Foundation Models

Shihan Wu, Xu Luo, Ji Zhang, Junlin Xie, Jingkuan Song, Heng Tao Shen, Lianli Gao

Comments ICLR 2026. Project website: https://koorye.github.io/PCD/

2505.01380 2026-04-27 cs.RO cs.SY eess.SY

An Efficient Real-Time Planning Method for Swarm Robotics Based on an Optimal Virtual Tube

Pengda Mao, Shuli Lv, Chen Min, Zhaolong Shen, Quan Quan

Comments 18 pages, 21 figures

2504.06148 2026-04-27 cs.CV

V-MAGE: A Game Evaluation Framework for Assessing Vision-Centric Capabilities in Multimodal Large Language Models

Xiangxi Zheng, Linjie Li, Zhengyuan Yang, Ping Yu, Alex Jinpeng Wang, Rui Yan, Yuan Yao, Lijuan Wang

2503.21435 2026-04-27 cs.AI

Graph-to-Vision: Multi-graph Understanding and Reasoning using Vision-Language Models

Qihang Ai, Ruizhou Li, Menghui Wang, Haiyun Jiang

Comments 26 pages, 23 figures

2503.12507 2026-04-27 cs.CV

Segment Any-Quality Images with Generative Latent Space Enhancement

Guangqian Guo, Yong Guo, Xuehui Yu, Wenbo Li, Yaoxing Wang, Shan Gao

Comments Accepted by CVPR2025

2503.05231 2026-04-27 cs.RO cs.AI

Kaiwu: A Multimodal Manipulation Dataset and Framework for Robot Learning and Human-Robot Interaction

Shuo Jiang, Haonan Li, Ruochen Ren, Yanmin Zhou, Zhipeng Wang, Bin He

Comments 8 pages, 5 figures, Submitted to IEEE Robotics and Automation Letters (RAL)

2502.16994 2026-04-27 cs.LG cs.AI cs.CL

FADE: Why Bad Descriptions Happen to Good Features

Bruno Puri, Aakriti Jain, Elena Golimblevskaia, Patrick Kahardipraja, Thomas Wiegand, Wojciech Samek, Sebastian Lapuschkin

2502.03698 2026-04-27 cs.LG cs.CR cs.RO

How Vulnerable Is My Learned Policy? Universal Adversarial Perturbation Attacks On Modern Behavior Cloning Policies

Akansha Kalra, Basavasagar Patil, Guanhong Tao, Daniel S. Brown

2502.00955 2026-04-27 cs.CL

Efficient Multi-Agent System Training with Data Influence-Oriented Tree Search

Wentao Shi, Zichun Yu, Fuli Feng, Xiangnan He, Chenyan Xiong

Comments Accepted by ACL 2026 Main;

2501.16839 2026-04-27 cs.LG math.PR

Flow Matching: Markov Kernels, Stochastic Processes and Transport Plans

Christian Wald, Gabriele Steidl

2501.07557 2026-04-27 cs.SD cs.CY eess.AS physics.soc-ph

Decoding Musical Evolution Through Network Science

Niccolo' Di Marco, Edoardo Loru, Alessandro Galeazzi, Matteo Cinelli, Walter Quattrociocchi

2412.19780 2026-04-27 cs.LG quant-ph

Tensor Network Estimation of Distribution Algorithms

John Gardiner, Javier Lopez-Piqueres