arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2507.12988 2026-04-02 cs.CV cs.LG

Variance-Based Pruning for Accelerating and Compressing Trained Networks

Uranik Berisha, Jens Mehnert, Alexandru Paul Condurache

Comments Accepted as Oral at ICCV'25 (IEEE/CVF International Conference on Computer Vision)

详情

英文摘要

Increasingly expensive training of ever larger models such as Vision Transfomers motivate reusing the vast library of already trained state-of-the-art networks. However, their latency, high computational costs and memory demands pose significant challenges for deployment, especially on resource-constrained hardware. While structured pruning methods can reduce these factors, they often require costly retraining, sometimes for up to hundreds of epochs, or even training from scratch to recover the lost accuracy resulting from the structural modifications. Maintaining the provided performance of trained models after structured pruning and thereby avoiding extensive retraining remains a challenge. To solve this, we introduce Variance-Based Pruning, a simple and structured one-shot pruning technique for efficiently compressing networks, with minimal finetuning. Our approach first gathers activation statistics, which are used to select neurons for pruning. Simultaneously the mean activations are integrated back into the model to preserve a high degree of performance. On ImageNet-1k recognition tasks, we demonstrate that directly after pruning DeiT-Base retains over 70% of its original performance and requires only 10 epochs of fine-tuning to regain 99% of the original accuracy while simultaneously reducing MACs by 35% and model size by 36%, thus speeding up the model by 1.44x. The code is available at: https://github.com/boschresearch/variance-based-pruning

URL PDF HTML ☆

赞 0 踩 0

2507.11737 2026-04-02 cs.AI

Auto-Formulating Dynamic Programming Problems with Large Language Models

Chenyu Zhou, Jingyuan Yang, Linwei Xin, Yitian Chen, Ziyan He, Dongdong Ge

2506.23053 2026-04-02 cs.LG

Double-Diffusion: ODE-Prior Accelerated Diffusion Models for Spatio-Temporal Graph Forecasting

Hanlin Dong, Arian Prabowo, Hao Xue, Ao Shuang, Tianyi Zhou, Flora D. Salim

2506.21997 2026-04-02 cs.LG cs.AI

Binned semiparametric Bayesian networks for efficient kernel density estimation

Rafael Sojo, Javier Díaz-Rozo, Concha Bielza, Pedro Larrañaga

Comments Major revision after reviewer comments. Title changed based on reviewer suggestion. Improved introduction, complexity analysis and experiments. Submitted to Information Sciences

2506.19846 2026-04-02 cs.AI

HiMA-Ecom: Enabling Joint Training of Hierarchical Multi-Agent E-commerce Assistants

Junxing Hu, Ai Han, Haolan Zhan, Pu Wei, Zhiqian Zhang, Yuhang Guo, Jiawei Lu, Zhen Chen, Haoran Li, Zicheng Zhang

Comments 39 pages, 10 figures, under review

2506.18919 2026-04-02 cs.CL cs.AI cs.CV

MemeMind: A Large-Scale Multimodal Dataset with Chain-of-Thought Reasoning for Harmful Meme Detection

Hexiang Gu, Qifan Yu, Yuan Liu, Zikang Li, Saihui Hou, Jian Zhao, Zhaofeng He

2506.08915 2026-04-02 cs.CV cs.AI

Two-stage Vision Transformers and Hard Masking offer Robust Object Representations

Ananthu Aniraj, Cassio F. Dantas, Dino Ienco, Diego Marcos

Comments Accepted at ICPR 2026

2506.04822 2026-04-02 cs.CL

Evaluating Vision-Language and Large Language Models for Automated Student Assessment in Indonesian Classrooms

Nurul Aisyah, Muhammad Dehan Al Kautsar, Arif Hidayat, Raqib Chowdhury, Fajri Koto

Comments Accepted at AIED 2026

2506.03753 2026-04-02 cs.CV

HUMOF: Human Motion Forecasting in Interactive Social Scenes

Caiyi Sun, Yujing Sun, Xiao Han, Zemin Yang, Jiawei Liu, Xinge Zhu, Siu Ming Yiu, Yuexin Ma

Comments Accepted by ICLR 2026

2506.02768 2026-04-02 cs.RO cs.SY eess.SY

Geometric Visual Servo Via Optimal Transport

Ethan Canzini, Simon Pope, Ashutosh Tiwari

Comments 19 pages, 5 figures. Accepted to Control Engineering Practice

2505.23459 2026-04-02 cs.LG

On Global Convergence Rates for Federated Softmax Policy Gradient under Heterogeneous Environments

Safwan Labbi, Paul Mangold, Daniil Tiapkin, Eric Moulines

Comments Preprint

2505.22337 2026-04-02 cs.CV

Learning to Infer Parameterized Representations of Plants from 3D Scans

Samara Ghrer, Christophe Godin, Stefanie Wuhrer

2505.21505 2026-04-02 cs.CL cs.AI

How Does Alignment Enhance LLMs' Multilingual Capabilities? A Language Neurons Perspective

Shimao Zhang, Zhejian Lai, Xiang Liu, Shuaijie She, Xiao Liu, Yeyun Gong, Shujian Huang, Jiajun Chen

Comments AAAI 2026 (Oral)

2505.19715 2026-04-02 cs.CL cs.AI cs.LG

Graceful Forgetting in Generative Language Models

Chunyang Jiang, Chi-min Chan, Yiyang Cai, Yulong Liu, Wei Xue, Yike Guo

Comments 8 pages, 6 figures. EMNLP 2025

2505.19574 2026-04-02 cs.RO cs.AI cs.LG math.OC

Situationally-Aware Dynamics Learning

Alejandro Murillo-Gonzalez, Lantao Liu

详情

DOI: 10.1177/02783649261431863
Journal ref: The International Journal of Robotics Research (IJRR) 2026

英文摘要

Autonomous robots operating in complex, unstructured environments face significant challenges due to latent, unobserved factors that obscure their understanding of both their internal state and the external world. Addressing this challenge would enable robots to develop a more profound grasp of their operational context. To tackle this, we propose a novel framework for online learning of hidden state representations, with which the robots can adapt in real-time to uncertain and dynamic conditions that would otherwise be ambiguous and result in suboptimal or erroneous behaviors. Our approach is formalized as a Generalized Hidden Parameter Markov Decision Process, which explicitly models the influence of unobserved parameters on both transition dynamics and reward structures. Our core innovation lies in learning online the joint distribution of state transitions, which serves as an expressive representation of latent ego- and environmental-factors. This probabilistic approach supports the identification and adaptation to different operational situations, improving robustness and safety. Through a multivariate extension of Bayesian Online Changepoint Detection, our method segments changes in the underlying data generating process governing the robot's dynamics. The robot's transition model is then informed with a symbolic representation of the current situation derived from the joint distribution of latest state transitions, enabling adaptive and context-aware decision-making. To showcase the real-world effectiveness, we validate our approach in the challenging task of unstructured terrain navigation, where unmodeled and unmeasured terrain characteristics can significantly impact the robot's motion. Extensive experiments in both simulation and real world reveal significant improvements in data efficiency, policy performance, and the emergence of safer, adaptive navigation strategies.

URL PDF HTML ☆

赞 0 踩 0

2505.17899 2026-04-02 cs.LG

Universal Domain Adaptation Benchmark for Time Series Data Representation

Romain Mussard, Fannia Pacheco, Maxime Berar, Gilles Gasso, Paul Honeine

2505.17870 2026-04-02 cs.CL

Just as Humans Need Vaccines, So Do Models: Model Immunization to Combat Falsehoods

Shaina Raza, Rizwan Qureshi, Azib Farooq, Marcelo Lotif, Aman Chadha, Deval Pandya, Christos Emmanouilidis

2505.17760 2026-04-02 cs.LG cs.AI

But what is your honest answer? Aiding LLM-judges with honest alternatives using steering vectors

Leon Eshuijs, Archie Chaudhury, Alan McBeth, Ethan Nguyen

2505.16619 2026-04-02 cs.AI q-bio.OT

Open and Sustainable AI: challenges, opportunities and the road ahead in the life sciences (October 2025 -- Version 2)

Gavin Farrell, Eleni Adamidi, Rafael Andrade Buono, Mihail Anton, Omar Abdelghani Attafi, Salvador Capella Gutierrez, Emidio Capriotti, Leyla Jael Castro, Davide Cirillo, Lisa Crossman, Christophe Dessimoz, Alexandros Dimopoulos, Raul Fernandez-Diaz, Styliani-Christina Fragkouli, Carole Goble, Wei Gu, John M. Hancock, Alireza Khanteymoori, Tom Lenaerts, Fabio G. Liberante, Peter Maccallum, Alexander Miguel Monzon, Magnus Palmblad, Lucy Poveda, Ovidiu Radulescu, Denis C. Shields, Shoaib Sufi, Thanasis Vergoulis, Fotis Psomopoulos, Silvio C. E. Tosatto

Comments 1 PDF, 24 Pages, 2 figures within. Co-corresponding authors: Institute of Applied Biosciences, Centre for Research and Technology Hellas, Thessaloniki, Greece and Department of Biomedical Sciences, University of Padova, Padova, Italy. E-mails: fpsom[@]certh.gr, silvio.tosatto[@]unipd.it

2505.15808 2026-04-02 cs.LG cs.AI math.PR stat.AP stat.ML

Neural Conditional Transport Maps

Carlos Rodriguez-Pardo, Leonardo Chiani, Emanuele Borgonovo, Massimo Tavoni

Comments Published in Transactions on Machine Learning Research

2505.15160 2026-04-02 cs.CV

Lossless Token Merging Even Without Fine-Tuning in Vision Transformers

Jaeyeon Lee, Dong-Wan Choi

Comments ECAI 2025

2505.14222 2026-04-02 cs.SD cs.GR cs.MM eess.AS

MATHDance: Mamba-Transformer Architecture with Uniform Tokenization for High-Quality 3D Dance Generation

Kaixing Yang, Xulong Tang, Ziqiao Peng, Yuxuan Hu, Xiangyue Zhang, Puwei Wang, Hongyan Liu, Jun He, Zhaoxin Fan

2505.12189 2026-04-02 cs.AI cs.CL

Mitigating Content Effects on Reasoning in Language Models through Fine-Grained Activation Steering

Marco Valentino, Geonhee Kim, Dhairya Dalal, Zhixue Zhao, André Freitas

Comments AAAI 2026

2505.08614 2026-04-02 cs.CV

WaveGuard: Robust Deepfake Detection and Source Tracing via Dual-Tree Complex Wavelet and Graph Neural Networks

Ziyuan He, Zhiqing Guo, Liejun Wang, Gaobo Yang, Yunfeng Diao, Dan Ma

Comments 14 pages, 6 figures, 7 tables

2505.04738 2026-04-02 cs.LG

SetONet: A Set-Based Operator Network for Solving PDEs with Variable-Input Sampling

Stepan Tretiakov, Xingjian Li, Krishna Kumar

2505.04645 2026-04-02 cs.CL cs.LG stat.CO

ChatGPT for automated grading of short answer questions in mechanical ventilation

Tejas Jade, Alex Yartsev

详情

DOI: 10.11157/fohpe-vol27iss1id958
Journal ref: Focus on Health Professional Education: A Multi-Professional Journal, 27(1), 90-104 (2026)

英文摘要

Standardised tests using short answer questions (SAQs) are common in postgraduate education. Large language models (LLMs) simulate conversational language and interpret unstructured free-text responses in ways aligning with applying SAQ grading rubrics, making them attractive for automated grading. We evaluated ChatGPT 4o to grade SAQs in a postgraduate medical setting using data from 215 students (557 short-answer responses) enrolled in an online course on mechanical ventilation (2020--2024). Deidentified responses to three case-based scenarios were presented to ChatGPT with a standardised grading prompt and rubric. Outputs were analysed using mixed-effects modelling, variance component analysis, intraclass correlation coefficients (ICCs), Cohen's kappa, Kendall's W, and Bland--Altman statistics. ChatGPT awarded systematically lower marks than human graders with a mean difference (bias) of -1.34 on a 10-point scale. ICC values indicated poor individual-level agreement (ICC1 = 0.086), and Cohen's kappa (-0.0786) suggested no meaningful agreement. Variance component analysis showed minimal variability among the five ChatGPT sessions (G-value = 0.87), indicating internal consistency but divergence from the human grader. The poorest agreement was observed for evaluative and analytic items, whereas checklist and prescriptive rubric items had less disagreement. We caution against the use of LLMs in grading postgraduate coursework. Over 60% of ChatGPT-assigned grades differed from human grades by more than acceptable boundaries for high-stakes assessments.

URL PDF HTML ☆

赞 0 踩 0

2505.00213 2026-04-02 cs.RO math.OC

A Player Selection Network for Scalable Game-Theoretic Prediction and Planning

Tianyu Qiu, Eric Ouano, Fernando Palafox, Christian Ellis, David Fridovich-Keil

2504.16624 2026-04-02 cs.LG cs.FL

A Detailed Account of Compositional Automata Learning through Alphabet Refinement

Leo Henry, Thomas Neele, Mohammad Reza Mousavi, Matteo Sammartino

Comments Extended version of "Compositional Active Learning of Synchronizing Systems Through Automated Alphabet Refinement" (CONCUR 2025, DOI: 10.4230/LIPIcs.CONCUR.2025.20), submitted to the CONCUR 2025 special issue of Logical Methods in Computer Science. Incorporates and extends results from "Compositional Automata Learning of Synchronous Systems" (FASE 2023, DOI: 10.1007/978-3-031-30826-0_3)

2503.22244 2026-04-02 cs.LG math.OC

Analysis of On-policy Policy Gradient Methods under the Distribution Mismatch

Weizhen Wang, Jianping He, Xiaoming Duan

2503.11175 2026-04-02 cs.CV cs.AI

Zero-TIG: Temporal Consistency-Aware Zero-Shot Illumination-Guided Low-light Video Enhancement

Yini Li, Nantheera Anantrasirichai