arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.22629 2026-04-27 cs.CR cs.LG

Detecting Concept Drift in Evolving Malware Families Using Rule-Based Classifier Representations

Tomáš Kalný, Martin Jureček, Mark Stamp

详情

英文摘要

This work proposes a structural approach to concept drift detection in malware classification using decision tree rulesets. Classifiers are trained across temporal windows on the EMBER2024 dataset, and drift is quantified by comparing extracted rule representations using feature importance, prediction agreement, activation stability, and coverage metrics. These metrics are correlated with both accuracy degradation and data distribution shift as complementary drift indicators. The approach is evaluated across six malware families using fixed-interval and clustering-based windowing in family-vs-benign and family-vs-family settings, and compared against RIPPER and Transcendent baselines. Results show that fixed two-month windowing with feature-level Pearson correlation is the most reliable configuration, being the only one where all family pairs produce positive drift-accuracy correlations. The methods are complementary - no single approach dominates across all pairs.

URL PDF HTML ☆

赞 0 踩 0

2604.22627 2026-04-27 quant-ph cs.CC cs.IT cs.LG math.IT physics.comp-ph

The Exact Replica Threshold for Nonlinear Moments of Quantum States

Shuai Zeng

2604.22601 2026-04-27 cs.SE cs.AI

From Natural Language to Verified Code: Toward AI Assisted Problem-to-Code Generation with Dafny-Based Formal Verification

Md Erfan, Md Kamal Hossain Chowdhury, Ahmed Ryan, Md Rayhanur Rahman

Comments 16 pages

2604.22580 2026-04-27 stat.ML cs.LG

Explanation of Dynamic Physical Field Predictions using WassersteinGrad: Application to Autoregressive Weather Forecasting

Younes Essafouri, Laure Raynaud, Luciano Drozda, Laurent Risser

2604.22579 2026-04-27 eess.IV cs.CV cs.LG

Useful nonrobust features are ubiquitous in biomedical images

Coenraad Mouton, Randle Rabe, Niklas C. Koser, Nicolai Krekiehn, Christopher Hansen, Jan-Bernd Hövener, Claus-C. Glüer

Comments Accepted at The IEEE International Symposium on Biomedical Imaging (ISBI), 2026

2604.22569 2026-04-27 cs.CR cs.LG

Adversarial Co-Evolution of Malware and Detection Models: A Bilevel Optimization Perspective

Olha Jurečková, Martin Jureček, Matouš Kozák, Róbert Lórencz

2604.22557 2026-04-27 eess.IV cs.CV cs.LG

Are Natural-Domain Foundation Models Effective for Accelerated Cardiac MRI Reconstruction?

Anam Hashmi, Mayug Maniparambil, Julia Dietlmeier, Kathleen M. Curran, Noel E. O'Connor

Comments Accepted to CVPRW 2026

2604.22550 2026-04-27 cs.CR cs.AI

ArmSSL: Adversarial Robust Black-Box Watermarking for Self-Supervised Learning Pre-trained Encoders

Yongqi Jiang, Yansong Gao, Boyu Kuang, Chunyi Zhou, Anmin Fu, Liquan Chen

详情

英文摘要

Self-supervised learning (SSL) encoders are invaluable intellectual property (IP). However, no existing SSL watermarking for IP protection can concurrently satisfy the following two practical requirements: (1) provide ownership verification capability under black-box suspect model access once the stolen encoders are used in downstream tasks; (2) be robust under adversarial watermark detection or removal, because the watermark samples form a distinguishable out-of-distribution (OOD) cluster. We propose ArmSSL, an SSL watermarking framework that assures black-box verifiability and adversarial robustness while preserving utility. For verification, we introduce paired discrepancy enlargement, enforcing feature-space orthogonality between the clean and its watermark counterpart to produce a reliable verification signal in black-box against the suspect model. For adversarial robustness, ArmSSL integrates latent representation entanglement and distribution alignment to suppress the OOD clustering. The former entangles watermark representations with clean representations (i.e., from non-source-class) to avoid forming a dense cluster of watermark samples, while the latter minimizes the distributional discrepancy between watermark and clean representations, thereby disguising watermark samples as natural in-distribution data. For utility, a reference-guided watermark tuning strategy is designed to allow the watermark to be learned as a small side task without affecting the main task by aligning the watermarked encoder's outputs with those of the original clean encoder on normal data. Extensive experiments across five mainstream SSL frameworks and nine benchmark datasets, along with end-to-end comparisons with SOTAs, demonstrate that ArmSSL achieves superior ownership verification, negligible utility degradation, and strong robustness against various adversarial detection and removal.

URL PDF HTML ☆

赞 0 踩 0

2604.22548 2026-04-27 stat.AP cs.LG

Multi-output Extreme Spatial Model for Complex Aircraft Production Systems

Cheolhei Lee, Xing Wang, Xiaowei Yue, Jianguo Wu

详情

DOI: 10.1287/msom.2023.0442

英文摘要

Problem definition: Data-driven models in machine learning have enabled efficient management of production systems. However, a majority of machine learning models are devoted to modeling the mean response or average pattern, which is inappropriate for studying abnormal extreme events that are often of primary interest in aircraft manufacturing. Since extreme events from heavy-tailed distributions give rise to prohibitive expenditures in system management, sophisticated extreme models are urgently needed to analyze complex extreme risks. Engineering applications of extreme models usually focus on individual extreme events, which is insufficient for complex systems with correlations. Methodology/results: We introduce an extreme spatial model for multi-output response control systems that efficiently captures the dynamics using a bilinear function on two spatial domains for control variables and measurement locations. Marginal parameter modeling and extremal dependence have been investigated. In addition, an efficient graph-assisted composite likelihood estimation and corresponding computational algorithms are developed to cope with high-dimensional outputs. The application to composite aircraft production shows that the proposed model enables comprehensive analyses with superior predictive performance on extreme events compared to canonical methods. Managerial implications: Our method shows how to use an extreme spatial model for predicting extreme events and managing extreme risks in complex production systems such as aircraft. This can help achieve better quality management and operation safety in aircraft production systems and beyond.

URL PDF HTML ☆

赞 0 踩 0

2604.22494 2026-04-27 stat.ML cs.LG

FedSPDnet: Geometry-Aware Federated Deep Learning with SPDnet

Thibault Pautrel, Florent Bouchard, Ammar Mian, Guillaume Ginolhac

2604.22492 2026-04-27 eess.IV cs.CV

MTT-Bench: Predicting Social Dominance in Mice via Multimodal Large Language Models

Yunquan Chen, Haoyu Chen

Comments 8 pages, 2 figures. Submitted to conference

2604.22491 2026-04-27 cs.HC cs.RO

Point & Grasp: Flexible Selection of Out-of-Reach Objects Through Probabilistic Cue Integration

Xuejing Luo, Hee-Seung Moon, Christian Holz, Antti Oulasvirta

Comments 19 pages, 13 figures, CHI 2026

2604.22438 2026-04-27 cs.CR cs.AI cs.CL

SSG: Logit-Balanced Vocabulary Partitioning for LLM Watermarking

Chenxi Gu, Xiaoning Du, John Grundy

Comments ACL 2026 Main Conference

2604.22422 2026-04-27 cs.DB cs.AI

How Hard is it to Decide if a Fact is Relevant to a Query?

Meghyn Bienvenu, Diego Figueira, Pierre Lafourcade

Comments Long version of KR'26 paper

2604.22391 2026-04-27 stat.ML cs.LG stat.CO stat.ME

Conformalized Super Learner

Zhanli Wu, Fabrizio Leisen, Miguel-Angel Luque-Fernandez, F. Javier Rubio

Comments R codes and data can be found at: https://github.com/ZWU-001/CSL

2604.22386 2026-04-27 stat.ML cs.LG

Pack only the essentials: Adaptive dictionary learning for kernel ridge regression

Daniele Calandriello, Alessandro Lazaric, Michal Valko

Comments In NeurIPS 2016 Workshop on Adaptive and Scalable Nonparametric Methods in Machine Learning (ASNMML)

2604.22385 2026-04-27 stat.ML cs.LG

Pliable rejection sampling

Akram Erraqabi, Michal Valko, Alexandra Carpentier, Odalric-Ambrym Maillard

Comments In ICML 2016

2604.22351 2026-04-27 astro-ph.IM cs.CV

Thermal background reduction for mid-infrared imaging by low-rank background and sparse point-source modelling

R. A. R. Moens, A. G. M. Pietrow, B. Brandl, R. Van de Plas

详情

DOI: 10.1051/0004-6361/202555698

英文摘要

Mid-infrared astronomy from the ground faces critical challenges in accurately detecting and quantifying sources due to the dominant spatially and time-variable background noise. Moreover, chopping and nodding, the traditional methods for dealing with these background issues, will not be technically feasible on the next generation of extremely large telescopes. This limitation requires the development of novel computational methods for a robust background reduction. We present and evaluate a novel method named LOw-RAnk Background ELimination (LORABEL) to improve the sensitivity of mid-infrared astronomical observations, without the need for classical telescope nodding, source masking, or other overheads in observing time. We applied a low-rank background-reduction strategy to (1) data taken on the ground with the VISIR with synthetically injected sources, and (2) airborne data from SOFIA. We compared the performance of our new method to classical chopping and nodding techniques, and analysed the effect on source photometry and detection precision for different observational scenarios. In regimes with a low signal-to-noise ratio (S/N $<5$) in the ground-based VISIR data, LORABEL reduces variation in the photometric error with respect to chopping differences alone and even the classical chop-nod sequence, at the cost of introducing a bias. Secondly, we demonstrate that LORABEL increases detection precision in comparison to traditional background-reduction methods. For the SOFIA dataset, we achieve a $20-100$ fold decrease in mean background flux with respect to the traditional chop-nod method while preserving most of the source flux. Our findings suggest that LORABEL is applicable to a wider range of instrumental observation, that is, both ground-based and airborne, and it is a suitable tool in the context of faint-source detection.

URL PDF HTML ☆

赞 0 踩 0

2604.22338 2026-04-27 eess.IV cs.CV

Selective Depthwise Separable Convolution for Lightweight Joint Source-Channel Coding in Wireless Image Transmission

Ming Ye, Kui Cai, Cunhua Pan, Zhen Mei, Wanting Yang, Chunguo Li

Comments 5 pages, 6 figures, journal

2604.22306 2026-04-27 cs.LO cs.AI cs.PL

BLAST: Benchmarking LLMs with ASP-based Structured Testing

Manuel Alejandro Borroto Santana, Erica Coppolillo, Francesco Calimeri, Giuseppe Manco, Simona Perri, Francesco Ricca

2604.22293 2026-04-27 cs.AR cs.LG hep-ex

HGQ-LUT: Fast LUT-Aware Training and Efficient Architectures for DNN Inference

Chang Sun, Zhiqiang Que, Bakhtiar Zadeh, Qibin Liu, Kevin H. Alvarez, Wayne Luk, Maria Spiropulu

2604.22287 2026-04-27 math.GR cs.RO math.DG math.DS physics.comp-ph

Closed Form Relations and Higher-Order Approximations of First and Second Derivatives of the Tangent Operator on SE(3)

Andreas Mueller

2604.22276 2026-04-27 eess.AS cs.SD

Audio Effect Estimation with DNN-Based Prediction and Search Algorithm

Youichi Okita, Haruhiro Katayose

Comments Accepted for ICASSP2026

2604.22256 2026-04-27 cs.SC cs.AI

A Probabilistic Framework for Hierarchical Goal Recognition

Chenyuan Zhang, Katherine Ip, Hamid Rezatofighi, Buser Say, Mor Vered

Comments Accepted by KR 2026

2604.22236 2026-04-27 cs.GT cs.HC cs.LG econ.EM

Algorithmic Feature Highlighting for Human-AI Decision-Making

Yifan Guo, Jann Spiess

2604.22230 2026-04-27 econ.GN cs.GT cs.LG q-fin.EC

On Benchmark Hacking in ML Contests: Modeling, Insights and Design

Xiaoyun Qiu, Yang Yu, Haifeng Xu

2604.22224 2026-04-27 cs.CE cs.LG physics.comp-ph

AI-Driven Performance-to-Design Generation and Optimization of Marine Propellers

Leah Chen, Keni Chih-Hua Wu, Boon Tat Chia, Xiuqing Xing, Jian Cheng Wong

Comments Accepted at OMAE 2026

详情

英文摘要

AI is increasingly used to accelerate engineering design by improving decision-making and shortening iteration cycles. Application to marine propeller design, however, remains challenging due to scarce training data and the lack of widely available pretrained models. We address this gap with a physics-based data generation pipeline and a generative-AI framework for direct performance-to-design generation tailored to marine propellers. First, we build a database of over 20,000 four- and five-bladed propeller geometries, each accompanied by simulated open-water performance curves. On top of this dataset, we develop a three-module design framework: (1) A Conditional Generation Model that proposes candidate geometries conditioned on design specifications such as target thrust, power, and diameter. (2) A Performance Prediction Model, implemented as a neural-network surrogate, that predicts thrust, torque, and efficiency in milliseconds, enabling rapid evaluation of generated designs. (3) A design refinement stage that applies evolutionary optimization to enforce practical constraints such as required thrust under power limits and bounds on blade-area ratio and thickness. Experimental results over a range of operating conditions show that the framework can generate hydrodynamically plausible propeller designs that match prescribed performance targets while substantially reducing design-iteration time relative to the traditional expert-guided refinement. Latent diffusion-based generator produces more diverse designs under the same conditions than the conditional variational autoencoder, suggesting a stronger capacity for design-space exploration with diffusion models. By coupling physics-based data synthesis with modular AI models, the proposed approach streamlines the propeller design cycle and reduces reliance on expensive high-fidelity simulations to final validation stages.

URL PDF HTML ☆

赞 0 踩 0

2604.22212 2026-04-27 eess.IV cs.CV cs.LG

Multimodal Diffusion to Mutually Enhance Polarized Light and Low Resolution EBSD Data

Harry Dong, Timofey Efimov, Megna Shah, Jeff Simmons, Sean Donegan, Marc De Graef, Yuejie Chi

2604.22209 2026-04-27 eess.AS cs.AI cs.CL cs.SD

UniSonate: A Unified Model for Speech, Music, and Sound Effect Generation with Text Instructions

Chunyu Qiang, Xiaopeng Wang, Kang Yin, Yuzhe Liang, Yuxin Guo, Teng Ma, Ziyu Zhang, Tianrui Wang, Cheng Gong, Yushen Chen, Ruibo Fu, Chen Zhang, Longbiao Wang, Jianwu Dang

Comments Accepted to ACL 2026 main conference (oral)

2604.22207 2026-04-27 cs.SE cs.AI cs.CL

Evaluating LLM-Based Goal Extraction in Requirements Engineering: Prompting Strategies and Their Limitations

Anna Arnaudo, Riccardo Coppola, Maurizio Morisio, Flavio Giobergia, Andrea Bioddo, Angelo Bongiorno, Luca Dadone

Comments 10 pages, 1 figure. This contribution will be published in the conference proceedings of EASE 2026 Conference (https://conf.researchr.org/home/ease-2026/prompt-se-2026)