arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.26530 2026-04-30 quant-ph cs.AI gr-qc

Fundamental Physics, Existential Risks and Human Futures

Adrian Kent

Comments Invited article for Phil. Trans. Roy. Soc. for the 25th anniversary of their millennium volume

2604.26527 2026-04-30 cs.HC cs.RO cs.SY eess.SY

Persona-Based Process Design for Assistive Human-Robot Workplaces for Persons with Disabilities

Nils Mandischer, Daria Eckert and, Lars Mikelsons

Comments Accepted at IEEE International Conference on Human-Machine Systems (ICHMS), Singapore, 2026

2604.26511 2026-04-30 cs.CR cs.AI

Tatemae: Detecting Alignment Faking via Tool Selection in LLMs

Matteo Leonesi, Francesco Belardinelli, Flavio Corradini, Marco Piangerelli

2604.26505 2026-04-30 cs.CR cs.LG

Quantamination: Dynamic Quantization Leaks Your Data Across the Batch

Hanna Foerster, Ilia Shumailov, Cheng Zhang, Yiren Zhao, Jamie Hayes, Robert Mullins

Comments 11 pages, 4 figures, 4 tables

2604.26494 2026-04-30 cs.HC cs.AI cs.CY cs.ET

Culturally Aware GenAI Risks for Youth: Perspectives from Youth, Parents, and Teachers in a Non-Western Context

Aljawharah Alzahrani, Tory Park, Tanusree Sharma

2604.26492 2026-04-30 eess.IV cs.CV cs.IT eess.SP math.IT

Adaptive Transform Coding for Semantic Compression

Andriy Enttsel, Vincent Corlay

Comments 7 pages, 4 figures

2604.26479 2026-04-30 stat.ME cs.LG

Recipes for Calibration Checks in Safety-Critical Applications

Romeo Valentin

Comments 36 pages, 22 figures. Manuscript prepared with Typst

详情

英文摘要

Safety-critical prediction systems, such as autonomous vehicles, weather forecasters, and medical monitors, commonly rely on probabilistic forecasters. These forecasters make predictions about possible future outcomes, and their quality and robustness needs to be validated and certified. Often, only accuracy -- the mean of the predictions -- is evaluated against true outcomes. However, for safety-critical scenarios and decision making under uncertainty, the full distributional properties of the forecasts should be checked: do the observed prediction errors actually follow the forecasted probability distributions? To this end, we introduce a framework for calibration checks: statistical tests that validate distributional properties of forecasts when measured over many samples. In order to support ease-of-use in real-world operations, these checks produce a single accept/reject decision for data collected from a forecaster. This contrasts typical calibration calculations which produce one or multiple continuous calibration scores and require expertise to implement in a validation workflow. We further support operationalization by introducing modifications to calibration testing that (a) reject only overconfident predictions, allowing for pessimistic or cautious predictions in safety-critical settings, and (b) tolerate small, operationally acceptable deviations even for large numbers of validation samples. We organize the calibration checking process into a modular pipeline comprising four steps: (i) the data model, (ii) the chosen metric, (iii) the hypothesis formulation, and (iv) the testing procedure. Each step consists of independently swappable components, thereby supporting a large variety of possible use-cases and trade-offs. We demonstrate the applicability of the framework on two complementary example problems, weather forecasting and robot pose estimation.

URL PDF HTML ☆

赞 0 踩 0

2604.26472 2026-04-30 math.CO cs.LG

Order-Sensitive Sequential Interventions on Ideal Lattices

Dmitry Pasechnyuk-Vilensky

Comments 18 pages

2604.26413 2026-04-30 quant-ph cs.AI cs.CR

Quantum Gatekeeper: Multi-Factor Context-Bound Image Steganography with VQC Based Key Derivation on Quantum Hardware

Sahil Tomar, Sandeep Kumar

2604.26394 2026-04-30 cs.CR cs.AI

SecMate: Multi-Agent Adaptive Cybersecurity Troubleshooting with Tri-Context Personalization

Yair Meidan, Omri Haller, Yulia Moshan, Shahaf David, Dudu Mimran, Yuval Elovici, Asaf Shabtai

2604.26388 2026-04-30 cs.DC cs.LG

SplitFT: An Adaptive Federated Split Learning System For LLMs Fine-Tuning

Yimeng Shan, Zhaorui Zhang, Sheng Di, Yu Liu, Xiaoyi Lu, Benben Liu

2604.26366 2026-04-30 stat.ML cs.LG

Probabilistic data quality assessment for structural monitoring data via outlier-resistant conditional diffusion model

Qi Li, Yong Huang, Hui Li

Comments 43 pages, 15 figures and 2 tables

2604.26349 2026-04-30 cs.DS cs.LG

Asymptotically Robust Learning-Augmented Algorithms for Preemptive FIFO Buffer Management

Wen-Han Hsieh, Ya-Chun Liang

2604.26347 2026-04-30 eess.AS cs.CL

The False Resonance: A Critical Examination of Emotion Embedding Similarity for Speech Generation Evaluation

Yun-Shao Tsai, Yi-Cheng Lin, Huang-Cheng Chou, Tzu-Wen Hsu, Yun-Man Hsu, Chun Wei Chen, Shrikanth Narayanan, Hung-yi Lee

Comments Submitted to Interspeech 2026

2604.26313 2026-04-30 cs.CR cs.LG

VulStyle: A Multi-Modal Pre-Training for Code Stylometry-Augmented Vulnerability Detection

Chidera Biringa, Ajmal Abbas, Vishnu Selvaraj, Gokhan Kul

Comments 12 pages, 2 figures. Accepted at the 56th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN 2026)

2604.26281 2026-04-30 eess.AS cs.LG cs.SD

DiffAnon: Diffusion-based Prosody Control for Voice Anonymization

Ismail Rasim Ulgen, Zexin Cai, Nicholas Andrews, Philipp Koehn, Berrak Sisman

Comments Submitted to Interspeech 2026

2604.26274 2026-04-30 cs.CR cs.AI

Enforcing Benign Trajectories: A Behavioral Firewall for Structured-Workflow AI Agents

Hung Dang

2604.26247 2026-04-30 cs.IR cs.AI

TimeMM: Time-as-Operator Spectral Filtering for Dynamic Multimodal Recommendation

Wei Yang, Rui Zhong, Zihan Lin, Xiaodan Wang, Cheng Chen, Huan Ren, Yao Hu

2604.26235 2026-04-30 cs.CR cs.AI cs.CL

LATTICE: Evaluating Decision Support Utility of Crypto Agents

Aaron Chan, Tengfei Li, Tianyi Xiao, Angela Chen, Junyi Du, Xiang Ren

Comments 15 pages, 3 figures, 9 tables

2604.26219 2026-04-30 cs.CR cs.LG

eDySec: A Deep Learning-based Explainable Dynamic Analysis Framework for Detecting Malicious Packages in PyPI Ecosystem

Sk Tanzir Mehedi, Raja Jurdak, Chadni Islam, Abu Bakar Siddique Mahi, Gowri Ramachandran

Comments 12 Pages, 11 Figures, and 5 Tables

详情

英文摘要

The security of open-source software repositories is increasingly threatened by next-gen software supply chain attacks. These attacks include multiphase malware execution, remote access activation, and dynamic payload generation. Traditional Machine Learning (ML) detectors struggle to detect these attacks due to the high-dimensional and sparse nature of dynamic behavioral data, including system calls, network traffic, directory access patterns, and dependency logs. As a result, these data characteristics degrade the performance, stability, and explainability of ML models. These challenges have made Deep Learning (DL) a promising alternative, given its success across various domains and its potential for modeling complex patterns. This paper presents eDySec, a DL-based efficient, stable, and explainable framework for dynamic behavioral analysis to detect malicious packages. Using the QUT-DV25 dataset, which captures both install-time and post-installation behaviors of packages, we evaluate DL models and investigate feature sets to identify the most discriminative attributes for enabling efficient malicious package detection. Additionally, model stability analysis and explainable AI techniques are incorporated into the detection pipeline to enable stable, and transparent interpretations of model decisions. Experimental results demonstrate that eDySec significantly outperforms the state-of-the-art frameworks. Specifically, it halves feature dimensionality while lowering false positives by 82% and false negatives by 79%. It also improves accuracy by 3%, achieves near-perfect stability, and maintains an inference latency of 170ms per package. Further analysis reveals that feature and model selection play a critical role, as certain combinations degrade performance. Ultimately, this study advances the understanding of the strengths and limitations of dynamic analysis against next-gen attacks.

URL PDF HTML ☆

赞 0 踩 0

2604.26213 2026-04-30 quant-ph cs.AI

Qvine: Vine Structured Quantum Circuits for Loading High Dimensional Distributions

David Quiroga, Hannes Leipold, Bibhas Adhikari

2604.26190 2026-04-30 cs.DS cs.CL

Flashback: A Reversible Bilateral Run-Peeling Decomposition of Strings

Thomas Konstantinovsky, Gur Yaari

2604.26180 2026-04-30 cs.DB cs.AI cs.CL

Evergreen: Efficient Claim Verification for Semantic Aggregates

Alexander W. Lee, Benjamin Han, Shayak Sen, Sam Yeom, Ugur Cetintemel, Anupam Datta

详情

英文摘要

With recent semantic query processing engines, semantic aggregation has become a primitive operator, enabling the reduction of a relation into a natural language aggregate using an LLM. However, the resulting semantic aggregate may contain claims that are not grounded in the underlying relation. Verifying such claims is challenging: they often involve quantifiers, groupings, and comparisons over relations that far exceed LLM context windows and require a costly combination of semantic and symbolic processing. We present Evergreen, a system that recasts claim verification as a semantic query processing task with tailored optimizations and provenance capture. Evergreen compiles each claim into a declarative semantic verification query and executes it on the same engine that produced the aggregate. To reduce cost and latency, Evergreen avoids unnecessary LLM calls through verification-aware optimizations (early stopping, relevance sorting, and estimation with confidence sequences) and general-purpose optimizations for semantic queries (operator fusion, similarity filtering, and prompt caching). Each verdict is accompanied by citations that identify a minimal set of tuples justifying the result, with semantics based on semiring provenance for first-order logic. On a benchmark of real-world restaurant review datasets reflecting production-inspired workloads, Evergreen achieves excellent verification quality (F1 = 1.00) with a strong LLM while reducing cost by 3.2x and latency by 4.0x compared to unoptimized verification. Even with a significantly weaker LLM, Evergreen outperforms a strong LLM-as-a-judge baseline in F1 at 48x lower cost and 2.3x lower latency. Relative to a retrieval-augmented agent, Evergreen compares favorably in F1 and latency with similar cost when both use a strong LLM; yet, with a much weaker LLM, it achieves the same F1 at 63x lower cost and 4.2x lower latency.

URL PDF HTML ☆

赞 0 踩 0

2604.26172 2026-04-30 eess.SY cs.AI cs.LG cs.SY math.OC stat.ML

Co-Learning Port-Hamiltonian Systems and Optimal Energy-Shaping Control

Ankur Kamboj, Biswadip Dey, Vaibhav Srivastava

2604.26160 2026-04-30 stat.ME cs.CE cs.LG cs.MS stat.CO

Fitting Large Nonlinear Mixed Effects Models Using Variational Expectation Maximization

Mohamed Tarek, Pedro Afonso

2604.26148 2026-04-30 cs.HC cs.CL

Beyond Screenshots: Evaluating VLMs' Understanding of UI Animations

Chen Liang, Xirui Jiang, Naihao Deng, Eytan Adar, Anhong Guo

Comments Accepted at ACL 2026 Findings

2604.26143 2026-04-30 physics.comp-ph cond-mat.mtrl-sci cs.LG

Mixture of Experts Framework in Machine Learning Interatomic Potentials for Atomistic Simulations

Gabriel de Miranda Nascimento, Marc L. Descoteaux, Laura Zichi, Chuin Wei Tan, William C. Witt, Nicola Molinari, Sriteja Mantha, Daniil Kitchaev, Mordechai Kornbluth, Karim Gadelrab, Charles Tuffile, Boris Kozinsky

Comments 10 pages, 5 figures

2604.26142 2026-04-30 cs.SE cs.AI

ImproBR: Bug Report Improver Using LLMs

Emre Furkan Akyol, Mehmet Dedeler, Eray Tüzün

2604.26136 2026-04-30 eess.AS cs.CL

One Voice, Many Tongues: Cross-Lingual Voice Cloning for Scientific Speech

Amanuel Gizachew Abebe, Yasmin Moslem

Comments IWSLT 2026

2604.26132 2026-04-30 eess.SP cs.LG

Sparse Graph Learning from Sparse Data via Fiedler Number Maximization

Bahar Oveisgharan, Gene Cheung, Andrew Eckford