arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.17691 2026-03-19 math.OC cs.LG

Stochastic set-valued optimization and its application to robust learning

Tommaso Giovannelli, Jingfu Tan, Luis Nunes Vicente

详情

英文摘要

In this paper, we develop a stochastic set-valued optimization (SVO) framework tailored for robust machine learning. In the SVO setting, each decision variable is mapped to a set of objective values, and optimality is defined via set relations. We focus on SVO problems with hyperbox sets, which can be reformulated as multi-objective optimization (MOO) problems with finitely many objectives and serve as a foundation for representing or approximating more general mapped sets. Two special cases of hyperbox-valued optimization (HVO) are interval-valued (IVO) and rectangle-valued (RVO) optimization. We construct stochastic IVO/RVO formulations that incorporate subquantiles and superquantiles into the objective functions of the MOO reformulations, providing a new characterization for subquantiles. These formulations provide interpretable trade-offs by capturing both lower- and upper-tail behaviors of loss distributions, thereby going beyond standard empirical risk minimization and classical robust models. To solve the resulting multi-objective problems, we adopt stochastic multi-gradient algorithms and select a Pareto knee solution. In numerical experiments, the proposed algorithms with this selection strategy exhibit improved robustness and reduced variability across test replications under distributional shift compared with empirical risk minimization, while maintaining competitive accuracy.

URL PDF HTML ☆

赞 0 踩 0

2603.17676 2026-03-19 q-bio.NC cs.AI cs.LG

Inhibitory normalization of error signals improves learning in neural circuits

Roy Henha Eyono, Daniel Levenstein, Arna Ghosh, Jonathan Cornford, Blake Richards

Comments 28 pages, 7 figures. Submitted to Neural Computation

2603.17673 2026-03-19 cs.CR cs.AI

Post-Training Local LLM Agents for Linux Privilege Escalation with Verifiable Rewards

Philipp Normann, Andreas Happe, Jürgen Cito, Daniel Arp

2603.17641 2026-03-19 cs.CE cs.AI cs.NA math.NA

Automated Grammar-based Algebraic Multigrid Design With Evolutionary Algorithms

Dinesh Parthasarathy, Wayne Mitchell, Arjun Gambhir, Harald Köstler, Ulrich Rüde

2603.15722 2026-03-19 cs.SE cs.AI cs.CE cs.DB cs.DL

A Framework and Prototype for a Navigable Map of Datasets in Engineering Design and Systems Engineering

H. Sinan Bank, Daniel R. Herber

Comments 10 pages, 3 figures, Submitted to ASME IDETC 2026-DAC22

2603.14225 2026-03-19 cs.HC cs.AI cs.SE

"I'm Not Reading All of That": Understanding Software Engineers' Level of Cognitive Engagement with Agentic Coding Assistants

Carlos Rafael Catalan, Lheane Marie Dizon, Patricia Nicole Monderin, Emily Kuang

Comments 7 pages, 5 figures, 2 tables, published and presented in CHI 2026 Workshop on Tools for Thought

2603.13780 2026-03-19 eess.AS cs.SD

Integrated Spoofing-Robust Automatic Speaker Verification via a Three-Class Formulation and LLR

Kai Tan, Lin Zhang, Ruiteng Zhang, Johan Rohdin, Leibny Paola García-Perera, Zexin Cai, Sanjeev Khudanpur, Matthew Wiesner, Nicholas Andrews

Comments Submitted to Interspeech 2026; put on arxiv based on requirement from Interspeech: "Interspeech no longer enforces an anonymity period for submissions." and "For authors that prefer to upload their paper online, a note indicating that the paper was submitted for review to Interspeech should be included in the posting."

2603.05542 2026-03-19 cs.DB cs.AI cs.ET cs.GR cs.HC cs.MM

Human-Data Interaction, Exploration, and Visualization in the AI Era: Challenges and Opportunities

Jean-Daniel Fekete, Yifan Hu, Dominik Moritz, Arnab Nandi, Senjuti Basu Roy, Eugene Wu, Nikos Bikakis, George Papastefanatos, Panos K. Chrysanthis, Guoliang Li, Lingyun Yu

2602.11262 2026-03-19 cond-mat.dis-nn cs.LG quant-ph

Unlearnable phases of matter

Tarun Advaith Kumar, Yijian Zou, Amir-Reza Negari, Roger G. Melko, Timothy H. Hsieh

Comments 28 pages, 9 figures. v2: Updated figure 4

2602.05089 2026-03-19 cs.CR cs.LG cs.RO

Beware Untrusted Simulators -- Reward-Free Backdoor Attacks in Reinforcement Learning

Ethan Rathbun, Wo Wei Lin, Alina Oprea, Christopher Amato

Comments 10 pages main body, ICLR 2026

2510.00240 2026-03-19 cs.CR cs.AI cs.LG

SecureBERT 2.0: Advanced Language Model for Cybersecurity Intelligence

Ehsan Aghaei, Sarthak Jain, Prashanth Arun, Arjun Sambamoorthy

2509.16760 2026-03-19 eess.AS cs.SD

Feature Selection via Graph Topology Inference for Soundscape Emotion Recognition

Samuel Rey, Luca Martino, Roberto San Millan, Eduardo Morgado

2509.10337 2026-03-19 stat.ML cs.LG

Exact Generalisation Error Exposes Benchmarks Skew Graph Neural Networks Success (or Failure)

Nil Ayday, Mahalakshmi Sabanayagam, Debarghya Ghoshdastidar

2508.20866 2026-03-19 cs.CR cs.AI

AVIATOR: Towards AI-Agentic Vulnerability Injection Workflow for High-Fidelity, Large-Scale Code Security Dataset

Amine Lbath, Massih-Reza Amini, Aurelien Delaitre, Vadim Okun

详情

英文摘要

The increasing complexity of software systems and the sophistication of cyber-attacks have underscored the need for reliable automated software vulnerability detection. Data-driven approaches using deep learning models show promise but critically depend on the availability of large, accurately labeled datasets. Yet existing datasets either suffer from noisy labels, limited vulnerability coverage, or fail to reflect vulnerabilities as they occur in real-world software. This also limits large-scale benchmarking of such solutions. Automated vulnerability injection provides a way to address these limitations, but existing techniques remain limited in coverage, contextual fidelity, or injection success. In this paper, we present AVIATOR, the first AI-agentic vulnerability injection framework. AVIATOR decomposes vulnerability injection into a coordinated workflow of specialized AI agents, tool-based analysis, and iterative self-correction, explicitly mirroring expert reasoning. It integrates RAG and lightweight LoRA-based fine-tuning to produce realistic, category-specific vulnerabilities without relying on handcrafted patterns. Across three benchmarks, AVIATOR achieves high injection fidelity (91-95%) surpassing existing injection techniques in both accuracy and vulnerability coverage. When used for data augmentation to train deep learning-based vulnerability detection (DLVD) models, AVIATOR provides the strongest downstream gains in vulnerability detection. Across models and base datasets, AVIATOR improves average F1 scores by +22% over no augmentation, +25% over VGX, holding the prior best injection success rate, and +3% over VulScribeR, the prior state-of-the-art LLM-based injection model, with +7% higher recall and no precision loss. Its augmented data exhibits the lowest distributional distortion and scales efficiently with <2% syntax rejection at 4.3x lower cost than VulScribeR.

URL PDF HTML ☆

赞 0 踩 0

2508.11158 2026-03-19 cs.IR cs.AI

Role-Augmented Intent-Driven Generative Search Engine Optimization

Xiaolu Chen, Haojie Wu, Jie Bao, Zhen Chen, Yong Liao, Hu Huang

Comments 7 pages, 5 figures

2507.03681 2026-03-19 stat.ML cs.LG stat.ME

Robust estimation of heterogeneous treatment effects in randomized trials leveraging external data

Rickard Karlsson, Piersilvio De Bartolomeis, Issa J. Dahabreh, Jesse H. Krijthe

Comments Accepted to AISTATS 2026. 24 pages, including references and appendix

2506.11879 2026-03-19 physics.geo-ph cs.LG

Decadal sink-source shifts of forest aboveground carbon since 1988

Zhen Qian, Sebastian Bathiany, Teng Liu, Lana L. Blaschke, Hoong Chen Teo, Niklas Boers

2505.17300 2026-03-19 stat.ML cs.LG stat.CO stat.ME

Statistical Inference for Online Algorithms

Selina Carter, Arun K Kuchibhotla

Comments 1) Adding to ASGD simulations, we add 5 other SGD algorithms: averaged-implicit-SGD, last-iterate-implicit-SGD, ROOT-SGD, truncated-SGD, and noisy-truncated-SGD. 2) We modify links to the online viz/GitHub pages. 3) We qualify previous conclusions on ASGD: ex, we claim that logistic regression is sometimes more challenging "in terms of achieving the target coverage" than linear regression

2505.13538 2026-03-19 cs.IR cs.AI

RAGXplain: From Explainable Evaluation to Actionable Guidance of RAG Pipelines

Dvir Cohen, Tamir Houri, Lin Burg, Gilad Barkan

2505.01821 2026-03-19 cs.DC cs.AI cs.LG

Edge-Cloud Collaborative Computing on Distributed Intelligence and Model Optimization: A Survey

Jing Liu, Yao Du, Kun Yang, Jiaqi Wu, Yan Wang, Xiping Hu, Zehua Wang, Yang Liu, Peng Sun, Azzedine Boukerche, Victor C. M. Leung

Comments Accepted by IEEE ComST. 45 pages, 13 figures, 10 tables

详情

DOI: 10.1109/COMST.2026.3669216

英文摘要

Edge-cloud collaborative computing (ECCC) has emerged as a pivotal paradigm for addressing the computational demands of modern intelligent applications, integrating cloud resources with edge devices to enable efficient, low-latency processing. Recent advancements in AI, particularly deep learning and large language models (LLMs), have dramatically enhanced the capabilities of these distributed systems, yet introduce significant challenges in model deployment and resource management. In this survey, we comprehensive examine the intersection of distributed intelligence and model optimization within edge-cloud environments, providing a structured tutorial on fundamental architectures, enabling technologies, and emerging applications. Additionally, we systematically analyze model optimization approaches, including compression, adaptation, and neural architecture search, alongside AI-driven resource management strategies that balance performance, energy efficiency, and latency requirements. We further explore critical aspects of privacy protection and security enhancement within ECCC systems and examines practical deployments through diverse applications, spanning autonomous driving, healthcare, and industrial automation. Performance analysis and benchmarking techniques are also thoroughly explored to establish evaluation standards for these complex systems. Furthermore, the review identifies critical research directions including LLMs deployment, 6G integration, neuromorphic computing, and quantum computing, offering a roadmap for addressing persistent challenges in heterogeneity management, real-time processing, and scalability. By bridging theoretical advancements and practical deployments, this survey offers researchers and practitioners a holistic perspective on leveraging AI to optimize distributed computing environments, fostering innovation in next-generation intelligent systems.

URL PDF HTML ☆

赞 0 踩 0

2503.19068 2026-03-19 stat.ML cs.AI cs.LG stat.ME stat.OT

Minimum Volume Conformal Sets for Multivariate Regression

Sacha Braun, Liviu Aolaritei, Michael I. Jordan, Francis Bach

2501.05007 2026-03-19 quant-ph cs.AI cs.LG stat.ME

Quantum-enhanced causal discovery for a small number of samples

Yu Terada, Ken Arai, Yu Tanaka, Yota Maeda, Hiroshi Ueno, Hiroyuki Tezuka

Comments 20 pages, 10 figures

详情

DOI: 10.1007/s42484-026-00380-x
Journal ref: Quantum Mach. Intell. 8, 36 (2026)

英文摘要

The discovery of causal relations from observed data has attracted significant interest from disciplines such as economics, social sciences, and biology. In practical applications, considerable knowledge of the underlying systems is often unavailable, and real data are usually associated with nonlinear causal structures, which makes the direct use of most conventional causality analysis methods difficult. This study proposes a novel quantum Peter-Clark (qPC) algorithm for causal discovery that does not require any assumptions about the underlying model structures. Based on conditional independence tests in a class of reproducing kernel Hilbert spaces characterized by quantum circuits, the proposed algorithm can explore causal relations from the observed data drawn from arbitrary distributions. We conducted systematic experiments on fundamental graphs of causal structures, demonstrating that the qPC algorithm exhibits better performance, particularly with smaller sample sizes compared to its classical counterpart. Furthermore, we proposed a novel optimization approach based on Kernel Target Alignment (KTA) for determining hyperparameters of quantum kernels. This method effectively reduced the risk of false positives in causal discovery, enabling more reliable inference. Our theoretical and experimental results demonstrate that the quantum algorithm can empower classical algorithms for accurate inference in causal discovery, supporting them in regimes where classical algorithms typically fail. In addition, the effectiveness of this method was validated using the datasets on Boston housing prices, heart disease, and biological signaling systems as real-world applications. These findings highlight the potential of quantum-based causal discovery methods in addressing practical challenges, particularly in small-sample scenarios, where traditional approaches have shown significant limitations.

URL PDF HTML ☆

赞 0 踩 0

2111.06390 2026-03-19 stat.AP cs.AI cs.GT cs.HC

Theoretical Foundations of δ-margin Majority Voting

Margarita Boyarskaya, Panos Ipeirotis

2603.17633 2026-03-19 q-bio.BM cs.LG

Atomic Trajectory Modeling with State Space Models for Biomolecular Dynamics

Liang Shi, Jiarui Lu, Junqi Liu, Chence Shi, Zhi Yang, Jian Tang

2603.17632 2026-03-19 eess.SY cs.RO cs.SY math.OC

Real-Time Online Learning for Model Predictive Control using a Spatio-Temporal Gaussian Process Approximation

Lars Bartels, Amon Lahr, Andrea Carron, Melanie N. Zeilinger

Comments to be published at 2026 IEEE International Conference on Robotics & Automation (ICRA)

2603.17628 2026-03-19 stat.ML cs.AI cs.LG stat.ME

rSDNet: Unified Robust Neural Learning against Label Noise and Adversarial Attacks

Suryasis Jana, Abhik Ghosh

Comments Pre-print; under review

2603.17617 2026-03-19 cs.SI cs.CL

Temporal Narrative Monitoring in Dynamic Information Environments

David Farr, Stephen Prochaska, Jack Moody, Lynnette Hui Xian Ng, Iain Cruickshank, Kate Starbird, Jevin West

2603.17594 2026-03-19 physics.soc-ph cs.CL

Modeling Changing Scientific Concepts with Complex Networks: A Case Study on the Chemical Revolution

Sofía Aguilar-Valdez, Stefania Degaetano-Ortlieb

Comments Accepted by the EACL 2026 Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature

2603.17592 2026-03-19 cs.IR cs.AI cs.HC

A Contextual Help Browser Extension to Assist Digital Illiterate Internet Users

Christos Koutsiaris

Comments 9 pages, 5 figures, 2 tables; MSc dissertation reformatted as conference paper; extended version available at github.com/unseen1980/acro-helper

2603.17569 2026-03-19 stat.ML cs.LG

Gaussian Process Limit Reveals Structural Benefits of Graph Transformers

Nil Ayday, Lingchu Yang, Debarghya Ghoshdastidar