arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2409.07985 2026-05-08 cs.AI cs.LG

Games for AI Control: Models of Safety Evaluations of AI Deployment Protocols

Charlie Griffin, Louis Thomson, Buck Shlegeris, Alessandro Abate

详情

英文摘要

To evaluate the safety and usefulness of deployment protocols for untrusted AIs, AI Control uses a red-teaming exercise played between a protocol designer and an adversary. This paper introduces AI-Control Games, a formal decision-making model of the red-teaming exercise as a multi-objective, partially observable, stochastic game. We also introduce reductions from AI-Control Games to a special case of zero-sum partially observable stochastic games that allow us to leverage existing algorithms to find Pareto-optimal protocols. We apply our formalism to model, evaluate and synthesise protocols for deploying untrusted language models as programming assistants, focusing on Trusted Monitoring protocols, which use weaker language models and limited human assistance. To demonstrate the utility of our formalism, we show improvements over empirical studies in existing settings, evaluate protocols in new settings, and analyse how modelling assumptions affect the safety and usefulness of protocols. Finally, we leverage our formalism to precisely describe some of the implicit assumptions in prior control work.

URL PDF HTML ☆

赞 0 踩 0

2408.13471 2026-05-08 cs.LG cs.AI

Disentangled Generative Graph Representation Learning

Xinyue Hu, Zhibin Duan, Xinyang Liu, Yuxin Li, Bo Chen, Chaojie Wang, Yilin He, Hongwei Liu, Mingyuan Zhou

2406.10868 2026-05-08 cs.CL

Identifying Query-Relevant Neurons in Large Language Models for Long-Form Texts

Lihu Chen, Adam Dejl, Francesca Toni

Comments AAAI 2025 Main Track

2406.07069 2026-05-08 cs.RO cs.SY eess.SY

Optimal Gait Control for a Tendon-driven Soft Quadruped Robot by Model-based Reinforcement Learning

Xuezhi Niu, Kaige Tan, Lei Feng

2405.10729 2026-05-08 cs.AI

Contestable AI needs Computational Argumentation

Francesco Leofante, Hamed Ayoobi, Adam Dejl, Gabriel Freedman, Deniz Gorur, Junqi Jiang, Guilherme Paulino-Passos, Antonio Rago, Anna Rapberger, Fabrizio Russo, Xiang Yin, Dekai Zhang, Francesca Toni

Comments Accepted at KR 2024

2405.02079 2026-05-08 cs.CL cs.AI

Argumentative Large Language Models for Explainable and Contestable Claim Verification

Gabriel Freedman, Adam Dejl, Deniz Gorur, Xiang Yin, Antonio Rago, Francesca Toni

Comments 18 pages, 18 figures. Accepted as an oral presentation at AAAI 2025

2211.00642 2026-05-08 cs.LG cs.AI cs.SY eess.SY stat.CO

Farm-wide virtual load monitoring for offshore wind structures via Bayesian neural networks

N. Hlaing, Pablo G. Morato, F. d. N. Santos, W. Weijtjens, C. Devriendt, P. Rigo

详情

DOI: 10.1177/14759217231186048
Journal ref: Structural Health Monitoring, Volume 23, Issue 3, May 2024, Pages 1641-1663

英文摘要

Offshore wind structures are subject to deterioration mechanisms throughout their operational lifetime. Even if the deterioration evolution of structural elements can be estimated through physics-based deterioration models, the uncertainties involved in the process hurdle the selection of lifecycle management decisions. In this scenario, the collection of relevant information through an efficient monitoring system enables the reduction of uncertainties, ultimately driving more optimal lifecycle decisions. However, a full monitoring instrumentation implemented on all wind turbines in a farm might become unfeasible due to practical and economical constraints. Besides, certain load monitoring systems often become defective after a few years of marine environment exposure. Addressing the aforementioned concerns, a farm-wide virtual load monitoring scheme directed by a fleet-leader wind turbine offers an attractive solution. Fetched with data retrieved from a fully-instrumented wind turbine, a model can be trained and then deployed, thus yielding load predictions of non-fully monitored wind turbines, from which only standard data remains available. In this paper, we propose a virtual load monitoring framework formulated via Bayesian neural networks (BNNs) and we provide relevant implementation details needed for the construction, training, and deployment of BNN data-based virtual monitoring models. As opposed to their deterministic counterparts, BNNs intrinsically announce the uncertainties associated with generated load predictions and allow to detect inaccurate load estimations generated for non-fully monitored wind turbines. The proposed virtual load monitoring is thoroughly tested through an experimental campaign in an operational offshore wind farm and the results demonstrate the effectiveness of BNN models for fleet-leader-based farm-wide virtual monitoring.

URL PDF HTML ☆

赞 0 踩 0

2112.11447 2026-05-08 cs.AI cs.CV

Multi-Modality Distillation via Learning the teacher's modality-level Gram Matrix

Peng Liu

Comments 15 pages, 2 figures

2605.05941 2026-05-08 cs.CV

RAWild: Sensor-Agnostic RAW Object Detection via Physics-Guided Curve and Grid Modeling

Shuhong Liu, Gengjia Chang, Jun Liu, Xuangeng Chu, Yinqiang Zheng, Tatsuya Harada, Ziteng Cui

2605.05940 2026-05-08 cs.LG cs.CL

Near-Policy: Accelerating On-Policy Distillation via Asynchronous Generation and Selective Packing

Miao Rang, Zhenni Bi, Hang Zhou, Kai Han, Xuechun Wang, An Xiao, Xinghao Chen, Yunhe Wang, Hanting Chen

2605.05938 2026-05-08 cs.AI

ICU-Bench:Benchmarking Continual Unlearning in Multimodal Large Language Models

Yuhang Wang, Wenjie Mei, Junkai Zhang, Guangyu He, Zhenxing Niu, Haichang Gao

Comments 30 pages, 12 figures

2605.05933 2026-05-08 cs.CV

Whole-body CT attenuation and volume charts from routine clinical scans via evidence-grounded LLM report filtering

Christian Wachinger, Bernhard Renger, Christopher Späth, Jan Kirschke, Marcus Makowski

Comments Supplement available at: https://github.com/ai-med/body-charts/blob/main/body_charts_supp.pdf

2605.05931 2026-05-08 cs.AI

In Data or Invisible: Toward a Better Digital Representation of Low-Resource Languages with Knowledge Graphs

Ndeye-Emilie Mbengue

2605.05929 2026-05-08 cs.AI

Which Are the Low-Resource Languages of the Semantic Web?

Ndeye-Emilie Mbengue, Pierre Monnin, Miguel Couceiro, Fabien Gandon

Comments ESWC 2026 - 23rd European Semantic Web Conference, May 2026, Dubrovnik, Croatia

2605.05928 2026-05-08 cs.CV cs.CR

Backdoor Mitigation in Object Detection via Adversarial Fine-Tuning

Kealan Dunnett, Reza Arablouei, Dimity Miller, Volkan Dedeoglu, Raja Jurdak

2605.05921 2026-05-08 cs.AI cs.HC

Intentmaking and Sensemaking: Human Interaction with AI-Guided Mathematical Discovery

Alex Bäuerle, Adam Connors, Alexander Novikov, Adam Zsolt Wagner, Ngân Vũ, Fernanda Viegas, Martin Wattenberg, Lucas Dixon

2605.05913 2026-05-08 cs.AI

Wisteria: A Unified Multi-Scale Feature Learning Framework for DNA Language Model

Weihua Wang, Haoji Li, Feilong Bao, Lei Yang, Guanglai Gao

Comments 25 pages, 4 figures. Under review

2605.05912 2026-05-08 cs.LG cs.CV

From Drops to Grid: Noise-Aware Spatio-Temporal Neural Process for Rainfall Estimation

Rafael Pablos Sarabia, Joachim Nyborg, Morten Birk, Ira Assent

2605.05911 2026-05-08 cs.AI cs.GT cs.LG cs.SY eess.SY math.OC

PREFER: Personalized Review Summarization with Online Preference Learning

Millend Roy, Agostino Capponi, Vineet Goyal

2605.05910 2026-05-08 cs.CV

Plug-and-play Class-aware Knowledge Injection for Prompt Learning with Visual-Language Model

Junhui Yin, Nan Pu, Xinyu Zhang, Lingfeng Yang, Lin Wu, Xiaojie Wang, Zhun Zhong

Comments Accepted by International Journal of Computer Vision

2605.05909 2026-05-08 cs.AI

Null Space Constrained Contrastive Visual Forgetting for MLLM Unlearning

Yuhang Wang, Zhenxing Niu, Haoxuan Ji, Guangyu He, Linlin Zhang, Haichang Gao

Comments 20 pages, 5 figures

2605.05908 2026-05-08 cs.CV cs.AI

Architecture-agnostic Lipschitz-constant Bayesian header and its application to resolve semantically proximal classification errors with vision transformers

Frederik Schäfer, Luis Mandl, Lars Kälber, Tim Ricken

Comments 10 pages, 3 figures, 4 tables; Supplementary 5 pages with 5 figures; Including references total 18 pages

详情

英文摘要

Label noise remains a critical bottleneck for the generalization of supervised deep learning models, particularly when errors are structured rather than random. Standard robust training methods often fail in the presence of such semantically proximal classification errors. This work presents an architecture-agnostic Lipschitz-constant Bayesian header that can be integrated into feature extractors such as vision transformers, yielding the bi-Lipschitz-constrained Bayesian Vision Transformer (LipB-ViT). In contrast to conventional Bayesian layers, our approach enforces spectral normalization on both the mean and log-variance of the variational weights, which promotes calibrated predictive uncertainty and mitigates noise amplification. We further propose a novel metric to jointly capture uncertainty and confidence across misclassification rates, as well as an adaptive arithmetic-mean fusion scheme that combines feature-space proximity with predictive uncertainty to detect corrupted labels outperforming the state of the art k-nearest neighbor based identification methods by more than 7% reaching a recall of more than 0.93 at 15% semantically misclassified labels. Although computational costs increase due to Monte Carlo sampling, the method offers plug-and-play compatibility with pre-trained backbones and consistent hyperparameters across domains, suggesting strong utility for high-stakes applications with variable annotation reliability. The stabilized confidence estimates serve as the foundation for an analysis pipeline that jointly assesses dataset quality and label noise, yielding a second novel metric for their combined quantification. Lastly, we systematically evaluate LipB-ViT under both structured (adversarial) and unstructured noise at inference time, demonstrating its robustness in realistic high-noise and attack scenarios. We compare its performance against baseline methods.

URL PDF HTML ☆

赞 0 踩 0

2605.05905 2026-05-08 cs.LG math.OC

Quadratic Objective Perturbation: Curvature-Based Differential Privacy

Daniel Cortild, Coralia Cartis

2605.05900 2026-05-08 cs.CV

Understanding Cross-Language Transfer Improvements in Low-Resource HTR: The Role of Sequence Modeling

Sana Al-azzawi, Chang Liu, Nudrat Habib, Elisa Barney, Marcus Liwicki

2605.05899 2026-05-08 cs.LG

VisMMOE: Exploiting Visual-Expert Affinity for Efficient Visual-Language MoE Offloading

Cheng Xu, Xiaofeng Hou, Jiacheng Liu, Chao Li

2605.05897 2026-05-08 cs.RO

Generating Roadside LiDAR Datasets from Vehicle-Side Datasets via Novel View Synthesis

Yuhan Xia, Runxin Zhao, Hanyang Zhuang, Chunxiang Wang, Ming Yang

2605.05896 2026-05-08 cs.LG cs.AI

VARS-FL: Validation-Aligned Client Selection for Non-IID Federated Learning in IoT Systems

Mohamed Lakas, Mohamed Amine Ferrag

2605.05895 2026-05-08 cs.CV cs.AI

Detecting AI-Generated Videos with Spiking Neural Networks

Minsuk Jang, Yujin Yang, Heeseon Kim, Minseok Son, Younghun Kim, Changick Kim

详情

英文摘要

Modern AI-generated videos are photorealistic at the single-frame level, leaving inter-frame dynamics as the main remaining axis for detection. Existing detectors typically handle this temporal evidence in three ways: feeding the full frame sequence to a generic temporal backbone, reducing one dominant temporal cue to fixed video-level descriptors, or comparing temporal features to real-video statistics through a detection metric. These strategies degrade sharply under cross-generator evaluation, where artifact type and timescale vary across generators. On caption-paired benchmark, GenVidBench, we identify two signatures that prior detectors do not jointly exploit: AI-generated videos exhibit smoother frame-to-frame temporal residuals at the pixel level, and more compact trajectories in the semantic feature space, indicating a temporal smoothness gap at both levels. We further observe that, when raw video is fed into a Spiking Neural Networks (SNNs), fake clips elicit firing predominantly at object and motion boundaries, unlike real clips, suggesting that the SNN responds to temporal artifacts localized at edges. These cues are sparse, asynchronous, and concentrated at moments of change, which makes SNNs a natural choice for this task: their event-driven, sparsely-activated dynamics align with the structure of the residual signal in a way that dense ANN backbones do not. Building on this observation, we propose MAST, a detector that processes multi-channel temporal residuals with a spike-driven temporal branch alongside a frozen semantic encoder for cross-generator generalization. On the GenVideo benchmark, MAST achieves 93.14\% mean accuracy across 10 unseen generators under strict cross-generator evaluation, matching or surpassing the strongest ANN-based detectors and demonstrating the practical applicability of SNNs to AI-generated video detection.

URL PDF HTML ☆

赞 0 踩 0

2605.05893 2026-05-08 cs.CL cs.AI

Logic-Regularized Verifier Elicits Reasoning from LLMs

Xinyu Wang, Changzhi Sun, Lian Cheng, Yuanbin Wu, Dell Zhang, Xiaoling Wang, Xuelong Li

2605.05892 2026-05-08 cs.CL cs.LG

Beyond Steering Vector: Flow-based Activation Steering for Inference-Time Intervention

Zehao Jin, Ruixuan Deng, Junran Wang, Xinjie Shen, Chao Zhang