arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 2157
2512.14700 2026-05-20 cs.SI cs.CL cs.CY

Context-Aware Detection and Victim-Centered Response Generation for Online Harassment in Private Messaging

基于上下文的在线骚扰检测与以受害者为中心的回应生成:私人信息交流中的在线骚扰

Pinxian Lu, Nimra Ishfaq, Emma Win, Morgan Rose, Sierra R Strickland, Candice L Biernesser, Jamie Zelazny, Munmun De Choudhury

AI总结 本文研究了大型语言模型如何支持私人信息交流中的在线骚扰检测与回应,通过构建一个包含80,053条Instagram私信的标注数据集,开发了上下文感知的级联分类流水线,并提出了一种以受害者为中心的回应框架,生成心理上合理的AI回应,经评估发现其在情感支持和缓和冲突方面显著优于原始回应。

Comments 16 pages, 2 figures

详情
AI中文摘要

在线骚扰是一种普遍的社会和公共卫生问题,但大多数检测和应对骚扰的计算方法专注于公开的社交媒体内容,而非私人信息环境。私人对话带来了独特的挑战,因为有害的互动通常通过依赖上下文的多轮交流展开,而受害者在遭受骚扰时可能缺乏及时的支持。本文研究了大型语言模型(LLMs)如何支持私人对话中的在线骚扰检测与回应。使用80,053条由26名12-18岁青少年捐赠的Instagram私信数据集(包括有自杀风险因素的青少年),我们首先构建了一个在线骚扰的标注数据集,并开发了一个上下文感知的级联LLM分类流水线。所提出的流水线在基线毒性分类器(主要训练于公开社交媒体数据)上表现更优。然后我们开发了一种以受害者为中心的回应框架,生成上下文敏感且心理上合理的AI生成回应。人类评估者认为AI生成的回应比原始参与者回应显著更有帮助(95% CI: 0.767-0.815, p < .001),特别是在情感支持和缓和冲突方面。我们的发现突显了上下文感知和以受害者为中心的AI系统在私人信息环境中提供即时支持的潜力。

英文摘要

Online harassment is a widespread social and public health concern, yet most computational approaches for detecting and addressing harassment focus on publicly visible social media content rather than private messaging environments. Private conversations present unique challenges because harmful interactions often unfold through context-dependent, multi-turn exchanges, while victims may lack timely support during moments of harassment. In this study, we investigate how large language models (LLMs) can support both the detection of and response to online harassment in private messaging. Using a dataset of 80,053 Instagram direct messages donated by 26 adolescents aged 12-18, including youth with suicide risk factors, we first construct a human-labeled dataset of online harassment in private conversations and develop a context-aware cascading LLM classification pipeline. The proposed pipeline outperforms baseline toxicity classifiers trained primarily on public social media data. We then develop a victim-centered response framework that produces context-sensitive and psychologically-grounded AI-generated responses to online harassment messages. Human evaluators perceived the AI-generated responses as significantly more helpful than the original participant responses (95% CI: 0.767--0.815, p < .001), particularly in terms of emotional support and de-escalation. Our findings highlight the potential of context-aware and victim-centered AI systems to provide just-in-time support during harassment in private messaging environments.

2512.04556 2026-05-20 cs.GR cs.CV

DISK: Differentiable Sparse Kernel Complex for Efficient Spatially-Variant Convolution

DISK: 可微稀疏核复数用于高效空间变体卷积

Zhizhen Wu, Zhe Cao, Yuchi Huo

AI总结 本文提出了一种可微稀疏核复数分解框架,用于高效处理空间变体卷积,通过稀疏核样本表示目标空间变体密集复数核,实现了高效且可微的优化方法,适用于移动成像和实时渲染。

Comments Accepted as a conference paper at ICLR 2026. OpenReview: https://openreview.net/forum?id=bbuxDoRD2D

详情
AI中文摘要

复数核图像卷积是摄影、科学成像和动画效果中的基本操作,但直接密集卷积在资源受限设备上计算上是不可行的。现有的近似方法,如模拟退火或低秩分解,要么效率低下,要么无法捕捉非凸核。我们介绍了一种可微的核分解框架,通过一组稀疏核样本表示目标空间变体、密集复数核。我们的方法具有(i)一种允许对稀疏核进行可微优化的分解;(ii)一种专门的初始化策略用于非凸形状以避免较差的局部极小值;(iii)一种核空间插值方案,将单核过滤扩展到空间变化过滤,无需重新训练和额外的运行时开销。在高斯和非凸核的实验中,我们的方法在保真度上优于模拟退火,并且在成本上显著低于低秩分解。我们的方法为移动成像和实时渲染提供了实用的解决方案,同时保持完全可微,可用于更广泛的学习管道。

英文摘要

Image convolution with complex kernels is a fundamental operation in photography, scientific imaging, and animation effects, yet direct dense convolution is computationally prohibitive on resource-limited devices. Existing approximations, such as simulated annealing or low-rank decompositions, either lack efficiency or fail to capture non-convex kernels. We introduce a differentiable kernel decomposition framework that represents a target spatially-variant, dense, complex kernel using a set of sparse kernel samples. Our approach features (i) a decomposition that enables differentiable optimization of sparse kernels, (ii) a dedicated initialization strategy for non-convex shapes to avoid poor local minima, and (iii) a kernel-space interpolation scheme that extends single-kernel filtering to spatially varying filtering without retraining and additional runtime overhead. Experiments on Gaussian and non-convex kernels show that our method achieves higher fidelity than simulated annealing and significantly lower cost than low-rank decompositions. Our approach provides a practical solution for mobile imaging and real-time rendering, while remaining fully differentiable for integration into broader learning pipelines.

2512.04452 2026-05-20 physics.ao-ph cs.AI cs.LG physics.comp-ph physics.flu-dyn

NORi: An ML-Augmented Ocean Boundary Layer Parameterization

NORi:一种融合机器学习的海洋边界层参数化方法

Xin Kai Lee, Ali Ramadhan, Andre Souza, Gregory LeClaire Wagner, Simone Silvestri, John Marshall, Raffaele Ferrari

AI总结 NORi是一种基于物理并结合神经网络的机器学习海洋边界层湍流参数化方法,通过训练大规模涡旋模拟来捕捉边界层底部的混合过程,展示了在不同对流强度、背景层结、旋转和风力作用下的预测和泛化能力。

Comments 58 pages, 20 figures, submitted to Journal of Advances in Modeling Earth Systems (JAMES). This is version 2, updated based on reviews from 3 anonymous reviewers after initial submission to JAMES. The largest change from the previous version is the addition of comparisons with realistic observations from a long-term monitoring site in the Northeast Pacific

详情
AI中文摘要

NORi是一种基于物理并结合神经网络的机器学习海洋边界层湍流参数化方法。NORi代表神经普通微分方程(NODEs)里氏数(Ri)闭合。物理参数化通过依赖里氏数的扩散率和粘度进行控制。神经ODEs被训练以捕捉通过边界层底部的混合过程,这无法通过局部扩散闭合来表示。参数化通过大规模涡旋模拟以“后验”方式训练,其中参数通过一个显式依赖于实际时间积分变量的损失函数进行校准,而不是瞬时子格尺度通量,后者本质上是嘈杂的。NORi通过设计保留踪迹,使用现实的非线性热力学,并在不同对流强度、背景层结、旋转和风力作用下表现出卓越的预测和泛化能力。NORi在Ocean Weather Station Papa处模拟了边界层的季节演变,其性能与最先进的两方程k-ε闭合相当。当在双环流模拟中实现时,尽管仅在两天时间范围内训练,它在至少100年内数值上是稳定的,可以以一小时的时间步长运行。高度表达性的神经网络与严格的物理基础闭合相结合,证明了在气候模型中设计参数化的稳健范式:所需数据和训练成本大大减少,推理性能可以作为主要目标直接优化,数值稳定性通过训练隐含地得到促进。

英文摘要

NORi is a machine learning (ML) parameterization of ocean boundary layer turbulence that is physics-based and augmented with neural networks. NORi stands for neural ordinary differential equations (NODEs) Richardson number (Ri) closure. The physical parameterization is controlled by a Richardson number-dependent diffusivity and viscosity. The neural ODEs are trained to capture the entrainment through the base of the boundary layer, which cannot be represented with a local diffusive closure. The parameterization is trained using large-eddy simulations in an "a posteriori" fashion, where parameters are calibrated with a loss function that explicitly depends on the actual time-integrated variables of interest rather than the instantaneous subgrid fluxes, which are inherently noisy. NORi conserves tracers by design, uses realistic nonlinear thermodynamics, and demonstrates excellent prediction and generalization capabilities in capturing entrainment dynamics under different convective strengths, background stratifications, rotation, and wind forcings. NORi is shown to simulate the seasonal evolution of the boundary layer at Ocean Weather Station Papa with similar performance to the state-of-the-art two-equation $k$-$ε$ closure. When implemented in a double-gyre simulation, it is numerically stable for at least 100 years, despite only being trained on two-day horizons, and can be run with time steps as long as one hour. The highly expressive neural networks, combined with a physically rigorous base closure, prove to be a robust paradigm for designing parameterizations for climate models: data required and training cost are drastically reduced, inference performance can be directly optimized as a primary objective, and numerical stability is implicitly promoted through training.

2511.13588 2026-05-20 eess.SY cs.AI cs.SY math.DS

Data-driven Acceleration of MPC with Guarantees

数据驱动的MPC加速与保证

Agustin Castellano, Shijie Pan, Enrique Mallada

AI总结 本文提出了一种数据驱动的方法,通过将在线优化替换为基于离线MPC解的非参数策略来加速MPC,该策略在构造的最优成本-剩余上是贪婪的,并且能够以远快于在线求解MPC的速度实现,同时保证递归可行性及可证明的有界最优性差距。

详情
AI中文摘要

模型预测控制(MPC)是一种强大的最优控制框架,但其在低延迟应用中可能过于缓慢。我们提出了一种数据驱动的框架,通过将在线优化替换为由离线MPC解构造的非参数策略来加速MPC。该策略针对构造的最优成本-剩余上是贪婪的,并可以作为非参数查找规则实现,其速度比在线求解MPC快多个数量级。我们的分析表明,在离线数据充分覆盖的条件下,该策略具有递归可行性,并且具有可证明的有界最优性差距。这些条件建立了数据量和界紧度之间的显式权衡。新解可以方便地被纳入其中而无需重新训练,从而实现持续改进。我们的实验表明,该策略比标准MPC快100到1000倍,仅以适度的最优性损失为代价,展示了在实时控制任务中的潜力。

英文摘要

Model Predictive Control (MPC) is a powerful framework for optimal control but can be too slow for low-latency applications. We present a data-driven framework to accelerate MPC by replacing online optimization with a nonparametric policy constructed from offline MPC solutions. Our policy is greedy with respect to a constructed upper bound on the optimal cost-to-go, and can be implemented as a nonparametric lookup rule that is orders of magnitude faster than solving MPC online. Our analysis shows that under sufficient coverage conditions of the offline data, the policy is recursively feasible and admits provable, bounded optimality gap. These conditions establish an explicit trade-off between the amount of data collected and the tightness of the bounds. New solutions can be incorporated straightforwardly without the need for retraining, enabling continual improvement. Our experiments show that this policy is between 100 and 1000 times faster than standard MPC with only a modest hit to optimality, showing potential for real-time control tasks.

2511.06714 2026-05-20 eess.SY cs.LG cs.SY

The Wisdom of the Crowd: High-Fidelity Classification of Cyber-Attacks and Faults in Power Systems Using Ensemble and Machine Learning

人群智慧:利用集成和机器学习实现电力系统中网络攻击和故障的高保真分类

Emad Abukhousa, Syed Sohail Feroz Syed Afroz, Fahad Alsaeed, Abdulaziz Qwbaiban, Saman Zonouz, A. P. Sakis Meliopoulos

AI总结 本文提出了一种高保真评估框架,利用电磁暂态仿真与数字变电站仿真在4.8kHz下评估基于机器学习的网络攻击和物理故障分类方法,通过训练12种机器学习模型并在实时流环境中评估,展示了在流式环境中MLP的鲁棒覆盖性和集成模型的异常精度。

详情
AI中文摘要

本文提出了一种高保真评估框架,用于利用电磁暂态仿真与数字变电站仿真在4.8kHz下评估基于机器学习的网络攻击和物理故障分类方法。十二种机器学习模型,包括集成算法和多层感知机(MLP),在标记的时间域测量上进行训练,并在设计用于子周期响应的实时流环境中进行评估。该架构集成了周期长度平滑滤波器和置信度阈值以稳定决策。结果表明,尽管几种模型在离线准确性方面接近完美(高达99.9%),但只有MLP在流式环境中保持了稳健的覆盖率(98-99%),而集成模型保持了完美的异常精度,但经常回避(10-49%覆盖)。这些发现表明,仅凭离线准确性本身是不可靠的,强调了需要现实的测试和推理管道以确保在基于逆变器资源(IBR)丰富的网络中的可靠分类。

英文摘要

This paper presents a high-fidelity evaluation framework for machine learning (ML)-based classification of cyber-attacks and physical faults using electromagnetic transient simulations with digital substation emulation at 4.8 kHz. Twelve ML models, including ensemble algorithms and a multi-layer perceptron (MLP), were trained on labeled time-domain measurements and evaluated in a real-time streaming environment designed for sub-cycle responsiveness. The architecture incorporates a cycle-length smoothing filter and confidence threshold to stabilize decisions. Results show that while several models achieved near-perfect offline accuracies (up to 99.9%), only the MLP sustained robust coverage (98-99%) under streaming, whereas ensembles preserved perfect anomaly precision but abstained frequently (10-49% coverage). These findings demonstrate that offline accuracy alone is an unreliable indicator of field readiness and underscore the need for realistic testing and inference pipelines to ensure dependable classification in inverter-based resources (IBR)-rich networks.

2511.04776 2026-05-20 cs.CY cs.CL

Quantifying the Climate Risk of Generative AI: Region-Aware Carbon Accounting with G-TRACE and the AI Sustainability Pyramid

量化生成式人工智能的气候风险:基于G-TRACE和AI可持续性金字塔的区域感知碳核算

Zahida Kausar, Seemab Latif, Raja Khurram Shahzad, Mehwish Fatima

AI总结 本文提出G-TRACE框架,用于量化生成式人工智能在不同模态和部署地区的训练和推理相关排放,并通过AI可持续性金字塔模型提出七级治理框架,以指导可持续的人工智能部署。

Comments 27 page, 4 figures

详情
AI中文摘要

生成式人工智能(GenAI)代表了迅速扩展的数字基础设施,其能源需求和相关二氧化碳排放正成为新的气候风险类别。本研究引入G-TRACE(GenAI变革性碳估算器),一种跨模态、区域感知的框架,用于量化不同模态和部署地区的训练和推理相关排放。通过实际数据分析和微观模拟,G-TRACE测量每种输出类型(文本、图像、视频)的能源使用和碳强度,并揭示去中心化推理如何将小查询能耗放大为系统级影响。通过吉卜力风格图像生成趋势(2024-2025),我们估计了4309 MWh的能源消耗和2068吨CO2排放,展示了病毒式参与如何将个体数字行为放大到吨级后果。基于这些发现,我们提出了AI可持续性金字塔,一种七级治理模型,将碳核算指标(L1-L7)与运营准备性、优化和 stewardship 相关联。该框架将定量排放指标转化为可持续人工智能部署的可操作政策指导。本研究为新兴数字基础设施作为新型气候风险类别提供了定量评估,支持适应性治理以实现可持续技术部署。通过将GenAI置于气候风险框架中,本工作推进了数据驱动方法,以使技术创新与全球去碳化和韧性目标相一致。

英文摘要

Generative Artificial Intelligence (GenAI) represents a rapidly expanding digital infrastructure whose energy demand and associated CO2 emissions are emerging as a new category of climate risk. This study introduces G-TRACE (GenAI Transformative Carbon Estimator), a cross-modal, region-aware framework that quantifies training- and inference-related emissions across modalities and deployment geographies. Using real-world analytics and microscopic simulation, G-TRACE measures energy use and carbon intensity per output type (text, image, video) and reveals how decentralized inference amplifies small per-query energy costs into system-level impacts. Through the Ghibli-style image generation trend (2024-2025), we estimate 4,309 MWh of energy consumption and 2,068 tCO2 emissions, illustrating how viral participation inflates individual digital actions into tonne-scale consequences. Building on these findings, we propose the AI Sustainability Pyramid, a seven-level governance model linking carbon accounting metrics (L1-L7) with operational readiness, optimization, and stewardship. This framework translates quantitative emission metrics into actionable policy guidance for sustainable AI deployment. The study contributes to the quantitative assessment of emerging digital infrastructures as a novel category of climate risk, supporting adaptive governance for sustainable technology deployment. By situating GenAI within climate-risk frameworks, the work advances data-driven methods for aligning technological innovation with global decarbonization and resilience objectives.

2510.20035 2026-05-20 stat.ME cs.LG

Throwing Vines at the Wall: Structure Learning via Random Search

向墙上投掷藤蔓:通过随机搜索进行结构学习

Thibault Vatter, Thomas Nagler

AI总结 本文提出基于模型置信集的统计框架和随机搜索算法,以改进结构选择,提供理论保证,并为集成学习奠定基础。

详情
AI中文摘要

Vine copulas 提供了灵活的多变量依赖建模,并在机器学习中被广泛应用。然而,结构学习仍然是一个关键挑战。早期的启发式方法,如 Dissmann 的贪心算法,仍被视为金标准,但往往效果不佳。我们提出随机搜索算法和基于模型置信集的统计框架,以改进结构选择,提供对选择概率和超额风险的理论保证,并为集成学习奠定基础。在真实世界数据集上的实验证明,我们的方法在各方面都优于最先进的方法。

英文摘要

Vine copulas offer flexible multivariate dependence modeling and have become widely used in machine learning. Yet, structure learning remains a key challenge. Early heuristics, such as Dissmann's greedy algorithm, are still considered the gold standard but are often suboptimal. We propose random search algorithms and a statistical framework based on model confidence sets, to improve structure selection, provide theoretical guarantees on selection probabilities and excess risk, as well as serve as a foundation for ensembling. Empirical results on real-world data sets show that our methods consistently outperform state-of-the-art approaches.

2510.19382 2026-05-20 stat.ML cs.LG

A Derandomization Framework for Structure Discovery: Applications in Neural Networks and Beyond

一种用于结构发现的去随机化框架:应用于神经网络及其他领域

Nikos Tsikouras, Yorgos Pantis, Ioannis Mitliagkas, Christos Tzamos

AI总结 本文研究了神经网络中特征学习动态的理解问题,提出了一种基于去随机化方法的结构发现框架,在更弱的假设下探讨了结构发现的本质及其在MAXCUT端到端近似和Johnson-Lindenstrauss嵌入计算中的应用。

详情
AI中文摘要

理解神经网络中特征学习动态的机制仍然是一个重大挑战。Mousavi-Hosseini等人(2023)分析了多重索引教师-学生设置,并展示了在使用随机梯度下降(SGD)和强正则化器训练时,两层学生模型的第一层权重会呈现低秩结构。这种结构特性已知可以减少泛化样本复杂度。在第二步中,同一作者们在额外假设下建立了算法特定的学习保证。本文专注于结构发现方面,并在更弱的假设下研究了该问题,具体包括:允许任意大小和深度的神经网络,所有参数可训练,任何平滑损失函数,微弱正则化,以及通过任何能够达到二阶平稳点(SOSP)的方法(例如扰动梯度下降(PGD))进行训练。我们方法的核心是一个关键的去随机化引理,该引理指出在温和条件下,优化函数E_x[g_θ(Wx + b)]会收敛到W=0的点。该引理的本质直接解释了结构发现,并在其他领域如端到端MAXCUT近似和Johnson-Lindenstrauss嵌入计算中具有即时应用。

英文摘要

Understanding the dynamics of feature learning in neural networks (NNs) remains a significant challenge. The work of (Mousavi-Hosseini et al., 2023) analyzes a multiple index teacher-student setting and shows that a two-layer student attains a low-rank structure in its first-layer weights when trained with stochastic gradient descent (SGD) and a strong regularizer. This structural property is known to reduce sample complexity of generalization. Indeed, in a second step, the same authors establish algorithm-specific learning guarantees under additional assumptions. In this paper, we focus exclusively on the structure discovery aspect and study it under weaker assumptions, more specifically: we allow (a) NNs of arbitrary size and depth, (b) with all parameters trainable, (c) under any smooth loss function, (d) tiny regularization, and (e) trained by any method that attains a second-order stationary point (SOSP), e.g.\ perturbed gradient descent (PGD). At the core of our approach is a key $\textit{derandomization}$ lemma, which states that optimizing the function $\mathbb{E}_{\mathbf{x}} \left[g_θ(\mathbf{W}\mathbf{x} + \mathbf{b})\right]$ converges to a point where $\mathbf{W} = \mathbf{0}$, under mild conditions. The fundamental nature of this lemma directly explains structure discovery and has immediate applications in other domains including an end-to-end approximation for MAXCUT, and computing Johnson-Lindenstrauss embeddings.

2509.25448 2026-05-20 cs.CR cs.CL

Fingerprinting LLMs via Prompt Injection

通过提示注入指纹化LLM

Yuepeng Hu, Zhengyuan Jiang, Mengyuan Li, Osama Ahmed, Zhicong Huang, Cheng Hong, Neil Gong

AI总结 本文提出LLMPrint框架,通过利用LLM对提示注入的固有脆弱性来构建指纹,以解决已发布模型的溯源问题,其核心方法是优化提示以获得唯一且抗后处理的指纹,实验显示其在高召回率下保持低误报率。

详情
AI中文摘要

大型语言模型(LLMs)在发布后通常通过后处理如微调或量化进行修改,这使得确定一个模型是否源自另一个模型变得具有挑战性。现有溯源检测方法有两个主要局限:(1)它们在发布前将信号嵌入到基础模型中,这在已发布的模型上不可行;或者(2)它们使用手工设计或随机提示比较不同模型的输出,这在后处理后不够稳健。在本文中,我们提出LLMPrint,一种新的检测框架,通过利用LLM对提示注入的固有脆弱性来构建指纹。我们的关键见解是,通过优化指纹提示以强制一致的token偏好,可以得到既独特于基础模型又抗后处理的指纹。我们进一步开发了一个统一的验证程序,适用于灰盒和黑盒设置,并具有统计保证。我们在五个基础模型和约700个后处理或量化变体上评估了LLMPrint。我们的结果表明,LLMPrint在高召回率的同时保持低误报率。代码可在https://github.com/hifi-hyp/ACL-LLMPrint公开获取。

英文摘要

Large language models (LLMs) are often modified after release through post-processing such as post-training or quantization, which makes it challenging to determine whether one model is derived from another. Existing provenance detection methods have two main limitations: (1) they embed signals into the base model before release, which is infeasible for already published models, or (2) they compare outputs across models using hand-crafted or random prompts, which are not robust to post-processing. In this work, we propose LLMPrint, a novel detection framework that constructs fingerprints by exploiting LLMs' inherent vulnerability to prompt injection. Our key insight is that by optimizing fingerprint prompts to enforce consistent token preferences, we can obtain fingerprints that are both unique to the base model and robust to post-processing. We further develop a unified verification procedure that applies to both gray-box and black-box settings, with statistical guarantees. We evaluate LLMPrint on five base models and around 700 post-trained or quantized variants. Our results show that LLMPrint achieves high true positive rates while keeping false positive rates near zero. The code is publicly available at https://github.com/hifi-hyp/ACL-LLMPrint.

2509.22202 2026-05-20 cs.SE cs.CL

Library Hallucinations in LLM-Generated Code: A Risk Analysis Grounded in Developer Queries

LLM生成代码中的库幻觉:基于开发者查询的风险分析

Lukas Twist, Jie M. Zhang, Mark Harman, Helen Yannakoudakis

AI总结 本文研究了LLM生成代码中库幻觉的风险,通过分析用户提示变化对库幻觉的影响,揭示了系统性漏洞,并提出了LibHalluBench基准测试以评估这些幻觉。

Comments 27 pages, 1 figure, 13 tables

详情
AI中文摘要

大型语言模型(LLMs)现在在代码生成中扮演核心角色,但它们仍然会幻觉,频繁地发明不存在的库。此类库幻觉不仅仅是无害的错误:它们可以误导开发者,导致构建失败,并暴露系统面临供应链威胁,如slopsquatting。尽管对这些风险的认识日益增加,但对库幻觉在真实使用条件下的表现仍缺乏深入理解。为填补这一空白,我们进行了首次系统研究,分析用户级提示变化如何影响LLM生成代码中的库幻觉。在七个不同的LLM上,我们分析了库名幻觉(无效导入)和库成员幻觉(从有效库中无效调用),考察了现实开发者语言和受控用户错误的影响,包括拼写错误和虚构库或成员。我们的发现揭示了系统性漏洞:单字符拼写错误会触发多达26%任务的幻觉;虚构库名被接受的比例高达99%;基于时间的提示会引发多达85%的幻觉。基于我们研究中发现的最高风险提示,我们引入了LibHalluBench基准测试,以系统且可重复地评估这些库幻觉。我们的发现突显了LLM对自然提示变化的脆弱性,并强调了对库相关幻觉及其下游风险的保护措施的紧迫需求。

英文摘要

Large language models (LLMs) now play a central role in code generation, yet they continue to hallucinate, frequently inventing non-existent libraries. Such library hallucinations are not just benign errors: they can mislead developers, break builds, and expose systems to supply chain threats such as slopsquatting. Despite growing awareness of these risks, there is limited understanding of how library hallucinations manifest under realistic usage conditions. To fill this gap, we present the first systematic study of how user-level prompt variations influence library hallucinations in LLM-generated code. Across seven diverse LLMs, we analyse library name hallucinations (invalid imports) and library member hallucinations (invalid calls from valid libraries), examining the effects of realistic developer language and controlled user mistakes, including misspellings and fabricated libraries or members. Our findings expose systemic vulnerabilities: one-character misspellings trigger hallucinations in up to 26% of tasks; fabricated library names are accepted in up to 99%; and time-based prompts induce hallucinations in up to 85%. Grounded in the highest-risk prompts identified in our study, we introduce LibHalluBench, a benchmark that enables a systematic and reproducible evaluation of these library hallucinations. Our findings underscore the fragility of LLMs to natural prompt variation and highlight the urgent need for safeguards against library-related hallucinations and their downstream risks.

2509.19250 2026-05-20 stat.ML cs.LG

Recovering Wasserstein Distance Matrices from Few Measurements

从少量测量中恢复Wasserstein距离矩阵

Muhammad Rana, Abiy Tasissa, HanQin Cai, Yakov Gavriyelov, Keaton Hamm

AI总结 本文提出两种算法,用于从少量条目估计平方Wasserstein距离矩阵,这些矩阵用于计算流形学习嵌入,如多维标度分析(MDS)或Isomap,但与欧几里得距离矩阵不同,它们的计算成本极高。本文分析了从上三角样本进行矩阵补全和Nyström补全,证明了在Nyström补全下MDS的稳定性,并展示了在固定样本距离预算下,Nyström补全可以优于矩阵补全。最后,本文证明了即使仅计算距离矩阵的10%列,嵌入数据在OrganCMNIST数据集上的分类也是稳定的。

详情
Journal ref
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2026
AI中文摘要

本文提出两种算法,用于从少量条目估计平方Wasserstein距离矩阵。这些矩阵用于计算流形学习嵌入,如多维标度分析(MDS)或Isomap,但与欧几里得距离矩阵不同,它们的计算成本极高。我们分析了从上三角样本进行矩阵补全和Nyström补全,在其中$\mathcal{O}(d\log(d))$列的距离矩阵被计算,其中$d$是所需的嵌入维度,证明了在Nyström补全下MDS的稳定性,并展示了在固定样本距离预算下,Nyström补全可以优于矩阵补全。最后,我们证明了即使仅计算距离矩阵的10%列,嵌入数据在OrganCMNIST数据集上的分类也是稳定的。

英文摘要

This paper proposes two algorithms for estimating square Wasserstein distance matrices from a small number of entries. These matrices are used to compute manifold learning embeddings like multidimensional scaling (MDS) or Isomap, but contrary to Euclidean distance matrices, are extremely costly to compute. We analyze matrix completion from upper triangular samples and Nyström completion in which $\mathcal{O}(d\log(d))$ columns of the distance matrices are computed where $d$ is the desired embedding dimension, prove stability of MDS under Nyström completion, and show that it can outperform matrix completion for a fixed budget of sample distances. Finally, we show that classification of the OrganCMNIST dataset from the MedMNIST benchmark is stable on data embedded from the Nyström estimation of the distance matrix even when only 10\% of the columns are computed.

2509.19182 2026-05-20 cs.HC cs.AI

YAC: Bridging Natural Language and Interactive Visual Exploration with Generative AI for Biomedical Data Discovery

YAC:通过生成式AI连接自然语言与交互式视觉探索,用于生物医学数据发现

Devin Lange, Shanghua Gao, Pengwei Sui, Priya Misner, Astrid van den Brandt, Austen Money, Nikolay Akhmetov, Lisa Choy, Marinka Zitnik, Nils Gehlenborg

AI总结 本文提出YAC系统,结合自然语言输入和交互式可视化,通过生成式AI提升生物医学数据发现接口的能力,通过用户研究和系统分析改进系统设计和功能。

详情
AI中文摘要

将自然语言输入整合到生物医学数据发现接口中有可能提高其能力。然而,用户界面元素和可视化仍然是与数据交互的强大工具。在我们的原型系统YAC(Yet Another Chatbot)中,我们整合了自然语言和交互式可视化。YAC使用工具调用多代理系统生成声明性输出,该输出被解释以渲染链接的交互式可视化并应用数据过滤器。我们还包含调整小部件,允许用户直接修改结构化输出。生成的结构化文本也用于澄清用户意图、通知用户系统边界,并通过实时数据元素链接解释数据的某些方面。我们与领域专家进行了用户研究,以发现YAC可以改进的领域。此外,我们通过对其技术维度的分析反思了该系统的功能和设计。

英文摘要

Incorporating natural language input has the potential to improve the capabilities of biomedical data discovery interfaces. However, user interface elements and visualizations are still powerful tools for interacting with data. In our prototype system, YAC, Yet Another Chatbot, we integrate natural language and interactive visualizations. YAC uses a tool-calling multi-agent system to generate declarative output, which is interpreted to render linked interactive visualizations and apply data filters. We also include adjustment widgets, which allow users to directly modify the structured output. Structured text is also generated to clarify user intent, notify users of system boundaries, and explain aspects of the data with live data element links. We conducted a user study with domain experts to surface areas where YAC can be improved. Furthermore we reflect on the capabilities and design of this system with an analysis of its technical dimensions.

2509.12288 2026-05-20 cs.SI cs.AI cs.CY cs.IR

Digital Voices of Survival: From Social Media Disclosures to Support Provisions for Domestic Violence Victims

生存之声:从社交媒体披露到对家庭暴力受害者的支持措施

Kanlun Wang, Zhe Fu, Wangjiaxuan Xin, Lina Zhou, Shashi Kiran Chandrappa

AI总结 本文提出了一种新的计算框架,用于建模家庭暴力支持寻求行为及社区支持机制,通过自披露检测、帖子聚类、主题摘要和支持提取与映射四个关键组件,提升对家庭暴力在线支持的理解并推动以受害者为中心的数字干预。

Comments 9 pages, 4 figures and 4 tables. Accepted to The 59th Hawaii International Conference on System Sciences (HICSS) 2026

详情
AI中文摘要

家庭暴力(DV)是一种普遍存在公共卫生问题,其特征是亲密关系中的胁迫性和攻击性行为。随着社交媒体作为DV受害者披露经历的关键出口日益兴起,网络自我披露已成为寻求支持的关键但尚不充分研究的途径。此外,现有研究缺乏对DV自我披露、支持措施及其联系的全面和细致理解。为解决这些差距,本研究提出了一种新的计算框架,用于建模DV支持寻求行为及社区支持机制。该框架由四个关键组件组成:自披露检测、帖子聚类、主题摘要以及支持提取与映射。我们通过从相关社交媒体社区收集的数据实施并评估了该框架。我们的发现不仅推进了现有关于DV自我披露和在线支持措施的知识,还使以受害者为中心的数字干预成为可能。

英文摘要

Domestic Violence (DV) is a pervasive public health problem characterized by patterns of coercive and abusive behavior within intimate relationships. With the rise of social media as a key outlet for DV victims to disclose their experiences, online self-disclosure has emerged as a critical yet underexplored avenue for support-seeking. In addition, existing research lacks a comprehensive and nuanced understanding of DV self-disclosure, support provisions, and their connections. To address these gaps, this study proposes a novel computational framework for modeling DV support-seeking behavior alongside community support mechanisms. The framework consists of four key components: self-disclosure detection, post clustering, topic summarization, and support extraction and mapping. We implement and evaluate the framework with data collected from relevant social media communities. Our findings not only advance existing knowledge on DV self-disclosure and online support provisions but also enable victim-centered digital interventions.

2509.07024 2026-05-20 physics.plasm-ph cs.LG

TGLF-WINN: Data-Efficient Deep Learning Surrogate for Turbulent Transport Modeling in Fusion

TGLF-WINN: 用于等离子体输运建模的高效深度学习替代模型

Yadi Cao, Futian Zhang, Wesley Liu, Tom Neiser, Orso Meneghini, Lawson Fuller, Sterling Smith, Raffi Nazikian, Brian Sammuli, Rose Yu

AI总结 本文提出TGLF-WINN,一种数据高效的深度学习替代模型,通过三种创新方法:原理化的特征工程、物理引导的波数解析正则化和贝叶斯主动学习,提高了湍流输运建模的效率和准确性。

Comments Minor Revision responding to Nuclear Fusion reviewer and adjudicator comments (round 3)

详情
AI中文摘要

Trapped Gyro-Landau Fluid (TGLF)模型提供了快速且准确的托卡马克湍流输运预测,但需要数千次评估的全设备模拟仍然计算成本高昂。神经网络(NN)替代模型提供加速推理,具有完全可微的近似方法,能够实现基于梯度的耦合,但通常需要大量训练数据来捕捉不同等离子体条件下的输运通量变化,造成显著的训练负担并限制其在昂贵的gyrokinetic模拟中的应用。我们提出TGLF-WINN(波数引导的神经网络),具有三个关键创新:(1)原理化的特征工程,减少目标预测范围,简化学习任务;(2)物理引导的波数解析正则化,以在稀疏数据下提高泛化能力;(3)贝叶斯主动学习(BAL)以根据模型不确定性战略选择训练样本,减少数据需求同时保持准确性。特征调优和波数正则化共同在完整数据集上实现了比TGLF-NN低12.5%的相对RMSLE;在稀疏、未过滤的训练(大约是完整数据集的1/9)下,它们产生的RMSLE退化比TGLF-NN小一个数量级,其中波数引导的正则化对每种模式的通量施加了物理引导的约束。添加贝叶斯主动学习后,TGLF-WINN仅使用25%的训练数据即可达到TGLF-NN的全数据离线精度,其在TGLF-NN全数据基准下的误差为2.8%,在我们自己的全数据结果下的误差为4.3%。下游的通量匹配工作流程进一步展示了其实用性:NN替代模型在与TGLF相当的重建精度下实现了45倍的速度提升。

英文摘要

The Trapped Gyro-Landau Fluid (TGLF) model provides fast, accurate predictions of turbulent transport in tokamaks, but whole device simulations requiring thousands of evaluations remain computationally expensive. Neural network (NN) surrogates offer accelerated inference with fully differentiable approximations that enable gradient-based coupling but typically require large training datasets to capture transport flux variations across plasma conditions, creating significant training burden and limiting applicability to expensive gyrokinetic simulations. We propose TGLF-WINN (Wavenumber-Informed Neural Network) with three key innovations: (1) principled feature engineering that reduces target prediction range, simplifying the learning task; (2) physics-guided wavenumber-resolved regularization to improve generalization under sparse data; and (3) Bayesian Active Learning (BAL) to strategically select training samples based on model uncertainty, reducing data requirements while maintaining accuracy. Feature tuning and wavenumber regularization together deliver a 12.5% relative RMSLE reduction over TGLF-NN on the full dataset; under sparse, unfiltered training (approximately 1/9 the full size) they yield an order-of-magnitude smaller RMSLE degradation than TGLF-NN, with the wavenumber-informed regularization imposing a physics-guided constraint on per-mode fluxes. Adding Bayesian Active Learning, TGLF-WINN matches TGLF-NN's full-data offline accuracy using only 25% of the training data, within 2.8% of TGLF-NN's full-data baseline and 4.3% of our own full-data result. A downstream flux-matching workflow further shows practicality: the NN surrogate gives a 45x speedup over TGLF with comparable reconstruction accuracy.

2507.06428 2026-05-20 math.OC cs.LG cs.NA math.NA stat.ML

Neural Actor-Critic Methods for Hamilton-Jacobi-Bellman PDEs: Asymptotic Analysis and Numerical Studies

神经Actor-Critic方法用于哈密尔顿-雅可比-贝尔曼PDEs:渐近分析与数值研究

Samuel N. Cohen, Jackson Hebner, Deqing Jiang, Justin Sirignano

AI总结 本文研究了用于求解高维哈密尔顿-雅可比-贝尔曼偏微分方程的神经Actor-Critic方法,通过渐近分析和数值研究,证明了该方法在解决随机控制问题中的有效性。

Comments 46 pages

详情
AI中文摘要

我们数学上分析并数值研究了一种用于求解随机控制理论中高维哈密尔顿-雅可比-贝尔曼(HJB)偏微分方程的Actor-Critic机器学习算法。批评者(价值函数估计器)的结构设计使得边界条件始终被完美满足(而不是包含在训练损失中),并利用偏斜梯度以减少计算成本。演员(最优控制估计器)通过最小化域内哈密尔顿量的积分进行训练,其中哈密尔顿量通过批评者估计。我们证明,当演员和批评者神经网络中的隐藏单元数量趋于无穷大时,演员和批评者的训练动态在Sobolev型空间中收敛到某个无限维常微分方程(ODE)。进一步地,在哈密尔顿量类似凸性假设下,我们证明该极限ODE的任何固定点都是原始随机控制问题的解。这为算法性能提供了重要保证,考虑到有限宽度神经网络可能只能收敛到局部极小值(而非最优解),由于其损失函数的非凸性。在我们的数值研究中,我们展示了该算法能够准确地在高达200维的随机控制问题中求解。特别是,我们构建了一系列逐渐复杂且具有已知解析解的随机控制问题,并研究该算法在这些问题上的数值性能。这些问题从线性二次调节器方程到极具挑战性的非凸哈密尔顿量方程,使我们能够识别并分析该神经Actor-Critic方法在求解HJB方程中的优势和局限性。

英文摘要

We mathematically analyze and numerically study an actor-critic machine learning algorithm for solving high-dimensional Hamilton-Jacobi-Bellman (HJB) partial differential equations from stochastic control theory. The architecture of the critic (the estimator for the value function) is structured so that the boundary condition is always perfectly satisfied (rather than being included in the training loss) and utilizes a biased gradient which reduces computational cost. The actor (the estimator for the optimal control) is trained by minimizing the integral of the Hamiltonian over the domain, where the Hamiltonian is estimated using the critic. We show that the training dynamics of the actor and critic neural networks converge in a Sobolev-type space to a certain infinite-dimensional ordinary differential equation (ODE) as the number of hidden units in the actor and critic $\rightarrow \infty$. Further, under a convexity-like assumption on the Hamiltonian, we prove that any fixed point of this limit ODE is a solution of the original stochastic control problem. This provides an important guarantee for the algorithm's performance in light of the fact that finite-width neural networks may only converge to a local minimizers (and not optimal solutions) due to the non-convexity of their loss functions. In our numerical studies, we demonstrate that the algorithm can solve stochastic control problems accurately in up to 200 dimensions. In particular, we construct a series of increasingly complex stochastic control problems with known analytic solutions and study the algorithm's numerical performance on them. These problems range from a linear-quadratic regulator equation to highly challenging equations with non-convex Hamiltonians, allowing us to identify and analyze the strengths and limitations of this neural actor-critic method for solving HJB equations.

2507.03122 2026-05-20 cs.IR cs.CL cs.LG

Federated Learning for ICD Classification with Lightweight Models and Pretrained Embeddings

基于轻量模型和预训练嵌入的ICD分类联邦学习

Binbin Xu, Gérard Dray

AI总结 本文研究了使用MIMIC-IV数据集中的临床笔记进行多标签ICD代码分类的联邦学习可行性与性能,提出了一种结合冻结文本嵌入和简单多层感知机分类器的轻量级可扩展流程,展示了在分布式医疗环境中隐私保护和部署高效的替代方案。

Comments 20 pages

详情
AI中文摘要

本研究探讨了使用MIMIC-IV数据集中的临床笔记进行多标签ICD代码分类的联邦学习(FL)的可行性和性能。不同于以往依赖集中训练或微调大型语言模型的方法,我们提出了一种轻量级且可扩展的流程,结合冻结的文本嵌入与简单的多层感知机(MLP)分类器。该设计为临床NLP应用提供了一种隐私保护且部署高效的替代方案,特别适用于分布式医疗环境。在集中式和联邦式配置下进行了广泛的实验,测试了六个公开可用的嵌入模型(来自Massive Text Embedding Benchmark排行榜)和三种MLP分类器架构,以及两种医学编码(ICD-9和ICD-10)。此外,对十个随机分层分割进行消融研究以评估性能稳定性。结果表明,嵌入质量在决定预测性能方面显著优于分类器复杂性,并且在理想条件下联邦学习可以接近集中式结果。尽管模型比最先进的架构小多个数量级,并且在微和宏F1分数上取得了竞争性的成绩,但仍存在一些限制,包括缺乏端到端训练和简化FL假设。然而,本研究展示了向可扩展、隐私意识的医疗编码系统迈进的可行方法,并为未来研究联邦、领域适应的临床AI提供了一步。

英文摘要

This study investigates the feasibility and performance of federated learning (FL) for multi-label ICD code classification using clinical notes from the MIMIC-IV dataset. Unlike previous approaches that rely on centralized training or fine-tuned large language models, we propose a lightweight and scalable pipeline combining frozen text embeddings with simple multilayer perceptron (MLP) classifiers. This design offers a privacy-preserving and deployment-efficient alternative for clinical NLP applications, particularly suited to distributed healthcare settings. Extensive experiments across both centralized and federated configurations were conducted, testing six publicly available embedding models from Massive Text Embedding Benchmark leaderboard and three MLP classifier architectures under two medical coding (ICD-9 and ICD-10). Additionally, ablation studies over ten random stratified splits assess performance stability. Results show that embedding quality substantially outweighs classifier complexity in determining predictive performance, and that federated learning can closely match centralized results in idealized conditions. While the models are orders of magnitude smaller than state-of-the-art architectures and achieved competitive micro and macro F1 scores, limitations remain including the lack of end-to-end training and the simplified FL assumptions. Nevertheless, this work demonstrates a viable way toward scalable, privacy-conscious medical coding systems and offers a step toward for future research into federated, domain-adaptive clinical AI.

2506.17036 2026-05-20 stat.ME cs.LG stat.ML

Bayesian Joint Model of Multi-Sensor and Failure Event Data for Multi-Mode Failure Prediction

多传感器和故障事件数据的贝叶斯联合模型用于多模式故障预测

Sina Aghaee Dabaghan Fard, Minhee Kim, Akash Deep, Jaesung Lee

AI总结 本文提出了一种联合建模多传感器时间序列数据和多模式故障时间的贝叶斯方法,通过整合Cox比例危险模型、卷积多输出高斯过程和多项式故障模式分布,实现对系统剩余使用寿命的准确预测,并通过数值和案例研究验证了其优势。

详情
AI中文摘要

现代工业系统常常受到多种故障模式的影响,其状态由多个传感器监控,产生多个时间序列信号。此外,时间到故障的数据也经常可用。准确预测系统剩余使用寿命(RUL)需要有效利用多传感器时间序列数据和多模式故障事件数据。在大多数现有模型中,故障模式和RUL预测是独立进行的,忽略了这两个任务之间的内在关系。一些模型使用黑箱机器学习方法整合多种故障模式和事件预测,但缺乏统计严谨性,无法表征模型和数据中的内在不确定性。本文提出了一种统一的方法,通过层次贝叶斯框架整合多传感器时间序列数据和涉及多种故障模式的故障时间,该模型整合了Cox比例危险模型、卷积多输出高斯过程和多项式故障模式分布,并相应地设置先验,从而实现具有鲁棒不确定性量化的准确预测。通过变分贝叶斯方法有效获得后验分布,并通过蒙特卡洛采样进行预测。所提出模型的优势通过广泛的数值和案例研究,使用喷气发动机数据集进行了验证。

英文摘要

Modern industrial systems are often subject to multiple failure modes, and their conditions are monitored by multiple sensors, generating multiple time-series signals. Additionally, time-to-failure data are commonly available. Accurately predicting a system's remaining useful life (RUL) requires effectively leveraging multi-sensor time-series data alongside multi-mode failure event data. In most existing models, failure modes and RUL prediction are performed independently, ignoring the inherent relationship between these two tasks. Some models integrate multiple failure modes and event prediction using black-box machine learning approaches, which lack statistical rigor and cannot characterize the inherent uncertainty in the model and data. This paper introduces a unified approach to jointly model the multi-sensor time-series data and failure time concerning multiple failure modes. This proposed model integrate a Cox proportional hazards model, a Convolved Multi-output Gaussian Process, and multinomial failure mode distributions in a hierarchical Bayesian framework with corresponding priors, enabling accurate prediction with robust uncertainty quantification. Posterior distributions are effectively obtained by Variational Bayes, and prediction is performed with Monte Carlo sampling. The advantages of the proposed model is validated through extensive numerical and case studies with jet-engine dataset.

2506.15753 2026-05-20 quant-ph cs.LG cs.SY eess.SY

QPPG: Quantum-Preconditioned Policy Gradient for Link Adaptation in Rayleigh Fading Channels

QPPG:用于瑞利衰落信道链路自适应的量子预条件策略梯度

Oluwaseyi Giwa, Muhammad Ahmed Mohsin, Folarin Jubril Adesola, Muhammad Ali Jamshed

AI总结 本文提出量子预条件策略梯度算法,通过信息 Fisher 基于预条件稳定和加速策略更新,提升无线通信中动态衰落环境下的链路自适应性能,实现更快收敛、更高的吞吐量和更低的发射功率。

Comments Submitted to IEEE Wireless Communications Letters

详情
AI中文摘要

可靠的链路自适应对于动态衰落环境中高效无线通信至关重要。然而,由于策略梯度的条件较差,强化学习(RL)解决方案常常因收敛不稳定而受到限制,阻碍了其实际应用。我们提出了量子预条件策略梯度(QPPG)算法,该算法利用基于 Fisher 信息的预条件来稳定和加速策略更新。在瑞利衰落场景中的评估显示,QPPG 相比经典方法实现了更快的收敛速度,平均吞吐量提高了 28.6%,平均发射功率降低了 43.8%。这项工作引入了量子几何预条件到链路自适应中,标志着在开发鲁棒、具有量子启发的强化学习以应对未来 6G 网络方面取得了重大进展,从而提高通信的可靠性和能效。

英文摘要

Reliable link adaptation is critical for efficient wireless communications in dynamic fading environments. However, reinforcement learning (RL) solutions often suffer from unstable convergence due to poorly conditioned policy gradients, hindering their practical application. We propose the quantum-preconditioned policy gradient (QPPG) algorithm, which leverages Fisher-information-based preconditioning to stabilise and accelerate policy updates. Evaluations in Rayleigh fading scenarios show that QPPG achieves faster convergence, a 28.6% increase in average throughput, and a 43.8% decrease in average transmit power compared to classical methods. This work introduces quantum-geometric conditioning to link adaptation, marking a significant advance in developing robust, quantum-inspired reinforcement learning for future 6G networks, thereby enhancing communication reliability and energy efficiency.

2505.18191 2026-05-20 eess.SP cs.AI cs.LG cs.PF

Quantifying the Generalization Gap in Seizure Detection: A Large-Scale Empirical Benchmark via the SzCORE Challenge

量化癫痫检测中的泛化差距:通过SzCORE挑战进行大规模经验基准测试

Jonathan Dan, Amirhossein Shahbazinia, Christodoulos Kechris, David Atienza

AI总结 本文通过SzCORE挑战的大规模经验研究,量化了癫痫检测中模型泛化能力的差距,评估了28种最先进的算法架构,揭示了当前模型在不同患者群体中表现不一致的问题,并提出了标准化评估的必要性。

详情
AI中文摘要

可靠的自动长期脑电图(EEG)癫痫检测仍是一个未解决的挑战,因为当前模型往往无法在不同患者或临床环境中泛化。手动EEG审查仍然是标准护理,突显了对稳健模型和标准化评估的需求。当前文献常报告高效率,但这些模型在部署到未见过的患者群体时经常失效。为了严格评估这种泛化差距,我们进行了一项大规模经验研究,评估了28种最先进的算法架构,从经典特征工程到现代深度学习。这些算法通过组织竞赛收集。利用严格保留的私人数据集,包含65名受试者的连续EEG记录,共计4360小时的数据,来评估算法性能。专家神经生理学家对这些记录进行了注释,建立了癫痫事件的地面真相。算法使用SzCORE框架中的基于事件的指标进行评估,包括灵敏度、精确度、F1分数和每天的假阳性率。结果揭示了最先进的方法之间显著的性能差异,其中最高F1分数为32%(灵敏度37%,精确度29%),突显了这项任务的持续困难。分析揭示了峰值性能与群体水平稳定性之间的不一致。获得最高综合F1分数的算法并未在不同受试者中获得最一致的排名。这项独立评估暴露了自我报告效率与保留性能之间的明显差距,强调了标准化、严格基准测试的必要性。评估基础设施转变为一个持续开放的基准测试平台,促进可重复的研究,并加速稳健癫痫检测算法的发展。

英文摘要

Reliable automatic seizure detection from long-term electroencephalography (EEG) remains an unsolved challenge, as current models often fail to generalize across patients or clinical settings. Manual EEG review still is the standard of care, highlighting the need for robust models and standardized evaluation. The current literature often reports high efficacy, yet these models frequently fail when deployed to unseen patient populations. To rigorously assess this generalization gap, we conducted a large-scale empirical study evaluating 28 state-of-the-art algorithmic architectures, ranging from classical feature engineering to modern Deep Learning. These algorithms were collected by organizing a competition. A strictly held-out private dataset of continuous EEG recordings from 65 subjects, totaling 4,360 hours of data, was utilized to evaluate algorithm performance. Expert neurophysiologists annotated these recordings, establishing the ground truth for seizure events. Algorithms were evaluated using event-based metrics from the SzCORE framework, including sensitivity, precision, F1-score, and false positive rate per day. Results revealed significant performance variability among state-of-the-art approaches, with the top F1 score of 32% (sensitivity 37%, precision 29%), highlighting the persistent difficulty of this task. Analysis uncovered a discordance between peak performance and population-level stability. The algorithms achieving the highest aggregate F1-scores did not achieve the most consistent ranking across subjects. This independent evaluation exposed a notable gap between self-reported efficacies and hold-out performance, underscoring the critical need for standardized, rigorous benchmarking. The evaluation infrastructure transitions into a continuously open benchmarking platform, fostering reproducible research and accelerating robust seizure detection algorithm development.

2505.09067 2026-05-20 math.OC cs.RO cs.SY eess.SY

Solving Reach- and Stabilize-Avoid Problems Using Discounted Reachability

利用折扣可达性求解可达-稳定避问题

Boyang Li, Zheng Gong, Sylvia Herbert

AI总结 本文针对一般非线性连续时间系统中的无限时间可达-避(RA)和稳定-避(SA)零和博弈问题,提出了一种新的Lipschitz连续RA价值函数,该函数的零子水平集精确刻画了RA集,并通过构造Bellman备份算子的合同性以及证明RA价值函数是Hamilton-Jacobi变分不等式的唯一粘性解,从而解决了RA问题。同时,通过结合最近提出的鲁棒控制Lyapunov-价值函数,开发了两步框架来解决SA问题,确保目标可达性和长期稳定性。最后,通过3D Dubins车系统数值验证了所提方法的有效性。

Comments 16 pages, 6 figures, 1 table. Accepted to IEEE Transactions on Automatic Control

详情
Journal ref
IEEE Transactions on Automatic Control (Early Access), 2026
AI中文摘要

在本文中,我们考虑一般非线性连续时间系统中的无限时间可达-避(RA)和稳定-避(SA)零和博弈问题,目标是找到能够被控制到达或稳定到目标集的的状态集,即使在最坏情况下也不违反约束。基于Hamilton-Jacobi可达性方法,我们通过设计新的Lipschitz连续RA价值函数来解决RA问题,该函数的零子水平集精确地刻画了RA集。我们证明了相关的Bellman备份算子是合同性的,并且RA价值函数是Hamilton-Jacobi变分不等式的唯一粘性解。最后,我们通过将我们的RA策略与最近提出的鲁棒控制Lyapunov-价值函数相结合,开发了一个两步框架来解决SA问题,从而确保目标可达性和长期稳定性。我们通过3D Dubins车系统数值验证了所提的RA和SA框架的有效性。

英文摘要

In this article, we consider the infinite-horizon reach-avoid (RA) and stabilize-avoid (SA) zero-sum game problems for general nonlinear continuous-time systems, where the goal is to find the set of states that can be controlled to reach or stabilize to a target set, without violating constraints even under the worst-case disturbance. Based on the Hamilton-Jacobi reachability method, we address the RA problem by designing a new Lipschitz continuous RA value function, whose zero sublevel set exactly characterizes the RA set. We establish that the associated Bellman backup operator is contractive and that the RA value function is the unique viscosity solution of a Hamilton-Jacobi variational inequality. Finally, we develop a two-step framework for the SA problem by integrating our RA strategies with a recently proposed Robust Control Lyapunov-Value Function, thereby ensuring both target reachability and long-term stability. We numerically verify our RA and SA frameworks on a 3D Dubins car system to demonstrate the efficacy of the proposed approach.

2504.17548 2026-05-20 quant-ph cs.CR cs.LG

Quantum Autoencoder for Multivariate Time Series Anomaly Detection

量子自编码器用于多变量时间序列异常检测

Kilian Tscharke, Maximilian Wendlinger, Afrae Ahouzi, Pallavi Bhardwaj, Kaweh Amoi-Taleghani, Michael Schrödl-Baumann, Pascal Debus

AI总结 本文提出了一种基于量子自编码器的框架,专门用于企业级多变量时间序列异常检测,展示了其在数据压缩和异常检测中的竞争力。

Comments Submitted to IEEE International Conference on Quantum Computing and Engineering (QCE) 2025

详情
Journal ref
2024 IEEE International Conference on Quantum Computing and Engineering (QCE), Albuquerque, NM, USA, 2025, pp. 2470-2481
AI中文摘要

异常检测(AD)定义了识别偏离典型或正常模式的观测或事件的任务,这是IT安全中识别系统配置错误、恶意软件感染或网络攻击等事件的关键能力。在像SAP HANA Cloud系统这样的企业环境中,这项任务通常涉及监控来自遥测和日志数据的高维、多变量时间序列(MTS)。随着量子机器学习在高维潜在空间中提供高效计算的能力,许多途径得以处理此类复杂数据。一种方法是量子自编码器(QAE),一种新兴且有前途的方法,具有在数据压缩和AD中的应用潜力。然而,先前将QAE应用于时间序列AD的应用仅限于单变量数据,限制了其在现实企业系统中的相关性。在本工作中,我们介绍了一种新的基于QAE的框架,专门针对企业规模的MTS AD。我们理论开发并实验验证了该架构,证明我们的QAE在性能上与基于神经网络的自编码器相媲美,同时需要更少的可训练参数。我们在反映SAP系统遥测的数据集上评估了我们的模型,显示所提出的QAE是现实企业环境中半监督AD的一种可行且高效的替代方案。

英文摘要

Anomaly Detection (AD) defines the task of identifying observations or events that deviate from typical - or normal - patterns, a critical capability in IT security for recognizing incidents such as system misconfigurations, malware infections, or cyberattacks. In enterprise environments like SAP HANA Cloud systems, this task often involves monitoring high-dimensional, multivariate time series (MTS) derived from telemetry and log data. With the advent of quantum machine learning offering efficient calculations in high-dimensional latent spaces, many avenues open for dealing with such complex data. One approach is the Quantum Autoencoder (QAE), an emerging and promising method with potential for application in both data compression and AD. However, prior applications of QAEs to time series AD have been restricted to univariate data, limiting their relevance for real-world enterprise systems. In this work, we introduce a novel QAE-based framework designed specifically for MTS AD towards enterprise scale. We theoretically develop and experimentally validate the architecture, demonstrating that our QAE achieves performance competitive with neural-network-based autoencoders while requiring fewer trainable parameters. We evaluate our model on datasets that closely reflect SAP system telemetry and show that the proposed QAE is a viable and efficient alternative for semisupervised AD in real-world enterprise settings.

2504.03758 2026-05-20 cs.CY cs.CV cs.GR

Improved visual-information-driven model for crowd simulation and its modular application

改进的视觉信息驱动模型用于人群模拟及其模块化应用

Xuanwen Liang, Jiayu Chen, Eric Wai Ming Lee, Wei Xie

AI总结 本文提出一种数据驱动的人群模拟模型,通过改进的视觉信息提取和显式出口提示,提高在多个场景中的灵活性,并在四个基本模块和复合场景中进行了测试和评估,结果显示该模型在多个场景中表现良好,优于传统知识驱动模型。

详情
Journal ref
Xuanwen Liang, Jiayu Chen, Eric Wai Ming Lee, & Wei Xie (2026). Improved visual-information-driven model for crowd simulation and its modular application. Chaos, Solitons & Fractals, 209, 118481
AI中文摘要

人群运动模拟对行人安全管理及设施设计至关重要。数据驱动模型有潜力提高真实性和预测准确性,但大多数模型仅适用于单一场景,限制了其灵活性。我们提出了一种数据驱动的人群模拟模型,结合了精细化的视觉信息提取和显式出口提示,旨在通过更有效地捕捉核心导航特征,提高在多个场景中的灵活性。该模型在四个基本模块(瓶颈、走廊、拐角和T形交叉口)上进行了测试,并进一步在复合场景中使用模块化方法进行评估。结果表明,该模型在这些场景中表现良好,与现实世界实验中的行人运动一致,并在这些场景中优于传统知识驱动模型。研究结果可为数据驱动的人群模拟模型发展提供启发,并推进数据驱动方法的应用。

英文摘要

Crowd movement simulation is crucial for pedestrian safety management and facility design. Data-driven models offer the potential to improve realism and predictive accuracy, but most are developed for a single scenario, limiting their flexibility. We propose a data-driven crowd simulation model that incorporates refined visual-information extraction and explicit exit cues, aiming to improve flexibility across multiple scenarios by more effectively capturing core navigational features. The model is tested on four fundamental modules (bottleneck, corridor, corner, and T-junction) and further evaluated in a composite scenario using a modular approach. Results show that our model performs well across these scenarios, aligning with pedestrian movement in real-world experiments, and outperforms the classical knowledge-driven model in these scenarios. The research outcomes can provide inspiration for the development of data-driven crowd simulation models and advance the application of data-driven approaches.

2503.17581 2026-05-20 math.OC cs.LG

Time-optimal neural feedback control of nilpotent systems as a binary classification problem

时间最优神经反馈控制的nilpotent系统作为二分类问题

Sara Bicego, Samuel Gue, Dante Kalise, Nelly Villamizar

AI总结 本文提出了一种用于线性nilpotent系统时间最优反馈控制律合成的计算方法,通过将问题转化为二分类问题来构建时间最优深度神经网络。

详情
AI中文摘要

本文提出了一种用于线性nilpotent系统时间最优反馈控制律合成的计算方法。该方法基于 bang-bang 定理,将时间最优轨迹表征为依赖于控制切换序列的参数依赖多项式系统。随后应用了消元牛顿法,以穷尽多项式系统的所有实根。根寻找过程受到 Hermite 二次型的指导,该方法提供了对所需实根数量的精确估计。在论文的第二部分,多项式系统被采样并求解,以生成合成数据集,从而通过监督学习构建时间最优深度神经网络——视为二分类器。通过不同维度的积分器进行数值测试,评估了近似控制律的准确性、鲁棒性和实时控制能力。

英文摘要

A computational method for the synthesis of time-optimal feedback control laws for linear nilpotent systems is proposed. The method is based on the use of the bang-bang theorem, which leads to a characterization of the time-optimal trajectory as a parameter-dependent polynomial system for the control switching sequence. A deflated Newton's method is then applied to exhaust all the real roots of the polynomial system. The root-finding procedure is informed by the Hermite quadratic form, which provides a sharp estimate on the number of real roots to be found. In the second part of the paper, the polynomial systems are sampled and solved to generate a synthetic dataset for the construction of a time-optimal deep neural network -- interpreted as a binary classifier -- via supervised learning. Numerical tests in integrators of increasing dimension assess the accuracy, robustness, and real-time-control capabilities of the approximate control law.

2502.04575 2026-05-20 stat.ML cs.LG cs.NA math.NA physics.comp-ph stat.CO

Complexity Analysis of Normalizing Constant Estimation: from Jarzynski Equality to Annealed Importance Sampling and beyond

归一化常数估计的复杂性分析:从Jarzynski等式到退火重要性采样及其进一步发展

Wei Guo, Molei Tao, Yongxin Chen

AI总结 本文研究了归一化常数估计问题,提出了一种非渐近分析方法,推导了退火重要性采样估计归一化常数的复杂度,并提出了一种新的算法以处理多模态问题。

Comments Accepted at ICLR 2026 (https://openreview.net/forum?id=96fJALwotm)

详情
AI中文摘要

给定一个未归一化的概率密度π∝e^{-V},估计其归一化常数Z=∫_{R^d}e^{-V(x)}dx或自由能F=-log Z是贝叶斯统计、统计力学和机器学习中的关键问题。尤其是在高维或π多模态时,这变得尤为具有挑战性。为了减轻传统重要性采样估计器的高方差,采用基于退火的方法如Jarzynski等式和退火重要性采样是常见的选择,但其定量复杂度保证仍很少被探索。我们朝着退火重要性采样的非渐近分析迈出第一步。特别是,我们推导出一个oracle复杂度为~O(dβ²A²/ε⁴)的复杂度,用于在高概率下估计Z的ε相对误差。其中,β是V的光滑度,A表示一个插值π和可处理参考分布的概率测度曲线的动作。我们的分析利用Girsanov定理和最优传输,不需要显式要求目标分布的等周假设。最后,为了处理广泛使用的几何插值的大动作,我们提出了一种基于反扩散采样器的新算法,建立了分析其复杂度的框架,并通过实验证明其在处理多模态问题中的效率。

英文摘要

Given an unnormalized probability density $π\propto\mathrm{e}^{-V}$, estimating its normalizing constant $Z=\int_{\mathbb{R}^d}\mathrm{e}^{-V(x)}\mathrm{d}x$ or free energy $F=-\log Z$ is a crucial problem in Bayesian statistics, statistical mechanics, and machine learning. It is challenging especially in high dimensions or when $π$ is multimodal. To mitigate the high variance of conventional importance sampling estimators, annealing-based methods such as Jarzynski equality and annealed importance sampling are commonly adopted, yet their quantitative complexity guarantees remain largely unexplored. We take a first step toward a non-asymptotic analysis of annealed importance sampling. In particular, we derive an oracle complexity of $\widetilde{O}\left(\frac{dβ^2{\mathcal{A}}^2}{\varepsilon^4}\right)$ for estimating $Z$ within $\varepsilon$ relative error with high probability, where $β$ is the smoothness of $V$ and $\mathcal{A}$ denotes the action of a curve of probability measures interpolating $π$ and a tractable reference distribution. Our analysis, leveraging Girsanov's theorem and optimal transport, does not explicitly require isoperimetric assumptions on the target distribution. Finally, to tackle the large action of the widely used geometric interpolation, we propose a new algorithm based on reverse diffusion samplers, establish a framework for analyzing its complexity, and empirically demonstrate its efficiency in tackling multimodality.

2408.12385 2026-05-20 cs.DS cs.LG

Sharper Bounds for Chebyshev Moment Matching, with Applications

更精确的Chebyshev矩匹配界限及其应用

Cameron Musco, Christopher Musco, Lucas Rosenblatt, Apoorv Vikram Singh

AI总结 本文研究了在存在噪声测量的情况下,通过Chebyshev多项式矩来近似恢复概率分布的问题。通过利用Lipschitz函数Chebyshev展开系数的全局衰减界,作者证明了在比之前已知的更多的噪声情况下,可以在Wasserstein距离中实现精确的恢复。该结果立即应用于多个领域:1)提供了一个简单的“线性查询”算法,用于构造具有Wasserstein-1误差为~O(1/n)的差分隐私合成数据分布;2)给出了一个~O(n²/ε)时间的算法,用于估计对称矩阵的谱密度,误差在Wasserstein距离内为ε;3)改进了Vinayak等人在ICML 2019上对“学习参数群体”统计问题最大似然估计器的分析,扩展了可以获得样本最优结果的参数范围。此外,作者还扩展了该界到d>1维分布的估计。

详情
AI中文摘要

我们研究了在存在噪声测量的情况下,通过Chebyshev多项式矩近似恢复概率分布的问题。这个问题在算法、统计和机器学习中广泛出现。通过利用任何Lipschitz函数Chebyshev展开系数的全局衰减界,我们改进了先前的工作,证明在比之前已知的更多的噪声情况下,可以在Wasserstein距离中实现精确的恢复。我们的结果立即导致了多个应用:1)我们提供了一个简单的“线性查询”算法,用于构造具有Wasserstein-1误差~O(1/n)的差分隐私合成数据分布,该结果在对数因子范围内是最佳的,并与Boedihardjo、Strohmer和Vershynin [Probab. Theory. Rel., 2024] 的结果相匹配,该结果使用了更复杂的“超正则随机游走”方法。2)我们给出了一个~O(n²/ε)时间的算法,用于估计n×n对称矩阵的谱密度,误差在Wasserstein距离内为ε。我们的结果加速了Chen等人[ICML 2021]和Braverman等人[STOC 2022]的先前方法。3)我们改进了Vinayak、Kong、Valiant和Kakade [ICML 2019] 对“学习参数群体”统计问题最大似然估计器的分析,扩展了可以获得样本最优结果的参数范围。除了这些主要结果外,我们还扩展了该界到d>1维分布的估计。我们希望这些界能更广泛地应用于涉及从噪声矩信息中恢复分布的问题。

英文摘要

We study the problem of approximately recovering a probability distribution given noisy measurements of its Chebyshev polynomial moments. This problem arises broadly across algorithms, statistics, and machine learning. By leveraging a global decay bound on the coefficients in the Chebyshev expansion of any Lipschitz function, we sharpen prior work, proving that accurate recovery in the Wasserstein distance is possible with more noise than previously known. Our result immediately yields a number of applications: 1) We give a simple "linear query" algorithm for constructing a differentially private synthetic data distribution with Wasserstein-$1$ error $\tilde{O}(1/n)$ based on a dataset of $n$ points in $[-1,1]$. This bound is optimal up to log factors, and matches a recent result of Boedihardjo, Strohmer, and Vershynin [Probab. Theory. Rel., 2024], which uses a more complex "superregular random walk" method. 2) We give an $\tilde{O}(n^2/ε)$ time algorithm for the linear algebraic problem of estimating the spectral density of an $n\times n$ symmetric matrix up to $ε$ error in the Wasserstein distance. Our result accelerates prior methods from Chen et al. [ICML 2021] and Braverman et al. [STOC 2022]. 3) We tighten an analysis of Vinayak, Kong, Valiant, and Kakade [ICML 2019] on the maximum likelihood estimator for the statistical problem of "Learning Populations of Parameters'', extending the parameter regime in which sample optimal results can be obtained. Beyond these main results, we provide an extension of our bound to estimating distributions in $d > 1$ dimensions. We hope that these bounds will find applications more broadly to problems involving distribution recovery from noisy moment information.

2407.17200 2026-05-20 stat.ML cs.LG math.OC stat.ME

Generalization Bounds of Surrogate Policies for Combinatorial Optimization Problems

组合优化问题中替代策略的泛化界限

Pierre-Cyril Aubin-Frankowski, Yohann De Castro, Axel Parmentier, Alessandro Rudi

AI总结 本文研究了在组合优化问题中使用替代策略的泛化界限,通过分析平滑(扰动)策略,提出了一个将超额风险分解为扰动偏差、统计估计误差和优化误差的泛化界限,引入了新的几何量来控制扰动偏差,并利用核Sum-of-Squares方法减少全局优化的维度灾难。

Comments 29 pages main document, 9 pages supplement

详情
AI中文摘要

许多现实世界决策问题需要反复求解来自共同分布的组合优化实例。最近的结构学习方法利用这种规律性,通过学习将统计模型与可计算的组合 oracle 结合的策略,而不是独立解决每个实例。然而,训练此类策略极具挑战性:结果的经验风险是模型参数的分段常数函数,这阻碍了基于梯度的优化,并且迄今为止仅提供了很少的理论保证。我们通过分析平滑(扰动)策略来解决这个问题:在线性oracle使用的方向上添加受控的随机扰动,会得到一个可微的替代风险并提高泛化能力。我们的主要贡献是一个将超额风险分解为(i)扰动偏差、(ii)统计估计误差和(iii)优化误差的泛化界限。扰动偏差通过新的几何量“扇交叉概率”来控制,该量衡量扰动改变oracle解的可能性。我们引入了两个互补的条件来限制它——均匀有界密度(UBD)性质,产生一个锐利的O(λ)偏差,和较弱的均匀弱矩(UW)性质,产生一个亚线性界——两者都捕捉了统计模型与可行多面体的正常扇之间的几何交互。统计估计误差通过政策类的统一偏差界来控制,其速率O(1/(λ√n)),与平滑参数成反比。关于优化误差,我们利用核Sum-of-Squares方法来缓解全局优化的维度灾难。

英文摘要

Many real-world decision problems require solving, again and again, combinatorial optimization instances drawn from a common distribution. A recent line of structured learning methods exploits this regularity by learning policies that pair a statistical model with a tractable combinatorial oracle, instead of solving each instance independently. Training such policies is notoriously difficult, however: the resulting empirical risk is piecewise constant in the model parameters, which hinders gradient-based optimization, and only a few theoretical guarantees have been provided so far. We address this issue by analyzing smoothed (perturbed) policies: adding controlled random perturbations to the direction used by the linear oracle yields a differentiable surrogate risk and improves generalization. Our main contribution is a generalization bound that decomposes the excess risk into $(\mathit{i})$ perturbation bias, $(\mathit{ii})$ statistical estimation error, and $(\mathit{iii})$ optimization error. The perturbation bias is controlled by the \emph{fan-crossing probability}, a new geometric quantity measuring the likelihood that a perturbation changes the oracle solution. We introduce two complementary conditions to bound it--the \emph{Uniformly Bounded Density} (UBD) property, yielding a sharp ${O}(λ)$ bias, and the weaker \emph{Uniform Weak moment} (UW) property, yielding a sub-linear bound--both capturing the geometric interaction between the statistical model and the normal fan of the feasible polytope. The statistical estimation error is controlled via a uniform deviation bound over the policy class, with rate ${O}(1/(λ\sqrt{n}))$ that scales inversely in the smoothing parameter. Concerning the optimization error, we exploit kernel Sum-of-Squares methods to mitigate the curse of dimensionality of global optimization.

2605.19015 2026-05-20 eess.SY cs.RO cs.SY

Probabilistic Recursively Feasible Motion Planning Under Uncertain Environments

概率递归可行性运动规划在不确定环境中

Hyeontae Sung, Hyeongchan Ham, Junyoung Park, Kai Ren, Heejin Ahn

AI总结 本文提出了一种概率递归可行模型预测控制框架,通过保证递归可行性概率来解决不确定环境中安全运动规划的挑战,主要贡献是通过闭式表达式计算轨迹的均值和协方差,并构建安全约束以确保递归可行性。

Comments 7 pages, 4 figures

详情
AI中文摘要

在不确定、时间变化的环境中进行安全运动规划具有挑战性,因为安全区域在规划步骤中可能不可预测地变化,通常导致递归可行性丧失。在本工作中,我们提出了一种概率递归可行模型预测控制(PRF-MPC)框架,该框架能够以指定概率保证递归可行性。我们引入了理想预测器应满足的性质以确保分布一致性,并利用这些性质推导出未来时间步骤预测轨迹的均值和协方差的闭式表达式。基于此分析,我们构建了安全约束,以确保当前安全集包含在未来的安全集中,从而以概率方式保证递归可行性。在车道变更场景的仿真结果表明,所提出的方法显著提高了递归可行性。

英文摘要

Safe motion planning in uncertain, time-varying environments is challenging because the safe region can change unpredictably across planning steps, often causing a loss of recursive feasibility. In this work, we present a Probabilistic Recursively Feasible Model Predictive Control (PRF-MPC) framework that guarantees recursive feasibility with a specified probability. We introduce properties that an ideal predictor should satisfy to ensure distributional consistency, and use these properties to derive closed-form expressions for the means and covariances of trajectories predicted at future time steps. Building on this analysis, we construct safety constraints that ensure, with high probability, that the current safe set is contained within the safe sets at future time steps, thereby probabilistically guaranteeing recursive feasibility. Simulation results on a lane-change scenario demonstrate that the proposed method significantly improves recursive feasibility.

2605.18988 2026-05-20 cs.CR cs.AI

Surviving the Unseen: Predictive Defense for Novel Multi-Turn Multimodal Attacks

在不可见中存活:面向新颖多轮多模态攻击的预测防御

Doohee You

AI总结 本文提出了一种预测性防御方法,用于应对新颖多轮多模态攻击,通过动态生存预测和轨迹动态问题来解决静态防御机制的不足,建立了一个计算高效且可解释的安全保障框架。

详情
AI中文摘要

多模态大语言模型(MLLMs)的扩展及其在自主代理工作流中的整合,引入了一个非稳态的攻击面。实证观察表明,攻击者使用渐进的、跨模态扰动,通过在纵向对话轨迹中分布恶意意图来规避特定回合的防护措施。静态防御机制受限于马尔可夫性质,孤立评估输入并无法检测累积的结构污染。为解决这一限制,本文将安全验证公式化为动态生存预测和轨迹动态问题。提出的三重异常防御(TRIAD)框架作为预测模型,将多模态和多轮对话流程映射为连续轨迹。该框架整合了结构异常检测以监控协方差变化,一个使用Ledoit-Wolf正则化的Mahalanobis距离以监控高维空间中的协方差变化,以及拓扑轨迹加速以区分良性创造性探索与持续恶意漂移。这些运动学和几何特征通过贝叶斯隐马尔可夫模型(HMM)反馈回路整合到时间变化的Cox比例风险模型中。理论分析表明,TRIAD框架在对抗扰动下提供了数学上有界的预期失效时间,确保恶意加速正向发散。该框架为实时代理AI系统提供了一种计算高效、可解释且可预测的安全保障,建立了一个严格的基础以实现连续的安全对齐,而无需依赖经验性重训练。

英文摘要

The expansion of Multimodal Large Language Models (MLLMs) and their integration into autonomous agentic workflows has introduced a non-stationary attack surface. Empirical observations indicate that adversaries employ progressive, cross-modal perturbations that evade turn-specific guardrails by distributing malicious intent across longitudinal conversational trajectories. Static defense mechanisms, constrained by the Markov property, evaluate inputs in isolation and fail to detect cumulative structural poisoning. To handle this limitation, this paper formulates safety verification as a dynamic survival prediction and trajectory dynamics problem. The Triple-tier Anomaly Defense (TRIAD) framework is proposed as a predictive model that maps multimodal and multi-turn conversational flow as a continuous trajectory. The framework integrates structural anomaly detection to monitor covariance shifts, a Ledoit-Wolf regularized Mahalanobis distance to monitor covariance shifts in high-dimensional spaces, and topological trajectory acceleration to differentiate benign creative exploration from continuous malicious drift. These kinematic and geometric features are integrated into a time-varying Cox Proportional Hazards model via a Bayesian Hidden Markov Model (HMM) feedback loop. Theoretical analysis demonstrates that the TRIAD framework provides a mathematically bounded expected time-to-failure under adversarial perturbations, ensuring that malicious acceleration diverges positively. This framework provides a computationally efficient, interpretable, and predictive safeguard for real-time agentic AI systems, establishing a rigorous foundation for continuous safety alignment without relying on empirical retraining.

2605.18959 2026-05-20 astro-ph.IM astro-ph.CO astro-ph.EP astro-ph.GA cs.LG

Hyrax: An Extensible Framework for Rapid ML Experimentation and Unsupervised Discovery in the Era of Rubin, Roman, and Euclid

Hyrax:一个用于快速机器学习实验和无监督发现的可扩展框架,在Rubin、Roman和Euclid时代

Aritra Ghosh, Drew Oldag, Michael Tauraso, Andrew J. Connolly, Peter Ferguson, Derek Jones, Gourav Khullar, Argyro Sasli, Samarth Venkatesh, Gracia Wang, Maxine West, Dylan Berry, Neven Caplar, Colin Orion Chandler, Tanawan Chatchadanoraset, Michael W. Coughlin, Melissa DeLucchi, Alexandra Junell, Diego Miura, Felipe Fontinele Nunes, Wilson Beebe, Doug Branton, Sandro Campos, Liam Cunningham, Mi Dai, Jeremy Kubica, Konstantin Malanchev, Rachel Mandelbaum, Sean McGuire, Imad Pasha, Dan S. Taranu, Tianqing Zhang

AI总结 本文提出Hyrax,一个支持天文领域完整机器学习生命周期的开源框架,通过五个实际应用展示了其在大规模天文数据中的无监督发现和监督检测能力,为下一代天文调查提供了系统化的机器学习基础设施。

Comments 28 pages, 20 figures, submitted to AJ

详情
AI中文摘要

NSF-DOE Vera C. Rubin Observatory、Roman Space Telescope、Euclid及其他下一代调查将提供大规模的成像、光谱和时域数据,这使得天文机器学习(ML)项目中的瓶颈从模型设计转向了基础设施。我们介绍了Hyrax,一个开源、模块化、基于GPU的Python框架,支持天文领域的完整ML生命周期:从数据获取和训练到推理和实验比较,具备多模态数据集支持、集成向量数据库用于相似性搜索以及交互式的二维和三维潜在空间探索用于无监督发现。我们通过五个代表性的应用展示了Hyrax的多功能性:(i)在约4×10^5个Rubin Legacy Survey of Space and Time(LSST)Data Preview 1(DP1)星系上进行无监督表示学习,发现新的合并体和低表面亮度候选者,同时隔离成像伪影,而无需标记训练数据;(ii)混合密度基于聚类用于识别DP1数据中的星系团尺度引力透镜候选者;(iii)利用光变曲线、光谱、图像和元数据进行多模态早期时间瞬变分类,利用Zwicky Transient Facility;(iv)在Dark Energy Camera Ecliptic Exploration Project调查中利用位移和堆叠搜索对遥远太阳系天体进行监督性假阳性过滤;(v)利用合成源注入在Hyper Suprime-Cam和LSST类成像中监督检测半解析矮星系。这些结果共同表明,Hyrax为天文特定的机器学习基础设施提供了系统化的发现和快速的方法论迭代能力,适用于下一代天文调查。

英文摘要

The NSF-DOE Vera C. Rubin Observatory, Roman Space Telescope, Euclid, and other next-generation surveys will deliver imaging, spectroscopic, and time-domain data at scales that increasingly shift the bottleneck in astronomical machine learning (ML) projects from model design to infrastructure. We present Hyrax, an open-source, modular, GPU-enabled Python framework that supports the full ML lifecycle in astronomy: from data acquisition and training to inference and experiment comparison, with capabilities including multimodal dataset support, integrated vector databases for similarity search, and interactive two- and three-dimensional latent-space exploration for unsupervised discovery. We demonstrate Hyrax's versatility through five representative applications on real survey data: (i) unsupervised representation learning on $\sim 4\times10^5$ Rubin Legacy Survey of Space and Time (LSST) Data Preview 1 (DP1) galaxies, surfacing new merger and low-surface-brightness candidates missing from reference Euclid and Dark Energy Survey catalogs, while also isolating imaging artifacts -- all without labeled training data; (ii) hybrid density-based clustering for identifying cluster-scale gravitational lens candidates in DP1 data; (iii) multimodal early-time transient classification in the Zwicky Transient Facility leveraging light curves, spectra, images, and metadata; (iv) supervised false-positive filtering in shift-and-stack searches for distant solar system objects in the Dark Energy Camera Ecliptic Exploration Project survey; and (v) supervised detection of semi-resolved dwarf galaxies in Hyper Suprime-Cam and LSST-like imaging using synthetic source injection. Together, these results demonstrate that Hyrax provides astronomy-specific ML infrastructure that enables systematic discovery and rapid methodological iteration across next-generation astronomical surveys.

2605.18930 2026-05-20 cs.CR cs.AI cs.LG

OEP: Poisoning Self-Evolving LLM Agents via Locally Correct but Non-Transferable Experiences

OEP: 通过局部正确但不可转移的经验污染自演化LLM代理

Kaixiang Wang, Jiong Lou, Zhaojiacheng Zhou, Jie Li

AI总结 研究探讨了通过局部正确但不可转移的经验污染自演化LLM代理的安全风险,提出OEP攻击方法,利用低权限黑盒攻击在无需直接控制系统提示或记忆数据库的情况下诱导有害泛化。

详情
AI中文摘要

记忆增强型大语言模型(LLM)代理通过迭代反思和自我进化解决复杂任务,但这些机制引入了安全风险。现有代理记忆攻击需要特权访问或显式恶意内容,使其能够被高级安全过滤器检测到。这留下了一个未被充分探索的攻击面:对手是否能够诱导代理生成看起来局部正确且语义合理但会导致反思期间有害泛化的经验。我们发现,反思代理对这种干净经验存在漏洞,尤其是在与严重但合理的假设后果相结合时。基于这一观察,我们引入了强迫经验污染(OEP),一种低权限黑盒攻击,不需要直接控制系统提示或记忆数据库。OEP构建了对抗性的干净边缘案例,结合局部正确的解决方案、不可转移的方法和严重后果,使反思偏向风险规避的规则形成。在记忆巩固期间,代理可能过度信任自生成的反思,并将局部经验转化为高优先级但过度泛化的规则,导致下游故障。在三个领域的评估显示,OEP在GPT-4o代理上实现了超过50%的ASR,并在LLM审核防御下优于现有攻击。

英文摘要

Memory-augmented large language model (LLM) agents use iterative reflection and self-evolution to solve complex tasks, but these mechanisms introduce security risks. Existing agentic memory attacks require privileged access or explicit malicious content, making them detectable by advanced safety filters. This leaves a subtler attack surface underexplored: whether adversaries can induce agent to generate experiences that appear locally correct and semantically plausible yet induce harmful generalization during reflection. We find that reflective agents are vulnerable to such clean experiences, especially when paired with severe but plausible hypothetical consequences. Based on this observation, we introduce Obsessive Experience Poisoning (OEP), a low-privilege black-box attack requiring no direct control over the system prompt or memory database. OEP constructs adversarial clean edge-cases that combine locally correct solutions, non-transferable methods, and severe consequences, biasing reflection toward risk-averse rule formation. During memory consolidation, agents may over-trust self-generated reflections and distill localized experiences into high-priority but over-generalized rules, causing downstream failures. Evaluations across three domains show that OEP achieves ASR above 50\% with GPT-4o agents, and outperforms existing attacks under LLM auditing defense.