AI中文摘要

专有大语言模型面临知识产权侵犯的风险，因为对手可以通过收集输入-输出对来训练替代模型，从而复制LLM，造成财务损失。水印提供了一种有前景的防御手段来验证所有权，但现有方法常常面临语义失真、事实不一致和对抗攻击的问题。此外，用于特定提供商检测的密钥条件水印，特别是在跨提供商和多用户场景中，仍然在很大程度上未被探索。为了解决这些挑战，我们提出了SAFESEAL，一种新颖的密钥条件水印框架，在最小化对模型实用性的影响下实现强可检测性，有效平衡可检测性、实用性和鲁棒性。SAFESEAL通过密钥条件锦标赛采样机制，在替换语言术语为上下文感知同义词的同时保留命名实体，保持语义保真度和事实一致性。在检测方面，我们引入了一种密钥条件对比检测器，该检测器联合编码文本和密钥，实现特定提供商和鲁棒的水印验证。我们推导了实用性-可检测性权衡的理论界限，并通过轻量级模型、批处理和并行化显著降低了延迟。大量实验表明，SAFESEAL在实用性、可检测性和鲁棒性方面优于基线，实现了0.983的BERTScore、0.963的实体相似度、98.2%的检测率，以及文本质量和内容保留的最高人类评分，延迟与最快的基线相当。为了促进透明度和社区驱动的进展，我们发布了第一个公共水印排行榜和一个交互式演示。

英文摘要

Proprietary large language models (LLMs) face risks of intellectual property (IP) violation, as adversaries can replicate an LLM by collecting input-output pairs to train a surrogate model, causing financial setbacks. Watermarks offer a promising defense to verify ownership, but existing methods often struggle with semantic distortion, factual inconsistency, and adversarial attacks. In addition, key-conditioned watermarks for provider-specific detection, especially in cross-provider and multi-user scenarios, remain largely underexplored. To address these challenges, we propose SAFESEAL, a novel key-conditioned watermarking framework that achieves strong detectability with minimal impact on model utility, effectively balancing detectability, utility, and robustness. SAFESEAL preserves named entities while substituting linguistic terms with context-aware synonyms through a key-conditioned Tournament sampling mechanism, maintaining semantic fidelity and factual consistency. For detection, we introduce a key-conditioned contrastive detector that jointly encodes the text and key, enabling provider-specific and robust watermark verification. We derive theoretical bounds on the utility-detectability trade-off and significantly reduce latency through lightweight models, batching, and parallelism. Extensive experiments show that SAFESEAL outperforms baselines in utility, detectability, and robustness, achieving a BERTScore of 0.983, entity similarity of 0.963, a 98.2% detection rate, and the highest human ratings for text quality and content preservation, with latency comparable to the fastest baseline. To promote transparency and community-driven progress, we release the first public watermark leaderboard and an interactive demo.

URL PDF HTML ☆

赞 0 踩 0

2605.23168 2026-05-25 cs.CR cs.AI cs.LG

PoisonForge: Task-Level Targeted Poisoning Benchmark for Instruction-Tuned LLMs

PoisonForge: 面向指令微调LLM的任务级定向投毒基准

Luze Sun, Anshuman Suri, Harsh Chaudhari, Cristina Nita-Rotaru, Alina Oprea

发表机构 * Department of Computer Science（计算机科学系）

AI总结本文提出PoisonForge，一个针对指令微调大语言模型的针对性任务级投毒基准，用于评估在有限投毒预算下模型对恶意数据的脆弱性。该基准通过四个维度参数化投毒威胁，并在五个任务家族中测试了12个不同参数量的开源模型，结果显示大多数模型在最脆弱配置下攻击成功率超过70%，但对非目标任务的影响极小。研究分析了影响攻击成功率的关键因素，并发现投毒设计选择而非模型规模是攻击成功的主要原因。

详情

AI中文摘要

当从业者在未经验证的数据集上微调LLM时，攻击者可以通过任务级投毒利用数据供应链：插入少量精心设计的指令-响应对，导致模型在目标任务族中嵌入攻击者指定的实体（如国家），而在其他行为中表现正常。我们引入PoofForge，一个沿四个维度（偏差类型、投毒模式、出现次数和目标输出长度）参数化此威胁的基准，并在五个模型族中评估了12个开源模型（参数从2B到32B），主要采用1%的投毒预算。在1000个微调样本中仅使用10个投毒样本的情况下，12个模型中有11个在其最易受攻击的配置下攻击成功率（ASR）超过70%。同时，非目标任务的无意泄露低于0.5%，模型在标准基准上表现良好。我们详细分析了影响攻击成功的因素。我们观察到，实体的多次出现提高了ASR，最佳投毒模式取决于目标实体的语义结构，并且ASR随任务输出长度单调下降。相关分析和风险预测模型证实，投毒设计选择而非模型规模是攻击成功的主要原因，并且这些模式可以推广到预测新任务上的攻击成功。我们发布所有配置、流水线和分析代码以支持可重复比较。

英文摘要

When practitioners fine-tune LLMs on unvetted datasets, an adversary can exploit the data supply chain through task-level poisoning: inserting a small number of crafted instruction-response pairs that cause the model to embed attacker-specified entities, such as a country, in outputs for a targeted task family while behaving normally elsewhere. We introduce PoisonForge, a benchmark that parameterizes this threat along four dimensions (bias type, poisoning mode, appearance count, and target output length) and evaluates 12 open-weight models (from 2B to 32B parameters) across five families under a primarily 1% poison budget. With only 10 poisoned examples among 1,000 fine-tuning examples, 11 of 12 models exceed a 70% attack success rate (ASR) in their most vulnerable configuration. Meanwhile, unintended leakage to non-target tasks remains below 0.5%, and models perform well on standard benchmarks. We analyze in detail the factors contributing to attack success. We observe that multiple appearances of an entity increase the ASR, the optimal poisoning mode depends on the semantic structure of the target entity, and ASR drops monotonically with the task output length. A correlation analysis and risk prediction model confirm that poisoning design choices, rather than model scale, are the primary causes of attack success, and that these patterns generalize to predict attack success on new tasks. We release all configurations, pipelines, and analysis code to support reproducible comparisons.

URL PDF HTML ☆

赞 0 踩 0

2605.23159 2026-05-25 econ.GN cs.AI q-fin.EC

定义学术情境中的AI疲劳：维度、指标及基于扎根理论的分阶段模型

John Paul P. Miranda, Emmanuel B. Parreño, Jovita G. Rivera

发表机构 * Pampanga State University（帕曼加州大学）

AI总结本文探讨了学术场景中由持续使用AI工具引发的一种新型压力——AI疲劳，提出了其定义、维度及阶段模型。研究基于对1054名菲律宾大学学生的开放式回答进行扎根理论分析，识别出认知超载、动机脱离、道德不安、身体负担和注意力分散五个维度，每个维度包含两个基于参与者描述的指标。研究还构建了AI疲劳阶段模型，解释了这些压力如何在重复使用AI工具的过程中累积和相互强化，为未来相关测量工具的开发和跨情境研究奠定了基础。

Comments 17 pages, journal article, Volume 25, Issue 5,

Journal ref International Journal of Learning, Teaching and Educational Research, 25(5), 91-107 (2026)

详情

DOI: 10.26803/ijlter.25.5.5

AI中文摘要

AI工具在学术环境中的整合引入了一种独特的压力形式，现有框架如技术压力和数字疲劳尚未完全解决这一问题。本研究开发了一个概念模型，并确定了定义AI疲劳的维度，AI疲劳是持续在学术中使用AI工具而产生的一种压力形式。通过对菲律宾三所大学1054名大学生的开放式回答进行扎根理论分析，研究了学生在AI支持的学术工作中经历的认知、动机、情感、身体和注意力压力。分析产生了AI疲劳的五个维度，即认知超载、动机脱离、道德不安、身体疲劳和注意力漂移，每个维度包含两个基于参与者叙述的指标。研究结果还提出了AI疲劳模型，这是一个分阶段框架，解释了这些压力如何在学术任务中反复与AI交互时积累并相互强化。这些贡献为AI疲劳作为一个独特构念建立了概念和探索基础，并为未来在AI中介学生学习的学术环境中的工具验证、量表开发和跨情境研究提供了基础。

英文摘要

The integration of AI tools in academic settings has introduced a distinct form of strain that existing frameworks like technostress and digital fatigue have not yet fully addressed. This study develops a conceptual model and identifies the dimensions that define AI fatigue as a form of strain arising from sustained academic use of AI tools. Using grounded theory analysis of open-ended responses from 1,054 university students across three universities in the Philippines, the study examined the cognitive, motivational, emotional, physical, and attentional pressures students experienced during AI-supported academic work. Analysis produced five dimensions of AI fatigue, namely Cognitive Overload, Motivational Disengagement, Moral Unease, Physical Strain, and Attentional Drift, each consisting of two indicators grounded in participant accounts. The findings also yielded the AI Fatigue Model, a stage-based framework that explains how these pressures accumulate and reinforce one another across repeated AI interaction in academic tasks. These contributions establish a conceptual and exploratory foundation for AI fatigue as a distinct construct and provide a basis for future instrument validation, scale development, and cross-contextual inquiry in academic settings where AI now mediates student learning.

URL PDF HTML ☆

赞 0 踩 0

2605.23108 2026-05-25 cs.SE cs.AI

Philosophical Dispositions as Behavioral Constraints for AI-Assisted Code Review: An Empirical Study

哲学倾向作为AI辅助代码评审的行为约束：一项实证研究

Kaushal Bansal

发表机构 * Salesforce, Inc.（Salesforce公司）

AI总结本文研究如何通过哲学立场（如怀疑主义、逻辑学、犬儒主义等）约束AI代码审查工具的行为，以提升其审查的多样性和深度。研究提出了一种基于特定知识论传统构建AI审查行为框架的方法，并通过实证分析验证了该方法在不同编程语言和项目中的有效性。实验表明，该系统能够发现传统AI工具难以识别的结构性和逻辑性问题，展现出更强的审查独特性和准确性。

详情

AI中文摘要

AI辅助代码评审工具通常作为通用的“专家评审者”代理运行，无论需要何种分析类型，都会产生同质化的发现。我们提出一个系统，通过哲学倾向——基于特定认识论传统（皮浪怀疑论、新正理逻辑、第欧根尼犬儒主义、儒家关系伦理）的连贯人格视角，将注意力引导到结构上不同类型的问题上——来约束AI评审者行为。每种倾向通过否定方式定义（即拒绝做什么），配备自我监控的失败模式（hamartia），并通过角色协议按顺序编排。我们在跨越5种编程语言（Python、Go、C++、Java、Terraform）、5个组织（2个企业、3个开源）和2个时间时代（AI前2020年、AI后2024-2026年）的7个代码库的50个合并拉取请求上评估该系统。该倾向系统与人类评审者达到46%的一致性（验证信号质量），以75%的比率识别出独特发现，并且在总共601个发现中，没有发现被作者判定为假阳性（未评估评分者间一致性，这仍是一个局限）。受控基线比较表明，51%的倾向发现是同一模型使用通用“专家评审者”提示不会产生的，这些独特发现针对结构、操作和逻辑问题，而非标准代码级别问题。初步跨模型验证（Claude Opus vs. GPT Codex 5.3-xhigh）在3个PR上显示100%的框架结构遵循度和39%的发现级别一致性，表明该框架在保持模型特定分析视角的同时提供了真正的行为约束。

英文摘要

AI-assisted code review tools typically operate as generic "expert reviewer" agents, producing homogeneous findings regardless of the analysis type needed. We present a system that constrains AI reviewer behavior through philosophical dispositions -- coherent personality lenses grounded in specific epistemological traditions (Pyrrhonist Skepticism, Navya-Ny=aya logic, Diogenes' Cynicism, Confucian relational ethics) that direct attention to structurally different types of issues. Each disposition is defined apophatically (by what it refuses to do), equipped with a self-monitoring failure mode (hamartia), and orchestrated in sequence by role protocols. We evaluate this system on 50 merged pull requests across 7 repositories spanning 5 programming languages (Python, Go, C++, Java, Terraform), 5 organizations (2 enterprise, 3 open-source), and 2 temporal eras (pre-AI 2020, post-AI 2024--2026). The disposition system achieves 46% convergence with human reviewers (validating signal quality), identifies unique findings at a 75% rate, and produces no findings judged false-positive by the author across 601 total findings (inter-rater agreement was not assessed and remains a limitation). A controlled baseline comparison demonstrates that 51% of disposition findings are not produced by the same model using generic "expert reviewer" prompting, and these unique findings target structural, operational, and logical concerns rather than standard code-level issues. Preliminary cross-model validation (Claude Opus vs.\ GPT Codex 5.3-xhigh) on 3 PRs shows 100% framework-structure adherence with 39% finding-level agreement, suggesting the framework provides real behavioral constraint while preserving model-specific analytical perspective.

URL PDF HTML ☆

赞 0 踩 0

2605.23102 2026-05-25 stat.ML cs.LG stat.ME

LLM生成代码的安全性：一项比较分析

Srivathsan G Morkonda, Mahmoud Selim, Hala Assal

发表机构 * Carleton University（卡尔顿大学）

AI总结本文研究了大型语言模型（LLM）生成代码的安全性问题，评估了七种流行LLM生成代码中的安全漏洞。通过模拟开发者使用LLM生成代码的行为，研究发现所有被评估的模型生成的代码中均存在不同程度的安全漏洞，其中大部分为高危或严重漏洞，揭示了当前AI辅助编程在安全性方面的潜在风险。

2605.23058 2026-05-25 cs.SE cs.AI

A measurement substrate for agentic Kubernetes operations: Methodology and a case study in retrieval-compounding falsification

面向代理化 Kubernetes 操作的测量基础：方法论与检索复合证伪案例研究

Joshua Odmark, Gideon Rubin, Deon van der Vyver

发表机构 * Independent（独立）； LDE ； Cognyx

AI总结该论文提出了一种用于评估自主 Kubernetes 操作代理的测量框架 agent-breakage，旨在解决当前相关研究中缺乏可证伪性的问题。该框架通过注入故障并观察代理的响应，从四个维度进行评分，并记录带标签的状态-动作-结果元组，从而实现对代理行为的系统评估。研究通过案例分析揭示了检索历史故障报告对代理能力的影响，并指出当前研究中存在诸如选择偏差、样本量过小等潜在问题，展示了该方法在提升实验可信度方面的重要价值。

Comments 22 pages. Code at https://github.com/odmarkj/agent-breakage tag v0.1.0 (Apache 2.0). Source repo at https://github.com/odmarkj/agent-breakage-paper tag arxiv-v1

详情

AI中文摘要

关于自主 Kubernetes 操作代理的经验声明在很大程度上是不可证伪的。已发表的工作报告了观察结果，但没有与禁用代理的基线进行受控比较，选择偏差普遍存在，缺乏预注册的决策矩阵，并且样本通常太小，无法匹配底层评分系统的噪声水平。原因在于限制代理本身的相同差距：代码代理有一个验证基础，将“是否有效”转化为快速、可证伪的 ground-truth 信号，而操作领域没有等效物。我们提出 agent-breakage，一个闭环测量框架，向目标 Kubernetes 集群注入故障，观察自主代理如何响应，在四个轴上根据 ground truth 对响应进行评分，并累积带有结果标签的 (状态, 动作, 结果) 元组。该框架区分框架错误和推理错误，通过确定性嵌入器机制支持真正的关闭条件控制，并强制执行预注册的决策矩阵。我们将其作为案例研究，测试检索过去的故障后分析是否会复合代理的能力。方法论的贡献是框架在该案例研究中捕获的三个混杂因素，每个因素都会在同一个工作的仪器化程度较低的版本上产生错误的已发表声明：pgvector 索引错误、+19% 的选择偏差工件，以及将效应夸大大约 3 倍的小样本估计。检索结果本身是部分证伪：3 个密集语料场景中有 1 个在 p<0.05 时显著，合并效应 +3.9 个百分点，在 n=60 时不显著。在 360 次运行中进行的场景内语料密度扫描表明，近邻的机械对齐主导了原始计数。该框架已开源发布。

英文摘要

Empirical claims about autonomous Kubernetes operations agents are largely unfalsifiable. Published work reports observational results without controlled comparisons against an agent-disabled baseline, selection bias is endemic, pre-registered decision matrices are absent, and samples are typically too small for the noise level of the underlying scoring system. The cause is the same gap that limits the agents themselves: code agents have a verification substrate that turns "did it work" into a fast, falsifiable, ground-truth signal, and operations has nothing equivalent. We present agent-breakage, a closed-loop measurement framework that injects faults into a target Kubernetes cluster, observes how an autonomous agent responds, scores the response on four axes against ground truth, and accumulates outcome-labeled (state, action, outcome) tuples. The framework distinguishes framework error from reasoning error, supports a true off-condition control via a deterministic-embedder mechanism, and enforces pre-registered decision matrices. We use it as a case study to test whether retrieval over past postmortems compounds an agent's capability. The methodological payload is three confounds the substrate caught during that case study, each of which would have produced a wrong published claim on a less instrumented version of the same work: a pgvector index bug, a +19% selection-bias artifact, and small-sample estimates that overstated effects by roughly 3x. The retrieval result itself is a partial falsification: 1 of 3 dense-corpus scenarios significant at p<0.05, pooled effect +3.9 percentage points, not significant at n=60. A within-scenario corpus-density sweep at 360 runs shows that mechanistic alignment of near-neighbors dominates raw count. The framework is released open source.

URL PDF HTML ☆

赞 0 踩 0

2605.23056 2026-05-25 cs.NI cs.AI

DRL-Driven Edge-Aware Utility Optimization for Multi-Slice 6G Networks

DRL驱动的多切片6G网络边缘感知效用优化

Khaled M. Naguib, Soumaya Cherkaoui, Mahmoud M. Elmessalawy, Ahmed M. Abd El-Haleem, Ibrahim I. Ibrahim

发表机构 * CCAS Department, School of Engineering, New giza University（新吉扎大学工程学院CCAS系）； Department of Computer and Software Engineering, Polytechnique Montreal（蒙特利尔大学计算机与软件工程系）； Department of Electronics and Communications, Faculty of Engineering, Helwan University（海尔万大学工程学院电子与通信系）

AI总结本文研究了在6G网络中如何通过深度强化学习优化多切片网络的边缘感知效用，以满足虚拟现实等高要求业务的需求。提出了一种基于深度Q网络（DQN）的智能资源分配与边缘缓存框架，能够在O-RAN架构中实现多网络切片的动态资源调度与内容分发。该方法有效提升了网络延迟和吞吐量，为6G环境下的沉浸式VR应用提供了更可靠和响应更快的支持。

Comments 5 pages

Journal ref IEEE Networking Letters, vol. 8, pp. 14-18, 2026

详情

DOI: 10.1109/LNET.2025.3614549

AI中文摘要

通过6G网络传输的虚拟现实（VR）服务需要超低延迟和高带宽，以确保无缝用户体验。本文提出了一种面向6G O-RAN网络的智能资源分配与边缘缓存框架，利用深度Q网络（DQN）学习优化O-RAN架构下多网络切片的边缘缓存和动态资源配置。通过将DRL代理集成到网络控制平面，所提系统能够实现主动和自适应内容分发以及实时计算资源分配，满足eMBB、URLLC，尤其是对VR至关重要的新兴MBRLLC切片的服务质量需求。仿真结果表明，基于DQN的框架在降低延迟和提高吞吐量方面始终优于传统方法，从而为6G环境中的沉浸式VR应用提供更可靠和响应更快的支持。

英文摘要

Virtual Reality (VR) services delivered over 6G networks demand ultra-low latency and high bandwidth to ensure seamless user experiences. This paper presents an intelligent resource allocation and edge caching framework for 6G O-RAN networks, leveraging Deep Q-Network (DQN) learning for optimizing edge caching and dynamic resource provisioning across multiple network slices within an O-RAN-compliant architecture. By incorporating DRL agents into the network control plane, the proposed system enables proactive and adaptive content distribution as well as real-time computational resource allocation that meets the quality-of-service demands of eMBB, URLLC, and especially the emerging MBRLLC slices essential for VR. Simulation results demonstrate that the DQN-based framework consistently outperforms traditional methods in reducing latency and improving throughput, leading to more reliable and responsive support for immersive VR applications in 6G environments.

URL PDF HTML ☆

赞 0 踩 0

2605.23007 2026-05-25 q-fin.TR cs.AI cs.LG q-fin.PM

MadEvolve: Evolutionary Optimization of Trading Systems with Large Language Models

MadEvolve: 基于大型语言模型的交易系统进化优化

Yurii Kvasiuk, Tianyi Li, Owen Colegrove, Moritz Münchmeyer

发表机构 * Department of Physics, University of Wisconsin–Madison（威斯康星大学麦迪逊分校物理系）； Event Horizon Labs（事件地平线实验室）

AI总结本文提出了一种基于大型语言模型的进化优化框架MadEvolve，用于优化量化交易系统，特别是在比特币交易中的策略生成与执行。该方法通过进化算法优化交易策略的特征集、策略组件及整体流程，显著提升了交易表现。研究还对比了其他智能搜索方法，并评估了模拟环境中的p-hacking概率，验证了AI驱动的进化算法在量化金融中的有效性。

详情

AI中文摘要

我们探索了将LLM驱动的算法优化应用于量化金融中的几个常见任务。MadEvolve是一个受DeepMind的Alpha-Evolve启发的通用算法优化框架，最近被开发用于优化计算宇宙学中的算法。在此，我们以比特币交易为例，展示了MadEvolve在优化算法交易策略和alpha生成方面的实用性。在我们的模拟和回测设置中，我们在所有考虑的任务上取得了显著改进，例如演化用于信号生成的特征集、优化交易策略的独立组件，以及联合演化特征流水线与执行策略。此外，我们将我们的方法与其他智能搜索方法（特别是Claude Code）进行了比较，并仔细评估了模拟设置中的p-hacking概率。我们的发现强烈支持AI驱动的智能和进化算法在算法交易和量化金融中的实用性。

英文摘要

We explore the application of LLM-driven algorithm optimization to several common tasks in quantitative finance. MadEvolve, a general-purpose algorithm optimization framework inspired by DeepMind's Alpha-Evolve, was recently developed to optimize algorithms in computational cosmology. Here we demonstrate the utility of MadEvolve to optimize algorithmic trading strategies and alpha generation at the example of Bitcoin trading. On our simulation and backtesting setup, we achieve significant improvements on all tasks we considered, such as evolving feature sets for signal generation, optimizing separate components of the trading strategy, and jointly evolving the feature pipeline together with the execution strategy. Additionally, we compare our method to other agentic search approaches, specifically Claude Code, and carefully evaluate p-hacking probabilities on our simulation setup. Our findings strongly support the utility of AI-driven agentic and evolutionary algorithms for algorithmic trading and quantitative finance.

URL PDF HTML ☆

赞 0 踩 0

2605.22995 2026-05-25 cs.CY cs.AI

Whose Good, Whose Place? The Moral Geography of Agentic AI for Social Good

谁之善，谁之地？面向社会公益的能动型AI的道德地理学

Poli Nemkova, Haeshitha Indukuri, Jaedon Charles

发表机构 * University of North Texas（北卡罗来纳州立大学）； Florida International University（佛罗里达国际大学）

AI总结本文研究了用于社会公益的智能代理AI系统在道德地理方面的不对称性，指出尽管这类系统常以联合国可持续发展目标（SDGs）为依据，但很少明确说明其地理背景，尤其在需要考虑地方政治、法律和文化因素的领域更为明显。研究分析了2015至2026年间112篇相关论文，发现仅25%的论文报告了实际部署或小规模测试，揭示了在责任归属、参与性和透明度方面的多重缺口，并提出了更具体、参与性更强的AI系统报告标准。

详情

AI中文摘要

能动型AI系统越来越多地被提出用于社会公益领域，通常引用联合国可持续发展目标（SDGs）作为全球利益的词汇。然而，社会公益的主张并未建立对系统声称服务的社区的问责。我们对2015年至2026年间发表的112篇关于社会公益的能动型AI论文进行了结构化调查。我们发现一种道德地理不对称：论文在最需要当地政治、法律和文化背景的领域最不可能指定地理背景。在整个语料库中，112篇论文中有82篇（73%）未指定任何地理背景。与健康或物理/生态SDGs相关的论文指定地理背景的比例为37-40%，而与制度和社会政策SDGs相关的论文仅13%。SDG 16（和平、正义与强大机构）既是语料库中覆盖最多的目标，也是地理指定率最低的目标。我们将此解释为道德抽象：面向社会公益的能动型AI往往将制度性善视为普适的，而不同于对待健康或生态善的方式。第二个发现加剧了这一点：112篇论文中只有28篇（25%）报告了任何实际部署或小规模测试。我们识别出五个问责缺口，并提出了一个最低报告标准，以促进更具体情境、参与性和负责任的面向社会公益的能动型AI。

英文摘要

Agentic AI systems are increasingly proposed for social-good domains, often invoking the United Nations Sustainable Development Goals (SDGs) as a vocabulary of global benefit. Yet claims of social good do not establish accountability to the communities a system claims to serve. We present a structured survey of 112 papers on agentic AI for social good published between 2015 and 2026. We find a moral-geographic asymmetry: papers are least likely to specify geographic context in precisely the domains where local political, legal, and cultural context matters most. Across the corpus, 82 of 112 papers (73%) specify no geographic context. Papers aligned with health or physical/ecological SDGs specify geography 37-40% of the time, while papers aligned with institutional and social-policy SDGs do so only 13%. SDG 16, peace, justice, and strong institutions, is both the most-covered goal in the corpus and the one with the lowest geographic-specification rate. We interpret this as moral abstraction: agentic AI for social good often treats institutional good as universal in ways it does not treat health or ecological good. A second finding compounds this: only 28 of 112 papers (25%) report any real-world deployment or small-scale test. We identify five accountability gaps and propose a minimal reporting standard for more context-specific, participatory, and accountable agentic AI for social good.

URL PDF HTML ☆

赞 0 踩 0

2605.22988 2026-05-25 q-bio.NC cs.LG cs.RO cs.SY eess.SY

Active Sensing Subserves Task-Level Control

主动感知服务于任务级控制

Andrew Lamperski, Debojyoti Biswas, Eric S. Fortune, John Guckenheimer, Kathleen Hoffman, Noah J. Cowan

发表机构 * Department of Electrical and Computer Engineering, University of Minnesota（明尼苏达大学电气与计算机工程系）； Laboratory for Computational Sensing and Robotics, Johns Hopkins University（约翰霍普金斯大学计算感知与机器人实验室）； Federated Department of Biological Sciences, New Jersey Institute of Technology（新泽西理工学院联合生物科学系）； Department of Mathematics, Cornell University（康奈尔大学数学系）； Department of Mathematics and Statistics, University of Maryland, Baltimore County（马里兰大学巴尔的摩县分校数学与统计学系）； Department of Mechanical Engineering, Johns Hopkins University（约翰霍普金斯大学机械工程系）

AI总结本文探讨了主动感知在任务级控制中的作用，提出主动感知并非由感官目标驱动，而是任务控制的必要组成部分。研究结合生物实证数据和数学理论，表明主动感知行为通常以离散阶段出现，动物在“探索”与“利用”两种行为模式间切换，以适应性传感器和模式切换实现反馈控制。这一策略在生物系统中普遍存在，但在工程系统中却较少应用，提示当前机器人控制体系仍有待改进。

详情

AI中文摘要

主动感知传统上被定义为为了获取信息而消耗能量，通常以运动的形式。在这里，我们提出，对自适应传感器的依赖、运动与感知之间的联系以及任务级控制的结合，必然导致主动感知运动的出现。这样，主动感知并非由感官目标驱动，例如最小化状态不确定性，而是任务级控制所必需的。这一假设，即主动感知服务于控制，得到了来自生物体的经验数据和数学理论的支持。有趣的是，主动感知行为通常发生在离散的时段中，与目标导向行为交替出现。这表明动物在两种具有不同控制策略的行为模式之间切换：一种“探索”模式，动物产生动态运动以塑造感觉反馈；以及一种“利用”模式，动物产生与实现任务目标直接相关的较慢补偿运动。这种依赖于自适应传感器、主动感知和模式切换的反馈控制策略在工程系统中并不常用，尽管在生物学中普遍存在。由最先进的传感器、执行器和机械设计组成的工程系统在“成本函数”方面（如最大力生成、精度和速度）可以胜过动物。然而，动物通常能够实现目前工程系统无法比拟的稳健、优雅的行为，这表明当前的控制系统存在不足。这些以控制理论语言表达的见解可能对改进机器人感知和控制至关重要。

英文摘要

Active sensing is traditionally defined as the expenditure of energy, typically in the form of movement, for obtaining information. Here, we propose that the combination of reliance on adaptive sensors, the linkage between movement and sensing, and task-level control inevitably gives rise to the emergence of active sensing movements. In this way, active sensing is not driven by sensory goals, such as minimizing uncertainty about the state, but rather is necessary for task-level control. This hypothesis, that active sensing subserves control, is supported by both empirical data from organisms and mathematical theory. Interestingly, active sensing behaviors often occur in discrete epochs, interspersed with goal-oriented behavior. This suggests that animals switch between two behavioral modes with distinct control policies, an `explore' mode in which animals produce dynamic movements to shape sensory feedback, and an `exploit' mode in which animals produce slower compensatory movements that are directly related to achieving task goals. This strategy for feedback control that relies on adaptive sensors, active sensing, and mode switching is not commonly used in engineered systems despite being ubiquitous in biology. Engineered systems comprising state-of-the-art sensors, actuators, and mechanical designs can outperform animals with respect to ``cost functions'' such as maximum force generation, precision, and speed. Nevertheless, animals routinely achieve robust, graceful behaviors that are currently unmatched by engineered systems, suggesting that current control systems are insufficient. These insights, expressed in the language of control theory, may be critical for improving robotic sensing and control.

URL PDF HTML ☆

赞 0 踩 0

2605.22976 2026-05-25 cs.SE cs.AI

LLM Code Smells: A Taxonomy and Detection Approach

LLM 代码异味：分类与检测方法

Zacharie Chenail-Larcher, Brahim Mahmoudi, Naouel Moha, Quentin Stiévenart, Florent Avellaneda

发表机构 * École de technologie supérieure ； Université du Québec à Montréal

AI总结本文研究了大语言模型（LLM）在软件系统中集成时可能引入的代码异味问题，提出了一个包含九类LLM代码异味的分类体系，并开发了静态分析工具SpecDetect4LLM用于检测这些异味。通过对692个开源项目进行实证评估，结果表明近74%的系统存在LLM代码异味，检测精度达91.3%，召回率为71.8%，为开发者提供了识别和改进LLM集成质量的有效手段。

详情

AI中文摘要

大型语言模型（LLM）因其多功能性、灵活性以及在某种程度上模拟人类推理的能力，越来越多地被集成到软件系统中用于各种目的。然而，源代码中LLM推理的糟糕集成可能会损害软件系统的质量。因此，必须记录不充分的LLM集成编码实践，以帮助开发者缓解此类问题。基于我们先前关于LLM代码异味的工作，本文通过呈现一个自包含的分类体系和包含九种LLM代码异味的目录，巩固并完善了这一概念。我们还创建了SpecDetect4LLM，一个用于检测这些异味的静态源代码分析工具，并对其检测效果（精确率和召回率）以及LLM代码异味在692个开源软件项目（171,194个源文件）中的普遍性进行了广泛的实证评估。结果表明，LLM代码异味影响了73.5%的被分析系统，检测精确率为91.3%，召回率为71.8%。

英文摘要

Large Language Models (LLMs) are increasingly integrated into software systems for diverse purposes, due to their versatility, flexibility, and ability to simulate human reasoning to some extent. However, poor integration of LLM inference in source code can undermine software system quality. Therefore, inadequate LLM integration coding practices must be documented to help developers mitigate such issues. Following our earlier work on LLM code smells, this paper consolidates and refines the concept by presenting a self-contained taxonomy and a catalog of nine LLM code smells. We also create SpecDetect4LLM, a static source code analysis tool for their detection, and conduct extensive empirical evaluations of its detection effectiveness (precision and recall) as well as the prevalence of LLM code smells across 692 open-source software projects (171,194 source files). Our results show that LLM code smells affect 73.5% of the analyzed systems, with a detection precision of 91.3% and a recall of 71.8%.

URL PDF HTML ☆

赞 0 踩 0

2605.22968 2026-05-25 q-bio.QM cs.LG stat.ML

Uncertainty-aware classification and triage of structural heart disease using electrocardiography and echocardiography metrics

基于心电图和超声心动图指标的结构性心脏病不确定性感知分类与分诊

Mitchel J. Colebank

发表机构 * Department of Mathematics, University of South Carolina（南卡罗来纳大学数学系）

AI总结该研究探讨了利用心电图（ECG）和超声心动图指标对结构性心脏病（SHD）进行分类与分诊的不确定性感知方法。研究对比了频率学派和贝叶斯神经网络分类器在SHD检测中的表现，发现贝叶斯方法在分类性能和不确定性量化方面更具优势。研究还展示了如何将不确定性感知分类应用于SHD筛查，为通过机器学习辅助分诊、优化医疗资源分配提供了可行方案。

Comments 15 pages, 5 figures

详情

AI中文摘要

机器学习方法提供了一种方法创新，可以通过无创且易于获得的测量方式帮助筛查心血管疾病。最近在利用心电图数据筛查结构性心脏病方面的投资就是一个例子，其中心电图提供了一种低成本、可用的筛查方式。这导致了EchoNext数据集的产生，这是一个配对的心电图-超声心动图数据存储库，用于测试新的结构性心脏病检测方法。然而，相对较少的研究探讨了通过贝叶斯推理进行更概率性的分类如何改善这种情况下的不确定性量化。此外，很少有研究考虑如何开发分诊系统以缓解医疗瓶颈，例如由专家超声技师审查来自服务不足的农村诊所的数据以进行结构性心脏病评估。在本研究中，我们利用现有的心电图-超声心动图数据来比较频率派和贝叶斯神经网络分类器。我们表明，贝叶斯方法在结构性心脏病分类中与频率派方法相当或更好，并且它们具有更稳健的不确定性量化。我们提供了一个示例，说明如何将此不确定性感知分类方案用于结构性心脏病筛查，为机器学习如何帮助分诊提供了概念验证，即在结构性心脏病高度可能或测量高度不确定时，让个体获得专家超声技师的输入。

英文摘要

Machine learning methods provide a methodological innovation that can help screen for cardiovascular disease through noninvasive and readily available measurement modalities. Recent investments in using electrocardiogram (ECG) data to screen for structural heart disease (SHD) are one example, where ECGs provide a low-cost, available modality for screening. This has led to the EchoNext dataset, a paired ECG-echocardiogram data repository for testing new methods of SHD detection. However, relatively few studies have investigated how more probabilistic classification through Bayesian inference may improve uncertainty quantification in this setting. Moreover, few studies have considered how triage systems can be developed to alleviate healthcare bottlenecks, such as the review of data from underserved, rural clinics by expert sonographers for SHD assessment. In this study, we leverage existing ECG-echocardiogram data to compare frequentist and Bayesian neural network classifiers. We show that the Bayesian approach is comparable or better than frequentist methods in SHD classification, and that they have a more robust uncertainty quantification attached to them. We provide an example of how this uncertainty-aware classification scheme can be used for screening SHD, providing a proof-of-concept for how machine learning can help with triage in getting individuals expert sonographer input when SHD is highly likely or measurements are highly uncertain.

URL PDF HTML ☆

赞 0 踩 0

2605.22237 2026-05-25 cs.CR cs.LG

Decision-Aware Quadratic ReLU Replacement for HE-Friendly Inference

面向同态加密推理的决策感知二次ReLU替换

Rui Li, Wenyuan Wu, Weijie Miao

发表机构 * Chongqing Key Laboratory of Secure Computing for Biology（重庆生物安全计算重点实验室）； Chongqing Institute of Green and Intelligent Technology（重庆绿色智能技术研究所）； Chinese Academy of Sciences（中国科学院）； Department of Industrial and Systems Engineering（工业与系统工程系）

AI总结该研究针对全同态加密（FHE）下神经网络推理中ReLU激活函数的替换问题，提出了一种基于决策感知的二次多项式替代方法，旨在在不重新训练模型的前提下，使用低阶多项式保持分类决策的一致性。研究通过几何框架分析校准集的决策边界，提出了在正边距条件下实现无误差替换的充要条件及构造算法，并在边距不足时引入凸包缩减和拉格朗日对偶松弛方法，有效降低计算复杂度。实验表明，该方法在CKKS方案下能够达到与明文模型相当的精度，且推理效率显著优于现有方法。

Comments 13 pages, 2 figures

详情

AI中文摘要

全同态加密（FHE）仅支持加法和乘法，因此仅使用FHE的神经网络推理通常将ReLU替换为在经验激活区间上拟合的多项式。这种区间拟合通常需要更高次多项式来控制激活误差，从而产生同态评估成本，而分类由最终logit决策决定。我们从决策感知的角度重新审视ReLU替换：给定一个训练好的单隐层ReLU MLP和一个指定的校准集，能否在不重新训练的情况下，用一个同态友好的低次多项式替换ReLU，同时保持校准集决策不变？我们专注于二次替换，即保留每个单元非线性的最低次数。对于在提升空间中正间隔可分的校准集，我们将二次替换公式化为一个线性可分问题，得到了校准无损替换的充分必要条件以及系数的构造性算法。当正间隔条件不满足时（通常是因为少数接近边界或错误分类的校准样本使提升凸包接触），我们通过缩减凸包和拉格朗日对偶软间隔松弛来扩展相同的几何框架。这些方法限制了单个样本能携带的权重，将问题转化为较小的凸二次规划，产生近似可行的系数，并在校准集决策上具有高经验一致性。特别地，在最大权重上限μ=1时，缩减凸包松弛退化为标准凸包分离；因此该松弛连续地扩展了正间隔精确理论。在CKKS下，二次替换在多个基准测试中匹配明文top-1准确率，激活模块运行速度比Remez-7快3.7-4.1倍，端到端快1.18-1.68倍。

英文摘要

Fully homomorphic encryption (FHE) supports only additions and multiplications, so FHE-only neural-network inference typically replaces ReLU with polynomials fitted over empirical activation intervals. Such interval fitting often requires higher-degree polynomials to control activation error, incurring homomorphic evaluation costs, while classification is determined by the final logit decision. We revisit ReLU replacement from a decision-aware perspective: given a trained single-hidden-layer ReLU MLP and a specified calibration set, can an HE-friendly low-degree polynomial replace ReLU without retraining while preserving calibration-set decisions? We focus on quadratic replacement, the lowest-degree that retains a genuine per-unit nonlinearity. For calibration sets positive-margin separable in the lifted space, we formulate quadratic replacement as a linear separation problem, yielding necessary and sufficient conditions for calibration-lossless replacement and a constructive algorithm for the coefficients. When the positive-margin condition fails -- often because a few near-boundary or misclassified calibration samples bring the lifted hulls into contact -- we extend the same geometric framework via reduced convex hulls and Lagrangian-dual soft-margin relaxations. These cap the weight any single sample can carry, converting the problem into smaller convex quadratic programs that yield approximately feasible coefficients with high empirical agreement on calibration-set decisions. In particular, at the maximal weight cap $μ=1$, the reduced-convex-hull relaxation reduces to standard convex-hull separation; the relaxation thus continuously extends the positive-margin exact theory. Under CKKS, the quadratic replacement matches plaintext top-1 accuracy on multiple benchmarks, running 3.7--4.1$\times$ faster than Remez-7 in the activation module and 1.18--1.68$\times$ faster end-to-end.

URL PDF HTML ☆

赞 0 踩 0

AI 大模型

视觉与机器人

科学与医疗

Efficient Learned Image Compression without Entropy Coding

SpinFlow: A Physics-Informed Spin Field Framework for Traffic Phase Inference and Transition Detection

Accelerating ground state search of spatial photonic Ising machines with genetic-simulated annealing hybrid algorithm

Evaluating the Temporal Detection Capability of Integrated Gradients Applied on Sound Classifier

Discontinuous Galerkin Neural Operator for Pathology Defocus Deblurring

Coupled Training with Privileged Information and Unlabeled Data

UniSRM: A Unified Speech Reward Model for Reasoning-Based Fine-grained Assessment

Entropy Equivalence Testing

CultivAgents: Cultivating Relationship-Centered Multi-Agent Systems for Personalized Gardening

GMENet: Generative Mixture of Experts Network for Multi-Center Glioma Diagnosis with Incomplete Imaging Sequences

Robust LLM Watermarking with Minimal Semantic Distortion for IP Protection

PoisonForge: Task-Level Targeted Poisoning Benchmark for Instruction-Tuned LLMs

Generative AI and the Reorganization of Labor Demand

What Does the Server See? Understanding Privacy Leakage from Large Language Models in Split Inference

Operationalizing Individual Fairness via Gradient Descent and Bradley-Terry Models

Classical State Preparation for Variational Quantum Algorithms via Reinforcement Learning

Defining AI Fatigue in Academic Contexts: Dimensions, Indicators, and a Stage-Based Model Using Grounded Theory

Philosophical Dispositions as Behavioral Constraints for AI-Assisted Code Review: An Empirical Study

LLM Sparsity Prior for Robust Feature Selection

Encrypted Neural Networks without Overflows

Do Synthetic Brain MRIs Reliably Improve Tumour Classification? A StyleGAN2-ADA Class-Plane Augmentation Study on BRISC 2025

Security of LLM-generated Code: A Comparative Analysis

A measurement substrate for agentic Kubernetes operations: Methodology and a case study in retrieval-compounding falsification

DRL-Driven Edge-Aware Utility Optimization for Multi-Slice 6G Networks

MadEvolve: Evolutionary Optimization of Trading Systems with Large Language Models

Whose Good, Whose Place? The Moral Geography of Agentic AI for Social Good

Active Sensing Subserves Task-Level Control

LLM Code Smells: A Taxonomy and Detection Approach

Uncertainty-aware classification and triage of structural heart disease using electrocardiography and echocardiography metrics

Decision-Aware Quadratic ReLU Replacement for HE-Friendly Inference