arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2605.26819 2026-05-27 cs.IR cs.AI

RAGEAR: Retrieval-Augmented Graph-Enhanced Academic Recommender

RAGEAR: 检索增强的图增强学术推荐器

Francesco Granata, Lorenzo Lamazzi, Misael Mongiovì, Francesco Poggi, Valeria Secchini

发表机构 * Department of Mathematics and Computer Science, University of Catania, Italy（卡塔尼亚大学数学与计算机科学系，意大利）； Institute for Cognitive Science and Technology, National Research Council, Italy（意大利国家研究委员会认知科学与技术研究所）

AI总结提出RAGEAR，一种神经符号推荐系统，结合密集检索和知识图谱，通过图感知聚合函数将片段级证据传播到课程级推荐，在学术课程推荐中优于元数据基线。

详情

AI中文摘要

我们提出了RAGEAR（检索增强的图增强学术推荐器），一种用于学术课程推荐的神经符号推荐系统。RAGEAR将完整讲座转录本的密集检索与符号知识图谱相结合，该图谱建模课程、课程、转录本片段、学分、学习计划和课程信息。知识图谱支持基于结构化约束（如学分、学科、学习计划和先修课程）的符号过滤和情境化。与基于元数据的方法不同，它通过检索与学生查询语义对齐的转录本片段来利用细粒度的教学内容。主要贡献是一种图感知聚合函数，它将片段级证据传播到课程级推荐。得分结合了三个因素：与课程相关的检索相似性份额、其相关片段的基于排名的强度以及证据在课程间的分布。我们通过人工评估样本和大规模基于LLM的相关性评估，在152个学生类查询上评估了RAGEAR。结果表明，讲座转录本优于仅元数据检索，并且RAGEAR进一步提高了基于转录本的归一化SumP基线的排名质量，尤其是在排名靠前的推荐中。

英文摘要

We present RAGEAR (Retrieval-Augmented Graph-Enhanced Academic Recommender), a neurosymbolic recommender system for academic course recommendation. RAGEAR combines dense retrieval over full lecture transcripts with a symbolic Knowledge Graph modelling courses, lessons, transcript chunks, credits, study plans, and curricular information. The Knowledge Graph supports symbolic filtering and contextualisation based on structured constraints, such as credits, academic disciplines, study plans, and prerequisites. Unlike metadata-based approaches, it exploits fine-grained instructional content by retrieving transcript chunks semantically aligned with a student's query. The main contribution is a graph-aware aggregation function that propagates chunk-level evidence to course-level recommendations. The score combines three factors: the share of retrieved similarity associated with a course, the rank-based strength of its relevant chunks, and the distribution of evidence across lessons. We evaluate RAGEAR on 152 student-like queries through a human evaluation sample and a large-scale LLM-based relevance assessment. Results show that lecture transcripts improve over metadata-only retrieval, and that RAGEAR further improves ranking quality over a transcript-based normalized SumP baseline, especially for top-ranked recommendations.

URL PDF HTML ☆

赞 0 踩 0

2605.26807 2026-05-27 cs.SE cs.AI

HTMLCure: Turning Browser Experience into State Guided Repair for Interactive HTML

HTMLCure：将浏览器体验转化为面向交互式HTML的状态引导修复

Jiajun Wu, Jian Yang, Tuney Zheng, Wei Zhang, Haowen Wang, Yihang Lou, Xianglong Liu

发表机构 * Beihang University（北京航空航天大学）； IQuest Research（IQuest研究院）； Peking University（北京大学）

AI总结提出HTMLCure框架，通过浏览器交互执行、状态感知诊断和闭环修复引擎，从大规模HTML页面中筛选并修复可修复页面，显著提升SFT数据质量和模型性能。

Comments 27 pages, 11 figures. Code: https://github.com/wuyuVerse/HTMLCure

详情

AI中文摘要

LLM现在可以生成完整的HTML页面，但其中许多页面仅在表面上正确：它们渲染一次，然后在滚动、悬停、点击、调整大小或游戏过程中失败。基于截图的评估可能遗漏这些失败，而过滤会丢弃许多仍然可修复的页面。我们引入了HTMLCure，一个浏览器体验框架，在系统与页面交互后评估HTML。评估器跨视口和交互状态执行页面，记录确定性的浏览器证据，并向VLM提供来自执行轨迹的精选关键帧，而非孤立截图。相同的状态信号驱动闭环修复引擎：HTMLCure诊断当前页面，选择特定状态的修复家族，再次运行每个候选页面，并导出质量清理后的页面用于SFT。在97K提示语料库上，这将直接可用的种子扩展为63703个质量清理页面的候选池，从中我们构建了最终的40K页面精炼SFT集。在相同骨干和训练方案下，HTMLCure-27B-Refined在HTMLBench-400上达到50.6分，确定性测试用例通过率为45.2%，与Kimi-K2.6和GPT-5.4等强参考行处于相同性能区间。在发布的MiniAppBench验证集上，它达到81.2的平均分，比原始27B SFT提高15.3分，接近强参考系统的水平。

英文摘要

LLMs can now produce full HTML pages, but many of those pages are only superficially correct: they render once, then fail under scroll, hover, click, resize, or gameplay. Evaluation from screenshots can miss these failures, and filtering discards many pages that are still repairable. We introduce HTMLCure, a browser experience framework that evaluates HTML after the system has interacted with it. The evaluator executes the page across viewports and interaction states, records deterministic browser evidence, and gives the VLM curated keyframes from the executed trajectory rather than isolated screenshots. The same state signal drives a closed loop repair engine: HTMLCure diagnoses the current page, chooses a state specific repair family, runs each candidate again, and exports quality cleared pages for SFT. On a 97K prompt corpus, this expands the directly usable seed into a candidate pool of 63703 quality cleared pages, from which we construct the final refined SFT set of 40K pages. Under the same backbone and training recipe, HTMLCure-27B-Refined reaches 50.6 on HTMLBench-400 with 45.2% deterministic test case pass, placing it in the same performance band as strong reference rows such as Kimi-K2.6 and GPT-5.4. On the released MiniAppBench validation split, it reaches 81.2 average, improving raw 27B SFT by 15.3 points and approaching the level of strong reference systems.

URL PDF HTML ☆

赞 0 踩 0

2605.26786 2026-05-27 cs.CY cs.AI cs.LG

Implementation of Big Data Analytics for Diabetes Management: Needs Assessment in the Rwanda Healthcare System

大数据分析在糖尿病管理中的应用：卢旺达医疗系统需求评估

Silas Majyambere, Tony Lindgren, Workneh Y. Ayele, Celestin Twizere

发表机构 * University of Rwanda（卢旺达大学）

AI总结本研究通过利益相关者研讨会评估卢旺达医疗系统采用大数据分析管理糖尿病的准备情况，并提出了一个基于可解释机器学习模型的实用框架。

详情

AI中文摘要

糖尿病是一种慢性代谢疾病，如果不及早诊断和管理，可能导致严重的健康问题。大数据分析和机器学习为分析大型健康数据集、支持早期发现和更好的治疗决策提供了实用工具。然而，它们在常规临床实践中的使用仍然有限。本研究考察了卢旺达医疗系统采用大数据分析管理糖尿病的准备情况。随着该国不断扩大电子病历和健康信息系统的使用，改善预测、监测和临床决策的新机遇随之出现。我们举办了一个为期五天的研讨会，涉及25名关键利益相关者，包括临床医生、数据管理员、政策制定者、医学研究人员、营养学家和技术提供商，以评估准备情况并识别现有差距。研究结果突出了大数据分析实施的潜力和主要挑战。基于这些结果，本文提出了一个实用的大数据分析框架，利用可解释的机器学习模型支持糖尿病管理策略。

英文摘要

Diabetes is a chronic metabolic disease that can lead to serious health problems if not diagnosed and managed early. Big Data Analytics (BDA) and machine learning offer practical tools for analyzing large health datasets and supporting early detection and better treatment decisions. However, their use in routine clinical practice is still limited. This study examines the readiness of Rwanda's healthcare system to adopt big data analytics for diabetes management. As the country continues to expand its use of electronic medical records and health information systems, new opportunities arise for improving prediction, monitoring, and clinical decision-making. A five-day workshop involving 25 key stakeholders, including clinicians, data managers, policymakers, medical researchers, nutritionists, and technology providers, was conducted to assess preparedness and identify existing gaps. The findings highlight both the potential and the main challenges of BDA implementation. Based on these results, the paper proposes a practical BDA framework to support diabetes management strategies using explainable machine learning models.

URL PDF HTML ☆

赞 0 踩 0

2605.26769 2026-05-27 cs.CY cs.AI

Generative artificial intelligence and the marginalization of minoritized knowledges in higher education: the case of disability

生成式人工智能与高等教育中少数群体知识的边缘化：以残疾为例

Fatiha Tali-Otmani

发表机构 * Université Toulouse Jean Jaurès-UMR EFTS（图卢兹让·雅克·儒勒大学-UMR EFTS）

AI总结研究通过教育科学、批判技术研究和残疾研究，揭示生成式人工智能如何通过以英语和西方为中心的训练数据集强化认知殖民性，导致残疾人群体的双重边缘化，并探讨研究者与机器混合以维护认知多样性的可能性及其结构性限制。

2605.26754 2026-05-27 cs.CR cs.AI

L2Rec：面向个性化推荐的LLM双视图理解

Pingjun Pan, Tingting Zhou, Peiyao Lu, Tingting Fei, Hongxiang Chen, Chuanjiang Luo

发表机构 * Netease Cloud Music（网易云音乐）

AI总结提出L2Rec方法，通过双视图个性化混合专家机制在参数层面统一行为与语义理解，实现端到端个性化推荐，实验证明优于现有方法。

Comments Accepted at SIGIR 2026

详情

DOI: 10.1145/3805712.3809943

AI中文摘要

将大型语言模型（LLM）适配于个性化推荐需要将其通用能力与用户特定偏好对齐，同时有效利用行为信号和语义信号。现有方法通常在输入层（例如，将行为嵌入注入令牌空间）或输出层（例如，独立编码器的对比对齐）整合这些信号，存在分布差距或缺乏端到端任务监督。在这项工作中，我们引入了L2Rec，它在LLM的参数层面统一了行为和语义理解。我们的关键洞察是，同一组Transformer参数可以作为两个视图的共享媒介：通过双视图个性化混合专家（DPMoE）机制应用视图特定的个性化低秩扰动，L2Rec使得单个LLM主干能够为每个用户产生互补的行为和语义适应，且表示层面的不对齐最小化。一个自适应跨视图融合模块进一步将双视图输出整合为统一的用户偏好。在四个数据集上的实验表明，L2Rec持续优于最先进的基线方法，并且在大型工业平台上的在线A/B测试验证了关键参与指标的显著改进。

英文摘要

Adapting large language models (LLMs) for personalized recommendation requires aligning their general-purpose capabilities with user-specific preferences while effectively leveraging both behavioral and semantic signals. Existing approaches typically integrate these signals at either the input level (e.g., injecting behavioral embeddings into the token space) or the output level (e.g., contrastive alignment of separate encoders), suffering from distribution gaps or lack of end-to-end task supervision. In this work, we introduce L2Rec, which unifies behavioral and semantic understanding at the parameter level of LLMs. Our key insight is that the same set of Transformer parameters can serve as a shared medium for both views: by applying view-specific, personalized low-rank perturbations via a Dual-view Personalized Mixture-of-Experts (DPMoE) mechanism, L2Rec enables a single LLM backbone to produce complementary behavioral and semantic adaptations for each user with minimal representation-level misalignment. An adaptive cross-view fusion module further integrates the dual-view outputs into a unified user preference. Experiments on four datasets show that L2Rec consistently outperforms state-of-the-art baselines, and online A/B testing on a large-scale industrial platform validates significant improvements in key engagement metrics.

URL PDF HTML ☆

赞 0 踩 0

2605.26713 2026-05-27 stat.ML cs.LG

打破认知陷阱：复合不确定性下的主动感知

Chayan Banerjee, Ethan Goan

发表机构 * School of Electrical Engineering and Robotics（电气工程与机器人学学院）

AI总结针对强化学习在安全关键领域中因状态-动力学耦合不确定性导致的失败，提出基于互信息的复合不确定性系数和主动信息寻求策略的适应性安全架构。

详情

AI中文摘要

在安全关键领域部署强化学习，从自动驾驶到医疗决策支持，受到系统遇到不熟悉条件时出现的失败的限制。我们认为，根本瓶颈不是单个挑战，如变化的动力学或不完整的观测，而是它们的协同交互，我们称之为认知陷阱：代理无法在不知道系统动力学的情况下估计其状态，也无法在没有准确状态信息的情况下学习动力学。在模拟运动中的概念验证实验表明，结合这些不确定性导致的失败远严重于单独挑战，性能下降77%，而单独效应相加为46%，展示了传统方法忽略的复合失败模式。这些方法采用被动的认知立场，无法解决这种耦合的不确定性。我们提出将安全重新定义为信息问题，引入一个适应性安全架构，围绕三个贡献构建：复合不确定性系数（κ），一种基于互信息的度量，量化状态-动力学耦合，可在线上计算而无需完整的联合信念推断；由MaxInfoRL目标驱动的信息寻求策略，主动探测系统动力学；以及随认知耦合上升而收紧的机制自适应安全约束。这种范式转变，从被动鲁棒性到主动感知，为在不确定性下运行、识别自身无知并战略性地采取行动解决它的决策系统提供了原则性路径。

英文摘要

Deploying reinforcement learning in safety critical domains, from autonomous vehicles to medical decision support, is constrained by failures arising when systems encounter unfamiliar conditions. We argue that the fundamental bottleneck is not individual challenges like changing dynamics or incomplete observations, but their synergistic interaction, which we term the Epistemic Trap: agents cannot estimate their state without knowing system dynamics, nor learn dynamics without accurate state information. Proof-of-concept experiments in simulated locomotion reveal that combining these uncertainties causes failures far worse than either challenge alone, a 77% performance degradation against the 46% by adding the individual effects, demonstrating compounding failure modes that conventional methods overlook. Such approaches adopt a passive epistemic stance that cannot resolve this coupled uncertainty. We propose reframing safety as an information problem, introducing an Adaptive Safety Architecture built around three contributions: the Compound Uncertainty Coefficient ($κ$), a mutual information based metric that quantifies state dynamics coupling and is computable online without full joint belief inference; information seeking policies governed by a MaxInfoRL objective that actively probe system dynamics; and regime-adaptive safety constraints that tighten as epistemic coupling rises. This paradigm shift, from passive robustness to active perception, offers a principled path toward decision making systems that operate under uncertainty, recognize their own ignorance, and act strategically to resolve it.

URL PDF HTML ☆

赞 0 踩 0

2605.26577 2026-05-27 eess.SY cs.AI cs.LG cs.SY math.OC

Bridging Control with Neural Network Verifier alpha-beta-CROWN: A Tutorial

桥接控制与神经网络验证器 alpha-beta-CROWN：教程

Haoyu Li, Xiangru Zhong, Hao Cheng, Bin Hu, Huan Zhang

发表机构 * Department of Computer Science（计算机科学系）； Department of Electrical and Computer Engineering（电气与计算机工程系）

AI总结本教程提出一个统一框架，通过将控制问题与神经网络验证器 α,β-CROWN 桥接，实现控制器属性的可扩展形式验证。

Comments ACC 2026 Tutorial

详情

AI中文摘要

基于学习的控制器合成方法因其高表达力和强经验性能而受到欢迎。然而，在自动驾驶、机器人技术和电力系统等安全关键场景中，仅凭经验性能是不够的，对控制器的稳定性、安全性等属性进行形式验证是非常可取的。不幸的是，许多先前的验证方法要么依赖于系统或证书的特定结构假设，难以在不同设置间迁移，要么在高维神经网络系统上可扩展性差。在本教程中，我们提出了一个统一框架，旨在通过将控制与最先进的神经网络验证器 $α,\!β$-CROWN（alpha-beta-CROWN）桥接来弥合这一差距。其核心是，$α,\!β$-CROWN 是一个通用的边界引擎，用于表示为计算图的非线性函数：给定一个输入域，它可以产生认证边界和非线性函数的显式线性松弛。这些认证边界本身对于可达性分析等任务很有用，并且它们为执行可满足性检查和优化的更复杂例程提供了基础。更具体地说，许多控制问题归结为验证状态域上的实值不等式（例如，李雅普诺夫理论）。因此，$α,\!β$-CROWN 通过计算紧边界并基于边界递归划分和剪枝子域，实现了这些条件的可扩展验证。得益于 GPU 并行化，该流程在对传统方法具有挑战性的验证和优化问题上展示了卓越的可扩展性。在本教程中，我们讨论了 $α,\!β$-CROWN 的基础知识，并介绍了其在各种控制相关任务中的应用。

英文摘要

Learning-based methods for synthesizing controllers have gained popularity due to their high expressiveness and strong empirical performance. However, in safety-critical scenarios such as autonomous driving, robotics, and power systems, empirical performance alone is insufficient, and formal verification of controller properties such as stability and safety is highly desirable. Unfortunately, many prior verification approaches are either tied to specific structural assumptions on the system or the certificate, making them difficult to transfer across settings, or suffer from poor scalability on higher-dimensional neural network systems. In this tutorial, we present a unified framework that aims to mitigate this gap via bridging control with the state-of-the-art neural network verifier $α,\!β$-CROWN (alpha-beta-CROWN). At its core, $α,\!β$-CROWN is a general-purpose bounding engine for nonlinear functions represented as computation graphs: given an input domain, it can produce certified bounds and explicit linear relaxation of the nonlinear function. These certified bounds are useful on their own for tasks such as reachability analysis, and they also provide the foundation for more complex routines that perform satisfiability checking and optimization. More specifically, many control problems reduce to verifying real-valued inequalities over a state domain (e.g., Lyapunov theory). Consequently, $α,\!β$-CROWN enables scalable verification of such conditions by computing tight bounds and recursively partitioning and pruning subdomains based on the bounds. Thanks to GPU parallelization, this pipeline demonstrates superior scalability on verification and optimization problems that are challenging for traditional approaches. In this tutorial, we discuss the basics of $α,\!β$-CROWN and introduce its application to various control-related tasks.

URL PDF HTML ☆

赞 0 踩 0

2605.26548 2026-05-27 cs.CR cs.LG

先设计，后编码：无模板的美观幻灯片生成

Zhiyao Cui, Chenxu Wang, Shuyue Hu, Yiqun Zhang, Wenqi Shao, Qiaosheng Zhang, Zhen Wang

发表机构 * School of Cybersecurity, Northwestern Polytechnical University（西北工业大学网络安全学院）； Shanghai Artificial Intelligence Laboratory（上海人工智能实验室）； Shanghai Innovation Institution（上海创新研究院）； Fudan University（复旦大学）

AI总结提出DeepSlides层次化幻灯片生成流程，通过解耦设计与实现、引入SlideDesign数据集和多智能体强化学习训练范式，在无模板条件下生成高质量幻灯片。

详情

AI中文摘要

自动生成演示幻灯片需要在严格的空间约束下协调叙事结构与页面级图形设计。对于这种结构化多模态任务，良好的设计流程对于确保幻灯片的最终质量至关重要。现有方法依赖固定模板或直接生成可执行代码，从而限制了LLM的创意布局设计能力，并绕过了关键的幻灯片页面设计步骤。为解决这些限制，本文(1)提出了一种层次化的幻灯片生成工作流DeepSlides，无需任何预定义模板或样式，系统化地组织幻灯片设计任务，将幻灯片页面设计与实现解耦；(2)引入了SlideDesign数据集，专门针对幻灯片生成任务定制；(3)提出了一种多智能体强化学习训练范式，并训练了一对模型SlideQwens，用于幻灯片设计和实现。实验结果表明，我们提出的框架在评估指标上优于基线方法，并在人类偏好评估中取得了优越性能。数据集和代码可在https://github.com/sxswz213/DeepSlides获取。

英文摘要

Producing presentation slides automatically entails coordinating narrative structure with page-level graphic design under strict spatial constraints. For such structured multimodal tasks, a well-organized design process is essential to ensure the final quality of slides. Existing approaches rely on fixed templates or directly emit executable code, thereby both limiting the creative layout-design capabilities of LLMs and bypassing the essential slide-page design step. To address these limitations, this paper (1) proposes a hierarchical slides generation workflow, DeepSlides, that systematically organizes slide design tasks without any predefined template or style, decoupling slide-page design from implementation; (2) introduces SlideDesign, a dataset tailored specifically for slides generation tasks; and (3) presents a multi-agent reinforcement learning training paradigm and trains a couple of models, SlideQwens, for slide design and implementation. Experimental results demonstrate that our proposed framework outperforms baseline methods on evaluated metrics and achieves superior performance in human preference evaluations. The dataset and code are available at https://github.com/sxswz213/DeepSlides.

URL PDF HTML ☆

赞 0 踩 0

2605.26429 2026-05-27 stat.ME cs.AI cs.LG stat.ML

Structure-Adaptive Conformal Inference for Large-Scale Out-of-Distribution Testing

面向大规模分布外检测的结构自适应共形推断

Rongyi Sun, Wenguang Sun, Zinan Zhao

发表机构 * Center for Data Science and School of Mathematical Sciences, Zhejiang University（数据科学中心和数学科学学院，浙江大学）

AI总结提出结构自适应共形q值(SCQ)和伪分数引导的直推式自动模型选择(P-TAMS)，在成对可交换性下实现结构化分布外检测的有限样本错误率控制、功效提升和可解释性增强。

2605.26424 2026-05-27 cs.IR cs.AI cs.LG

Uniboost: Global Coordination with Value Alignment for Fair and Efficient Traffic Allocation

Uniboost：基于价值对齐的全局协调实现公平高效的流量分配

Ge Fan, Nan Zhao, Kai Meng, Cong Luo, Yang Fu, Huiping Chu, Jialin Liu, Yuning Jiang, Bo Zheng

发表机构 * Taobao \& Tmall Group of Alibaba Hangzhou China ； Taobao \& Tmall Group of Alibaba Beijing China ； Taobao \& Tmall Group of Alibaba

AI总结提出Uniboost统一流量分配框架，通过后验价值对齐机制和独立线性提升范式，解决耦合分配、分数膨胀和可解释性问题，提升流量分配效率和推荐性能。

Comments accepted by SIGIR 2026

详情

AI中文摘要

随着互联网服务的快速发展，推荐系统已变得不可或缺。特别是混合（重排序）阶段在跨不同业务目标分配流量中起着关键作用。然而，现有方法常受限于耦合的分配方案、分数膨胀和缺乏可解释性。为应对这些挑战，我们提出Uniboost，一个统一的流量分配框架。Uniboost引入后验价值对齐机制，将抽象模型分数校准到具有明确业务语义的锚定指标，显著增强可解释性。此外，它采用独立的线性提升范式来解耦复杂的加权方案，实现每个计划贡献的精确归因。我们通过在线A/B测试和深入数据分析验证了Uniboost的有效性，展示了三个关键发现：1）降低加权分数的整体权重有效减轻了意外的业务干扰，产生更高效的微观流量分配策略；2）事后分析和聚合仪表板提供了直观的宏观洞察，指导整体流量分配机制的设计；3）提出的“有效完成分数”作为易于获取的后验指标，为内容推荐管道提供了可靠的锚点。综合来看，我们的实验表明，Uniboost不仅在微观层面提升了流量分配效率和推荐性能，还为系统迭代提供了宏观指导。因此，这项工作为大规模工业推荐系统提供了一种高效可控的流量调节解决方案。

英文摘要

With the rapid evolution of internet services, recommendation systems have become indispensable. In particular, the blending (re-ranking) stage plays a pivotal role in allocating traffic across diverse business objectives. However, existing approaches often suffer from coupled allocation plans, score inflation, and a lack of interpretability. To address these challenges, we propose Uniboost, a unified traffic allocation framework. Uniboost introduces a posterior value alignment mechanism that calibrates abstract model scores to anchor metrics with explicit business semantics, significantly enhancing interpretability. Furthermore, it employs an independent linear boosting paradigm to decouple complex weighting schemes, enabling precise attribution of each plan's contribution. We validate the effectiveness of Uniboost through online A/B tests and in-depth data analysis, demonstrating three key findings: 1) Reducing the overall weight of weighted scores effectively mitigates unintended business interference, yielding a more efficient micro-level traffic allocation strategy; 2) Post-hoc analyses and aggregated dashboards provide intuitive, macro-level insights that guide the design of the overall traffic allocation mechanism; 3) The proposed "Effective Completion Score" serves as an easily obtainable post-metric that offers a reliable anchor for content recommendation pipelines. Collectively, our experiments show that Uniboost not only improves traffic allocation efficiency and recommendation performance at the micro level but also provides macro-level guidance for system iteration. Thus, this work provides an efficient and controllable traffic regulation solution for large-scale industrial recommendation systems.

URL PDF HTML ☆

赞 0 踩 0

2605.26413 2026-05-27 stat.ME cs.AI cs.LG stat.ML

Confounder Detection via Treatment Intent: A New Observational Study Design

通过治疗意图进行混杂检测：一种新的观察性研究设计

Drago Plecko, Patrik Okanovic, Torsten Hoefler, Elias Bareinboim

发表机构 * UCLA（加州大学洛杉矶分校）； ETH Zurich（苏黎世联邦理工学院）； Columbia University（哥伦比亚大学）

AI总结提出一种通过询问治疗决策者比较配对单元来揭示未观测混杂因素的新研究设计，并在ICU数据中验证其有效性。

详情

AI中文摘要

理解干预的效果是科学进步的核心，随机对照试验（RCT）在许多应用领域被视为因果推断的金标准。然而，RCT成本高、耗时长，且常受伦理或实际限制，这促使我们需要能够从观察性数据中得出结论的因果方法。尽管此类数据收集规模日益扩大，但将其用于因果推断常因并非所有影响治疗分配和结果的变量都被观测到而受阻，这一问题称为未观测混杂。在本文中，我们介绍了一种称为通过治疗意图进行混杂检测的新研究设计。其思路是询问做出治疗决策的人类专家，并要求他们比较由原则性匹配策略提出的单元对，目的是引出解释治疗决策为何不同的未观测变量。我们为此类程序提供了理论基础，确定了此类研究设计可能引出未观测混杂因素的条件。基于这些新建立的基础，我们研究了重症监护病房（ICU）中干预的治疗效果。首先，我们展示了强烈表明ICU中收集的电子健康记录（EHR）存在未观测混杂的经验证据。通过使用临床文本笔记作为医生知识的代理并利用自然语言处理，我们在已知真实情况的半合成环境中为我们的方法提供了概念验证。

英文摘要

Understanding the effects of interventions is central to scientific progress, with randomized controlled trials (RCTs) regarded as the gold standard for causal inference in many applied fields. However, RCTs are costly, time-consuming, and often constrained by ethical or practical limitations, motivating the need for causal methods able to draw conclusions from observational data. While such data is collected at ever larger scale, making its use for causal inference is often hindered by the fact that not all variables affecting treatment allocation and the outcome are observed: an issue known as unobserved confounding. In this paper, we introduce a new study design called confounder detection via treatment intent. The idea is to query a human expert who makes treatment decisions, and ask them to compare pairs of units proposed by a principled matching strategy, with the goal of eliciting unobserved variables that explain why treatment decisions differ. We provide a theoretical basis for such a procedure, ascertaining conditions under which such a study design may elicit unobserved confounders. Building on this newly established foundations, we study treatment effects of interventions in the intensive care unit (ICU). First, we show empirical evidence strongly indicating that electronic health records (EHRs) collected in ICUs are subject to unobserved confounding. By using clinical text notes as a proxy for physicians' knowledge and leveraging natural language processing, we provide a proof of concept for our methodology in a semi-synthetic environment with a known ground truth.

URL PDF HTML ☆

赞 0 踩 0

2605.26409 2026-05-27 cs.CR cs.AI cs.LG

Jailbreak susceptibility prediction and mitigation via the behavioral geometry of models

通过模型的行为几何进行越狱易感性预测与缓解

Hayden Helm, Xiaodong Liu, Weiwei Yang

发表机构 * Microsoft Research（微软研究院）

AI总结本文通过形式化模型群体的行为几何，利用已评估和防御的模型，实现高效的易感性预测和防御迁移，在79个模型和100个系统配置上，易感性检测AUPRC达0.94且探针减少约98%，防御迁移性能优于同供应商分配。

详情

AI中文摘要

评估和缓解生成系统对越狱攻击的易感性对其安全部署至关重要。由于可部署系统的数量众多，对每种配置进行全面评估和优化是不切实际的。本文形式化了模型群体的行为几何，通过利用先前评估和防御过的模型，支持群体内高效的易感性预测和有效的防御迁移。我们将该框架应用于涵盖24个提供商的79个模型以及单个基础模型的100个系统配置。使用行为几何的简单方法在易感性检测中达到了0.94的AUPRC，与全面评估相比，探针数量减少了约98%。使用行为几何选择从哪个模型迁移优化后的防御，在无额外探针成本的情况下优于同供应商分配（+2%，p = 0.03），且一组三个模型足以覆盖整个群体。结果对超参数选择和评判者具有鲁棒性。

英文摘要

Evaluating and mitigating a generative system's susceptibility to jailbreak attacks is critical to its safe deployment. Given the number of deployable systems, full per-configuration evaluation and optimization is impractical. In this paper, we formalize the behavioral geometry of a population of models that, by leveraging previously evaluated and defended models, supports both efficient susceptibility prediction and effective defense transfer across a population. We apply the framework to 79 models spanning 24 providers and to 100 system configurations of a single base model. Simple methods that use the behavioral geometry reach an AUPRC of $0.94$ for susceptibility detection with $\approx98\%$ fewer probes relative to a full evaluation. Using the behavioral geometry to select which model to transfer an optimized defense from outperforms same-provider assignment ($+2\%$, $p = 0.03$) at no additional probe cost, with a set of three models sufficient to cover the population. Results are robust to hyperparameter selection and judge.

URL PDF HTML ☆

赞 0 踩 0

AI 大模型

视觉与机器人

科学与医疗

RAGEAR: Retrieval-Augmented Graph-Enhanced Academic Recommender

HTMLCure: Turning Browser Experience into State Guided Repair for Interactive HTML

Implementation of Big Data Analytics for Diabetes Management: Needs Assessment in the Rwanda Healthcare System

Generative artificial intelligence and the marginalization of minoritized knowledges in higher education: the case of disability

Cordon-MAS: Defending RAG against Knowledge Poisoning via Information-Flow Control

MatFormBench: A Benchmarking Evaluation Framework for Target-Driven Materials Formulation

Measuring Prediction Uncertainty in Neural Cellular Automata

L2Rec: Towards Dual-View Understanding of LLMs for Personalized Recommendation

Transformers Can Learn Posterior Predictive Distributions In-Context

ProcCtrlBench: Evaluating Process-Level Defects and Control Preservation in LLM Coding Agents

Jacobian-Velocity Bounds for Deployment Risk Under Covariate Drift

Cryptographic Registry Provenance: Structural Defense Against Dependency Confusion in AI Package Ecosystems

Tracing the Dynamics of Refusal: Exploiting Latent Refusal Trajectories for Robust Jailbreak Detection

Certified Purity for Cognitive Workflow Executors: From Static Analysis to Cryptographic Attestation

Certified Causal Attribution for Real-Time Attack Forensics in 6G Network Slicing

CART Random Forests as Sequential Allocation over Random Opportunity Sets: A Stochastic-Control Theory of Ensemble Risk

Sample Complexity of Policy Gradient for Log-Growth Control

Breaking the Epistemic Trap: Active Perception Under Compound Uncertainty

Bridging Control with Neural Network Verifier alpha-beta-CROWN: A Tutorial

SEC-bench Pro: Can Language Models Solve Long-Horizon Software Security Tasks?

ChainCaps: Composition-Safe Tool-Using Agents via Monotonic Capability Attenuation

DGLD: Domain-Gated Latent Diffusion for the Discovery of Novel Energetic Materials

StreamSplit: Continuous Audio Representation Learning via Uncertainty-Guided Adaptive Splitting

Foundations of a Time-Consistent Counterfactual Actuarial Runtime for Autonomous AI Agents

Verus-SpecGym: An Agentic Environment for Evaluating Specification Autoformalization

Design First, Code Later: Aesthetically Pleasing Template-Free Slides Generation

Structure-Adaptive Conformal Inference for Large-Scale Out-of-Distribution Testing

Uniboost: Global Coordination with Value Alignment for Fair and Efficient Traffic Allocation

Confounder Detection via Treatment Intent: A New Observational Study Design

Jailbreak susceptibility prediction and mitigation via the behavioral geometry of models