arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 1676
2605.15905 2026-05-18 cs.IR cs.AI

Generative Long-term User Interest Modeling for Click-Through Rate Prediction

生成长期用户兴趣建模用于点击通过率预测

Jiangli Shao, Kaifu Zheng, Hao Fang, Huimu Ye, Zhiwei Liu, Bo Zhang, Shu Han, Xingxing Wang

AI总结 本文提出GenLI模型,通过生成兴趣模块、行为检索模块和兴趣融合模块,提升CTR预测的准确性和效率,解决传统方法中长期兴趣建模不完整和效率低的问题。

详情
AI中文摘要

通过大规模历史用户行为建模长期用户兴趣可提升广告和推荐系统中点击通过率(CTR)预测性能。通常采用两阶段框架,其中通用搜索单元(GSU)首先检索目标物品的相关行为,精确搜索单元(ESU)通过定制注意力生成兴趣特征。然而,当前以目标为中心的GSU会忽略其他潜在用户兴趣,导致兴趣特征不完整和偏差。此外,GSU中的匹配基于检索过程依赖于目标物品与每个历史行为之间的成对相似度分数,这不仅使在线服务在用户行为增长时变得耗时,还忽略了用户行为间的交互信息。为解决这些问题,我们提出了一种名为GenLI的生成长期用户兴趣模型用于CTR预测。GenLI包括兴趣生成模块(IGM)、行为检索模块(BRM)和兴趣融合模块(IFM)。IGM生成多个兴趣分布以表示实时用户兴趣的不同方面,该模块是目标无关的,并且结合行为间的交互信息,确保兴趣特征的完整和多样化。BRM通过简单的查找操作选择相关行为,将加权每个行为的时间复杂度降低到O(1)。最后,IFM使用精细的门控机制生成兴趣特征。基于生成过程,GenLI提高了用户兴趣的多样性,避免了基于匹配的行为检索,实现了CTR预测在准确性和效率之间的更好平衡。

英文摘要

Modeling long-term user interests with massive historical user behaviors enhances click-through rate (CTR) prediction performance in advertising and recommendation systems. Typically, a two-stage framework is widely adopted, where a general search unit (GSU) first retrieves top-$k$ relevant behaviors towards the target item, and an exact search unit (ESU) generates interest features via tailored attention. However, current target-centered GSU would ignore other latent user interests, leading to incomplete and biased interest features. Additionally, the matching-based retrieval process in GSUs depends on the pairwise similarity score between target item and each historical behavior, which not only becomes time-consuming for online services as user behaviors continue to grow, but also overlooks the interaction information among user behaviors. To combat these problems, we propose a \textbf{Gen}erative \textbf{L}ong-term user \textbf{I}nterest model named GenLI for CTR prediction. GenLI consists of an interest generation module (IGM), a behavior retrieval module (BRM), and an interest fusion module (IFM). The IGM generates multiple interest distributions to indicate different aspects of real-time user interests, which is target-independent and incorporates interaction information among behaviors, ensuring complete and diverse interest features. The BRM selects related behaviors via a simple lookup operation, reducing the time complexity for weighting each behavior to $O(1)$. Finally, the IFM uses delicate gating mechanisms to generate interest features. Based on the generation process, GenLI improves the diversity of user interests and avoids complex matching-based behavioral retrieval, achieving a better balance between accuracy and efficiency for CTR prediction.

2605.15895 2026-05-18 eess.IV cs.CV

Layer Selection in Feature-Based Losses Affects Image Quality and Microstructural Consistency in Deep Learning Super-Resolution of Brain Diffusion MRI

基于特征的损失函数中层选择影响深度学习超分辨率中图像质量及微结构一致性

David Lohr, Rene Werner

AI总结 本研究探讨了基于特征的损失函数在深度学习超分辨率中对扩散信号一致性的影响,发现深层网络层会导致网格状伪影,而浅层网络层能保持图像与地面真实的一致性,尤其在9倍超分辨率下表现优异。

详情
AI中文摘要

高分辨率扩散MRI的临床应用受硬件限制和扫描时间阻碍,推动了计算超分辨率的发展。本研究探讨了基于特征的损失函数在深度学习超分辨率中保持扩散信号一致性的有效性。利用人类连接组计划的7T数据生成低分辨率和高分辨率扩散加权图像对,训练了UNets进行2D超分辨率。通过消融和隔离研究,评估了不同VGG16层用于特征损失与图像基L1基准的性能。更深层的层和其组合在超分辨率DWI中产生网格状伪影,这种伪影在扩散参数如定量和各向异性分数中持续存在。使用最浅层时没有此类伪影。该层的下游分析显示与地面真实高度一致,即使在9倍超分辨率下也是如此。图像SNR和使用的VGG16层深度调节伪影的出现和严重程度,要求在扩散MRI中谨慎选择贡献层。

英文摘要

Clinical application of high-resolution diffusion MRI is hindered by hardware limitations and prohibitive scan times, motivating computational super-resolution. This study investigates the efficacy of a feature-based loss function in preserving diffusion signal consistency in deep learning super-resolution. Using 7T data from the human connectome project to generate pairs of low- and high-resolution diffusion weighted images (DWI), we trained UNets for 2D super-resolution. Ablation and isolation studies evaluated different VGG16-layers for feature-based losses against an image-based L1 baseline. Deeper layers and combinations thereof resulted in grid-like artifacts in super-resolution DWIs, which persisted in diffusion parameters like quantitative and fractional anisotropy. No such artifacts were present when using the shallowest layer. Downstream analysis for this layer showed great consistency with the ground truth, even for 9-fold super-resolution. Image SNR and used VGG16-layer depths modulated artifact appearance and severity, mandating careful selection of contributing layers for application in and beyond diffusion MRI.

2605.15889 2026-05-18 cs.CR cs.LG

A Multi-Layer Cloud-IDS Pipeline with LLM and Adaptive Q-Learning Calibration

多层云入侵检测系统与LLM和自适应Q学习校准

Syed Waqas Ali, Ibrar Ali Shah, Farzana Zahid, Daniyal Munir, Hans D. Schotten

AI总结 本文提出一种多层云入侵检测系统,结合强化学习和LLM,通过自适应阈值和Chroma数据库提升检测性能,实验显示系统在准确率、精确率和召回率等方面表现优异。

详情
AI中文摘要

由于分层云架构、动态环境和未知攻击等因素,云安全成为重大关注点。入侵检测系统(IDS)通常在特定层运行,依赖机器学习模型,但实验效果好却难以在实际云部署中维持性能。本文实现了一个基于强化学习的自信度多级入侵检测系统,保护网络、主机和虚拟机三层。每层机器学习模型检测已知攻击模式,预测自信度区分可靠决策与不确定结果。在多闸门流程中,低自信事件通过学习阈值自信门(Gate-1),随后通过Chroma内存匹配门(Gate-2),未解决事件升级至大语言模型(LLM)进行语义分析和解释。最终在Gate-3使用校准的LLM自信度或加权融合回退,不确定事件保留在审查桶中以避免强制分类。生成的解释和确认知识存储在ChromaDB中以支持未来分析和再训练。该方法首先使用静态阈值评估,建立比较基准。结果表明,所提系统学习自适应阈值,减少LLM升级58.78%,降低成本同时保持强性能(88.68%准确率,85.29%精确率,84.72%召回率,85.00% F1)。网络和虚拟机层分别达到98.02%和97.08%准确率,证明了系统平衡且高效的检测能力。

英文摘要

Security in cloud computing has become a major concern due to several factors such as layered cloud architectures, dynamic environments, and exposure to unseen or zero-day attacks. Moreover, intrusion detection systems (IDS) typically operate at specific layers and rely heavily on machine learning models, which often perform well in experimental settings but fail to sustain performance in real cloud deployments. In this work, we implement a confidence-aware multilevel intrusion detection system using reinforcement learning tailored for cloud environments. The system secures three distinct layers: network, host, and hypervisor. Machine learning models at each layer detect known attack patterns, while prediction confidence distinguishes reliable decisions from uncertain outcomes. Within the multi-gate flow, low-confidence events pass through a learned-threshold confidence gate (Gate-1), followed by a Chroma memory-matching gate (Gate-2), with unresolved events escalated to a large language model (LLM) for semantic analysis and explanation. Final attack promotion at Gate-3 uses calibrated LLM confidence or weighted-fusion fallback, while uncertain events are retained in a review bucket to avoid forced classification. Generated explanations and confirmed knowledge are stored in ChromaDB to support future analysis and retraining. The approach is first evaluated using static thresholds, establishing a baseline for comparison. Results show that the proposed system learns adaptive thresholds and reduces LLM escalation by 58.78%, lowering cost while maintaining strong performance (88.68% accuracy, 85.29% precision, 84.72% recall, 85.00% F1). The network and hypervisor layers achieve 98.02% and 97.08% accuracy, demonstrating a balanced and efficient detection system.

2605.15881 2026-05-18 math.DS cs.AI physics.comp-ph

Symplectic Neural Operators for Learning Infinite Dimensional Hamiltonian Systems

辛神经算子用于学习无限维哈密顿系统

Yeang Makara, Yusuke Tanaka, Takashi Matsubara, Takaharu Yaguchi

AI总结 本文提出辛神经算子,用于解决无限维哈密顿系统建模与模拟中的计算与结构挑战,通过保持辛结构提升长期稳定性与能量行为。

详情
AI中文摘要

无限维哈密顿系统的建模与模拟是数学物理和工程中的核心问题,但对标准数据驱动架构提出了显著的计算和结构挑战。本文引入辛神经算子,一种设计用于保持哈密顿PDE内在辛结构的神经算子架构。我们对它们的辛性进行了理论表征,并基于辛结构保持与学习精度的结合,建立了严格的长期稳定性结果。对典型哈密顿PDE的数值实验验证了这一理论结果,并显示SNOs相比非结构保持神经算子表现出改进的能量行为。

英文摘要

The modeling and simulation of infinite-dimensional Hamiltonian systems are central problems in mathematical physics and engineering, however they pose significant computational and structural challenges for standard data-driven architectures. In this work, we introduce the Symplectic Neural Operator, a neural operator architecture designed to preserve the symplectic structure intrinsic to Hamiltonian PDEs. We provide a theoretical characterization of their symplecticity and establish a rigorous long-term stability result based on the combination of symplectic structure preservation and learning accuracy. Numerical experiments on canonical Hamiltonian PDEs corroborate this theoretical result and show that SNOs exhibit improved energy behavior compared with non-structure-preserving neural operators.

2605.15859 2026-05-18 cs.DS cs.LG math.ST stat.ML stat.TH

Complexity of Non-Log-Concave Sampling in Fisher Information

非对数凹分布采样中复杂性的研究

Sinho Chewi, Andre Wibisono

AI总结 研究非对数凹分布采样中相对信息量保证的查询复杂性,提出基于近端采样器的算法,利用受限高斯 oracle 实现,改进非对数凹采样的复杂性并提升对数凹采样的精度。

详情
AI中文摘要

我们研究了获得非对数凹分布采样相对 Fisher 信息保证的查询复杂性,该问题类似于优化中的近似 stationary 点寻找。我们的算法基于近端采样器,即 Langevin 扩散的隐式离散化,并需要实现称为受限高斯 oracle(RGO)的后向步骤。我们展示通过利用最近在 Rényi 散度中高精度对数凹采样的结果,可以得到近似 RGO 实现,当与近端采样器结合时,能够获得在相对 Fisher 信息中继承与对数凹采样相同维度依赖性的复杂性保证,并在非对数凹采样中改进先前工作。我们还展示了一个逆向减少,任何在非对数凹采样中相对 Fisher 信息的维度依赖性改进都将导致高精度对数凹采样中的维度依赖性改进。

英文摘要

We study the query complexity of obtaining a relative Fisher information guarantee for sampling from a log-smooth non-log-concave distribution; this is a sampling analog of finding an approximate stationary point in optimization. Our algorithm is based on the proximal sampler, which is an implicit discretization of the Langevin diffusion, and requires an implementation of the backward step known as the restricted Gaussian oracle (RGO). We show that by leveraging the recent results for log-concave sampling with high-accuracy guarantees in Rényi divergence, we can obtain an approximate RGO implementation that -- when used with the proximal sampler -- yields a complexity guarantee in relative Fisher information that inherits the same dimension dependence as log-concave sampling, and improves upon prior work for non-log-concave sampling. We also show a converse reduction that any improvement in the dimension dependence in relative Fisher information for non-log-concave sampling will yield an improved dimension dependence for high-accuracy log-concave sampling.

2605.15848 2026-05-18 cs.HC cs.CL

Conversations in Space: Structuring Non-Linear LLM Interactions on a Canvas

空间对话:在画布上构建非线性大语言模型交互结构

Rifat Mehreen Amin, Alperen Adatepe, Daniela Fernandes, Daniel Buschek, Andreas Butz

AI总结 本文提出CanvasConvo,通过将线性聊天转化为分支对话树,支持在画布上探索假设场景,提升LLM交互的非线性结构和探索效率。

详情
AI中文摘要

由大型语言模型(LLMs)驱动的对话界面广泛用于创意和分析,但其线性结构限制了替代方案的探索和长对话的管理。我们提出了CanvasConvo,一种将线性聊天转化为嵌入在空间画布中的分支对话树的对话界面概念。CanvasConvo允许用户通过直接从对话内容分支来探索假设场景,支持平行开发替代方向。这些分支在画布上可视化,同时与熟悉的聊天界面保持集成,允许用户在线性和非线性交互之间切换。支持时间线导航、自动标记和总结、以及上下文感知控制(例如目标、可重用提示)等功能,支持结构化交互和连续性。我们在5-7天的现场研究中评估了CanvasConvo,与24名参与者一起。我们的发现突显了非线性对话结构如何支持探索性工作流程和不同的LLM工作交互。

英文摘要

Conversational interfaces powered by large language models (LLMs) are widely used for ideation and analysis, yet their linear structure limits exploration of alternatives and management of long-running interactions. We present CanvasConvo, a conversational interface concept that transforms linear chat into a branching conversation tree embedded in a spatial canvas. CanvasConvo enables users to explore what-if scenarios by branching directly from conversational content, supporting parallel development of alternative directions. These branches are visualized on a canvas while remaining integrated with a familiar chat interface, allowing users to switch between linear and non-linear interaction. Features such as timeline-based navigation, automatic tagging and summarization, and context-aware controls (e.g., goals, reusable prompts) support structured interaction and continuity. We evaluated CanvasConvo in a 5-7 day field study with 24 participants. Our findings highlight how non-linear conversational structures support exploratory workflows and different interactions in LLM-based work.

2605.15832 2026-05-18 cs.PF cs.LG

Heuristic-Based Merging of HPC Traces to Extend Hardware Counter Coverage

基于启发式的HPC追踪合并以扩展硬件计数器覆盖率

Júlia Orteu Aubach, Fabio Banchelli, Marc Clascà Ramírez, Marta Garcia-Gasulla

AI总结 本文提出一种基于启发式的HPC追踪合并方法,通过分析MPI结构、时间和通信模式匹配计算突发,从而扩展硬件计数器覆盖率,用于性能预测和分析。

详情
AI中文摘要

本文扩展了一个用于预测高性能计算工作负载性能的框架,使用机器学习。性能建模的一个常见限制是同时收集的硬件计数器数量有限。为此,我们提出了一种基于启发式的策略,通过分析MPI结构、时间和通信模式来匹配多个运行中的计算突发,从而构建一个包含更广泛硬件特征的统一数据集,而无需使用多路复用。输出是一个新的合成追踪,包含所有合并的计数器,可用于HPC性能预测和传统性能分析。该方法已在MareNostrum5机器上验证,使用各种内核和实际应用。结果表明,合并后的计数器在不同应用中保持了可接受的准确性,并可直接用于在更丰富的特征空间上训练机器学习模型,而无需先进行计数器选择。

英文摘要

This work extends a framework for predicting the performance of High-Performance Computing (HPC) workloads using Machine Learning (ML). A common limitation in performance modeling is the restricted number of hardware counters that can be collected simultaneously. To address this, we propose a heuristic-based methodology to merge execution traces from multiple runs, each instrumented with a different set of hardware counters. Our approach matches computation bursts across executions by analyzing MPI structure, timing, and communication patterns. This process enables the construction of a unified dataset that includes a wider set of hardware features without relying on multiplexing. The output is a new synthetic trace with all merged counters, which can be used both for HPC performance prediction and for conventional performance analysis. The methodology has been validated on MareNostrum5 machine with a range of kernels and real applications. Results show that the merged counters maintain acceptable accuracy depending on the application, and can be directly used to train ML models on a richer feature space without prior counter selection.

2605.15816 2026-05-18 cs.GR cs.CV cs.LG

StippleDiffusion: Capacity-Constrained Stippling using Controlled Diffusion

StippleDiffusion:基于受控扩散的容量受限点绘制

Ofir Gilad, Aleksander Plocharski, Przemyslaw Musialski, Andrei Sharf

AI总结 本文提出一种基于扩散模型的点绘制方法,通过学习局部点分布先验和连续容量约束,实现高效且可微的点集生成,适用于任意目标密度。

Comments 12 pages, 10 figures

详情
AI中文摘要

点绘制模式,即局部密度跟踪目标图像的点集,传统上由逐密度迭代优化器生成,速度慢且非可微,每次新目标需重新运行。学习替代方法至今仅能处理无条件点生成;容量受限、图像条件化的点绘制仍无法实现。我们提出了首个基于扩散的采样器,能够在推理时同时满足学习的局部点分布先验和连续的图像定义容量约束。该方法基于最优传输网格点集扩散基础线程,构建在ControlNet分支上,条件于目标密度图和高分辨率图像。两种设计选择使组合可行:训练和推理限制在后期去噪阶段,初始化自密度加权拒绝样本;标准零卷积注入被替换为sigmoid门控1x1投影,以在强密度信号下保持基础模型的蓝噪声结构。单个训练检查点在推理时接受任意目标密度,可泛化至训练时未见过的点预算,并在输出点数几乎无关的时间内生成点集。在Icons-50基准测试中,我们的学习采样器在所有报告的指标上与逐密度优化基线持平,且保持端到端可微。

英文摘要

Stipple patterns, point sets whose local density tracks a target image, are traditionally produced by per-density iterative optimizers, which are slow, non-differentiable, and must be re-run from scratch for each new target. Learned alternatives have so far addressed only unconditional point generation; capacity-constrained, image-conditioned stippling has remained out of reach. We present the first diffusion-based sampler that simultaneously satisfies a learned local point-distribution prior and a continuous, image-defined capacity constraint at inference. The method is a ControlNet branch built on top of an optimal-transport-grid point-set diffusion baseline, conditioned on the target density map and a high-resolution image. Two design choices make the combination tractable: training and inference are restricted to the late-stage denoising regime, initialized from a density-weighted rejection sample, and the standard zero-convolution injection is replaced with a sigmoid-gated 1x1 projection that preserves the base model's blue-noise structure under hard density signals. A single trained checkpoint accepts arbitrary target densities at inference, generalizes to point budgets that were not seen during training, and produces stipples in time nearly independent of the output point count. On the Icons-50 benchmark, our learned sampler reaches parity with per-density-optimized baselines on every reported metric while remaining differentiable end-to-end.

2605.15815 2026-05-18 cs.SE cs.CL cs.MA

BootstrapAgent: Distilling Repository Setup into Reusable Agent Knowledge

BootstrapAgent: 将仓库设置提炼为可重用的代理知识

Sihan Fu, Oucheng Liu, Shiyuan Wang, Jin Shi, Chengkun Wei

AI总结 BootstrapAgent通过提炼仓库初始化过程中的启发式知识,生成可验证的代理合同,提升代码代理在 unfamiliar repositories 的任务成功率和效率。

Comments 19 pages, 9 figures, 6 tables

详情
AI中文摘要

代码代理越来越多地帮助开发者处理不熟悉的仓库,但每次任务都依赖于昂贵的前置条件:将仓库初始化为可使用的开发状态。这一过程需要大量的试错探索,但由此产生的知识——解决依赖关系、修复策略——被困在单次对话中,无法为未来的代理所用。因此,我们将仓库初始化视为一个可重用的启动知识问题,并引入BootstrapAgent,一个多代理框架,将初始化探索中发现的启发式方法提炼成持久、可验证的代理可消费的.bootstrap合同。通过证据提取、结构化规划、确定性Docker验证和基于跟踪的修复,BootstrapAgent生成涵盖环境设置、诊断检查、最小验证和累积修复知识的合同。我们进一步提出 warm repair with clean replay 来加速迭代调试而不牺牲冷启动可重现性,并提出 delta repair with sanity check 来防止奖励黑客。在三个基准测试中的实验表明,BootstrapAgent实现了92.9%的成功率,比基线高出超过10%,同时减少了下游代理的token使用量25.9%和构建时间22.3%。我们的代码可在https://github.com/Vossera/BootstrapAgent上获得。

英文摘要

Code agents increasingly help developers work with unfamiliar repositories, but every such task depends on a costly prerequisite: bootstrapping the repository into a usable development state. This process requires substantial trial-and-error exploration, yet the resulting knowledge--resolved dependencies, repair strategies--stays trapped in a single conversation, unavailable to future agents. We therefore formulate repository bootstrapping as a reusable startup knowledge problem and introduce BootstrapAgent, a multi-agent framework that distills the heuristics discovered during bootstrap exploration into a persistent, verifiable, agent-consumable .bootstrap contract. Through evidence extraction, structured planning, deterministic Docker-based verification, and trace-driven repair, BootstrapAgent generates a contract covering environment setup, diagnostic checks, minimal verification, and accumulated repair knowledge. We further propose warm repair with clean replay to accelerate iterative debugging without sacrificing cold-start reproducibility, and a delta repair with sanity check to prevent reward hacking. Experiments on three benchmarks show that BootstrapAgent achieves a 92.9% success rate, outperforming the baseline by over 10% while reducing downstream agent token usage by 25.9% and build time by 22.3%. Our code is available at https://github.com/Vossera/BootstrapAgent.

2605.15812 2026-05-18 cs.HC cs.AI

Toward Natural and Companionable Virtual Agents via Cross-Temporal Emotional Modeling

通过跨时间情感建模实现自然和陪伴型虚拟代理

Feier Qin, Xiao Li, Yi Zheng, Haibin Huang, Hanyao Wang, Xiaoyu Wang, Yan Lu, Yuan Zhang

AI总结 本文提出CTEM框架,通过链接长期行为历史与即时情感表达,提升虚拟代理的自然性和情感和谐度,实验显示在21天的真实场景中效果显著。

Comments 21 pages, published in CHI '26

详情
Journal ref
Proceedings of the 2026 CHI Conference on Human Factors in Computing Systems (CHI '26), ACM, 2026
AI中文摘要

最近基础模型的进步使对话代理旨在持续陪伴而非单纯任务完成。然而大多数代理仍无法支持自然、长期的陪伴式互动,导致体验显得片段化和不真实。我们主张当前代理忽视了跨时间建模的社会行为和内部情感:生成的行为很少影响代理的情感状态,而情感状态 seldom 形成后续行为。我们提出了跨时间情感建模(CTEM)框架,该框架将长期行为历史与即时情感表达联系起来。CTEM建立了一个闭环,过去的经验更新演化的心理状态;该状态调节即时互动;用户反馈不断修订记忆和心理状态,使反思和预期成为可能。我们将CTEM实例化为Auri,一个即时通讯平台上的陪伴代理,并报告了一项21天的真实场景研究,显示CTEM在感知自然性、连贯性和情感和谐度方面有所改进。

英文摘要

Recent advances in foundation models have enabled conversational agents that aim for sustained companionship rather than mere task completion. Yet most still remain unable to support natural, long-term companion-like interactions, resulting in experiences that feel episodic and inauthentic. We argue that current agents overlooked cross-temporal modeling of agents' social behaviors and internal emotions: generated behaviors rarely influence an agent's emotional state, and emotional states seldom shape subsequent behaviors. We present Cross-Temporal Emotion Modeling (CTEM), a framework that links long-term behavioral history to moment-to-moment emotional expression. CTEM establishes a closed loop where past experiences update an evolving emotional state; this state conditions immediate interactions; and user feedback continually revises both memory and emotional state, enabling reflection and anticipation. We instantiate CTEM as Auri, a companion agent on an instant-messaging platform, and report a 21-day in-the-wild study showing that CTEM shows improvements in perceived naturalness, coherence, and emotional harmony.

2605.15799 2026-05-18 cs.MA cs.RO

From Gridworlds to Warehouses: Adapting Lightweight One-shot Multi-Agent Pathfinding for AGVs

从网格世界到仓库:为AGVs适应轻量级一次性多智能体路径规划

Hiroki Nagai, Keisuke Okumura

AI总结 本文提出多智能体仓库路径规划(MAWPF),针对差分驱动AGVs的运动特性,引入四条约束条件,改进传统MAPF算法,通过实验验证PP和LNS2在多智能体场景下的不足,PIBT类方法在可扩展性上表现更优。

Comments To be presented at IJCAI 2026

详情
AI中文摘要

多智能体路径规划(MAPF)在一次性规划中是仓库自动化的核心组成部分,但传统方法通常假设四连通的2D网格,具有单位时间四方向移动。为填补现实差距并仍能用离散组合搜索跟踪,本文提出更实用的多智能体仓库路径规划(MAWPF),具有四个约束:(i)智能体动作受限于直线运动和原地旋转;(ii)旋转需要多步成本;(iii)考虑加速度和减速度;(iv)禁止跟随碰撞以防止追尾事故。为高效解决MAWPF,我们适应了代表性的次优MAPF算法-PP、LNS2、PIBT和LaCAM,并进行了全面的基准测试。我们的实验表明,PP和LNS2在智能体数量多的实例中表现不佳,而基于PIBT的方法在增加解决方案成本时具有更优的可扩展性。我们相信,这些构成了将经典网格世界MAPF适应到运营仓库设置的重要一步。

英文摘要

Multi-agent pathfinding (MAPF) under one-shot planning is a core component of warehouse automation, yet classical formulations typically assume four-connected 2D grids with unit-time moves in four directions. To fill reality gaps while still being trackable with discrete combinatorial search, this work proposes a more practical counterpart tailored to differential-drive AGVs. We term this multi-agent warehouse pathfinding (MAWPF), featured with four constraints: (i) agent actions are restricted to straight motion and in-place rotation; (ii) rotations require multi-step costs; (iii) acceleration and deceleration are considered, and; (iv) follower collisions are prohibited to prevent rear-end crashes. To solve MAWPF efficiently, we adapt representative suboptimal MAPF algorithms-PP, LNS2, PIBT, and LaCAM-and conduct comprehensive benchmarking. Our experiments reveal that PP and LNS2 struggle to solve instances with many agents, while PIBT-based approaches achieve preferable scalability with increased solution cost. We believe that these constitute an important step toward adapting classical gridworld MAPF to operational warehouse setups.

2605.15788 2026-05-18 cs.DC cs.LG

ADAPT: A Self-Calibrating Proactive Autoscaler for Container Orchestration

ADAPT:一种自校准的前瞻性容器编排自动扩展器

Himanshu Singh Baghel

AI总结 ADAPT通过在线EWMA估计器动态调整冷启动时间,结合MPC优化副本数量,实现低于5%的SLA违规率,优于传统HPA和MPC+Prophet方案。

Comments 9 pages, 5 figures, 3 tables. Includes reproducible simulation framework for proactive Kubernetes autoscaling with adaptive cold-start estimation and MPC-based scaling. Source code and experiment configurations available at: https://github.com/Himanshu21035/autoscaling_research

详情
AI中文摘要

容器化工作负载的前瞻性自动扩展依赖于了解资源配置延迟,即从扩展决策到新容量准备就绪的时间。本文提出ADAPT(Adaptive Duration Approximation for Predictive Timing),一种在线EWMA估计器,实时跟踪冷启动时间。ADAPT将动态规划 horizon FH-OPT 输入模型预测控制器(MPC),在滑动窗口内优化副本数量。这些组件共同形成一个闭环前瞻性自动扩展设计,根据测量的资源配置延迟调整前瞻范围。在三种策略(MPC+LSTM、MPC+Prophet、HPA)和六个工作负载 archetype 上评估,MPC+LSTM 在所有工作负载上均实现低于5%的SLA违规率,相比之下,反应式HPA为7-19%,MPC+Prophet在双模交通情况下达到最高28.7%。

英文摘要

Proactive autoscaling for containerized workloads depends on knowing the provisioning delay, i.e., the time between a scaling decision and the moment new capacity is ready to serve traffic. In practice, this cold-start duration can vary substantially across environments and even across consecutive scale-out events. We present ADAPT (Adaptive Duration Approximation for Predictive Timing), an online EWMA estimator that tracks coldstart duration at runtime. ADAPT feeds a dynamic planning horizon, FH-OPT, into a Model Predictive Controller (MPC) that optimizes replica counts over a rolling window. Together, these components form a closed-loop proactive autoscaling design that adapts its lookahead based on measured provisioning delay. Evaluated across three policies (MPC+LSTM, MPC+Prophet, HPA) and six workload archetypes with five random seeds, MPC+LSTM achieves below 5% SLA violation on all workloads, compared with 7-19% for reactive HPA and up to 28.7% for MPC+Prophet on bimodal traffic.

2605.15108 2026-05-18 stat.ML cs.AI cs.IR cs.LG stat.ME

Logging Policy Design for Off-Policy Evaluation

为离线策略评估设计日志策略

Connor Douglas, Joel Persson, Foster Provost

AI总结 本文研究如何设计日志策略以最小化OPE误差,探讨了奖励与覆盖之间的根本权衡,并在不同信息场景下提出了最优策略。

详情
AI中文摘要

离线策略评估(OPE)利用不同日志策略收集的数据来估计目标策略(如推荐系统)的价值。它使高风险实验无需实时部署,但实际准确性严重依赖于用于计算估计值的数据收集日志策略。我们研究如何设计日志策略以最小化OPE误差。我们刻画了一个根本的奖励-覆盖权衡:将概率质量集中在高奖励动作上会减少方差,但可能错过目标策略可能采取的动作的信号。我们提出了一种统一的日志策略设计框架,并在目标策略和奖励分布已知、未知或部分通过先验或噪声估计可知的信息场景中推导出最优策略。我们的结果为公司选择多个候选推荐系统提供了可行指导。我们展示了在收集OPE数据时治疗选择的重要性,并在该目标是公司主要目标时描述了理论上最优的方法。我们还提炼了在操作约束防止实施理论最优的情况下选择日志策略的实用设计原则。

英文摘要

Off-policy evaluation (OPE) estimates the value of a target treatment policy (e.g., a recommender system) using data collected by a different logging policy. It enables high-stakes experimentation without live deployment, yet in practice accuracy depends heavily on the logging policy used to collect data for computing the estimate. We study how to design logging policies that minimize OPE error for given target policies. We characterize a fundamental reward-coverage tradeoff: concentrating probability mass on high-reward actions reduces variance but risks missing signal on actions the target policy may take. We propose a unifying framework for logging policy design and derive optimal policies in canonical informational regimes where the target policy and reward distribution are (i) known, (ii) unknown, and (iii) partially known through priors or noisy estimates at logging time. Our results provide actionable guidance for firms choosing among multiple candidate recommendation systems. We demonstrate the importance of treatment selection when gathering data for OPE, and describe theoretically optimal approaches when this is a firm's primary objective. We also distill practical design principles for selecting logging policies when operational constraints prevent implementing the theoretical optimum.

2605.14260 2026-05-18 stat.ML cs.LG

On the Burden of Achieving Fairness in Conformal Prediction

在符合预测中实现公平性的负担

Ziang Gao, Pengqi Liu, Archer Yi Yang, Mouloud Belbahri, Jesse C. Cresswell, Masoud Asgharian

AI总结 研究揭示了单一阈值校准在符合预测中隐藏的跨组异质性,证明了公平性定义之间的根本矛盾,并量化了不同校准策略的成本。

详情
AI中文摘要

符合预测通常使用单一池化阈值进行校准,但这种方法可能隐藏分数分布中的跨组异质性并扭曲各组的覆盖范围。我们通过分割符合校准下的总体分数分布研究了这一现象。首先,我们推导出一个守恒定律和下限,表明池化校准在跨组分位数异质性的尺度上不可避免地导致各组覆盖范围的扭曲。其次,我们证明了符合预测中两种主要公平性定义,即等覆盖和等集合大小,本质上存在根本矛盾。第三,我们量化了在不同策略之间转换的成本,这些策略分别处理各组或池化各组。在合成和真实数据上的实验验证了有限样本校准后的相同双向权衡。我们的结果表明,对于所研究的校准家族,校准选择不会消除跨组异质性;它决定了由此产生的扭曲出现在覆盖或大小维度中,为实际公平导向的校准选择提供了原理性的分析视角。

英文摘要

Conformal prediction is often calibrated with a single pooled threshold, but this can hide cross-group heterogeneity in score distributions and distort group-wise coverage. We study this phenomenon through the population score distributions underlying split conformal calibration. First, we derive a conservation law and lower bound showing that pooled calibration incurs irreducible group-wise coverage distortion at a scale set by cross-group quantile heterogeneity. Second, we demonstrate that the two leading fairness definitions for conformal prediction, Equalized Coverage and Equalized Set Size, are fundamentally in tension. Third, we quantify the cost of moving between policies which treat groups separately or pool them. Experiments on synthetic and real data confirm the same bidirectional trade-off after finite-sample calibration. Our results show that, for the policy families studied here, calibration choice does not remove cross-group heterogeneity; it determines whether the resulting distortion appears in the coverage or size dimension, providing a principled lens for analyzing fairness-oriented calibration choices in practice.

2605.12581 2026-05-18 cs.LO cs.AI cs.FL math.OC

Ensuring Logic in the Fog: Sound POMDP Synthesis with LTL Objectives

确保雾中的逻辑:带有LTL目标的可靠POMDP综合

Can Zhou, Yulong Gao, Pian Yu

AI总结 本文提出一种新的可靠奖励塑造机制,用于在部分可观测马尔可夫决策过程中实现LTL目标的合成,通过增强的蒙特卡洛规划框架提升在部分可观测环境中的导航能力。

Comments Accepted by IJCAI-ECAI 2026, the 35th International Joint Conference on Artificial Intelligence

详情
AI中文摘要

合成能够导航不确定环境并遵守复杂时间约束的自主代理仍然是基本挑战。虽然线性时序逻辑(LTL)提供了一种严格指定此类任务的语言,但部分可观测马尔可夫决策过程(POMDP)中验证LTL满足的固有不可判定性使得定量合成困难,尤其是在为近似求解器设计可靠奖励信号时。本文通过一种新颖且可靠的奖励塑造机制填补了这一空白,该机制动态生成基于信念的奖励,这些奖励基于已认证的LTL满足。通过将此机制整合到增强的蒙特卡洛规划框架中,我们使代理能够通过专注于最大化可验证成功的搜索过程来导航部分可观测性中的'雾'。实验表明,该方法不仅在现有求解器失败的场景中表现出色,而且在多样化的基准领域中保持了有效性和可扩展性。

英文摘要

Synthesising autonomous agents that can navigate uncertain environments while adhering to complex temporal constraints remains a fundamental challenge. While Linear Temporal Logic (LTL) provides a rigorous language for specifying such tasks, the inherent undecidability of qualitatively verifying LTL satisfaction in partially observable Markov decision processes renders quantitative synthesis difficult, especially when designing reliable reward signals for approximate solvers. In this paper, we bridge this gap with a novel, sound reward-shaping mechanism that dynamically generates belief-dependent rewards grounded in certified LTL satisfaction. By integrating this mechanism into an enhanced Monte Carlo Planning framework, we empower agents to navigate the `fog' of partial observability with a search process focused on maximising verifiable success. Our experiments demonstrate that this approach not only thrives in scenarios where existing solvers fail but also maintains effectiveness and scalability across diverse benchmark domains.

2605.12509 2026-05-18 cs.SI cs.AI cs.CE math.CO

Representing Higher-Order Networks: A Survey of Graph-Based Frameworks

表示高阶网络:基于图的框架综述

Takaaki Fujita, Florentin Smarandache

AI总结 本文综述了用于表示高阶网络的图基框架,探讨了多方式、分层、时间、多层、递归和张量交互等方法,旨在提供统一视角以比较不同模型并识别合适工具。

Comments 170 pages. Peer-Reviewed Book. Publisher: Neutrosophic Science International Association (NSIA) Publishing House. ISBN: 978-1-59973-881-9

详情
AI中文摘要

许多现实世界现象自然地通过图和网络建模。然而,经典图模型通常局限于成对交互,可能无法充分捕捉实践中更丰富的结构。高阶图形式化通过引入多方式、分层、时间、多层、递归和张量基的交互,从而提供更丰富的复杂系统表示。本书全面概述了可用于建模高阶网络的数学概念,回顾了基础概念、扩展框架和新引入的正式化,强调其结构原理、关系和建模作用。目的是提供一种统一的视角,帮助读者比较不同的高阶网络模型,并识别适用于理论研究和实际应用的合适工具。本书是第2.0版,主要包含新增概念以及对错别字和解释的修正和改进。

英文摘要

Many real-world phenomena are naturally modeled by graphs and networks. However, classical graph models are often limited to pairwise interactions and may not adequately capture the richer structures that arise in practice. Higher-order graph formalisms extend this framework by incorporating multiway, hierarchical, temporal, multilayer, recursive, and tensor-based interactions, thereby providing more expressive representations of complex systems. This book presents a comprehensive overview of mathematical notions that can be used to model higher-order networks. It surveys foundational concepts, extensional frameworks, and newly introduced formalisms, with an emphasis on their structural principles, relationships, and modeling roles. The aim is to provide a unified perspective that helps readers compare diverse higher-order network models and identify appropriate tools for theoretical study and practical applications. This book is Edition 2.0. It mainly includes the addition of several concepts, as well as corrections and improvements of typographical errors and explanations.

2605.10867 2026-05-18 cs.CR cs.AI cs.CV cs.LG cs.NI

BEACON: A Multimodal Dataset for Learning Behavioral Fingerprints from Gameplay Data

BEACON:一个用于从游戏数据中学习行为指纹的多模态数据集

Ishpuneet Singh, Gursmeep Kaur, Uday Pratap Singh Atwal, Guramrit Singh, Gurjot Singh, Maninder Singh

AI总结 BEACON数据集通过高精度运动技能和认知负荷,为行为生物特征的鲁棒性测试提供严格压力测试,支持连续认证、行为建模和多模态学习。

详情
AI中文摘要

在高风险数字环境中,连续认证需要具有细粒度行为信号的高质量数据集,但现有基准往往受限于规模小、单模态传感或缺乏同步环境上下文。为此,本文引入BEACON(行为认证与连续监控行为引擎),一个大规模多模态数据集,捕捉竞技Valorant游戏中的多样化技能层级。BEACON包含约430GB同步多模态数据(461GB总存储量,包括辅助Valorant配置捕获),来自79个会话的28名不同玩家,估计102.51小时的活跃游戏时间,包括高频鼠标动态、按键事件、网络数据包捕获、屏幕录制、硬件元数据和游戏内配置上下文。BEACON利用战术射击游戏固有的高精度运动技能和高认知负荷,使其成为评估行为生物特征鲁棒性的严格压力测试。该数据集允许在高保真的电子竞技环境中研究连续认证、行为建模、用户漂移和多模态表示学习。作者在Hugging Face和GitHub上发布数据集和代码,以创建可重复的基准,用于评估下一代行为指纹和安全模型。

英文摘要

Continuous authentication in high-stakes digital environments requires datasets with fine-grained behavioral signals under realistic cognitive and motor demands. But current benchmarks are often limited by small scale, unimodal sensing or lack of synchronised environmental context. To address this gap, this paper introduces BEACON (Behavioral Engine for Authentication & Continuous Monitoring), a large-scale multimodal dataset that captures diverse skill tiers in competitive Valorant gameplay. BEACON contains approximately 430 GB of synchronised modality data (461 GB total on-disk including auxiliary Valorant configuration captures) from 79 sessions across 28 distinct players, estimated at 102.51 hours of active gameplay, including high-frequency mouse dynamics, keystroke events, network packet captures, screen recordings, hardware metadata, and in-game configuration context. BEACON leverages the high precision motor skills and high cognitive load that are inherent to tactical shooters, making it a rigorous stress test for the robustness of behavioral biometrics. The dataset allows for the study of continuous authentication, behavioral profiling, user drift and multimodal representation learning in a high-fidelity esports setting. The authors release the dataset and code on Hugging Face and GitHub to create a reproducible benchmark for evaluating next-generation behavioral fingerprinting and security models.

2604.26578 2026-05-18 cs.SE cs.AI

Graph Construction and Matching for Imperative Programs using Neural and Structural Methods

基于神经方法和结构方法的命令式程序图构建与匹配

Arshad Beg, Diarmuid O'Donoghue, Rosemary Monahan

AI总结 本文提出通过神经和结构方法构建命令式程序图,实现跨语言和注释风格的图表示一致性,为语义丰富和近似图匹配提供基础。

Comments 20 Pages. Technical Report. Maynooth University, Ireland. Submitted on 29 April 2026

详情
AI中文摘要

重用验证制品需要识别程序及其规范的结构和语义相似性。本文聚焦图构建作为实现这一目标的基础步骤。我们提出一个管道,将命令式程序及其注释转换为带类型和属性的图。实验涵盖包含C与ACSL、Java与JML以及Dafny for C#的数据集。该管道整合了抽象语法树解析与从SentenceTransformer和CodeBERT等模型中获得的语义嵌入。这使生成的图表示能够捕捉结构关系和语义上下文。我们的结果表明,可以在不同语言和注释风格下构建一致的图表示。本文为未来语义丰富和近似图匹配的可扩展验证制品重用提供了实用基础。

英文摘要

Reusing verification artefacts requires identifying structural and semantic similarities across programs and their specifications. In this paper, we focus on graph construction as a foundational step toward this goal. We present a pipeline that converts imperative programs and their annotations into typed, attributed graphs. Our experiments cover datasets including C with ACSL, Java with JML, and Dafny for C\#. The pipeline integrates abstract syntax tree parsing with semantic embeddings derived from models such as SentenceTransformer and CodeBERT. This enables the generation of graph representations that capture both structural relationships and semantic context. Our results show that consistent graph representations can be constructed across different languages and annotation styles. This work provides a practical basis for future steps in semantic enrichment and approximate graph matching for scalable verification artefact reuse.

2604.09631 2026-05-18 cs.DC cs.AI

Hardware Utilization and Inference Performance of Edge Object Detection Under Fault Injection

边缘目标检测在故障注入下的硬件利用与推断性能

Faezeh Pasandideh, Mehdi Azarafza, Achim Rettberg

AI总结 研究通过故障注入测试评估了TensorRT优化的YOLO模型在边缘平台上的硬件行为,发现其在资源降级下保持稳定性能,为边缘推断可靠性提供硬件层面的视角。

详情
AI中文摘要

随着深度学习模型部署在资源受限的边缘平台,了解硬件在资源降级下的行为变得至关重要。本文系统地表征了在大规模故障注入测试下,TensorRT优化的YOLOv10s、YOLOv11s和YOLO2026n管道在NVIDIA Jetson Nano上的CPU负载、GPU利用率、RAM消耗、功耗、吞吐量和热行为。故障通过解耦框架合成,利用大型语言模型和潜在扩散模型。结果表明,两种任务和两种模型的推断引擎在资源降级下保持GPU占用稳定,温度上升受控,功耗在安全范围内,内存使用在初始暖机阶段后趋于一致释放模式。目标检测在内存和热行为上略有波动,但两者均得出结论:TensorRT管道在输入数据严重降级时仍表现良好。这些发现提供了模型可靠性的硬件层面视角,与边缘推断性能研究形成补充。

英文摘要

As deep learning models are deployed on resource constrained edge platforms in autonomous driving systems, reli able knowledge of hardware behavior under resource degradation becomes an essential requirement. Therefore, we introduce a systematic characterization of CPU load, GPU utilization, RAM consumption, power draw, throughput, and thermal behaviour of TensorRT-optimized YOLOv10s, YOLOv11s and YOLO2026n pipelines running on NVIDIA Jetson Nano under a large-scale fault injection campaign targeting both lane-following and ob ject detection tasks. Faults are synthesized using a decoupled framework that leverages large language models (LLMs) and latent diffusion models (LDMs), based on original data from our JetBot platform data collection. Results show that across both tasks and both models the inference engines keep GPU occupancy stable, temperature rise under control, and power consumption within safe limits, while memory usage settles into a consistent release pattern after the initial warm-up phase. Object detection tends to show somewhat more variability in memory and thermal behavior, yet both tasks point to the same conclusion: the TensorRT pipelines hold up well even when the input data is heavily degraded. These findings offer a hardware-level view of model reliability that sits alongside, rather than against, the broader body of work focused on inference performance at the edge.

2603.29617 2026-05-18 q-bio.NC cs.AI cs.CL

Convergent Representations of Linguistic Constructions in Human and Artificial Neural Systems

人类和人工神经系统的语言构造收敛表征

Pegah Ramezani, Thomas Kinfe, Andreas Maier, Achim Schilling, Patrick Krauss

AI总结 研究通过EEG验证人类神经活动对语言构造的表征,发现句末alpha波段出现构造特异性神经签名,与人工语言模型的构造表征模式相似,支持语言构造作为形式-意义映射的神经编码。

详情
AI中文摘要

理解大脑如何处理语言构造是认知神经科学和语言学的核心挑战。最近的计算研究表明,人工神经语言模型会自发发展出对论元结构构造(ASCs)的差异化表征,生成关于构造层面信息在处理过程中何时何地出现的预测。本研究通过脑电图(EEG)在人类神经活动中测试这些预测。十名母语英语者在听200个合成生成的句子时,这些句子涵盖四种构造类型(单及物、双及物、因果运动、结果性)。利用时频方法、特征提取和机器学习分类分析,发现构造特异性神经签名主要出现在句末位置,即论元结构完全歧义化的位置,并且最显著地出现在alpha波段。成对分类显示可靠区分,尤其是双及物和结果性构造之间,而其他对则有重叠。关键的是,这些效应的出现时间和相似性结构与基于循环和变压器的语言模型中的构造表征模式相似,其中构造性表征在整合处理阶段出现。这些发现支持语言构造作为神经编码的独立形式-意义映射的观点,与构造语法一致,并表明生物和人工系统在相似的表征解决方案上趋于一致。更广泛地说,这种趋同与学习系统在基础表征景观中发现稳定区域(最近称为柏拉图表征空间)的想法一致,该景观约束了高效语言抽象的出现。

英文摘要

Understanding how the brain processes linguistic constructions is a central challenge in cognitive neuroscience and linguistics. Recent computational studies show that artificial neural language models spontaneously develop differentiated representations of Argument Structure Constructions (ASCs), generating predictions about when and how construction-level information emerges during processing. The present study tests these predictions in human neural activity using electroencephalography (EEG). Ten native English speakers listened to 200 synthetically generated sentences across four construction types (transitive, ditransitive, caused-motion, resultative) while neural responses were recorded. Analyses using time-frequency methods, feature extraction, and machine learning classification revealed construction-specific neural signatures emerging primarily at sentence-final positions, where argument structure becomes fully disambiguated, and most prominently in the alpha band. Pairwise classification showed reliable differentiation, especially between ditransitive and resultative constructions, while other pairs overlapped. Crucially, the temporal emergence and similarity structure of these effects mirror patterns in recurrent and transformer-based language models, where constructional representations arise during integrative processing stages. These findings support the view that linguistic constructions are neurally encoded as distinct form-meaning mappings, in line with Construction Grammar, and suggest convergence between biological and artificial systems on similar representational solutions. More broadly, this convergence is consistent with the idea that learning systems discover stable regions within an underlying representational landscape - recently termed a Platonic representational space - that constrains the emergence of efficient linguistic abstractions.

2603.25099 2026-05-18 cs.CE cs.AI

Large Language Models as Optimization Controllers: Adaptive Continuation for SIMP Topology Optimization

大语言模型作为优化控制器:SIMP拓扑优化的自适应延续

Shaoliang Yang, Jun Wang, Yunsheng Wang

AI总结 本文提出利用大语言模型作为SIMP拓扑优化的在线自适应控制器,通过实时状态条件参数决策替代传统固定调度延续方法,提升优化效果。

Comments 32 pages, 11 figures

详情
AI中文摘要

我们提出一个框架,其中大语言模型(LLM)作为SIMP拓扑优化的在线自适应控制器,取代传统固定调度延续方法。在每次第k次迭代中,LLM接收结构化观察(当前合规性、灰度指数、停滞计数器、棋盘度量、体积分数和预算消耗),并通过直接数字控制接口输出惩罚指数p、投影锐度β、滤波半径r_min和移动限制δ的数值。硬灰度门防止过早二元化,元优化循环使用第二个LLM迭代来调整代理的调用频率和门阈值。我们对四个基线(固定、标准三场延续、专家启发法、仅调度消融)在三个二维问题(悬臂、MBB梁、L型支架)和两个三维问题(悬臂、MBB梁)上进行基准测试,所有问题均运行300次迭代。标准化的40次锐化尾部从最佳有效快照应用,使得合规性差异仅反映探索阶段。LLM代理在每个基准测试中均达到最低最终合规性:相对于固定基线,-5.7%至-18.1%,所有解决方案均为完全二进制。仅调度消融在三个问题中的两个上表现低于固定基线,确认LLM的实时干预(而非调度几何)驱动了增益。代码和再生产脚本将在发表时发布。

英文摘要

We present a framework in which a large language model (LLM) acts as an online adaptive controller for SIMP topology optimization, replacing conventional fixed-schedule continuation with real-time, state-conditioned parameter decisions. At every $k$-th iteration, the LLM receives a structured observation$-$current compliance, grayness index, stagnation counter, checkerboard measure, volume fraction, and budget consumption$-$and outputs numerical values for the penalization exponent $p$, projection sharpness $β$, filter radius $r_{\min}$, and move limit $δ$ via a Direct Numeric Control interface. A hard grayness gate prevents premature binarization, and a meta-optimization loop uses a second LLM pass to tune the agent's call frequency and gate threshold across runs. We benchmark the agent against four baselines$-$fixed (no-continuation), standard three-field continuation, an expert heuristic, and a schedule-only ablation$-$on three 2-D problems (cantilever, MBB beam, L-bracket) at $120\!\times\!60$ resolution and two 3-D problems (cantilever, MBB beam) at $40\!\times\!20\!\times\!10$ resolution, all run for 300 iterations. A standardized 40-iteration sharpening tail is applied from the best valid snapshot so that compliance differences reflect only the exploration phase. The LLM agent achieves the lowest final compliance on every benchmark: $-5.7\%$ to $-18.1\%$ relative to the fixed baseline, with all solutions fully binary. The schedule-only ablation underperforms the fixed baseline on two of three problems, confirming that the LLM's real-time intervention$-$not the schedule geometry$-$drives the gain. Code and reproduction scripts will be released upon publication.

2511.19931 2026-05-18 cs.IR cs.AI

LLM-EDT: Large Language Model Enhanced Cross-domain Sequential Recommendation with Dual-phase Training

LLM-EDT: 基于大语言模型的跨领域序列推荐增强方法与双阶段训练

Ziwei Liu, Qidong Liu, Wanyu Wang, Yejing Wang, Pengyue Jia, Tong Xu, Wei Huang, Chong Chen, Xiangyu Zhao

AI总结 本文提出LLM-EDT,通过双阶段训练策略解决跨领域序列推荐中的领域不平衡和过渡问题,引入可转移物品增强器和领域感知配置模块,提升推荐效果。

详情
AI中文摘要

跨领域序列推荐(CDSR)旨在通过整合多领域信息丰富用户-物品交互。尽管已有进展,领域不平衡和过渡问题阻碍了进一步发展。前者导致某一领域交互主导整体行为,难以捕捉其他领域特征;后者导致混合交互序列中难以捕捉用户跨领域偏好,影响特定领域下一项预测性能。大语言模型(LLMs)通过生成和编码能力部分缓解这些问题,但现有LLM增强的CDSR方法仍需改进。为此,我们提出LLM-EDT,通过可转移物品增强器减少无关噪声,双阶段训练策略增强领域特定线程的领域共享背景,以及领域感知配置模块总结用户偏好并自适应聚合生成综合用户画像。实验验证了LLM-EDT的有效性。

英文摘要

Cross-domain Sequential Recommendation (CDSR) has been proposed to enrich user-item interactions by incorporating information from various domains. Despite current progress, the imbalance issue and transition issue hinder further development of CDSR. The former one presents a phenomenon that the interactions in one domain dominate the entire behavior, leading to difficulty in capturing the domain-specific features in the other domain. The latter points to the difficulty in capturing users' cross-domain preferences within the mixed interaction sequence, resulting in poor next-item prediction performance for specific domains. With world knowledge and powerful reasoning ability, Large Language Models (LLMs) partially alleviate the above issues by performing as a generator and an encoder. However, current LLMs-enhanced CDSR methods are still under exploration, which fail to recognize the irrelevant noise and rough profiling problems. Thus, to make peace with the aforementioned challenges, we proposed an LLMs Enhanced Cross-domain Sequential Recommendation with Dual-phase Training ({LLM-EDT}). To address the imbalance issue while introducing less irrelevant noise, we first propose the transferable item augmenter to adaptively generate possible cross-domain behaviors for users. Then, to alleviate the transition issue, we introduce a dual-phase training strategy to empower the domain-specific thread with a domain-shared background. As for the rough profiling problem, we devise a domain-aware profiling module to summarize the user's preference in each domain and adaptively aggregate them to generate comprehensive user profiles. The experiments on three public datasets validate the effectiveness of our proposed LLM-EDT. To ease reproducibility, we have released the detailed code online at {https://anonymous.4open.science/r/LLM-EDT-583F}.

2511.15623 2026-05-18 cs.DB cs.AI cs.LO

Sufficient Explanations in Databases and their Connections to Database Repairs

数据库中的充分解释及其与数据库修复的关系

Leopoldo Bertossi, Nina Pardal

AI总结 研究数据库中充分解释的概念及其与数据库修复的联系,提出基于答案集程序计算充分解释和度量的方法。

详情
AI中文摘要

我们研究了充分解释的概念,以及用于查询回答的数据库元组的充分性度数作为归因分数。我们还探讨了充分解释与用于处理不一致数据库的数据库修复之间的联系,并与基于因果的必要解释相结合,获得新的计算结果。我们展示了如何使用答案集程序来指定充分解释并计算充分性度数。

英文摘要

We investigate the notion of sufficient explanation, and a sufficiency-degree as attribution score for database tuples in relation to query answering. We also investigate and exploit connections with database repairs as used for dealing with inconsistent databases; and with causality-based necessary explanations, obtaining new computational results. We show how to use answer-set programs to specify sufficient explanations and compute sufficiency-degrees.

2511.14482 2026-05-18 cs.DB cs.LG

Gradient-Based Join Ordering

基于梯度的连接顺序

Tim Schwabe, Maribel Acosta

AI总结 本文提出基于梯度的连接顺序方法,通过连续松弛和可微约束,在放松空间中寻找低成本计划,优于传统离散搜索方法。

详情
AI中文摘要

连接顺序是NP难问题,涉及选择数据库查询中最有效的连接顺序。传统方法将问题视为二叉树的离散组合搜索,但存在效果与效率的权衡。本文展示当成本模型可微时,查询计划可连续松弛为软邻接矩阵,结合可微约束确保计划有效性,从而在放松空间中通过梯度搜索低成本计划。使用图神经网络作为成本模型,证明该方法在两个不同图数据集上能获得与传统离散搜索相当甚至更低的成本,并且运行时间优于离散搜索算法。

英文摘要

Join ordering is the NP-hard problem of selecting the most efficient order in which to evaluate joins (conjunctive, binary operators) in a database query. Because query execution performance critically depends on this choice, join ordering lies at the core of query optimization. Traditional approaches cast this problem as a discrete combinatorial search over binary trees guided by a cost model, but they have trade-offs between effectiveness and efficiency. We show that when the cost model is differentiable, query plans can be continuously relaxed into a soft adjacency matrix that represents a superposition of plans. This continuous relaxation, combined with differentiable constraints that enforce plan validity, enables a gradient-based search for low-cost plans within this relaxed space. Using a Graph Neural Network as the cost model, we demonstrate that this gradient-based approach can find comparable and even lower-cost plans compared to traditional discrete search methods on two different graph datasets. Furthermore, we empirically show that the runtime of this approach scales better than discrete search algorithms. We believe this first step towards gradient-based join ordering can lead to more effective and efficient query optimizers in the future.

2510.06194 2026-05-18 hep-ex astro-ph.IM cs.CV

Overlap-aware segmentation for topological reconstruction of obscured objects

关注重叠的分割以重建被遮挡物体的拓扑结构

J. Schueler, H. M. Araújo, S. N. Balashov, J. E. Borg, C. Brew, F. M. Brunbauer, C. Cazzaniga, A. Cottle, D. Edgeman, C. D. Frost, F. Garcia, D. Hunt, M. Kastriotou, P. Knights, H. Kraus, A. Lindote, M. Lisowska, D. Loomba, E. Lopez Asamar, P. A. Majewski, T. Marley, C. McCabe, L. Millins, R. Nandakumar, T. Neep, F. Neves, K. Nikolopoulos, E. Oliveri, A. Roy, T. J. Sumner, E. Tilly, W. Thompson, M. A. Vogiatzi

AI总结 本文提出OASIS框架,通过加权损失函数优先处理重叠区域,提升被遮挡物体的像素强度和拓扑特征重建。在MIGDAL实验中,OASIS显著改善了低能电子轨迹的重建效果。

详情
AI中文摘要

重叠物体的分离在科学成像中是一个重大挑战。尽管深度学习分割-回归算法能预测像素强度,但通常平等对待所有区域,而非优先处理重叠区域。最近的实例分割进展表明,训练中加权重叠像素区域可改善重叠区域的分割边界预测,但此方法尚未扩展到分割回归。本文提出OASIS:一种新的分割-回归框架,其加权损失函数旨在训练期间优先处理物体重叠区域,从而从严重遮挡的物体中提取像素强度和拓扑特征。在MIGDAL实验中,OASIS被用于直接成像Migdal效应——一种罕见过程,其中电子发射由核散射诱导——在低气压光学时间投影室中。此设置是一个极端测试案例,因为重建目标是微弱的电子 recoil 轨迹,通常被数量级更亮的核 recoil 轨迹严重遮挡。与无权分割回归相比,我们证明OASIS的新型重叠区域目标损失函数权重是提高低能电子轨迹强度和拓扑重建的最重要训练权重。在八次训练活动中平均,我们进一步显示添加重叠目标权重可将这些低能电子的中位强度重建误差从-41.1%提高到-13.3%。这些性能提升证明OASIS是一种通用的方法,可用于恢复重叠主导区域的被遮挡信号。

英文摘要

The separation of overlapping objects presents a significant challenge in scientific imaging. While deep learning segmentation-regression algorithms can predict pixel-wise intensities, they typically treat all regions equally rather than prioritizing overlap regions where attribution is most ambiguous. Recent advances in instance segmentation show that weighting regions of pixel overlap in training can improve segmentation boundary predictions in regions of overlap, but this idea has not yet been extended to segmentation regression. We address this with Overlap-Aware Segmentation of ImageS (OASIS): a new segmentation-regression framework with a weighted loss function designed to prioritize regions of object-overlap during training, enabling extraction of pixel intensities and topological features from heavily obscured objects. We demonstrate OASIS in the context of the MIGDAL experiment, which aims to directly image the Migdal effect--a rare process where electron emission is induced by nuclear scattering--in a low-pressure optical time projection chamber. This setting poses an extreme test case, as the target for reconstruction is a faint electron recoil track which is often heavily-buried within the order(s)-of-magnitude brighter nuclear recoil track. Compared to unweighted segmentation regression, we demonstrate OASIS's novel overlap region-targeted loss function weight to be the single most important training weight for improving intensity and topological reconstructions of the low-energy electron tracks that tend to be most dominated by pixel overlap. Averaging over eight training campaigns, we further show the addition of overlap-targeted weights to improve median intensity reconstruction errors from -41.1% to -13.3% for these low-energy electrons. These performance gains demonstrate OASIS as a generalizable methodology for recovering obscured signals in overlap-dominated regions.

2510.01632 2026-05-18 q-bio.BM cs.AI

BioBlobs: Unsupervised Discovery of Functional Substructures for Protein Function Prediction

BioBlobs:无监督发现蛋白质功能预测的的功能子结构

Xin Wang, Kaiwen Shi, Carlos Oliver

AI总结 BioBlobs通过无监督方法发现蛋白质的功能子结构,利用端到端可微分框架压缩蛋白质为少量连贯子结构并预测功能,实现了对功能区域的候选识别。

详情
AI中文摘要

蛋白质功能由如催化三元组、结合口袋和结构模体等紧密子结构驱动,这些子结构仅占据蛋白质残基的小部分。然而,现有基于蛋白质编码器的流程并未在子结构层面建模,未能回答核心生物学问题:蛋白质的哪一部分负责其功能?我们引入了BioBlobs,一种编码器无关、端到端可微分的框架,能够将蛋白质压缩为少量连贯的子结构(blobs),并仅基于这些blobs预测功能,使得每个blob对应一个候选功能区域。在多样化的蛋白质功能预测任务和多种基于序列和结构的编码器上,BioBlobs在仅使用少量残基的情况下,匹配或超过了强大的基线模型。发现的blobs会根据任务调整其空间尺度,从局部催化位点到整个结构域。仅在蛋白质层面标签上训练,BioBlobs能够恢复M-CSA数据库中实验注释的催化位点,证明了无监督的功能子结构发现,并为未注释的整个蛋白质组的规模化功能位点发现开辟了道路。

英文摘要

Protein function is driven by cohesive substructures, such as catalytic triads, binding pockets, and structural motifs, that occupy only a small fraction of a protein's residues. Yet existing pipelines built on protein encoders do not model proteins at the substructure level, leaving the central biological question unanswered: which substructure of a protein is responsible for its function? We introduce BioBlobs, an encoder-agnostic, end-to-end differentiable framework that compresses a protein into a small set of cohesive substructures (blobs) and predicts function from these blobs alone, so that each blob corresponds to a candidate functional region. Across diverse protein function prediction tasks and multiple sequence- and structure-based encoders, BioBlobs matches or exceeds strong baselines while operating on only a small fraction of residues. The discovered blobs adapt their spatial scale to the task, ranging from local catalytic sites to entire structural domains. Trained only on protein-level labels, BioBlobs recovers experimentally annotated catalytic sites in the M-CSA database, demonstrating unsupervised functional substructure discovery and opening a path to large-scale functional site discovery across the unannotated proteome.

2509.07404 2026-05-18 math.OC cs.LG

Reinforcement learning for adaptive interior point methods in convex quadratic programming

强化学习用于凸二次规划中自适应内点方法

Jeremy Bertoncini, Alberto De Marchi, Matthias Gerdts, Simon Gottschalk

AI总结 本文提出利用强化学习优化内点法求解凸二次规划问题,通过调整双循环流程和控制参数提升求解效率,实验表明轻量训练后策略能有效泛化至不同问题类。

Comments 20 pages, 9 figures, 4 tables

详情
AI中文摘要

二次规划是现代非线性优化、控制和数据科学中的重要工具。尽管正则化方法在最少量假设下提供收敛保证,但它们通常表现出一阶方案典型的慢尾收敛特性,需要许多迭代才能获得高精度解。此外,超参数调优显著影响求解器性能,但如何找到合适的参数配置仍是一个悬而未决的研究问题。为解决这些问题,我们探索数据驱动方法如何加速求解过程。针对高精度解,我们专注于正则化内点求解器,并仔细处理其双循环流程和控制参数。我们将展示强化学习如何在促进求解器调优和加速优化过程方面做出重要贡献。数值实验表明,在轻量训练后,学习到的策略能有效泛化至不同问题类,具有不同维度。

英文摘要

Quadratic programming is a workhorse of modern nonlinear optimization, control, and data science. Although regularized methods offer convergence guarantees under minimal assumptions on the problem data, they can exhibit the slow tail-convergence typical of first-order schemes, thus requiring many iterations to achieve high-accuracy solutions. Moreover, hyperparameter tuning significantly impacts the solver performance but how to find an appropriate parameter configuration remains an elusive research question. To address these issues, we explore how data-driven approaches can accelerate the solution process. Aiming at high-accuracy solutions, we focus on a regularized interior-point solver and carefully handle its two-loop flow and control parameters. We will show that reinforcement learning can make a significant contribution to facilitating the solver tuning and to speeding up the optimization process. Numerical experiments demonstrate that, after a lightweight training, the learned policy generalizes well to different problem classes with varying dimensions.

2508.08431 2026-05-18 eess.IV cs.CV eess.SP

Preprocessing Algorithm Leveraging Geometric Modeling for Scale Correction in Hyperspectral Images for Improved Unmixing Performance

基于几何建模的预处理算法用于超光谱图像的尺度校正以提升解混性能

Praveen Sumanasekara, Athulya Ratnayake, Buddhi Wijenayake, Keshawa Ratnayake, Roshan Godaliyadda, Parakrama Ekanayake, Vijitha Herath

AI总结 本文提出一种预处理算法,通过校正像素签名的尺度变化,提升超光谱解混性能,实验验证其在多种解混方法上的有效性,实现约50%的误差降低。

Comments 20 pages, 14 figures

详情
AI中文摘要

光谱变化显著影响超光谱解混算法的准确性和收敛性。许多方法处理复杂光谱变化,但因地形、光照和阴影导致的像素签名大规模畸变仍是主要挑战。这些变化通常会降低解混性能并使模型拟合复杂化。因此,校正这些变化可为实际GIS应用提供显著优势。本文提出了一种新的预处理算法,在解混前校正由尺度引起的光谱变化。通过估计并校正像素签名的尺度畸变,该算法生成具有最小尺度畸变的像素签名。由于这些尺度畸变(阻碍许多解混方法性能)在所提出方法的输出中被大大减少,解混算法的丰度估计显著提高。我们提供了一个严谨的数学框架来描述和校正尺度变化,并对所提算法进行了广泛的实验验证。此外,该算法的影响在多种最先进的解混方法上评估了两个合成和两个真实超光谱数据集。所提出的预处理步骤在这些方法上一致提高了性能,即使对于专门处理光谱变化的算法,也实现了约50%的误差降低。这表明尺度校正作为一种补充步骤,有助于更准确的解混,利用现有方法。该算法的通用性、一致影响和显著影响突显了其在实际超光谱解混管道中的潜力。实现代码将在发表时公开。

英文摘要

Spectral variability significantly impacts the accuracy and convergence of hyperspectral unmixing algorithms. Many methods address complex spectral variability; yet large-scale distortions to the scale of the observed pixel signatures due to topography, illumination, and shadowing remain a major challenge. These variations often degrade unmixing performance and complicate model fitting. Because of this, correcting these variations can offer significant advantages in real-world GIS applications. In this paper, we propose a novel preprocessing algorithm that corrects scale-induced spectral variability prior to unmixing. By estimating and correcting these distortions to the scale of the pixel signatures, the algorithm produces pixel signatures with minimal distortions in scale. Since these distortions in scale (which hinder the performance of many unmixing methods) are greatly minimized in the output provided by the proposed method, the abundance estimation of the unmixing algorithms is significantly improved. We present a rigorous mathematical framework to describe and correct for scale variability and provide extensive experimental validation of the proposed algorithm. Furthermore, the algorithm's impact is evaluated across a wide range of state-of-the-art unmixing methods on two synthetic and two real hyperspectral datasets. The proposed preprocessing step consistently improves the performance of these algorithms, achieving error reductions of around 50%, even for algorithms specifically designed to handle spectral variability. This demonstrates that scale correction acts as a complementary step, facilitating more accurate unmixing with existing methods. The algorithm's generality, consistent impact, and significant influence highlight its potential as a key component in practical hyperspectral unmixing pipelines. The implementation code will be made publicly available upon publication.

2506.22440 2026-05-18 cs.CY cs.LG cs.MA econ.GN q-fin.EC

From Model Design to Organizational Design: Complexity Redistribution and Trade-Offs in Generative AI

从模型设计到组织设计:生成AI中的复杂性再分配与权衡

Sharique Hasan, Alexander Oettl, Sampsa Samila

AI总结 本文提出GAS框架,分析大语言模型如何重塑组织与竞争策略,揭示生成AI中通用性、准确性与简洁性之间的权衡及复杂性再分配对管理挑战的影响。

详情
AI中文摘要

本文引入通用性-准确性-简洁性(GAS)框架,分析大语言模型如何重塑组织和竞争策略。我们认为,将AI视为简单输入成本降低忽略了两个关键动态:(a)通用性、准确性和简洁性之间的固有权衡;(b)复杂性在利益相关者间的再分配。尽管LLMs通过简单接口提供高通用性和准确性,这种用户端的简洁性掩盖了复杂性向基础设施、合规性和专业人员的转移。因此,GAS权衡并未消失,而是从用户转移到组织,带来新的管理挑战,尤其是在高风险应用中的准确性问题。我们主张,竞争优势不再来自单纯的AI采用,而是来自通过抽象层设计、流程对齐和互补专业知识掌握再分配的复杂性。本研究通过阐明可扩展认知如何再分配复杂性并重新定义技术整合的条件,推动了AI战略的发展。

英文摘要

This paper introduces the Generality-Accuracy-Simplicity (GAS) framework to analyze how large language models (LLMs) are reshaping organizations and competitive strategy. We argue that viewing AI as a simple reduction in input costs overlooks two critical dynamics: (a) the inherent trade-offs among generality, accuracy, and simplicity, and (b) the redistribution of complexity across stakeholders. While LLMs appear to defy the traditional trade-off by offering high generality and accuracy through simple interfaces, this user-facing simplicity masks a significant shift of complexity to infrastructure, compliance, and specialized personnel. The GAS trade-off, therefore, does not disappear but is relocated from the user to the organization, creating new managerial challenges, particularly around accuracy in high-stakes applications. We contend that competitive advantage no longer stems from mere AI adoption, but from mastering this redistributed complexity through the design of abstraction layers, workflow alignment, and complementary expertise. This study advances AI strategy by clarifying how scalable cognition relocates complexity and redefines the conditions for technology integration.

2504.18522 2026-05-18 stat.ML cs.LG

Extrapolation Guarantees for Perturbation Modeling Under the Additive Latent Shift Assumption

在加性潜在位移假设下对扰动建模的外推保证

Julius von Kügelgen, Jakob Ketterer, Michael Vollenweider, Michael Scholkemper, Xinwei Shen, Nicolai Meinshausen, Jonas Peters

AI总结 本文研究了在加性潜在位移假设下,通过扰动建模预测新扰动组合的分布,提出PDAE模型并证明了外推保证。

Comments Updated preprint with new material and empirical results; previous version presented at the ICLR'25 Workshop on Learning Meaningful Representations of Life

详情
AI中文摘要

我们考虑了建模如基因敲除等扰动对测量(如单细胞RNA计数)的影响问题。给定某些扰动的数据,我们旨在预测新扰动组合的测量分布。为此,我们假设扰动在合适但未知的嵌入空间中是加性的。我们将数据生成过程建模为潜在变量模型,其中扰动相当于潜在空间中的均值位移,并且可以加性组合。我们证明,在训练扰动足够多样时,表示和扰动效应可识别到正交变换为止,并利用此推导出对未见扰动的外推保证,这些未见扰动可表示为已见扰动的线性组合。为了从数据中估计模型,我们提出扰动分布自编码器(PDAE),该模型通过最大化真实与模拟扰动分布之间的分布相似性进行训练。训练后的模型可用于预测之前未见的扰动分布。为了支持我们的理论结果,我们通过模拟展示了PDAE能够准确预测未见但可识别的扰动效应,并在组合基因扰动数据上展示了该方法。

英文摘要

We consider the problem of modeling the effects of perturbations like gene knockouts on measurements such as single-cell RNA counts. Given data for some perturbations, we aim to predict the distribution of measurements for new combinations of perturbations. To address this challenging extrapolation task, we posit that perturbations act additively in a suitable, unknown embedding space. We formulate the data-generating process as a latent variable model, in which perturbations amount to mean shifts in latent space and can be combined additively. We then prove that, given sufficiently diverse training perturbations, the representation and perturbation effects are identifiable up to orthogonal transformation and use this to derive extrapolation guarantees for unseen perturbations that can be expressed as linear combinations of seen ones. To estimate the model from data, we propose the perturbation distribution autoencoder (PDAE), which is trained by maximizing the distributional similarity between true and simulated perturbation distributions. The trained model can then be used to predict previously unseen perturbation distributions. In support of our theoretical results, we demonstrate through simulations that PDAE can accurately predict the effects of unseen but identifiable perturbations, and showcase the method on combinatorial gene perturbation data.