arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2602.01372 2026-05-11 math.OC cs.LG

Robust Sublinear Convergence Rates for Iterative Bregman Projections

Gabriel Peyré

AI总结本文研究了在熵正则化框架下迭代Bregman投影方法的收敛速率问题，提出了一种通用的分析框架，证明了其对偶收敛速率为 $O(1/k)$，且常数项仅与熵正则化参数 $γ$ 线性相关，因而称为“鲁棒”收敛速率。该方法通过构造约束分割诱导的商范数下的原问题和对偶问题界，结合非扩张性分析，简化了收敛性证明。文章还基于该框架提出了一个新的图结构上的流-Sinkhorn算法，用于计算图上的Wasserstein-1距离，并给出了其计算复杂度的理论保证。

2601.22400 2026-05-11 quant-ph cs.AI

Spectral Filtering for Complex Linear Dynamical Systems

Elad Hazan, Annie Marsden

AI总结本文研究了具有扇形有界谱的复值线性动态系统（CLDS）的学习问题，这类系统广泛存在于信号处理、结构状态空间模型和量子系统中。作者提出了一种基于Slepian基的谱滤波方法，证明了系统的可学习性由一个与状态空间维度无关的有效维度所决定。该方法进一步推导出适用于CLDS序列预测的维度无关的遗憾界，为复杂动态系统的高效学习提供了理论保证。

2601.21951 2026-05-11 stat.ML cs.LG stat.CO

Diffusion Path Samplers via Sequential Monte Carlo

James Matthew Young, Paula Cordero-Encinar, Sebastian Reich, Andrew Duncan, O. Deniz Akyildiz

AI总结本文提出了一种基于扩散路径的采样方法，用于从仅知归一化常数的目标分布中进行采样。研究通过构建一条从简单基础分布到目标分布的扩散路径，并结合序贯蒙特卡洛方法，高效估计时间变化分布的得分函数和密度函数。为降低得分估计的方差，作者还设计了实用的控制变量调度策略，并将该框架应用于多种扩散路径模型，理论分析与实验结果均验证了方法的有效性。

2601.07247 2026-05-11 stat.ML cs.LG math.ST stat.ME stat.TH

Multi-environment Invariance Learning with Missing Data

Yiran Jia, Jelena Bradic

AI总结本文研究了在存在缺失数据的情况下如何进行多环境不变性学习，以提升模型的因果解释能力和预测鲁棒性。作者提出了一种基于不变性目标的估计方法，并建立了变量选择性质和$\ell_2$误差收敛率的非渐近理论保证，分析了缺失数据比例和插补模型质量对性能的影响。实验表明，即使在使用有偏插补模型的情况下，该方法仍能有效降低预测误差，展现出良好的实用价值。

Comments Added co-author

2512.19408 2026-05-11 math.NA cs.CE cs.NA cs.RO cs.SY eess.SY math.DS

Mixed formulation and structure-preserving discretization of Cosserat rod dynamics in a port-Hamiltonian framework

Philipp L. Kinon, Simon R. Eugster, Peter Betsch

AI总结本文提出了一种基于能量的非线性空间Cosserat杆动力学建模框架，适用于大位移和大旋转情况。该方法采用混合变量形式，独立处理位移、速度和应力变量，并通过引入方向量描述有限旋转，避免了奇点并保持质量矩阵恒定，最终形成一个具有二次能量泛函的无限维端口哈密顿系统。通过结构保持的有限元离散化，得到具有哈密顿结构的有限维系统，有利于设计能量-动量一致的积分方案，并自然地集成阻尼材料行为和非标准驱动方式，为计算力学中涉及有限旋转的问题提供了新的能量-动量一致建模方法。

Comments 39 pages, 16 figures

2512.14018 2026-05-11 cs.SE cs.AI

PerfCoder: Large Language Models for Interpretable Code Performance Optimization

Jiuding Yang, Shengyao Lu, Hongxuan Liu, Shayan Shirahmad Gale Bagi, Zahra Fazel, Tomasz Czajkowski, Di Niu

AI总结 PerfCoder 是一种专门用于生成高性能代码的大语言模型，旨在解决当前模型在代码性能优化方面能力不足的问题。该模型通过可解释的定制化优化策略，结合真实优化轨迹和人类注释进行微调，并利用运行时测量进行强化学习对齐，从而直接提出并应用针对性的性能改进方案。实验表明，PerfCoder 在代码性能基准 PIE 上显著优于现有模型，同时还能生成可解释的代码反馈，提升大模型在代码优化任务中的表现。

2512.05967 2026-05-11 cs.IR cs.AI cs.CL cs.LG

Enhancing Retrieval-Augmented Generation with Entity Linking for Educational Platforms

Francesco Granata, Francesco Poggi, Misael Mongiovì

AI总结在大型语言模型时代，检索增强生成（RAG）架构因其能基于可靠知识源生成文本而受到关注，但在专业领域中，仅依赖语义相似性的RAG系统常因术语歧义影响检索准确性。本文提出ELERAG，一种结合实体链接技术的增强型RAG架构，旨在提升教育问答系统的事实准确性，特别是在意大利语环境下。通过引入基于Wikidata的实体链接模块和混合重排序策略，实验表明ELERAG在专业领域数据集上显著优于传统方法，验证了领域适配的混合策略在提升教育类RAG系统事实精度中的有效性。

详情

DOI: 10.3390/bdcc10040120
Journal ref: Big Data and Cognitive Computing, 10(4), 120. 2026

英文摘要

In the era of Large Language Models (LLMs), Retrieval-Augmented Generation (RAG) architectures are gaining significant attention for their ability to ground language generation in reliable knowledge sources. Despite their effectiveness, RAG systems based solely on semantic similarity often fail to ensure factual accuracy in specialized domains, where terminological ambiguity can affect retrieval relevance. This study proposes ELERAG, an enhanced RAG architecture that integrates a factual signal derived from Entity Linking to improve the accuracy of educational question-answering systems in Italian. The system includes a Wikidata-based Entity Linking module and implements a hybrid re-ranking strategy based on Reciprocal Rank Fusion (RRF). To validate our approach, we compared it against standard baselines and state-of-the-art methods, including a Weighted-Score Re-ranking, a standalone Cross-Encoder and a combined RRF+Cross-Encoder pipeline. Experiments were conducted on two benchmarks: a custom academic dataset and the standard SQuAD-it dataset. Results show that, in domain-specific contexts, ELERAG significantly outperforms both the baseline and the Cross-Encoder configurations. Conversely, the Cross-Encoder approaches achieve the best results on the general-domain dataset. These findings provide strong experimental evidence of the domain mismatch effect, highlighting the importance of domain-adapted hybrid strategies to enhance factual precision in educational RAG systems without relying on computationally expensive models trained on disparate data distributions. They also demonstrate the potential of entity-aware RAG systems in educational environments, fostering adaptive and reliable AI-based tutoring tools.

URL PDF HTML ☆

赞 0 踩 0

2510.22944 2026-05-11 cs.CR cs.AI

Is Your Prompt Poisoning Code? Defect Induction Rates and Security Mitigation Strategies

Bin Wang, YiLu Zhong, MiDi Wan, WenJie Yu, YuanBing Ouyang, Yenan Huang, Hui Li

AI总结本文研究了良性但表述不佳的提示对大型语言模型生成代码安全性的影响，提出了一个包含目标清晰度、信息完整性和逻辑一致性的提示质量评估框架，并构建了CWE-BENCH-PYTHON基准数据集。实验表明，提示规范性越低，生成的代码越不安全，而使用思维链和自我修正等高级提示技术可显著提升代码安全性。该研究强调提升用户提示质量是增强AI生成代码安全性的关键策略。

Comments Accepted for publication in Empirical Software Engineering (EMSE) Journal

2510.00322 2026-05-11 cs.CR cs.CC cs.DS cs.LG

Privately Estimating Black-Box Statistics

Günter F. Steinke, Thomas Steinke

AI总结本文研究如何在差分隐私框架下对任意黑盒函数进行统计估计。传统方法依赖于对估计器灵敏度的已知界限，而这些界限往往难以获取或较大。为此，作者提出了一种在数据统计效率和函数调用效率之间进行权衡的方案，并给出了该方法的近似最优性下界。

2509.08350 2026-05-11 physics.soc-ph cs.LG math.AT

Chordless cycle filtrations for dimensionality detection in complex networks via topological data analysis

Aina Ferrà Marcús, Robert Jankowski, Meritxell Vila Miñana, Carles Casacuberta, M. Ángeles Serrano

AI总结本文研究了如何通过拓扑数据分析揭示复杂网络中潜在的超球几何结构，并估计其维度。作者提出了一种基于无弦环（chordless cycle）的拓扑权重方案，结合代数拓扑和机器学习方法，构建了一个无需重新训练即可应用于真实网络的神经网络模型。该方法为复杂网络的隐藏几何结构揭示和低维嵌入提供了稳健有效的解决方案。

2508.10880 2026-05-11 cs.CR cs.AI cs.CL

Searching for Privacy Risks in LLM Agents via Simulation

Yanzhe Zhang, Diyi Yang

AI总结本文研究了基于大语言模型（LLM）的智能体在多轮交互中可能引发的隐私风险问题，提出了一种基于搜索的框架，通过模拟隐私关键的智能体交互过程，交替优化攻击与防御策略。该方法利用LLM作为优化器，迭代生成新的智能体指令，并通过多线程并行搜索与跨线程传播提高策略探索效率。研究发现，攻击策略从直接请求演变为复杂的伪装和伪造授权等手段，防御策略则从简单的规则限制发展为更强大的身份验证状态机，且所发现的攻击与防御策略具有跨场景和跨模型的泛化能力，为构建隐私感知的智能体提供了重要参考。

Comments ICLR 2026

2508.02001 2026-05-11 cs.NI cs.LG

Versatile yet Efficient Network Traffic Analysis: Offloading Network Foundation Model to SmartNIC

Chungang Lin, Xuying Meng, Tianyu Zuo, Weiyao Zhang, Meng Shen, Ruijie Zhao, Guanming Che, Ruiqi Meng, Ziyue Huang, Haitong Luo, Zhiwei Xu, Yujun Zhang

AI总结随着网络流量加密的普及，传统的基于大规模标注数据的流量分析方法面临挑战，而安全运维又要求在边缘进行低延迟分析。为解决灵活性与效率难以兼顾的问题，本文提出Nepco系统，将网络基础模型卸载到SmartNIC上，通过聚焦局部字节区域的高效建模和硬件友好的处理流程，实现了高灵活性与低延迟的统一。实验表明，Nepco在保持高性能的同时，将端到端延迟降低了328倍。

Comments Under review

详情

英文摘要

Pervasive encryption makes large-scale labeling infeasible for traffic analysis, while security operations demand edge analysis to avert service degradation and further vulnerabilities. These pressures have produced two disjoint research lines: 1) versatile analysis, via network foundation models for low label dependency, and 2) efficient analysis, via hardware offloading for low analysis latency. However, versatility and efficiency have appeared fundamentally incompatible to co-achieve, with prior work consistently sacrificing one for the other, yet we show that this incompatibility is a consequence of polarized design choices across the three components of traffic analysis systems, i.e., traffic processing, model architecture, and analysis execution. In response, we present Nepco, a versatile yet efficient network traffic analysis system that offloads network foundation models to SmartNIC. Our key observation is that discriminative traffic information is concentrated in localized byte regions, motivating versatile yet efficient localized byte-sequence modeling rather than inefficient global modeling. To exploit this without incurring the latency bottlenecks of complex encoding steps, we employ a hardware-friendly processing pipeline that directly embeds raw byte sequences. Crucially, to maintain versatility across diverse tasks, we propose a pattern-aware convolutional architecture equipped with dedicated scoring and gating mechanisms. By exploiting translation invariance, this design dynamically locates and extracts salient semantic signatures. We prototype Nepco on the Nvidia BlueField-3 SmartNIC with multiengine collaborative analysis execution. The experimental results demonstrate that Nepco achieves macro F1 competitive with the best performances achieved by 8 state-of-the-art network foundation models, while reducing end-to-end latency by 328x to the millisecond scale.

URL PDF HTML ☆

赞 0 踩 0

2506.04565 2026-05-11 cs.MA cs.CL

From Standalone LLMs to Integrated Intelligence: A Survey of Compound Al Systems

Jiayi Chen, Junyi Ye, Guiling Wang

AI总结本文综述了复合人工智能系统（CAIS），该系统通过集成大型语言模型与检索器、代理、工具等外部组件，克服了单一模型在记忆、推理、实时 grounding 和多模态理解等方面的局限性。文章提出了基于组件角色和调度策略的多维分类体系，分析了包括检索增强生成（RAG）、LLM 代理、多模态 LLM 和调度机制在内的四种基础范式，并总结了当前系统的设计权衡与评估方法，指出了可扩展性、互操作性等关键挑战及未来研究方向。

2505.11325 2026-05-11 stat.ME cs.AI cs.LG stat.CO stat.ML

Uncertainty Quantification for Prior-Data Fitted Networks using Martingale Posteriors

Thomas Nagler, David Rügamer

AI总结本文研究了如何为先验-数据拟合网络（PFNs）提供不确定性量化方法，这类网络在表格数据预测任务中表现出色但缺乏对预测结果的不确定性估计。作者提出了一种基于鞅后验的采样方法，能够在无需调参的情况下高效构建预测均值、分位数等估计的贝叶斯后验，并证明了该方法的收敛性。实验表明，该方法在多个模拟和实际数据集上表现出良好的效率和校准能力。

2504.12922 2026-05-11 math.OC cs.LG math.LO math.PR

An abstract effective convergence theorem for stochastic processes, with applications to stochastic approximation

Morenikeji Neri, Nicholas Pischke, Thomas Powell

AI总结本文提出了一种适用于满足弱超鞅条件的随机过程的通用收敛定理，能够在较高抽象层次上提供定量收敛保证，核心在于引入了一个通用模度函数 $τ$ 来刻画解的期望唯一性。该定理具有高度统一的收敛速率，仅依赖于少量数据。作者进一步将该结果作为统一框架，推导了包括 Robbins-Siegmund 定理、Dvoretzky 收敛定理和随机拟 Fejér 单调序列收敛在内的多个关键定理的定量版本，并探讨了其在随机逼近中的多种应用。

Comments 25 pages

2311.08433 2026-05-11 q-bio.QM cs.LG stat.AP

Clinical Characteristics and Laboratory Biomarkers in ICU-admitted Septic Patients with and without Bacteremia

Sangwon Baek, Seung Jun Lee

AI总结该研究旨在探讨重症监护病房内感染性休克患者中是否存在菌血症的临床特征和实验室生物标志物的预测价值。通过回顾性分析218例患者的临床数据，研究发现C反应蛋白（CRP）和降钙素原（PCT）对菌血症具有较好的预测能力，而结合PCT、胆红素、中性粒细胞与淋巴细胞比值（NLR）、血小板、乳酸、红细胞沉降率（ESR）和格拉斯哥昏迷评分（GCS）构建的多变量逻辑回归模型显著提升了预测准确性，AUC达到0.907。研究还发现菌血症与患者死亡率存在显著关联，表明这些生物标志物在临床诊断和预后评估中具有重要应用价值。

Comments This research is not complete

2309.01751 2026-05-11 eess.IV cs.CV physics.geo-ph

Multispectral Indices for Wildfire Management

Afonso Oliveira, João P. Matos-Carvalho, Filipe Moutinho, Nuno Fachada

AI总结随着野火发生频率和强度的增加，传统地面监测方法难以应对火势和环境的快速变化，亟需更先进的管理手段。本文通过文献综述和两个实际案例研究，探讨了多光谱遥感影像在野火管理中的应用，评估了多种多光谱指数在植被、水域和人工结构等关键环境特征提取中的效果。研究发现，NVDI、MNDWI和MSR等指数在分割和特征提取方面表现突出，为提升野火监测、风险评估和应急响应提供了有效支持。

Comments The peer-reviewed version of this paper is published in Frontiers in Remote Sensing at https://doi.org/10.3389/frsen.2026.1807451. This version is typeset by the authors and differs only in pagination and typographical detail

2204.05551 2026-05-11 math.OC cs.LG cs.SY eess.SY math.DS

Near-Optimal Distributed Linear-Quadratic Regulator for Networked Systems

Sungho Shin, Yiheng Lin, Guannan Qu, Adam Wierman, Mihai Anitescu

AI总结本文研究了在分布式控制设置中，去中心化程度与控制性能之间的权衡问题。通过引入基于图结构的 $κ$-分布式控制方法，使得每个智能体仅依赖于图中距离为 $κ$ 的状态信息进行决策，从而在去中心化程度与控制性能之间建立联系。研究发现，在一定的温和条件下，$κ$-分布式控制与集中式最优控制之间的性能差距随 $κ$ 的增大呈指数级衰减，表明适度去中心化的分布式控制即可实现接近最优的控制效果，为大规模网络系统的控制提供了有效的架构方案。

2005.06674 2026-05-11 math.OC cs.LG math.DS

On the Convergence of Overlapping Schwarz Decomposition for Nonlinear Optimal Control

Sen Na, Sungho Shin, Mihai Anitescu, Victor M. Zavala

AI总结本文研究了重叠施瓦茨分解算法在求解非线性最优控制问题中的收敛性质。该算法将时间域划分为多个重叠子域，并并行求解各子域上的子问题，通过更新子域边界处的对偶信息实现收敛。研究证明该算法具有局部线性收敛性，且收敛速度随重叠大小呈指数级提升，并建立了适用于二次规划的全局收敛性结果，为二阶优化算法中的施瓦茨方法提供了理论支持。实验表明，该方法在四旋翼飞行器路径规划和偏微分方程控制问题中表现出比ADMM更高的效率，且接近集中式求解器Ipopt的性能。

Comments 16 pages

详情

DOI: 10.1109/TAC.2022.3194087
Journal ref: IEEE Transactions on Automatic Control, 2022

英文摘要

We study the convergence properties of an overlapping Schwarz decomposition algorithm for solving nonlinear optimal control problems (OCPs). The algorithm decomposes the time domain into a set of overlapping subdomains, and solves all subproblems defined over subdomains in parallel. The convergence is attained by updating primal-dual information at the boundaries of overlapping subdomains. We show that the algorithm exhibits local linear convergence, and that the convergence rate improves exponentially with the overlap size. We also establish global convergence results for a general quadratic programming, which enables the application of the Schwarz scheme inside second-order optimization algorithms (e.g., sequential quadratic programming). The theoretical foundation of our convergence analysis is a sensitivity result of nonlinear OCPs, which we call "exponential decay of sensitivity" (EDS). Intuitively, EDS states that the impact of perturbations at domain boundaries (i.e. initial and terminal time) on the solution decays exponentially as one moves into the domain. Here, we expand a previous analysis available in the literature by showing that EDS holds for both primal and dual solutions of nonlinear OCPs, under uniform second-order sufficient condition, controllability condition, and boundedness condition. We conduct experiments with a quadrotor motion planning problem and a PDE control problem to validate our theory; and show that the approach is significantly more efficient than ADMM and as efficient as the centralized solver Ipopt.

URL PDF HTML ☆

赞 0 踩 0

2605.07517 2026-05-11 cs.IR cs.AI

LARAG: Link-Aware Retrieval Strategy for RAG Systems in Hyperlinked Technical Documentation

Giorgia Bolognesi, Claudio Estatico, Ulderico Fugacci, Isabella Mastroianni, Claudio Muselli, Luca Oneto

AI总结 LARAG 是一种面向超链接技术文档的检索增强生成（RAG）方法，旨在解决传统基于嵌入的检索器忽略文档中超链接结构的问题。该方法利用技术文档中已有的超链接关系，将其作为元数据编码到文档块中，从而实现基于局部相关性的图式检索。实验表明，LARAG 在保持高质量答案生成的同时，减少了检索的文档块数量和生成的 token 数量，提升了 RAG 系统的效率和准确性。

2605.07481 2026-05-11 cs.CR cs.AI

Vaporizer: Breaking Watermarking Schemes for Large Language Model Outputs

Jonathan Hong Jin Ng, Anh Tu Ngo, Anupam Chattopadhyay

AI总结本文研究了当前最先进的大型语言模型（LLM）输出水印方案，并评估了它们在面对多种语义保持的文本攻击时的有效性。作者提出了多种攻击策略，包括词汇替换、机器翻译和神经重述等，通过BERT分数、文本复杂度、语法错误和阅读难度等指标衡量攻击效果。实验表明，尽管不同水印方法的抗攻击能力有所差异，但大多数水印都可以在不显著影响语义的前提下被有效移除，揭示了现有水印系统的安全漏洞与改进方向。

2605.07472 2026-05-11 cs.CR cs.AI cs.MA

HBEE: Human Behavioral Entropy Engine -- Pre-Registered Multi-Agent LLM Simulation of Peer-Suspicion-Based Detection Inversion

Vickson Ferrel

AI总结该研究探讨了在多智能体模拟环境中，基于同行怀疑机制的内部威胁检测是否能够有效识别自适应型内鬼。研究通过预注册实验，对比了不同防御模式和对手类型下的检测效果，发现自适应型内鬼的行为反而降低了其被怀疑的程度，导致检测结果出现反转。研究还表明，传统基于行为分析的检测方法在面对自适应对手时可能失效，并公开了实验模拟器及相关数据以供进一步研究。

Comments 14 pages, 6 figures. Pre-registration document and full deviation log included in artifact

2605.07444 2026-05-11 cs.CE cs.AI

Accelerated and data-efficient flow prediction in stirred tanks via physics-informed learning

Mahdi Naderibeni, Liang Wu, David M. J. Tax

AI总结本文研究了在工业规模搅拌罐中利用物理信息学习方法加速并提高稳态流场预测效率的问题。通过生成基于雷诺平均纳维-斯托克斯方程（RANS）的稳态流场数据集，作者对比了纯数据驱动模型与引入物理约束的隐式神经网络模型的性能，发现随着训练数据量的增加预测误差逐步下降，但在数据量达到一定规模后收益递减。引入物理约束不仅提升了模型精度和训练稳定性，还改善了示踪剂传输行为，但同时也增加了训练复杂度。

2605.07422 2026-05-11 cs.SE cs.AI

Prompt Engineering Strategies for LLM-based Qualitative Coding of Psychological Safety in Software Engineering Communities: A Controlled Empirical Study

Moaath Alshaikh, Tasneem Alshaher, Ricardo Vieira, Beatriz Santana, Clelio Xavier, Jose Amancio, Glauco Carneiro, Julio Leite, Savio Freire, Manoel Mendonca

AI总结本研究探讨了如何通过提示工程策略提升基于大语言模型（LLM）对软件工程社区心理安全性的定性编码效果。通过对比三种LLM在零样本和多样本封闭编码策略下的表现，发现多样本提示在提升 Claude Haiku 的编码一致性方面效果显著，而其他模型则未表现出类似提升。研究还揭示了模型间稳定性差异及系统性预测偏差，为未来基于LLM的定性分析提供了实证指导。

Comments 9 pages, 5 figures. Accepted at the 1st International Workshop on Prompt Engineering for Software Engineering (PROMPT-SE 2026), co-located with the 30th International Conference on Evaluation and Assessment in Software Engineering (EASE 2026), Glasgow, Scotland, United Kingdom, June 9--12, 2026

2605.07417 2026-05-11 cs.AR cs.LG

Effective and Memory-Efficient Alternatives to ECC for Reliable Large-Scale DNNs

Mohammad Hasan Ahmadilivani, Marten Roots, Marco Restifo, Sven-Markus Loorits, Luca Di Mauro, Jaan Raik

AI总结随着深度学习模型在自动驾驶和数据中心等关键领域中的广泛应用，硬件故障对系统可靠性构成了严重威胁。本文研究了传统ECC在保护深度神经网络参数方面的局限性，并提出了两种高效且内存友好的替代方案——MSET和CEP。实验表明，这两种方法在不增加内存开销的情况下显著提升了大型卷积神经网络和视觉Transformer的可靠性，并在面积和延迟方面优于传统SECDED ECC方案。

Comments 7 pages, 7 figures, 3 tables. The paper is accepted at IEEE IOLTS'26

2605.07414 2026-05-11 cs.MA cs.AI cs.CR

OrchJail: Jailbreaking Tool-Calling Text-to-Image Agents by Orchestration-Guided Fuzzing

Jianming Chen, Yawen Wang, Junjie Wang, Zhe Liu, Qing Wang, Fanjiang Xu

AI总结本文提出了一种名为OrchJail的工具调用型文生图代理的越狱方法，通过指导式模糊测试针对工具链的组合方式发起攻击。该方法利用高风险的工具调用模式，学习成功越狱案例中的因果关系，从而更高效地生成能够触发危险行为的提示词。实验表明，OrchJail在多个代表性模型上显著提升了越狱成功率和图像质量，同时降低了攻击成本，揭示了工具链编排作为新型安全漏洞的重要性。

2605.07389 2026-05-11 cs.SE cs.LG

Exploring CoCo Challenges in ML Engineering Teams: Insights From the Semiconductor Industry

A. Azamnouri, M. Haug, L. Woltmann, M. Fritz, J. Bogner, S. Wagner

AI总结本文探讨了在半导体行业中机器学习工程团队面临的协作与沟通（CoCo）挑战。研究通过访谈全球半导体公司的12位从业者，识别出16项常见的CoCo挑战，其中角色与责任不明确是最关键的问题。研究还总结了有效的实践与建议，揭示了在硬件驱动约束下，这类挑战与软件企业存在差异，为未来研究和工具开发提供了方向。

详情

英文摘要

The integration of machine learning (ML) into complex software systems has increased challenges in collaboration and communication (CoCo) of the teams building these systems. ML engineering (MLE) teams often involve diverse roles, ML engineers, data scientists, software engineers, and domain experts, each bringing unique goals, experiences, and jargon. These interdisciplinary dynamics can make it challenging to deploy, reproduce, and maintain ML-enabled systems over the long term. Previous studies have uncovered several CoCo challenges and practices, but most have focused on software-centric companies, leaving limited empirical understanding of how these dynamics unfold in hardware-centric contexts. In hardware-centric environments, CoCo challenges are shaped by additional constraints such as strict data governance, long development cycles, and tight coupling with physical processes, which amplify coordination complexity and reduce flexibility. To strengthen empirical understanding in such settings, we present a qualitative investigation of MLE teams within a global semiconductor company, where ML-enabled systems and manufacturing processes introduce additional complexity. We interviewed 12 practitioners regarding CoCo practices, tools, challenges, and approaches. Through analysis, we identified 16 recurring challenges, with unclear roles and responsibilities emerging as the most critical, and common practices and recommendations practitioners considered effective in mitigating CoCo problems. While grounded in a single organizational context, our findings align with known issues in interdisciplinary ML-enabled systems development, but also demonstrate how these challenges manifest differently under hardware-driven constraints. Our results highlight directions for future research and tool support to strengthen CoCo in MLE projects and ensure the success of ML-enabled systems.

URL PDF HTML ☆

赞 0 踩 0

2605.07385 2026-05-11 cs.GR cs.CV

Velocity-Space 3D Asset Editing

Hao Liu, Yuxuan Lin, Jingfeng Guo, Ruihang Chu, Junjie Wang, Ruotong Li, Yujiu Yang

AI总结该论文提出了一种名为VS3D的3D资产编辑方法，旨在实现对3D模型局部区域的精确编辑，同时保持其余部分不变。传统方法依赖外部机制实现局部性，而VS3D则在ODE采样器内部进行针对性干预，解决了身份泄露、编辑信号放大不足以及几何和材质阶段的身份拖拽等问题。VS3D框架无需训练和掩码，通过三个互补模块分别在编辑流程的不同阶段进行优化，提升了局部编辑的准确性和保真度。

2605.07354 2026-05-11 eess.SP cs.CV

Task-Oriented Communication for Human Action Understanding via Edge-Cloud Co-Inference

Jingyi Liu, Cheng Yuan, Lijun He, Jun Zhang, Jiawei Shao

AI总结随着智能感知技术的广泛应用，网络边缘对人类动作理解的准确性需求日益增长。为解决传统方法中视频数据传输带宽消耗大、延迟高和隐私泄露等问题，本文提出了一种基于边缘-云端协同的面向任务的通信框架TOAU。该方法通过单目姿态估计器提取视频中的关节坐标，并利用VQ-VAE将其编码为离散运动标记，仅传输少量的编码索引，极大降低了传输负载和延迟，同时保障了隐私安全。实验表明，该系统在保持动作理解精度的同时，显著提升了通信效率。

Comments 12 pages, 6 figures

2605.07314 2026-05-11 cs.IR cs.AI

DCGL: Dual-Channel Graph Learning with Large Language Models for Knowledge-Aware Recommendation

Xinchi Zou, Tongzhenzhi Su, Jianjun Li, Yuan Fu, Chang Liu, Zhiying Deng, Zhiwei Shen

AI总结本文提出了一种名为DCGL的双通道图学习框架，旨在提升知识感知推荐系统的性能。该方法通过解耦语义信息与用户行为模式，结合多级对比学习和动态融合机制，有效解决了知识图谱与大语言模型融合中的语义关系建模不足、嵌入融合干扰以及用户-物品交互频率差异等问题。实验表明，DCGL在多个真实数据集上优于现有方法，尤其在稀疏场景下表现出显著优势。

Comments Accepted by SIGIR 2026