arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 2115
2606.17633 2026-06-17 cs.HC 新提交

AdaPT: Adaptive Lesson Plan Transformer for Cross-Regional and Differentiated Instruction

AdaPT:面向跨区域与差异化教学的适应性教案转换器

Yanjie Zhang, Jiajun Zhu, Minyu Wu, Huamin Qu, Sicheng Song

AI总结 提出AdaPT系统,利用大语言模型将现有教案转换为适应新区域和学生特征的内容,通过交互界面、结构化表示和解释机制支持教师迭代优化,促进教育公平。

详情
AI中文摘要

由于教育不平等,高质量的教案往往与不同教育环境的需求不匹配。教师通常会修改现有教案以适应新环境,但当前工具侧重于从头生成内容,增加了额外工作量。此外,在支持教师快速适应新学习特征方面仍存在关键缺口。为弥补这些缺口,我们提出AdaPT,一个利用大语言模型支持现有教案转换以用于跨区域和差异化教学的系统。AdaPT具有交互式界面,允许教师输入学生特征,提供结构化的教案表示,提供教案转换的解释,自动调整教案内容以适应新环境,并支持迭代的教师参与式优化。我们通过一项包含9名教师的用户研究和一项包含3名专家的评估对AdaPT进行了评估。结果表明,AdaPT支持教师的工作流程,并为促进教育公平提供了一条有前景的途径。

英文摘要

Due to educational inequality, high-quality lesson plans often mismatch the needs of disparate educational contexts. Teachers typically modify existing lesson plans to fit new contexts, but current tools instead focus on generating content from scratch, creating additional workload. Moreover, a critical gap remains in supporting teachers to quickly adapt to new learning profiles. To bridge these gaps, we present AdaPT, a system leverages LLMs to support transformation of existing lesson plans for cross-regional and differentiated instruction. AdaPT features an interactive interface that allows teachers to input student profiles, offers structured lesson representation, provides explanations for lesson-plan transformations, automatically adapts lesson content for new contexts, and supports iterative, teacher-in-the-loop refinement. We evaluated AdaPT through a user study with 9 teachers and an expert evaluation with 3 specialists. Results show that AdaPT supports workflows of teachers and offers a promising pathway toward promoting educational equity.

2606.17616 2026-06-17 cs.HC 新提交

Towards Speech Impairment Prediction in German-Speaking Individuals with Amyotrophic Lateral Sclerosis

针对德语肌萎缩侧索硬化症患者的言语障碍预测

Monica Gonzalez-Machorro, Ricarda von Heynitz, Justine Hanslmeier, Finja Grimm, Alexandra-Iulia Deac, Anne Gründel, Isabell Cordts, Björn Schuller

AI总结 本研究利用两种临床言语评分,通过交叉和个性化建模范式预测德语ALS患者的言语障碍,发现重复任务在预测言语相关生活质量方面表现最佳。

Comments Paper accepted at Interspeech 2026, Sydney, Australia

详情
AI中文摘要

肌萎缩侧索硬化症(ALS)是一种神经退行性疾病,常因延髓功能障碍影响言语。在本研究中,我们使用两种临床言语相关评分预测ALS患者的言语障碍。我们评估了横截面(跨说话人)和个性化(说话人内)建模范式,并分析了常见言语任务的效用,以促进ALS患者言语数据收集的标准化。对66名德语ALS患者的实验表明,重复任务(/da/-/da/、/da/-/ba/)在预测构音障碍患者生活质量问卷方面取得了最佳的横截面性能(一致性相关系数CCC=0.62),而说话人内设置达到了CCC=0.86。本研究是向德语ALS患者言语障碍预测迈出的初步一步,并突显了自动言语分析作为言语障碍评估支持工具的潜力。

英文摘要

Amyotrophic Lateral Sclerosis (ALS) is a neurodegenerative disease, often affecting speech due to bulbar dysfunction. In this study, we predict speech impairment in people with ALS (pwALS) using two clinical speech-related scores. We evaluate cross-sectional (across speakers) and personalised (within-speaker) modelling paradigms and analyse the utility of common speech tasks to contribute to the standardisation of speech data collection for pwALS. Experiments on a German-speaking cohort of 66 pwALS show that repetition tasks (/da/-/da/, /da/-/ba/) achieved the best cross-sectional performance (Concordance Correlation Coefficient (CCC) = 0.62) for predicting the Quality of Life in the Dysarthric Speaker questionnaire, while the within-speaker setting reached a CCC of 0.86. This study represents an initial step towards speech impairment prediction in German-speaking pwALS and highlights the potential of automated speech analysis as a supportive tool for speech impairment assessment.

2606.17613 2026-06-17 cs.DC 新提交

From GPU to Microcontroller: Online Ridge Regression for Edge-Deployable Traffic Prediction

从GPU到微控制器:面向边缘部署的在线岭回归交通预测

Suresh Purini, Archit Narwadkar, Deepak Gangadharan

AI总结 针对资源受限的边缘设备,提出用每传感器岭回归结合递归最小二乘在线自适应替代复杂神经网络,在PEMS基准上取得最优MAPE,并在ESP32微控制器上实现毫秒级推理。

Comments 8 pages, IEEE Intelligent Transportation Systems Conference, 2026

详情
AI中文摘要

最先进的交通流预测模型,包括图卷积网络和无图MLP,需要跨所有传感器的集中式GPU训练,这使得它们不适用于资源受限的智能交通部署。我们表明,这种复杂性在很大程度上是不必要的。对最近的无图模型GLMST的参数分析显示,将其内部嵌入维度从64减少到4会使MAPE下降不到一个百分点,这表明模型的有效容量远超任务所需。受此发现启发,我们完全用每传感器的岭回归(使用水平对齐的周期特征)结合递归最小二乘(RLS)进行在线自适应来替代神经架构。我们的方法每传感器仅需444个参数(比GLMST少80倍),并在测试时进行在线自适应,在四个PEMS基准中的三个上取得了最佳MAPE,在第四个上保持在1个百分点以内。由于每个传感器的模型是自包含的且仅涉及初等线性代数,整个流程(训练、推理和在线自适应)可在无GPU的边缘硬件上运行。ESP32微控制器(160 MHz,520 KB SRAM)在7.4秒内完成冷启动训练,每次预测和更新在2毫秒内完成且零堆分配;单个树莓派5核心在0.21秒内完成冷启动训练,每次预测和更新在0.26毫秒内完成。

英文摘要

State-of-the-art traffic flow forecasting models, including Graph Convolutional Networks and graph-less MLPs, require centralized GPU training across all sensors, making them impractical for resource-constrained intelligent transportation deployments. We show that much of this complexity is unnecessary. A parametric analysis of the recent graph-less model GLMST reveals that reducing its internal embedding dimension from 64 to 4 degrades MAPE by less than one percentage point, suggesting that the model's effective capacity far exceeds what the task requires. Motivated by this finding, we replace the neural architecture entirely with per-sensor Ridge regression using horizon-aligned periodic features, combined with Recursive Least Squares (RLS) for online adaptation. With only 444 parameters per sensor (80x fewer than GLMST) and test-time online adaptation, our method achieves the best MAPE on three of four PEMS benchmarks, and remains within one percentage point on the fourth. Because each sensor's model is self-contained and involves only elementary linear algebra, the entire pipeline (training, inference, and online adaptation) runs on edge hardware without a GPU. An ESP32 microcontroller (160 MHz, 520 KB SRAM) completes cold-start training in 7.4s and each predict-and-update in under 2ms with zero heap allocation; a single Raspberry Pi 5 core completes cold-start training in 0.21s and each predict-and-update in 0.26ms.

2606.17612 2026-06-17 cs.SE 新提交

PracRepair: LLM-Empowered Automated Program Repair Inspired by Human-Like Debugging Practices

PracRepair: 受人类调试实践启发的大语言模型赋能自动化程序修复

Yu Cheng, Zhongxin Liu, Zhenchang Xing, Chao Ni, Qing Huang, Xiaoxue Ren

AI总结 提出PracRepair框架,通过构建按需静态-动态上下文、进行问题驱动的故障诊断并迭代细化补丁,利用动态信息提升LLM在程序修复中的效果,在Defects4J和真实世界漏洞上取得最优性能。

详情
AI中文摘要

随着软件系统规模和复杂性的增长,调试和修复仍然成本高昂且耗时。大语言模型(LLM)推动了自动化程序修复(APR)的发展,但现有基于LLM的APR方法仍主要依赖静态或检索到的上下文、错误消息和粗粒度的验证结果。因此,它们未能充分利用动态信息来理解故障和修复,包括故障执行动态和补丁验证动态。然而,有效利用这些信息具有挑战性:故障执行轨迹庞大且嘈杂,原始静态-动态上下文缺乏自解释性,而补丁验证动态通常被简化为粗粒度的反馈。为应对这些挑战,我们提出PracRepair,一个完全自动化的基于LLM的APR框架,受人类调试实践启发。PracRepair从有缺陷的程序和故障执行中构建按需的静态-动态上下文,执行问题驱动的故障诊断以形成明确的修复假设,并使用验证诊断和轨迹级行为变化迭代地细化候选补丁。在Defects4J V1.2和V2.0上的实验结果表明,PracRepair持续优于最先进的基线方法。具体而言,在GPT-3.5下,PracRepair在Defects4J V1.2/V2.0上正确修复了139/136个错误,而在GPT-4o下进一步改进至162/171。此外,PracRepair有效泛化到真实世界漏洞(RWB),在多个基础模型上取得了最佳性能。

英文摘要

As software systems grow in scale and complexity, debugging and repair remain costly and time-consuming. Large language models (LLMs) have advanced automated program repair (APR), but existing LLM-based APR approaches still largely rely on static or retrieved context, error messages, and coarse-grained validation outcomes. As a result, they underutilize dynamic information for failure understanding and repair, including failure-execution dynamics and patch-validation dynamics. Effectively leveraging such information, however, is challenging: failure-execution traces are large and noisy, raw static-dynamic context is not self-explanatory, and patch-validation dynamics are often reduced to coarse feedback. To address these challenges, we propose \textsc{PracRepair}, a fully automated LLM-based APR framework inspired by human-like debugging practices. \textsc{PracRepair} constructs an on-demand static-dynamic context from buggy programs and failure executions, performs question-driven failure diagnosis to formulate explicit repair hypotheses, and iteratively refines candidate patches using validation diagnostics and trace-level behavioral changes. Experimental results on Defects4J V1.2 and V2.0 show that \textsc{PracRepair} consistently outperforms state-of-the-art baselines. Specifically, under GPT-3.5, \textsc{PracRepair} correctly fixes 139/136 bugs on Defects4J V1.2/V2.0, while under GPT-4o it further improves to 162/171. Moreover, \textsc{PracRepair} generalizes effectively to RWB (Real-World Bugs), achieving the best performance across multiple foundation models.

2606.17610 2026-06-17 cs.DL cs.CY 新提交

Beyond Citations: Comparing Scholarly, Policy, and Patent Impact Across the FT50 Journals

超越引用:比较FT50期刊的学术、政策和专利影响力

Arash Hajikhani, Yi Zhang, Mengjia Wu

AI总结 通过分析53种FT50期刊在学术引用、政策采纳和专利引用三个维度的表现,发现期刊影响力存在显著异质性,单一引用排名与多维排名仅中度相关,近半数期刊在纳入政策和专利指标后四分位变化。

Comments 28 pages, 9 figures, 1 table. Analysis of 53 FT50 and recently removed journals using citation, policy, and patent impact indicators. Submitted manuscript

详情
AI中文摘要

《金融时报》50强(FT50)期刊列表影响着全球商学院的招聘、晋升、认证和研究评估。然而,列表中的期刊通常被视为代表同质化的卓越层级。我们通过比较53种FT50及近期移除的期刊在三个不同影响力渠道的表现来检验这一假设:学术影响力(领域加权引用和可见性)、政策采纳以及通过专利引用实现的技术影响力。利用2005年至2019年间超过60,000篇出版物的面板数据,我们发现隐藏在二元FT50标签下的显著异质性。精英经济学期刊主导政策影响力,信息系统和市场营销期刊引领技术影响力,而许多高被引管理学期刊在学术界之外的影响力有限。引用、政策和专利指标在很大程度上是独立的影响力维度,仅基于引用的排名与多维排名仅中度相关。一旦纳入政策和专利指标,近半数期刊的四分位发生变化,表明仅基于学术引用的评估忽视了研究影响力的重要维度。尽管FT50仍被广泛用作期刊质量的二元分类,但我们的结果揭示了列表内部存在显著的影响力谱系,并表明期刊排名对影响力的定义和测量方式高度敏感。

英文摘要

The Financial Times 50 (FT50) journal list shapes hiring, promotion, accreditation, and research evaluation across business schools worldwide. Yet journals on the list are typically treated as if they represent a homogeneous tier of excellence. We test this assumption by comparing 53 FT50 and recently removed journals across three distinct impact channels: scholarly influence (field-weighted citations and visibility), policy uptake, and technological reach through patent citations. Using a panel of more than 60,000 publications from 2005 to 2019, we find striking heterogeneity hidden beneath the binary FT50 label. Elite economics journals dominate policy influence, information systems and marketing journals lead technological impact, while many highly cited management journals exhibit limited reach beyond academia. Citation, policy, and patent indicators behave as largely independent dimensions of impact, with a citation-only ranking correlating only moderately with a multidimensional ranking. Nearly half of all journals change quartile once policy and patent indicators are incorporated, demonstrating that assessments based solely on scholarly citations overlook important dimensions of research influence. While the FT50 remains widely used as a binary classification of journal quality, our results reveal a substantial within-list impact spectrum and show that journal rankings are highly sensitive to how impact is defined and measured.

2606.17593 2026-06-17 cs.SE 新提交

Why Model Credibility Isn't Enough: -Rethinking Trust in Simulation Architectures

为什么模型可信度不够?——重新思考仿真架构中的信任

Romain Barbedienne, Adeline Lanugue, Rim Kaddah, Julien Silande, Anthony Levillain, Cedric Leclerc, Maxime Hayet, Boussaad Soualmi, Cristian Maxim

AI总结 本文探讨仿真架构的可信度评估问题,综述装配可信度领域现状,比较敏感性分析、专家定性分析、AI可解释性和网络方法,并基于严谨性、泛化性和资源需求评估各方法优劣。

Comments Annual Congress of Japan Society of Automotive Engineers (JSAE), May 2026, Yokohama, Japan

详情
AI中文摘要

仿真模型的可信度是一个重要课题。已有多种方法试图量化仿真的可信度。然而,模型大多是在仿真架构中组装而成的。能否根据组成仿真架构的模型的可信度来评估该架构的可信度?本文旨在通过提供装配可信度领域当前最新技术的概述来解决这一问题。它将比较敏感性分析技术、专家定性分析、AI中的可解释性以及网络方法。最后,基于严谨性、泛化性和资源需求等标准对所提出的方法进行评估,将揭示每种方法的优缺点。

英文摘要

Credibility of a simulation model is an important topic. Several approaches try to quantify the credibility of simulation. However, models are mostly assembled within a simulation architecture. Can the credibility of a simulation architecture be assessed based on the credibility of the models that comprise it? This paper aims to address this issue by providing an overview of the current state of the art in the field of assembly credibility. It will compare sensitivity analysis techniques, qualitative analysis by experts, explainability in AI, and networks. Finally, an assessment of the proposed approaches, based on criteria such as rigor, generalization, and resource requirements, will reveal the strengths and weaknesses of each approach.

2606.17582 2026-06-17 cs.DB 新提交

Collaborative Large and Small Language Models for Accurate and Scalable Data Repair

协作式大小语言模型实现准确且可扩展的数据修复

Qian Chen, Jianwei Wang, Wenjie Zhang

AI总结 提出LasRepair框架,利用大语言模型作为指导者选择全局修复上下文,小语言模型作为校正者高效修复错误数据,并通过EM过程和置信度加权进一步提升修复质量。

Comments 14 pages, 11 figures

详情
AI中文摘要

我们研究数据修复问题,这是数据清洗中的关键任务,旨在纠正原始数据集中的错误条目以提高整体数据质量。尽管近期基于数据驱动的方法,特别是基于大型语言模型(LLM)的方法取得了显著性能,但我们观察到:(i) 它们直接在原始且低质量的上下文中修复数据,这可能损害学习信号;(ii) 它们直接使用不确定的模型输出作为修复结果,可能引入不可靠的修正并损害修复质量。受小型语言模型(SLM)的效率和LLM的能力的启发,并旨在解决上述局限性,我们提出了LasRepair,一个协作大小语言模型进行数据修复的框架。LasRepair使用LLM作为指导者,选择全局修复上下文来引导SLM。SLM作为校正者,使用选定的上下文更高效地修复错误数据。此外,为了进一步提高上下文质量,我们将LasRepair扩展为LasRepair+,它将数据修复公式化为期望最大化(EM)过程,交替进行E步(更新校正者参数)和M步(细化修复上下文)。此外,为了减轻模型不确定性,我们提出了LasRepair++,它使用列校准的模型置信度在更新校正者时降低不可靠修复行的权重,从而增强修复质量。理论分析和实证评估证明了我们方法的优越性。我们从理论上证明了EM风格过程和基于置信度的加权的有效性。在真实数据集上的实验表明,LasRepair++相比最强基线平均F1分数提高了18.1%。

英文摘要

We study the problem of data repair, a key task in data cleaning that corrects erroneous entries in raw datasets to improve overall data quality. Although recent data-driven methods, especially those based on large language models (LLMs), achieve remarkable performance, we observe that: (i) they directly repair data in the raw and low-quality context, which may compromise learning signals, and (ii) they directly use uncertain model outputs as repairs, potentially introducing unreliable corrections and compromising repair quality. Motivated by the efficiency of small language models (SLMs) and the capabilities of LLMs, and aiming to address the above limitations, we propose LasRepair, a framework that collaborates Large and small language models for data repair. LasRepair employs an LLM as an instructor, which selects a global repair context to guide the SLM. The SLM acts as a corrector, using the selected context to repair erroneous data more efficiently. Moreover, to further improve context quality, we extend LasRepair to LasRepair+, which formulates data repair as an Expectation-Maximisation (EM) procedure that alternates between an E-step for updating the corrector parameters and an M-step for refining the repair context. Furthermore, to mitigate model uncertainty, we propose LasRepair++, which uses column-calibrated model confidence to down-weight unreliable repaired rows when updating the corrector, thereby enhancing repair quality. Theoretical analysis and empirical evaluation demonstrate the superiority of our methods. We theoretically prove the effectiveness of the EM-style procedure and the confidence-based weighting. Experiments on real-world datasets show that LasRepair++~ achieves an average F1-score improvement of 18.1% over the strongest baseline.

2606.17578 2026-06-17 cs.DS 新提交

Exact Algorithms for Edge Deletion to Cactus Graphs and Weighted Variants

删除边到仙人掌图的精确算法及加权变体

Wenhao Song

AI总结 针对删除最少边使图变为连通仙人掌的问题,提出O*(2^n)时间复杂度的精确算法,并推广到有限不同代价的加权情形。

详情
AI中文摘要

我们研究删除边到仙人掌图的精确指数时间算法。给定连通图$G$,任务是删除最少边使得剩余生成图是连通仙人掌。Akhtar和Philip (IWOCA 2026)给出了无权重问题的$O^*(3^n)$时间算法,其中$n$是输入图的顶点数,$O^*(\cdot)$符号隐藏多项式因子。我们将这个界改进到$O^*(2^n)$时间和空间。更一般地,如果删除代价至多取$q$个不同的非负实数值,则加权问题可以在$O^*(2^n n^{O(q)})$时间和空间内解决。因此,每个固定数量的不同代价,特别是无权重情形,都有更快的精确算法。对于总权重为$W$的非负整数代价,我们得到一个$O^*(2^n(W+1))$伪多项式算法,而任意非负实数代价则有一个$O^*(3^n)$精确算法。

英文摘要

We study exact exponential-time algorithms for Edge Deletion to Cactus. Given a connected graph $G$, the task is to delete a minimum number of edges so that the remaining spanning graph is a connected cactus. Akhtar and Philip (IWOCA 2026) gave an $O^*(3^n)$-time algorithm for the unweighted problem, where $n$ is the number of vertices in the input graph and the $O^*(\cdot)$ notation hides polynomial factors. We improve this bound to $O^*(2^n)$ time and space. More generally, if the deletion costs take at most $q$ distinct nonnegative real values, then the weighted problem can be solved in $O^*(2^n n^{O(q)})$ time and space. Thus every fixed number of distinct costs, and in particular the unweighted case, admits a faster exact algorithm. For nonnegative integer costs of total weight $W$, we obtain an $O^*(2^n(W+1))$ pseudo-polynomial algorithm, while arbitrary nonnegative real costs admit an $O^*(3^n)$ exact algorithm.

2606.17575 2026-06-17 eess.SY cs.SY 新提交

Dynamic Analysis of Centralized Energy Storage Systems -- A Comparison between Grid-following and Grid-forming Controls

集中式储能系统的动态分析——电网跟随与电网形成控制的比较

Qiang Fu, Siqi Bu, Yang Wang, Mingyu Yan

AI总结 本文通过小信号稳定性分析,比较集中式储能系统中电网跟随与电网形成控制的动态特性,发现单一控制类型具有动态叠加特征,而混合控制需限制电网形成比例以避免模态共振。

Comments This paper has been accepted for publication in IEEE TRANS POWER SYSTEMS, 2026. The final version of record will be available via IEEE Xplore

详情
Journal ref
IEEE TRANS POWER SYSTEMS, 2026
AI中文摘要

本研究使用电网跟随(GFL)和电网形成(GFM)控制,特别关注双向功率流和多个储能系统(ESS),研究了集中式储能系统(CESS)的小信号稳定性。为了解决考虑全面GFL和GFM控制回路时CESS的复杂动态问题,通过关注主导振荡模式,使用虚拟阻尼方法简化了高阶动态。阻尼分析验证了使用单一类型控制(GFL或GFM)的CESS具有动态叠加特性。具体来说,随着ESS数量增加,GFM-CESS的阻尼改善,而GFL-CESS的阻尼下降。阻尼灵敏度表明,GFM-CESS的阻尼对双向功率流和所有控制回路更敏感,而GFL-CESS的阻尼对d轴控制回路更敏感。因此,GFM-CESS更适合大规模集成,但在功率反转显著的情况下受到限制。如果在CESS中混合使用GFL和GFM控制,应限制GFM-CESS的比例,以避免GFL-CESS和GFM-CESS之间的模态共振导致不稳定。这强调实施GFM-CESS需要考虑场景限制,而不是在混合集成条件下追求最大集成。通过模态分析和时域仿真验证了结论。

英文摘要

This study investigates the small-signal stability of centralized energy storage systems (CESSs) using grid-following (GFL) and grid-forming (GFM) controls, particularly focusing on bidirectional power flow and multiple energy storage systems (ESSs). To address the issue of complex dynamics in CESSs when comprehensive GFL and GFM control loops are considered, high-order dynamics are simplified using the virtual damping method by focusing on the dominant oscillation mode. Damping analysis verifies that CESSs using a single-type control (either GFL or GFM) have dynamic superimposition characteristics. Specifically, as ESS number increases, the damping of GFM-CESSs improves but that of GFL-CESSs decreases. The damping sensitivity shows that the damping of GFM-CESSs is more sensitive to bidirectional power flow and all control loops, whereas that of GFL-CESSs is more sensitive to d-axis control loop. Consequently, GFM-CESSs are preferred for large-scale integration but are limited in scenarios with significant power reversal. If GFL and GFM controls are hybridized in CESSs, the ratio of GFM-CESSs should be constrained to avoid instability from modal resonance between GFL-CESSs and GFM-CESSs. This highlights that implementing GFM-CESSs necessitates considering scenario limitations rather than pursuing maximal integration under hybrid integration conditions. The conclusions are validated through modal analysis and time-domain simulations.

2606.17573 2026-06-17 cs.OS cs.CR 新提交

Cordon: Semantic Transactions for Tool-Using LLM Agents

Cordon: 工具使用型LLM代理的语义事务

Zheng Chen, Hanqing Liu, Duling Xu, Dong Dong, Jialin Li, Bangzheng Pu, Jidong Zhai

AI总结 提出Cordon事务运行时系统,通过语义事务边界实现多步代理工作流的提交、回滚、恢复和审计,减少不可逆效应失败并暴露跨步违规。

详情
AI中文摘要

使用工具的LLM代理正在将计算单元从显式的人类指令转变为具有状态后果的模型驱动任务。然而,今天的代理运行时仍然将工具暴露为孤立的RPC。这种接口为运行时提供了便捷的集成点,但缺乏用于跨多步代理工作流进行提交、回滚、恢复和审计的任务范围执行边界。我们认为,这种不匹配需要运行时隔离边界,而不是另一个每次调用的护栏。本文介绍了Cordon,一个事务性运行时系统,用于在提交前暂存和验证不可逆的代理效应。语义事务是一个任务级执行边界,它将工具意图和运行时跟踪的结果谱系绑定到可逆的本地状态、暂存的外部效应、委托的权限和审计元数据。Cordon通过一个事务管理器实现这种抽象,该管理器跟踪派生结果对象,在影子状态中执行可逆突变,在效应发件箱中暂存面向外部的动作,并记录恢复元数据。然后,运行时在提交状态或释放外部效应之前验证组合的执行流程。我们在对抗性和良性工作流上的评估表明,Cordon暴露了现有防御措施遗漏的跨步违规。它还在保持良性任务完成的同时,以适度的批准和延迟开销减少了不可逆效应失败。

英文摘要

Tool-using LLM agents are shifting the unit of computation from explicit human-issued commands to model-driven tasks with stateful consequences. Yet today's agent runtimes still expose tools as isolated RPCs. This interface gives runtimes a convenient integration point, but it lacks a task-scoped execution boundary for commit, rollback, recovery, and audit across multi-step agent workflows. We argue that this mismatch calls for a runtime containment boundary rather than another per-call guardrail. This paper introduces Cordon, a transactional runtime system for staging and validating irreversible agent effects before commit. A semantic transaction is a task-level execution boundary that binds tool intents and runtime-tracked result lineage to reversible local state, staged external effects, delegated authority, and audit metadata. Cordon implements this abstraction with a transaction manager that tracks derived result objects, executes reversible mutations in shadow state, stages outward-facing actions in an effect outbox, and records recovery metadata. The runtime then validates the composed execution flow before it commits state or releases external effects. Our evaluation across adversarial and benign workflows shows that Cordon exposes cross-step violations missed by existing defenses. It also reduces irreversible-effect failures while preserving benign task completion with modest approval and latency overhead.

2606.17568 2026-06-17 eess.SY cs.SY 新提交

Instability Caused by Integration of IBRs under Strong Grid Connections -- A Practical Case Study on Large-scale Energy Storage Systems

强电网连接下IBR集成引起的失稳——大规模储能系统的实际案例研究

Qiang Fu, Siqi Bu, Zijun Bin, Peng Li, Tong Wang

AI总结 本文通过大规模储能系统案例,揭示强电网连接下逆变器基资源(IBR)因PCS间动态交互叠加导致振荡失稳,强调功能控制设计与规模规划的重要性。

Comments This paper has been accepted for publication in IEEE TRANS POWER SYSTEMS, 2026. The final version of record will be available via IEEE Xplore

详情
AI中文摘要

众所周知,逆变器基资源(IBRs)在弱电网连接下可能导致换流器驱动的稳定性问题。然而,随着IBR数量的增加,即使在强电网连接下也可能发生失稳。本文以大规模储能系统(ESSs)为例,展示了一个实际案例来证明这一结论。在本研究中,ESS在向所连接电力系统提供容性和感性无功支持(通过ESS功能控制回路实现)时,在d-q坐标系中诱发了频率为150 Hz的振荡。理论分析表明,在强电网连接下,随着ESS规模扩大,ESS的功率转换系统(PCSs)之间的动态相互作用可能叠加并增强,从而降低振荡阻尼并导致系统失稳。这表明ESS功能控制回路在向电力系统提供支持时也存在潜在的失稳风险,应仔细检查。最后,确定了减轻振荡的主要影响因素,并基于SIMULINK平台验证了结论。本文为即使在强电网条件下系统失稳提供了有价值的实际见解,强调了功能控制设计和IBR主导系统规模仔细规划的重要性。

英文摘要

It has been well known that inverter-based resources (IBRs) can lead to converter-driven stability issues under weak grid connections. However, as the number of IBRs increases, instabilities can also occur even under strong grid connections. A practical case is presented to demonstrate this conclusion, using large-scale energy storage systems (ESSs) as an example. In this study, the ESSs induce oscillations with a frequency of 150 Hz in the d-q coordinates while providing both capacitive and inductive reactive power support (achieved by ESS functional control loops) to the connected power system. Theoretical analysis reveals that under strong grid connections, the dynamic interactions among power conversion systems (PCSs) of ESSs can be superimposed and intensified as the ESS scale extends, which reduces oscillation damping and leads to system instability. This indicates that ESS functional control loops also have potential instability risks when providing supports to power systems, which should be carefully examined. Finally, major impact factors are identified to mitigate the oscillations, and the conclusions are validated based on the SIMULINK platform. This paper provides valuable practical insights into system instabilities even under strong grid conditions, emphasizing the importance of functional control design and careful planning of the scale for IBR-dominated systems.

2606.17565 2026-06-17 eess.SY cs.SY 新提交

Stability Analysis in Large-scale Centralized Bidirectional Inverter-based Stations Connected to Bulk Power Systems through AC and DC Connections

大规模集中式双向逆变器站通过交流和直流连接接入大电网的稳定性分析

Qiang Fu, Wenjuan Du, Siqi Bu, Haifeng Wang

AI总结 研究大规模双向逆变器站引起的次同步振荡稳定性问题,比较交流和直流连接的影响,发现直流连接可降低失稳风险,且参数调节更有效。

Comments Accepted for publication in IEEE TRANS POWER SYSTEMS, 2026

详情
Journal ref
IEEE TRANS POWER SYSTEMS, 2026
AI中文摘要

大量受控直流资源(如电池储能系统)通过双向逆变器连接到交流电力系统以满足功率平衡需求。本研究探讨了由大规模双向逆变器站(IBS)引起的次同步频率范围内的换流器驱动稳定性(CDS)问题。通过考察三个因素:受控直流资源的数量、功率流向以及逆变器的控制参数,比较了IBS的交流和直流连接对次同步振荡(SSO)的影响。对于交流连接,无论功率流向如何,随着受控直流资源数量的增加,IBS可能引发不稳定。为了保持稳定,计算了IBS的最大功率幅值。发现如果直流线路电阻远小于交流线路电抗,切换到直流连接可以降低这些失稳风险。此外,在直流连接下,调节控制参数的方法在改善与功率相关的临界稳定性方面更为有效。因此,直流-IBS更适用于高压输电。最后,在不同网络拓扑和系统规模下,连接有交流-和直流-IBS的电力系统中验证了这些结论。

英文摘要

Massive controlled DC resources (CDCRs), such as battery energy storage systems, are connected to AC power systems through bidirectional inverters for power balance requirements. This study investigates converter-driven stability (CDS) issues in the sub-synchronous frequency range caused by large-scale bidirectional inverter-based stations (IBSs). The impacts of the AC and DC connections of IBSs on subsynchronous oscillations (SSOs) are compared by examining three factors: the number of CDCRs, power flow direction, and control parameters of the inverters. For AC connections, IBSs may induce instability as the number of CDCRs increases, regardless of the power flow direction. To maintain stability, the maximum power amplitude of the IBS is calculated. It is found that switching to DC connections can reduce these instability risks if the DC line resistance is much less than the AC line reactance. Moreover, the method of tuning control parameters is demonstrated to be more effective in improving power-related critical stability under DC connections. Therefore, The DC-IBS is preferred for high-voltage transmission. Finally, the conclusions are validated in power systems connected with both AC- and DC-IBSs under various network topologies and system scales.

2606.17562 2026-06-17 cs.CR cs.SY eess.SY 新提交

Anywhere, Any-Stymie: Remote Activation of Trojan Malware on LiDAR with Modulated Signals

任意地点,任意干扰:利用调制信号远程激活LiDAR上的特洛伊恶意软件

R. Spencer Hallyburton, Miroslav Pajic

AI总结 本研究设计了一种嵌入LiDAR固件的特洛伊恶意软件,并利用光学触发通过调制信号远程激活,实现虚假目标注入和真实目标抑制,威胁自动驾驶安全。

详情
AI中文摘要

LiDAR传感器广泛应用于自主系统的3D感知和安全关键决策。我们识别了一个先前未探索的攻击面:嵌入LiDAR感知流程中的休眠恶意软件在正常操作期间保持不活跃,部署后可通过外部触发激活,无需在攻击时访问传感器硬件或网络。为实现这一威胁,我们设计了能够进行低级点云操作的恶意软件,并将其嵌入LiDAR固件。该恶意软件是在封闭的研究测试环境中,在供应商技术支持下开发的,而非利用固有的生产供应链漏洞。为了选择性触发攻击激活,我们设计并实现了一种光学触发器,通过向感知环境传递调制信号远程激活恶意软件。一旦触发,恶意软件执行实时点云操作,我们在静态和移动受害者平台上演示了虚假目标注入和真实目标抑制。我们的评估首先建立了攻击可行性,包括300英尺的静态操作和达到35英里/小时的记录驱动通过。然后,我们定量说明注入的人形伪影可以被最先进的3D目标检测器在语义上检测到。最后,我们在部署的战术自主车辆上展示了多种安全关键影响模式。这些结果共同强调了在整个LiDAR传感器开发和部署流程中加强完整性保证的必要性。

英文摘要

LiDAR sensors are widely deployed in autonomous systems for 3D perception and safety-critical decision-making. We identify a previously unexplored attack surface in which dormant malware embedded in the LiDAR sensing pipeline remains inactive during normal operation and can be externally triggered after deployment, without requiring access to sensor hardware or networking at attack time. To operationalize this threat, we design malware capable of low-level point-cloud manipulation and embed it into LiDAR firmware. This malware was developed in a closed research test environment with vendor technical support, rather than by exploiting an inherent production supply-chain vulnerability. To selectively trigger attack activation, we design and implement an optical trigger that remotely activates the malware by delivering a modulated signal into the sensing environment. Once triggered, the malware performs real-time point cloud manipulation, and we demonstrate false object injection and real object suppression on static and mobile victim platforms. Our evaluation first establishes attack feasibility, including static operation at 300~ft and recorded drive-by runs reaching 35~mph. We then illustrate quantitatively that injected person-like artifacts can remain semantically detectable by a state-of-the-art 3D object detector. Finally, we demonstrate multiple modes of safety-critical impact on a deployed tactical autonomous vehicle. Together, these results highlight the need for stronger integrity guarantees throughout the LiDAR sensor development and deployment pipeline.

2606.17533 2026-06-17 cs.CR 新提交

SNAS: A Multi-Layer Defense-in-Depth Architecture for Secure Egress in Sandboxed Workloads

SNAS:一种用于沙箱工作负载安全出口的多层纵深防御架构

Niranjan Kumar Sharma, S Muralidhar, Samy Boshra-Riad, Mike Halcrow, Yuxiong He, Nitya Kumar Sharma, Shawn Xia, Haowei Yu, Elliott Brossard, Derek Denny-Brown, Choden Konigsmark, Bhanu Prakash, Brandon Baker, Andong Zhan

AI总结 针对沙箱工作负载的外部连接需求,提出SNAS架构,结合eBPF包过滤、GENEVE覆盖网络和分布式出口代理,实现低开销的策略驱动出口控制,并已在Snowflake全区域部署。

Comments 10 pages, 7 figures. Accepted at the 53rd IEEE/IFIP International Conference on Dependable Systems and Networks (DSN 2026), June 23-26, 2026

详情
AI中文摘要

Snowpark通过在安全沙箱中执行用户自定义函数,支持Snowflake中的数据工程和AI/ML工作负载。许多此类工作负载需要外部连接以访问云API、外部数据库或特征存储,这带来了一个可靠性挑战:如何在保持严格多租户隔离和资源公平性的同时提供透明的网络访问。本文提出了Snowpark中的安全网络访问(SNAS),一种用于沙箱工作负载安全外部通信的生产架构。SNAS结合了扩展伯克利包过滤器(eBPF)包过滤、通用网络虚拟化封装(GENEVE)覆盖网络和分布式出口代理,以实现低开销的策略驱动出口控制。我们描述了SNAS的设计、部署和测量的生产行为,包括使用最早出发时间(EDT)算法的基于eBPF的带宽限制器、双层策略执行以及连接限制和端口耗尽的保护措施。SNAS已部署在所有Snowflake区域,支持大规模生产工作负载,包括PB级数据传输和延迟敏感的外部集成。

英文摘要

Snowpark enables data engineering and AI/ML workloads in Snowflake by executing user-defined functions in secure sandboxes. Many of these workloads require external connectivity to access cloud APIs, external databases, or feature stores, creating a dependability challenge: how to provide transparent network access while preserving strict multi-tenant isolation and resource fairness. This paper presents Secure Network Access in Snowpark (SNAS), a production architecture for secure external communication from sandboxed workloads. SNAS combines Extended Berkeley Packet Filter (eBPF) packet filtering, Generic Network Virtualization Encapsulation (GENEVE) overlay networks, and distributed egress proxies for policy-driven egress control with low overhead. We describe the design, deployment, and measured production behavior of SNAS, including an eBPF-based bandwidth limiter using the Earliest Departure Time (EDT) algorithm, dual-tier policy enforcement, and safeguards for connection limiting and port exhaustion. SNAS is deployed across all Snowflake regions and supports large-scale production workloads including petabyte-scale data transfer and latency-sensitive external integrations.

2606.17521 2026-06-17 cs.MM 新提交

DiffPC: Diffusion-Based Projector Photometric Compensation

DiffPC: 基于扩散的投影仪光度补偿

Yuxi Wang, Haibin Ling, Bingyao Huang

AI总结 提出一种基于扩散模型的光度补偿方法,将投影失真建模为环境相关加性噪声,通过扩散模型逐步去噪生成补偿图像,并设计融合光度与内容特征的噪声估计网络,在未知场景中取得更优视觉表现。

详情
AI中文摘要

投影仪光度补偿纠正由表面纹理、反射和环境光照引入的颜色失真。现有的深度学习方法通常需要专业的场景特定数据收集,且缺乏对感知质量的考虑。为解决这一限制,我们提出一种基于扩散的光度补偿方法,在光度与内容感知引导下重建补偿图像。具体地,我们首先将投影过程中引入的光度失真建模为环境相关的加性噪声,从而将光度补偿问题重新表述为具有物理约束的去噪任务。接着,我们引入一个扩散模型,通过遵循加性轨迹迭代去除噪声来生成补偿图像。最后,为准确估计每个时间步的噪声,通过分析投影和拍摄物理过程中导致失真的因素,我们设计了一个融合光度感知与内容条件特征的噪声估计网络。实验表明,我们的方法在未知场景中实现了优越的视觉性能,因此相较于先前方法展现出显著的实际优势。

英文摘要

Projector photometric compensation corrects color distortions introduced by surface texture, reflection, and ambient lighting. Existing deep learning-based methods usually require professional scene-specific data collection and lack consideration for perceptual quality. To address this limitation, we present a diffusion-based photometric compensation method that reconstructs compensation images under photometric and content-aware guidance. Specifically, we first model the photometric distortions introduced during projection as environment-dependent additive noise, thereby reformulating the photometric compensation problem as a denoising task with physical constraints. Next, we introduce a diffusion model, which generates compensation images by following an additive trajectory to iteratively remove the noise. Finally, to accurately estimate the noise at each timestep, by analyzing the factors that contribute to distortions in the physical process of projection and capturing, we design a noise estimation network that incorporates features of both photometry-aware and content conditions. Experiments show that our method achieves superior visual performance in unknown scenarios, thereby exhibiting significant practical advantages over prior methods.

2606.17518 2026-06-17 cs.DC 新提交

SpecGen: Accelerating Agentic Kernel Optimization with Speculative Generation

SpecGen: 利用推测生成加速智能体内核优化

Jihu Guo, Sitian Lu, Tenghui Ma, Wei Gao, Zhisheng Ye, Xingcheng Zhang, Dahua Lin

AI总结 针对智能体内核优化中生成延迟长、反馈不足和资源利用率低的问题,提出SpecGen系统,通过推测生成在推理过程中提前产生候选内核,并行验证剖析,动态调整资源,并利用远程KV缓存减少前缀重计算,显著降低端到端时间并提升内核加速比。

详情
AI中文摘要

智能体内核优化通过迭代生成、验证和剖析,利用推理LLM自动进行手动GPU内核调优,将优化任务转化为反馈引导的搜索。然而,我们的工作负载特征揭示了三个限制搜索效率的系统级低效问题:(1) LLM推理导致的长生成延迟,(2) 不充分的剖析反馈,(3) 验证/剖析资源利用不足。我们的关键洞察是,正在进行的推理生成在完成之前暴露了一个产生额外候选内核的窗口,允许系统在出现满意内核时提前终止推理。我们提出了SpecGen,一个具有推测生成的智能体内核优化系统。首先,SpecGen在推理轨迹中精心选择的触发点分叉出非推理生成以产生内核,增加每次迭代的候选内核数量。这些内核与正在进行的推理并行进行验证和剖析,增加剖析反馈,并在生成期间保持资源忙碌。当内核满足终止条件时,SpecGen提前终止推理生成以减少生成延迟。其次,SpecGen根据到达率动态重新分配验证和剖析GPU池,并优先处理请求,以减少突发推测生成负载下的剖析反馈延迟。此外,SpecGen利用验证/剖析GPU的空闲内存作为远程KV缓存存储,以消除在有限内存预算下推测生成的前缀重计算。在H200上使用两个推理LLM的实验表明,与三个基线系统相比,SpecGen减少了端到端时间,同时产生了更多的剖析反馈,提高了资源利用率,并在固定的时间和令牌预算下改善了内核加速比。

英文摘要

Agentic kernel optimization automates manual GPU kernel tuning via iterative generation, validation, and profiling with reasoning LLMs, casting the optimization task as feedback-guided search. However, our workload characterization reveals three system-level inefficiencies that limit search efficiency: (1) long generation latency due to LLM reasoning, (2) insufficient profiling feedback, and (3) underutilized validation/profiling resources. Our key insight is that the ongoing reasoning generation exposes a window for producing additional candidate kernels before it completes, allowing the system to terminate reasoning early once a satisfactory kernel appears. We present SpecGen, an agentic kernel optimization system with \emph{speculative generation}. First, SpecGen forks non-reasoning generations at well-chosen trigger points in the reasoning trace to yield kernels, increasing the candidate kernel count per iteration. These kernels are validated and profiled in parallel with the ongoing reasoning, increasing profiling feedback, and keeping resources busy during generation. When a kernel meets the termination criterion, SpecGen terminates the reasoning generation early to reduce the generation latency. Second, SpecGen dynamically reallocates validation and profiling GPU pools based on the arrival rate and prioritizes requests to reduce profiling feedback latency under bursty speculative generation load. Furthermore, SpecGen utilizes spare memory of the validation/profiling GPUs as remote KV cache storage to eliminate prefix recomputation of speculative generations under limited memory budget. Experiments with two reasoning LLMs on H200 show that SpecGen reduces end-to-end time over three baseline systems, while producing more profiling feedback, increasing resource utilization, and improving kernel speedup under a fixed time and token budget.

2606.17512 2026-06-17 cs.HC 新提交

MedEasy: Designing AI Standardized Patients for Clinical Consultation Training

MedEasy:为临床咨询培训设计AI标准化患者

Zhiqi Gao, Huarui Luo, Guo Zhu, Bingquan Zhang, Dongyijie Primo Pan, Yizhan Feng, Jiahuan Pei, Jie Li, Benyou Wang

AI总结 提出多智能体系统MedEasy,通过患者对话、临床操作、决策提交、文档记录和反馈组织虚拟患者练习,基于形成性研究设计分阶段工作流,评估表明学习者将其视为连贯的咨询环境。

详情
AI中文摘要

AI标准化患者正成为临床咨询专业培训的一种设置。本文介绍MedEasy,一个多智能体系统,通过患者对话、临床操作、决策提交、文档记录和反馈来组织虚拟患者练习。我们首先通过与12名临床年级医学生的访谈和三次协同设计工作坊进行了形成性研究。研究结果指导了分阶段工作流、结构化病例记录、行动相关发现和基于轨迹的回顾。然后,我们与另一组12名临床年级医学生进行了评估性用户研究,每位参与者完成两个平衡的病例。学习者将MedEasy解释为一个连贯的咨询环境。他们结合患者反应、检查发现、可用操作和反馈来判断所呈现的病例是否保持连贯。他们重视可重复的练习和记录的回顾,同时对缺失的操作和反馈标准提出质疑。本文为使用病例特定标准连接情境实践的AI支持专业培训系统提供了设计启示。

英文摘要

AI standardized patients are becoming a setting for professional training in clinical consultation. This paper presents MedEasy, a multi-agent system that organizes virtual-patient practice through patient dialogue, clinical actions, decision submission, documentation, and feedback. We first conducted a formative study with 12 clinical-year medical students through interviews and three co-design workshops. The findings informed a staged workflow, structured case records, action-contingent findings, and trajectory-based review. We then conducted an evaluative user study with a separate cohort of 12 clinical-year medical students, with each participant completing two counterbalanced cases. Learners interpreted MedEasy as a connected consultation environment. They used patient responses, examination findings, available actions, and feedback together to judge whether the represented case remained coherent. They valued repeatable practice and recorded review, while questioning missing actions and feedback criteria. The paper contributes design implications for AI-supported professional training systems that use case-specific standards to connect situated practice.

2606.17510 2026-06-17 cs.SE cs.SY eess.SY 新提交

OmniDroneX: An LLM-Assisted Holistic Drone-as-a-Service Ecosystem

OmniDroneX: 一种LLM辅助的全方位无人机即服务生态系统

I-Ling Yen, Akeem Mohammed, Farokh Bastani, San-Yih Hwang

AI总结 提出OmniDroneX统一无人机即服务生态系统,通过libUAV接口和PT-SOA抽象模型连接底层物理与高层任务,利用大语言模型辅助功能识别、服务组合和自然语言任务定义,支持多种组合技术以实现可扩展、自演进的无人机系统。

Comments Ongoing project paper that is going to be submitted to JointCloud Computing

详情
AI中文摘要

尽管无人机技术取得了快速进步,但由于无人机系统研究中的若干空白,当前部署仍然有限。为应对这些挑战,我们提出OmniDroneX,一个统一的无人机即服务生态系统,其中无人机从固定功能平台转变为动态可组合实体,可与外部基础设施集成以提供全方位能力。OmniDroneX通过统一的供应商无关接口(libUAV)和形式化的物理服务抽象模型(PT-SOA)连接底层物理原语与高层任务意图。一个核心创新是大语言模型(LLM)在OmniDroneX架构多层中的多样化应用。LLM用于辅助识别和形式化原始设备功能及抽象服务定义,支持自动化服务组合和工作流生成,并实现交互式自然语言任务规范与细化。OmniDroneX还包含了动态无人机系统中至关重要的多种组合技术类别,包括用于无人机能力增强的物理层组合,以及时空、功能、协作、异常感知和基于QoS的服务组合。总体而言,这些特性使OmniDroneX能够作为在复杂动态环境中运行的可扩展、有弹性和自演进的无人机生态系统的基础。

英文摘要

Despite rapid advances in UAV technologies, current deployments remain limited due to several gaps in UAV systems research. To address these challenges, we propose OmniDroneX, a unified Drone-as-a-Service ecosystem, in which drones are transitioned from fixed function platforms into dynamically composable entities that can be integrated with external infrastructures to offer omni-capabilities. OmniDroneX bridges low-level physical primitives with high-level mission intent through a unified vendor-agnostic interface (libUAV) and a formal physical-service abstraction model (PT-SOA). A core innovation is the diverse application of large language models (LLMs) across multiple layers of the OmniDroneX architecture. LLMs are used to assist in identifying and formalizing primitive device functions and abstract service definitions, supporting automated service composition and workflow generation, and enabling interactive, natural-language mission specification and refinement. OmniDroneX also incorporates important categories of composition techniques that are essential in dynamic UAV systems, including physical layer composition for drone capability augmentation, as well as spatiotemporal, functional, collaborative, exception-aware, and QoS-based service compositions. Collectively, these features allow OmniDroneX to serve as a foundation for scalable, resilient, and self-evolving UAV ecosystems operating in complex and dynamic environments.

2606.17509 2026-06-17 eess.SY cs.SY 新提交

Data-Driven Stabilizing Controller Design for Linear Infinite Networks

线性无限网络的数据驱动镇定控制器设计

Mahdieh Zaker, Andrii Mironchenko, Amy Nejati, Abolfazl Lavaei

AI总结 提出一种直接数据驱动方法,利用噪声污染的输入-状态轨迹和线性矩阵不等式,为未知线性时不变子系统构造eISS控制Lyapunov函数和镇定反馈控制器,并通过无限维小增益条件组合为全局控制器,实现无限网络的指数稳定性。

Comments This paper has been accepted at the 27th International Symposium on Mathematical Theory of Networks and Systems (MTNS)

详情
AI中文摘要

我们提出了一种直接数据驱动方法,用于由未知线性时不变子系统组成的无限网络的控制器综合。利用从每个子系统收集的一组噪声污染的输入-状态轨迹,并假设某些线性矩阵不等式成立,通过局部构造一个eISS控制Lyapunov函数以及一个指数输入-状态镇定反馈控制器,使每个子系统达到指数输入-状态稳定(eISS)。然后,我们在无限维空间中,在组合小增益条件下组合这些局部组件,以获得全局控制Lyapunov函数和相关的镇定控制器,确保无限网络的全局一致指数稳定性。该方法在一个具有未知动力学的物理案例研究中得到验证。

英文摘要

We propose a direct data-driven method for controller synthesis of infinite networks composed of unknown linear time-invariant subsystems. Using a single set of noise-corrupted input-state trajectories collected from each subsystem, and provided that certain linear matrix inequalities hold, each subsystem is rendered exponentially input-to-state stable (eISS) by locally constructing an eISS control Lyapunov function together with an exponentially input-to-state stabilizing feedback controller. We then compose these local components under a compositional small-gain condition in infinite-dimensional spaces to obtain a global control Lyapunov function and an associated stabilizing controller, ensuring uniform global exponential stability of the infinite network. The approach is validated on a physical case study with unknown dynamics.

2606.17481 2026-06-17 cs.NI cs.ET 新提交

RATIO: Redundancy-Controlled Stochastic Routing for Reliable Vehicular Multi-Hop Networking

RATIO: 用于可靠车载多跳网络的冗余控制随机路由

Lei Lei, Xudong Wang

AI总结 提出RATIO路由协议,通过构建加权有向无环图并采用基于模数的随机转发规则,实现连续可控的冗余度,在保证高可靠性的同时降低延迟和开销。

Comments 19 pages, 8 figures

详情
AI中文摘要

车载网络中可靠、低延迟的多跳数据传输需求日益增长,但由于高移动性和间歇性阻塞导致的频繁路由失效,这一目标仍具挑战性。基于冗余的路由通过多路径转发数据包来增强鲁棒性,但过度复制会加剧竞争并引入额外延迟,凸显了精细管理冗余-可靠性权衡的必要性。然而,传统的确定性多路径复制通常将数据包复制到整数个分支,使得冗余度难以调节并适应车载网络中时变的网络动态。为此,本文提出了冗余控制随机(RATIO)路由。对于每个活跃流,RATIO构建一个加权简化有向无环图(DAG)作为路由结构,其中边权重指定每链路转发概率。在分叉节点,允许总出向转发概率超过1,并采用基于模数的随机转发规则来保证可行转发,从而实现连续可控的冗余度。理想化的RATIO设计被形式化为一个负载最小化优化问题,受限于每流的及时可靠性和链路容量约束,但在时变无线动态下该问题通常是难解的。因此,开发了一种实用的启发式算法H-RATIO。H-RATIO通过取候选路径的并集构建紧凑的简化DAG,并通过局部评分和复制调整迭代优化转发概率。广泛的基于轨迹的SUMO/ns-3联合仿真表明,与基线相比,RATIO/H-RATIO始终实现最高的及时数据包投递率,同时提供显著更好的投递效率,尤其是在高负载场景下。

英文摘要

Reliable, low-latency multi-hop data delivery in vehicular networks is increasingly demanded, yet remains challenging due to frequent route failures caused by high mobility and intermittent blockage. While redundancy-based routing enhances robustness by forwarding packets over multiple paths, over-replication intensifies contention and introduces additional delay, highlighting the need to carefully managing redundancy--reliability trade-off. However, conventional deterministic multi-path replication typically duplicates packets to an integer number of branches, making the redundancy level hard to tune and adapt to time-varying network dynamics in vehicular networks. To this end, Redundancy-Controlled Stochastic (RATIO) routing is proposed in this paper. For each active flow, RATIO constructs a weighted reduced directed acyclic graph (DAG) as the routing structure, where edge weights specify per-link forwarding probabilities. At fork nodes, the aggregate outgoing forwarding probability is allowed to exceed one and a modulo-based stochastic forwarding rule is employed to guarantee feasible forwarding, thereby enabling continuously controllable redundancy. An idealized RATIO design is formulated as a load-minimizing optimization subject to per-flow timely-reliability and link-capacity constraints, but the problem is generally intractable under time-varying wireless dynamics. Accordingly, a practical heuristic, termed H-RATIO, is developed. H-RATIO constructs a compact reduced DAG by taking the union of candidate paths and optimizes forwarding probabilities via local scoring and replication-adjustment iterations. Extensive trace-driven SUMO/ns-3 co-simulations demonstrate that RATIO/H-RATIO consistently achieves the highest timely PDR compared to baselines, while providing substantially better delivery efficiency, especially under high-load scenarios.

2606.17470 2026-06-17 cs.CY cs.HC 新提交

Self-Efficacy and Favorability Shape Learning from Tutoring Systems and Paper Practice

自我效能感和偏好影响辅导系统与纸质练习的学习效果

Xinfei Cen, Vincent Aleven, Kenneth R. Koedinger, Conrad Borchers, Paulo F. Carvalho

AI总结 研究通过平衡被试内设计,发现低基线自我效能感的学生无论练习形式如何都获得更大学习收益,且对辅导系统的偏好与学习收益正相关,但纸质练习模式不同;基于智能辅导系统的练习并未显著提升自我效能感。

Comments Full research paper accepted at EC-TEL 2026

详情
AI中文摘要

动机因素如自我效能感和学生对练习的偏好程度在塑造学习中起着关键作用,尤其是在技术支持的环境中。然而,教育干预常常忽视这些因素如何与练习形式相互作用。本文考察了自我效能感和偏好对两种常见练习形式(纸质练习和基于系统的辅导练习)学习结果的影响。通过使用匹配问题集的平衡被试内设计,我们分离了练习形式的影响,同时建模了动机差异。结果表明,无论练习形式如何,基线自我效能感较低的学生获得了更大的学习收益。在基线自我效能感较低的学生中,对辅导系统的偏好越高,在辅导练习中的学习收益越大,而在纸质练习中模式不同。与纸质方法相比,基于智能辅导系统(ITS)的练习并未显著提高训练后的自我效能感。这些发现强调了根据学生动机特征定制练习形式的潜在价值,因为辅导和纸质练习的收益随基线自我效能感和偏好而变化。它们为未来研究如何更有效地将教学形式与学习者的动机需求对齐奠定了基础。

英文摘要

Motivational factors such as self-efficacy and how favorably students feel toward practice play a crucial role in shaping learning, particularly in technology-supported environments. Yet, educational interventions often overlook how these factors interact with practice format. This paper examines the influence of self-efficacy and favorability on learning outcomes across two common practice formats: paper-based and system-based tutoring practice. Using a counterbalanced within-subject design with matched problem sets, we isolate the effect of practice format while modeling motivational differences. Results indicate that students with lower baseline self-efficacy achieved greater learning gains regardless of practice format. Among students with lower baseline self-efficacy, greater favorability toward the tutor was associated with greater learning gains during tutor practice, whereas the pattern differed in paper-based practice. Intelligent Tutoring System (ITS)-based practice did not significantly improve post-training self-efficacy relative to paper-based methods. These findings underscore the potential value of tailoring practice format to students' motivational profiles, as the benefits of tutor- and paper-based practice varied with baseline self-efficacy and favorability. They lay the groundwork for future research on how instructional formats can be aligned more effectively with learners' motivational needs.

2606.17468 2026-06-17 cs.IR 新提交

RSRank: Learning Relevance from Representational Shifts

RSRank: 从表示偏移中学习相关性

Archit Gupta, Sai Sundaresan, Debabrata Mahapatra

AI总结 针对RAG系统中重排序依赖启发式阈值和语言模型logit信号的问题,提出基于表示偏移(RS)的相关性信号,通过轻量级训练框架学习映射,在零阈值下过滤无关内容,超越现有重排序器。

Comments Under Peer Review

详情
AI中文摘要

随着企业部署基于RAG的系统为用户查询提供有依据的响应,重排序已成为最终过滤步骤的关键组成部分,用于区分相关文档与分散注意力或不相关的文档。现有的重排序器通常依赖启发式阈值来实现最优过滤。此外,对于相关性评分,最先进的方法使用语言模型的logit信号,这些信号是为下一个词预测设计的,而非用于评估相关性。为了解决这些局限性,我们确定了一个原则性的相关性信号:当以文档为条件时,查询内部状态中引起的表示偏移(RS)。我们观察到,(a) 候选文档引起的RS与(b) 预言文档集引起的RS之间的对齐提供了相关性的稳健指标。基于这一见解,我们引入了一个轻量级训练框架,学习将RS映射到校准的相关性分数的投影。我们的训练目标在零阈值下自然过滤不相关内容,减少了对启发式调优的依赖。在多种检索数据集上,我们的方法相比最先进的重排序器取得了性能提升。

英文摘要

As enterprises deploy RAG-based systems to provide grounded responses to user queries, reranking has become a critical component for the final filtering step that separates relevant from distracting or irrelevant documents. Existing rerankers often rely on heuristic thresholds to achieve optimal filtering. Moreover, for relevance scoring, state-of-the-art methods use a language model's logit signals, which are designed for next-token prediction, not for assessing relevance. To address these limitations, we identify a principled signal for relevance: the representational shift (RS) induced in a query's internal state when conditioned on a document. We observe that the alignment between (a) RS induced by a candidate document and (b) RS induced by an oracle document-set provides a robust indicator of relevance. Building on this insight, we introduce a lightweight training framework that learns projections mapping RS to calibrated relevance scores. Our training objectives naturally filter irrelevant content at a zero threshold, reducing dependence on heuristic tuning. Across diverse retrieval datasets, our method delivers gains over SOTA rerankers.

2606.17458 2026-06-17 cs.CE 新提交

ICBCBench: An Industry Consortium Benchmark for Financial Deep Research

ICBCBench:面向金融深度研究的行业联盟基准

Weiya Li, Zhiwei Tang, Yizhou He, Chenghao Wang, Liang Feng, Xiao Sun, Dongrui Liu, Zichen Wen, Hu Wei, Jinghang Wang, Yi Luo, Li Guo, Linfeng Zhang

AI总结 针对金融领域深度研究代理评估标准缺失的问题,提出ICBCBench基准,采用客观任务与主观报告评估双轨范式,揭示当前模型在复杂推理、事实依据和报告质量上的显著差距。

Comments 33 pages, 14 figures. Preprint. Under review

详情
AI中文摘要

随着深度研究代理在金融等知识密集型领域的快速发展,建立可靠且与领域对齐的评估标准仍然是一个关键挑战。现有基准要么专注于封闭式问答,要么专注于开放式报告评估,未能共同捕捉实际工作流程中所需的检索-推理准确性和端到端研究质量。我们引入了ICBCBench,一个由联盟驱动的金融深度研究基准,与来自广泛金融机构和学术界的领域专家合作开发,涉及超过40个组织的50多位专家。它采用双轨范式,整合了具有可验证答案的客观任务和主观长篇报告评估,从而在专家对齐性、引用一致性和来源质量方面,实现对检索-推理准确性和端到端报告质量的互补评估。对最先进的深度研究代理和大语言模型的实验揭示了在复杂推理、事实依据和报告质量方面的显著差距,凸显了实现行业级性能的挑战。我们的数据集和评估框架可在以下网址获取:this https URL。

英文摘要

With the rapid advancement of Deep Research Agents in knowledge-intensive domains such as finance, establishing reliable and domain-aligned evaluation standards remains a critical challenge. Existing benchmarks focus on either closed-ended question answering or open-ended report evaluation, failing to jointly capture retrieval-reasoning accuracy and end-to-end research quality required in real-world workflows. We introduce ICBCBench, a consortium-driven benchmark for financial deep research, developed in collaboration with domain experts from a broad range of financial institutions and academia, involving over 50 experts across more than 40 organizations. It adopts a dual-track paradigm integrating objective tasks with verifiable answers and subjective long-form report evaluation, enabling complementary assessment of retrieval-reasoning accuracy and end-to-end report quality in terms of expert alignment, citation consistency, and source quality. Experiments on state-of-the-art DRAs and large language models reveal substantial gaps in complex reasoning, factual grounding, and report quality, highlighting the challenges of achieving industry-level performance. Our dataset and evaluation framework are available at https://github.com/DeepFin-Intelligence/ICBCBench.

2606.17421 2026-06-17 cs.CR 新提交

Bifrost: Hybrid TEE-FHE Inference for Privacy-Preserving Transformer and LLM Serving

Bifrost: 面向隐私保护Transformer和LLM服务的混合TEE-FHE推理架构

Chenghao Chen, Kailun Qin, Xiaolin Zhang, Chi Zhang, Dawu Gu

AI总结 提出Bifrost混合架构,利用CPU TEE处理非线性操作和状态刷新,FHE加密线性层委托给加速器,实现安全高效的LLM推理,相比纯FHE方案延迟降低9-53倍。

详情
AI中文摘要

云端托管的Transformer和大语言模型(LLM)推理会产生直接的机密性问题:用户提示可能包含敏感代码、业务数据、个人信息或受监管文档,而远程服务会将中间状态暴露给云端软件栈和加速器运行时。全同态加密(FHE)使加速器端的执行仅处理密文,但端到端的LLM推理仍然昂贵,因为线性层与非线性、缓存状态和刷新敏感操作交错在一起。CPU可信执行环境(TEE)可以原生执行这些操作,但仅靠CPU TEE无法定义不受信任的加速器应如何参与。我们提出Bifrost,一种混合TEE-FHE服务架构,其中秘密仅提供给经过认证的CPU TEE,而加速器、设备内存、驱动/运行时栈和主机软件均不在可信计算基内。Bifrost使用FHE作为安全委托机制,用于加速器支持的CKKS上的投影和前馈线性层,而非线性操作、注意力侧控制逻辑、KV状态转换以及解密再加密刷新在CPU TEE内执行。Bifrost+进一步采用预填充/解码拆分:提示侧KV状态在CPU TEE内构建,仅解码侧状态进入混合密文路径。在与Euston方法匹配的估计器风格比较中,Bifrost在GPT-2(1.5B)上将预计延迟降低9.25倍,在LLaMA 3(8B)上降低9.91倍。在直接CKKS/FHE部署中,Bifrost+在GPT-2(124M)上将TTFT降低14.6-45.8倍,在Qwen3(0.6B)上降低15.3-53.4倍。系统经验是选择性加密执行:仅在需要仅密文加速器委托时使用FHE,并将非线性、刷新和提示侧工作保留在CPU TEE内。

英文摘要

Cloud-hosted transformer and large language model (LLM) inference creates a direct confidentiality problem: user prompts may contain sensitive code, business data, personal information, or regulated documents, yet remote serving exposes intermediate state to the cloud software stack and accelerator runtime. Fully homomorphic encryption (FHE) keeps accelerator-side execution ciphertext-only, but end-to-end LLM inference remains expensive because linear layers are interleaved with non-linear, cache-state, and refresh-sensitive operators. CPU trusted execution environments (TEEs) can execute those operators natively, but a CPU TEE alone does not define how an untrusted accelerator should participate. We present Bifrost, a hybrid TEE-FHE serving architecture in which secrets are provisioned only to an attested CPU TEE, while the accelerator, device memory, driver/runtime stack, and host software remain outside the trusted computing base. Bifrost uses FHE as a secure delegation mechanism for projection and feed-forward linear layers on accelerator-backed CKKS, while non-linear operators, attention-side control logic, KV-state transitions, and decrypt-then-encrypt refresh execute inside the CPU TEE. Bifrost+ further applies a prefill/decode split: prompt-side KV state is built inside the CPU TEE, and only decode-side state enters the hybrid ciphertext path. In an estimator-style comparison matching Euston's methodology, Bifrost reduces projected latency by 9.25x on GPT-2 (1.5B) and 9.91x on LLaMA 3 (8B). In direct CKKS/FHE deployments, Bifrost+ reduces TTFT by 14.6-45.8x on GPT-2 (124M) and 15.3-53.4x on Qwen3 (0.6B). The systems lesson is selective encrypted execution: use FHE only where ciphertext-only accelerator delegation is required, and keep non-linear, refresh, and prompt-side work inside the CPU TEE.

2606.17415 2026-06-17 cs.GT 新提交

Pure or Unstable: A Generic Dichotomy for Strong Stackelberg Commitments

纯策略或不稳定:强Stackelberg承诺的通用二分法

Kamil Bulinski, Lang White, Hung Nguyen

AI总结 研究有限领导者-跟随者博弈中强Stackelberg均衡的稳定性,证明当领导者效用从连续分布中采样时,最优承诺几乎必然是纯策略且稳定,或混合策略且不稳定。

Comments 19 pages

详情
AI中文摘要

我们研究了当跟随者的最优反应对应是集值时,有限领导者-跟随者博弈中强Stackelberg均衡(SSE)的鲁棒性。虽然乐观的破平(有利于领导者)被普遍采用,但它可能依赖于刀刃上的无差异。我们形式化了一个稳定性概念:如果在领导者承诺的策略下,跟随者有一个严格降低领导者效用的替代最优反应,则该SSE是不稳定的。我们的主要结果建立了一个尖锐的通用二分法。固定跟随者的效用并从任意连续分布中采样领导者的效用,以概率1,最优Stackelberg承诺是唯一的,并且要么是(i)纯策略,要么是(ii)混合策略且不稳定。当两个玩家的效用都通用地采样时,这加强为:以概率1,唯一的最优承诺要么是纯策略且稳定,要么是混合策略且不稳定。这些定理补充了von Stengel和Zamir的经典通用值结果,表明即使乐观和悲观的领导者值在通用情况下一致,当最优性需要真正的随机化时,策略层面的SSE预测在通用情况下是脆弱的。我们进一步将此视角应用于Stackelberg满足博弈,通过反例反驳了先前工作中的猜想,并确定了该猜想仍然成立的条件。

英文摘要

We study the robustness of the Strong Stackelberg Equilibrium (SSE) in finite leader--follower games when the follower's best-response correspondence is set-valued. While optimistic tie-breaking (in the leader's favor) is commonly adopted, it can hinge on knife-edge indifferences. We formalize a stability notion: an SSE is unstable if, at the leader's committed strategy, the follower has an alternative best response that strictly reduces the leader's payoff. Our main results establish a sharp generic dichotomy. Fixing the follower's utility and sampling the leader's utility from any continuous distribution, with probability one the optimal Stackelberg commitment is unique and is either (i) pure, or (ii) mixed and unstable. When both players' utilities are sampled generically, this strengthens to: with probability one, the unique optimal commitment is either pure and stable or mixed and unstable. These theorems complement the classic generic-value result of von Stengel and Zamir by showing that even when optimistic and pessimistic leader values coincide generically, the strategy-level SSE prediction is generically fragile whenever optimality requires genuine randomization. We further apply this perspective to Stackelberg satisfaction games, disproving a conjecture from prior work via counterexamples and identifying conditions under which it nonetheless holds.

2606.17411 2026-06-17 cs.SI 新提交

Sender--Receiver Community Detection in Directed Networks via Node-Role-Constrained Edge Clustering

有向网络中基于节点角色约束边聚类的发送者-接收者社区检测

Duy Hieu Do

AI总结 提出TT-SR框架,为每个节点分配发送者和接收者角色,通过双层次规则优化角色分配,在保持可解释性的同时提升有向边社区恢复性能。

Comments Preprint, 25 pages

详情
AI中文摘要

有向社区检测具有挑战性,因为边方向编码了不对称的源-目标关系。大多数有向模块度和随机游走方法为每个顶点分配一个标签,而最近基于双模性的方法更自由地聚类有向边。我们提出TT-SR,一个介于这两种观点之间的双层发送者-接收者框架。每个顶点被分配一个发送者角色和一个接收者角色,每条有向边获得由其源节点的发送者角色和目标节点的接收者角色诱导的类型。因此,TT-SR比单标签顶点聚类更具表现力,同时比无限制边聚类更易解释。该方法从计数残差、平稳流、度修正和顺序得分视角生成候选发送者-接收者分配。候选分配通过局部角色更新进行优化,并由双层规则选择:度修正轮廓得分提供主要结构标准,而伯努利密度和顺序流得分仅作为次要排序信号。我们通过发送者-接收者模块度松弛证明了主要谱视角的合理性,并将度修正得分解释为基于似然的残差比较。在路径型、共块和有序流合成基准上的实验表明,TT-SR在三种规模设置下实现了最强或基本持平的最强边社区恢复。在度修正共块和有序流图上增益最为显著。真实网络诊断进一步表明,TT-SR与Email-Eu-core元数据良好对齐,并在未标记的有向网络上提取出强烈的发送者-接收者双社区摘要。

英文摘要

Directed community detection is challenging because edge directions encode asymmetric source-target relations. Most directed modularity and random-walk methods assign one label to each vertex, whereas recent bimodularity-based methods cluster directed edges more freely. We propose TT-SR, a Two-Tier Sender-Receiver framework that lies between these two viewpoints. Each vertex is assigned a sender role and a receiver role, and each directed edge receives the type induced by the sender role of its source and the receiver role of its target. Thus, TT-SR is more expressive than one-label vertex clustering while remaining more interpretable than unrestricted edge clustering. The method generates candidate sender-receiver assignments from count-residual, stationary-flow, degree-corrected, and order-score views. The candidates are refined by local role updates and selected by a two-tier rule: a degree-corrected profile score provides the primary structural criterion, while Bernoulli density and order-flow scores are used only as secondary ranking signals. We justify the main spectral views through sender-receiver modularity relaxations and interpret the degree-corrected score as a likelihood-based residual comparison. Experiments on pathway-type, co-block, and ordered-flow synthetic benchmarks show that TT-SR achieves the strongest or essentially tied strongest edge-community recovery across three scale settings. The gains are most pronounced on degree-corrected co-block and ordered-flow graphs. Real-network diagnostics further indicate that TT-SR aligns well with Email-Eu-core metadata and extracts strong sender-receiverbicommunity summaries on unlabeled directed networks.

2606.17390 2026-06-17 cs.CE 新提交

A Differentiable GPU-Accelerated Finite Element Framework for Inverse Characterization of Finite-Strain Anisotropic Plasticity

一种用于有限应变各向异性塑性逆向表征的可微分GPU加速有限元框架

Deepak Sharma, Itzel Salgado, Lu Huang, Hui-Ping Wang, Jian Cao

AI总结 提出基于JAX的可微分GPU加速有限元框架,通过并行化非线性FEM三大瓶颈实现高效正向模拟,并利用自动微分进行逆向表征,准确恢复各向异性屈服和硬化参数。

详情
AI中文摘要

我们提出了一个完全可微分、GPU加速的有限元框架,用于有限应变各向异性弹塑性材料的正向模拟和逆向表征。该框架基于JAX构建,通过并行化非线性FEM中的三个主要计算瓶颈:单元弱形式和切线刚度评估、全局稀疏矩阵组装以及稀疏线性求解,充分利用现代加速器架构。对于具有300万自由度的正向问题,JAX-FEM在单个NVIDIA H100 GPU上比24核CPU Abaqus基线实现了高达9.4倍的加速。自动微分应用于本构更新和求解器工作流程,为复杂本构模型提供一致的雅可比矩阵,无需手动推导,并为PDE约束的逆向分析提供精确梯度。与有限差分相比,JAX-AD梯度避免了步长敏感性,并以显著更低的计算成本提供所需的灵敏度。在逆向表征中,我们将信息丰富、拓扑优化的异质试件几何与全场位移数据相结合,以识别具有许多参数的复杂本构模型,否则需要多次常规实验才能表征。我们展示了在逐步具有挑战性的设置中准确恢复各向异性屈服和硬化参数,包括均匀和空间变化的材料属性。由此产生的基于AD的公式能够在高维参数空间中进行高效优化,而有限差分方法在计算上不可行。这些结果确立了可微分、GPU加速的有限元方法作为先进制造中模拟、表征和优化工作流程的高通量引擎的实用性。

英文摘要

We present a fully differentiable, GPU-accelerated finite element framework for forward simulation and inverse characterization of finite-strain anisotropic elastoplastic materials. Built on JAX, the framework exploits modern accelerator architectures by parallelizing the three major computational bottlenecks in nonlinear FEM: elemental weak-form and tangent-stiffness evaluation, global sparse matrix assembly, and sparse linear solution. For a large-scale forward problem with 3 million degrees of freedom, JAX-FEM on a single NVIDIA H100 GPU achieves up to 9.4$\times$ speed-up over a 24-core CPU Abaqus baseline. Automatic differentiation is applied through the constitutive update and solver workflow, providing consistent Jacobians for complex constitutive models without manual derivation and accurate gradients for PDE-constrained inverse analysis. Compared with finite differences, the JAX-AD gradients avoid step-size sensitivity and provide the required sensitivities at substantially lower computational cost. For inverse characterization, we combine information-rich, topology-optimized heterogeneous specimen geometries with full-field displacement data to identify complex constitutive models with many parameters that would otherwise require many conventional experiments to characterize. We demonstrate accurate recovery of anisotropic yield and hardening parameters in progressively challenging settings, including uniform and spatially varying material properties. The resulting AD-based formulation enables efficient optimization in high-dimensional parameter spaces where finite-difference approaches are computationally infeasible. These results establish differentiable, GPU-accelerated FEM as a practical high-throughput engine for simulation, characterization, and optimization workflows in advanced manufacturing.

2606.17387 2026-06-17 cs.SE 新提交

Supporting the Adoption of Privacy-Enhancing Technologies through Requirements Engineering

通过需求工程支持隐私增强技术的采用

Oleksandr Kosenkov, Vadym Honcharenko, Abhinava Singh, Volodymyr Spirin, Danica Vranjanin

AI总结 本文从需求工程视角分析隐私增强技术(PETs)在软件工程中采用面临的跨利益相关者和跨学科挑战,提出通过系统化处理工程、商业和法律视角来促进PETs采用。

Comments Accepted to the 34th International Requirements Engineering Conference (RE 2026), Montreal, Canada, from 17 to 21 August 2026

详情
AI中文摘要

近几十年来,隐私增强技术(PETs)已被视为在处理个人数据的软件系统中满足监管和用户隐私要求的一种手段。尽管有大量的研究努力、监管机构的支持、谷歌和微软等大型技术公司的贡献以及软件从业者日益增长的兴趣,PETs的实际采用仍然有限。现有研究一致指出了软件工程中PETs采用面临的反复出现的挑战,例如技术复杂性和培训不足。尽管正在进行研究努力,这些挑战在实践中很大程度上仍未解决。在这篇工业挑战论文中,我们采用一种实用的、需求工程(RE)驱动的视角,考察了多个利益相关者群体(PET开发者、集成者和采用者)以及不同学科视角(工程、法律和商业)下PET采用面临的挑战。我们认为,RE可以通过系统地处理隐私的互补工程、商业和法律观点来促进PETs的采用。忽视这些观点中的任何一个挑战(例如,PETs对软件架构的影响、其商业影响及其对法规遵从的贡献)都可能增加障碍,甚至导致实施失败。在实践中,在RE中明确指定这些观点可以实现利益相关者之间有意义的协调,从而更有效地在软件工程中实现PETs的好处。

英文摘要

In recent decades, privacy-enhancing technologies (PETs) have been recognized as a means of meeting regulatory and user privacy requirements in software systems that process personal data. Despite substantial research efforts, support from regulators, contributions by large technology companies such as Google and Microsoft, and growing interest among software practitioners, the practical adoption of PETs remains limited. Existing research consistently identifies recurring challenges to PETs adoption in SE, such as technical complexity and insufficient training. Despite ongoing research efforts, these challenges largely remain unresolved in practice. In this industrial challenge paper, we apply a practical, requirements engineering (RE)-driven perspective to examine challenges to PET adoption across multiple stakeholder groups (PET developers, integrators, and adopters) as well as across different disciplinary perspectives (engineering, law, and business). We argue that RE can facilitate the adoption of PETs by systematically addressing each of the complementary engineering, business, and legal viewpoints on privacy. Neglecting challenges in any of these viewpoints (e.g., the impact of PETs on software architecture, their business implications, and their contribution to regulatory compliance) can increase the impediments or even lead to implementation failure. In practice, explicit specification of these viewpoints within RE can enable meaningful coordination among stakeholders to more effectively realize the benefits of PETs in software engineering.

2606.17378 2026-06-17 cs.DC 新提交

RISE: Relay Inference and Online Scheduling for Efficient Edge-Device Collaborative Diffusion Model Services

RISE: 面向高效边缘-设备协同扩散模型服务的接力推理与在线调度

Zilan Huang, Zhiqing Tang, Hanshuai Cui, Tian Wang, Yuan Wu, Weijia Jia, Wei Zhao

AI总结 提出RISE方法,通过训练无关的接力机制利用模型家族共享潜空间,将边缘大模型与设备小模型结合,并采用上下文感知调度器优化质量与延迟权衡。

Comments to be published in IEEE ICWS 2026

详情
AI中文摘要

文本到图像扩散模型越来越多地部署在网络边缘,以服务于具有不同质量和延迟要求的异构工作负载。然而,现有的部署策略要么选择具有高保真度但高延迟的大型边缘端模型,要么选择速度较快但语义连贯性较差的轻量级设备端模型。此外,这些方法很少在不同大小的模型之间跨边缘服务器和用户设备分割去噪工作负载。为了弥合这一差距,我们提出了RISE,一种用于边缘-设备扩散模型服务的方法,该方法结合了接力推理与在线调度。受潜强度在模型切换后表现出最小偏差这一发现的驱动,RISE使用了一种训练无关的接力机制,利用模型家族内的共享潜空间:边缘端的大模型处理早期塑造语义结构的去噪步骤,然后将中间潜变量传递给设备端的小模型进行细节细化。为了将该机制部署为实际服务,一个上下文赌博机调度器根据提示复杂度、用户偏好、网络质量和实时节点负载选择最佳接力配置。在两个基准上的实验表明,RISE的接力机制在保持完整模型质量的同时实现了高达2.1倍的加速,其上下文感知调度器在混合工作负载下有效平衡了质量和延迟。

英文摘要

Text-to-image diffusion models are increasingly deployed at the network edge to serve heterogeneous workloads with diverse quality and latency requirements. However, existing deployment strategies choose either large edge-side models with high fidelity but high latency or lightweight device-side models that offer speed at the cost of semantic coherence. Moreover, these approaches rarely split the denoising workload between models of different sizes across edge servers and user devices. To bridge this gap, we propose RISE, a method for edge-device diffusion model services that combines relay inference with online scheduling. Driven by the finding that the latent intensity exhibits minimal deviation after a model handoff, RISE uses a training-free relay mechanism that exploits the shared latent space within a model family: the large model on the edge handles the early denoising steps that shape semantic structure, then passes the intermediate latent to a small device-side model for detail refinement. To deploy this mechanism as a practical service, a contextual bandit scheduler selects the best relay configuration based on prompt complexity, user preferences, network quality and real-time node loads. Experiments on two benchmarks show that RISE's relay mechanism achieves up to 2.1$\times$ speedup while preserving full-model quality, and its context-aware scheduler effectively balances quality and latency under mixed workloads.

2606.17374 2026-06-17 cs.LO cs.PL cs.SE 新提交

Verifying the Rust Standard Library

验证 Rust 标准库

Byron Cook, Remi Delmas, Zyad Hassan, Bart Jacobs, Ranjit Jhala, Rahul Kumar, Felipe R. Monteiro, Thanh Nguyen, Rebecca Rumbul, Michael Tautschnig, Celina Val, Carolyn Zech

AI总结 通过众包方式集成多种验证工具,对 Rust 标准库中的不安全代码进行静态验证,发现并修复未定义行为,展示了大规模验证的可行性与挑战。

Comments Published at 18th NASA Formal Methods Symposium (NFM 2026)

详情
Journal ref
In: Deshmukh, J., Havelund, K., Pinto, A. (eds) NASA Formal Methods. NFM 2026. Lecture Notes in Computer Science, vol 16622. Springer, Cham, pp. 415-435
AI中文摘要

Rust 的类型系统防止了许多类别的内存错误,但其标准库严重依赖不安全代码,这些代码的正确性通过测试(包括在 Miri 下的动态检查)来验证,但缺乏静态验证。我们提出了据我们所知,针对软件库所报告的最大规模验证活动:一个开放的、众包的努力,将互补的验证工具集成到一个从 Rust 标准库分支出来的验证仓库的持续集成中。我们分析了该活动的有效性,讨论了机器检查证明对于一部分未定义行为(例如越界访问、空指针和悬垂指针解引用以及使用未初始化内存)的实际价值,并将剩余障碍作为开放挑战呈现给形式化方法社区。

英文摘要

Rust's type system prevents many classes of memory errors, yet its standard library relies heavily on unsafe code whose correctness is validated through testing, including dynamic checks under Miri, but lacks static verification. We present what is, to the best of our knowledge, the largest verification campaign reported for a software library: an open, crowdsourced effort that integrates complementary verification tools into the continuous integration of a verification repository forked from the Rust standard library. We analyze the campaign's effectiveness, discuss the practical value of machine-checked proofs for a subset of undefined behaviors (e.g., out-of-bounds access, null and dangling pointer dereferences, and use of uninitialized memory), and frame the remaining obstacles as open challenges for the formal-methods community.