arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 2115
2606.17986 2026-06-17 cs.CR 新提交

ShellGames: Speculative LLM-Driven SSH Deception

ShellGames: 基于LLM的推测性SSH欺骗

Umberto Salviati, Fabio De Gaspari, Mauro Conti, Luigi Vincenzo Mancini

AI总结 针对LLM在欺骗系统中缺乏持久状态、输出不一致等问题,提出ShellGames,结合思维链、记忆管理、推测执行等五种技术,在正确性、一致性、状态跟踪和鲁棒性上显著优于基线。

详情
AI中文摘要

网络欺骗和移动目标防御是有前景的策略,旨在通过增加不确定性来干扰对手。然而,与对手维持长期、可信的交互会话仍然是一个开放挑战。大型语言模型(LLM)为更动态的欺骗系统提供了有希望的路径,但存在关键限制,从根本上限制了其适用性,包括:缺乏持久状态、输出不一致、幻觉、延迟以及可能暴露欺骗的行为颠覆易感性。我们提出了ShellGames,一个基于LLM的SSH shell模拟器,旨在解决这些限制。ShellGames结合了五种互补技术:(i) 自动思维链和少样本学习以提高正确性;(ii) 内存管理以维持系统状态一致性;(iii) 推测性命令执行以减少响应延迟;(iv) 将复杂交互命令智能路由到沙盒环境;以及(v) 利用shell环境的受限输入输出域进行颠覆检测。为了进行系统评估,我们引入了一个标准化的基准测试协议和数据集,涵盖正确性、一致性、状态跟踪和鲁棒性任务。ShellGames在正确性上达到0.898的命令准确率(比基线高5.3个百分点),一致性上达到0.918的序列级准确率(高36个百分点),状态跟踪准确率0.98(高18.3个百分点),鲁棒性准确率0.95(高37个百分点)。一项有20名参与者的用户研究证实,ShellGames在自由探索下实现了与真实shell相当的真实感,并且在感知命令覆盖率上优于传统蜜罐。

英文摘要

Cyber deception and Moving Target Defense are promising strategies that aim to disrupt adversaries by increasing uncertainty. However, sustaining long-lived, credible interactive sessions with adversaries remains an open challenge. Large Language Models (LLMs) offer a promising path toward more dynamic deception systems, but suffer from key limitations that fundamentally limit their applicability, including: lack of persistent state, output inconsistencies, hallucinations, latency, and susceptibility to behavioral subversion that may reveal the deception. We propose ShellGames, an SSH shell simulator based on LLM designed to address these limitations. ShellGames combines five complementary techniques: (i) Automatic Chain-of-Thought and few-shot learning to improve correctness; (ii) memory management to maintain system state coherency; (iii) speculative command execution to reduce response latency; (iv) smart routing of complex interactive commands to a sandboxed environment; and (v) subversion detection leveraging the constrained input-output domain of shell environments. To enable systematic evaluation, we introduce a standardized benchmarking protocol and dataset spanning correctness, consistency, state tracking, and robustness tasks. ShellGames achieves $0.898$ command accuracy on correctness ($+5.3pp$ over baselines), $0.918$ sequence-level accuracy on consistency ($+36pp$), $0.98$ state tracking accuracy ($+18.3pp$), and $0.95$ accuracy on robustness ($+37pp$). A user study with $n=20$ participants confirms that ShellGames achieves realism comparable to a real shell under free exploration and outperforms traditional honeypots on perceived command coverage.

2606.17981 2026-06-17 cs.SE 新提交

Planning to Hammer: Difficulty-Aware Decomposition for Automating Rocq Proofs

规划锤击:面向自动化Rocq证明的难度感知分解

Ning Zhang, Nongyu Di, Zenan Li, Yuan Yao, Xiaoxing Ma

AI总结 提出Quarry框架,通过LLM规划证明分解并利用难度模型排序子目标,结合CoqHammer自动证明,在Rocq基准测试中成功率提升7%-13%。

Comments 26 pages, 8 figures; submitted to OOPSLA 2026

详情
AI中文摘要

随着AI生成代码的普及,形式化验证(特别是通过Rocq和Isabelle等交互式定理证明器)对于确保软件正确性变得越来越重要。然而,在这些证明器中生成机器检查的证明仍然是一个瓶颈。现有解决方案在证明自动化方面具有互补优势:大型语言模型(LLM)可以提出高级证明策略但缺乏局部严谨性,而CoqHammer等自动化策略可以可靠地处理许多局部目标但缺乏长期规划能力。为了结合两者优点,我们提出了Quarry,一个基于规划的证明合成框架,将证明规划与证明执行分离。具体来说,Quarry要求LLM主动提出多个带有任意子引理的证明分解,在Rocq中临时假设子引理进行类型检查,并使用基于证明状态的难度模型(估计锤子可解性)对候选方案进行排序。然后,它在有限预算内递归证明子引理,有效地将长证明转化为一系列锤子可解的义务序列。我们在SerAPI和CoqHammer之上实现了Quarry,并使用多个前沿LLM在多个基准测试上进行了评估。实验结果表明,基于规划的分解与可解性感知排序显著提高了自动化程度,同时保持了可预测的成本。在统一的10分钟墙钟预算下,Quarry在三个Rocq基准测试中的成功率比最强基线提高了7%到13%。这些结果表明,通过协调神经规划与符号执行(而非取代其中任何一个),可以实现可靠的证明自动化。

英文摘要

As AI-generated code proliferates, formal verification, particularly through interactive theorem provers such as Rocq and Isabelle, becomes increasingly important for ensuring software correctness. However, producing machine-checked proofs in such provers remains a bottleneck. Existing solutions bring complementary strengths to proof automation: large language models (LLMs) can propose high-level proof strategies but lack local rigor, while automated tactics such as CoqHammer can reliably discharge many local goals but lack long-range planning capabilities. To combine the best of both worlds, we present Quarry, a planning-based proof synthesis framework that separates proof planning from proof execution. Specifically, Quarry asks an LLM to actively propose multiple proof decompositions with arbitrary sublemmas, type-checks them in Rocq under temporarily admitted sublemmas, and ranks candidates using a proof-state-based difficulty model that estimates hammer solvability. It then recursively proves sublemmas within a bounded budget, effectively turning long proofs into sequences of hammer-solvable obligations. We implement Quarry on top of SerAPI and CoqHammer and evaluate it using multiple frontier LLMs across multiple benchmarks. The experimental results show that planning-based decomposition with solvability-aware ranking substantially improves automation while maintaining predictable cost. Under a uniform 10-minute wall-clock budget, Quarry improves over the strongest baseline by 7% to 13% in success rate across three Rocq benchmarks. These results demonstrate that reliable proof automation can be achieved by coordinating neural planning with symbolic execution rather than replacing either.

2606.17957 2026-06-17 cs.CR cs.CY cs.HC 新提交

Children Are Not the Enemy: Child-Fit Security as an Alternative to Bans and Surveillance

儿童不是敌人:作为禁令和监控替代方案的儿童适配安全

Kopo M. Ramokapane, Rui Huan, Zaina Dkaidek, Awais Rashid

AI总结 提出“儿童适配安全”设计范式,将儿童视为合法用户而非威胁,以儿童福祉、发展、隐私、安全、自主权和权利为核心安全需求,保护儿童及其参与。

Comments 14 pages, 2 figures, Paper Under review

详情
AI中文摘要

数字技术现已成为儿童学习、游戏、交流、身份形成和社会参与的核心。然而,儿童在线安全的主流方法往往依赖于遏制机制,包括禁令、年龄门控、家长控制、监控和屏幕时间限制。这些方法在特定情境下可能有用,但它们通常将儿童保护主要视为限制访问为成人设计的系统的问题。在本文中,我们认为这种框架对儿童的数字生活而言是不充分的,并且作为安全范式是不够的。我们提出儿童适配安全(Child-fit security),这是一种设计范式,其中可能被儿童使用的技术将儿童视为合法用户,而非需要排除的攻击者、需要修补的漏洞或需要管理的风险。在该范式中,儿童的福祉、发展、隐私、安全、自主权和权利成为核心安全需求。这将保护的重点从应用、账户和数据转移到儿童-系统关系上,意味着同时保护儿童及其参与。我们概念化儿童适配安全,将其与面向遏制的方法进行对比,定义其核心原则,并讨论其对安全设计的影响。最后,我们提出了一个使儿童适配安全可操作化的研究议程。

英文摘要

Digital technologies are now central to children's learning, play, communication, identity formation, and social participation. Yet dominant approaches to children's online safety often rely on containment mechanisms, including bans, age gates, parental controls, monitoring, and screen-time restrictions. These approaches can be useful in specific contexts, but they often frame child protection primarily as a problem of restricting access to systems designed for adults. In this paper, we argue that this framing is inadequate for children's digital lives and insufficient as a security paradigm. We propose Child-fit security, a design paradigm in which technologies likely to be used by children treat a child as legitimate users, not attackers to be excluded, vulnerabilities to be patched, or risks to be managed. In this paradigm, children's wellbeing, development, privacy, safety, agency, and rights become core security requirements. This shifts the focus of protection from apps, accounts, and data to the child-system relationship, which means protecting both the child and their participation. We conceptualise child-fit security, contrast it with containment-oriented approaches, define its core principles, and discuss its implications for security design. We conclude by presenting a research agenda for making child-fit security operational.

2606.17949 2026-06-17 cs.DC 新提交

RouteBalance: Fused Model Routing and Load Balancing for Heterogeneous LLM Serving

RouteBalance: 面向异构LLM服务的融合模型路由与负载均衡

Wei Da, Evangelia Kalyvianaki

AI总结 提出RouteBalance,通过融合模型路由与负载均衡为异构LLM服务实现质量、延迟和成本的三维联合优化,在13实例28GPU集群上达到最优前沿。

Comments 12 pages, 5 figures

详情
AI中文摘要

异构LLM服务栈将调度分为两个独立优化的层次:模型路由器根据质量和成本信号选择模型而忽略实例负载,服务负载均衡器优化队列而忽略质量。我们提出RouteBalance,一个服务感知的调度层,将两者融合为对具体模型实例的单一在线分配,联合权衡质量、延迟和成本。批处理进程内预测器栈和推算的实例状态使得联合决策在请求热路径上成本低廉(12 req/s时约32 ms)。在运行四种模型规模的13实例、28GPU异构集群上,单个部署的RouteBalance栈追踪了质量-成本-吞吐量三维前沿的上部区域。扫描一个权重向量即可达到最高的路由决策质量(DeepEval 0.419,比最强基线高0.013,95%置信区间[+0.005,+0.022];当第二个裁判对实际服务文本重新评分时排序保持不变),并在其成本优先角上达到与最便宜基线持平的每请求成本。在与我们构建的并发评分基线变体进行路由工程均衡后,其平衡预设以2.8秒和30 req/s提供服务,在高负载下领先增强版BEST-Route 2.6到4.1倍。(按发布方式部署这些路由器,每个请求一次串行评分调用,会使它们在高负载下崩溃23倍,这是单独隔离的部署架构效应,而非路由结果。)四臂隔离实验表明,收益来自于在模型选择时对延迟定价;学习到的预测器贡献了校准和SLO余量,而非主要前沿。代码:此 https URL

英文摘要

Heterogeneous LLM serving stacks split scheduling into two layers that optimize in isolation: model routers pick a model from quality and cost signals while ignoring instance load, and serving load balancers optimize queues while ignoring quality. We present RouteBalance, a serving-aware scheduling layer that fuses both into a single online assignment over concrete model instances, jointly trading off quality, latency, and cost. A batched in-process predictor stack and dead-reckoned instance state keep the joint decision cheap on the request hot path ($\approx$32 ms at 12 req/s). On a 13-instance, 28-GPU heterogeneous cluster serving four model sizes, a single deployed RouteBalance stack traces the upper region of the three-way quality-cost-throughput frontier. Sweeping one weight vector reaches both the highest routing-decision quality (DeepEval $0.419$, $+0.013$ over the strongest baseline, $95\%$ CI $[{+}0.005,{+}0.022]$; the ordering holds when a second judge re-scores the actually served text) and, at its cost-priority corner, per-request cost that ties the cheapest baseline. With router engineering equalized against concurrent-scoring baseline variants we build, its balanced preset serves at $2.8$ s and $30$ req/s, leading $2.6$ to $4.1\times$ ahead of enhanced BEST-Route at high load. (Deploying those routers as published, one serial scoring call per request, makes them collapse $23\times$ under load, a deployment-architecture effect we isolate separately, not the routing result.) A four-arm isolation shows the benefit follows from pricing latency at model-selection time; the learned predictors contribute calibration and SLO headroom rather than the headline frontier. Code: https://github.com/AKafakA/route-balance

2606.17925 2026-06-17 cs.GT 新提交

Parasitic Masquerade: Societal Scale Human-Machine Interaction

寄生伪装:社会尺度的人机交互

Jiejun Hu-Bolz, James Stovold

AI总结 通过图论平均场博弈模型,研究人机交互中寄生行为伪装成生产性学习的现象,发现信息流不对称和环境噪声可引发系统相变。

详情
AI中文摘要

本文通过将个体博弈论模型扩展到社会层面,拓展了近期在人机交互方面的研究。我们采用图论平均场博弈(GMFG)模型,模拟了共享环境中四组内部同质但外部异质的智能体之间的交互。结果表明,寄生行为可以伪装成生产性学习,知识分布和行为看似健康,但实际上是由机器耦合而非独立研究驱动的。为了检测这一点,我们测量了信息流方向和环境的信念熵,揭示出在所有场景中,人到机器的通道占主导地位,且在寄生状态下不对称性加剧。我们进一步证明,该系统存在共生的和寄生的均衡共存,环境噪声可以诱发一个临界点,使智能体越过认知成本障碍。这些涌现现象并非设计在任何单个智能体中,而是源于集体交互结构,强调了需要将人机社会学作为一个复杂系统整体进行研究。

英文摘要

This work extends recent developments in studying human--machine interaction by scaling from individual game-theoretic models to a societal-level model. We adopt a Graphon Mean-Field Game (GMFG) that models the interaction among four groups of internally-homogeneous but externally-heterogeneous agents in a shared environment. Our results show that parasitism can masquerade as productive learning, with knowledge distribution and actions appearing healthy while being driven by machine coupling rather than independent investigation. To detect this, we measure the direction of information flow and belief entropy of the environment, revealing that human to machine channel dominates across all scenarios, with the asymmetry intensifying under parasitism. We further demonstrate that the system exhibits coexisting mutualistic and parasitic equilibria, where environmental noise can induce a tipping point that shifts agents past the cognitive cost barrier. These emergent phenomena are not designed into any individual agent but arise from the collective interaction structure, underscoring the need to study the sociology of humans and machines holistically as a complex system.

2606.17921 2026-06-17 cs.MM 新提交

OlfactProfile: Profile-Conditioned Odor Prediction from Audiovisual Content

OlfactProfile: 基于用户嗅觉特征从视听内容预测气味

Zhengyu Lou, Bosheng Qin, Yanan Wang, Duanduan Yin, Wentao Ye, Yu Xin

AI总结 提出OlfactProfile框架,通过结构化场级用户嗅觉特征调制,实现从视听内容预测气味,优于基线模型和通用多模态大模型,在背景气味和情感气味预测上提升显著。

Comments 10 pages, 5 figures

详情
AI中文摘要

自动视频-气味匹配预测与视听内容对齐的香味,用于增强感官的媒体。现有方法通常将气味标签视为仅由场景内容决定,但气味判断也依赖于个体嗅觉特征,包括气味敏感性、对难闻气味的耐受性和情感偏好。忽略这一观察者上下文限制了当前系统预测与感知体验匹配的气味的能力。我们提出了OlfactProfile,一个用于从视听内容进行特征条件气味预测的框架。我们的结果表明,嗅觉特征并非默认有益:在匹配特征骨干网络的情况下,简单的特征拼接和统一特征调制会降低性能,而结构化的场级特征调制则持续改善预测。因此,关键挑战不仅在于观察者上下文是否可用,还在于如何将其整合到多模态推理中。为了研究这一设置,我们构建了一个视听基准,将时间对齐的气味注释与注释者的嗅觉偏好特征配对。它包含1,350个视频片段、一个99类气味词汇和三个语义气味轨道:前景气味、背景气味和情感气味。我们还提出了OAR(嗅觉感知路由),一种多模态融合模块,执行轨道感知的视听路由与场级特征调制,允许特征维度根据感知角色影响气味推理。实验表明,OlfactProfile优于监督基线和通用多模态大模型,在与气味专家的小规模人类比较中具有竞争力,并在无需任务特定微调的情况下改善了气味增强应用中的感知气味匹配。按轨道分析显示,在背景气味和情感气味上增益最强,这些领域观察者依赖的判断最为重要。

英文摘要

Automated video-odor matching predicts scents aligned with audiovisual content for scent-enhanced media. Existing methods usually treat odor labels as determined only by scene content, but odor judgment also depends on individual olfactory profiles, including scent sensitivity, tolerance to unpleasant odors, and affective preference. Ignoring this observer context limits current systems' ability to predict scents that match perceived experience. We present OlfactProfile, a framework for profile-conditioned odor prediction from audiovisual content. Our results show that olfactory profiles are not beneficial by default: with matched feature backbones, naive profile concatenation and uniform profile modulation can degrade performance, while structured field-wise profile conditioning consistently improves prediction. Thus, the key challenge is not merely whether observer context is available, but how it is integrated into multimodal reasoning. To study this setting, we construct an audiovisual benchmark pairing temporally aligned odor annotations with annotator olfactory preference profiles. It contains 1,350 video clips, a 99-class scent vocabulary, and three semantic odor tracks: Foreground Odor, Background Odor, and Emotion Odor. We also propose OAR (Olfactory-Aware Routing), a multimodal fusion module that performs track-aware audiovisual routing with field-wise profile modulation, allowing profile dimensions to influence odor reasoning according to perceptual role. Experiments show that OlfactProfile outperforms supervised baselines and general-purpose multimodal large models, is competitive with odor experts in a small human comparison, and improves perceived scent fit in scent-enhanced applications without task-specific fine-tuning. Per-track analysis shows that gains are strongest for Background Odor and Emotion Odor, where observer-dependent judgment is most important.

2606.17914 2026-06-17 eess.SY cs.SY 新提交

Three-phase model of unbalanced distribution networks with DERs

含分布式能源的不平衡配电网三相模型

S. Perna, C. Lillo, A. R. Di Fazio, M. Russo, G. M. Casolino, P. Varilone, P. Verde

AI总结 提出非近似的三相Dist3Flow支路潮流模型,用节点电压实虚部和功率流为状态变量,通过前后向回扫算法求解,适用于辐射状和闭环拓扑,经OpenDSS验证。

详情
AI中文摘要

经典的稳态配电网分析DistFlow方程无法捕捉由不对称线路、负载和分布式能源(DER)引起的三相系统固有失衡。本文将经典潮流(PF)方程扩展为严格的、非近似的三相公式,称为Dist3Flow。所提出的支路潮流模型(BFM)利用节点电压的实部和虚部以及有功和无功功率流作为状态变量。线路通过非线性前向和后向方程建模,而负载和DER分别通过ZIP模型和P-Q控制表示。通过在终端节点引入特定边界条件,该公式将PF分析推广到辐射状和闭环拓扑。通过使用前后向回扫(BFS)算法获得解。该方法在OpenDSS上针对各种配置进行了验证,考虑了开环和闭环拓扑,以及有无DER的情况。

英文摘要

Classical DistFlow equations for steady-state distribution network analysis fail to capture the inherent imbalances of three-phase systems arising from asymmetrical lines, loads, and distributed energy resources (DERs). This paper extends the classical power flow (PF) equations into a rigorous, non-approximated three-phase formulation, termed Dist3Flow. The proposed branch flow model (BFM) utilizes the real and imaginary components of nodal voltages and the active and reactive power flows as state variables. Lines are modelled by nonlinear forward and backward equations, while loads and DERs are represented via ZIP models and P-Q control, respectively. By incorporating specific boundary conditions at the terminal nodes, the formulation generalizes PF analysis to both radial and closed-ring topologies. The solution is obtained by using a backward/borward sweep (BFS) algorithm. The approach is validated against OpenDSS across various configurations, considering open-ring and closed-ring topologies with and without DERs.

2606.17913 2026-06-17 eess.SY cs.SY 新提交

Reducing Building Heat Demand Through Intelligent Control: A Comparative Simulation Study

通过智能控制减少建筑供暖需求:一项比较仿真研究

Ueli Schilt, Curtis Meister, Philipp Schuetz

AI总结 本研究通过比较两种不同优化目标的模型预测控制策略,发现以热舒适为导向的控制器比最小化供暖功率的控制器更能降低总热量消耗,同时保持高舒适度。

Comments 9 pages, 5 figures, 1 table. REHABEND 2026, 11th Euro-American Congress

详情
AI中文摘要

空间供暖仍然是建筑中的主要能源消耗者。虽然结构改造可以大幅减少需求,但通常成本高昂且耗时。作为替代方案,本研究探讨了智能供暖控制策略以较低投资和更快实施减少热量消耗的潜力。先前的研究表明,用模型预测控制器(MPC)替换传统的基于供暖曲线的控制器可以减少供暖能源需求。尽管大多数研究将MPC与传统控制进行比较,但本工作评估了两种具有不同控制目标的MPC策略,并量化了它们对室内温度跟踪和供暖需求的影响。基于ISO 52016-1在Python中开发了一个虚拟住宅建筑模型,以生成合成测量数据。使用该数据集对简化的电阻-电容(RC)模型进行参数化,并将其用作在MATLAB中实现的两种MPC策略的内部模型。这些策略仅在优化目标上有所不同:一种最小化二次供暖功率,而另一种优先考虑室内温度跟踪以实现热舒适。为期六天的模拟表明,两种策略都满足舒适性和系统约束,但在能源使用和温度变化方面存在差异。以舒适为导向的控制器实现了比最小化供暖功率的控制器更低的总热量消耗,这归因于二次目标函数中对高供暖速率的惩罚。结果证明了MPC设计中目标函数公式的重要性,并表明在不进行建筑围护结构改造的情况下,可以在实现较低供暖需求的同时保持高舒适度水平。

英文摘要

Space heating remains the dominant energy consumer in buildings. While structural retrofitting can substantially reduce demand, it is often costly and time-intensive. As an alternative, this study investigates the potential of intelligent heating control strategies to reduce heat consumption with lower investment and faster implementation. Previous studies have shown that replacing conventional heating-curve-based controllers with model predictive controllers (MPCs) can reduce heating energy demand. Whereas most studies compare MPC to conventional control, this work evaluates two MPC strategies with different control objectives and quantifies their impact on indoor temperature tracking and heating demand. A virtual residential building model was developed in Python based on ISO 52016-1 to generate synthetic measurement data. A simplified resistance-capacitance (RC) model was parametrised using this dataset and used as the internal model for two MPC strategies implemented in MATLAB. The strategies differ only in their optimisation objective: one minimises quadratic heating power, while the other prioritises indoor temperature tracking for thermal comfort. Simulations over six days show that both strategies satisfy comfort and system constraints, but differ in energy use and temperature variation. The comfort-oriented controller achieves lower total heat consumption than the controller minimising heating power, which is attributed to the penalisation of high heating rates in the quadratic objective function. The results demonstrate the importance of objective function formulation in MPC design and show that high comfort levels can be maintained while achieving lower heating demand without structural modifications to the building envelope.

2606.17873 2026-06-17 eess.SY cs.SY 新提交

Model-Free Control for Multi-Time Scale Dynamics of Grid-Connected Power Converters

并网功率变换器多时间尺度动态的无模型控制

Dewan Mahnaaz Mahmud, Vinu Thomas, Bogdan Marinescu

AI总结 针对并网功率变换器的多时间尺度动态,提出一种基于智能比例积分(iPI)的无模型控制方法,并在16kW实验平台上验证其有效性,展示了在二次电压控制中的应用优势。

详情
AI中文摘要

基于电力电子系统的控制器合成主要依赖于系统的数学模型,当实际系统复杂且数学模型无法捕捉其所有动态时,这便成为了一种限制。无模型控制通过使用一种特设的简单模型来弥补这一限制,该模型通过基于导数的高速率动态评估进行补偿。然而,将无模型控制策略应用于基于电力电子的多时间尺度动态系统是具有挑战性的,因为实现这种控制需要导数作用。并网功率变换器是这类系统的例子,但文献中尚未充分解决实验验证问题。本文介绍了包括硬件实现层面在内的此类控制的验证。合成了一种智能比例积分(iPI)控制器,并在16 kW实验测试台上进行了验证。这证明了该方法在并网功率变换器控制中的优势,其中包括它们在二次电压控制中的参与。

英文摘要

Controller synthesis in power electronics-based systems depends predominantly on the mathematical model of the system, which is a limitation when the actual system is complex and the mathematical model cannot capture all its dynamics. Model-free control addresses this limitation by using an ad-hoc simple model which is compensated by high-rate evaluation of dynamics in terms of their derivatives. However, application of the model-free control strategy to power electronics-based multi-time scale dynamical systems is challenging because of the derivative action needed to implement such control. Grid-connected power converters are examples of such systems, yet experimental validation has not been adequately addressed in the literature. This letter presents the validation of such control including the hardware implementation level. An intelligent proportional-integral (iPI) controller is synthesized and validated on a 16 kW experimental test bench. This proves the benefits of the approach in control of grid-connected power converters, among which their participation in the secondary voltage control.

2606.17860 2026-06-17 cs.DC cs.LO 新提交

An Epistemic Analysis of Random Coordinated Attack

随机协调攻击的认知分析

Sophia Knight, David Lehnherr, Sergio Rajsbaum

AI总结 针对随机协调攻击问题,提出一种概率认知逻辑框架,分析Varghese-Lynch算法,证明其下界紧致,并揭示信息水平与认知公式的对应关系。

详情
AI中文摘要

协调攻击问题建模了通过不可靠链路在有限时间内协调联合行动的挑战。它是第一个被证明不可解的分布式计算问题。其分析也揭示了共同知识(认知逻辑中的核心概念)的重要性。然而,据我们所知,可解的随机化版本的协调攻击尚未通过概率认知逻辑的视角进行研究,其中进程通过抛硬币产生随机性。我们提出了一个认知逻辑框架,用于研究执行有限轮次的随机化算法。该框架适用于协调攻击、近似一致和共识问题,并支持动态图模型:同步系统中可靠进程执行有限轮次,同时对手决定哪些消息丢失。我们的方法结合了动态网络的逻辑刻画和任务可解性技术,以及概率动态认知逻辑的思想。它受到Varghese和Lynch关于随机协调攻击的操作模型的启发。更广泛地说,由此产生的概率认知任务可解性概念为随机化分布式计算的认知研究提供了基础。利用该框架,我们从知识理论的角度分析了Varghese-Lynch算法,提供了对该算法及其下界的正式处理。作为副产品,我们加强了下界并证明其紧致性。证明依赖于不可区分性论证,表明在概率设置中关于知识的推理仍然至关重要。我们还形式化了Varghese和Lynch引入的信息水平概念,表明它对应于一个特定的认知公式。

英文摘要

The coordinated attack problem models the challenge of coordinating a joint action within a bounded time by communicating over unreliable links. It was the first distributed computing problem proven unsolvable. Its analysis also revealed the importance of common knowledge, a central concept in epistemic logic. However, the randomized version of coordinated attack, which is solvable, has not, to the best of our knowledge, been studied through the lens of probabilistic epistemic logic, where processes generate randomness by flipping coins. We present an epistemic logic framework for studying randomized algorithms that execute for a bounded number of rounds. The framework applies to coordinated attack, approximate agreement, and consensus, and supports dynamic graph models: synchronous systems in which reliable processes execute a bounded number of rounds while an adversary determines which messages are lost. Our approach combines techniques from the logical characterization of dynamic networks and task solvability with ideas from probabilistic dynamic epistemic logic. It is inspired by the operational model of Varghese and Lynch for randomized coordinated attack. More broadly, the resulting notion of probabilistic epistemic task solvability provides a foundation for the epistemic study of randomized distributed computation. Using this framework, we analyze the Varghese-Lynch algorithm from a knowledge-theoretic perspective, providing a formal treatment of the algorithm and its lower bound. As a byproduct, we strengthen the lower bound and show it is tight. The proof relies on indistinguishability arguments, demonstrating that reasoning about knowledge remains essential in the probabilistic setting. We also formalize the notion of information level introduced by Varghese and Lynch, showing that it corresponds to a specific epistemic formula.

2606.17853 2026-06-17 cs.NE 新提交

An Optimization Framework for Automated Assessment of Biological Plausibility of Spiking Neurons

脉冲神经元生物学合理性的自动化评估优化框架

Sven Nitzsche, Alexandru Ionita, Andreas Faust, Bogdan Ionescu, Juergen Becker

AI总结 提出一个开源框架,通过优化模型参数以复现生物典型放电模式,自动化评估脉冲神经元模型的生物学合理性,并在多个模型上验证有效性。

Comments Reviewed version published at the ECML-PKDD 2025 joint post-workshop proceeding in Springer Communications in Computer and Information Science

详情
AI中文摘要

生物学合理性是神经形态计算和脉冲神经网络中的一个关键概念,但其定义不一致且难以量化。在这项工作中,我们提出了一个用于脉冲神经元模型生物学合理性自动化评估的开源框架。我们的方法基于评估模型复现生物系统中观察到的典型神经元放电模式的能力,遵循Izhikevich提出的分类。通过将这些模式编码为目标函数并相应优化模型参数,我们的框架无需先验分析建模即可实现经验性评估。将神经元模型视为黑箱,该框架提供了一种实用且灵活的方法来表征其动态能力。我们在几个已建立的模型和一个先前未探索的自定义模型上展示了该框架的有效性。该框架使用Python实现,兼容PyTorch和Norse库,专为机器学习场景设计。它旨在作为系统研究生物学合理性与网络级性能指标(如准确性、能效、鲁棒性和适应性)之间关系的起点。

英文摘要

Biological plausibility is a key concept in neuromorphic computing and spiking neural networks, yet it remains inconsistently defined and difficult to quantify. In this work, we present an open-source framework for the automated assessment of biological plausibility in spiking neuron models. Our method builds on the idea of evaluating a model's ability to replicate canonical neuronal firing patterns observed in biological systems, following the classification proposed by Izhikevich. By encoding these patterns into objective functions and optimizing model parameters accordingly, our framework enables empirical assessment without requiring prior analytical modeling. Treating neuron models as black boxes, it provides a practical and flexible means of characterizing their dynamic capabilities. We demonstrate the effectiveness of the framework on several established models and a previously unexplored custom model. Implemented in Python and compatible with PyTorch and the Norse library, the framework is tailored for machine learning contexts. It is intended as a starting point for systematic research into the relationship between biological plausibility and network-level performance metrics such as accuracy, energy efficiency, robustness, and adaptability.

2606.17850 2026-06-17 cs.AR 新提交

CUTh-Solver: GPU-Accelerated Sparse Matrix Solver for High-Resolution Thermal Simulation of 3D ICs

CUTh-Solver:用于3D IC高分辨率热仿真的GPU加速稀疏矩阵求解器

Chenghan Wang, Zhen Zhuang, Shui Jiang, Siyuan Liang, Xiaoman Yang, Kai Zhu, Darong Huang, Luis Costero, Rongmei Chen, Tsung-Wei Huang, David Atienza, Tsung-Yi Ho

AI总结 针对3D IC高分辨率热仿真中稀疏矩阵求解的瓶颈,提出CUTh-Solver,通过压缩DIA存储格式、对角SpMV、高并行预处理和自适应混合精度策略,在GPU上实现高达25.8倍加速。

详情
AI中文摘要

粗粒度热仿真往往低估局部热问题,可能遗漏关键热点。因此,准确分析需要细粒度信息,这极大地增加了网格分辨率,从而增加了计算工作量。幸运的是,系数矩阵通常是稀疏的且具有规则稀疏模式,提供了优化机会。然而,GPU上现有的通用矩阵求解器很少利用这些领域特定属性,因此在数据存储、内存访问、并行性、计算效率和硬件利用率方面遇到瓶颈。因此,我们提出了CUTh-Solver,一个协同设计的GPU加速基于预条件共轭梯度(PCG)的稀疏求解器框架,用于高分辨率稳态和瞬态3D IC热仿真中出现的对称正定(SPD)系统。在数据存储方面,CUTh-Solver压缩对角(DIA)存储格式以消除冗余。为了优化内存访问,CUTh-Solver采用对角SpMV实现合并内存访问。我们进一步观察到并行性与预条件质量之间的关键冲突,因此采用高并行预条件策略。为了提高计算效率和硬件利用率,我们采用自适应细粒度混合精度策略,利用不同的浮点单元避免资源争用,在保证数值稳定性的同时提高吞吐量。实验结果表明,CUTh-Solver相比GPU加速的COMSOL Multiphysics 6.4实现了高达25.8倍加速,相比NVIDIA的原生通用库(AmgX、cuSPARSE、cuDSS)实现了超过3倍加速。消融研究验证了每种优化的单独贡献。代码可在以下网址获取:this https URL

英文摘要

Coarse-grained thermal simulation tends to underestimate localized thermal issues, potentially missing critical hotspots. Accurate analysis, therefore, demands fine-grained information, which dramatically increases grid resolution and thus computational workload. Fortunately, the coefficient matrices are often sparse with regular sparsity patterns, offering optimization opportunities. However, existing general-purpose matrix solvers on GPUs rarely exploit these domain-specific properties, thereby encountering bottlenecks in data storage, memory access, parallelism, computational efficiency, and hardware utilization. Therefore, we propose CUTh-Solver, a co-designed GPU-accelerated Preconditioned Conjugate Gradient (PCG)-based sparse solver framework for Symmetric Positive Definite (SPD) systems arising from high-resolution steady-state and transient 3D IC thermal simulation. For data storage, CUTh-Solver condenses the Diagonal (DIA) storage format to remove redundancy. To optimize the memory access, CUTh-Solver employs diagonal-wise SpMV to achieve coalesced memory access. We further observe a critical conflict between parallelism and preconditioning quality and thus adopt a high-parallelism preconditioning strategy. To improve computational efficiency and hardware utilization, we employ an adaptive fine-grained mixed-precision strategy that leverages diverse floating-point units to avoid resource contention, enhancing throughput without compromising numerical stability. Experimental results show that CUTh-Solver achieves up to 25.8x speedup over GPU-accelerated COMSOL Multiphysics 6.4 and over 3x speedup over NVIDIA's native general-purpose libraries (AmgX, cuSPARSE, cuDSS). Ablation studies validate the individual contribution of each optimization. The code is available at: https://github.com/Chenghan-Wang/CUTh-Solver

2606.17845 2026-06-17 cs.NI 新提交

UAV-CAS: A Calibrated Digital-Twin Dataset for Intrusion Detection in UAV Swarm Networks

UAV-CAS:用于无人机群网络入侵检测的校准数字孪生数据集

Sripath Mishra, Bharat Bhargava, Zizheng Liu, Shafkat Islam

AI总结 针对有线网络数据集训练的入侵检测系统在真实无人机群中性能急剧下降的问题,提出UAV-CAS数据集,通过四层校准管道生成大规模标记流数据,覆盖五种攻击族和九种协作攻击组合,验证了数据集的可学习性和攻击分类的挑战性。

Comments Repository URL: https://github.com/Sripathm2/Collaborative-UAV-Dataset, Dataset Link: https://dx.doi.org/10.21227/zgrg-z865

详情
AI中文摘要

基于有线网络基准训练的入侵检测系统(IDS)在真实无人机(UAV)群中性能急剧下降,因为移动性、波动的链路质量和去中心化路由重塑了流量分布。现有的无人机特定数据集也没有系统地改变这些条件,无法针对导致IDS失效的分布偏移进行训练或测试。我们提出了UAV-CAS,一个用于无人机网络入侵检测的大规模标记流数据集,由Containernet数字孪生生成,并针对AERPAW测试平台测量进行了系统校准。我们有一个四层校准管道,涵盖高度相关的路径损耗、任务特定的移动性、链路级性能链和端到端轨迹保真度。UAV-CAS包含来自1024种配置的99,492条流,涵盖五种攻击族(DoS、DDoS、黑洞、虫洞、重放)和九种协作攻击组合。多样性分析表明,高速率攻击与良性流量的分离程度比任何先前基准高出一个数量级,而隐蔽攻击则故意与良性流量混合。在十个基线IDS上,二分类攻击检测饱和于0.98以上,确认数据集是可学习的,而完整的攻击类别识别仍然困难——每类F1分数从接近零到0.82不等,对于隐蔽攻击则降至个位数。我们发布数据集、模拟器和校准数据,以支持可重复的无人机入侵检测研究。

英文摘要

Intrusion detection systems (IDS) trained on wired-network benchmarks degrade sharply in real-world unmanned aerial vehicle (UAV) swarms, where mobility, fluctuating link quality, and decentralized routing reshape traffic distributions. Existing UAV-specific datasets also do not systematically vary these conditions, leaving no way to train or test an IDS against the very shift that defeats it. We present UAV-CAS, a large-scale labeled flow dataset for UAV-network intrusion detection, generated by a Containernet digital twin that is systematically calibrated against AERPAW testbed measurements. We have a four-layer calibration pipeline spanning altitude-dependent path loss, mission-specific mobility, the link-level performance chain, and end-to-end trace fidelity. UAV-CAS comprises 99,492 flows drawn from 1,024 configurations that span five attack families (DoS, DDoS, blackhole, wormhole, replay) and nine collaborative attack compositions. A diversity analysis shows that high-rate attacks separate from benign traffic up to an order of magnitude more strongly than in any prior benchmark, while stealth attacks deliberately blend with benign traffic. Across ten baseline IDS, binary attack detection saturates above $0.98$, confirming the dataset is learnable, whereas full attack-class identification remains hard -- per-class $F_1$ ranges from near zero to $0.82$ and falls into the single digits for stealth attacks. We release the dataset, simulator, and calibration data to support reproducible UAV intrusion-detection research.

2606.17811 2026-06-17 cs.LO 新提交

UMB: A Unified Markov Binary Format for Probabilistic Model Checking (extended version)

UMB:一种用于概率模型检验的统一马尔可夫二进制格式(扩展版)

Roman Andriushchenko, Arnd Hartmanns, Joshua Jeppson, Sebastian Junges, Tobias Meggendorfer, David Parker, Tim Quatmann, Maximilian Weininger

AI总结 提出UMB格式,一种高效、可扩展的显式状态文件格式,用于表示多种概率系统,解决低层模型交换问题,已被主流工具采用并提供Python库支持。

详情
AI中文摘要

本文提出了统一马尔可夫二进制(UMB)格式,一种高效、可扩展且支持良好的显式状态文件格式,用于表示广泛的概率系统。UMB解决了以下问题:虽然概率模型检验工具通常支持常见的高级建模语言,但缺乏交换低层模型表示的有效机制。实践中,使用文本的、特定于工具的格式,阻碍了互操作性,并导致读写模型文件的开销很大。UMB基于通用的底层数学模型,并使用一小组位级原始数据结构进行编码,提供了一种简洁、统一且高效的解决方案。该格式已被主流工具采用,并附带一个方便的Python库,用于读取、操作、创建和验证模型,以及跨工具安装和持续验证的基础设施。我们报告了文件格式的效率以及它促成的新的实际用例。

英文摘要

This paper presents the unified Markov binary (UMB) format, an efficient, extensible, and well-supported explicit-state file format for representing a wide range of probabilistic systems. UMB addresses the problem that, while probabilistic model checking tools often support common high-level modelling languages, there is no effective mechanism for exchanging low-level model representations. In practice, textual, tool-specific formats are used, hampering interoperability and resulting in large overheads in writing and reading model files. UMB provides a clean, unified, and efficient solution, based on a general underlying mathematical model, and encoded using a small set of bit-level primitive data structures. The format has already been adopted by prominent tools and comes with a convenient Python library for reading, manipulating, creating, and validating models, plus infrastructure for cross-tool installation and continuous validation. We report on both the efficiency of the file format and the new practical use cases that it facilitates.

2606.17793 2026-06-17 cs.HC cs.DB 新提交

ARES: A Platform for Adaptive Role-Based Evaluation of Social Engineering Risks in Human--AI Games

ARES: 一种用于人类-人工智能游戏中基于角色的社会工程风险自适应评估平台

Roberto Daza, Javier Irigoyen, Ivan Lopez, Raquel Rodriguez-Carvajal, Laura Gomez, Julian Fierrez, Ruben Tolosana, Aythami Morales

AI总结 提出ARES平台,通过可控社交游戏审计LLM中介的社会决策中的自适应社会工程风险,支持人-人、人-AI和AI-AI设置,并收集多模态数据集以评估风险。

Comments 6 pages, 2 figures. Accepted at the International Carnahan Conference on Security Technology (ICCST 2026)

详情
AI中文摘要

本文介绍了ARES,一个平台和开放试点数据集,用于通过受控社交游戏审计LLM中介的社会决策中的自适应社会工程风险。ARES支持人-人、人-AI和AI-AI设置,结合了可配置的游戏模板、角色条件的LLM代理、心理学知情的参与者画像、结构化交互树以及同步的行为和生物特征采集、过滤和基于深度学习的特征提取。试点数据集来自15名参与者与角色条件的GPT-5.4代理在两个串联游戏(改编的囚徒困境和最后通牒游戏)中的互动。它包含340 GB的原始和处理过的多模态数据,涵盖六个流:交互日志、视频、屏幕录制、注视日志、智能手表信号以及游戏/问卷元数据。这些数据包括交互路径、书面理由、心理画像、主观反馈、感知对手身份、游戏结果以及衍生的行为、面部和注视特征。除了数据集,我们还提供了描述性分析来表征试点发布。严格的风险评估对于部署安全的AI系统至关重要,因为它能够识别和缓解漏洞,确保敏感数据的保护,并支持遵守社会不断发展的监管和伦理标准。

英文摘要

This work introduces ARES, a platform and open pilot dataset for auditing adaptive social engineering risks in LLM-mediated social decision-making through controlled social games. ARES supports human--human, human--AI, and AI--AI settings, combining configurable game templates, role-conditioned LLM agents, psychology-informed participant profiling, structured interaction trees, and synchronised behavioural and biometric acquisition, filtering, and deep-learning-based feature extraction. The pilot dataset was collected from 15 participants interacting with a role-conditioned GPT-5.4 agent in two concatenated games: an adapted Prisoner's Dilemma and an Ultimatum Game. It comprises 340 GB of raw and processed multimodal data across six streams: interaction logs, video, screen recordings, gaze logs, smartwatch signals, and game/questionnaire metadata. These data include interaction paths, written justifications, psychological profiles, subjective feedback, perceived counterpart identity, game outcomes, and derived behavioural, facial, and gaze features. Alongside the dataset, we provide descriptive analyses to characterise the pilot release. Rigorous risk evaluation is essential for the deployment of secure AI systems, as it enables the identification and mitigation of vulnerabilities, ensures the protection of sensitive data, and supports compliance with evolving regulatory and ethical standards in society.

2606.17789 2026-06-17 cs.HC 新提交

Mind Companion: An Embodied Conversational Agent for Process-Based Psychotherapy

Mind Companion: 一种用于基于过程心理治疗的具身对话代理

Sofie Kamber, Lukas Diebold, Pascal Riachi, Stella Brogna, Andrew Gloster, Rafael Wampfler

AI总结 提出Mind Companion,一种基于大语言模型的具身对话代理,通过多层级心理分析与过程治疗原则,实时分析客户陈述并生成回应,评估显示GPT-5.2在多个维度上优于人类治疗师。

详情
Journal ref
2026 IEEE 14th International Conference on Healthcare Informatics (ICHI), Minneapolis, MN, June 1-3, 2026, pp. 980-989
AI中文摘要

全球范围内获得循证心理治疗的机会仍然有限,即使在高收入地区也存在漫长的等待名单。最近大语言模型(LLM)的进展,在临床监督和安全机制设计下,为可扩展的心理健康支持提供了潜力。我们提出了Mind Companion,一种基于LLM的具身对话代理,将多层级心理分析与基于过程的治疗原则相结合。该系统对客户陈述进行实时分析,涵盖事实提取、心理灵活性过程检测、情绪识别和安全监控。分析结果存储供监督临床医生用于治疗规划。回应生成结合了来自循证治疗文献的检索增强生成和上下文感知提示。回应通过具身化虚拟角色以同步语音合成和动画传递。我们评估了三种LLM配置(GPT-4.1-mini、GPT-5.2、Claude Sonnet 4.5),与来自真实治疗会话的治疗师回应进行对比,使用自动LLM裁判评估和11位专业心理治疗师的专家评估。GPT-5.2在理解力、人际效能、协作和治疗一致性方面均获得高于人类治疗师回应的评分,证明了基于LLM的对话代理作为临床护理补充工具的可行性。

英文摘要

Access to evidence-based psychotherapy remains limited worldwide, with long waitlists even in high-income regions. Recent advances in large language models (LLMs) offer potential for scalable mental health support when designed with clinical oversight and safety mechanisms. We present Mind Companion, an LLM-based embodied conversational agent integrating multi-layered psychological analysis with process-based therapy principles. The system performs real-time analysis of client statements across fact extraction, psychological flexibility process detection, emotion recognition, and safety monitoring. Analysis results are stored for supervising clinicians to inform therapeutic planning. Response generation incorporates retrieval-augmented generation from evidence-based therapeutic literature and context-aware prompting. Responses are delivered through an embodied avatar with synchronized speech synthesis and animation. We evaluated three LLM configurations (GPT-4.1-mini, GPT-5.2, Claude Sonnet 4.5) against therapist responses from real therapy sessions using automated LLM-judge assessment and expert evaluation with 11 professional psychotherapists. GPT-5.2 achieved higher ratings than human therapist responses across understanding, interpersonal effectiveness, collaboration, and therapeutic alignment in both evaluations, demonstrating the feasibility of LLM-based conversational agents as tools to complement clinical care.

2606.17787 2026-06-17 cs.DC 新提交

LUMEN: Coordinated Failure Recovery for Distributed LLM Serving

LUMEN:分布式LLM服务的协调故障恢复

Zhang Cao, Shujie Han, Juncheng Zhang, Yuanming Ren, Yongkun Li, Patrick P. C. Lee

AI总结 针对分布式LLM服务中工作节点故障导致KV缓存丢失和请求重算的问题,提出LUMEN系统,通过负载感知的协调恢复策略(检查点放置、中断请求分配、容量恢复)显著提升服务与恢复时间。

详情
AI中文摘要

现代大语言模型(LLM)服务集群将推理请求分布到不同GPU上的多个工作进程中,但大规模下故障普遍存在。当工作进程故障时,集群同时丢失故障工作进程的GPU驻留键值(KV)缓存和服务容量,导致幸存工作进程在吸收重定向流量的同时从头重新运行中断的请求。现有容错系统要么从头重启中断请求,要么从固定邻居工作进程上的检查点恢复KV缓存,但这两种方法在未考虑当前集群负载的情况下路由恢复工作,并在模型重载期间使恢复工作进程空闲。我们提出LUMEN,一种容错LLM服务系统,将恢复视为跨三个决策点的负载感知协调问题:故障前的检查点放置、故障时的中断请求分配以及模型重载期间的服务容量恢复。我们通过原型实验和大规模模拟评估LUMEN,并展示了在服务时间和恢复时间上的显著改进。

英文摘要

Modern large language model (LLM) serving clusters distribute inference requests across multiple worker processes on different GPUs, but failures are prevalent at scale. When a worker fails, the cluster simultaneously loses the failed worker's GPU-resident key-value (KV) caches and serving capacity, leaving surviving workers to absorb the redirected traffic while re-running interrupted requests from scratch. Existing fault-tolerant systems either restart interrupted requests from scratch or restore KV caches from checkpoints stored on a fixed neighboring worker, but both approaches route recovery work without considering current cluster load and leave the recovering worker idle during model reload. We present LUMEN, a fault-tolerant LLM serving system that treats recovery as a load-aware coordination problem across three decision points: checkpoint placement before failures, interrupted-request distribution at failure time, and serving capacity restoration during model reload. We evaluate LUMEN using both prototype experiments and large-scale simulations and demonstrate significant improvements in serving and recovery times.

2606.17783 2026-06-17 cs.HC 新提交

Is It Real? Exploiting Virtual-Physical Discrimination Vulnerability in Mixed Reality

这是真的吗?利用混合现实中的虚实辨别漏洞

Xueyang Wang, Xihuan Yao, Yanming Xiu, Xin Yi, Maria Gorlatova, Hewu Li

AI总结 研究混合现实头显中用户无法区分虚拟与真实物体的漏洞,通过专家研讨和四项概念验证攻击(成功率85%-100%),揭示了攻击如何改变用户行为,并提出平台级溯源、交互门控和用户教育等防御措施。

Comments Accepted at the 2026 USENIX Symposium on Usable Privacy and Security (SOUPS 2026)

详情
AI中文摘要

消费级混合现实(MR)头显将虚拟内容以足够保真度无缝融合到物理环境中,用户可能无法区分虚拟物体和物理物体。我们将这种虚实辨别漏洞识别为一种可利用的安全原语。通过与12位来自网络安全和MR/HCI领域的专家进行推测性设计研讨会,我们开发了虚实混淆攻击的分类法,并在Apple Vision Pro上实现了四项概念验证攻击,在26名参与者参与的现实MR任务中进行了评估。所有四项攻击都改变了用户行为,成功率从85%到100%不等,产生了误导性交互、误判物体身份、有偏见的购买决策和改变的导航路径。值得注意的是,最成功的攻击也是最难被参与者主观评分检测到的。即使识别出虚拟内容的参与者在行为上仍然顺从,并且没有参与者将异常事件归因于对抗性原因。我们提出平台级溯源、交互门控和用户教育作为对策。

英文摘要

Consumer mixed reality (MR) headsets seamlessly blend virtual content into physical environments with sufficient fidelity that users may be unable to distinguish virtual objects from physical ones. We identify this virtual-physical discrimination vulnerability as an exploitable security primitive. Through speculative design workshops with 12 experts from cybersecurity and MR/HCI, we develop a taxonomy of virtual-physical confusion attacks and implement four proof-of-concept attacks on Apple Vision Pro, evaluating them with 26 participants in realistic MR tasks. All four attacks altered user behavior, with success rates ranging from 85% to 100%, producing misdirected interactions, misjudged object identities, biased purchasing decisions, and altered navigation paths. Notably, the most successful attacks were also the hardest to detect according to participants' subjective ratings. Even participants who recognized virtual content still complied behaviorally, and no participant attributed anomalous events to adversarial causes. We propose platform-level provenance, interaction gating, and user education as countermeasures.

2606.17746 2026-06-17 cs.NI 新提交

FlowCLIP: Contrastive Pretraining Using Domain Names for Encrypted Traffic Classification

FlowCLIP: 使用域名的对比预训练进行加密流量分类

Eun Hun Choi

AI总结 提出FlowCLIP框架,利用数据包侧信道特征(间隔、大小、方向)和CLIP对比目标对齐流量与域名表示,在QUIC流量数据集上跨周评估,优于基线方法。

详情
AI中文摘要

网络流量分类支持网站指纹识别、入侵检测和服务质量管理。然而,在现实部署条件下开发能够捕获稳定且可泛化的流量模式的方法仍然具有挑战性。我们引入了FlowCLIP,一个对比预训练框架,仅使用侧信道特征(数据包到达间隔时间、数据包大小和数据包方向)从加密流量中进行域名预测。FlowCLIP通过CLIP风格的对比目标将流量流表示与域名表示对齐,从而将原始域名作为文本监督。预训练的流量编码器随后被冻结,并通过线性探测在规范化的域名标签上进行评估。我们在一个基于时间协议的大规模QUIC流量数据集上评估FlowCLIP,其中模型在第1周的流量上训练,并在第2-4周的流量上评估。FlowCLIP在后续评估周中优于竞争性的机器学习基线,表明原始域名为学习可迁移的加密流量表示提供了文本监督信号。

英文摘要

Network traffic classification enables website fingerprinting, intrusion detection, and Quality of Service management. However, developing methods that capture stable and generalizable traffic patterns under realistic deployment conditions remains challenging. We introduce FlowCLIP, a contrastive pretraining framework for domain name prediction from encrypted traffic using only side-channel features: packet inter-arrival times, packet sizes, and packet directions. FlowCLIP uses raw domain names as textual supervision by aligning traffic flow representations with domain name representations through a CLIP-style contrastive objective. The pretrained traffic encoder is then frozen and evaluated through linear probing on canonicalized domain name labels. We evaluate FlowCLIP on a large-scale QUIC traffic dataset using a time-based protocol, where models are trained on Week 1 traffic and evaluated on traffic from Weeks 2-4. FlowCLIP outperforms competitive machine learning baselines across later evaluation weeks, suggesting that raw domain names provide a textual supervision signal for learning transferable encrypted traffic representations.

2606.17741 2026-06-17 eess.SY cs.HC cs.SY 新提交

A Wearable Multimodal Ultrasound+Inertial System for Real-Time Virtual Reality Interaction

用于实时虚拟现实交互的可穿戴多模态超声+惯性系统

Giusy Spacone, Sebastian Frey, Enzo Baraldi, Mattia Orlandi, Luca Benini, Andrea Cossettini

AI总结 提出基于前臂和上臂超声与惯性传感的完全可穿戴多模态接口,结合WULPUS平台和Unity VR环境,通过多模态学习实现手部姿态和前臂位置估计,在三个任务中在线成功率超88%。

Comments 8 pages, 8 figures, 3 tables

详情
AI中文摘要

A模式超声(US)是一种有前景的虚拟现实(VR)交互传感模态,因为它能够将肌肉活动映射为控制命令,同时保留可穿戴传感的优势。然而,现有方法在可穿戴性和交互复杂性方面仍面临限制,通常依赖外部硬件如摄像头。在这项工作中,我们提出了一种完全可穿戴的多模态接口,用于实时VR交互,基于来自前臂和上臂的并发US和惯性(加速度计)传感。该系统构建于WULPUS平台之上,并集成了一个端到端的软件框架,用于实时采集、可视化以及与基于Unity的VR环境通信。引入了一种多模态学习流水线,用于在二维空间中同时进行手部姿态和前臂位置估计。通过离线与在线实验对接口进行了评估,涉及五名受试者执行三项功能任务:圆柱体抓取(粗大运动)与搬运、弹珠捏取(精细运动)与搬运以及液体倾倒。对于离线实验,我们在多天内采集了5次采集会话,在手部姿态估计中实现了跨受试者的平均跨会话准确率80±6%,前臂位置估计为77±7%。在线验证仅需最少微调(5分钟),三项任务的成功率分别为92.0±16.0%、88.0±9.8%和96.0±8.0%。功耗仅为19.9 mW,我们的系统可在小型350 mAh锂聚合物电池上连续使用超过2.5天而无需充电,实现了真正可穿戴、多模态且功能有意义的VR交互。

英文摘要

A-mode ultrasound (US) is a promising sensing modality for Virtual Reality (VR) interaction, as it enables the mapping of muscular activity into control commands while retaining the benefits of wearable sensing. However, existing approaches still face limitations in terms of wearability and interaction complexity, often relying on external hardware such as cameras. In this work, we propose a fully wearable multimodal interface for real-time VR-interaction, based on concurrent US and inertial (accelerometry) sensing from the forearm and upper arm. The system is built on the WULPUS platform and integrates an end-to-end software framework for real-time acquisition, visualization, and communication with a Unity-based VR environment. A multimodal learning pipeline is introduced for concurrent hand pose and forearm position estimation in 2D space. The interface is evaluated through offline and online experiments with five subjects, during the execution of three functional tasks: cylinder grasping (gross motor) and relocation, marble pinching (fine motor) and relocation, and liquid pouring. For offline experiments, we collect 5 acquisition sessions across multiple days, achieving an average inter-session accuracy across subjects of 80$\pm$6\% for hand pose estimation and 77$\pm$7\% for forearm position estimation. Online validation with minimal fine-tuning (5 min) demonstrates success rates of 92.0$\pm$16.0\%, 88.0$\pm$9.8\%, and 96.0$\pm$8.0\% for the three tasks, respectively. With a power consumption of only 19.9~mW, our system enables more than 2.5 days of continuous use on a small 350 mAh LiPo battery without the need for recharge, enabling truly wearable, multimodal, and functionally meaningful VR interaction.

2606.17732 2026-06-17 cs.DS 新提交

Four-Cycle Counting in Low-Degeneracy Graph Streams

低退化度图流中的四环计数

Sebastian Lüderssen, Stefan Neumann, Pan Peng

AI总结 提出两种基于子图采样的算法,分别使用两遍和一遍流式扫描,在低退化度图上实现四环数量的(1+ε)近似,空间复杂度达到理论最优或接近最优。

Comments KDD 2026

详情
AI中文摘要

我们研究了在任意顺序边流给出的图中,对四环数量进行$(1+\varepsilon)$近似的问题。我们提出了两种基于采样诱导子图的新算法。第一个贡献是一个两遍算法,使用$\widetilde{O}(\kappa m / \sqrt{T})$空间,其中$m$是边数,$T$是四环数,$\kappa$是图的退化度。该算法改进了现有的理论界限,并且在常数退化度图上被证明是最优的,匹配已知的$\Omega(m/\sqrt{T})$下界(忽略低阶因子)。第二个贡献是一个一遍算法,当四环不是高度集中在单个节点、边或楔形周围时,该算法保持准确;这种结构性质在稀疏社交和协作网络中很常见。我们在各种真实世界图流上评估了这两种算法。两遍算法始终优于最先进的方法,使用更少的空间达到所需的精度。一遍算法在四环均匀分布时具有竞争力,与我们的理论分析一致。与最近的几项工作不同,我们的算法即使在非二分图(如社交网络)上也表现良好。

英文摘要

We study the problem of $(1+\varepsilon)$-approximating the number of four-cycles in graphs given as arbitrary order edge streams. We propose two new algorithms based on sampling induced subgraphs. Our first contribution is a two-pass algorithm that uses $\widetilde{O}(κm / \sqrt{T})$ space, where $m$ is the number of edges, $T$ is the number of four-cycles, and $κ$ is the graph's degeneracy. This algorithm improves upon existing theoretical bounds and is provably optimal for constant-degeneracy graphs, matching the known $Ω(m/\sqrt{T})$ lower bound up to lower-order factors. Our second contribution is a one-pass algorithm that remains accurate when four-cycles are not highly concentrated around individual nodes, edges, or wedges; this structural property is common in sparse social and collaboration networks. We evaluate both algorithms on a variety of real-world graph streams. The two-pass algorithm consistently outperforms state-of-the-art methods, using substantially less space to achieve a desired accuracy. The one-pass algorithm is competitive when four-cycles are evenly distributed, matching our theoretical analysis. Unlike several recent works, our algorithms perform well even on non-bipartite graphs such as social networks.

2606.17731 2026-06-17 cs.NE 新提交

Evolutionary Algorithms and Multi-Objective Minimum Spanning Trees with Limited Distinct Weight Values

进化算法与具有有限不同权值的多目标最小生成树

Narges Tavassoli Kejani, Andrew M. Sutton, Frank Neumann

AI总结 研究当边权取少量不同值时帕累托前沿的结构,基于此推导进化多目标算法的新运行时界,并通过实验验证。

详情
AI中文摘要

进化算法已广泛应用于多目标组合优化问题。尽管实际成功,但关于进化算法在多目标组合问题上运行时的理论结果相当有限。一个已被研究过的经典问题是多目标最小生成树问题,已获得计算帕累托前沿所有极值角点的运行时界。本文提供了当边权取少量不同值时帕累托前沿结构的更详细见解。基于这些见解,我们推导了进化多目标算法的新运行时结果,并通过实验研究补充了我们的理论结果。

英文摘要

Evolutionary algorithms have been used for a wide range of multi-objective combinatorial optimization problems. Despite practical success, theoretical results on the runtime of evolutionary algorithms for multi-objective combinatorial problems are rather limited. One classical problem that has been investigated is the multi-objective minimum spanning tree problem for which runtime bounds have been obtained to compute all extremal corner points of the Pareto front. With this paper, we provide some more detailed insights into the structure of the Pareto front when the edge weights take on a small number of distinct values. Based on these insights, we derive new runtime results for evolutionary multi-objective algorithms and complement our theoretical results with experimental investigations.

2606.17721 2026-06-17 cs.IR 新提交

Understanding and Debugging Failures in N-Gram-Based Generative Retrieval

理解和调试基于N-Gram的生成式检索中的失败

Richard Takacs, Adrian Bracher, Svitlana Vakulenko

AI总结 本文通过分类法、实证分析和可视化工具,系统研究了基于n-gram的生成式检索方法(如SEAL和MINDER)的失败模式,包括歧义文档ID、低标识符多样性和特定标识符的不成比例影响。

Comments Work in progress

详情
AI中文摘要

生成式检索(GR)是一种新兴的信息检索(IR)范式,其动机是日益强大的语言模型。在GR中,模型直接生成相关文档的标识符。虽然这些系统提供了独特的优势,但它们也引入了不同的失败机制。我们通过三个贡献探索这些失败模式:(1)我们基于GR文献提出了GR失败模式的分类法。(2)我们实证研究了GR子集——基于n-gram的方法,更具体地说,SEAL和MINDER中的失败。我们的分析揭示了常见问题,例如歧义文档ID、低标识符多样性以及特定标识符的不成比例影响。(3)我们引入了一个新的基于Web的工具,帮助IR社区分析生成的n-gram及其对最终排名的各自贡献,提供了一个直观的界面来识别这些GR方法出错的地方。

英文摘要

Generative Retrieval (GR) is an emerging Information Retrieval (IR) paradigm that is motivated by increasingly capable language models. In GR, a model directly generates identifiers for relevant documents. While these systems offer unique advantages, they also introduce distinct failure mechanisms. We explore these failure modes in three contributions: (1) We present a taxonomy of GR failure modes based on GR literature. (2) We empirically investigate failure in a subset of GR: ngram-based methods, more specifically, SEAL and MINDER. Our analysis reveals common issues, such as ambiguous docids, low identifier diversity, and the disproportionate impact of specific identifiers. (3) We introduce a new web-based tool that helps the IR community analyze generated ngrams and their respective contribution to the final ranking, providing an intuitive interface to identify where such GR methods go wrong.

2606.17716 2026-06-17 cs.NI 新提交

DPDS: A DPDK-Based Packet Delayer and Spacer

DPDS:基于DPDK的数据包延迟器与间隔器

Etienne Zink, Fabian Ihle, Michael Menth

AI总结 提出自适应延迟关联方法,在DPDK上实现高吞吐、零丢包的数据包延迟与间隔器DPDS,优于NetEm和MoonEm。

Comments This work has been submitted to the IEEE Open Journal of the Communications Society for possible publication

详情
AI中文摘要

本文解决了链路仿真中为数据包添加可变延迟的问题。朴素的方法要么增加过多延迟,要么导致数据包重排序,两者都不理想。我们开发了自适应延迟关联,高效地向数据包添加正相关延迟。它以平均延迟和标准差(抖动)作为输入,以及控制延迟动态的半衰期。我们研究了有无带宽限制下所得数据包延迟的准确性和动态性。据此,我们给出了半衰期的配置建议。我们在基于DPDK的数据包延迟器和间隔器(DPDS)中实现了自适应延迟关联,在硬件上测试其性能,并与广泛使用的链路仿真器NetEm以及最近开发的基于DPDK的仿真器MoonEm进行比较。DPDS在恒定延迟下以95 Gbit/s的零丢包吞吐量优于两者,在启用间隔功能时,对于3 ms抖动的可变延迟达到85 Gbit/s。此外,DPDS支持数据包重排序,恒定延迟和可变延迟下的零丢包吞吐量分别为73 Gbit/s和58 Gbit/s,还支持策略和两种丢包模型。

英文摘要

In this paper we tackle the problem of adding varying delay to packets for link emulation. Naive approaches either add more delay than desired or cause packet reordering, both of which are undesirable. We develop adaptive delay correlation, which adds positively correlated delays to packets efficiently. It takes a mean delay and standard deviation (jitter) as input, as well as a half-life period to control the delay dynamics. We investigate the accuracy and dynamics of the resulting packet delays with and without bandwidth limitation. As a result we give a recommendation for the configuration of the half-life period. We implement adaptive delay correlation in a DPDK-based packet delayer and spacer (DPDS), investigate its performance on hardware, and compare it with the widely used link emulator NetEm and the recently developed DPDK-based emulator MoonEm. DPDS outperforms both of them with a zero-loss throughput of 95 Gbit/s for constant delay and, with spacing enabled, 85 Gbit/s for varying delay with 3 ms jitter. Further, DPDS supports packet reordering with zero-loss throughputs of 73 Gbit/s and 58 Gbit/s for constant and varying delay, respectively, as well as policing and two packet loss models.

2606.17707 2026-06-17 cs.IR 新提交

Do Generative Recommenders Deepen the Information Cocoon? A Closed-Loop Simulation with LLM-powered User Simulators

生成式推荐器会加深信息茧房吗?基于LLM用户模拟器的闭环仿真

Jiyuan Yang, Gengxin Sun, Mengqi Zhang, Lingjie Wang, Yuanzi Li, Hongxi Cui, Xin Xin, Pengjie Ren

AI总结 提出闭环仿真框架RecLoop,利用LLM用户代理比较生成式与传统推荐器,发现生成式推荐器在暴露层面不易形成信息茧房,但反馈循环仍会导致编码空间集中,且茧房严重程度取决于分词策略和模型规模。

详情
AI中文摘要

推荐系统缓解了信息过载,但推荐与用户交互之间的重复反馈会强化现有偏好并缩小用户接触范围,形成信息茧房。虽然这一现象在传统序列推荐中已被广泛研究,但其对生成式推荐的影响尚不明确。通过用语义ID(SID)序列替换原子项目ID,生成式推荐器引入了一种不同的推荐机制,其在信息茧房形成中的作用尚未被理解。为了探究生成式推荐器是否会加深信息茧房,我们提出了RecLoop,一个由LLM驱动的用户代理的闭环仿真框架。我们在两个亚马逊数据集上,跨多个反馈周期比较了两种生成式推荐器和两种传统序列基线。除了标准的暴露层面指标,我们还引入了编码空间结构茧房,这是一个模型层面的指标,用于衡量生成的SID空间中的集中度。实验结果表明,生成式推荐器通常比传统基线更不易形成暴露层面的茧房,保持了更广泛的暴露多样性并减缓了跨用户同质化。然而,反馈循环仍可能导致生成的SID空间内出现集中。我们进一步发现,茧房严重程度强烈依赖于分词策略和模型规模:协同信号分词比语义分词产生更强的茧房效应,而更大的模型能保持更大的编码空间多样性,并更好地保留对利基内容的访问。这些发现表明,生成式推荐中的信息茧房不仅受推荐行为影响,还受项目分词和模型能力的影响。我们的代码可从此https URL获取。

英文摘要

Recommender systems alleviate information overload, yet repeated feedback between recommendations and user interactions can reinforce existing preferences and narrow users' exposure, forming information cocoons. While this phenomenon has been widely studied in traditional sequential recommendation, its impact on generative recommendation remains unclear. By replacing atomic item IDs with Semantic ID (SID) sequences, generative recommenders introduce a different recommendation mechanism whose role in information cocoon formation is not yet understood. To investigate whether generative recommenders deepen information cocoons, we propose \textsc{RecLoop}, a closed-loop simulation framework with LLM-driven user agents. We compare two generative recommenders and two traditional sequential baselines on two Amazon datasets across multiple feedback cycles. In addition to standard exposure-level metrics, we introduce \emph{Code-Space Structural Cocoon}, a model-level metric that measures concentration in the generated SID space. Experimental results show that generative recommenders are generally less prone to exposure-level cocoon formation than traditional baselines, preserving broader exposure diversity and slowing cross-user homogenization. However, feedback loops can still induce concentration within the generated SID space. We further find that cocoon severity depends strongly on tokenization strategy and model scale: collaborative-signal tokenization produces stronger cocoon effects than semantic tokenization, whereas larger models maintain greater code-space diversity and better retain access to niche content. These findings suggest that information cocoons in generative recommendation are shaped not only by recommendation behavior, but also by item tokenization and model capacity. Our code is available at https://github.com/Dregen-Yor/RecLoop.

2606.17703 2026-06-17 cs.SI 新提交

Minimizing Total Biharmonic Distance in Large Graphs via Link Recommendation

通过链接推荐最小化大型图中的总双调和距离

Xinna Zhou, Zhongzhi Zhang

AI总结 研究通过添加k条边最小化总双调和距离的问题,提出基于贪心算法和投影法、拉普拉斯求解器、凸包近似等技术的近线性时间算法,在真实数据集上验证了效率和有效性。

Comments This paper has been published in Proceedings of the 32nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.1

详情
AI中文摘要

总双调和距离,即网络中每对节点之间双调和距离的总和,是评估网络连通性和鲁棒性的关键指标。在本文中,我们研究了通过向给定图$G$添加$k$条不存在的边来最小化总双调和距离的问题,其中$k$为预算。该问题在计算上具有挑战性。我们证明了该问题的目标函数是单调的但不是超模的。为了解决这个问题,我们提出了时间复杂度为三次的简单贪心算法。为了缓解这些贪心算法的高时间复杂度,我们应用了几种技术,包括投影法、拉普拉斯求解器和凸包近似。这些技术将我们提出的算法的时间复杂度从三次降低到近线性,同时提供了误差保证。最后,在真实数据集上的大量实验证明了我们提出算法的效率和有效性。

英文摘要

The total biharmonic distance, which is the sum of the biharmonic distance between every pair of nodes in a network, is a key metric for evaluating network connectivity and robustness. In this paper, we study the problem of minimizing the total biharmonic distance by adding $k$ nonexistent edges for a given graph $G$ and budget $k$. The problem is computationally challenging. We show that the objective function of the problem is monotone but not supermodular. To solve this problem, we propose simple greedy algorithms with cubic time complexity. To mitigate the high time complexity of these greedy algorithms, we apply several techniques, including the projection method, the Laplacian solver, and convex hull approximation. These techniques reduce the time complexity of our proposed algorithms from cubic to nearly linear while providing error guarantees. Finally, extensive experiments on real datasets demonstrate both the efficiency and effectiveness of our proposed algorithms.

2606.17693 2026-06-17 cs.LO 新提交

Verifying LTL for Infinite State Systems via Termination Analysis

通过终止分析验证无限状态系统的LTL性质

Nils Lommen, Moritz Leven Rosarius, Jürgen Giesl

AI总结 提出框架MoAT,将无限状态系统的LTL模型检验归约为公平终止问题,并利用终止分析工具KoAT和LoAT进行验证,实验表明与现有工具性能相当。

Comments Presented at WST 2026, 8 pages, 3 figures, 1 table

详情
AI中文摘要

我们证明,现有的终止分析工具非常适合用于无限状态系统的LTL模型检验。为此,我们提出了一个框架MoAT,它采用著名的基于自动机的方法,将LTL模型检验问题归约为公平终止问题。为了证明或反驳公平终止,它在后端调用终止工具KoAT和LoAT。我们的实验表明,通过这种方式,MoAT在无限状态系统的LTL模型检验方面与现有最先进的工具性能相当。

英文摘要

We show that existing tools for termination analysis are extremely well suited for LTL model checking of infinite state systems. To this end, we present a framework MoAT which uses the well-known automata-based approach and reduces the LTL model checking problem to fair termination. To prove or disprove fair termination, it then calls the termination tools KoAT and LoAT in the backend. Our experiments show that in this way, MoAT is on par with existing state-of-the-art tools for LTL model checking of infinite state systems.

2606.17655 2026-06-17 cs.NI 新提交

Integration of 5G and Industrial Digital Models: A Case Study with AGVs

5G与工业数字模型的集成:以AGV为例的案例研究

J. Cañete-Martín, J. Gómez-Jerez, M. C. Lucas-Estañ, J. Gozálvez

AI总结 本文首次将5G数字模型作为资产管理壳(AAS)集成到工业数字模型中,通过OPC UA接口互联,以AGV案例评估5G通信对工业过程生产力和操作的影响。

详情
Journal ref
Proceedings of 2024 IEEE International Conference on Emerging Technologies and Factory Automation (IEEE ETFA 2024), September, 2024, Padova, Italy
AI中文摘要

5G是智能制造数字化的基础技术。智能制造依赖于在制造工厂实施之前使用数字模型优化工业过程。这些模型应考虑5G通信的影响,以充分设计和优化基于5G的工业过程。本文提出了工业数字模型与5G数字模型的首次集成,该5G数字模型作为5G系统的资产管理壳(AAS)实现。两个模型通过基于OPC UA的接口互连。我们使用一个用例评估集成模型的影响,其中自动导引车(AGV)从仓库运输材料到生产线。AGV定期通过5G交换位置以避免潜在碰撞。如果通信失败,AGV出于安全原因停止,直到可以保证可靠的5G连接。我们证明,通过集成5G和工业数字模型,可以计算并量化5G通信对工业过程操作和生产力的影响。这一结果凸显了将5G集成到工业数字模型中以实现联合设计和优化的重要性和必要性。

英文摘要

5G is a fundamental technology for the digitalization of smart manufacturing. Smart manufacturing relies on the use of digital models to optimize industrial processes before implementation on the manufacturing plants. These models should account for the impact of 5G communications to adequately dimension and optimize 5G-based industrial processes. This paper presents the first integration of industrial digital models with a 5G digital model, implemented as an Asset Administration Shell (AAS) of a 5G system. The two models are interconnected using an OPC UA-based interface. We evaluate the impact of the integrated model using a use case where Automated Guided Vehicles (AGVs) transport material from a warehouse to production lines. The AGVs periodically exchange their positions over 5G to avoid potential collisions. If the communications fail, the AGVs stop for safety reasons until a reliable 5G connection can be guaranteed. We demonstrate that, by integrating 5G and industrial digital models, it is possible to account for, and quantify, the impact of 5G communications on the operation and productivity of industrial processes. This result highlights the importance and necessity of integrating 5G into industrial digital models for their joint design and optimization.

2606.17654 2026-06-17 cs.NI 新提交

5G Network Architecture and Configuration Choices to Support Teleoperated Driving at Scale

5G网络架构与配置选择以支持大规模远程驾驶

M. C. Lucas-Estañ, B. Coll-Perales, M. I. Khan, J. Gozálvez, S. S. Avedisov, O. Altintas, M. Sepulcre

AI总结 本文证明MEC或边缘5G网络比集中式网络更适合支持大规模远程驾驶服务,并量化了不同架构和配置下同时远程操作多辆车所需的带宽。

详情
Journal ref
Proceedings of the 2024 IEEE 100th Vehicular Technology Conference (VTC2024-Fall), 7-10 October 2024, Washington DC, USA
AI中文摘要

远程驾驶(ToD)能够实现车辆的远程驾驶或控制。为此,车辆必须将视频流传输到ToD控制中心,以便远程操作员充分了解驾驶状况并安全控制车辆。5G(及未来)网络是部署ToD的基础,因为它们可以提供连接车辆和ToD控制中心所需的低延迟、可靠和宽带连接。然而,目前尚不清楚常见的5G网络架构和配置是否适合支持同时远程操作多辆对上行带宽要求高的车辆,因为当前网络主要配置用于支持移动宽带服务。本文证明,与集中式网络相比,MEC或基于边缘的5G网络更适合支持并扩展ToD服务,并量化了在各种5G网络架构和配置(包括不同双工模式和TDD帧结构)下同时远程操作多辆车所需的带宽。最后,研究表明,控制信道的配置有助于减轻视频馈送处理时间对支持并扩展ToD服务能力的影响。

英文摘要

Teleoperated driving (ToD) enables the remote driving or control of vehicles. For this purpose, vehicles must transmit video feeds to the ToD control center so that the remote operator is fully aware of the driving conditions and can safely control the vehicle. 5G (and beyond) networks are fundamental for the deployment of ToD as they can provide the low latency, reliable and broadband connection necessary to connect the vehicle and ToD control center. However, it is unclear whether common 5G network architectures and configurations are well-suited to support the simultaneous teleoperation of multiple vehicles with demanding uplink bandwidth, as current networks are mainly configured to support mobile broadband services. This paper demonstrates that MEC or edge-based 5G networks are better suited to support and scale the ToD service than centralized networks, and quantifies the bandwidth required to simultaneously teleoperate multiple vehicles under various 5G network architectures and configurations, including different duplexing modes and TDD frame structures. Finally, the study shows that the configuration of the control channels can help mitigate the impact that the processing time of the video feeds has on the capacity to support and scale the ToD service.

2606.17653 2026-06-17 cs.NI 新提交

Predictive Configured Grant Scheduling for Deterministic Wireless Communications

预测性配置授权调度用于确定性无线通信

Syed Morsleen Riaz, M. Carmen Lucas-Estañ, Baldomero Coll-Perales, Javier Gozalvez

AI总结 提出一种基于流量预测并考虑预测误差的预测性配置授权调度方案,通过预分配资源提高满足有界时延需求的能力,支持确定性服务并提升资源利用率。

详情
Journal ref
Proceedings of the 2025 IEEE 101st Vehicular Technology Conference (VTC2025-Spring), Oslo, Norway, 2025
AI中文摘要

未来无线网络必须增强其容量以维持确定性服务水平,并支持关键垂直领域新兴的时间敏感服务。保证有界延迟的能力在很大程度上依赖于高效的无线资源管理。配置授权(CG)调度可以通过预分配资源来减少延迟,但其有效性和效率在可变流量模式下会降低。本研究提出了一种新颖的预测性CG调度方案,该方案基于流量预测预分配资源,同时考虑预测不准确性。通过考虑这些不准确性,该方案显著提高了满足有界延迟要求的能力,这对于支持确定性服务水平至关重要。我们的评估表明,即使在具有不同需求的变体和混合流量场景下,所提出的方案也能显著增强支持确定性服务水平的能力,同时提高资源利用率。

英文摘要

Future wireless networks must enhance their capacity to sustain deterministic service levels and support emerging time-sensitive services in key verticals. The ability to guarantee bounded latencies heavily depends on efficient radio resource management. Configured Grant (CG) scheduling can reduce latency by pre-allocating resources, but its effectiveness and efficiency decrease under variable traffic patterns. This study presents a novel predictive CG scheduling scheme that pre-allocates resources based on traffic predictions while accounting for prediction inaccuracies. By considering these inaccuracies, the scheme significantly improves the ability to meet bounded latency requirements, which are essential for supporting deterministic service levels. Our evaluations show that the proposed scheme significantly enhances the capacity to support deterministic service levels while improving resource utilization, even in scenarios with variable and mixed traffic flows with diverse requirements.