arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2606.17732 2026-06-17 cs.DS 新提交

Four-Cycle Counting in Low-Degeneracy Graph Streams

低退化度图流中的四环计数

Sebastian Lüderssen, Stefan Neumann, Pan Peng

AI总结提出两种基于子图采样的算法，分别使用两遍和一遍流式扫描，在低退化度图上实现四环数量的(1+ε)近似，空间复杂度达到理论最优或接近最优。

Comments KDD 2026

详情

AI中文摘要

我们研究了在任意顺序边流给出的图中，对四环数量进行$(1+\varepsilon)$近似的问题。我们提出了两种基于采样诱导子图的新算法。第一个贡献是一个两遍算法，使用$\widetilde{O}(\kappa m / \sqrt{T})$空间，其中$m$是边数，$T$是四环数，$\kappa$是图的退化度。该算法改进了现有的理论界限，并且在常数退化度图上被证明是最优的，匹配已知的$\Omega(m/\sqrt{T})$下界（忽略低阶因子）。第二个贡献是一个一遍算法，当四环不是高度集中在单个节点、边或楔形周围时，该算法保持准确；这种结构性质在稀疏社交和协作网络中很常见。我们在各种真实世界图流上评估了这两种算法。两遍算法始终优于最先进的方法，使用更少的空间达到所需的精度。一遍算法在四环均匀分布时具有竞争力，与我们的理论分析一致。与最近的几项工作不同，我们的算法即使在非二分图（如社交网络）上也表现良好。

英文摘要

We study the problem of $(1+\varepsilon)$-approximating the number of four-cycles in graphs given as arbitrary order edge streams. We propose two new algorithms based on sampling induced subgraphs. Our first contribution is a two-pass algorithm that uses $\widetilde{O}(κm / \sqrt{T})$ space, where $m$ is the number of edges, $T$ is the number of four-cycles, and $κ$ is the graph's degeneracy. This algorithm improves upon existing theoretical bounds and is provably optimal for constant-degeneracy graphs, matching the known $Ω(m/\sqrt{T})$ lower bound up to lower-order factors. Our second contribution is a one-pass algorithm that remains accurate when four-cycles are not highly concentrated around individual nodes, edges, or wedges; this structural property is common in sparse social and collaboration networks. We evaluate both algorithms on a variety of real-world graph streams. The two-pass algorithm consistently outperforms state-of-the-art methods, using substantially less space to achieve a desired accuracy. The one-pass algorithm is competitive when four-cycles are evenly distributed, matching our theoretical analysis. Unlike several recent works, our algorithms perform well even on non-bipartite graphs such as social networks.

URL PDF HTML ☆

赞 0 踩 0

2606.17731 2026-06-17 cs.NE 新提交

Evolutionary Algorithms and Multi-Objective Minimum Spanning Trees with Limited Distinct Weight Values

进化算法与具有有限不同权值的多目标最小生成树

Narges Tavassoli Kejani, Andrew M. Sutton, Frank Neumann

AI总结研究当边权取少量不同值时帕累托前沿的结构，基于此推导进化多目标算法的新运行时界，并通过实验验证。

2606.17721 2026-06-17 cs.IR 新提交

Understanding and Debugging Failures in N-Gram-Based Generative Retrieval

理解和调试基于N-Gram的生成式检索中的失败

Richard Takacs, Adrian Bracher, Svitlana Vakulenko

AI总结本文通过分类法、实证分析和可视化工具，系统研究了基于n-gram的生成式检索方法（如SEAL和MINDER）的失败模式，包括歧义文档ID、低标识符多样性和特定标识符的不成比例影响。

Comments Work in progress

2606.17716 2026-06-17 cs.NI 新提交

DPDS: A DPDK-Based Packet Delayer and Spacer

DPDS：基于DPDK的数据包延迟器与间隔器

Etienne Zink, Fabian Ihle, Michael Menth

AI总结提出自适应延迟关联方法，在DPDK上实现高吞吐、零丢包的数据包延迟与间隔器DPDS，优于NetEm和MoonEm。

Comments This work has been submitted to the IEEE Open Journal of the Communications Society for possible publication

详情

AI中文摘要

本文解决了链路仿真中为数据包添加可变延迟的问题。朴素的方法要么增加过多延迟，要么导致数据包重排序，两者都不理想。我们开发了自适应延迟关联，高效地向数据包添加正相关延迟。它以平均延迟和标准差（抖动）作为输入，以及控制延迟动态的半衰期。我们研究了有无带宽限制下所得数据包延迟的准确性和动态性。据此，我们给出了半衰期的配置建议。我们在基于DPDK的数据包延迟器和间隔器（DPDS）中实现了自适应延迟关联，在硬件上测试其性能，并与广泛使用的链路仿真器NetEm以及最近开发的基于DPDK的仿真器MoonEm进行比较。DPDS在恒定延迟下以95 Gbit/s的零丢包吞吐量优于两者，在启用间隔功能时，对于3 ms抖动的可变延迟达到85 Gbit/s。此外，DPDS支持数据包重排序，恒定延迟和可变延迟下的零丢包吞吐量分别为73 Gbit/s和58 Gbit/s，还支持策略和两种丢包模型。

英文摘要

In this paper we tackle the problem of adding varying delay to packets for link emulation. Naive approaches either add more delay than desired or cause packet reordering, both of which are undesirable. We develop adaptive delay correlation, which adds positively correlated delays to packets efficiently. It takes a mean delay and standard deviation (jitter) as input, as well as a half-life period to control the delay dynamics. We investigate the accuracy and dynamics of the resulting packet delays with and without bandwidth limitation. As a result we give a recommendation for the configuration of the half-life period. We implement adaptive delay correlation in a DPDK-based packet delayer and spacer (DPDS), investigate its performance on hardware, and compare it with the widely used link emulator NetEm and the recently developed DPDK-based emulator MoonEm. DPDS outperforms both of them with a zero-loss throughput of 95 Gbit/s for constant delay and, with spacing enabled, 85 Gbit/s for varying delay with 3 ms jitter. Further, DPDS supports packet reordering with zero-loss throughputs of 73 Gbit/s and 58 Gbit/s for constant and varying delay, respectively, as well as policing and two packet loss models.

URL PDF HTML ☆

赞 0 踩 0

2606.17707 2026-06-17 cs.IR 新提交

Do Generative Recommenders Deepen the Information Cocoon? A Closed-Loop Simulation with LLM-powered User Simulators

生成式推荐器会加深信息茧房吗？基于LLM用户模拟器的闭环仿真

Jiyuan Yang, Gengxin Sun, Mengqi Zhang, Lingjie Wang, Yuanzi Li, Hongxi Cui, Xin Xin, Pengjie Ren

AI总结提出闭环仿真框架RecLoop，利用LLM用户代理比较生成式与传统推荐器，发现生成式推荐器在暴露层面不易形成信息茧房，但反馈循环仍会导致编码空间集中，且茧房严重程度取决于分词策略和模型规模。

详情

AI中文摘要

推荐系统缓解了信息过载，但推荐与用户交互之间的重复反馈会强化现有偏好并缩小用户接触范围，形成信息茧房。虽然这一现象在传统序列推荐中已被广泛研究，但其对生成式推荐的影响尚不明确。通过用语义ID（SID）序列替换原子项目ID，生成式推荐器引入了一种不同的推荐机制，其在信息茧房形成中的作用尚未被理解。为了探究生成式推荐器是否会加深信息茧房，我们提出了RecLoop，一个由LLM驱动的用户代理的闭环仿真框架。我们在两个亚马逊数据集上，跨多个反馈周期比较了两种生成式推荐器和两种传统序列基线。除了标准的暴露层面指标，我们还引入了编码空间结构茧房，这是一个模型层面的指标，用于衡量生成的SID空间中的集中度。实验结果表明，生成式推荐器通常比传统基线更不易形成暴露层面的茧房，保持了更广泛的暴露多样性并减缓了跨用户同质化。然而，反馈循环仍可能导致生成的SID空间内出现集中。我们进一步发现，茧房严重程度强烈依赖于分词策略和模型规模：协同信号分词比语义分词产生更强的茧房效应，而更大的模型能保持更大的编码空间多样性，并更好地保留对利基内容的访问。这些发现表明，生成式推荐中的信息茧房不仅受推荐行为影响，还受项目分词和模型能力的影响。我们的代码可从此https URL获取。

英文摘要

Recommender systems alleviate information overload, yet repeated feedback between recommendations and user interactions can reinforce existing preferences and narrow users' exposure, forming information cocoons. While this phenomenon has been widely studied in traditional sequential recommendation, its impact on generative recommendation remains unclear. By replacing atomic item IDs with Semantic ID (SID) sequences, generative recommenders introduce a different recommendation mechanism whose role in information cocoon formation is not yet understood. To investigate whether generative recommenders deepen information cocoons, we propose \textsc{RecLoop}, a closed-loop simulation framework with LLM-driven user agents. We compare two generative recommenders and two traditional sequential baselines on two Amazon datasets across multiple feedback cycles. In addition to standard exposure-level metrics, we introduce \emph{Code-Space Structural Cocoon}, a model-level metric that measures concentration in the generated SID space. Experimental results show that generative recommenders are generally less prone to exposure-level cocoon formation than traditional baselines, preserving broader exposure diversity and slowing cross-user homogenization. However, feedback loops can still induce concentration within the generated SID space. We further find that cocoon severity depends strongly on tokenization strategy and model scale: collaborative-signal tokenization produces stronger cocoon effects than semantic tokenization, whereas larger models maintain greater code-space diversity and better retain access to niche content. These findings suggest that information cocoons in generative recommendation are shaped not only by recommendation behavior, but also by item tokenization and model capacity. Our code is available at https://github.com/Dregen-Yor/RecLoop.

URL PDF HTML ☆

赞 0 踩 0

2606.17703 2026-06-17 cs.SI 新提交

Minimizing Total Biharmonic Distance in Large Graphs via Link Recommendation

通过链接推荐最小化大型图中的总双调和距离

Xinna Zhou, Zhongzhi Zhang

AI总结研究通过添加k条边最小化总双调和距离的问题，提出基于贪心算法和投影法、拉普拉斯求解器、凸包近似等技术的近线性时间算法，在真实数据集上验证了效率和有效性。

Comments This paper has been published in Proceedings of the 32nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.1

详情

DOI: 10.1145/3770854.3780204

AI中文摘要

总双调和距离，即网络中每对节点之间双调和距离的总和，是评估网络连通性和鲁棒性的关键指标。在本文中，我们研究了通过向给定图$G$添加$k$条不存在的边来最小化总双调和距离的问题，其中$k$为预算。该问题在计算上具有挑战性。我们证明了该问题的目标函数是单调的但不是超模的。为了解决这个问题，我们提出了时间复杂度为三次的简单贪心算法。为了缓解这些贪心算法的高时间复杂度，我们应用了几种技术，包括投影法、拉普拉斯求解器和凸包近似。这些技术将我们提出的算法的时间复杂度从三次降低到近线性，同时提供了误差保证。最后，在真实数据集上的大量实验证明了我们提出算法的效率和有效性。

英文摘要

The total biharmonic distance, which is the sum of the biharmonic distance between every pair of nodes in a network, is a key metric for evaluating network connectivity and robustness. In this paper, we study the problem of minimizing the total biharmonic distance by adding $k$ nonexistent edges for a given graph $G$ and budget $k$. The problem is computationally challenging. We show that the objective function of the problem is monotone but not supermodular. To solve this problem, we propose simple greedy algorithms with cubic time complexity. To mitigate the high time complexity of these greedy algorithms, we apply several techniques, including the projection method, the Laplacian solver, and convex hull approximation. These techniques reduce the time complexity of our proposed algorithms from cubic to nearly linear while providing error guarantees. Finally, extensive experiments on real datasets demonstrate both the efficiency and effectiveness of our proposed algorithms.

URL PDF HTML ☆

赞 0 踩 0

2606.17693 2026-06-17 cs.LO 新提交

Verifying LTL for Infinite State Systems via Termination Analysis

通过终止分析验证无限状态系统的LTL性质

Nils Lommen, Moritz Leven Rosarius, Jürgen Giesl

AI总结提出框架MoAT，将无限状态系统的LTL模型检验归约为公平终止问题，并利用终止分析工具KoAT和LoAT进行验证，实验表明与现有工具性能相当。

Comments Presented at WST 2026, 8 pages, 3 figures, 1 table

2606.17655 2026-06-17 cs.NI 新提交

Integration of 5G and Industrial Digital Models: A Case Study with AGVs

5G与工业数字模型的集成：以AGV为例的案例研究

J. Cañete-Martín, J. Gómez-Jerez, M. C. Lucas-Estañ, J. Gozálvez

AI总结本文首次将5G数字模型作为资产管理壳（AAS）集成到工业数字模型中，通过OPC UA接口互联，以AGV案例评估5G通信对工业过程生产力和操作的影响。

详情

DOI: 10.1109/ETFA61755.2024.10711119
Journal ref: Proceedings of 2024 IEEE International Conference on Emerging Technologies and Factory Automation (IEEE ETFA 2024), September, 2024, Padova, Italy

AI中文摘要

5G是智能制造数字化的基础技术。智能制造依赖于在制造工厂实施之前使用数字模型优化工业过程。这些模型应考虑5G通信的影响，以充分设计和优化基于5G的工业过程。本文提出了工业数字模型与5G数字模型的首次集成，该5G数字模型作为5G系统的资产管理壳（AAS）实现。两个模型通过基于OPC UA的接口互连。我们使用一个用例评估集成模型的影响，其中自动导引车（AGV）从仓库运输材料到生产线。AGV定期通过5G交换位置以避免潜在碰撞。如果通信失败，AGV出于安全原因停止，直到可以保证可靠的5G连接。我们证明，通过集成5G和工业数字模型，可以计算并量化5G通信对工业过程操作和生产力的影响。这一结果凸显了将5G集成到工业数字模型中以实现联合设计和优化的重要性和必要性。

英文摘要

5G is a fundamental technology for the digitalization of smart manufacturing. Smart manufacturing relies on the use of digital models to optimize industrial processes before implementation on the manufacturing plants. These models should account for the impact of 5G communications to adequately dimension and optimize 5G-based industrial processes. This paper presents the first integration of industrial digital models with a 5G digital model, implemented as an Asset Administration Shell (AAS) of a 5G system. The two models are interconnected using an OPC UA-based interface. We evaluate the impact of the integrated model using a use case where Automated Guided Vehicles (AGVs) transport material from a warehouse to production lines. The AGVs periodically exchange their positions over 5G to avoid potential collisions. If the communications fail, the AGVs stop for safety reasons until a reliable 5G connection can be guaranteed. We demonstrate that, by integrating 5G and industrial digital models, it is possible to account for, and quantify, the impact of 5G communications on the operation and productivity of industrial processes. This result highlights the importance and necessity of integrating 5G into industrial digital models for their joint design and optimization.

URL PDF HTML ☆

赞 0 踩 0

2606.17654 2026-06-17 cs.NI 新提交

5G Network Architecture and Configuration Choices to Support Teleoperated Driving at Scale

5G网络架构与配置选择以支持大规模远程驾驶

M. C. Lucas-Estañ, B. Coll-Perales, M. I. Khan, J. Gozálvez, S. S. Avedisov, O. Altintas, M. Sepulcre

AI总结本文证明MEC或边缘5G网络比集中式网络更适合支持大规模远程驾驶服务，并量化了不同架构和配置下同时远程操作多辆车所需的带宽。

详情

DOI: 10.1109/VTC2024-Fall63153.2024.10758026
Journal ref: Proceedings of the 2024 IEEE 100th Vehicular Technology Conference (VTC2024-Fall), 7-10 October 2024, Washington DC, USA

AI中文摘要

远程驾驶（ToD）能够实现车辆的远程驾驶或控制。为此，车辆必须将视频流传输到ToD控制中心，以便远程操作员充分了解驾驶状况并安全控制车辆。5G（及未来）网络是部署ToD的基础，因为它们可以提供连接车辆和ToD控制中心所需的低延迟、可靠和宽带连接。然而，目前尚不清楚常见的5G网络架构和配置是否适合支持同时远程操作多辆对上行带宽要求高的车辆，因为当前网络主要配置用于支持移动宽带服务。本文证明，与集中式网络相比，MEC或基于边缘的5G网络更适合支持并扩展ToD服务，并量化了在各种5G网络架构和配置（包括不同双工模式和TDD帧结构）下同时远程操作多辆车所需的带宽。最后，研究表明，控制信道的配置有助于减轻视频馈送处理时间对支持并扩展ToD服务能力的影响。

英文摘要

Teleoperated driving (ToD) enables the remote driving or control of vehicles. For this purpose, vehicles must transmit video feeds to the ToD control center so that the remote operator is fully aware of the driving conditions and can safely control the vehicle. 5G (and beyond) networks are fundamental for the deployment of ToD as they can provide the low latency, reliable and broadband connection necessary to connect the vehicle and ToD control center. However, it is unclear whether common 5G network architectures and configurations are well-suited to support the simultaneous teleoperation of multiple vehicles with demanding uplink bandwidth, as current networks are mainly configured to support mobile broadband services. This paper demonstrates that MEC or edge-based 5G networks are better suited to support and scale the ToD service than centralized networks, and quantifies the bandwidth required to simultaneously teleoperate multiple vehicles under various 5G network architectures and configurations, including different duplexing modes and TDD frame structures. Finally, the study shows that the configuration of the control channels can help mitigate the impact that the processing time of the video feeds has on the capacity to support and scale the ToD service.

URL PDF HTML ☆

赞 0 踩 0

2606.17653 2026-06-17 cs.NI 新提交

Predictive Configured Grant Scheduling for Deterministic Wireless Communications

预测性配置授权调度用于确定性无线通信

Syed Morsleen Riaz, M. Carmen Lucas-Estañ, Baldomero Coll-Perales, Javier Gozalvez

AI总结提出一种基于流量预测并考虑预测误差的预测性配置授权调度方案，通过预分配资源提高满足有界时延需求的能力，支持确定性服务并提升资源利用率。

详情

DOI: 10.1109/VTC2025-Spring65109.2025.11174458
Journal ref: Proceedings of the 2025 IEEE 101st Vehicular Technology Conference (VTC2025-Spring), Oslo, Norway, 2025

AI中文摘要

未来无线网络必须增强其容量以维持确定性服务水平，并支持关键垂直领域新兴的时间敏感服务。保证有界延迟的能力在很大程度上依赖于高效的无线资源管理。配置授权（CG）调度可以通过预分配资源来减少延迟，但其有效性和效率在可变流量模式下会降低。本研究提出了一种新颖的预测性CG调度方案，该方案基于流量预测预分配资源，同时考虑预测不准确性。通过考虑这些不准确性，该方案显著提高了满足有界延迟要求的能力，这对于支持确定性服务水平至关重要。我们的评估表明，即使在具有不同需求的变体和混合流量场景下，所提出的方案也能显著增强支持确定性服务水平的能力，同时提高资源利用率。

英文摘要

Future wireless networks must enhance their capacity to sustain deterministic service levels and support emerging time-sensitive services in key verticals. The ability to guarantee bounded latencies heavily depends on efficient radio resource management. Configured Grant (CG) scheduling can reduce latency by pre-allocating resources, but its effectiveness and efficiency decrease under variable traffic patterns. This study presents a novel predictive CG scheduling scheme that pre-allocates resources based on traffic predictions while accounting for prediction inaccuracies. By considering these inaccuracies, the scheme significantly improves the ability to meet bounded latency requirements, which are essential for supporting deterministic service levels. Our evaluations show that the proposed scheme significantly enhances the capacity to support deterministic service levels while improving resource utilization, even in scenarios with variable and mixed traffic flows with diverse requirements.

URL PDF HTML ☆

赞 0 踩 0

2606.17633 2026-06-17 cs.HC 新提交

AdaPT: Adaptive Lesson Plan Transformer for Cross-Regional and Differentiated Instruction

AdaPT：面向跨区域与差异化教学的适应性教案转换器

Yanjie Zhang, Jiajun Zhu, Minyu Wu, Huamin Qu, Sicheng Song

AI总结提出AdaPT系统，利用大语言模型将现有教案转换为适应新区域和学生特征的内容，通过交互界面、结构化表示和解释机制支持教师迭代优化，促进教育公平。

详情

AI中文摘要

由于教育不平等，高质量的教案往往与不同教育环境的需求不匹配。教师通常会修改现有教案以适应新环境，但当前工具侧重于从头生成内容，增加了额外工作量。此外，在支持教师快速适应新学习特征方面仍存在关键缺口。为弥补这些缺口，我们提出AdaPT，一个利用大语言模型支持现有教案转换以用于跨区域和差异化教学的系统。AdaPT具有交互式界面，允许教师输入学生特征，提供结构化的教案表示，提供教案转换的解释，自动调整教案内容以适应新环境，并支持迭代的教师参与式优化。我们通过一项包含9名教师的用户研究和一项包含3名专家的评估对AdaPT进行了评估。结果表明，AdaPT支持教师的工作流程，并为促进教育公平提供了一条有前景的途径。

英文摘要

Due to educational inequality, high-quality lesson plans often mismatch the needs of disparate educational contexts. Teachers typically modify existing lesson plans to fit new contexts, but current tools instead focus on generating content from scratch, creating additional workload. Moreover, a critical gap remains in supporting teachers to quickly adapt to new learning profiles. To bridge these gaps, we present AdaPT, a system leverages LLMs to support transformation of existing lesson plans for cross-regional and differentiated instruction. AdaPT features an interactive interface that allows teachers to input student profiles, offers structured lesson representation, provides explanations for lesson-plan transformations, automatically adapts lesson content for new contexts, and supports iterative, teacher-in-the-loop refinement. We evaluated AdaPT through a user study with 9 teachers and an expert evaluation with 3 specialists. Results show that AdaPT supports workflows of teachers and offers a promising pathway toward promoting educational equity.

URL PDF HTML ☆

赞 0 踩 0

2606.17616 2026-06-17 cs.HC 新提交

Towards Speech Impairment Prediction in German-Speaking Individuals with Amyotrophic Lateral Sclerosis

针对德语肌萎缩侧索硬化症患者的言语障碍预测

Monica Gonzalez-Machorro, Ricarda von Heynitz, Justine Hanslmeier, Finja Grimm, Alexandra-Iulia Deac, Anne Gründel, Isabell Cordts, Björn Schuller

AI总结本研究利用两种临床言语评分，通过交叉和个性化建模范式预测德语ALS患者的言语障碍，发现重复任务在预测言语相关生活质量方面表现最佳。

Comments Paper accepted at Interspeech 2026, Sydney, Australia

详情

AI中文摘要

肌萎缩侧索硬化症（ALS）是一种神经退行性疾病，常因延髓功能障碍影响言语。在本研究中，我们使用两种临床言语相关评分预测ALS患者的言语障碍。我们评估了横截面（跨说话人）和个性化（说话人内）建模范式，并分析了常见言语任务的效用，以促进ALS患者言语数据收集的标准化。对66名德语ALS患者的实验表明，重复任务（/da/-/da/、/da/-/ba/）在预测构音障碍患者生活质量问卷方面取得了最佳的横截面性能（一致性相关系数CCC=0.62），而说话人内设置达到了CCC=0.86。本研究是向德语ALS患者言语障碍预测迈出的初步一步，并突显了自动言语分析作为言语障碍评估支持工具的潜力。

英文摘要

Amyotrophic Lateral Sclerosis (ALS) is a neurodegenerative disease, often affecting speech due to bulbar dysfunction. In this study, we predict speech impairment in people with ALS (pwALS) using two clinical speech-related scores. We evaluate cross-sectional (across speakers) and personalised (within-speaker) modelling paradigms and analyse the utility of common speech tasks to contribute to the standardisation of speech data collection for pwALS. Experiments on a German-speaking cohort of 66 pwALS show that repetition tasks (/da/-/da/, /da/-/ba/) achieved the best cross-sectional performance (Concordance Correlation Coefficient (CCC) = 0.62) for predicting the Quality of Life in the Dysarthric Speaker questionnaire, while the within-speaker setting reached a CCC of 0.86. This study represents an initial step towards speech impairment prediction in German-speaking pwALS and highlights the potential of automated speech analysis as a supportive tool for speech impairment assessment.

URL PDF HTML ☆

赞 0 踩 0

2606.17613 2026-06-17 cs.DC 新提交

From GPU to Microcontroller: Online Ridge Regression for Edge-Deployable Traffic Prediction

从GPU到微控制器：面向边缘部署的在线岭回归交通预测

Suresh Purini, Archit Narwadkar, Deepak Gangadharan

AI总结针对资源受限的边缘设备，提出用每传感器岭回归结合递归最小二乘在线自适应替代复杂神经网络，在PEMS基准上取得最优MAPE，并在ESP32微控制器上实现毫秒级推理。

Comments 8 pages, IEEE Intelligent Transportation Systems Conference, 2026

详情

AI中文摘要

最先进的交通流预测模型，包括图卷积网络和无图MLP，需要跨所有传感器的集中式GPU训练，这使得它们不适用于资源受限的智能交通部署。我们表明，这种复杂性在很大程度上是不必要的。对最近的无图模型GLMST的参数分析显示，将其内部嵌入维度从64减少到4会使MAPE下降不到一个百分点，这表明模型的有效容量远超任务所需。受此发现启发，我们完全用每传感器的岭回归（使用水平对齐的周期特征）结合递归最小二乘（RLS）进行在线自适应来替代神经架构。我们的方法每传感器仅需444个参数（比GLMST少80倍），并在测试时进行在线自适应，在四个PEMS基准中的三个上取得了最佳MAPE，在第四个上保持在1个百分点以内。由于每个传感器的模型是自包含的且仅涉及初等线性代数，整个流程（训练、推理和在线自适应）可在无GPU的边缘硬件上运行。ESP32微控制器（160 MHz，520 KB SRAM）在7.4秒内完成冷启动训练，每次预测和更新在2毫秒内完成且零堆分配；单个树莓派5核心在0.21秒内完成冷启动训练，每次预测和更新在0.26毫秒内完成。

英文摘要

State-of-the-art traffic flow forecasting models, including Graph Convolutional Networks and graph-less MLPs, require centralized GPU training across all sensors, making them impractical for resource-constrained intelligent transportation deployments. We show that much of this complexity is unnecessary. A parametric analysis of the recent graph-less model GLMST reveals that reducing its internal embedding dimension from 64 to 4 degrades MAPE by less than one percentage point, suggesting that the model's effective capacity far exceeds what the task requires. Motivated by this finding, we replace the neural architecture entirely with per-sensor Ridge regression using horizon-aligned periodic features, combined with Recursive Least Squares (RLS) for online adaptation. With only 444 parameters per sensor (80x fewer than GLMST) and test-time online adaptation, our method achieves the best MAPE on three of four PEMS benchmarks, and remains within one percentage point on the fourth. Because each sensor's model is self-contained and involves only elementary linear algebra, the entire pipeline (training, inference, and online adaptation) runs on edge hardware without a GPU. An ESP32 microcontroller (160 MHz, 520 KB SRAM) completes cold-start training in 7.4s and each predict-and-update in under 2ms with zero heap allocation; a single Raspberry Pi 5 core completes cold-start training in 0.21s and each predict-and-update in 0.26ms.

URL PDF HTML ☆

赞 0 踩 0

2606.17612 2026-06-17 cs.SE 新提交

PracRepair: LLM-Empowered Automated Program Repair Inspired by Human-Like Debugging Practices

PracRepair: 受人类调试实践启发的大语言模型赋能自动化程序修复

Yu Cheng, Zhongxin Liu, Zhenchang Xing, Chao Ni, Qing Huang, Xiaoxue Ren

AI总结提出PracRepair框架，通过构建按需静态-动态上下文、进行问题驱动的故障诊断并迭代细化补丁，利用动态信息提升LLM在程序修复中的效果，在Defects4J和真实世界漏洞上取得最优性能。

详情

AI中文摘要

随着软件系统规模和复杂性的增长，调试和修复仍然成本高昂且耗时。大语言模型（LLM）推动了自动化程序修复（APR）的发展，但现有基于LLM的APR方法仍主要依赖静态或检索到的上下文、错误消息和粗粒度的验证结果。因此，它们未能充分利用动态信息来理解故障和修复，包括故障执行动态和补丁验证动态。然而，有效利用这些信息具有挑战性：故障执行轨迹庞大且嘈杂，原始静态-动态上下文缺乏自解释性，而补丁验证动态通常被简化为粗粒度的反馈。为应对这些挑战，我们提出PracRepair，一个完全自动化的基于LLM的APR框架，受人类调试实践启发。PracRepair从有缺陷的程序和故障执行中构建按需的静态-动态上下文，执行问题驱动的故障诊断以形成明确的修复假设，并使用验证诊断和轨迹级行为变化迭代地细化候选补丁。在Defects4J V1.2和V2.0上的实验结果表明，PracRepair持续优于最先进的基线方法。具体而言，在GPT-3.5下，PracRepair在Defects4J V1.2/V2.0上正确修复了139/136个错误，而在GPT-4o下进一步改进至162/171。此外，PracRepair有效泛化到真实世界漏洞（RWB），在多个基础模型上取得了最佳性能。

英文摘要

As software systems grow in scale and complexity, debugging and repair remain costly and time-consuming. Large language models (LLMs) have advanced automated program repair (APR), but existing LLM-based APR approaches still largely rely on static or retrieved context, error messages, and coarse-grained validation outcomes. As a result, they underutilize dynamic information for failure understanding and repair, including failure-execution dynamics and patch-validation dynamics. Effectively leveraging such information, however, is challenging: failure-execution traces are large and noisy, raw static-dynamic context is not self-explanatory, and patch-validation dynamics are often reduced to coarse feedback. To address these challenges, we propose \textsc{PracRepair}, a fully automated LLM-based APR framework inspired by human-like debugging practices. \textsc{PracRepair} constructs an on-demand static-dynamic context from buggy programs and failure executions, performs question-driven failure diagnosis to formulate explicit repair hypotheses, and iteratively refines candidate patches using validation diagnostics and trace-level behavioral changes. Experimental results on Defects4J V1.2 and V2.0 show that \textsc{PracRepair} consistently outperforms state-of-the-art baselines. Specifically, under GPT-3.5, \textsc{PracRepair} correctly fixes 139/136 bugs on Defects4J V1.2/V2.0, while under GPT-4o it further improves to 162/171. Moreover, \textsc{PracRepair} generalizes effectively to RWB (Real-World Bugs), achieving the best performance across multiple foundation models.

URL PDF HTML ☆

赞 0 踩 0

2606.17610 2026-06-17 cs.DL cs.CY 新提交

Beyond Citations: Comparing Scholarly, Policy, and Patent Impact Across the FT50 Journals

超越引用：比较FT50期刊的学术、政策和专利影响力

Arash Hajikhani, Yi Zhang, Mengjia Wu

AI总结通过分析53种FT50期刊在学术引用、政策采纳和专利引用三个维度的表现，发现期刊影响力存在显著异质性，单一引用排名与多维排名仅中度相关，近半数期刊在纳入政策和专利指标后四分位变化。

Comments 28 pages, 9 figures, 1 table. Analysis of 53 FT50 and recently removed journals using citation, policy, and patent impact indicators. Submitted manuscript

详情

AI中文摘要

《金融时报》50强（FT50）期刊列表影响着全球商学院的招聘、晋升、认证和研究评估。然而，列表中的期刊通常被视为代表同质化的卓越层级。我们通过比较53种FT50及近期移除的期刊在三个不同影响力渠道的表现来检验这一假设：学术影响力（领域加权引用和可见性）、政策采纳以及通过专利引用实现的技术影响力。利用2005年至2019年间超过60,000篇出版物的面板数据，我们发现隐藏在二元FT50标签下的显著异质性。精英经济学期刊主导政策影响力，信息系统和市场营销期刊引领技术影响力，而许多高被引管理学期刊在学术界之外的影响力有限。引用、政策和专利指标在很大程度上是独立的影响力维度，仅基于引用的排名与多维排名仅中度相关。一旦纳入政策和专利指标，近半数期刊的四分位发生变化，表明仅基于学术引用的评估忽视了研究影响力的重要维度。尽管FT50仍被广泛用作期刊质量的二元分类，但我们的结果揭示了列表内部存在显著的影响力谱系，并表明期刊排名对影响力的定义和测量方式高度敏感。

英文摘要

The Financial Times 50 (FT50) journal list shapes hiring, promotion, accreditation, and research evaluation across business schools worldwide. Yet journals on the list are typically treated as if they represent a homogeneous tier of excellence. We test this assumption by comparing 53 FT50 and recently removed journals across three distinct impact channels: scholarly influence (field-weighted citations and visibility), policy uptake, and technological reach through patent citations. Using a panel of more than 60,000 publications from 2005 to 2019, we find striking heterogeneity hidden beneath the binary FT50 label. Elite economics journals dominate policy influence, information systems and marketing journals lead technological impact, while many highly cited management journals exhibit limited reach beyond academia. Citation, policy, and patent indicators behave as largely independent dimensions of impact, with a citation-only ranking correlating only moderately with a multidimensional ranking. Nearly half of all journals change quartile once policy and patent indicators are incorporated, demonstrating that assessments based solely on scholarly citations overlook important dimensions of research influence. While the FT50 remains widely used as a binary classification of journal quality, our results reveal a substantial within-list impact spectrum and show that journal rankings are highly sensitive to how impact is defined and measured.

URL PDF HTML ☆

赞 0 踩 0

2606.17593 2026-06-17 cs.SE 新提交

Why Model Credibility Isn't Enough: -Rethinking Trust in Simulation Architectures

为什么模型可信度不够？——重新思考仿真架构中的信任

Romain Barbedienne, Adeline Lanugue, Rim Kaddah, Julien Silande, Anthony Levillain, Cedric Leclerc, Maxime Hayet, Boussaad Soualmi, Cristian Maxim

AI总结本文探讨仿真架构的可信度评估问题，综述装配可信度领域现状，比较敏感性分析、专家定性分析、AI可解释性和网络方法，并基于严谨性、泛化性和资源需求评估各方法优劣。

Comments Annual Congress of Japan Society of Automotive Engineers (JSAE), May 2026, Yokohama, Japan

2606.17582 2026-06-17 cs.DB 新提交

Collaborative Large and Small Language Models for Accurate and Scalable Data Repair

协作式大小语言模型实现准确且可扩展的数据修复

Qian Chen, Jianwei Wang, Wenjie Zhang

AI总结提出LasRepair框架，利用大语言模型作为指导者选择全局修复上下文，小语言模型作为校正者高效修复错误数据，并通过EM过程和置信度加权进一步提升修复质量。

Comments 14 pages, 11 figures

详情

AI中文摘要

我们研究数据修复问题，这是数据清洗中的关键任务，旨在纠正原始数据集中的错误条目以提高整体数据质量。尽管近期基于数据驱动的方法，特别是基于大型语言模型（LLM）的方法取得了显著性能，但我们观察到：(i) 它们直接在原始且低质量的上下文中修复数据，这可能损害学习信号；(ii) 它们直接使用不确定的模型输出作为修复结果，可能引入不可靠的修正并损害修复质量。受小型语言模型（SLM）的效率和LLM的能力的启发，并旨在解决上述局限性，我们提出了LasRepair，一个协作大小语言模型进行数据修复的框架。LasRepair使用LLM作为指导者，选择全局修复上下文来引导SLM。SLM作为校正者，使用选定的上下文更高效地修复错误数据。此外，为了进一步提高上下文质量，我们将LasRepair扩展为LasRepair+，它将数据修复公式化为期望最大化（EM）过程，交替进行E步（更新校正者参数）和M步（细化修复上下文）。此外，为了减轻模型不确定性，我们提出了LasRepair++，它使用列校准的模型置信度在更新校正者时降低不可靠修复行的权重，从而增强修复质量。理论分析和实证评估证明了我们方法的优越性。我们从理论上证明了EM风格过程和基于置信度的加权的有效性。在真实数据集上的实验表明，LasRepair++相比最强基线平均F1分数提高了18.1%。

英文摘要

We study the problem of data repair, a key task in data cleaning that corrects erroneous entries in raw datasets to improve overall data quality. Although recent data-driven methods, especially those based on large language models (LLMs), achieve remarkable performance, we observe that: (i) they directly repair data in the raw and low-quality context, which may compromise learning signals, and (ii) they directly use uncertain model outputs as repairs, potentially introducing unreliable corrections and compromising repair quality. Motivated by the efficiency of small language models (SLMs) and the capabilities of LLMs, and aiming to address the above limitations, we propose LasRepair, a framework that collaborates Large and small language models for data repair. LasRepair employs an LLM as an instructor, which selects a global repair context to guide the SLM. The SLM acts as a corrector, using the selected context to repair erroneous data more efficiently. Moreover, to further improve context quality, we extend LasRepair to LasRepair+, which formulates data repair as an Expectation-Maximisation (EM) procedure that alternates between an E-step for updating the corrector parameters and an M-step for refining the repair context. Furthermore, to mitigate model uncertainty, we propose LasRepair++, which uses column-calibrated model confidence to down-weight unreliable repaired rows when updating the corrector, thereby enhancing repair quality. Theoretical analysis and empirical evaluation demonstrate the superiority of our methods. We theoretically prove the effectiveness of the EM-style procedure and the confidence-based weighting. Experiments on real-world datasets show that LasRepair++~ achieves an average F1-score improvement of 18.1% over the strongest baseline.

URL PDF HTML ☆

赞 0 踩 0

2606.17578 2026-06-17 cs.DS 新提交

Exact Algorithms for Edge Deletion to Cactus Graphs and Weighted Variants

删除边到仙人掌图的精确算法及加权变体

Wenhao Song

AI总结针对删除最少边使图变为连通仙人掌的问题，提出O*(2^n)时间复杂度的精确算法，并推广到有限不同代价的加权情形。

2606.17575 2026-06-17 eess.SY cs.SY 新提交

Dynamic Analysis of Centralized Energy Storage Systems -- A Comparison between Grid-following and Grid-forming Controls

集中式储能系统的动态分析——电网跟随与电网形成控制的比较

Qiang Fu, Siqi Bu, Yang Wang, Mingyu Yan

AI总结本文通过小信号稳定性分析，比较集中式储能系统中电网跟随与电网形成控制的动态特性，发现单一控制类型具有动态叠加特征，而混合控制需限制电网形成比例以避免模态共振。

Comments This paper has been accepted for publication in IEEE TRANS POWER SYSTEMS, 2026. The final version of record will be available via IEEE Xplore

详情

Journal ref: IEEE TRANS POWER SYSTEMS, 2026

AI中文摘要

本研究使用电网跟随（GFL）和电网形成（GFM）控制，特别关注双向功率流和多个储能系统（ESS），研究了集中式储能系统（CESS）的小信号稳定性。为了解决考虑全面GFL和GFM控制回路时CESS的复杂动态问题，通过关注主导振荡模式，使用虚拟阻尼方法简化了高阶动态。阻尼分析验证了使用单一类型控制（GFL或GFM）的CESS具有动态叠加特性。具体来说，随着ESS数量增加，GFM-CESS的阻尼改善，而GFL-CESS的阻尼下降。阻尼灵敏度表明，GFM-CESS的阻尼对双向功率流和所有控制回路更敏感，而GFL-CESS的阻尼对d轴控制回路更敏感。因此，GFM-CESS更适合大规模集成，但在功率反转显著的情况下受到限制。如果在CESS中混合使用GFL和GFM控制，应限制GFM-CESS的比例，以避免GFL-CESS和GFM-CESS之间的模态共振导致不稳定。这强调实施GFM-CESS需要考虑场景限制，而不是在混合集成条件下追求最大集成。通过模态分析和时域仿真验证了结论。

英文摘要

This study investigates the small-signal stability of centralized energy storage systems (CESSs) using grid-following (GFL) and grid-forming (GFM) controls, particularly focusing on bidirectional power flow and multiple energy storage systems (ESSs). To address the issue of complex dynamics in CESSs when comprehensive GFL and GFM control loops are considered, high-order dynamics are simplified using the virtual damping method by focusing on the dominant oscillation mode. Damping analysis verifies that CESSs using a single-type control (either GFL or GFM) have dynamic superimposition characteristics. Specifically, as ESS number increases, the damping of GFM-CESSs improves but that of GFL-CESSs decreases. The damping sensitivity shows that the damping of GFM-CESSs is more sensitive to bidirectional power flow and all control loops, whereas that of GFL-CESSs is more sensitive to d-axis control loop. Consequently, GFM-CESSs are preferred for large-scale integration but are limited in scenarios with significant power reversal. If GFL and GFM controls are hybridized in CESSs, the ratio of GFM-CESSs should be constrained to avoid instability from modal resonance between GFL-CESSs and GFM-CESSs. This highlights that implementing GFM-CESSs necessitates considering scenario limitations rather than pursuing maximal integration under hybrid integration conditions. The conclusions are validated through modal analysis and time-domain simulations.

URL PDF HTML ☆

赞 0 踩 0

2606.17573 2026-06-17 cs.OS cs.CR 新提交

Cordon: Semantic Transactions for Tool-Using LLM Agents

Cordon: 工具使用型LLM代理的语义事务

Zheng Chen, Hanqing Liu, Duling Xu, Dong Dong, Jialin Li, Bangzheng Pu, Jidong Zhai

AI总结提出Cordon事务运行时系统，通过语义事务边界实现多步代理工作流的提交、回滚、恢复和审计，减少不可逆效应失败并暴露跨步违规。

详情

AI中文摘要

使用工具的LLM代理正在将计算单元从显式的人类指令转变为具有状态后果的模型驱动任务。然而，今天的代理运行时仍然将工具暴露为孤立的RPC。这种接口为运行时提供了便捷的集成点，但缺乏用于跨多步代理工作流进行提交、回滚、恢复和审计的任务范围执行边界。我们认为，这种不匹配需要运行时隔离边界，而不是另一个每次调用的护栏。本文介绍了Cordon，一个事务性运行时系统，用于在提交前暂存和验证不可逆的代理效应。语义事务是一个任务级执行边界，它将工具意图和运行时跟踪的结果谱系绑定到可逆的本地状态、暂存的外部效应、委托的权限和审计元数据。Cordon通过一个事务管理器实现这种抽象，该管理器跟踪派生结果对象，在影子状态中执行可逆突变，在效应发件箱中暂存面向外部的动作，并记录恢复元数据。然后，运行时在提交状态或释放外部效应之前验证组合的执行流程。我们在对抗性和良性工作流上的评估表明，Cordon暴露了现有防御措施遗漏的跨步违规。它还在保持良性任务完成的同时，以适度的批准和延迟开销减少了不可逆效应失败。

英文摘要

Tool-using LLM agents are shifting the unit of computation from explicit human-issued commands to model-driven tasks with stateful consequences. Yet today's agent runtimes still expose tools as isolated RPCs. This interface gives runtimes a convenient integration point, but it lacks a task-scoped execution boundary for commit, rollback, recovery, and audit across multi-step agent workflows. We argue that this mismatch calls for a runtime containment boundary rather than another per-call guardrail. This paper introduces Cordon, a transactional runtime system for staging and validating irreversible agent effects before commit. A semantic transaction is a task-level execution boundary that binds tool intents and runtime-tracked result lineage to reversible local state, staged external effects, delegated authority, and audit metadata. Cordon implements this abstraction with a transaction manager that tracks derived result objects, executes reversible mutations in shadow state, stages outward-facing actions in an effect outbox, and records recovery metadata. The runtime then validates the composed execution flow before it commits state or releases external effects. Our evaluation across adversarial and benign workflows shows that Cordon exposes cross-step violations missed by existing defenses. It also reduces irreversible-effect failures while preserving benign task completion with modest approval and latency overhead.

URL PDF HTML ☆

赞 0 踩 0

2606.17568 2026-06-17 eess.SY cs.SY 新提交

Instability Caused by Integration of IBRs under Strong Grid Connections -- A Practical Case Study on Large-scale Energy Storage Systems

强电网连接下IBR集成引起的失稳——大规模储能系统的实际案例研究

Qiang Fu, Siqi Bu, Zijun Bin, Peng Li, Tong Wang

AI总结本文通过大规模储能系统案例，揭示强电网连接下逆变器基资源（IBR）因PCS间动态交互叠加导致振荡失稳，强调功能控制设计与规模规划的重要性。

Comments This paper has been accepted for publication in IEEE TRANS POWER SYSTEMS, 2026. The final version of record will be available via IEEE Xplore

详情

AI中文摘要

众所周知，逆变器基资源（IBRs）在弱电网连接下可能导致换流器驱动的稳定性问题。然而，随着IBR数量的增加，即使在强电网连接下也可能发生失稳。本文以大规模储能系统（ESSs）为例，展示了一个实际案例来证明这一结论。在本研究中，ESS在向所连接电力系统提供容性和感性无功支持（通过ESS功能控制回路实现）时，在d-q坐标系中诱发了频率为150 Hz的振荡。理论分析表明，在强电网连接下，随着ESS规模扩大，ESS的功率转换系统（PCSs）之间的动态相互作用可能叠加并增强，从而降低振荡阻尼并导致系统失稳。这表明ESS功能控制回路在向电力系统提供支持时也存在潜在的失稳风险，应仔细检查。最后，确定了减轻振荡的主要影响因素，并基于SIMULINK平台验证了结论。本文为即使在强电网条件下系统失稳提供了有价值的实际见解，强调了功能控制设计和IBR主导系统规模仔细规划的重要性。

英文摘要

It has been well known that inverter-based resources (IBRs) can lead to converter-driven stability issues under weak grid connections. However, as the number of IBRs increases, instabilities can also occur even under strong grid connections. A practical case is presented to demonstrate this conclusion, using large-scale energy storage systems (ESSs) as an example. In this study, the ESSs induce oscillations with a frequency of 150 Hz in the d-q coordinates while providing both capacitive and inductive reactive power support (achieved by ESS functional control loops) to the connected power system. Theoretical analysis reveals that under strong grid connections, the dynamic interactions among power conversion systems (PCSs) of ESSs can be superimposed and intensified as the ESS scale extends, which reduces oscillation damping and leads to system instability. This indicates that ESS functional control loops also have potential instability risks when providing supports to power systems, which should be carefully examined. Finally, major impact factors are identified to mitigate the oscillations, and the conclusions are validated based on the SIMULINK platform. This paper provides valuable practical insights into system instabilities even under strong grid conditions, emphasizing the importance of functional control design and careful planning of the scale for IBR-dominated systems.

URL PDF HTML ☆

赞 0 踩 0

2606.17565 2026-06-17 eess.SY cs.SY 新提交

Stability Analysis in Large-scale Centralized Bidirectional Inverter-based Stations Connected to Bulk Power Systems through AC and DC Connections

大规模集中式双向逆变器站通过交流和直流连接接入大电网的稳定性分析

Qiang Fu, Wenjuan Du, Siqi Bu, Haifeng Wang

AI总结研究大规模双向逆变器站引起的次同步振荡稳定性问题，比较交流和直流连接的影响，发现直流连接可降低失稳风险，且参数调节更有效。

Comments Accepted for publication in IEEE TRANS POWER SYSTEMS, 2026

详情

Journal ref: IEEE TRANS POWER SYSTEMS, 2026

AI中文摘要

大量受控直流资源（如电池储能系统）通过双向逆变器连接到交流电力系统以满足功率平衡需求。本研究探讨了由大规模双向逆变器站（IBS）引起的次同步频率范围内的换流器驱动稳定性（CDS）问题。通过考察三个因素：受控直流资源的数量、功率流向以及逆变器的控制参数，比较了IBS的交流和直流连接对次同步振荡（SSO）的影响。对于交流连接，无论功率流向如何，随着受控直流资源数量的增加，IBS可能引发不稳定。为了保持稳定，计算了IBS的最大功率幅值。发现如果直流线路电阻远小于交流线路电抗，切换到直流连接可以降低这些失稳风险。此外，在直流连接下，调节控制参数的方法在改善与功率相关的临界稳定性方面更为有效。因此，直流-IBS更适用于高压输电。最后，在不同网络拓扑和系统规模下，连接有交流-和直流-IBS的电力系统中验证了这些结论。

英文摘要

Massive controlled DC resources (CDCRs), such as battery energy storage systems, are connected to AC power systems through bidirectional inverters for power balance requirements. This study investigates converter-driven stability (CDS) issues in the sub-synchronous frequency range caused by large-scale bidirectional inverter-based stations (IBSs). The impacts of the AC and DC connections of IBSs on subsynchronous oscillations (SSOs) are compared by examining three factors: the number of CDCRs, power flow direction, and control parameters of the inverters. For AC connections, IBSs may induce instability as the number of CDCRs increases, regardless of the power flow direction. To maintain stability, the maximum power amplitude of the IBS is calculated. It is found that switching to DC connections can reduce these instability risks if the DC line resistance is much less than the AC line reactance. Moreover, the method of tuning control parameters is demonstrated to be more effective in improving power-related critical stability under DC connections. Therefore, The DC-IBS is preferred for high-voltage transmission. Finally, the conclusions are validated in power systems connected with both AC- and DC-IBSs under various network topologies and system scales.

URL PDF HTML ☆

赞 0 踩 0

2606.17562 2026-06-17 cs.CR cs.SY eess.SY 新提交

Anywhere, Any-Stymie: Remote Activation of Trojan Malware on LiDAR with Modulated Signals

任意地点，任意干扰：利用调制信号远程激活LiDAR上的特洛伊恶意软件

R. Spencer Hallyburton, Miroslav Pajic

AI总结本研究设计了一种嵌入LiDAR固件的特洛伊恶意软件，并利用光学触发通过调制信号远程激活，实现虚假目标注入和真实目标抑制，威胁自动驾驶安全。

详情

AI中文摘要

LiDAR传感器广泛应用于自主系统的3D感知和安全关键决策。我们识别了一个先前未探索的攻击面：嵌入LiDAR感知流程中的休眠恶意软件在正常操作期间保持不活跃，部署后可通过外部触发激活，无需在攻击时访问传感器硬件或网络。为实现这一威胁，我们设计了能够进行低级点云操作的恶意软件，并将其嵌入LiDAR固件。该恶意软件是在封闭的研究测试环境中，在供应商技术支持下开发的，而非利用固有的生产供应链漏洞。为了选择性触发攻击激活，我们设计并实现了一种光学触发器，通过向感知环境传递调制信号远程激活恶意软件。一旦触发，恶意软件执行实时点云操作，我们在静态和移动受害者平台上演示了虚假目标注入和真实目标抑制。我们的评估首先建立了攻击可行性，包括300英尺的静态操作和达到35英里/小时的记录驱动通过。然后，我们定量说明注入的人形伪影可以被最先进的3D目标检测器在语义上检测到。最后，我们在部署的战术自主车辆上展示了多种安全关键影响模式。这些结果共同强调了在整个LiDAR传感器开发和部署流程中加强完整性保证的必要性。

英文摘要

LiDAR sensors are widely deployed in autonomous systems for 3D perception and safety-critical decision-making. We identify a previously unexplored attack surface in which dormant malware embedded in the LiDAR sensing pipeline remains inactive during normal operation and can be externally triggered after deployment, without requiring access to sensor hardware or networking at attack time. To operationalize this threat, we design malware capable of low-level point-cloud manipulation and embed it into LiDAR firmware. This malware was developed in a closed research test environment with vendor technical support, rather than by exploiting an inherent production supply-chain vulnerability. To selectively trigger attack activation, we design and implement an optical trigger that remotely activates the malware by delivering a modulated signal into the sensing environment. Once triggered, the malware performs real-time point cloud manipulation, and we demonstrate false object injection and real object suppression on static and mobile victim platforms. Our evaluation first establishes attack feasibility, including static operation at 300~ft and recorded drive-by runs reaching 35~mph. We then illustrate quantitatively that injected person-like artifacts can remain semantically detectable by a state-of-the-art 3D object detector. Finally, we demonstrate multiple modes of safety-critical impact on a deployed tactical autonomous vehicle. Together, these results highlight the need for stronger integrity guarantees throughout the LiDAR sensor development and deployment pipeline.

URL PDF HTML ☆

赞 0 踩 0

2606.17533 2026-06-17 cs.CR 新提交

SNAS: A Multi-Layer Defense-in-Depth Architecture for Secure Egress in Sandboxed Workloads

SNAS：一种用于沙箱工作负载安全出口的多层纵深防御架构

Niranjan Kumar Sharma, S Muralidhar, Samy Boshra-Riad, Mike Halcrow, Yuxiong He, Nitya Kumar Sharma, Shawn Xia, Haowei Yu, Elliott Brossard, Derek Denny-Brown, Choden Konigsmark, Bhanu Prakash, Brandon Baker, Andong Zhan

AI总结针对沙箱工作负载的外部连接需求，提出SNAS架构，结合eBPF包过滤、GENEVE覆盖网络和分布式出口代理，实现低开销的策略驱动出口控制，并已在Snowflake全区域部署。

Comments 10 pages, 7 figures. Accepted at the 53rd IEEE/IFIP International Conference on Dependable Systems and Networks (DSN 2026), June 23-26, 2026

详情

AI中文摘要

Snowpark通过在安全沙箱中执行用户自定义函数，支持Snowflake中的数据工程和AI/ML工作负载。许多此类工作负载需要外部连接以访问云API、外部数据库或特征存储，这带来了一个可靠性挑战：如何在保持严格多租户隔离和资源公平性的同时提供透明的网络访问。本文提出了Snowpark中的安全网络访问（SNAS），一种用于沙箱工作负载安全外部通信的生产架构。SNAS结合了扩展伯克利包过滤器（eBPF）包过滤、通用网络虚拟化封装（GENEVE）覆盖网络和分布式出口代理，以实现低开销的策略驱动出口控制。我们描述了SNAS的设计、部署和测量的生产行为，包括使用最早出发时间（EDT）算法的基于eBPF的带宽限制器、双层策略执行以及连接限制和端口耗尽的保护措施。SNAS已部署在所有Snowflake区域，支持大规模生产工作负载，包括PB级数据传输和延迟敏感的外部集成。

英文摘要

Snowpark enables data engineering and AI/ML workloads in Snowflake by executing user-defined functions in secure sandboxes. Many of these workloads require external connectivity to access cloud APIs, external databases, or feature stores, creating a dependability challenge: how to provide transparent network access while preserving strict multi-tenant isolation and resource fairness. This paper presents Secure Network Access in Snowpark (SNAS), a production architecture for secure external communication from sandboxed workloads. SNAS combines Extended Berkeley Packet Filter (eBPF) packet filtering, Generic Network Virtualization Encapsulation (GENEVE) overlay networks, and distributed egress proxies for policy-driven egress control with low overhead. We describe the design, deployment, and measured production behavior of SNAS, including an eBPF-based bandwidth limiter using the Earliest Departure Time (EDT) algorithm, dual-tier policy enforcement, and safeguards for connection limiting and port exhaustion. SNAS is deployed across all Snowflake regions and supports large-scale production workloads including petabyte-scale data transfer and latency-sensitive external integrations.

URL PDF HTML ☆

赞 0 踩 0

2606.17521 2026-06-17 cs.MM 新提交

DiffPC: Diffusion-Based Projector Photometric Compensation

DiffPC: 基于扩散的投影仪光度补偿

Yuxi Wang, Haibin Ling, Bingyao Huang

AI总结提出一种基于扩散模型的光度补偿方法，将投影失真建模为环境相关加性噪声，通过扩散模型逐步去噪生成补偿图像，并设计融合光度与内容特征的噪声估计网络，在未知场景中取得更优视觉表现。

详情

AI中文摘要

投影仪光度补偿纠正由表面纹理、反射和环境光照引入的颜色失真。现有的深度学习方法通常需要专业的场景特定数据收集，且缺乏对感知质量的考虑。为解决这一限制，我们提出一种基于扩散的光度补偿方法，在光度与内容感知引导下重建补偿图像。具体地，我们首先将投影过程中引入的光度失真建模为环境相关的加性噪声，从而将光度补偿问题重新表述为具有物理约束的去噪任务。接着，我们引入一个扩散模型，通过遵循加性轨迹迭代去除噪声来生成补偿图像。最后，为准确估计每个时间步的噪声，通过分析投影和拍摄物理过程中导致失真的因素，我们设计了一个融合光度感知与内容条件特征的噪声估计网络。实验表明，我们的方法在未知场景中实现了优越的视觉性能，因此相较于先前方法展现出显著的实际优势。

英文摘要

Projector photometric compensation corrects color distortions introduced by surface texture, reflection, and ambient lighting. Existing deep learning-based methods usually require professional scene-specific data collection and lack consideration for perceptual quality. To address this limitation, we present a diffusion-based photometric compensation method that reconstructs compensation images under photometric and content-aware guidance. Specifically, we first model the photometric distortions introduced during projection as environment-dependent additive noise, thereby reformulating the photometric compensation problem as a denoising task with physical constraints. Next, we introduce a diffusion model, which generates compensation images by following an additive trajectory to iteratively remove the noise. Finally, to accurately estimate the noise at each timestep, by analyzing the factors that contribute to distortions in the physical process of projection and capturing, we design a noise estimation network that incorporates features of both photometry-aware and content conditions. Experiments show that our method achieves superior visual performance in unknown scenarios, thereby exhibiting significant practical advantages over prior methods.

URL PDF HTML ☆

赞 0 踩 0

2606.17518 2026-06-17 cs.DC 新提交

SpecGen: Accelerating Agentic Kernel Optimization with Speculative Generation

SpecGen: 利用推测生成加速智能体内核优化

Jihu Guo, Sitian Lu, Tenghui Ma, Wei Gao, Zhisheng Ye, Xingcheng Zhang, Dahua Lin

AI总结针对智能体内核优化中生成延迟长、反馈不足和资源利用率低的问题，提出SpecGen系统，通过推测生成在推理过程中提前产生候选内核，并行验证剖析，动态调整资源，并利用远程KV缓存减少前缀重计算，显著降低端到端时间并提升内核加速比。

详情

AI中文摘要

智能体内核优化通过迭代生成、验证和剖析，利用推理LLM自动进行手动GPU内核调优，将优化任务转化为反馈引导的搜索。然而，我们的工作负载特征揭示了三个限制搜索效率的系统级低效问题：(1) LLM推理导致的长生成延迟，(2) 不充分的剖析反馈，(3) 验证/剖析资源利用不足。我们的关键洞察是，正在进行的推理生成在完成之前暴露了一个产生额外候选内核的窗口，允许系统在出现满意内核时提前终止推理。我们提出了SpecGen，一个具有推测生成的智能体内核优化系统。首先，SpecGen在推理轨迹中精心选择的触发点分叉出非推理生成以产生内核，增加每次迭代的候选内核数量。这些内核与正在进行的推理并行进行验证和剖析，增加剖析反馈，并在生成期间保持资源忙碌。当内核满足终止条件时，SpecGen提前终止推理生成以减少生成延迟。其次，SpecGen根据到达率动态重新分配验证和剖析GPU池，并优先处理请求，以减少突发推测生成负载下的剖析反馈延迟。此外，SpecGen利用验证/剖析GPU的空闲内存作为远程KV缓存存储，以消除在有限内存预算下推测生成的前缀重计算。在H200上使用两个推理LLM的实验表明，与三个基线系统相比，SpecGen减少了端到端时间，同时产生了更多的剖析反馈，提高了资源利用率，并在固定的时间和令牌预算下改善了内核加速比。

英文摘要

Agentic kernel optimization automates manual GPU kernel tuning via iterative generation, validation, and profiling with reasoning LLMs, casting the optimization task as feedback-guided search. However, our workload characterization reveals three system-level inefficiencies that limit search efficiency: (1) long generation latency due to LLM reasoning, (2) insufficient profiling feedback, and (3) underutilized validation/profiling resources. Our key insight is that the ongoing reasoning generation exposes a window for producing additional candidate kernels before it completes, allowing the system to terminate reasoning early once a satisfactory kernel appears. We present SpecGen, an agentic kernel optimization system with \emph{speculative generation}. First, SpecGen forks non-reasoning generations at well-chosen trigger points in the reasoning trace to yield kernels, increasing the candidate kernel count per iteration. These kernels are validated and profiled in parallel with the ongoing reasoning, increasing profiling feedback, and keeping resources busy during generation. When a kernel meets the termination criterion, SpecGen terminates the reasoning generation early to reduce the generation latency. Second, SpecGen dynamically reallocates validation and profiling GPU pools based on the arrival rate and prioritizes requests to reduce profiling feedback latency under bursty speculative generation load. Furthermore, SpecGen utilizes spare memory of the validation/profiling GPUs as remote KV cache storage to eliminate prefix recomputation of speculative generations under limited memory budget. Experiments with two reasoning LLMs on H200 show that SpecGen reduces end-to-end time over three baseline systems, while producing more profiling feedback, increasing resource utilization, and improving kernel speedup under a fixed time and token budget.

URL PDF HTML ☆

赞 0 踩 0

2606.17512 2026-06-17 cs.HC 新提交

MedEasy: Designing AI Standardized Patients for Clinical Consultation Training

MedEasy：为临床咨询培训设计AI标准化患者

Zhiqi Gao, Huarui Luo, Guo Zhu, Bingquan Zhang, Dongyijie Primo Pan, Yizhan Feng, Jiahuan Pei, Jie Li, Benyou Wang

AI总结提出多智能体系统MedEasy，通过患者对话、临床操作、决策提交、文档记录和反馈组织虚拟患者练习，基于形成性研究设计分阶段工作流，评估表明学习者将其视为连贯的咨询环境。

详情

AI中文摘要

AI标准化患者正成为临床咨询专业培训的一种设置。本文介绍MedEasy，一个多智能体系统，通过患者对话、临床操作、决策提交、文档记录和反馈来组织虚拟患者练习。我们首先通过与12名临床年级医学生的访谈和三次协同设计工作坊进行了形成性研究。研究结果指导了分阶段工作流、结构化病例记录、行动相关发现和基于轨迹的回顾。然后，我们与另一组12名临床年级医学生进行了评估性用户研究，每位参与者完成两个平衡的病例。学习者将MedEasy解释为一个连贯的咨询环境。他们结合患者反应、检查发现、可用操作和反馈来判断所呈现的病例是否保持连贯。他们重视可重复的练习和记录的回顾，同时对缺失的操作和反馈标准提出质疑。本文为使用病例特定标准连接情境实践的AI支持专业培训系统提供了设计启示。

英文摘要

AI standardized patients are becoming a setting for professional training in clinical consultation. This paper presents MedEasy, a multi-agent system that organizes virtual-patient practice through patient dialogue, clinical actions, decision submission, documentation, and feedback. We first conducted a formative study with 12 clinical-year medical students through interviews and three co-design workshops. The findings informed a staged workflow, structured case records, action-contingent findings, and trajectory-based review. We then conducted an evaluative user study with a separate cohort of 12 clinical-year medical students, with each participant completing two counterbalanced cases. Learners interpreted MedEasy as a connected consultation environment. They used patient responses, examination findings, available actions, and feedback together to judge whether the represented case remained coherent. They valued repeatable practice and recorded review, while questioning missing actions and feedback criteria. The paper contributes design implications for AI-supported professional training systems that use case-specific standards to connect situated practice.

URL PDF HTML ☆

赞 0 踩 0

2606.17510 2026-06-17 cs.SE cs.SY eess.SY 新提交

OmniDroneX: An LLM-Assisted Holistic Drone-as-a-Service Ecosystem

OmniDroneX: 一种LLM辅助的全方位无人机即服务生态系统

I-Ling Yen, Akeem Mohammed, Farokh Bastani, San-Yih Hwang

AI总结提出OmniDroneX统一无人机即服务生态系统，通过libUAV接口和PT-SOA抽象模型连接底层物理与高层任务，利用大语言模型辅助功能识别、服务组合和自然语言任务定义，支持多种组合技术以实现可扩展、自演进的无人机系统。

Comments Ongoing project paper that is going to be submitted to JointCloud Computing

详情

AI中文摘要

尽管无人机技术取得了快速进步，但由于无人机系统研究中的若干空白，当前部署仍然有限。为应对这些挑战，我们提出OmniDroneX，一个统一的无人机即服务生态系统，其中无人机从固定功能平台转变为动态可组合实体，可与外部基础设施集成以提供全方位能力。OmniDroneX通过统一的供应商无关接口（libUAV）和形式化的物理服务抽象模型（PT-SOA）连接底层物理原语与高层任务意图。一个核心创新是大语言模型（LLM）在OmniDroneX架构多层中的多样化应用。LLM用于辅助识别和形式化原始设备功能及抽象服务定义，支持自动化服务组合和工作流生成，并实现交互式自然语言任务规范与细化。OmniDroneX还包含了动态无人机系统中至关重要的多种组合技术类别，包括用于无人机能力增强的物理层组合，以及时空、功能、协作、异常感知和基于QoS的服务组合。总体而言，这些特性使OmniDroneX能够作为在复杂动态环境中运行的可扩展、有弹性和自演进的无人机生态系统的基础。

英文摘要

Despite rapid advances in UAV technologies, current deployments remain limited due to several gaps in UAV systems research. To address these challenges, we propose OmniDroneX, a unified Drone-as-a-Service ecosystem, in which drones are transitioned from fixed function platforms into dynamically composable entities that can be integrated with external infrastructures to offer omni-capabilities. OmniDroneX bridges low-level physical primitives with high-level mission intent through a unified vendor-agnostic interface (libUAV) and a formal physical-service abstraction model (PT-SOA). A core innovation is the diverse application of large language models (LLMs) across multiple layers of the OmniDroneX architecture. LLMs are used to assist in identifying and formalizing primitive device functions and abstract service definitions, supporting automated service composition and workflow generation, and enabling interactive, natural-language mission specification and refinement. OmniDroneX also incorporates important categories of composition techniques that are essential in dynamic UAV systems, including physical layer composition for drone capability augmentation, as well as spatiotemporal, functional, collaborative, exception-aware, and QoS-based service compositions. Collectively, these features allow OmniDroneX to serve as a foundation for scalable, resilient, and self-evolving UAV ecosystems operating in complex and dynamic environments.

URL PDF HTML ☆

赞 0 踩 0

2606.17509 2026-06-17 eess.SY cs.SY 新提交

Data-Driven Stabilizing Controller Design for Linear Infinite Networks

线性无限网络的数据驱动镇定控制器设计

Mahdieh Zaker, Andrii Mironchenko, Amy Nejati, Abolfazl Lavaei

AI总结提出一种直接数据驱动方法，利用噪声污染的输入-状态轨迹和线性矩阵不等式，为未知线性时不变子系统构造eISS控制Lyapunov函数和镇定反馈控制器，并通过无限维小增益条件组合为全局控制器，实现无限网络的指数稳定性。

Comments This paper has been accepted at the 27th International Symposium on Mathematical Theory of Networks and Systems (MTNS)

2606.17481 2026-06-17 cs.NI cs.ET 新提交

RATIO: Redundancy-Controlled Stochastic Routing for Reliable Vehicular Multi-Hop Networking

RATIO: 用于可靠车载多跳网络的冗余控制随机路由

Lei Lei, Xudong Wang

AI总结提出RATIO路由协议，通过构建加权有向无环图并采用基于模数的随机转发规则，实现连续可控的冗余度，在保证高可靠性的同时降低延迟和开销。

Comments 19 pages, 8 figures

详情

AI中文摘要

车载网络中可靠、低延迟的多跳数据传输需求日益增长，但由于高移动性和间歇性阻塞导致的频繁路由失效，这一目标仍具挑战性。基于冗余的路由通过多路径转发数据包来增强鲁棒性，但过度复制会加剧竞争并引入额外延迟，凸显了精细管理冗余-可靠性权衡的必要性。然而，传统的确定性多路径复制通常将数据包复制到整数个分支，使得冗余度难以调节并适应车载网络中时变的网络动态。为此，本文提出了冗余控制随机（RATIO）路由。对于每个活跃流，RATIO构建一个加权简化有向无环图（DAG）作为路由结构，其中边权重指定每链路转发概率。在分叉节点，允许总出向转发概率超过1，并采用基于模数的随机转发规则来保证可行转发，从而实现连续可控的冗余度。理想化的RATIO设计被形式化为一个负载最小化优化问题，受限于每流的及时可靠性和链路容量约束，但在时变无线动态下该问题通常是难解的。因此，开发了一种实用的启发式算法H-RATIO。H-RATIO通过取候选路径的并集构建紧凑的简化DAG，并通过局部评分和复制调整迭代优化转发概率。广泛的基于轨迹的SUMO/ns-3联合仿真表明，与基线相比，RATIO/H-RATIO始终实现最高的及时数据包投递率，同时提供显著更好的投递效率，尤其是在高负载场景下。

英文摘要

Reliable, low-latency multi-hop data delivery in vehicular networks is increasingly demanded, yet remains challenging due to frequent route failures caused by high mobility and intermittent blockage. While redundancy-based routing enhances robustness by forwarding packets over multiple paths, over-replication intensifies contention and introduces additional delay, highlighting the need to carefully managing redundancy--reliability trade-off. However, conventional deterministic multi-path replication typically duplicates packets to an integer number of branches, making the redundancy level hard to tune and adapt to time-varying network dynamics in vehicular networks. To this end, Redundancy-Controlled Stochastic (RATIO) routing is proposed in this paper. For each active flow, RATIO constructs a weighted reduced directed acyclic graph (DAG) as the routing structure, where edge weights specify per-link forwarding probabilities. At fork nodes, the aggregate outgoing forwarding probability is allowed to exceed one and a modulo-based stochastic forwarding rule is employed to guarantee feasible forwarding, thereby enabling continuously controllable redundancy. An idealized RATIO design is formulated as a load-minimizing optimization subject to per-flow timely-reliability and link-capacity constraints, but the problem is generally intractable under time-varying wireless dynamics. Accordingly, a practical heuristic, termed H-RATIO, is developed. H-RATIO constructs a compact reduced DAG by taking the union of candidate paths and optimizes forwarding probabilities via local scoring and replication-adjustment iterations. Extensive trace-driven SUMO/ns-3 co-simulations demonstrate that RATIO/H-RATIO consistently achieves the highest timely PDR compared to baselines, while providing substantially better delivery efficiency, especially under high-load scenarios.

URL PDF HTML ☆

赞 0 踩 0