arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2606.02960 2026-06-03 cs.SE

Many a Little Makes a Mickle: A Code-Centric Empirical Study of Data Minimization Principle in Android App Development

积少成多：以代码为中心的Android应用开发中数据最小化原则的实证研究

Dianshu Liao, Shidong Pan, Zhenchang Xing, Xiaoyu Sun

AI总结通过对1114个开源Android应用的实证研究，识别出五个数据处理阶段中的十种数据最小化场景，并基于9875个真实APK的大规模分析提炼出31条可操作的编码指南，同时发现LLM生成的代码会重现数据最小化风险实践，而纳入指南可消除这些问题。

详情

AI中文摘要

现代移动应用消耗大量数据以运行，引发了显著的隐私担忧和监管挑战。虽然先前的工作主要通过政策分析检测合规差距，但缺乏为开发者在代码层面实现隐私原则的可操作指导。在本文中，我们聚焦于数据最小化作为开发者可操作的原则，并研究其在Android应用中的实现。我们对1114个开源Android应用进行了形成性研究，识别出五个数据处理阶段中的十种重复出现的数据最小化场景。在此基础上，我们对9875个真实世界APK进行了大规模分析，并提炼出31条可操作的编码指南以支持隐私合规开发。我们进一步检查了Android开发中基于LLM的代码生成，发现最先进的模型一致地重现了数据最小化风险实践，表明它们继承并放大了真实世界代码中的模式。令人鼓舞的是，纳入我们的指南在所有评估模型中消除了这些问题。我们的工作倡导转向在代码级根本原因上响应隐私监管要求，从而在人类和AI辅助编程中实现更好的合规性。

英文摘要

Modern mobile applications consume large amounts of data to function, raising significant privacy concerns and regulatory challenges. While prior work has primarily focused on detecting compliance gaps through policy analysis, there remains a lack of actionable guidance for developers to implement privacy principles at the code level. In this paper, we focus on data minimization as a developer-operationalizable principle and investigate its realization in Android applications. We conduct a formative study on 1,114 open-source Android apps to identify ten recurring data minimization scenarios across five data-handling stages. Building on this, we perform a large-scale analysis of 9,875 real-world APKs and distill 31 actionable coding guidelines to support privacy-compliant development. We further examine LLM-based code generation in Android development and find that state-of-the-art models consistently reproduce data minimization-risky practices, indicating that they inherit and amplify patterns from real-world code. Encouragingly, incorporating our guidelines eliminates these issues across all evaluated models. Our work advocates a shift toward responding to privacy regulatory requirements at their code-level root causes, enabling better compliance in both human and AI-assisted programming.

URL PDF HTML ☆

赞 0 踩 0

2606.02950 2026-06-03 cs.ET

Powering An Ecosystem Of Pedagogical AI Agents: A Validation Strategy For A Unified Data Architecture

赋能教学型AI代理生态系统：统一数据架构的验证策略

Natalia Theodora, Ploy Thajchayapong, Ashok K. Goel

AI总结针对教学型AI代理生态系统中的数据架构验证问题，提出一种两阶段测试方法，确保功能多样性和实际可扩展性，并在大规模在线项目中成功处理超过270万条请求。

详情

AI中文摘要

人工智能在教育中的应用已从单一的整体式智能辅导系统演变为多样化的教学代理生态系统，包括对话助手、虚拟教练和自适应导师。这种转变需要统一且可扩展的数据架构来管理人类教师、学习者以及各种AI代理之间的复杂信息反馈循环。数据架构的设计、开发和部署随之引发了一个关键问题：验证。本文通过描述一种实用的验证策略来满足这一关键需求，该策略针对作为国家AI成人学习和在线教育研究所的AI增强成人学习数据架构一部分的高容量数据管道。我们的方法涉及两阶段测试方法，以确保功能多样性和实际可扩展性。首先，QA环境使用合成数据和真实数据的混合，验证来自学习者和代理交互产生的各种事件类型的功能正确性。随后，生产环境成功处理了来自大规模在线项目的真实事件数据，在21次成功运行中总计超过270万次生产请求。此验证过程揭示了数据隐私的关键见解，这是处理来自多个AI代理数据源的各种数据时的一个关键挑战。通过概述可复制的统一数据骨干测试策略，本研究为旨在构建和支持其自身异构AI驱动学习工具套件的机构和开发者提供了清晰的框架。关键词：教学代理，学习生态系统，数据架构，验证，可扩展性，学习分析。

英文摘要

The application of AI in education has evolved from monolithic intelligent tutoring systems to a diverse ecosystem of pedagogical agents, including conversational assistants, virtual coaches, and adaptive tutors. This shift requires a unified and scalable data architecture to manage the complex information feedback loops between human instructors, learners, and the varied AI agents. The design, development, and deployment of the data architecture in turn raises a critical issue of validation. This paper addresses this critical need by describing a practical validation strategy for a high-volume data pipeline developed as part of a data architecture for AI-augmented adult learning at the National AI Institute for Adult Learning and Online Education. Our approach involves a two-stage testing methodology to ensure both functional diversity and real-world scalability. First, the QA environment uses a blend of synthetic and real-world data to validate functional correctness across various event types produced from learner and agent interactions. Following this, the production environment successfully processed a total of over 2.7 million production requests across 21 successful runs carrying authentic event data from a large-scale online program. This validation process surfaced crucial insights into data privacy, a key challenge when handling varied data from multiple AI agent data sources. By outlining a replicable testing strategy for a unified data backbone, this research offers a clear framework for institutions and developers aiming to build and support their own heterogeneous suites of AI-powered learning tools. Keywords: Pedagogical Agents, Learning Ecosystems, Data Architecture, Validation, Scalability, Learning Analytics.

URL PDF HTML ☆

赞 0 踩 0

2606.02949 2026-06-03 eess.SY cs.SY

Power System CBFs

电力系统控制障碍函数

Abdallah Alalem B. Albustami, Ahmad F. Taha, Taylor T. Johnson

AI总结针对电力系统微分代数方程模型，提出一种支持动态和代数变量安全约束的控制障碍函数框架，实现实时安全滤波与离线形式化验证。

详情

AI中文摘要

控制障碍函数（CBFs）已成为安全关键控制系统中的标准工具。CBFs 将状态约束转化为实时控制条件，确保前向不变性（即一旦系统从安全区域开始，它将永远停留在该区域），并在安全受到威胁时最小程度地修改标称控制器。在电力系统中，基于 CBF 的方法已被提出用于频率和电压安全，但它们很大程度上与电力系统运行的三个关键特征脱节：捕获网络潮流约束的微分代数方程（DAE）模型、涉及代数变量（如母线电压）的安全规范，以及所得闭环系统的形式化验证。本文通过为电力系统 DAE 模型开发一个支持动态和代数变量安全约束的 CBF 框架来弥补这一差距。该框架通过一个优化层提供实时安全滤波，该优化层包裹现有控制器并最小程度地修改其指令以强制执行安全。此外，它通过基于离线可达性的安全运行证书提供形式化验证（即所有可接受轨迹满足规定安全约束的数学保证）。结果是一种统一的滤波与验证方法，用于在保持底层模型 DAE 结构的同时，强制执行和认证电力系统中的频率与电压安全。

英文摘要

Control barrier functions (CBFs) have become a standard tool in safety critical-control systems. CBFs convert state constraints into real time control conditions that certify forward invariance (meaning that once the system starts in a safe region, it remains there for all future times) and minimally modify a nominal controller only when safety is at risk. In power systems, CBF based methods have been proposed for frequency and voltage safety, but they largely remain disconnected from three key features that are central to power system operation: differential algebraic equation (DAE) models that capture network power flow constraints, safety specifications involving algebraic variables such as bus voltages, and formal verification of the resulting closed loop system. This paper closes this gap by developing a CBF framework for power system DAE models that supports safety constraints on both dynamic and algebraic variables. The framework provides real time safety filtering through an optimization layer that wraps around an existing controller and minimally modifies its command to enforce safety. In addition, it provides formal verification (i.e., a mathematical guarantee that all admissible trajectories satisfy the prescribed safety constraints) through an offline reachability based certificate of safe operation. The result is a unified filter and verify methodology for enforcing and certifying frequency and voltage safety in power systems while preserving the DAE structure of the underlying model.

URL PDF HTML ☆

赞 0 踩 0

2606.02934 2026-06-03 cs.CR cs.IT math.IT

Quantifying Side-Channel Leakage in Public Metrology Releases

量化公共计量发布中的侧信道泄漏

Faruk Alpay, Taylan Alpay

AI总结本文形式化并量化了公共科学和计量发布中侧信道泄漏的风险，提出了一种基于统计侧信道审计的框架，并导出了有限带宽传输泄漏定律。

Comments 30 pages, 7 figures, 8 tables; ancillary reproducibility package included

详情

AI中文摘要

公共科学和计量发布可能泄漏产生它们的隐藏设置。我们形式化并量化了这种风险，将其视为一种轮廓统计侧信道审计：发布映射暴露了功率谱密度（PSD）的有限带宽统计量，轮廓观察者在明确预算下训练标记模板谱，挑战发布从两个效用等价的配方中抽取，这些配方由一个受保护坐标分隔。平均PSD bin遵循伽马信道，当bin相关时，由协方差加权对数谱信道替代；这产生了精确的Kullback-Leibler散度、Chernoff指数、受保护比特优势界限以及有限训练、有限库、有限计算和模型失配校正。我们的主要结果是一个有限带宽传输泄漏定律：在消除幅度和模糊后，受保护的酸传输信息满足$I_{\lambda|\alpha,eta}(K) = (64/1225)\, w \lambda^{6} K^{9} + O(w \lambda^{8} K^{11})$，其中$K\lambda \ll 1$，这是一个具有闭式安全带的九阶指数。一个逐步协议将测量发布转化为这些数字，一个固定种子的可重复性包重新生成每个表格和图形。我们将该审计应用于筛选的极紫外（EUV）粗糙度谱作为模型条件案例研究，下一步是部署在测量发布上。

英文摘要

Public scientific and metrology releases can leak the hidden settings that produced them. We formalize and quantify this risk as a profiled statistical side-channel audit: a release map exposes finite-band statistics of a power spectral density (PSD), a profiled observer trains labeled template spectra under an explicit budget, and a challenge release is drawn from one of two utility-equivalent recipes separated by a protected coordinate. Averaged PSD bins follow a gamma channel, replaced by a covariance-weighted log-spectrum channel when the bins are correlated; this yields exact Kullback-Leibler divergences, Chernoff exponents, protected-bit advantage bounds, and finite-training, finite-library, finite-compute, and model-mismatch corrections. Our headline result is a finite-band transport-leakage law: after amplitude and blur are eliminated, the protected acid-transport information obeys $I_{λ|α,β}(K) = (64/1225)\, w λ^{6} K^{9} + O(w λ^{8} K^{11})$ for $Kλ\ll 1$, a ninth-order exponent with a closed-form safe band. A step-by-step protocol turns a measured release into these numbers, and a fixed-seed reproducibility package regenerates every table and figure. We instantiate the audit on screened extreme-ultraviolet (EUV) roughness spectra as a model-conditioned case study, with deployment on measured releases the next step.

URL PDF HTML ☆

赞 0 踩 0

2606.02933 2026-06-03 cs.HC

Characterization and Effects of CS2 Learning with GenAI, Visualization, and Human Support

CS2学习中的生成式AI、可视化与人类支持的特征及影响

Quinton Yong, Anthony Estey, Miguel Nacenta

AI总结通过混合方法研究，比较二年级算法课程中学生在生成式AI、算法可视化和人类辅导三种学习支持下的交互模式与学习效果，发现生成式AI虽提升自我效能但学习效果较差，而人类辅导效果最佳。

Comments Accepted at the ACM Conference on International Computing Education Research (ICER 2026)

详情

DOI: 10.1145/3765964.3811663

AI中文摘要

生成式AI（GenAI）正成为学生和教师广泛采用的学习支持工具，因为它提供了个性化辅导和支架式学习等好处。然而，最近的研究强调了潜在的缺点，如过度依赖和元认知问题，尤其是在新手程序员中。大多数先前的工作集中在入门编程课程上，关于GenAI负面影响的潜在机制以及当学生学习更高级的计算机科学概念时这些发现是否具有普遍性的重要问题仍然存在。为了弥补这一差距，我们进行了一项混合方法研究，比较了学生在二年级算法课程中与GenAI以及两种传统学习支持——算法可视化（AV）和人类实时辅导（LT）的交互。十二名学生参加了三个90分钟的学习环节，重点关注排序、树和图算法。我们记录了注视和交互数据，每个环节结束时进行测试以评估他们对主题的概念理解。我们的分析分类了参与者在问题解决过程中何时寻求帮助，并比较了三种学习支持下的交互模式。尽管GenAI相比实时辅导在自我效能方面产生了更大的提升，但它与显著较低的学习成果相关。我们发现参与者没有有效使用算法可视化，在使用GenAI学习高级主题时面临使用障碍，并且实时辅导产生了最高的学习成果。

英文摘要

Generative AI (GenAI) is becoming a widely adopted learning support tool for both students and instructors, as it offers benefits such as personalized tutoring and scaffolded learning. However, recent research highlights potential drawbacks such as overreliance and metacognitive issues, especially in novice programmers. Most prior work focuses on introductory programming courses, and important questions remain about the underlying mechanisms behind the negative effects of GenAI and if findings can be generalized when students learn more advanced computer science concepts. To address this gap, we conducted a mixed-methods study comparing student interactions with GenAI to two traditional learning supports in a second-year algorithms course: algorithm visualization (AV) and human live tutoring (LT). Twelve students participated in three 90-minute study sessions focusing on sorting, tree, and graph algorithms. We recorded gaze and interaction data, and each session concluded with a test assessing their conceptual understanding of the topic. Our analysis classifies when during the problem-solving process participants sought help, and compares the interaction patterns across the three learning supports. Although GenAI produced a larger increase in self-efficacy compared to live tutoring, it was associated with noticeably lower results in learning outcomes. We found that participants did not use algorithm visualizations effectively, faced usage barriers when using GenAI to learn advanced topics, and that live tutoring yielded the highest learning outcomes.

URL PDF HTML ☆

赞 0 踩 0

2606.02916 2026-06-03 cs.DC

GreenGNN: Energy-Aware Windowed Communication Optimization for Distributed GNN Training

GreenGNN：面向分布式GNN训练的能耗感知窗口化通信优化

Arefin Niam, Tevfik Kosar, M. S. Q. Zulkar Nine

AI总结提出GreenGNN系统，通过窗口化通信优化和离线模拟器选择窗口大小，在分布式GNN训练中减少通信能耗并提升吞吐量。

详情

AI中文摘要

大规模图神经网络（GNN）训练通常需要分布式集群，因为图结构和特征张量不再适合单个节点的内存。在基于采样的训练中，每个小批量扩展成一个跨分区的感受野，每轮触发数千次远程特征获取。这浪费能量有两个主要原因：每个小RPC支付固定的启动和协议成本，并且GPU在等待远程特征时继续消耗大量基线功率。我们提出GreenGNN，一种能耗感知的分布式GNN训练系统，通过利用邻居采样的突发性、短时时间局部性来减少通信能量。GreenGNN将训练分组为W个连续小批量的窗口，将每个窗口的热特征暂存到本地缓存，并将来自每个分区所有者的远程请求合并为少量批量传输。这摊销了跨多个特征的RPC开销，同时为缓存未命中保留按需路径。由于窗口大小控制通信摊销与热集陈旧性之间的权衡，GreenGNN使用离散事件模拟器离线选择W，该模拟器使用混合能量模型重放确定性的一轮访问轨迹。我们在DGL上实现GreenGNN，并在4节点GPU集群上使用基准数据集进行评估。在各种数据集和批量大小下，与基线相比，GreenGNN将系统总能量降低27-43%，同时端到端吞吐量提升高达3.9倍。GPU能量下降36-71%，这是由于更少的RPC启动和更低的GPU停顿时间。

英文摘要

Large-scale graph neural network (GNN) training often requires distributed clusters because graph structure and feature tensors no longer fit in a single node's memory. In sampling-based training, each mini-batch expands into a receptive field that spans partitions and triggers thousands of remote feature fetches per epoch. This wastes energy for two main reasons: each small RPC pays a fixed initiation and protocol cost, and GPUs continue drawing substantial baseline power while waiting for remote features. We present GreenGNN, an energy-aware distributed GNN training system that reduces communication energy by exploiting the bursty, short-lived temporal locality of neighbor sampling. GreenGNN groups training into windows of W consecutive mini-batches, stages each window's hot features in a local cache, and merges remote requests from each partition owner into a small number of bulk transfers. This amortizes RPC overhead across many features while preserving an on-demand path for cache misses. Because window size controls the trade-off between communication amortization and hot-set staleness, GreenGNN selects W offline using a discrete-event simulator that replays a deterministic one-epoch access trace with a hybrid energy model. We implement GreenGNN on DGL and evaluate it on a 4-node GPU cluster with benchmark datasets. Across datasets and batch sizes, GreenGNN reduces total system energy by 27--43% relative to baseline while improving end-to-end throughput by up to 3.9x. GPU energy drops by 36--71%, driven by fewer RPC initiations and lower GPU stall time.

URL PDF HTML ☆

赞 0 踩 0

2606.02905 2026-06-03 cs.DL

Speaker Mining -- FAIR Data on Public Broadcasts for Question Answering

Speaker Mining -- 面向问答的公共广播FAIR数据

Tim Wittenborg, Omar Imad Remmo, Claudia Frick, Lena John, Oliver Karras, Sören Auer

AI总结提出一个可扩展的FAIR数据策展框架，整合ZDF档案、维基百科和维基数据，自动消歧并构建知识图谱，支持基于SPARQL的问答，覆盖8436个规范人物及23527次出场。

Comments 17 pages, 5 figures, submitted to TPDL 2026

详情

AI中文摘要

公共广播处于公民话语的中心：传统的电视脱口秀，以及新兴的播客和网络视频形式，捕捉并引导我们社会的注意力，塑造公民如何接触政治、科学和社会问题。然而，对这些形式的系统性甚至简单分析面临类似的挑战：嘉宾和内容元数据稀缺、短暂、碎片化且不标准化。所进行的研究和回答的问题基于广泛、费力但孤立的数据策展工作，这些工作仅捕捉了相关领域的一小部分。本文旨在通过一个面向公共广播中FAIR数据策展的可扩展框架来解决这一问题。在15个广播节目上评估，该管道将ZDF档案PDF、this http URL和维基数据聚合为一个统一的知识图谱。来自这三个来源的31,817个候选嘉宾提及中，17,729个可以自动消歧，另外5,958个通过使用OpenRefine进行64小时的手动协调完成。结果发布在this http URL并链接到维基数据，支持基于性别、年龄、职业或机构隶属关系的SPARQL问答，覆盖8,436个规范人物，在6,469个对齐的剧集中出现23,527次。我们的迭代经验表明，正确消歧和去重来自异构源的演讲者数据需要可持续基础设施的专门努力。为了使公共广播上的可扩展和可靠问答对每个人都可访问，我们建议促进关联开放数据的潜力：推进像本文这样的对齐和利用方法，特别是向众包开发和策展方向发展，同时也推动公共广播服务提供商提供更多FAIR数据接口。

英文摘要

Public broadcasts are at the center of civic discourse: Traditional television talk shows, alongside emerging podcast and web video formats, capture and guide the attention of our societies, shaping how citizens encounter politics, science, and societal issues. Yet, systematic or even simple analyses of these formats face similar challenges: guest and content metadata are scarce, fleeting, fragmented, and not standardized. Research conducted and questions answered are based on extensive, laborious, yet isolated data-curation efforts that capture only a fraction of the relevant landscape. This work seeks to address this issue using a scaling-oriented framework for FAIR data curation in public broadcasting. Evaluated on 15 broadcasting programs, the pipeline aggregates ZDF Archive PDFs, fernsehserien.de, and Wikidata into a unified knowledge graph. Of the 31,817 candidate guest mentions from these three sources, 17,729 could be automatically disambiguated, further 5,958 via 64 hours of manual reconciling using OpenRefine. Results are published at speakermining.wikibase.cloud and linked to Wikidata, enabling SPARQL-based question answering based on gender, age, occupation, or institutional affiliation across 8,436 canonical persons with 23,527 appearances in 6,469 aligned episodes. Our iterative experience reveals that correctly disambiguating and deduplicating speaker data from heterogeneous sources demands dedicated effort on sustainable infrastructure. For scalable and reliable question answering on public broadcasts to be accessible to everyone, we recommend fostering the potential of linked open data: Advancing alignment and utilization approaches like this work, particularly towards crowdsourced development and curation, but also more FAIR data interfaces from public broadcast service providers.

URL PDF HTML ☆

赞 0 踩 0

2606.02878 2026-06-03 eess.SY cs.SY

Package-Embedded Coupled Inductor Arrays for High-Performance Computing Power Delivery

用于高性能计算供电的封装嵌入式耦合电感阵列

Rami Rasheedi, Salma Abdelzaher, Inna Partin-Vaisband

AI总结提出一种包含封装嵌入式电感拓扑和电感岛方法的新型供电框架，通过紧密耦合的螺旋方形电感阵列实现高电感密度（250 nH/mm²）和电流密度（10 A/mm²），在40A负载下效率提升高达11.04%。

Comments 11 page, 13 figures, 7 tables, accepted for publication in IEEE Transactions on Components, Packaging, and Manufacturing Technology (T-CPMT), Special Section on Vertical Power Delivery for Next-Generation Advanced Packaging Systems

详情

AI中文摘要

引入了一种新型供电框架，包括封装嵌入式电感拓扑和电感岛方法，以最大化垂直供电（VPD）中的电感密度和电流密度。该框架利用多个多相转换器（高性能计算系统中的常见策略）来提高效率和可扩展性。所提出的拓扑采用紧密耦合的螺旋方形电感阵列，共享一个公共磁棒，服务于工作在相同转换相位的多个转换器。该阵列经过优化以最大化耦合并最小化转换损耗，实现了优异的电感密度（250 nH/mm²）和电流密度（10 A/mm²）。在系统层面，电感岛方法将供电网络划分为多个岛，每个岛专用于一个转换器相位并为负载电流的一部分供电，从而实现可扩展和高效的分配。为了验证该框架，在ANSYS Maxwell 3D和Mechanical中设计和仿真了电感阵列，在2A负载电流、6V输入和10MHz开关频率下，平均品质因数为23.6，效率为97.4%。从ANSYS提取电感阵列网表，并在Cadence Virtuoso中与分布式双相电源转换系统协同设计，确保无源和有源组件的联合优化。与使用非耦合电感的类似转换器相比，协同设计的转换器在40A负载下平均效率提升5.65%，最高达11.04%，展示了该方法的实际优势。

英文摘要

A novel power delivery framework, comprising a package-embedded inductor topology and an inductance-island methodology, is introduced to maximize both inductance and current densities in vertical power delivery (VPD). The framework leverages multiple multi-phase converters, a common strategy in high-performance computing systems, to enhance efficiency and scalability. The proposed topology employs an array of tightly coupled spiral square inductors sharing a common magnetic rod, serving multiple converters operating in the same conversion phase. The array is optimized to maximize coupling and minimize conversion losses, achieving superior inductance and current densities of 250 nH/mm^2 and 10 A/mm^2, respectively. At the system level, the inductance-island methodology partitions the power delivery network into multiple islands, each dedicated to a converter phase and supplying a portion of the load current, thereby enabling scalable and efficient distribution. To validate the framework, the inductor array is designed and simulated in ANSYS Maxwell 3D and Mechanical, exhibiting an average quality factor of 23.6 and efficiency of 97.4% at 2 A load current, 6 V input, and 10 MHz switching frequency. The inductor array netlist is extracted from ANSYS and co-designed in Cadence Virtuoso with a distributed dual-phase power conversion system, ensuring joint optimization of passive and active components. The co-designed converter achieves a significant efficiency gain of 5.65% on average and up to 11.04% at 40 A load over a similar converter with uncoupled inductors, demonstrating the practical benefits of the approach.

URL PDF HTML ☆

赞 0 踩 0

2606.02869 2026-06-03 cs.CE

ZOAF: Towards Efficient Zeroth-Order Optimization for Analog/RF Circuit Design

ZOAF：面向模拟/射频电路设计的高效零阶优化方法

Liyan Tan, Yequan Zhao, Jinming Lu, Ben F. Jamroz, Ari Feldman, Zheng Zhang

AI总结提出零阶模拟/射频框架（ZOAF），通过少量黑盒仿真恢复梯度下降方向，结合混合ZO调度、准随机多起点和滑动窗口监测等技术，在模拟/射频电路优化中实现更优的中值最终值和鲁棒性，并将仿真调用次数减少1.3-3.8倍。

Comments Preprint. Under review

详情

AI中文摘要

电路优化是模拟/射频集成电路设计中不可或缺的一步。由于无法访问仿真器源代码以及实现伴随方法的技术障碍，经典的基于梯度的快速优化方法通常不可行。因此，基于代理的黑盒优化在实践中被广泛使用；然而，构建代理模型成本高昂且对超参数敏感，而群体启发式方法在严格的仿真调用预算下往往收敛缓慢且评估次数多。为了解决这些限制，我们提出了零阶模拟/射频框架（ZOAF），该框架从少量黑盒电路仿真中恢复梯度下降方向，结合了基于梯度的优化和黑盒优化的优势。我们还采用了多种无代理技术来提高效率和准确性，包括：(1) 混合ZO调度方法，在随机方向ZO（用于预算高效的探索）和坐标方向ZO（用于精确的后期细化）之间切换；(2) 一次性准随机多起点以集中评估；(3) 滑动窗口监测器，触发提前停止和盒投影更新以保持可行性。在三个不同的电路图上评估，ZOAF始终优于最先进的基线，在每个报告的性能指标上实现了最佳中值最终值——在22参数两级放大器上中值峰值优势达一个数量级——同时在不同种子下具有最鲁棒的最差情况行为，并将收敛所需的仿真调用次数减少了1.3-3.8倍。代码在此https URL公开。

英文摘要

Circuit optimization is an indispensable step in analog/RF IC design. Classical fast gradient-based optimization methods are typically infeasible due to lack of access to simulator source code and the technical barriers to implementing adjoint methods. Therefore, surrogate-based black-box optimization is widely used in practice; however, it can be costly to build and sensitive to hyperparameters, whereas population heuristics often suffer from slow convergence and large evaluation counts under tight simulator-call budgets. To address these limitations, we propose the Zeroth-Order Analog/RF Framework (ZOAF), which recovers gradient-descent directions from a small number of black-box circuit simulations, combining the benefits of both gradient-based optimization and black-box optimization. We also employ several surrogate-free techniques to improve the efficiency and accuracy, including (1) a hybrid ZO scheduling method that switches between random-direction ZO for budget-efficient exploration and coordinate-wise ZO for accurate late-stage refinement, (2) one-shot quasi-random multi-start to focus evaluations, and (3) a sliding-window monitor that triggers early stops and box-projected updates to maintain feasibility. Evaluated on three distinct schematics, ZOAF consistently outperforms state-of-the-art baselines, achieving the best median final value on every reported figure of merit -- with up to an order-of-magnitude advantage in median peaking on the 22-parameter two-stage amplifier -- together with the most robust worst-case behavior across seeds, while reducing simulator calls to convergence by $1.3$--$3.8\times$. Code is publicly available at https://github.com/LiyanTan111/ZOAF.

URL PDF HTML ☆

赞 0 踩 0

2606.02839 2026-06-03 cs.CR cs.CY cs.HC

Human Factors in Cybersecurity in Icelandic Small and Medium-sized Enterprises

冰岛中小企业中网络安全的人为因素

Goda Cicėnaitė, Thomas Welsh, Helmut Neukirchen

AI总结通过调查冰岛130个公共和私营部门组织，从管理视角识别出人为因素（如培训不足、招聘问题、安全文化薄弱和资源限制）是网络安全的主要挑战，并提出针对性缓解建议。

Comments To be published in 17th EAI International Conference on Digital Forensics & Cyber Crime, 8 - 10 September 2026, Reykjavík, Iceland

详情

AI中文摘要

由于数字系统融入现代生活以及不稳定的地缘政治格局，网络威胁在社会各个方面日益增加。技术因素是一场持续的军备竞赛；然而，来自人为和社会因素的威胁面仍然存在，常常为恶意行为者提供绕过复杂技术安全控制的手段。理解技术演进背景下的人为因素对于确保安全控制的有效性至关重要。本研究呈现了对冰岛公共和私营部门组织（包括关键基础设施提供商）网络安全挑战的调查结果（N=130）。从管理角度来看，人为因素被强烈指出是其组织安全的挑战和障碍。这些挑战包括缺乏充分的培训或意识、招聘问题、不良的网络安全文化以及时间和/或财务资源限制。基于这些发现，得出了缓解人为因素威胁的建议，包括：优先进行针对性培训而非通用培训以减少员工疲劳，为财务受限的组织提供外部政府支持，以及通过围绕共同责任的建设性沟通建立强大的网络安全文化。

英文摘要

Cybersecurity threats are increasing in all aspects of society due to the integration of digital systems into modern-day life and a volatile geo-political landscape. Technical factors are an ongoing arms race; however, the threat surface from human and social factors is still present, often providing malicious actors the means to bypass complex technical security controls. Understanding human factors in light of technical evolution is essential to ensure security controls remain effective. This study presents the results of a survey on cybersecurity challenges within public and private sector organisations, including critical infrastructure providers, in Iceland (N = 130). From the management perspective, human factors were strongly noted as challenges and barriers to their organisations' security. These challenges include a lack of adequate training or awareness, hiring issues, poor cybersecurity culture, and time and/or financial resource constraints. Based on these findings, recommendations for mitigating threats from human factors are derived. These include: prioritising targeted over generic training to reduce employee fatigue, external government support for financially constrained organisations, and building a strong cybersecurity culture through constructive communication around shared responsibilities.

URL PDF HTML ☆

赞 0 踩 0

2606.02836 2026-06-03 cs.AR

Fast Transformer Inference on ARM-Based HMPSoCs

基于ARM的HMPSoC上的快速Transformer推理

Hang Xu, Yixian Shen, Thanassis Giannetsos, Anuj Pathania

AI总结针对ARM边缘设备缺乏Transformer推理支持的问题，在ARM计算库中实现新内核，并提出CPU-GPU协同推理方法，实现最高3倍加速和额外15.72%延迟降低。

Comments Accepted at ISVLSI 2026

详情

AI中文摘要

Transformer模型为机器学习任务设定了新的性能标准。然而，在资源受限的边缘设备上部署它们以实现无云、片上Transformer推理仍然具有挑战性。ARM计算库（ARM-CL）框架在基于ARM的边缘设备上提供低延迟CNN推理，但缺乏对Transformer推理的支持。在这项工作中，我们在ARM-CL中实现了几个新的Transformer内核以支持原生Transformer执行。与基于ARM的嵌入式板上的最先进CPU/GPU实现相比，我们扩展的ARM-CL实现了高达三倍的Transformer推理加速。此外，为边缘设备供电的异构多处理器系统芯片（HMPSoC）同时提供嵌入式CPU和GPU。我们引入了协作式CPU-GPU Transformer推理，它在CPU上执行内存密集型操作，同时利用GPU进行高度可并行化、计算密集型操作。这种协作执行以最小开销实现，与ARM-CL上最佳单处理器推理相比，进一步将Transformer推理延迟降低高达15.72%。

英文摘要

Transformer models have set new performance standards for machine learning (ML) tasks. However, their resource-intensive deployment on resource-constrained edge devices for cloud-free, on-chip transformer inference remains challenging. The ARM Compute Library (ARM-CL) framework provides low-latency CNN inference on ARM-based edge devices but lacks support for transformer inference. In this work, we implement several new transformer kernels in ARM-CL to support native transformer execution. Our extended ARM-CL achieves up to three times faster transformer inference compared to state-of-the-art CPU/GPU implementations on an ARM-based embedded board. Furthermore, heterogeneous multi-processor system-on-chips (HMPSoCs) powering edge devices provide both embedded CPUs and GPUs. We introduce cooperative CPU-GPU transformer inference, which executes memory-intensive operations on the CPU while utilizing the GPU for highly parallelizable, compute-intensive operations. This cooperative execution, implemented with minimal overhead, further reduces transformer inference latency by up to 15.72% compared to the best single-processor inference on ARM-CL.

URL PDF HTML ☆

赞 0 踩 0

2606.02820 2026-06-03 eess.SY cs.SY

Fairness as an Investment: Dynamic Participation and Long-Run Profit in Virtual Power Plants

公平性作为投资：虚拟电厂中的动态参与与长期利润

Liudong Chen, Bolun Xu

AI总结本文提出在虚拟电厂运营中纳入公平性约束可激励消费者参与，从而提升聚合商的长期盈利能力，并通过动态聚合框架和松弛增强分配机制验证了公平性对参与度和利润的影响。

详情

AI中文摘要

我们表明，将公平性约束纳入虚拟电厂（VPP）运营可以激励消费者参与，从而提高聚合商的长期盈利能力。VPP依赖异质消费者的持续参与来提供多种电网服务，这些服务的时间和频率通常不确定。因此，消费者提供灵活性的意愿和能力随时间演变，形成了过去参与与未来资源可用性之间的动态联系。我们开发了一个动态聚合框架，研究服务分配中的公平性如何影响未来参与和长期盈利能力。通过将当前调度决策与未来资源可用性联系起来，我们表明更公平的分配可以增强消费者参与度，扩大总可用性，并在高价格和高需求事件中创造额外价值。为了平衡公平性和运营效率，我们引入了一种松弛增强的分配机制，该机制保留了公平性带来的大部分参与收益，同时避免了服务采购的不必要减少。我们推导了由此产生的可用性增益超过重新分配短期成本的条件，并使用挪威的真实消费者行为和电力市场数据验证了该方法。

英文摘要

We show that incorporating fairness constraints into virtual power plant (VPP) operations can incentivize consumer participation and thus improve the aggregator's long-run profitability. VPPs rely on sustained participation from heterogeneous consumers to provide a variety of grid services whose timing and frequency are often uncertain. As a result, consumers' willingness and ability to provide flexibility evolve over time, creating a dynamic link between past participation and future resource availability. We develop a dynamic aggregation framework to study how fairness in service allocation affects future participation and long-run profitability. By linking current dispatch decisions to future resource availability, we show that fairer allocations can strengthen consumer engagement, expand aggregate availability, and create additional value during high-price and high-demand events. To balance fairness and operational efficiency, we introduce a slack-augmented allocation mechanism that preserves most of the participation benefits from fairness while avoiding unnecessary reductions in service procurement. We derive conditions under which the resulting availability gains outweigh the short-run cost of redistribution and validate the approach using real-world consumer behavior and electricity market data from Norway.

URL PDF HTML ☆

赞 0 踩 0

2606.02813 2026-06-03 cs.GT cs.MA cs.SI physics.soc-ph

Democracy on Rugged Landscapes: Phase Transitions in Optimal Voting Rules

崎岖景观上的民主：最优投票规则的相变

Joshua Nunley

AI总结通过NK适应度景观模型研究集体治理，发现直接民主下最优投票方法随景观复杂度发生尖锐相变，并引入代议制民主模型分析代表性对复杂度依赖结构的影响。

Comments 8 pages, 3 figures. Submitted to ALIFE 2026

详情

AI中文摘要

法律和制度通过与公民多样化环境的复杂互动塑造个人结果，然而不同投票方法如何驾驭这种耦合景观仍知之甚少。我们将集体治理建模为NK适应度景观上的优化，其中共享位（法律）通过投票更新，而个体位（个人特征）保持不变。交叉依赖参数$\alpha$控制立法效果对个人情况的依赖程度。我们比较了八种标准投票方法和一个广义评分族，景观崎岖度$K \in \{1,\ldots,20\}$和$\alpha \in [0,1]$，每种配置运行1000次。在直接民主下，最优投票方法随景观复杂度发生尖锐相变：基数评分投票在平滑景观上占优，序数评分（$p=0.35$）在低到中等崎岖度下占优，波达计数在广泛中间范围内占优，STAR投票在最高复杂度下占优。一个双参数经验公式将$(K, \alpha)$平面简化为单一复杂度轴以便可视化。波达计数在参数空间大部分区域实现最高平均适应度和最低方差。我们进一步引入由身份权重$\beta$和候选人自利性$p_{\mathrm{self}}$参数化的代议制民主模型。即使在有利条件下，代表性重塑了复杂度依赖结构：基数评分投票在大多数制度下占优，而简单多数制在$\beta$高且$p_{\mathrm{self}}$低到中等时成为最优方法。

英文摘要

Laws and institutions shape individual outcomes through complex interactions with citizens' diverse circumstances, yet how different voting methods navigate this coupled landscape remains poorly understood. We model collective governance as optimization on NK fitness landscapes, where shared bits (laws) are updated by voting while individual bits (personal traits) remain fixed. A cross-dependency parameter $α$ controls how legislation's effects depend on individual circumstances. We compare eight standard voting methods and a generalized scoring family across landscape ruggedness $K \in \{1,\ldots,20\}$ and $α\in [0,1]$ with 1000 runs per configuration. Under direct democracy, the optimal voting method undergoes sharp phase transitions as a function of landscape complexity: cardinal score voting dominates on smooth landscapes, ordinal scoring with $p=0.35$ at low-to-moderate ruggedness, Borda count across a wide middle range, and STAR voting at the highest complexity. A two-parameter empirical formula reduces the $(K, α)$ plane to a single complexity axis for visualization. Borda count achieves the highest mean fitness and lowest variance across most of the parameter space. We further introduce a representative democracy model parameterized by identity weight $β$ and candidate self-interest $p_{\mathrm{self}}$. Representation reshapes the complexity-dependent structure even under favorable conditions: cardinal score voting dominates across most regimes, with plurality emerging as the top method at high $β$ and low-to-moderate $p_{\mathrm{self}}$.

URL PDF HTML ☆

赞 0 踩 0

2606.02804 2026-06-03 cs.SE

Report on the Designing Accountable Software Systems Workshop

设计可问责软件系统研讨会报告

Catherine Albiston, Travis Breaux, Kat Dearstyne, Jane Cleland-Huang, Serge Egelman, Joan Feigenbaum, Lu Feng, Max Lindquist, Stephen Miner, Ruzica Piskac, Sarah Santos, Jordan Schmerge, Anmol Singhal, Maria Smith, Daniel Weitzner, Christopher Yoo

AI总结本报告总结了2024年11月举办的“设计可问责软件系统研讨会”的成果，该研讨会由美国国家科学基金会资助，汇聚了政府、学术界和工业界的利益相关者，通过小组讨论、特邀演讲和分组会议，探讨了软件系统问责制的维度、概念模型、法律要求来源、法律要求在软件中的操作化、证据保存要求以及影响问责结构成败的挑战和背景因素，并提出了未来研究方向。

详情

AI中文摘要

设计可问责软件系统研讨会（DASS）于2024年11月召开，得到了美国国家科学基金会的支持，旨在吸引来自政府、学术界和工业界的广泛当前和未来利益相关者，共同探讨软件系统问责制这一跨学科主题。在为期两天的会议中，与会者参与了一系列小组讨论、特邀演讲和分组会议，内容涵盖：（1）问责制的维度，包括法律合规以及商业和社会方面的因素与驱动因素；（2）实现问责制所需的各种结构的概念模型；（3）影响软件的法律要求的来源；（4）法律要求在软件中的操作化；（5）保存进行调查所需证据的要求；以及（6）软件之外影响某些问责结构成功而其他结构失败的一系列挑战和背景因素。该研讨会以协作知识系统化的方式进行，最终形成了若干研究方向。研究结果包括明确可问责组织内定义和职责的重要性，这可能会影响研究问责制的人员是否做出限制研究结果普遍性的假设。此外，还确定需要进一步研究如何改进将问责结构转化为软件设计过程的方法，同时加强与立法者、监管者、企业高管和系统开发者等利益相关者的互动。最后，一个关键发现是，像DASS这样的研究项目对跨学科团队提出了高要求：既包括团队组建和维持方面，也包括涵盖研究方法、研究传播和职业发展的跨学科学习的特定需求。

英文摘要

The Workshop on Designing Accountable Software Systems (DASS) was convened in November 2024 with support from the U.S. National Science Foundation to engage a wide range of current and future stakeholders from government, academia, and industry on the cross-disciplinary topic of accountability in software systems. Over two days, attendees engaged in a series of panels, invited talks, and breakout sessions covering: (1) the dimensions of accountability, including legal compliance as well as business and societal aspects and drivers; (2) a conceptual model of the various structures needed to realize accountability; (3) the sources of legal requirements that affect software; (4) the operationalization of legal requirements in software; (5) the requirements to preserve evidence needed to conduct investigations; and (6) a range of challenges and contextual factors beyond software that affect why some accountability structures succeed, while others fail. The workshop was conducted as a collaborative systematization of knowledge that culminated in several research directions. The findings include the importance of clarifying definitions and responsibilities within accountable organizations, which can affect whether those researching accountability are making assumptions that limit the generalizability of findings. Further research was also identified as needed to study the ways to improve the translation of accountability structures into the software design process while improving engagement with stakeholders, such as legislators, regulators, business executives and system developers. Finally, a key finding was the high demands that DASS-like research projects place on interdisciplinary teams: both in terms of team formation and sustainment, as well as, the specific demands of cross-disciplinary learning that covers both research methods, research dissemination, and career development.

URL PDF HTML ☆

赞 0 踩 0

2606.02784 2026-06-03 cs.DB

LAANN: I/O-Aware Look-Ahead Search for Disk-Based Approximate Nearest Neighbor Search

LAANN: 面向磁盘近似最近邻搜索的I/O感知前瞻搜索

Dingyi Kang, Juncheng Yang, Bingzhe Li

AI总结提出LAANN系统，通过协同优化CPU计算和I/O访问，采用前瞻搜索、优先级I/O-CPU流水线和轻量级内存图索引，显著提升磁盘ANNS的吞吐量和延迟。

Comments 13 pages, 14 figures

详情

AI中文摘要

近似最近邻搜索（ANNS）是大规模检索、推荐和AI系统中的基本原语。随着向量数据集增长到数十亿甚至数万亿项，基于磁盘的ANNS系统通过将向量数据和索引结构存储在存储系统上来处理这种规模，但其查询性能仍然受I/O延迟主导。现有的基于磁盘的ANNS系统主要优化I/O效率或将I/O与计算重叠，但它们将CPU计算和I/O访问视为基本独立的组件。这种分离错过了一个关键机会：在做出I/O决策之前，有选择地处理已缓存在内存中的候选者可以减少不必要的磁盘访问并提高搜索质量。然而，利用这一机会具有挑战性，因为过多的计算可能延迟关键的I/O操作，而选择不当的计算则收益甚微，可能增加整体查询延迟。在本文中，我们提出了LAANN，一种基于磁盘的ANNS系统，通过协同优化CPU计算和I/O访问，使图搜索显式地感知I/O。LAANN结合了三种技术：前瞻搜索，它跨查询阶段调整搜索策略以平衡I/O减少和及时的I/O发出；优先级I/O-CPU流水线，它利用I/O等待时间根据候选者对即将到来的I/O决策的预期影响来处理缓存在内存中的候选者；以及快速轻量级的内存图索引，它提供高质量的初始候选者以加速收敛并减少磁盘访问。在百万和十亿规模数据集上的实验表明，LAANN显著优于最先进的基于磁盘的ANNS系统。在Recall@10 = 0.9时，LAANN实现了1.41倍至4.66倍的吞吐量提升，29%至79%的延迟降低，以及1.59倍至6.34倍的I/O操作减少。

英文摘要

Approximate nearest neighbor search (ANNS) is a fundamental primitive in large-scale retrieval, recommendation, and AI systems. As vector datasets grow to billions or even trillions of items, disk-based ANNS systems have emerged to handle this scale by storing vector data and index structures on storage systems, but their query performance remains dominated by I/O latency. Existing disk-based ANNS systems primarily optimize I/O efficiency or overlap I/O with computation, but they treat CPU computation and I/O access as largely separate components. This separation misses a critical opportunity: selectively processing candidates already cached in memory before making I/O decisions can reduce unnecessary disk accesses and improve search quality. However, exploiting this opportunity is challenging because excessive computation can delay critical I/O operations, while poorly chosen computation provides little benefit, potentially increasing overall query latency. In this paper, we present LAANN, a disk-based ANNS system that makes graph search explicitly I/O-aware by co-optimizing CPU computation and I/O access. LAANN combines three techniques: look-ahead search, which adapts the search strategy across query stages to balance I/O reduction and timely I/O issuance; a priority I/O-CPU pipeline, which uses I/O waiting time to process candidates cached in memory according to their expected impact on upcoming I/O decisions; and a fast lightweight in-memory graph index, which provides high-quality initial candidates to accelerate convergence and reduce disk accesses. Experiments on million- and billion-scale datasets demonstrate that LAANN substantially outperforms state-of-the-art disk-based ANNS systems. At Recall@10 = 0.9, LAANN achieves 1.41x-4.66x higher throughput, 29%-79% lower latency, and 1.59x-6.34x fewer I/O operations.

URL PDF HTML ☆

赞 0 踩 0

2606.02763 2026-06-03 cs.HC

InquiryBits: Sharing AI Conversation Traces to Support Collaboration Within Trust Boundaries

InquiryBits: 在信任边界内共享AI对话轨迹以支持协作

Caitlin Morris, Pattie Maes

AI总结提出InquiryBits系统，通过共享AI对话的最小化摘要，在可配置的信任边界内支持团队协作，研究发现信任边界比信息粒度更影响分享意愿。

Comments 7 pages, 3 figures

2606.02757 2026-06-03 eess.SY cs.SY

Corridor Design and Separation Definition in Advanced Air Mobility: Systematic Literature Review

先进空中交通中的走廊设计与间隔定义：系统文献综述

Evgenii Vinogradov, Debashisha Mishra, Mariam Ali Askar Alobeidli, Jamal Khaled Al Ali, Ahmed Saleh Alshehhi, Jennifer Simonjan, Enrico Natalizio

AI总结本文通过系统文献综述，识别了先进空中交通中走廊设计、运营管理和间隔标准的不足，并提出了统一的框架和分类法以支持安全高效的eVTOL运营。

详情

DOI: 10.1109/OJITS.2026.3686509
Journal ref: IEEE Open Journal of Intelligent Transportation Systems, vol. 7, pp. 1151-1180, 2026

AI中文摘要

先进空中交通（AAM）利用电动垂直起降（eVTOL）飞行器来解决城市拥堵和排放问题。然而，走廊设计、运营管理和间隔标准在安全高密度运营方面仍研究不足。本文应用系统综述和荟萃分析首选报告项目（PRISMA）指南，系统回顾了IEEE Xplore和Web of Science中2010年至2024年的相关文献。采用背景、干预、机制和结果（CIMO）框架指导研究问题的制定。经过筛选2,039篇期刊和会议论文，最终有62篇文章符合纳入标准。研究结果揭示了缺乏综合的走廊设计方法、运营策略有限以及对传统航空标准的依赖。为弥补这些不足，提出了统一的走廊设计和间隔定义框架及分类法，为未来研究和运营框架提供参考，以支持城市环境中安全高效的eVTOL运营部署。

英文摘要

Advanced Air Mobility (AAM) uses electric vertical take-off and landing (eVTOL) vehicles to address urban congestion and emissions. However, corridor design, operation management, and separation standards remain underexamined for safe high-density operations. This paper applies the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines to systematically review relevant literature from IEEE Xplore and Web of Science, focusing on publications from 2010 to 2024. A Context, Intervention, Mechanism, and Outcome (CIMO) framework guided the development of research questions. After screening 2,039 journal and conference papers, 62 articles met the inclusion criteria. The findings reveal a lack of integrated corridor design approaches, limited operational strategies, and reliance on standards originally designed for conventional aviation. A unified corridor design and separation definition frameworks and taxonomies are proposed to address these shortcomings, informing future investigations and operational frameworks for safe, efficient eVTOL operation deployment in urban settings.

URL PDF HTML ☆

赞 0 踩 0

2606.02752 2026-06-03 cs.DS

Online K-d tree for approximate neighborhood search in data streams

数据流中近似邻域搜索的在线K-d树

Eduardo V. L. Barboza, Robert Sabourin, Rafael M. O. Cruz

AI总结提出一种在线K-d树算法，支持数据流上的动态更新，并适配Canberra距离以保持结构不变性，实现高效近似邻域搜索，在准确率损失较小的情况下显著提升处理速度。

Comments Paper accepted to the ICPRAI 2026

详情

AI中文摘要

k-最近邻（kNN）算法长期以来广泛应用于机器学习（ML）应用中。然而，使用该算法时的主要问题是邻域搜索所需的计算成本，这可能使其在大规模应用中不可行。优化算法，如K-d树，在此类场景中成为一种选择。在数据流下，维护K-d树的属性可能具有挑战性，因为它需要动态插入和删除节点。这些操作可能使维护树的平衡和不变量变得困难。此外，传统的K-d树最初是为基于闵可夫斯基的距离函数设计的。在这项工作中，我们描述了一种在线K-d树及其对Canberra距离的适配，该树支持数据流上的动态更新，同时保留高效遍历所需的结构不变量。实验分析表明，在线K-d树算法在数据流下实现了更快的处理时间，并且适配Canberra距离实现了有效的子树剪枝，这通过平均准确率的微小损失和每秒处理实例数的显著提升得以证明。我们的实现可在我们的GitHub仓库中找到。

英文摘要

The k-Nearest Neighbors (kNN) algorithm has long been widely used in Machine Learning (ML) applications. However, the main concern when using it is the computational cost required for neighborhood search, which can make it unfeasible for large-scale applications. Optimization algorithms, such as the K-d tree, become an option in such scenarios. Under data streams, it can be challenging to maintain the properties of the K-d tree, as it requires inserting and deleting nodes on the fly. These operations can make maintaining the tree's balance and invariants difficult. Additionally, traditional K-d trees were initially designed for Minkowski-based distance functions. In this work, we describe an Online K-d tree and its adaptation to the Canberra distance that supports dynamic updates over data streams while preserving the structural invariants required for efficient traversal. Experimental analysis demonstrates that the Online K-d tree algorithm achieves faster processing time under data streams, and that adapting to the Canberra distance enabled effective subtree pruning, as evidenced by a minor loss in average accuracy and a substantial gain in instances processed per second. Our implementation can be found in our GitHub repository

URL PDF HTML ☆

赞 0 踩 0

2606.02674 2026-06-03 cs.CR

Cross-Vendor Sola ISPM Benchmark: Evaluating Agentic AI for Federated Identity Security Reasoning

跨厂商Sola ISPM基准测试：评估面向联邦身份安全推理的智能体AI

Eden Yavin, Gal Engelberg, Konstantin Koutsyi, Leon Goldberg, Gal Baron

AI总结针对多云和SaaS平台中跨厂商身份安全配置错误与权限提升路径的评估缺失，提出包含50个数据驱动任务的跨厂商Sola ISPM基准测试，并构建评估框架，实验表明结构化关系上下文将答案正确性相对提升约34%，探索查询减少约70%。

Comments 22 pages, 4 figures, 8 tables, 2 appendices

详情

AI中文摘要

多云和SaaS平台的快速普及已将身份安全态势管理（ISPM）转变为根本性的跨厂商挑战：关键配置错误和权限提升路径日益跨越多个身份提供商、基础设施层和认证系统，而这些系统从未设计为互操作。现有评估侧重于孤立的单平台环境，无法评估AI智能体是否能够跨这些碎片化边界进行推理。为填补这一空白，我们引入了跨厂商Sola ISPM基准测试，这是一个包含50个数据驱动任务的生产级基准测试，需要跨八个集成企业平台（包括AWS、Okta、Azure AD和Google Workspace）进行多跳实体解析和跨系统关联。我们还贡献了一个评估框架，不仅衡量最终答案的正确性，还衡量证据基础、结构连接保真度、检索质量和SQL等价性。我们使用三种前沿LLM，在五种上下文配置（从无注入元数据到完整模式、图和检索上下文）下评估Sola AI智能体。结果表明，结构化关系上下文将答案正确性相对提升约34%，并将所有测试模型的探索查询减少约70%，其中最大的收益来自跨厂商图拓扑。我们的发现表明，前沿LLM具有显著的潜在安全推理能力，但可靠的跨厂商身份分析从根本上受限于实体解析和证据基础的显式关系上下文的可用性。在完整上下文下，最佳配置实现了78%的答案正确性，同时将完全失败率降至4%。

英文摘要

The rapid proliferation of multi-cloud and SaaS platforms has transformed Identity Security Posture Management (ISPM) into a fundamentally cross-vendor challenge: critical misconfigurations and privilege escalation paths increasingly span multiple identity providers, infrastructure layers, and authentication systems never designed to interoperate. Existing evaluations focus on isolated single-platform environments and provide no means to assess whether an AI agent can reason across these fragmented boundaries. To address this gap, we introduce the Cross-Vendor Sola ISPM Benchmark, a production-grade benchmark of 50 data-grounded tasks requiring multi-hop entity resolution and cross-system correlation across eight integrated enterprise platforms including AWS, Okta, Azure AD, and Google Workspace. We also contribute an evaluation framework measuring not only final answer correctness but also evidentiary grounding, structural join fidelity, retrieval quality, and SQL equivalence. We evaluate the Sola AI Agent across five context configurations - from no injected metadata to full schema, graph, and retrieval context - using three frontier LLMs. Results show that structured relational context improves answer correctness by approximately 34% relatively and reduces exploration queries by approximately 70% across all tested models, with the largest gains driven by cross-vendor graph topology. Our findings indicate that frontier LLMs possess substantial latent security reasoning capability, but reliable cross-vendor identity analysis is fundamentally constrained by the availability of explicit relational context for entity resolution and evidentiary grounding. Under full context, the best configuration achieves 78% answer correctness while reducing complete failure to 4%.

URL PDF HTML ☆

赞 0 踩 0

2606.02668 2026-06-03 cs.CR cs.HC

What You Approve Is What Executes: Consent Integrity for Black-Box LLM Agents

你批准即执行：黑盒LLM代理的同意完整性

Xiaoqi Weng

AI总结针对黑盒LLM代理的批准对话框存在摘要伪造漏洞，本文引入同意完整性概念，通过可信中介确保人类看到的动作与真实执行的动作一致，并分析了该机制的局限性与权衡。

Comments Preprint. IEEE conference format. Proof-of-concept; artifact at https://github.com/zjnbwxq/agentguard-ci

详情

AI中文摘要

编码代理将重要操作置于人类参与的批准对话框之后，但对话框由代理自身叙述：人类批准代理编写的摘要。Lies-in-the-Loop (LITL) 攻击表明摘要是可伪造的，因此被攻破的代理可以显示良性的描述而执行不同的操作。本文通过将所见即所签（WYSIWYS）和可信路径属性引入代理批准通道，命名了缺失的属性——同意完整性：向人类展示的操作必须由边界处的可信中介根据真实操作渲染，而非代理的叙述，且路径不可被代理伪造，并与实际执行的操作绑定。与经典WYSIWYS相比有两个不同点：渲染者是敌手，边界真实情况是低级事件，必须在不信任代理的情况下解码。由于没有解码器是完备的，可实现的目标是相对于分析器的：分析器无法分类的内容将被标记为不可检查而非静默批准。原型实现了分析器、渲染器和执行绑定；完全中介和可信路径被指定但假定存在，未实现。在GTFOBins（一个包含1330个可信工具滥用的独立语料库）上，原型静默通过了10.0%（每个实例都通过可信工具）；在tldr（28798个正常使用命令）上，标记了87.0%为不可检查。这两个独立测量界定了设计的核心张力：限制静默通过的可信列表也是导致过度提示的原因，而仅边界中介可以沿该前沿移动但无法逃脱。贡献在于属性、机制以及在该前沿上的诚实定位，而非已解决的防御。

英文摘要

Coding agents gate consequential actions behind a human-in-the-loop approval dialog, but the dialog is narrated by the agent itself: the human approves a summary the agent writes. The Lies-in-the-Loop (LITL) attack shows that summary is forgeable, so a compromised agent can show a benign description while a different action runs. This paper names the missing property, Consent Integrity, by importing What You See Is What You Sign (WYSIWYS) and the trusted-path property into the agent approval channel: the action shown to the human must be rendered by a trusted mediator from the real action at the boundary, not the agent's narration, over a path the agent cannot spoof, and bound to the exact action that executes. Two twists distinguish it from classical WYSIWYS: the renderer is the adversary, and the boundary ground truth is a low-level event that must be decoded without trusting the agent. Since no decoder is complete, the realizable target is analyzer-relative: whatever the analyzer cannot classify is surfaced as uninspectable rather than silently approved. A prototype implements the analyzer, renderer, and bind-to-execution; total mediation and the trusted path are specified but assumed, not implemented. On GTFOBins, an independent corpus of 1330 trusted-tool abuses, the prototype silently passes 10.0% (every instance through a trusted tool); on tldr, 28,798 normal-usage commands, it marks 87.0% uninspectable. These two independent measurements bracket the design's central tension: the trust list that bounds silent passes is the same one that drives over-prompting, and a boundary-only mediator can move along that frontier but not escape it. The contribution is the property, the mechanism, and an honest position on that frontier, not a solved defense.

URL PDF HTML ☆

赞 0 踩 0

2606.02651 2026-06-03 cs.PL cs.LO cs.SE

From Rocq to Metal: A Pipeline for Formally Verified Microcontroller Firmware

从Rocq到Metal：形式化验证微控制器固件的流水线

Valentin Bergeron, Karolina Gorna

AI总结本文提出Encore!虚拟机，通过延续传递风格运行Rocq提取的Scheme代码，实现微控制器固件的形式化验证，并利用大语言模型辅助策略合成自动证明固件正确性。

2606.02627 2026-06-03 cs.CE cs.DC cs.GR physics.flu-dyn

Streami: An MPI Data-Parallel Library to Compute Field Lines on GPUs

Streami: 一个用于在GPU上计算场线的MPI数据并行库

Stefan Zellmann, Milan Jaros, Andrea Paris, Ingo Wald, Tatiana von Landesberger

AI总结提出Streami，一个可扩展的GPU加速库，用于在高性能计算机上计算流体流场中的场线，支持后处理或原位分析，并与现有MPI应用交互。

2606.02586 2026-06-03 cs.GR

Fewer, Better Frames: A Compute-Normalized Proof of Concept for Coherence-First World-Model Rendering with Model-Guided FSR4 Frame Generation

更少但更好的帧：一种计算归一化的概念验证，采用一致性优先的世界模型渲染与模型引导的FSR4帧生成

Paweł Katarzyński

AI总结本文提出一种一致性优先的渲染策略，通过生成高上下文锚定帧（15 FPS）并利用潜在增量运动引导和合成深度重建至30 FPS，在有限推理预算下比原生30 FPS基线更长时间保持场景几何、物体身份和深度层次，并通过LPIPS等指标验证了其有效性。

Comments 19 pages, 8 figures, independent systems proof of concept

详情

AI中文摘要

世界模型通常通过原生帧率来评估，但更高的标称帧率可能会牺牲长程场景稳定性。本文报告了一个独立的概念验证，使用Overworld的Waypoint-1.5系列和WorldEngine运行时，在Windows回退栈上通过ONNX Runtime + DirectML和FSR4 DX12桥接实现。测试的一致性优先分支以15 FPS的呈现时间线节奏生成高上下文锚定帧，并通过潜在增量运动引导和合成深度重建呈现至30 FPS。将其与在相同种子、路线、控制脚本、目标呈现时长和局部时间缩放机制下原生生成约30 FPS的低上下文节奏优先基线进行比较。在森林、剑、沙漠和雪地场景中，一致性优先分支更长时间地保持了路径几何、物体身份、大轮廓和深度分层，而基线则更早出现亮度漂移和几何畸变。轻量级时间指标和配对视频支持了视觉比较，LPIPS在所有测试场景中均偏向一致性优先分支。这里的计算归一化意味着大致匹配相同GPU、相同时间尺度的运行点，而非精确的FLOP对等或实测实时吞吐量。一个单独的更重的剑场景探测表明局部非单调性：更多的上下文和去噪并未自动提升质量。这些结果支持一致性优先分配作为有限推理预算下的实用概念验证策略，而非一个完整的实时渲染器。

英文摘要

World models are often evaluated by native frame cadence, but higher nominal frame rate can trade away long-horizon scene stability. This article reports an independent proof of concept implemented using Overworld's Waypoint-1.5 family and WorldEngine runtime on a Windows fallback stack with ONNX Runtime + DirectML and an FSR4 DX12 bridge. The tested coherence-first branch generates higher-context anchor frames at a 15 FPS presentation-timeline cadence and reconstructs presentation to 30 FPS using latent-delta motion guidance and synthesized depth. It is compared against a lower-context cadence-first baseline that generates about 30 FPS natively under the same seed, route, control script, target presentation duration, and local time-scaling regime. Across forest, sword, desert, and snow scenes, the coherence-first branch preserves path geometry, object identity, large silhouettes, and depth layering longer, while the baseline degrades earlier into brightness drift and geometric distortion. Lightweight temporal metrics and paired videos support the visual comparison, with LPIPS favoring the coherence-first branch across all tested scenes. Here compute-normalized means approximately matched same-GPU, same-timescale operating points, not exact FLOP parity or measured realtime throughput. A separate heavier sword-scene probe suggests local non-monotonicity: more context and denoising did not automatically improve quality. These results support coherence-first allocation as a practical proof-of-concept strategy under limited inference budget, not as a finished realtime renderer.

URL PDF HTML ☆

赞 0 踩 0

2606.02585 2026-06-03 cs.CE

An improved PINN framework integrating localized collocation scheme and PIKF

一种融合局部配点方案与物理信息核函数的改进PINN框架

Qiang Xi, Wenzhi Xu, Mario Cvetkovic, Dragan Poljak, Timon Rabczuk, Zhuojia Fu

AI总结提出LPIKFNN框架，通过局部配点方案和物理信息核函数构建损失函数，避免自动微分，提高高阶导数和高波数问题的计算效率与精度。

详情

AI中文摘要

我们提出了一种局部物理信息核函数神经网络（LPIKFNN），这是一种基于物理信息核函数（PIKF）的改进物理信息神经网络（PINN）。在LPIKFNN框架中，局部配点方案将物理量离散化在局部域内，其中物理场表示为PIKF的线性组合。基于这种表示，训练多层感知机以迭代学习物理量。为了克服传统PINN在高阶导数和高波数问题中的计算挑战，LPIKFNN使用PIKF和局部配点方案构建损失函数，而不是依赖自动微分。因此，在迭代训练中执行控制方程所需的高成本导数评估被消除，从而显著提高了计算效率和训练性能。此外，将PIKF纳入损失函数使得所提出的LPIKFNN在具有高度振荡物理场的高波数问题中显著提高了计算精度。为了克服物理信息核函数神经网络（PIKFNN）在异质问题中的计算瓶颈，LPIKFNN引入了局部配点方案，消除了对全局PIKF的依赖，从而在全局PIKF不可用时实现准确预测。通过一系列基准研究，包括高波数问题、高阶导数问题、非线性问题、异质问题和基于电位的逆肌电图，证明了所提出的LPIKFNN的可行性和准确性。LPIKFNN获得的数值预测与可用的解析解和实验测量结果高度一致。

英文摘要

We propose a localized physics-informed kernel function neural network (LPIKFNN), which is an improved physics-informed neural network (PINN) based on physics-informed kernel function (PIKF). In the LPIKFNN framework, the localized collocation scheme discretizes the physical quantities within the local domain, where the physical field is represented as a linear combination of PIKFs. Based on this representation, the multilayer perceptron is trained to iteratively learn the physical quantities. To overcome the computational challenges of conventional PINN in higher-order derivative and high wavenumber problems, the LPIKFNN constructs the loss function using the PIKF and a localized collocation scheme rather than relying on automatic differentiation. As a result, the costly derivative evaluations required to enforce governing equations during iterative training are eliminated, leading to significantly improved computational efficiency and training performance. Moreover, incorporating PIKFs into the loss function enables the proposed LPIKFNN to significantly improve computational accuracy in high-wavenumber problems characterized by highly oscillatory physical fields. To overcome the computational bottleneck of the physics-informed kernel function neural network (PIKFNN) in heterogeneous problems, the LPIKFNN introduces a localized collocation scheme that removes reliance on global PIKFs, enabling accurate predictions where global PIKFs are unavailable. The feasibility and accuracy of the proposed LPIKFNN are demonstrated through a series of benchmark studies, including high wavenumber problems, higher-order derivative problems, nonlinear problems, heterogeneous problems, and potential-based inverse electromyography. The numerical predictions obtained by LPIKFNN show excellent agreement with available analytical solutions and experimental measurements.

URL PDF HTML ☆

赞 0 踩 0

2606.02583 2026-06-03 cs.CE

A complete simulation framework for stone degradation on 3D real geometries

一种用于3D真实几何体上石材退化的完整模拟框架

Silvia Preda, Gabriella Bretti, Francesco Freddi, Bruno Nazarena, Matteo Semplice

AI总结提出一种完整工作流，结合摄影测量获取几何、水平集表示和PDE数值离散，预测艺术品中大理石硫酸盐化等石材退化现象。

2606.03987 2026-06-03 math.CO math.MG

Kusner's conjecture: Exact values and linear bounds

Kusner猜想：精确值与线性界

Hong-Jun Ge, Zixiang Xu, Yang Zhou

AI总结本文证明了当2≤p≤4时Kusner猜想成立，并给出了更一般区间上的几乎线性上界，同时改进了环面T^n上等距集问题的上界。

Comments 47 pages

详情

AI中文摘要

1983年，Kusner猜想在$\mathbb{R}^{n}$中，对于$\ell_{p}$度量，最大的等距集在$1<p<\infty$时基数为$n+1$，在$p=1$时基数为$2n$。该猜想仅在孤立情况$p=2$和$p=4$下被证明，并在$1<p<2$时被证伪。最佳的一般上界$O_p(n^{ rac{2p+2}{2p-1}})$归功于Alon和Pudlák的著名工作[GAFA, 2003]。我们的主要贡献包括：(1) 对于每个维度$n\ge 1$，当$2\le p\le 4$时，我们证明了Kusner猜想。更一般地，对于每个整数$k\ge 0$和每个$p\in[4k+2,4k+4]$，在$\mathbb{R}^{n}$中具有$\ell_{p}$度量的每个等距集的基数至多为$(2k+1)n+1$。在互补区间$p\in(4k,4k+2)$且$p\geq 1$上，我们得到几乎线性界$O_p(n\log n)$。(2) 我们还考虑了环面$\mathbb{T}^n$上的类似问题，该问题最近由Alon提出，其中循环距离使得问题比$\mathbb R^n$中复杂得多。我们证明了对于$1\le p\le 2$的几乎线性界$O_p(n\log n)$，以及对于每个固定实数$p>2$的$O_p(n^{ rac{3}{2}- rac{1}{p}})$，改进了Alon对所有有限$p\ge 1$的界$O_p(n^{2+ rac{2}{\lfloor p floor}})$。

英文摘要

In 1983, Kusner conjectured that the largest equilateral set in $\mathbb{R}^{n}$ with metric $\ell_{p}$ has cardinality $n+1$ when $1<p<\infty$ and $2n$ when $p=1.$ This conjecture was proved only in the isolated cases $p=2$ and $p=4$, and was disproved when $1<p<2$. The best general upper bound $O_p(n^{\frac{2p+2}{2p-1}})$ is due to the celebrated work of Alon and Pudlák~[GAFA, 2003]. Our main contributions include: (1) We prove Kusner's conjecture for every dimension $n\ge 1$ when $2\le p\le 4$. More generally, for every integer $k\ge 0$ and every $p\in[4k+2,4k+4]$, every equilateral set in $\mathbb{R}^{n}$ with metric $\ell_p$ has cardinality at most $(2k+1)n+1$. On the complementary intervals $p\in(4k,4k+2)$ with $p\geq 1$, we obtain the almost linear bound $O_p(n\log n)$. (2) We also consider the analogous problem on the torus $\mathbb{T}^n$, recently initiated by Alon, where the cyclic distance makes the problem substantially more delicate than in $\mathbb R^n$. We prove the almost linear bound $O_p(n\log n)$ for $1\le p\le 2$ and $O_p(n^{\frac{3}{2}-\frac{1}{p}})$ for every fixed real $p>2$, improving Alon's bounds $O_p(n^{2+\frac{2}{\lfloor p\rfloor}})$ for all finite $p\ge 1$.

URL PDF HTML ☆

赞 0 踩 0

2606.03983 2026-06-03 math.DG math.CA

Cylindrical generalized Ricci solitons in three dimensions

三维柱对称广义Ricci孤立子

Miguel Pino Carmona

AI总结构造了一个具有SO(2)×R对称性的完整非紧三维光滑稳态梯度广义Ricci孤立子的显式双参数族，提供了Podestà和Raffero最近发现的球对称孤立子的柱对称对应。

Comments 20 pages. This work is part of the author's PhD thesis

2606.03970 2026-06-03 math.NT

Some new results on determinants and permanents

行列式和积和式的一些新结果

Bo Jiang, Zhi-Wei Sun

AI总结本文通过数论方法证明了关于行列式和积和式的多个猜想，包括模素数平方剩余和模n的整除性质。

Comments 8 pages

2606.03953 2026-06-03 math.OC

Introduction to stochastic gradient methods

随机梯度方法导论

Simon Weissmann

AI总结本文介绍一阶优化方法，重点研究随机梯度方法的收敛理论，包括确定性梯度方法和随机梯度下降的收敛性分析，以及方差缩减等高级主题。

2606.03941 2026-06-03 math.DS

Rate distortion dimension of Gibbs measures for functions depending on the first two coordinates on the full shift of Ahlfors regular spaces

Ahlfors正则空间全移位上依赖于前两个坐标的函数的Gibbs测度的率失真维数

Kanji Inui, Mao Shinoda

AI总结本文研究了Ahlfors正则空间全移位上依赖于前两个坐标的函数的Gibbs测度的率失真维数，并建立了基于率失真维数的变分原理，展示了经典设置中不存在的新现象。

详情

AI中文摘要

遍历理论中移位空间的研究已超越经典框架，但从遍历理论角度讨论Kolmogorov-Sinai熵的扩展仍有空间。另一方面，率失真维数最近在平均维数理论中引起关注，因为它在通常熵无穷大的“大”空间中的动力系统上表现得像Kolmogorov-Sinai熵。基于这些背景，我们研究了乘积空间上的Gibbs测度与基于率失真维数的变分原理之间的联系：我们在具体设置中具体计算了Gibbs测度的率失真维数，它满足基于率失真维数的热力学形式的最简单情况：拓扑熵最大测度的扩展。值得注意的是，结果显示了经典设置中不存在的新现象。我们还在更具体的设置下讨论了另一个变分原理。

英文摘要

The study on shift spaces in ergodic theory has been beyond the classical setting, but there is a room to discuss an extension of the Kolmogorov-Sinai entropy in the ergodic theoretical point of view. On the other hand, the rate distortion dimension recently attracted attention in mean dimension theory because it behaves like the Kolmogorov-Sinai entropy on dynamical systems in the ``large" spaces in which the usual entropies is in general infinite. According to these background, we investigate the connection between the Gibbs measure on the product spaces and the variational principle based on the rate distortion dimension: we concretely calculate the rate distortion dimension of the Gibbs measure on the concrete setting and it satisfies the simplest case of thermodynamical formalism based on the rate distortion dimension: the extension of the maximal measure of topological entropy. Remark that the result shows a new phenomenon which does not hold in the classical setting. We also discuss another variational principle under more concrete settings.

URL PDF HTML ☆

赞 0 踩 0