arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2606.18633 2026-06-18 cs.MA 新提交

PersonalPlan: Planning Multi-Agent Systems for Personalized Programming Learning

PersonalPlan: 面向个性化编程学习的多智能体系统规划

Zhiyuan Wen, Jiannong Cao, Peng Gao, Haochen Shi, Wengpan Kuan, Bo Yuan, Xiuxiu Qi

AI总结提出PersonalPlan，一种两阶段多智能体规划器，通过分层SFT和奖励自适应GRPO生成可执行、个性化且具有教学支架的计划，在MAP-PPL数据集上优于现有方法。

详情

AI中文摘要

有效的编程教育需要针对不同学习者背景进行个性化教学。然而，虽然基于LLM的多智能体系统（MAS）擅长复杂规划，但现有规划器通常缺乏轮廓基础（profile-grounding）和教学支架（pedagogical scaffolding），从而削弱了个性化编程学习。为填补这一空白，我们首先引入\textbf{MAP-PPL}（\textbf{M}ulti-\textbf{A}gent \textbf{P}lans for \textbf{P}ersonalized \textbf{P}rogramming \textbf{L}earning），这是一个基于轮廓的多智能体规划数据集，包含来自1,730个Stack Overflow问题组和2,738个学习者轮廓的3,043个查询-轮廓-计划实例。每个计划指定了智能体、子任务、可执行步骤和先决依赖关系。然后，我们提出\textbf{PersonalPlan}，一个两阶段MAS规划器，首先使用独立的LoRA适配器进行分层SFT，用于轮廓感知的任务分解和步骤依赖规划，然后应用奖励自适应GRPO，鼓励模型生成可执行、个性化且具有教学支架的计划。在MAP-PPL上进行的广泛实验，将PersonalPlan与前沿LLM、通用MAS框架和智能体规划器进行比较，证明了其优越性。仅使用8B和32B变体，PersonalPlan在计划可执行性、个性化和教学质量方面达到了最先进水平，有效协调了MAS进行智能体-学生交互。

英文摘要

Effective programming education requires personalized instruction adapted to diverse learner backgrounds. However, while LLM-based multi-agent systems (MAS) excel at complex planning, existing planners often lack profile-grounding and pedagogical scaffolding, thereby undermining personalized programming learning. To fill in the gap, we first introduce \textbf{MAP-PPL} (\textbf{M}ulti-\textbf{A}gent \textbf{P}lans for \textbf{P}ersonalized \textbf{P}rogramming \textbf{L}earning), a profile-conditioned multi-agent planning dataset with 3{,}043 query--profile--plan instances from 1{,}730 Stack Overflow question groups and 2{,}738 learner profiles. Each plan specifies agents, subtasks, executable steps, and prerequisite dependencies. Then, we propose \textbf{PersonalPlan}, a two-stage MAS planner that first performs hierarchical SFT with separate LoRA adapters for profile-aware task decomposition and step dependency planning, then applies a Reward-Adaptive GRPO to encourage the model to generate executable, personalized, and pedagogically scaffolded plans. Extensive experiments on MAP-PPL comparing PersonalPlan against frontier LLMs, generic MAS frameworks, and agentic planners demonstrate its superiority. With only 8B and 32B variants, PersonalPlan achieves state-of-the-art plan executability, personalization, and pedagogical quality, effectively orchestrating MAS for agent-student interactions.

URL PDF HTML ☆

赞 0 踩 0

2606.18600 2026-06-18 cs.DC 新提交

ShuntServe: Cost-Efficient LLM Serving on Heterogeneous Spot GPU Clusters

ShuntServe: 异构竞价型GPU集群上的成本高效LLM服务

Seungwoo Jeong, Moohyun Song, Juhyun Park, Kyungyong Lee

AI总结提出ShuntServe系统，通过屋顶线模型估计性能和动态规划优化模型放置，在异构竞价型GPU集群上最大化吞吐量，结合输出保留迁移与共享张量存储实现容错，相比基线吞吐量提升1.42倍，成本效率提升31.9%以上。

Comments 18 pages, 16 figures, 5 tables

详情

AI中文摘要

随着大语言模型（LLM）服务的广泛采用，在云环境中为这些模型提供服务的GPU资源成本已成为关键问题。竞价实例相比按需实例可节省高达90%的成本，但其频繁中断和有限可用性对连续LLM服务构成重大挑战。特别是GPU竞价实例的可用性比基于CPU的实例更低且更不稳定，使得依赖单一GPU类型的同构集群容易受到关联故障的影响。跨多种GPU类型的异构集群可以通过利用不同竞价池的互补可用性模式来解决这一问题，然而现有的LLM服务系统是为同构环境设计的，在异构GPU上部署时会遇到负载不均衡的问题。本文提出了ShuntServe，一个用于异构竞价型GPU集群的成本高效LLM服务系统。ShuntServe采用基于屋顶线模型的分析性服务性能估计器和基于动态规划的模型放置优化器，联合确定节点配置、并行化策略和层分配，以最大化跨异构GPU的吞吐量。为了增强使用竞价实例时的容错能力，ShuntServe将输出保留的请求迁移与通过共享张量存储的并发初始化相结合，通过重叠替换节点准备与持续服务来最小化迁移停机时间。在由L4、A10G和L40S GPU组成的异构AWS集群上对Llama-3.1-70B和Qwen3-32B的评估表明，ShuntServe的吞吐量比最先进的基线高出1.42倍和1.35倍，并且与按需实例相比，在离线服务和在线服务中分别实现了31.9%和31.2%的成本效率提升。

英文摘要

As large language model (LLM) services become widely adopted, the cost of GPU resources for serving these models in cloud environments has emerged as a critical concern. Spot instances offer up to 90% cost savings over on-demand instances, but their frequent interruptions and limited availability pose significant challenges for continuous LLM serving. GPU spot instances, in particular, exhibit lower and more volatile availability than CPU-based instances, making homogeneous clusters that depend on a single GPU type vulnerable to correlated failures. Heterogeneous clusters spanning multiple GPU types can address this by leveraging complementary availability patterns across diverse spot pools, yet existing LLM serving systems are designed for homogeneous environments and suffer from load imbalance when deployed on heterogeneous GPUs. This paper presents ShuntServe, a cost-efficient LLM serving system for heterogeneous spot GPU clusters. ShuntServe employs a roofline model-based analytical serving performance estimator and a dynamic programming-based model placement optimizer that jointly determines node configuration, parallelization strategy, and layer assignment to maximize throughput across heterogeneous GPUs. To enhance fault tolerance when using spot instances, ShuntServe combines output-preserving request migration with concurrent initialization via a shared tensor store, minimizing migration downtime by overlapping replacement node preparation with ongoing serving. Evaluation on Llama-3.1-70B and Qwen3-32B with a heterogeneous AWS cluster of L4, A10G, and L40S GPUs shows that ShuntServe achieves 1.42x and 1.35x higher throughput than state-of-the-art baselines and attains 31.9% and 31.2% cost efficiency improvements over on-demand instances for offline and online serving, respectively.

URL PDF HTML ☆

赞 0 踩 0

2606.18593 2026-06-18 cs.HC cs.CY 新提交

"The New Era of Tech-Enabled Traceability": Tensions between the FDA's Data Governance Vision and the Lived Realities of Food Producers

“技术赋能可追溯性的新时代”：FDA的数据治理愿景与食品生产者的现实困境之间的张力

Soonho Kwon, Catherine Wieczorek, Heidi Biggs, Shellye Suttles, Tammi S. Etheridge, Annabel Rothschild, Shaowen Bardzell

AI总结研究美国FDA食品追溯规则如何将农业食品利益相关者转化为数据劳工，通过分析1198条公众评论揭示数据收集、基础设施和文化实践中的三大矛盾。

详情

DOI: 10.1145/3817012

AI中文摘要

美国食品药品监督管理局（FDA）的《食品追溯规则》要求农业食品供应链利益相关者（包括农民、渔民、零售工人等）从2026年1月起维护详细的跟踪记录。通过该规则，FDA设想了一个“技术赋能可追溯性的新时代”，其中标准化、协调一致的跟踪数据作为基础公共卫生基础设施，能够更快速地识别和移除可能受污染的食物，最终降低食源性疾病的风险。尽管这一愿景令人期待，但我们观察到，该规则通过强制要求严格的数据收集、格式化和报告要求，将农业食品利益相关者重新配置为数据劳工。在本文中，我们研究了这种重新配置所产生的张力和负担。以数据女性主义为视角，关注数据驱动的政策实施如何不成比例地加重缺乏基础设施和财务能力的小规模、资源不足的利益相关者的负担，我们分析了针对该拟议规则提交至http://www.regulations.gov的1198条公众评论。我们的定性文档分析揭示了三个关键张力：（1）利益相关者在被重新配置为数据工作者时所经历的个人劳动、财务和教育负担；（2）由于基础设施限制、文化背景和特定生产实践，数据跟踪变得不可行的情况；（3）该规则旨在提供的灵活性因其模糊性反而引入了困惑和负担的实例。

英文摘要

The U.S. Food and Drug Administration (FDA)'s Food Traceability Rule requires agri-food supply chain stakeholders (stakeholders)--including farmers, fishers, retail workers, and others--to maintain detailed tracking records beginning in January 2026. Through this Rule, the FDA envisions a "New Era of Tech-Enabled Traceability," in which standardized, harmonized tracking data serve as a foundational public health infrastructure, enabling more rapid identification and removal of potentially contaminated food and ultimately reducing the risk of foodborne illness. Despite this promising vision, we observe that the Rule reconfigures agri-food stakeholders into data laborers by mandating stringent data collection, formatting, and reporting requirements. In this paper, we examine the tensions and burdens that arise from such reconfiguration. Leveraging Data Feminism as an orientation to attend to how data-driven policy implementation disproportionately burdens smaller, under-resourced stakeholders who lack the infrastructural and financial capacity to comply, we analyze 1,198 public comments submitted to Regulations.gov in response to the proposed Rule. Our qualitative document analysis reveals three key tensions: (1) the individual labor, financial, and educational burdens stakeholders experience as they are reconfigured into data workers; (2) moments where data tracking becomes infeasible due to infrastructural limitations, cultural contexts, and situated production practices; and (3) instances where the Rule's intended flexibility instead introduces confusion and burden due to its ambiguity.

URL PDF HTML ☆

赞 0 踩 0

2606.18569 2026-06-18 cs.CG 新提交

Tangent Spheres and Integer Distances

切球与整数距离

David Eppstein

AI总结将Erdős-Anning定理推广到任意维双曲空间，并首次给出欧氏空间维度大于2时整数距离点集大小的定量界，证明基于切球图的双子图引理。

Comments 6 pages, 4 figures. To appear at the Canadian Conference on Computational Geometry (CCCG 2026)

详情

AI中文摘要

二进制输入的ReLU网络的深度下界

Neil Krishnan, Elchanan Mossel

AI总结针对二进制输入和实值输出的ReLU网络，构造了一个深度n+1、宽度常数的函数族，证明任何深度d、宽度w的精确计算网络需满足w^d = Ω(2^n)，即深度d = o(n/log n)时宽度不能为n的多项式。

Comments The authors explicitly reserves all rights in this work. No permission is granted for the reproduction, storage, or use of this document for the purpose of training artificial intelligence systems or for text and data mining (TDM), including but not limited to the generation of embeddings, summaries, or synthetic derivatives

详情

AI中文摘要

我们研究了具有离散（布尔）输入和实值输出的ReLU网络中深度的作用，补充了两个已有的研究方向。对于布尔输入，在$\mathsf{AC}^0$中证明了显著的深度分离结果，但使用阈值（$\mathsf{TC}^0$）或ReLU门时，深度分离仅针对深度二与三建立。另一方面，对于{\em实值}函数和ReLU网络，Telgarsky（2016）构造了一个简单的单变量函数类，在更高深度上建立了分离。本文旨在为$\{0,1\}^n$上的ReLU网络建立全深度分离。我们通过展示一个显式的函数族来实现这一点，该函数族可由深度$n+1$、宽度常数的ReLU网络精确计算，而任何深度$d$、宽度$w$的ReLU网络要精确计算该函数必须满足$w^d = \Omega(2^n)$；特别地，没有深度$d = o(n/\log n)$的网络可以用$n$的多项式宽度计算它。我们注意到，我们的下界依赖于\emph{精确、无限精度}计算，因为输出的指数精度截断可由多项式大小的$\mathsf{TC}^0$电路计算。

英文摘要

We study the role of depth in ReLU networks with discrete (Boolean) inputs and real-valued outputs, complementing two established lines of work. For Boolean inputs, striking depth separation results were proven for $\mathsf{AC}^0$ but with threshold ($\mathsf{TC}^0$) or ReLU gates depth separation is only established for depth two vs. three. On the other hand, for {\em real-valued} functions and ReLU networks, Telgarsky's (2016) constructed a simple one variable class of functions which establishes separation at higher depths. In this paper we are interested to establish an all-depths depth separation for ReLU networks on $\{0,1\}^n$. We do so by exhibiting an explicit family of functions computable exactly by a ReLU network of depth $n+1$ and constant width, such that any ReLU network of depth $d$ and width $w$ computing the function exactly must satisfy $w^d = Ω(2^n)$; in particular, no network of depth $d = o(n/\log n)$ can compute it with width polynomial in $n$. We note that our lower bound relies on \emph{exact, infinite-accuracy} computation as an exponential precision truncation of the output is computable by a polynomial-size $\mathsf{TC}^0$ circuit.

URL PDF HTML ☆

赞 0 踩 0

2606.18511 2026-06-18 cs.HC 新提交

Stitching the Divide: Investigating Mixed Reality as a Bridge Between Paper-Based and Digital Artifacts in UI/UX Design

缝合鸿沟：探究混合现实作为UI/UX设计中纸质与数字人工制品之间的桥梁

Abidullah Khan, Jinghui Cheng

AI总结通过访谈和概念探针研究，发现混合现实能实现连续混合设计工作流、减少手动重建、支持空间锚定工作区及实时跨媒介协作，并推导出未来MR系统的四个设计维度。

Comments Accepted to the ACM Graphics Interface Conference, 2026

详情

AI中文摘要

UI/UX设计师同时使用纸质和数字人工制品，但缺乏将两者无缝集成的工具。混合现实（MR）为结合两种设计环境的优势提供了尚未充分探索的机会。为考察这些机会，我们首先对19名专业UI/UX设计师进行了访谈，了解他们当前使用纸质和数字人工制品的经验。受访谈见解的启发和指导，我们组织了九次概念探针用户研究会议，设计师在其中使用结合了纸质和数字原型制作过程的MR探针，并头脑风暴MR在UI/UX设计中的潜力。我们发现，参与者重视MR在实现连续混合设计工作流、减少手动重建、支持空间锚定工作区以及促进实时跨媒介协作方面的作用。他们还设想了未来具有AI辅助、更丰富的交互和动态内容以及能够在统一环境中管理多样化设计人工制品的MR工具。根据这些发现，我们推导出未来MR系统的四个设计维度，这些系统可能实现更流畅、更具创造性和协作性的设计实践。

英文摘要

UI/UX designers work with both paper-based and digital artifacts but lack tools that seamlessly integrate the two. Mixed Reality (MR) offers under-explored opportunities to combine the strengths of both design environments. To examine these opportunities, we first conducted interviews with 19 professional UI/UX designers to understand their current experiences using paper and digital artifacts. Motivated and informed by the interview insights, we organized nine conceptual-probe user study sessions in which designers engaged with a MR-probe that combined paper and digital prototyping processes and brainstormed MR's potential in UI/UX design. We found that participants valued MR for enabling continuous hybrid design workflows, reducing manual reconstruction, supporting spatially anchored workspaces, and facilitating real-time cross-medium collaboration. They also envisioned future MR tools with AI assistance, richer interactive and dynamic content, and the ability to manage diverse design artifacts within a unified environment. From these findings, we derive four design dimensions for future MR systems that could enable more fluid, creative, and collaborative design practices.

URL PDF HTML ☆

赞 0 踩 0

2606.18497 2026-06-18 cs.CR 新提交

Ghost Vectors: Soft-Deleted Embeddings Remain Reconstructible in HNSW Vector Databases

幽灵向量：HNSW向量数据库中软删除的嵌入仍然可重构

Chandranil Chakraborttii, Jackeline García Alvarado, Sitora Abdulofizova, Shivanshu Dwivedi

AI总结研究揭示HNSW向量数据库的软删除机制存在安全漏洞，被标记删除的向量仍可通过存储层恢复，并提出基于加密密钥轮换的防护方案。

Comments 13 pages, 5 figures, 12 tables. Prepared for submission

详情

AI中文摘要

检索增强生成（RAG）使大型语言模型能够访问外部和私有语料库，以生成事实性、领域特定的响应。现代RAG流水线使用分层可导航小世界（HNSW）向量数据库进行高效的相似性搜索。当用户请求数据删除时，系统通常仅将记录标记为已删除，而嵌入在磁盘上物理保持不变。这种软删除操作在GDPR第17条和HIPAA等数据擦除和保留要求下引发了合规性问题。对三种HNSW实现的分析证实，通过访问存储层的原始索引文件（绕过API访问），已删除的向量在物理上仍然可恢复。使用无需领域特定微调的Vec2Text反演模型，我们在多个真实世界数据集和数据模态上展示了这一漏洞。在维基百科在世人物数据集（BLP）上，我们成功恢复了25.5%的精确人名和46.4%的地理位置（ROUGE-L 0.185）。在高度结构化的敏感数据（NIH Synthea数据集）上，患者年龄和性别标记的恢复率达到100%（ROUGE-L 0.290）。在软删除的图像嵌入上，我们在组织病理学切片上展示了100%的组织分类（p=1.02e-07），在人脸嵌入上top-1身份恢复率达到99%（p<0.01）。本工作引入了Epoch密钥轮换，即加密向量并在删除时丢弃密钥。Epoch密钥轮换将观察到的PII恢复降至0%，并在2.5毫秒内完成500个已删除向量的处理（约0.005毫秒/记录）。此外，它还生成ECDSA签名的加密证明，作为删除事件的可审计记录。

英文摘要

Retrieval-augmented generation (RAG) allows large language models to access external and private corpora for factual, domain-specific responses. Modern RAG pipelines use hierarchical navigable small world (HNSW) vector databases for efficient similarity search. When a user requests data deletion, the systems typically only mark the record as deleted, leaving the embedding on disk physically unchanged. This soft-delete operation raises compliance concerns under data-erasure and retention requirements such as GDPR Article 17 and HIPAA. Analysis on three HNSW implementations confirms that deleted vectors remain physically recoverable by accessing the raw index files at the storage layer, bypassing API access. Using the Vec2Text inversion model without domain-specific fine-tuning, we show this vulnerability on multiple real-world datasets and data modalities. On Wikipedia biographical living persons dataset (BLP), we successfully recover 25.5% of exact person names and 46.4% of geographic locations (ROUGE-L 0.185). Recovery reaches 100% for both patient age and gender markers (ROUGE-L 0.290) on highly structured, sensitive data (NIH Synthea dataset). On soft-deleted image embeddings, we show 100% tissue classification on histopathology patches (p=1.02e-07) and top-1 identity recovery reaches 99% on facial embeddings (p<0.01). This work introduces Epoch Key Rotation, which encrypts vectors and discards the key upon deletion. Epoch key rotation reduces observed PII recovery to 0% and completes in 2.5 ms for 500 deleted vectors (approximately 0.005 ms/record). Additionally, it generates an ECDSA-signed cryptographic proof as an auditable record of the deletion event.

URL PDF HTML ☆

赞 0 踩 0

2606.18483 2026-06-18 cs.DC 新提交

通过跨层约束发现深度学习流水线中的编译器-平台交互错误

Yuxin Qiu, Jiyuan Wang, Ronak Badhe, Ben Limpanukorn, Miryung Kim, Qian Zhang

AI总结提出一种自动化框架XCheck，通过提取全栈约束生成测试模型，发现编译器与硬件平台交互导致的错误，并在三个编译器上发现2034个错误案例。

详情

AI中文摘要

人工智能的日益部署需要鲁棒的深度学习编译器，如TVM和ONNX-MLIR。这些编译器以高级AI模型为输入，通过多层变换降低它们，并将其专门化到不同的硬件。测试此类编译器具有独特的挑战性，因为正确性取决于嵌入在整个编译栈中的隐式约束。现有的测试方法主要采用类型约束来限制输入模型生成，因此强调类型验证并监控编译崩溃或覆盖率增益。这种关注忽略了由编译和执行环境之间的交错效应引起的编译器-平台交互错误。在这项工作中，我们提出了一个可扩展的自动化DL编译器测试框架，用于同时(1)发现编译器-平台交互错误和(2)实现行为等价划分。我们的关键见解是，这些错误是由跨编译通道和硬件平台的交互引起的违反假设导致的。因此，我们超越了约束输入生成，并推导出全栈约束。我们的方法分为三步。首先，我们设计了一种自动化方法来提取全栈约束，这些约束共同指导模型生成并表征编译行为。其次，我们优先考虑暴露交互敏感行为的约束，以便我们生成的模型能够执行深度编译逻辑。第三，我们通过自动插入断言来监控覆盖率或通过/失败信号遗漏的不同编译症状，从而实现行为等价划分。我们在三个广泛使用的DL编译器上评估了我们的工具XCheck，发现了2034个揭示错误的案例，包括内存溢出、整数溢出以及根源于编译器-平台交互的静默意外编译。

英文摘要

The growing deployment of artificial intelligence (AI) necessitates robust deep learning (DL) compilers, such as TVM and ONNX-MLIR. These compilers take as input high-level AI models, lower them through multi-layer transformations, and specialize them to diverse hardware. Testing such compilers is uniquely challenging as correctness depends on implicit constraints embedded throughout the compilation stack. Existing testing approaches largely take type constraints to restrict input model generation and therefore emphasize type validation and monitor compilation crashes or coverage gains. This focus overlooks compiler-platform interaction bugs that arise from interleaved effects across compilation and execution environments. In this work, we propose a scalable, automated DL compiler testing framework for, in tandem, (1) finding compiler-platform interaction bugs and (2) enabling behavior equivalence partitioning. Our key insight is that these bugs are caused by violated assumptions arising from interactions across compilation passes and hardware platforms. Therefore, we move beyond constraining input generation and derive full-stack constraints. Our approach is three-fold. First, we design an automated approach to extract full-stack constraints that jointly guide model generation and characterize compilation behaviors. Second, we prioritize constraints that expose interaction-sensitive behaviors, so our generated models are capable of exercising deep compilation logic. Third, we enable behavior equivalence partitioning by automatically inserting assertions to monitor distinct compilation symptoms that coverage or pass/fail signals miss. We evaluated our tool, XCheck, on three widely-used DL compilers and found 2,034 bug-revealing cases, including memory overflows, integer overflows, and silent unexpected compilations that were rooted in compiler-platform interactions.

URL PDF HTML ☆

赞 0 踩 0

2606.18417 2026-06-18 cs.CE 新提交

Enhancing neural network extrapolation in thermo-fluid systems using steady-state solutions

利用稳态解增强热流体系统中的神经网络外推能力

Sanjeeb Poudel, Teeratorn Kadeethum, Sanghyun Lee

AI总结针对耗散PDE系统，提出一种稳态信息嵌入的神经网络表示，将解分解为稳态分量和瞬态修正，直接嵌入渐近行为，无需额外惩罚项，显著提升时间外推能力。

详情

AI中文摘要

时间相关偏微分方程（PDE）出现在许多工程系统中，包括热流体应用。对此类系统的经典数值模拟在长时间动力学中可能变得计算昂贵，因为它们通常需要受稳定性、精度或非线性求解器约束的时间步长进行顺序时间积分。尽管科学机器学习为逼近PDE解提供了替代方案，但标准神经网络近似在训练时间区间外进行外推时通常会退化。在这项工作中，我们针对解松弛到平稳平衡的耗散PDE系统提出了一种稳态信息神经网络表示。所提出的ansatz将解分解为稳态分量和由时间相关衰减曲线调制的瞬态修正。当衰减曲线在长时间消失且瞬态修正保持有界时，该表示将收敛到指定稳态直接嵌入到架构中，而不是通过额外的惩罚项来强制执行。这使得网络能够学习瞬态动力学，同时保持正确的渐近行为。我们在物理信息神经网络（PINN）框架内实现了该方法，并使用SOAP优化器训练所得模型。该方法在一系列物理和几何复杂度递增的问题上进行了评估，范围从一维热方程到方腔顶盖驱动不可压缩Navier-Stokes流、方腔自然对流以及全三维共轭传热问题。数值结果表明，与未明确强制执行渐近条件的架构相比，稳态信息架构显著改善了训练区间之外的时间外推。

英文摘要

Time-dependent partial differential equations (PDEs) arise in many engineering systems, including thermo-fluid applications. Classical numerical simulations of such systems can become computationally expensive for long-time dynamics because they typically require sequential time integration with time steps constrained by stability, accuracy, or nonlinear solvers. Although scientific machine learning provides an alternative for approximating PDE solutions, standard neural network approximations often degrade when extrapolated beyond the training time interval. In this work, we propose a steady-state-informed neural network representation for dissipative PDE systems whose solutions relax toward a stationary equilibrium. The proposed ansatz decomposes the solution into a steady-state component and a transient correction modulated by a time-dependent decay profile. When the decay profile vanishes at long time and the transient correction remains bounded, the representation embeds convergence to the prescribed steady state directly into the architecture, rather than enforcing it through an additional penalty term. This allows the network to learn the transient dynamics while preserving the correct asymptotic behavior. We implement the approach within a physics-informed neural network (PINN) framework and train the resulting model using the SOAP optimizer. The method is evaluated on a sequence of problems of increasing physical and geometric complexity, ranging from the one-dimensional heat equation to incompressible Navier-Stokes flow in a lid-driven cavity, natural convection in a square cavity, and a full three-dimensional conjugate heat transfer problem. The numerical results show that the steady-state-informed architecture substantially improves temporal extrapolation beyond the training interval compared with architectures that do not explicitly enforce the asymptotic condition.

URL PDF HTML ☆

赞 0 踩 0

2606.18416 2026-06-18 eess.SY cs.SY 新提交

Constellation-Level Power Allocation for LEO Space-Based Solar Power

LEO天基太阳能的星座级功率分配

Mustafa Alhassan, Amjad Iqbal, Peng Hu

AI总结提出LEO SBSP系统模型，通过24小时仿真评估Walker 4×5星座的功率分配，发现峰值功率1.986 MW，每站平均40-75 kW，功率密度低于ICNIRP限值。

详情

AI中文摘要

天基太阳能（SBSP）近期作为利用天基基础设施提供持续清洁能源的有吸引力的技术进步重新受到关注。然而，低地球轨道（LEO）卫星星座用于SBSP的潜力在很大程度上仍未探索，缺乏详细的基于仿真的研究。在本文中，我们引入了一个新颖的LEO SBSP系统模型，并对高度450 km的Walker $4\ imes 5$ LEO SBSP星座进行了24小时系统级仿真，在贪婪分配策略下将2.45 GHz微波功率波束传输到八个地面站（GS）。该模型包括轨道传播、日食周期、卫星功率链、Goubau-Brown波束耦合、ITU-R P.618大气衰减和星载电池动力学。结果证实，传输的峰值直流功率达到1.986 MW，而服务站的每站点平均传输功率在40到75 kW之间。八个地面站中有两个在运行期间未获得服务，因为在贪婪策略下，它们的过境排名始终低于同一时刻的竞争链路。整流天线处的入射峰值功率密度（PD）保持在3.35-5.72 W/m²范围内，低于国际非电离辐射防护委员会（ICNIRP）的公众暴露限值。对于此高度的20颗卫星Walker LEO星座，每站实际传输功率为50-100 kW，整流天线应按照约5 W/m²的运行入射功率密度设计，而不是按照地球静止轨道（GEO）时代的100 W/m²额定值设计。

英文摘要

Space-based solar power (SBSP) has recently gained renewed attention as an appealing technological advancement for providing continuous clean energy using space-based infrastructure. However, the potential of low-Earth orbit (LEO) satellite constellations for SBSP remains largely unexplored and lacks detailed simulation-based studies. In this paper, we introduce a novel LEO SBSP system model and conduct a 24-hour system-level simulation of a Walker $4\times 5$ LEO SBSP constellation at an altitude of 450\,km, beaming 2.45\,GHz microwave power to eight ground stations (GSs) under a greedy allocation policy. The model includes orbital propagation, eclipse cycles, the satellite power chain, Goubau--Brown beam coupling, ITU-R P.618 atmospheric attenuation, and onboard battery dynamics. The results confirm that the peak DC power delivered reaches 1.986\,MW, while the mean per-site delivery at the served GS ranged from 40 to 75\,kW. Two of the eight GSs received no service during the run, as their passes were consistently ranked lower under the greedy policy than competing links at the same step. The incident peak power density (PD) at the rectenna remained within the 3.35--5.72\,W/m\textsuperscript{2} range, below the International Commission on Non-Ionizing Radiation Protection (ICNIRP) general-public exposure limit. For a 20-satellite Walker LEO at this altitude, realistic per-site delivery is 50--100 kW, and the rectenna should be sized to the operational incident PD of order 5,W/m\textsuperscript{2} rather than to a Geostationary Earth Orbit (GEO)-era 100,W/m\textsuperscript{2} rating.

URL PDF HTML ☆

赞 0 踩 0

2606.18405 2026-06-18 cs.CR 新提交

探索统计变点检测技术在 Mozilla 性能异常检测中的应用

Mohamed Bilel Besbes, Gregory Mierzwinski, Suhaib Mujahid, Philipp Leitner, Alexander Serebrenik, Dave Hunt, Diego Elias Costa

AI总结本文针对 Mozilla 性能异常检测中高误报和漏报问题，评估了 25 种变点检测方法和 15 种集成方法，基于人工标注的真实数据集发现集成投票策略在 F1 分数上提升 11%，并已集成到 Mozilla 系统。

详情

AI中文摘要

软件性能回归可能带来严重的业务后果，因此自动检测成为现代持续集成流水线的关键组成部分。在 Mozilla，性能异常检测由 Perfherder 处理，这是 Mozilla 的性能工程管理系统，它基于 Student's T 检验方法在每天数百次代码变更中标记回归。然而，我们对 Mozilla 一年性能数据的初步分析显示，12.5% 生成的警报组是误报，而约 6.8% 的警报组包含自动系统遗漏的回归。本文提出了一项实证研究，评估了 25 种变点检测（CPD）方法和 15 种集成方法作为 Mozilla 当前方法的替代方案。我们构建了一个包含 174 个性能时间序列的真实数据集，由 11 位 Mozilla 性能工程师手动标注，代表了性能工程领域首批从业者标注的 CPD 基准之一。我们的结果表明，虽然离线和混合 CPD 方法比 Mozilla 方法提高了召回率，但代价是精度大幅降低。集成投票策略缓解了这种权衡，并提供了更一致的性能，使 F1 分数提高了 11%。我们通过从业者调查验证了实验结果，并报告了将最佳方法集成到 Mozilla 性能工程系统中的经验教训。

英文摘要

Software performance regressions can have significant business consequences, making automated detection a critical component of modern continuous integration pipelines. At Mozilla, performance anomaly detection is handled by Perfherder, Mozilla's performance engineering management system that relies on a Student's T-test-based approach to flag regressions across hundreds of daily code changes. However, our preliminary analysis of one year of Mozilla performance data reveals that 12.5% of generated alert groups are false positives, while approximately 6.8% of them contain regressions missed by the automated system. This paper presents an empirical study evaluating 25 change-point detection (CPD) methods and 15 ensemble approaches as alternatives to Mozilla's current method. We construct a ground-truth dataset of 174 performance time series manually annotated by eleven Mozilla performance engineers, representing one of the first practitioner-annotated CPD benchmarks for performance engineering. Our results show that while offline and hybrid CPD methods improve recall over Mozilla's method, they do so at a high cost to precision. Ensemble voting strategies alleviate this trade-off and offer more consistent performance, resulting in 11% improvement in the F1-score. We validate the experimental results through a practitioner survey and report on lessons learned from integrating the best methods into Mozilla's performance engineering system.

URL PDF HTML ☆

赞 0 踩 0

2606.18320 2026-06-18 cs.CR 新提交

TopVenues: A Reproducible Corpus and Tooling Substrate for Cybersecurity Literature Reviews

TopVenues：一个可复现的网络安全文献综述语料库与工具基础

Sidnei Barbieri, Ágney Lopes Roth Ferraz, Lourenço Alves Pereira Júnior

AI总结提出TopVenues开源系统，通过DBLP元数据骨架和API构建版本化语料库，实现网络安全文献综述的可复现基础，支持高效检索和可重复测量。

详情

AI中文摘要

网络安全文献综述需要一个可复现的分母：协议在筛选和综合开始前包含的论文集合。如今，该分母通常从出版商门户、书目索引和学术应用程序接口（API）重建，而这些接口的覆盖范围、格式和查询语义随时间变化。本文提出TopVenues，一个开源系统，将语料库构建实现为版本化的研究工件。TopVenues声明一个会议和年份范围，使用DBLP计算机科学书目（DBLP）作为元数据主干，通过开放的学术API和特定出版商的提取器丰富记录的摘要和BibTeX条目，并将结果存储在单调的SQLite快照中，可通过命令行界面（CLI）、Web界面以及用于综述工作流的导出路径访问。2026年5月的快照包含来自2017年至2026年11个网络安全来源的9,925篇论文，摘要覆盖率达99.86%，BibTeX覆盖率达99.99%；全文语料库的关键词搜索在31毫秒内完成，一个250个测试的套件验证了数据完整性不变量。固定的分母还实现了可重复测量：在我们的范围内，四个顶级安全会议2024年至2025年的论文中有29.2%以arXiv预印本形式出现，中位发表前时间为五个月，而先前作者记录过滤器在90%召回率下对后续出现在同一会议集中的预印本进行筛选时，实现了16.5倍的精度提升。TopVenues通过使语料库本身可执行、可检查和可引用，将语料库构建与可审计的网络安全测量联系起来。该工件可在以下网址获取：this https URL。

英文摘要

Cybersecurity literature reviews require a reproducible denominator: the set of papers that a protocol includes before screening and synthesis begin. Today, that denominator is often reconstructed from publisher portals, bibliographic indices, and scholarly application programming interfaces (APIs) whose coverage, formats, and query semantics change over time. This paper presents TopVenues, an open-source system that materializes corpus construction as a versioned research artifact. TopVenues declares a venue and year scope, uses DBLP Computer Science Bibliography (DBLP) as the metadata spine, enriches records with abstracts and BibTeX entries via open scholarly APIs and publisher-specific extractors, and stores the results in a monotonic SQLite snapshot, accessible via a command-line interface (CLI), a web interface, and export paths for review workflows. The May 2026 snapshot contains 9,925 papers from 11 cybersecurity sources over 2017 to 2026, with 99.86% abstract coverage and 99.99% BibTeX coverage; keyword search over the full corpus completes in under 31 ms, and a 250-test suite validates the data-integrity invariants. The fixed denominator also enables repeatable measurement: 29.2% of 2024 to 2025 papers from the four top-ranked security conferences in our scope appear as arXiv preprints, with a median of five months before publication, and a prior-author-track-record filter yields a 16.5x precision gain at 90% recall for triaging preprints that later appear in the same venue set. TopVenues links corpus construction to auditable cybersecurity measurement by making the corpus itself executable, inspectable, and citable. The artifact is available at https://github.com/sidneibarbieri/topVenues.

URL PDF HTML ☆

赞 0 踩 0

2606.18314 2026-06-18 cs.CG 新提交

Repair Entropy in Dynamic Geometric Nearest-Neighbour Structures

动态几何最近邻结构中的修复熵

Faruk Alpay, Bugra Kilictas

AI总结针对小运动下的精确最近邻维护问题，提出基于修复前沿熵的自适应策略，在O(|F_t| log N)时间内修复失效证书，并验证了2400种运动场景下的有效性。

Comments 10 pages, 2 figures, 2 tables; code and dataset provided as ancillary files

详情

AI中文摘要

我们研究小运动下精确最近邻维护的动态几何数据结构。对每个点，我们存储一个由最近邻和两个最小邻近距离组成的证书，间隙为$c_i=d^i_2-d^i_1$。三角不等式给出一个尖锐的有效性半径：在最大位移为$\varepsilon$的一步后，每个满足$c_i>4\varepsilon$的证书仍然有效，因此所有可能的失效被限制在修复前沿$F_t$内。我们引入修复前沿熵$H(F_t)$，即失效证书在索引单元上的归一化香农熵，作为选择事件驱动修复、批量修复或完全重建的工作负载描述符。由此产生的维护规则在单元占用有界的情况下，仅以$O(|F_t|\log N)$时间修复前沿，而完全重建代价为$\Theta(N)$；此外，熵为事件驱动修复所触及的前沿单元数量提供下界，并改变了经验上的修复-重建交叉点。我们在$d\in\{2,3\}$中评估了十种运动族，$N$高达16,000，使用精确的平铺GPU预言机和GPU网格重建作为真实值和竞争者。在2400个标记的转换中，有效性规则没有遗漏任何无效证书，低压前沿通常通过增量修复更便宜，而相同大小的扩散前沿对于事件驱动修复更昂贵，但对于批量修复则不然。发布的数据集记录了前沿几何、证书审计、每种策略的时间以及最佳策略标签。

英文摘要

We study dynamic geometric data structures for exact nearest-neighbour maintenance under small motions. For each point we store a certificate consisting of its nearest neighbour and the two smallest neighbour distances, with clearance $c_i=d^i_2-d^i_1$. A triangle-inequality argument gives a sharp validity radius: after a step of maximum displacement $\varepsilon$, every certificate with $c_i>4\varepsilon$ remains valid, so all possible failures are confined to a repair frontier $F_t$. We introduce repair-frontier entropy $H(F_t)$, the normalized Shannon entropy of failed certificates over index cells, as a workload descriptor for choosing between event-driven repair, batched repair, and full rebuild. The resulting maintenance rule repairs only the frontier in $O(|F_t|\log N)$ time under bounded cell occupancy, while a full rebuild costs $Θ(N)$; moreover, entropy lower-bounds the number of frontier cells touched by event-driven repair and shifts the empirical repair-rebuild crossover. We evaluate ten motion families in $d\in{2,3}$, with $N$ up to $16,000$, using an exact tiled GPU oracle and a GPU grid rebuild as ground truth and competitor. Across $2400$ labelled transitions, the validity rule misses no invalid certificate, low-pressure frontiers are usually cheaper to repair incrementally, and diffuse frontiers of the same size are more expensive for event-driven repair but not for batched repair. The released dataset records frontier geometry, certificate audits, per-strategy times, and best-strategy labels.

URL PDF HTML ☆

赞 0 踩 0

2606.18297 2026-06-18 cs.DB 新提交

HyDRA: 通过共聚类实现无损超图摘要

Giulia Preti, Aris Anagnostopoulos, Francesco Bonchi

AI总结提出HyDRA，首个无损加权超图摘要框架，通过共聚类思想设计贪心算法，结合增量更新策略，实现80-93%的存储压缩，并支持直接查询和加速下游任务。

详情

AI中文摘要

超图是表示高阶交互的强大表示形式，但其规模和复杂性带来了显著的数据管理和分析挑战。虽然摘要技术广泛用于简化简单图，但超图的无损摘要仍未得到探索。我们引入了HyDRA，这是首个用于加权超图无损摘要的正式框架。在我们的框架中，摘要是一个由超节点（节点组）和超超边（超边组）组成的新加权超图，并配有一个用于精确重建的校正表。通过建立与共聚类的概念联系，我们设计了一种高效、无参数的贪心算法，该算法迭代地合并节点和超边聚类，以最小化一种新颖的存储感知代价函数。HyDRA采用增量更新策略，以避免每一步中校正表的昂贵重新计算。大量实验表明，我们的方法在存储成本上实现了显著降低（在某些设置中，根据超图特征，降低80-93%）。由于生成的摘要本身是超图，可以直接查询，为各种连通性和中心性查询提供快速且准确的近似答案，并加速诸如影响力最大化等下游任务。

英文摘要

Hypergraphs are a powerful representation for higher-order interactions but their scale and complexity pose significant data management and analysis challenges. While summarization techniques are widely used to distill simple graphs, lossless summarization for hypergraphs remains unexplored. We introduce HyDRA, the first formal framework for lossless summarization of weighted hypergraphs. In our framework, a summary is a new weighted hypergraph composed of supernodes (groups of nodes) and superhyperedges (groups of hyperedges), paired with a correction table for exact reconstruction. By establishing a conceptual link to co-clustering, we design an efficient, parameter-free greedy algorithm that iteratively merges node and hyperedge clusters to minimize a novel storage-aware cost function. HyDRA employs an incremental update strategy to prevent the costly recomputation of the correction table at each step. Extensive experiments demonstrate that \our achieves a substantial reduction in storage cost (80-93% in some settings, depending on the hypergraph characteristics). Because the resulting summaries are themselves hypergraphs, they can be queried directly, providing fast and accurate approximate answers for various connectivity and centrality queries, and accelerating downstream tasks such as influence maximization.

URL PDF HTML ☆

赞 0 踩 0

AI 大模型

视觉与机器人

科学与医疗

PersonalPlan: Planning Multi-Agent Systems for Personalized Programming Learning

ShuntServe: Cost-Efficient LLM Serving on Heterogeneous Spot GPU Clusters

"The New Era of Tech-Enabled Traceability": Tensions between the FDA's Data Governance Vision and the Lived Realities of Food Producers

Tangent Spheres and Integer Distances

Principal Component Analysis and Power Indices

Wind-Resilient Trajectory Optimization for UAV-BS Networks: TD3 for Continuous Service Availability

The Gate Is Only as Honest as Its Contracts: ContractGuard for the Contract Layer of Risk-Aware Causal Gating

Co-evolution of the global research collaboration network and the performance of nations in science and technology

Confident yet Concerned: Inconsistencies in Computing Students' Attitudes on Cybersecurity

Depth Lower Bounds for ReLU Networks with Binary Inputs

Stitching the Divide: Investigating Mixed Reality as a Bridge Between Paper-Based and Digital Artifacts in UI/UX Design

Ghost Vectors: Soft-Deleted Embeddings Remain Reconstructible in HNSW Vector Databases

Flexible Distributed Particle Filtering for the Internet of Things via Aggregate Computing

Designing L5: A Permacomputing Approach to Creative Coding

Understanding the "Airport" Censorship Circumvention Ecosystem in China

A Critical Discourse Analysis of Gender Representation in Software Engineering Education Videos on YouTube

Finding Compiler-Platform Interaction Bugs in Deep Learning Pipelines via Cross-Layer Constraints

Enhancing neural network extrapolation in thermo-fluid systems using steady-state solutions

Constellation-Level Power Allocation for LEO Space-Based Solar Power

Evaluating the Effectiveness of LLMs in Aiding Compliance Testing of PKCS#1-v1.5

CloakLM: Obfuscating GPU Memory Layout to Mitigate Model Ex-filtration for Serving

When Mobile Crowdsourcing Meets Queueing Systems: Human-in-the-Loop Learning

Exploring Statistical Change Point Detection Techniques for Performance Anomaly Detection at Mozilla

TopVenues: A Reproducible Corpus and Tooling Substrate for Cybersecurity Literature Reviews

Repair Entropy in Dynamic Geometric Nearest-Neighbour Structures

From Embedded Properties to Trait Nodes: A Design Method for Identifying Reusable Metadata in Property Graph Schemas

Beyond the Algorithm: Professional Experiences and Perceptions of AI Bias

RELIANCE: Curating and Evaluating Reproductive Health Information on Social Media

Joint Discovery of Graph Structure and Dynamics in Stochastic Interacting Particle Systems

HyDRA: Lossless Hypergraph Summarization via Co-Clustering