arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2606.19746 2026-06-19 cs.DC 新提交

SAC: Disaggregated KV Cache System for Sparse Attention LLMs with CXL

SAC: 面向稀疏注意力LLM的基于CXL的解耦KV缓存系统

Ruiyang Ma, Teng Ma, Junru Li, Hantian Zha, Xuchun Shang, Qingda Hu, Zheng Liu, Xinjun Yang, Tao Ma, Guojie Luo

AI总结针对稀疏注意力模型在长上下文推理中全量KV缓存传输导致的瓶颈，提出基于CXL按需获取top-k KV条目的解耦缓存系统SAC，相比RDMA方案吞吐提升2.1倍、TTFT降低9.7倍。

详情

AI中文摘要

LLM向长上下文推理的扩展将主要服务系统瓶颈从计算转移到内存容量。传统针对密集注意力模型的解决方案依赖基于RDMA的解耦内存池，在解码前从远程存储粗粒度地获取整个前缀KV缓存到本地内存。然而，这种方法对于新兴的稀疏注意力模型本质上是低效的。尽管解码过程中只有一小部分KV条目是活跃的，这些系统仍然将完整的KV缓存获取到本地，导致严重的传输瓶颈和本地内存浪费。为了解决这个问题，我们提出了SAC，第一个针对稀疏注意力模型优化的高效解耦KV缓存系统。通过利用Compute Express Link (CXL)的低延迟、缓存行粒度的加载/存储语义，SAC在推理过程中按需仅获取所需的top-k KV条目。在使用SGLang对DeepSeek-V3.2的评估中，与基于RDMA的基线相比，SAC实现了2.1倍的吞吐量提升、9.7倍的TTFT降低和1.8倍的TBT降低，确立了基于CXL的解耦作为新兴稀疏注意力模型的优越基础设施。

英文摘要

The scaling of LLMs toward long-context inference has shifted the primary serving system bottleneck from computation to memory capacity. Traditional solutions for dense attention models rely on RDMA-based disaggregated memory pools, which perform coarse-grained fetching of the entire prefix KV cache from remote storage to local memory before decoding. However, this approach is fundamentally inefficient for emerging sparse attention models. While only a small fraction of KV entries are active during decoding, these systems still fetch the full KV cache locally, leading to severe transmission bottlenecks and local memory wastage. To address this, we propose SAC, the first efficient disaggregated KV cache system optimized for sparse attention models. By leveraging the low-latency, cache-line granularity load/store semantics of Compute Express Link (CXL), SAC fetches only the required top-k KV entries on demand during inference. Evaluations on DeepSeek-V3.2 using SGLang show that SAC achieves 2.1x higher throughput, 9.7x lower TTFT, and 1.8x lower TBT compared to RDMA-based baselines, establishing CXL-based disaggregation as the superior infrastructure for emerging sparse attention models.

URL PDF HTML ☆

赞 0 踩 0

2606.19745 2026-06-19 cs.HC 新提交

Designing for Interconnected Islamic Learning: A Qualitative Study of Muslim Women's Experiences with Qur'an, Hadith, and Seerah Apps

设计互联的伊斯兰学习：穆斯林女性使用古兰经、圣训和先知传记应用的质性研究

Ishrat Jahan Easha, Nabil Mosharraf Hossain, Araf Mohammad Mahbub, Fairoze Bint Abu Hassan, Zunaid Aslam, Yemin Sajid, Riasat Islam

AI总结通过访谈穆斯林女性，发现她们在数字工具中阅读古兰经、圣训和先知传记时面临上下文分离的张力，提出分层语境性概念，强调在可信、可选且不打断阅读的前提下提供跨文本语境。

Comments 27 pages, 1 figure, 3 tables. Submitted to the International Journal of Human-Computer Interaction

详情

AI中文摘要

伊斯兰学习通常依赖于同时阅读古兰经、圣训和先知传记，然而数字工具通常将这些资源分散在不同的应用、屏幕和搜索路径中。我们通过从在线伊斯兰学习社区招募的五名穆斯林女性的半结构化访谈，将此视为人机交互问题。参与者描述了一个反复出现的张力：她们希望在阅读时获得古兰经-圣训-先知传记的上下文，但仅当上下文扩展是可信的、可选的且不打断阅读时。通过性别化数字宗教、认知信任和无缝学习的视角解读访谈，我们识别出关于上下文理解、真实性、界面杂乱、学习模式和指导特征的五个主题。我们引入分层语境性作为该领域的HCI解释：上下文扩展必须与解释责任、虔诚流动以及跨设备和学习强度的连续性相平衡。

英文摘要

Islamic learning often depends on reading the Qur'an, Hadith, and Seerah together, yet digital tools typically separate these sources across apps, screens, and search pathways. We examine this as a human-computer interaction problem through five semi-structured interviews with Muslim women recruited from an online Islamic learning community. Participants described a recurring tension: they wanted Qur'an-Hadith-Seerah context at the point of reading, but only when contextual expansion remained trustworthy, optional, and did not interrupt reading. Interpreting the interviews through gendered digital religion, epistemic trust, and seamless learning, we identify five themes concerning contextual understanding, authenticity, interface clutter, study modes, and guidance features. We introduce layered contextuality as an HCI account of this domain: contextual expansion must be balanced with interpretive accountability, devotional flow, and continuity across devices and study intensities.

URL PDF HTML ☆

赞 0 踩 0

2606.19703 2026-06-19 cs.HC 新提交

Vibe Coding for Visualization Implementation: An Empirical Study of Practices and Challenges

Vibe Coding 用于可视化实现：实践与挑战的实证研究

Zhengyu Sun, Xiaolin Wen, Fengjie Wang, Can Liu, Yi Lai, Christophe Hurter, Yong Wang

AI总结通过16名参与者的实证研究，探讨用户使用AI驱动的自然语言交互工具生成可视化时的实践（提示、评估、迭代）和挑战。

Comments 5 pages, 2 figures. Short paper under review

2606.19692 2026-06-19 cs.CR cs.DB cs.IR 新提交

When Global Gating Is Enough: Admission-Time Hubness Control in Anisotropic Vector Retrieval Systems

当全局门控足够：各向异性向量检索系统中的准入时间枢纽性控制

Prashant Kumar Pathak, Tarun Kumar Sharma

AI总结针对检索增强生成中向量枢纽性引发的投毒风险，提出准入时间控制方法，通过哨兵查询评分隔离枢纽文档，全局门控在多个数据集上达到高召回率和低误报率。

详情

AI中文摘要

向量枢纽性（少数点成为许多查询的最近邻）在检索增强生成（RAG）中造成投毒风险：一个注入的文档可能影响不相关的请求。现有防御使用周期性反向k近邻扫描，存在暴露窗口和重复的全语料库工作。我们研究准入时间控制，根据哨兵查询对每个候选文档评分，并在插入前隔离类似枢纽的文档。在两个10万文档语料库、五个编码器以及不相交的攻击者和防御者查询集上，全局门控在决定性嵌入空间点达到召回率1.0（有效范围内>=0.92），在HotFlip攻击上达到0.91 +/- 0.07，对一般文档的误报率为1%。每主题门控没有提供可靠的好处，这与各向异性耦合局部和全局可见性一致。阈值是增量维护的，插入成本与语料库大小无关，删除成本摊销。在HNSW上，准入增加约3.1%的摄入延迟，评分在10^6向量上保持平坦，近似索引下1.2%的决策翻转，不涉及攻击。来源信息补充了门控对自然或紧密领域枢纽的处理。

英文摘要

Vector hubness, where a few points become nearest neighbors of many queries, creates a poisoning risk in retrieval-augmented generation (RAG): one injected document can influence unrelated requests. Existing defenses use periodic reverse-kNN scans, leaving an exposure window and repeated corpus-wide work. We study admission-time control, scoring each candidate against sentinel queries and quarantining hub-like documents before insertion. Across two 100,000-document corpora, five encoders, and disjoint attacker and defender query sets, a global gate achieves recall 1.0 at the decisive embedding-space point (>=0.92 across the effective range) and 0.91 +/- 0.07 on HotFlip attacks, with 1% false positives on general documents. A per-topic gate provides no reliable benefit, consistent with anisotropy coupling local and global visibility. Thresholds are maintained incrementally, with corpus-size-independent insertion cost and amortized deletion cost. On HNSW, admission adds about 3.1% to ingestion latency, scoring remains flat to 10^6 vectors, and 1.2% of decisions flip under approximate indexing, none involving attacks. Provenance complements the gate for natural or tight-domain hubs.

URL PDF HTML ☆

赞 0 踩 0

2606.19689 2026-06-19 cs.HC 新提交

Syndesmoscope: The Power of Invariant Plots\\Linked to Traditional Network Views

Syndesmoscope: 不变图的力量与传统网络视图的关联

Matt Oddo, Indira Sowy, Stephen Kobourov, Tamara Munzner

AI总结提出Syndesmoscope系统，通过结合不变图（如kSnakes）与传统网络视图，利用跳蛙和跳房子交互揭示单一视图无法呈现的网络模式。

详情

AI中文摘要

传统的网络表示，如节点-链接视图和邻接矩阵，根据底层布局或排序算法可能显示出截然不同的视觉模式。相比之下，不变图对于相同的输入拓扑始终呈现相同的视觉模式；然而，研究者对其探索不足，且未将其集成到可视化系统中。我们提出了Syndesmoscope，一个用于网络探索的交互式系统，它并置了同一网络的多个视图。窗格显示一个熟悉的力导向视图，以及三个基于图论属性的可解释几何布局窗格：密集-稀疏梯度、测地偏心率和谱二分。作为次要贡献，我们引入了kSnakes，一种基于密度分解的新不变图。Syndesmoscope支持两种关键交互：跳蛙，即不同可解释视觉模式之间的链接高亮；以及跳房子，即通过底层拓扑扩展数据选择的基于跳的遍历。通过在72个不同网络组成的语料库上的使用场景，我们展示了这些交互如何揭示单一视图无法访问的网络模式。在线演示见此URL。

英文摘要

Traditional network representations, such as node-link views and adjacency matrices, can show dramatically different visual patterns, depending on the underlying layout or seriation algorithm. In contrast, invariant plots consistently surface the same visual pattern for the same input topology; yet researchers have underexplored them and have not integrated them into visualization systems. We present Syndesmoscope, an interactive system for network exploration that juxtaposes multiple views of the same network. Panes show a familiar a force-directed view alongside three panes with interpretable geometric layouts based on graph-theoretic properties: dense-sparse gradient, geodesic eccentricity, and spectral bisection. As a secondary contribution, we introduce kSnakes, a new invariant plot based on density decomposition. Syndesmoscope supports two key interactions: leapfrogging, or linked highlighting between different and interpretable visual patterns; and hopscotching, or hop-based traversal that extends data selections through the underlying topology. Through usage scenarios across a corpus of 72 diverse networks, we demonstrate how these interactions reveal network patterns inaccessible through any single view alone. Live demo available at https://syndesmoscope.vercel.app/.

URL PDF HTML ☆

赞 0 踩 0

2606.19686 2026-06-19 cs.PL 新提交

Effect Systems as Abstract Interpretations

效应系统作为抽象解释

Colin S. Gordon

AI总结本文通过将效应量词嵌入抽象域，并从事件发生角度恢复效应量词的一般形式，建立了抽象解释与一般效应系统之间的形式关系。

Comments Draft short paper

2606.19680 2026-06-19 cs.CE 新提交

ImProNCDE: Impulse-Corrected Neural Controlled Differential Equations with Prototype Learning for Longitudinal Prognosis Prediction

ImProNCDE：基于原型学习的脉冲校正神经控制微分方程用于纵向预后预测

Hao Wang, Yupeng Xu, Jinghao Lin, Shuchang Ye, Yige Peng, Jinman Kim, Kun Liu, Lei Bi

AI总结提出ImProNCDE框架，通过残差脉冲校准捕捉病理突变，并利用原型引导轨迹稳定器减少长期误差累积，在眼科纵向预后预测中超越现有方法。

Comments 12 pages, 5 figures

详情

AI中文摘要

纵向眼科影像分析是眼科疾病预后预测的关键步骤。然而，AI辅助预后模型面临随访序列稀疏、不规则采样和不完整的挑战。尽管先进的预后建模方法，尤其是基于神经控制微分方程（NCDE）的方法，为稀疏和不规则的纵向数据提供了原则性的连续时间框架，但在临床随访建模中仍有两个主要问题未解决。首先，标准NCDE的平滑潜在动力学与治疗干预、病灶复发或长随访间隔引起的突然病理变化不匹配。其次，长时间跨度的数值积分会累积误差，导致不稳定的潜在轨迹和弱化的类别区分。为解决这些挑战，我们提出了ImProNCDE，一种带有原型学习的脉冲校正NCDE框架，用于纵向眼科预后预测。为了捕捉平滑潜在动力学之外的突然病理变化，ImProNCDE引入了残差脉冲校准（RIC），在就诊时间注入基于残差的脉冲校正，并在观测偏离连续预测时重新校准潜在状态。为了进一步减轻长时间跨度的误差累积，我们引入了原型引导轨迹稳定器（PTS），旨在将潜在轨迹吸引到可学习的预后原型，以减少类别重叠，最终提高长期稳定性。在多个私人和公共纵向眼科数据集（总计超过1206个样本）上的实验表明，ImProNCDE优于专注于序列建模的现有最先进方法。

英文摘要

Longitudinal ophthalmic imaging analysis is an essential step for prognosis prediction in ophthalmic diseases. However, AI-assisted prognosis models are challenged by follow-up sequences, which tend to be sparse, irregularly sampled, and incomplete. Although advanced prognosis modeling methods, especially for the methods based on neural controlled differential equations (NCDEs), provide a principled continuous-time framework for sparse and irregular longitudinal data. Unfortunately, two major concerns remain unsolved in clinical follow-up modeling. First, the smooth latent dynamics of standard NCDEs is poorly matched to abrupt pathological changes induced by therapeutic intervention, lesion recurrence, or long follow-up gaps. Second, numerical integration over long horizons can accumulate errors, which will produce unstable latent trajectories and weakened class discrimination. To address these challenges, we propose ImProNCDE, an impulse-corrected NCDE framework with prototype learning for longitudinal ophthalmic prognosis prediction. To capture abrupt pathological changes beyond smooth latent dynamics, ImProNCDE introduces Residual Impulse Calibration (RIC), which injects residual-based impulse corrections at visit times and then recalibrates the latent state when observations deviate from continuous predictions. To further mitigate error accumulation over long horizons, we introduce a Prototype-guided Trajectory Stabilizer (PTS), which aims to attract latent trajectories toward learnable prognosis prototypes to reduce class overlap and which ultimately improves long-horizon stability. Experiments on multiple private and public longitudinal ophthalmic datasets (totalling over 1206 samples) show that ImProNCDE outperforms existing SOTA methods focusing on sequence modeling.

URL PDF HTML ☆

赞 0 踩 0

2606.19654 2026-06-19 cs.CR cs.SE 新提交

PUFFERDOS: Efficient and Effective Attack String Generation for Regular Expression Denial of Service Vulnerabilities

PUFFERDOS：针对正则表达式拒绝服务漏洞的高效攻击字符串生成

Shangzhi Xu, Ziqi Ding, Xiao Cheng, Yuekang Li, Nan Sun, Benjamin Turnbull, Shuangxiang Kan, Siqi Ma

AI总结提出PUFFERDOS方法，通过定义三种脆弱模式并利用合成技术与组合符号执行，生成在现实长度预算内且经程序验证有效的ReDoS攻击字符串。

Comments Accepted by S&P'26

详情

AI中文摘要

ReDoS攻击构成了一类关键的资源耗尽漏洞。在此类攻击中，攻击者利用正则表达式引擎的病态最坏情况执行行为，诱导高度不对称的计算工作负载，最终耗尽系统资源并降低服务可用性。为了保护系统免受ReDoS攻击，研究人员提出了许多检测技术，这些技术通过生成攻击字符串来模拟攻击过程，以便在早期开发阶段主动利用ReDoS漏洞并促进修复。现有技术大致分为两类：搜索病态正则表达式结构的静态分析，以及合成候选攻击字符串的动态探索方法。然而，生成的攻击字符串通常不适用于实际利用，因为它们往往假设不切实际的输入长度预算，并且未在程序级别验证攻击的有效性和效率。因此，许多生成的字符串在应用于实际程序时无法触发易受攻击的正则表达式，进一步限制了其实用性。为了解决这些不足，我们引入了一种有效且高效的攻击字符串生成器PUFFERDOS，旨在合成在现实长度预算内可行且经程序级别验证的攻击输入，从而实现对实际程序中ReDoS漏洞的有效利用。具体来说，我们首先基于观察和形式化验证定义了三种脆弱模式。根据这些模式，PUFFERDOS采用合成技术生成攻击字符串，然后通过针对ReDoS的组合符号执行对字符串进行细化和验证，以确保现实世界中的可利用性。

英文摘要

ReDoS attacks constitute a critical class of resource-exhaustion vulnerabilities. In such attacks, adversaries exploit the pathological worst-case execution behavior of regular expression (regex) engines to induce highly asymmetric computational workloads, ultimately exhausting system resources and degrading service availability. To protect systems against ReDoS attacks, numerous detection techniques have been proposed that simulate the attack process by generating attack strings to proactively exploit ReDoS vulnerabilities at the early development stage and facilitate remediation. Existing techniques broadly fall into two classes: static analyses that search for pathological regex structures, and dynamic exploration methods that synthesize candidate attack strings. However, the generated attack strings are often impractical for real-world exploitation because they usually assume unrealistic input-length budgets and do not validate the effectiveness and efficiency of the attack at the program level. Therefore, many generated strings fail to trigger vulnerable regexes when applied to real-world programs, further limiting the practical utility. To address these shortcomings, we introduce an effective and efficient attack string generator, PUFFERDOS, designed to synthesize attack inputs that are both feasible within realistic length budgets and validated at the program level, enabling effective exploitation of ReDoS vulnerabilities in real-world programs. Specifically, we first define three vulnerable patterns based on our observation and formal verification. According to the patterns, PUFFERDOS conducts a synthesis technique to generate attack strings, and then refines and validates the strings with ReDoS-specific compositional concolic execution to guarantee real-world exploitability.

URL PDF HTML ☆

赞 0 踩 0

2606.19644 2026-06-19 cs.SE 新提交

Prompt Quality and Pull Request Outcomes: A Stage-Based Empirical Study of LLM-Assisted Development

提示质量与拉取请求结果：基于阶段的LLM辅助开发实证研究

Richard Sserunjogi, Daniel Ogenrwot, John Businge

AI总结通过分析265个开发者与ChatGPT的交互，研究提示结构（上下文、具体性、验证）对LLM辅助开发中代码生成、采纳和集成深度的影响，发现不同维度在不同阶段有不同作用。

Comments 48 pages, 2 figures

详情

AI中文摘要

大型语言模型（LLM）驱动的工具（如ChatGPT）越来越多地用于协作软件工程工作流，但提示结构如何影响下游拉取请求（PR）结果尚不清楚。先前的研究主要考察对话帮助性、生产力或粗粒度的采用指标，对提示结构在协作集成行为中的作用理解不足。我们分析了来自开源拉取请求中自我承认的ChatGPT使用的265个手动验证的开发者-ChatGPT交互。基于先前关于开发者面向工件和提示工程的研究，我们使用三个维度操作化提示结构：上下文、具体性和验证。我们首先评估LLM辅助注释是否能可靠地再现人类对提示结构的判断，发现在不同维度和工作流上下文中存在显著差异。具体性与人类判断的一致性最稳定；上下文被LLM系统性地低估；验证仍然难以一致评估，这促使采用人类-LLM混合注释策略。使用这个经过验证的框架，我们然后检查提示结构如何影响AI辅助PR工作流中的可操作代码生成、代码采纳和集成深度。具体性和上下文与可操作代码生成关联最强；验证成为代码采纳的主要预测因子；集成深度与上下文关联最强。总体而言，我们的发现表明，提示特征在AI辅助软件工程工作流中表现出不同的、阶段依赖的影响，通过上下文基础、任务具体性和可评估性线索影响下游采纳和集成。

英文摘要

Large language model (LLM)-powered tools such as ChatGPT are increasingly used in collaborative software engineering workflows, yet little is known about how prompt structure influences downstream pull request (PR) outcomes. Prior studies primarily examine conversational helpfulness, productivity, or coarse-grained adoption metrics, leaving the role of prompt structure in collaborative integration behavior insufficiently understood. We analyze 265 manually validated developer-ChatGPT interactions derived from self-admitted ChatGPT usage in open-source pull requests. Building on prior research on developer-facing artifacts and prompt engineering, we operationalize prompt structure using three dimensions: Context, Specificity, and Verification. We first evaluate whether LLM-assisted annotation can reliably reproduce human judgments of prompt structure, finding substantial variation across dimensions and workflow contexts. Specificity shows the most stable agreement with human judgments; Context is systematically under-scored by the LLM; and Verification remains difficult to assess consistently, motivating a hybrid human-LLM annotation strategy. Using this validated framework, we then examine how prompt structure influences actionable code generation, code adoption, and integration depth across AI-assisted PR workflows. Specificity and Context are most strongly associated with actionable code generation; Verification emerges as the primary predictor of code adoption; and integration depth is most strongly associated with Context. Overall, our findings show that prompt characteristics exert distinct, stage-dependent effects across AI-assisted software engineering workflows, influencing downstream adoption and integration through contextual grounding, task specificity, and evaluability cues.

URL PDF HTML ☆

赞 0 踩 0

2606.19620 2026-06-19 cs.CR 新提交

G-Lox: Group-Adaptive, Privacy-Preserving Bridge Distribution with Two-Party Computation

G-Lox: 基于两方计算的组自适应、隐私保护桥分发

Baigang Chen, Nicholas Hopper

AI总结提出G-Lox桥分发系统，通过两方安全计算实现隐藏的组级自适应分配，保护分发者盲性，支持阻塞报告、传输感知重分配和隐私保护组分裂。

详情

AI中文摘要

我们提出G-Lox（组自适应Lox），一种桥分发系统，在保持Lox风格分发者盲性的同时，实现隐藏的、有状态的组级自适应。G-Lox将自适应分配逻辑置于双服务器隐私墙之后，因此没有单个服务器能学习组标识符或组到桥的分配。私有状态访问和状态相关更新使用双服务器DPF/FSS协议和安全两方计算，支持阻塞报告、传输感知重分配和隐私保护组分裂。我们通过系统测量和策略模拟评估G-Lox。在我们的C++/EMP实现中，基于真实TCP套接字，私有状态访问的客户端可见开销较低：在状态大小高达2^16时，每次迭代的通信量保持在低KiB范围。在M=1024时，客户端发送1,968字节，接收1,280字节，每次迭代完成约0.25秒。针对特定组阻塞和女巫枚举的模拟表明，在保持广泛发行的系统中，G-Lox相比类似Lox和rBridge的基线提高了鲁棒性。

英文摘要

We present G-Lox (group-adaptive Lox), a bridge-distribution system that preserves Lox-style distributor blindness while enabling hidden, stateful group-level adaptation. G-Lox places adaptive assignment logic behind a two-server privacy wall, so no single server learns group identifiers or group-to-bridge assignments. Private state access and state-dependent updates use two-server DPF/FSS protocols and secure two-party computation, supporting blockage reporting, transport-aware reassignment, and privacy-preserving group splitting. We evaluate G-Lox through system measurements and policy simulation. In our C++/EMP implementation over real TCP sockets, private state access has low client-visible overhead: across state sizes up to 2^16, communication remains in the low-KiB range per iteration. At M=1024, the client sends 1,968 bytes, receives 1,280 bytes, and completes an iteration in about 0.25 s. Simulations with group-specific blocking and Sybil enumeration show that G-Lox improves robustness over Lox- and rBridge-like baselines among systems that maintain broad issuance.

URL PDF HTML ☆

赞 0 踩 0

2606.19618 2026-06-19 cs.GT 新提交

Joint-task truthfulness of the DMI mechanism

DMI机制的联合任务真实性

Rafael Frongillo

AI总结研究DMI机制在联合任务策略下的真实性，证明当其他代理使用一致策略时，真实报告仍是贝叶斯-纳什均衡，但无限制时主导真实性和知情真实性均失效。

2606.19609 2026-06-19 cs.HC cs.GR 新提交

Building Drift: Documenting On-Site Construction Adaptations Across Material Lifecycles

建筑漂移：记录跨材料生命周期的现场施工适应

Ritik Batra, Martin Tamke, Tom Svilans, Jan Hüls, Amritansh Kwatra, Steven J. Jackson, Thijs Roumen, Mette Ramsgaard Thomsen

AI总结提出“建筑漂移”概念，通过案例研究建立分类法，并开发Pentimento工具，利用视频和3D高斯泼溅记录现场适应，促进再生材料循环利用。

Comments In submission

详情

AI中文摘要

在建筑循环经济中，再生材料承载着先前使用生命，并将在未来建筑中拥有后生命。然而，使用此类材料会引入不可预测性，需要现场即兴发挥，这使得其再利用难以记录和跨建筑生命周期规模化。没有记录，使用再生材料进行施工所需的现场适应使得合作者、评估者和继承者缺乏继续、评估和再利用材料所需的信息。我们将通过这些适应导致物理状态与数字模型的集体偏差称为“建筑漂移”。通过一个案例研究——在森林中建造的再生木材亭子ReShelter，我们开发了一个建筑漂移分类法，以表征跨建筑生命周期的集体偏差：照料场地、寻找契合、解读材料、标记测量和跨社区协调。为了将我们的建筑漂移分类法付诸实践，我们提出了Pentimento，一个利用视频记录和3D高斯泼溅在空间、时间和语义上表示与设计模型相关的现场适应的文档工具。Pentimento使每个利益相关者能够以降低材料再利用障碍的方式浏览材料历史。这些贡献共同为支持再生材料施工所必需的现场即兴发挥的计算工具开辟了路径，从而实现更可持续的回收、修复和再利用循环。

英文摘要

In a circular economy for construction, reclaimed materials carry prior lives of use and go on to have post-lives in future buildings. Yet working with such materials introduces unpredictability that requires on-site improvisation, making their reuse challenging to document and scale across building lifetimes. Without documentation, the on-site adaptations that make construction with reclaimed materials possible leave collaborators, evaluators, and inheritors without the information they need to continue, assess, and reuse materials. We call the collective deviation of the physical state from the digital model through these adaptations "building drift." Through a case study, ReShelter, a reclaimed timber pavilion constructed in the forest, we develop a taxonomy for building drift that characterizes the collective deviation across building lifetimes: Tending the Site, Foraging for Fit, Interpreting the Material, Marking Measurements, and Coordinating Across Communities. To put our taxonomy for building drift into practice, we present Pentimento, a documentation tool that leverages video documentation and 3D Gaussian Splatting to spatially, temporally, and semantically represent on-site adaptations in relation to the designed model. Pentimento enables each stakeholder to navigate material histories in ways that reduce barriers to material reuse. Together, these contributions open pathways towards computational tools that support the on-site improvisation essential to construction with reclaimed materials, enabling more sustainable cycles of recovery, repair, and reuse.

URL PDF HTML ☆

赞 0 踩 0

2606.19599 2026-06-19 eess.SY cs.SY econ.EM 新提交

Ramping Procurement and Bid-Cost Recovery in Real-Time Market

实时市场中的爬坡采购与投标成本回收

Cong Chen, Valentina Norambuena, Lang Tong

AI总结研究净需求不确定下与经济调度协同优化的爬坡采购，分析单间隔与多间隔协同优化设计，提出评估发电机利润、消费者支付、投标成本回收和运营效率的分析框架，并比较三种定价机制。

Comments 4 figures

详情

AI中文摘要

我们研究了净需求不确定下与经济调度协同优化的爬坡采购。我们考察了电网运营商实施的两种灵活爬坡产品设计：单间隔和多间隔协同优化。两者都依赖于滚动窗口随机优化，包含绑定和咨询间隔决策。我们开发了分析框架来评估发电机利润、消费者支付、投标成本回收（BCR）和运营效率。特别是，净需求不确定性可能导致发电机补偿不足，需要歧视性BCR。虽然运营效率对能量和爬坡价格不变，但生产者利润和消费者支付关键取决于定价。我们研究了节点边际定价（LMP）和两种统一定价：最大调度成本定价（MDCP）和最大时间节点边际定价（MTLMP）。在市场外BCR下，LMP产生歧视性能量价格，而MDCP消除BCR，MTLMP在大多数情况下也是如此。这一性质使我们能够在MDCP下为价格接受型发电机建立真实投标激励。我们的分析突出了单间隔和多间隔协同优化与定价设计之间的权衡：在高预测不确定性和中等爬坡需求下，单间隔能量-爬坡协同优化具有优势，而当净需求预测相对准确且爬坡需求具有挑战性时，多间隔协同优化更优。基于CAISO和ERCOT数据的实证结果表明，与LMP相比，MDCP和MTLMP增加了生产者利润且BCR可忽略，但以消费者支付增加为代价。

英文摘要

We study ramping procurement co-optimized with economic dispatch under net-demand uncertainty. We examine two flexible ramp product designs implemented by grid operators: single-interval and multi-interval co-optimization. Both rely on rolling-window stochastic optimization with binding and advisory interval decisions. We develop analytical frameworks to evaluate generator profits, consumer payments, bid cost recovery (BCR), and operational efficiency. In particular, net-demand uncertainty may lead to generator under-compensation, requiring discriminatory BCR. While operational efficiency is invariant to energy and ramp prices, producer profits and consumer payments depend critically on pricing. We examine locational marginal pricing (LMP) and two uniform pricing: maximum dispatch cost pricing (MDCP) and maximum temporal locational marginal pricing (MTLMP). With out-of-market BCR, LMP yields discriminatory energy prices, whereas MDCP eliminates BCR and MTLMP does so in most cases. This property enables us to establish truthful bidding incentives for price-taking generators under MDCP. Our analysis highlights trade-offs between single- and multi-interval co-optimization and pricing designs: single-interval energy-ramp co-optimization is advantageous under high forecast uncertainty and moderate ramping requirements, whereas multi-interval co-optimization is superior when net-demand forecasts are relatively accurate and ramp needs are challenging. Empirical results on CAISO and ERCOT data show that MDCP and MTLMP increase producer profits with negligible BCR, albeit at the expense of higher consumer payments relative to LMP.

URL PDF HTML ☆

赞 0 踩 0

2606.19576 2026-06-19 cs.DB cs.DC 新提交

REMOP: REmote-Memory-aware OPerator Optimization

REMOP: 远程内存感知的算子优化

Shiquan Zhang, Yunhao Mao, Yuqiu Zhang, Gengrui Zhang, Jeyhun Karimov, Hans-Arno Jacobsen

AI总结针对远程内存环境下查询处理中数据传输轮次过多的问题，提出REMOP框架，通过轮次感知的算子内内存策略优化内存溢出执行，在DuckDB中实现三种算子，减少高达97%的传输轮次和48%的算子运行时间。

Comments 14 pages, 13 figures, 9 tables. Preprint, under review

详情

AI中文摘要

远程和分离内存层扩展了分析数据库引擎的有效内存容量，但也重塑了内存溢出查询处理的成本结构。当算子溢出到本地DRAM之外时，将页面移动到远程内存既会产生数据传输时间，也会产生每次传输的固定往返延迟。经典的算子分析和缓冲区分配启发式方法主要通过最小化总I/O量来针对磁盘溢出。在远程内存下，这些策略可能不是最优的，因为它们可能触发过多的传输轮次。我们提出了REMOP，一个远程内存感知的算子优化框架，它使用传输轮次感知的算子内内存策略来改善内存预算紧张下的内存溢出执行。REMOP将传输轮次数引入延迟成本模型，并推导出算子特定的缓冲区划分策略，在DuckDB中为阻塞嵌套循环连接、外部归并排序和外部哈希连接实例化了该方法。我们在双节点计算-内存测试平台上的评估表明，在溢出密集的微基准测试中，REMOP减少了高达97%的传输轮次和高达48%的算子运行时间，并将溢出TPC-H和TPC-DS查询的平均运行时间分别降低了22.7%和26.4%。

英文摘要

Remote and disaggregated memory tiers expand the effective memory capacity of analytical database engines, but they also reshape the cost structure of out-of-memory query processing. When an operator spills beyond local DRAM, moving pages to remote memory incurs both data-transfer time and a fixed round-trip latency per transfer. Classical operator analyses and buffer-allocation heuristics primarily target disk spilling by minimizing total I/O volume. Under remote memory, these strategies can be suboptimal because they may trigger excessive transfer rounds. We present REMOP, a remote-memory-aware operator optimization framework that uses transfer-round-aware intra-operator memory policies to improve out-of-memory execution under tight memory budgets. REMOP introduces the number of transfer rounds into the latency cost model and derives operator-specific buffer-partitioning strategies, instantiating the approach for blocked nested-loop join, external merge sort, and external hash join in DuckDB. Our evaluation on a two-node compute-memory testbed shows that REMOP reduces transfer rounds by up to 97% and operator runtime by up to 48% on spill-heavy microbenchmarks, and lowers the average runtime of spilling TPC-H and TPC-DS queries by 22.7% and 26.4% end-to-end.

URL PDF HTML ☆

赞 0 踩 0

2606.19570 2026-06-19 cs.HC 新提交

Code as Anchor, Memory and Metaphor as Support: Learner Experiences with Multi-View Visualizations

代码作为锚点，记忆与隐喻作为支持：学习者对多视图可视化的体验

Naaz Sibia, Jessica Wen, Amber Richardson, Yashika Jain, Khushi Malik, Bogdan Simion, Carolina Nobre, Angela Zavaleta Bernuy, Andrew Petersen, Michael Liut

AI总结通过眼动追踪和访谈，研究19名CS1/CS2学生在多视图可视化工具中的行为，发现学生主要关注代码，忽视隐喻视图，受能动性、表征适配和合法性因素影响。

Comments Pre-Print of a paper to be published at the International Computing Education Research (ICER) conference 2026

详情

DOI: 10.1145/3765964.3811662

AI中文摘要

程序可视化被广泛用于支持新手程序员，但学生经常忽视或抵制精心设计的视觉支架。关于多重外部表征（MERs）的研究提供了协调视图的认知设计原则，但对于什么因素影响学习者对可用表征的参与度知之甚少。我们对19名已完成CS1和CS2的本科生进行了一项被试内研究。学生使用一个多表征探针（包含同步的代码、记忆和隐喻视图）和Python Tutor，在作用域、while循环和链表任务中完成出声思考任务、反思性访谈和基于摄像头的视线追踪。视线分析显示，尽管有可用的视觉支架，学生将近一半的时间专注于代码。没有先前经验的学生更强烈地以代码为锚点，并且很少参与隐喻视图。访谈确定了影响选择性参与的三个因素：能动性（学生寻求控制认知努力而非简单减少）、表征适配（相同设计在不同情境下感觉有帮助或令人不知所措）以及合法性（一些学生避免他们认为幼稚或不够严谨的隐喻支架）。这些发现表明，计算教育中的多表征工具需要关注情感和社会因素以及认知设计。实际考虑包括将可视化定位为验证工具、提供可切换的抽象级别以及通过框架设计传达学科合法性。更广泛地说，这些主题有助于解释为什么认知上合理的可视化工具可能无法吸引它们旨在帮助的学生。

英文摘要

Program visualizations are widely used to support novice programmers, yet students often ignore or resist well-designed visual scaffolds. Research on multiple external representations (MERs) offers cognitive design principles for coordinating views, but less is known about what shapes learners' engagement with available representations. We conducted a within-subjects study with 19 undergraduates who had completed CS1 and CS2. Students completed think-aloud tasks, reflective interviews, and webcam-based gaze tracking while using a multi-representational probe with synchronized code, memory, and metaphor views, and Python Tutor, across scope, while loops, and linked lists. Gaze analysis showed that students spent nearly half their time focused on code despite available visual scaffolds. Students without prior experience anchored even more heavily in code and engaged minimally with metaphor views. Interviews identified three factors shaping selective engagement: agency, as students sought control over cognitive effort rather than simply having it reduced; representational fit, as identical designs differed in whether they felt helpful or overwhelming; and legitimacy, as some students avoided metaphorical scaffolds they perceived as childish or insufficiently rigorous for university-level work. These findings suggest that multi-representational tools in computing education require attention to affective and social factors alongside cognitive design. Practical considerations include positioning visualizations as verification instruments, offering toggleable abstraction levels, and framing tools to signal disciplinary legitimacy. More broadly, the themes help explain why cognitively sound visualization tools may fail to engage the students they are designed to help.

URL PDF HTML ☆

赞 0 踩 0

2606.19556 2026-06-19 cs.CE 新提交

A hybrid sharp-diffuse interface approach to accurately model melt pool dynamics with rapid evaporation in laser-based processing of metals

一种混合锐利-扩散界面方法，用于精确模拟激光加工金属中伴随快速蒸发的熔池动力学

Nils Much, Andreas Koch, Christoph Meier, Magdalena Schreter-Fleischhacker

AI总结提出混合锐利-扩散界面方法，结合锐利界面传热模型和扩散界面多相流模型，精确模拟激光加工中蒸发驱动的熔池热流体动力学，精度比纯扩散模型高一个数量级。

Journal ref Computer Methods in Applied Mechanics and Engineering 457, 119023, 2026

详情

DOI: 10.1016/j.cma.2026.119023

AI中文摘要

在激光加工金属（如激光束焊接或激光粉末床熔融增材制造）中，熔池动力学的预测模拟需要精确解析熔-气界面的热流体动力学相互作用。这里，蒸发诱导的反冲压力和温度相关的表面张力控制着流动。由于这些机制对界面温度敏感（通常呈指数关系），可靠的预测需要高精度的传热模型。流行的扩散界面公式模糊了激光-金属相互作用中典型的极端热梯度，导致界面温度误差，从而严重降低界面力预测和熔池动力学的精度。我们提出了一种混合锐利-扩散界面方法，用于高保真模拟伴随快速蒸发的熔池热流体动力学。传热问题采用锐利界面非拟合有限元（CutFEM）公式表示，能够精确预测温度场。多相流问题具有大密度比和复杂界面动力学特征，通过稳健的基于水平集的一流体扩散界面有限元公式精确捕捉。通过将锐利界面温度扩展到窄界面区域，在扩散界面流动框架内评估温度相关的界面力，实现了一致耦合。在实际相关基准测试中，锐利界面热模型表现出二阶空间收敛性，使得有限元尺寸比扩散界面方法大两个数量级，同时保持1%精度。在一个代表激光-金属相互作用的耦合热流体动力学新基准测试中，混合方法在同一网格上比纯扩散界面模型精确一个数量级。

英文摘要

Predictive simulation of melt pool dynamics in laser-based processing of metals, e.g., laser beam welding or laser powder bed fusion additive manufacturing, requires accurate resolution of thermo-hydrodynamic interactions at the melt-gas interface. Here, evaporation-induced recoil pressure and temperature-dependent surface tension govern the flow. Because these mechanisms depend sensitively, often exponentially, on the interface temperature, reliable predictions demand highly accurate heat transfer models. Popular diffuse-interface formulations smear the extreme thermal gradients as typical for laser-metal interactions, leading to interface temperature errors that critically degrade the accuracy of interface force predictions and melt pool dynamics. We present a hybrid sharp-diffuse interface approach for high-fidelity modelling of melt pool thermo-hydrodynamics with rapid evaporation. The heat transfer problem is represented using a sharp-interface unfitted finite element (CutFEM) formulation, enabling accurate prediction of the temperature field. The multi-phase flow problem, characterized by large density ratios and complex interface dynamics, is accurately captured using a robust level-set-based one-fluid diffuse-interface finite element formulation. Consistent coupling is achieved by extending the sharp-interface temperature into a narrow interface region to evaluate temperature-dependent interface forces within the diffuse-interface flow framework. In practically relevant benchmarks, the sharp-interface thermal model exhibits second-order spatial convergence, enabling finite element sizes two orders of magnitude larger than the diffuse-interface approach for 1 accuracy. In a novel coupled thermo-hydrodynamic benchmark representative of laser-metal interactions, the hybrid approach is one order of magnitude more accurate than a purely diffuse-interface model on the same mesh. Robu

URL PDF HTML ☆

赞 0 踩 0

2606.19537 2026-06-19 cs.MA cs.DC 新提交

Mesh Inference: A Formal Model of Collective Intelligence Without a Center

网格推理：无中心集体智能的形式模型

Hongwei Xu

AI总结提出网格推理形式模型，通过耦合自由能实现无中心多智能体协作推理，证明收敛唯一性、识别完备性和观测唯一性，并分析线性高斯情况下的延迟代价。

Comments 21 pages, 2 figures

详情

AI中文摘要

我们提出了网格推理的形式模型：一群独立智能体，每个持有私有状态，仅交换被接纳的、类型化的观测，在没有中央协调者且无智能体暴露的情况下，推导出任何一个智能体单独无法得出的结论。没有智能体共享权重、梯度或隐藏状态，且智能体可能跨越不同的团队、网络和组织。受“询问模型是能量最小化推理”这一观察的启发，我们将网格建模为每个智能体局部松弛的耦合自由能。我们证明，单一的接纳/发射策略控制三个性质。首先，对于任何对称或非对称的接纳，网格推理收敛到唯一答案，因为耦合总是M-矩阵。其次，它是识别完备的：当贡献视图是载波连通时，它精确推导出集中式最优解。第三，它是仅观测的：没有节点传输其内部状态，且机密性是识别的对偶。内容寻址谱系是唯一的全局侧信道。在线性高斯情况下，每个推导出的答案都是确定的，因此等于集中式最优解，延迟为O(diam^2)，这是移除中心所付出的代价。这样的推导是无中心学习循环的一个环节，我们将其形式化为架构而非证明。我们提出的开放问题是，询问何时能改善集体而非破坏它：非线性闭包是推导出升级的答案还是自信的错误。据我们所知，这是网格推理的第一个形式模型。

英文摘要

We present a formal model of mesh inference: how a population of independent agents, each holding private state and exchanging only admitted, typed observations, derives a conclusion none of them holds alone, with no central coordinator and no agent exposed. No agent shares weights, gradients, or hidden state, and the agents may span different teams, networks, and organizations. Motivated by the observation that asking a model is energy-minimizing inference, we model the mesh as a coupled free energy that each agent relaxes locally. We show that a single admission/emission policy governs three properties. First, mesh inference converges to a unique answer for any admission, symmetric or not, because the coupling is always an M-matrix. Second, it is identification-complete: it derives the centralized optimum exactly when the contributing views are carrier-connected. Third, it is observation-only: no node transmits its internals, and confidentiality is the dual of identification. Content-addressed lineage is the only global side-channel. In the linear-Gaussian regime every derived answer is determined, hence equal to the centralized optimum, at O(diam^2) latency, the measured price of removing the center. One such derivation is one turn of a center-free learning loop, which we formalize as architecture rather than prove. The open problem we state is when asking improves the collective rather than corrupting it: whether the non-linear closure derives an upgraded answer or a confident error. To our knowledge, this is the first formal model of mesh inference.

URL PDF HTML ☆

赞 0 踩 0

2606.19532 2026-06-19 cs.LO 新提交

Vancomycert: A Certified Neuro-Symbolic Drug Delivery System (Case Study)

Vancomycert: 一种经过认证的神经符号药物递送系统（案例研究）

Alistair Sirman, Fleur Conway, Jessica Ciupa, Gusts Gustavs Grīnbergs, Ekaterina Komendantskaya, Thai Son Hoang, Michael Rawson, Alessandro Bruni, Vaishak Belle, Michael John Williams

AI总结针对抗生素给药神经网络控制器的形式化验证问题，提出一种结合监督学习和定理证明的方法，确保无限时域内自动给药不超过治疗上限。

详情

AI中文摘要

自主决策的神经网络控制器在网络物理系统中已得到广泛应用，但在安全关键的医疗环境中，其部署仍未得到充分验证。本文提出了一种用于抗生素给药神经网络控制器形式化验证的方法和案例研究，其动机源于系统必须在无限时间范围内同时具备适应性和可证明安全性的挑战。我们构建了一个简化但临床可解释的模型，用于跟踪药物浓度、体温和白细胞计数。万古霉素被选为代表性抗生素，广泛用于严重感染，但治疗窗口狭窄，超治疗浓度有肾毒性风险，而亚治疗剂量可能导致治疗失败。我们使用合成的临床医生式给药数据训练了一个监督式神经网络控制器。我们建立了输入-输出安全属性的形式化验证，特别验证了神经网络的一个属性，该属性意味着无限时域证明自动给药从未超过超治疗边界。该系统的属性在Rocq中使用Vehicle交互式定理证明器后端进行证明，以集成不同的证明系统。最终结果是一个验证流水线，允许各种治疗方法，同时为每个特定患者保持安全性。

英文摘要

Neural network controllers for autonomous decision-making are well-established in cyber-physical systems, yet their deployment in safety-critical healthcare settings remains largely unverified. This paper presents a methodology and case study for the formal verification of a neural network controller for antibiotic dosing, motivated by the challenge of systems that must be simultaneously adaptive and provably safe across unbounded time horizons. We construct a simplified yet clinically-interpretable model that tracks drug concentration, body temperature, and white blood cell count. Vancomycin is selected as a representative antibiotic, widely prescribed for severe infections yet carrying a narrow therapeutic window, where supratherapeutic concentrations risk nephrotoxicity and subtherapeutic dosing risks treatment failure. A supervised neural network controller is trained on synthetic clinician-style dosing data. We establish formal verification of input-output safety properties, specifically verifying a property of a neural network that implies an infinite-horizon proof that automated dosing never exceeds the supratherapeutic boundary. This system property is proven in Rocq using the Vehicle interactive theorem prover back-end to integrate the different proof systems. The end result is a verification pipeline that allows for a wide variety of treatment approaches whilst maintaining safety for each specific patient.

URL PDF HTML ☆

赞 0 踩 0

2606.19529 2026-06-19 cs.DC 新提交

The Sheaf Laplacian: A Topological Framework for Data Fusion and Consensus in Distributed Sensing Networks

层拉普拉斯算子：分布式传感网络中数据融合与共识的拓扑框架

Manuel Hernández, Eduardo Sánchez-Soto

AI总结提出层理论作为传统图模型的替代，利用层拉普拉斯算子实现异构分布式传感网络中的数据融合与共识。

2606.19526 2026-06-19 cs.AR 新提交

SPINE: A Fault Injection Profiler for Quantized Neural Networks under Accumulated Faults

SPINE: 面向累积故障下量化神经网络的故障注入分析器

Nathan Guimarães, Ian Kersz, Leonardo R. Gobatto, Fabio Benevenuti, Michael G. Jordan, Antonio Carlos S. Beck, Fernanda L. Kastensmidt, Jose Rodrigo Azambuja

AI总结提出GDB驱动的分析框架SPINE，通过向边缘CPU目标二进制注入累积权重位翻转，生成逐层故障特征，无需重训练或修改代码，指导选择性加固策略。

Comments ACM/IEEE/SBC/SBMICRO Symposium on Integrated Circuits and Systems Design 2026

2606.19520 2026-06-19 eess.SY cs.SY 新提交

ev-flow: A Reproducible, NHTS-Grounded Generator of Synthetic Plug-in Electric Vehicle Charging Behavior for Eight U.S. Regions

ev-flow: 一个可复现的、基于NHTS的合成插电式电动汽车充电行为生成器，适用于美国八个地区

Bertrand Travacca

AI总结提出ev-flow开源Python包，基于2017年全国家庭旅行调查数据，通过九阶段流水线生成美国八个地区的合成插电式电动汽车充电行为，填补了美国本土化、NHTS驱动的充电行为生成工具空白。

Comments 20 pages

详情

AI中文摘要

电动汽车并网研究需要大量具有行为真实性的个体充电档案，但实际充电遥测数据稀缺且受隐私限制，现有的开源生成器要么基于非美国出行调查校准，要么忽略了驱动总需求的区域、季节和设备异质性。我们提出\texttt{ev-flow}（导入名\texttt{pev\_synth}），一个MIT许可的开源Python包，基于2017年全国家庭旅行调查（NHTS）微观数据和区域销售组合模型，为美国八个区域生成合成插电式电动汽车充电行为。一个确定性的九阶段流水线（M1-M9）将每辆车从调查记录转换为带时间戳的充电档案：它将调查的人日拼接成捐赠者匹配的365天出行日历，并带有温度依赖的冬季能量提升；从已发表的SPEECh K=16高斯混合参数化中采样行为插电开始时间；评估三层伯努利插电模型；传播连续时间荷电状态账本，并带有明确的PHEV汽油续航扩展项；将插电状态栅格化为15分钟和小时网格。该包生成住宅和工作场所档案类型，并附有描述性EVSE品牌和连接器丰富信息；每个输出均以UTC存储、时区感知，并可从单个主种子实现比特可复现。验证运行器将生成的分布与已发表的边界进行比较，并根据文献出处对每个偏差进行分类：参考的\texttt{bay\_area}住宅档案在21项适用检查中汇总为11项通过、0项未解释失败、6项已解释失败和4项已解释跳过。\texttt{ev-flow}填补了美国本土、基于NHTS的空白，与欧洲生成器（如emobpy和VencoPy）以及充电模拟器（如datafev和ACN-Sim）互补。

英文摘要

Electric-vehicle grid-integration studies need large, behaviorally realistic populations of individual charging profiles, but real charging telemetry is scarce and privacy-restricted, and the existing open generators are calibrated to non-U.S. mobility surveys or flatten the regional, seasonal, and equipment heterogeneity that drives aggregate demand. We present \texttt{ev-flow} (import name \texttt{pev\_synth}), an open-source, MIT-licensed Python package that generates synthetic plug-in electric vehicle charging behavior for eight U.S. regions, grounded in 2017 National Household Travel Survey (NHTS) microdata and regional sales-mix models. A deterministic nine-stage pipeline (M1--M9) carries each vehicle from survey records to a time-stamped charging profile: it stitches survey person-days into donor-matched 365-day travel calendars with a temperature-dependent winter energy uplift, samples behavioral plug-in start times from the published SPEECh K=16 Gaussian-mixture parameterization, evaluates a three-layer Bernoulli plug-in model, propagates a continuous-time state-of-charge ledger with an explicit PHEV gasoline range-extension term, and rasterizes plug status to 15-minute and hourly grids. The package generates residential and workplace profile types with descriptive EVSE brand and connector enrichment; every output is UTC-stored, timezone-aware, and bit-reproducible from a single master seed. A validation runner compares the generated distributions against published bounds and classifies every divergence with literature provenance: the reference \texttt{bay\_area} residential profile rolls up to 11 PASS, 0 unexplained FAIL, 6 explained failures, and 4 explained skips across 21 applicable checks. \texttt{ev-flow} fills a U.S.-focused, NHTS-grounded niche complementary to European generators such as emobpy and VencoPy and to charging simulators such as datafev and ACN-Sim.

URL PDF HTML ☆

赞 0 踩 0

2606.19519 2026-06-19 cs.DC 新提交

A Topos-Theoretic Interpretation of Blockchain Systems: Sheaves of Consensus and the Logic of Decentralized Truth

区块链系统的拓扑学解释：共识层与去中心化真理的逻辑

Manuel Hernández, Eduardo Sánchez-Soto

AI总结本文提出用拓扑论（层范畴理论）作为区块链系统的数学语言，将共识过程建模为局部一致性到全局真理的构造，超越传统有限状态机模型。

2606.19514 2026-06-19 cs.HC 新提交

LLM-Mediated Human-AI Interaction in Search and Rescue: Impact of Expertise on Attentional Allocation

LLM介导的人机交互在搜索与救援中的应用：专业知识对注意力分配的影响

Elahe Oveisi, Hemanth Manjunatha

AI总结本研究通过模拟搜索救援任务，比较有无大语言模型（LLM）指导的条件，结合眼动追踪和行为分析，发现LLM提升任务效率但未增加总救援人数，并揭示了注意力-指导权衡，其中专业知识调节了用户对AI的依赖模式。

详情

AI中文摘要

人机团队（HAT）越来越多地涉及在复杂任务中提供实时、上下文感知指导的AI系统。虽然此类系统可以提高性能，但其有效性取决于它们如何塑造人类认知和行为。特别是，AI辅助可能引入认知需求，并影响注意力、规划以及与任务环境的交互，其效果可能因专业知识水平而异。本研究在模拟搜索救援（SAR）环境中调查这些机制。我们比较了两种LLM（大语言模型）指导条件和无LLM基线条件下的人类表现，并在多个层面分析交互，包括任务绩效、眼动测量和规划行为。眼动追踪提供了对注意力分配和与AI指导交互的细粒度洞察，而行为测量则捕捉用户如何随时间构建和调整其决策。结果表明，LLM指导提高了任务效率（更高的奖励和每步受害者数），但并未增加总救援人数。眼动数据揭示了注意力-指导权衡，视觉资源转移到聊天界面，同时瞳孔大小变异性增加。专业知识调节了这种效应：新手表现出被动AI依赖，而专家通过持续的环境扫描维持“验证循环”。这些发现表明，LLM介导的团队效能取决于操作员将AI指导与地面实况交叉引用以保持态势感知的能力。

英文摘要

Human-AI teaming (HAT) increasingly involves AI systems that provide real-time, context-aware guidance in complex tasks. While such systems can improve performance, their effectiveness depends on how they shape human cognition and behavior. In particular, AI assistance can introduce cognitive demands and influence attention, planning, and interaction with the task environment, with effects that can vary across levels of expertise. This work investigates these mechanisms in a simulated search and rescue (SAR) environment. We compare human performance under two LLM (Large Language Model)-guided conditions and a no-LLM baseline, and analyze interaction at multiple levels, including task performance, eye-tracking measures, and planning behavior. Eye tracking provides fine-grained insight into attention allocation and interaction with AI guidance, while behavioral measures capture how users structure and adapt their decisions over time. Results indicate that LLM guidance enhanced task efficiency (higher rewards and victims-per-step) but did not increase total victims saved. Eye-tracking data revealed an attention-guidance trade-off, with visual resources shifting to the chat interface alongside increased pupil size variability. Expertise moderated this effect: novices exhibited passive AI reliance, whereas experts maintained a "verification loop" through persistent environmental scanning. These findings suggest that LLM-mediated teaming efficacy depends on the operator's ability to cross-reference AI guidance with ground truth to maintain situational awareness.

URL PDF HTML ☆

赞 0 踩 0

2606.19458 2026-06-19 cs.IR 新提交

MonaVec: A Training-Free Embedded Vector Search Kernel for Edge and Offline AI Systems

MonaVec: 一种面向边缘和离线AI系统的免训练嵌入式向量搜索内核

Oğuzhan Yenen

AI总结提出MonaVec，一种无需训练、数据无关的嵌入式向量搜索内核，通过随机哈达玛变换和预计算Lloyd-Max量化实现4位压缩，在边缘和离线场景下提供确定性结果，支持单文件部署。

Comments 27 pages, 11 figures. Code and artifacts: https://github.com/mona-hq/monavec (PyPI: monavec; crates.io: monavec-core). Zenodo: doi:10.5281/zenodo.20559587

详情

AI中文摘要

我们提出MonaVec，一种确定性的嵌入式向量搜索内核，适用于边缘和离线AI场景——即服务器基础设施、网络连接和训练数据均不可用的环境。现有的向量搜索系统假设存在持久化服务器、千兆字节RAM或对语料库进行训练；而MonaVec则针对SQLite的部署模式：一个文件、一次函数调用、随处运行。其量化核心默认免训练且数据无关：随机哈达玛变换（RHDH）将任意输入分布调整至N(0,1)，因此预计算的Lloyd-Max表可将数据量化至4位（缩小8倍），无需学习码本或数据遍历。索引持久化为单个.mvec文件，其中嵌入的ChaCha20旋转种子使得结果在不同架构间可重现，并在同一构建内字节一致——这是并行构建图库无法提供的确定性保证。在语义嵌入（AG News，45K x 1024维BGE-M3，余弦相似度）上，MonaVec 4位BruteForce在27 MB内达到0.960 Recall@10，在召回率上领先float32 FAISS-IVF和8位usearch，同时以峰值吞吐量换取字节一致的确定性。单次全局标准化（fit()）将相同的数据无关流程扩展到对幅度敏感的L2数据，可选的IvfFlat和HNSW后端将其扩展到百万向量语料库。MonaVec使用纯Rust实现，并带有Python绑定和运行时SIMD调度（AVX-512/AVX2/NEON/scalar）。它面向设备端RAG、离线代理和嵌入式检索——即SQLite在关系数据领域占据的细分市场：一个文件、一次调用、随处运行。

英文摘要

We present MonaVec, a deterministic, embedded vector-search kernel for edge and offline AI -- settings where server infrastructure, network connectivity, and training data are all unavailable. Existing vector-search systems assume a persistent server, gigabytes of RAM, or a training pass over the corpus; MonaVec instead targets the deployment profile of SQLite: one file, one function call, runs anywhere. Its quantization core is training-free by default and data-oblivious: a Randomized Hadamard Transform (RHDH) conditions any input distribution toward N(0,1), so precomputed Lloyd-Max tables quantize to 4 bits (8x smaller) with no learned codebook and no data pass. The index persists as a single .mvec file whose embedded ChaCha20 rotation seed makes results reproducible across architectures and byte-identical within a build -- a determinism guarantee that parallel-build graph libraries cannot offer. On semantic embeddings (AG News, 45K x 1024-dim BGE-M3, cosine), MonaVec 4-bit BruteForce reaches 0.960 Recall@10 in 27 MB -- leading float32 FAISS-IVF and 8-bit usearch on recall -- while trading peak throughput for byte-identical determinism. A single-pass global standardization (fit()) extends the same data-oblivious pipeline to magnitude-sensitive L2 data, and optional IvfFlat and HNSW backends carry it to million-vector corpora. MonaVec is implemented in pure Rust with Python bindings and runtime SIMD dispatch (AVX-512/AVX2/NEON/scalar). It targets on-device RAG, offline agents, and embedded retrieval -- the niche SQLite occupies for relational data: one file, one call, runs anywhere.

URL PDF HTML ☆

赞 0 踩 0

2606.19409 2026-06-19 cs.SE cs.PL 新提交

OpenRath: Session-Centered Runtime State for Agent Systems

OpenRath: 面向会话的代理系统运行时状态

Fukang Wen, Zhijie Wang, Ruilin Xu

AI总结针对代理系统运行时状态碎片化问题，提出以Session为核心的一等运行时抽象，支持分支、检查、重放、后端感知和组合，使fork、merge和replay成为显式运行时操作。

详情

AI中文摘要

现代代理系统常常遭受碎片化的运行时状态：对话记录、工具效果、内存事件、工作区放置、分支来源和重放证据被分别记录，难以检查或重现。OpenRath通过一个类似PyTorch的编程模型来解决这个问题，适用于多代理、多会话系统。这里的类比涉及中心一等运行时抽象的角色，而非张量计算。其核心抽象是Session，即在代理和工作流之间传递的运行时值。Session是可分支、可检查、可重放、后端感知且可组合的。它记录对话片段、沙箱放置、谱系元数据、令牌使用、待处理工作和工具证据，同时定义内存交互进入运行时记录的位置。由于此状态由程序执行中使用的同一值携带，fork、merge和replay成为显式的运行时操作，而非从外部痕迹重建的状态。OpenRath进一步定义了Sandbox、Tool、Agent、Memory、Workflow和Selector，其中Selector将控制流转化为运行时路由的决策。本报告介绍了编程模型、架构、审计里程碑和证据协议。其主张仅限于受控的运行时属性，而广泛的定量比较、实时提供者质量、可选后端可用性和内存质量留待后续评估。核心论点是Session为代理系统提供了一个一等运行时值，用于可审计的组合。

英文摘要

Modern agent systems often suffer from fragmented runtime state: transcripts, tool effects, memory events, workspace placement, branch provenance, and replay evidence are recorded separately and become difficult to inspect or reproduce. OpenRath addresses this issue with a PyTorch-like programming model for multi-agent, multi-session systems. The analogy concerns the role of a central first-class runtime abstraction, not tensor computation. Its core abstraction is Session, the runtime value passed between agents and workflows. A Session is branchable, inspectable, replayable, backend-aware, and composable. It records conversation chunks, sandbox placement, lineage metadata, token usage, pending work, and tool evidence, while defining where memory interactions enter the runtime record. Since this state is carried by the same value used in program execution, fork, merge, and replay become explicit runtime operations rather than states reconstructed from external traces. OpenRath further defines Sandbox, Tool, Agent, Memory, Workflow, and Selector, with Selector turning control flow into runtime-routed decisions. This report presents the programming model, architecture, audited milestones, and evidence protocol. Its claims are limited to controlled runtime properties, while broad quantitative comparisons, live-provider quality, optional-backend availability, and memory quality are left for follow-on evaluation. The central thesis is that Session provides agent systems with a first-class runtime value for auditable composition.

URL PDF HTML ☆

赞 0 踩 0

2606.19401 2026-06-19 cs.CC 新提交

The Complexity of Auditing Disclosure-Robust Defeasible Explanations

审计披露鲁棒的可废止解释的复杂性

Haoyang Li

AI总结研究在增量披露下保持鲁棒的最小解释核心的复杂性，发现验证鲁棒核心为coNP完全，寻找大小不超过θ的鲁棒核心为Σ₂ᵖ完全，并给出了精确审计的复杂度景观。

Comments 11 pages, 4 figures; full proofs in appendix

详情

AI中文摘要

一个形式化解释用子集最小充分理由来认证一个预测。然而，在增量披露下，证据逐字段到达，通常充分的理由可能被后续信息推翻。我们研究在所有允许的后续披露下仍然充分的最小理由核心；其大小为鲁棒半径。我们将一个可废止分类器编译成一个显式的边界图谱，包含入口锚点和出口击败者，并描绘了审计它的复杂性（所有陈述均以图谱大小衡量）。预测和常驻锚点通过对图谱的多项式时间扫描读取，无需迭代不动点计算；一个理由的击败者前沿通过扫描并子集最小化其上的击败者获得。但验证一个理由核心是鲁棒的是coNP完全的，而判断是否存在大小不超过θ的鲁棒核心是Σ₂ᵖ完全的——一个四格P/coNP完全/NP完全/Σ₂ᵖ完全的景观，其中接受情况（A(t)=1）达到多项式层次第二层。最小认证披露的判定版本是NP完全的；其优化版本在排除无击败者世界的数量上具有固定参数可解性，而一般击败者情况未解决。在标准表格数据集上的深度受限决策树的精确审计中，采用故意小的布尔抽象，控制参数处于小参数范围（鲁棒核心在低个位数），因此在这些审计立方体中精确鲁棒审计是可处理的；在从我们的归约构建的对抗实例上，困难性显现，鲁棒核心大小为Θ(n)。据我们所知，这是针对披露鲁棒形式化解释的第一个Σ₂ᵖ完全审计查询。

英文摘要

A formal explanation certifies a prediction with a subset-minimal sufficient reason. Under incremental disclosure, however, evidence arrives field by field, and a normally sufficient reason can be overturned by later information. We study the smallest reason core that remains sufficient under all admissible later disclosures; its size is the robustness radius. We compile a defeasible classifier into an explicit boundary atlas of entry anchors and exit defeaters, and chart the complexity of auditing it (all statements are in the atlas size). Prediction and standing anchors are read by polynomial-time scans of the atlas, without iterative fixpoint computation; a reason's defeater frontier is obtained by scanning and subset-minimizing the defeaters above it. But verifying that a reason core is robust is coNP-complete, and deciding whether a robust core of size at most theta exists is $Σ_2^p$-complete -- a four-cell P / coNP-complete / NP-complete / $Σ_2^p$-complete landscape, with the accepted (A(t)=1) case reaching the second level of the polynomial hierarchy. The decision version of minimal certified disclosure is NP-complete; its optimization version is fixed-parameter tractable in the number of excluded worlds without defeaters, with the general-defeater case open. On exact audits of depth-limited decision trees over standard tabular datasets under a deliberately small Boolean abstraction, the governing parameters fall in a small-parameter regime (robust cores in the low single digits), so exact robust auditing is tractable in these audited cubes; on adversarial instances built from our reductions the hardness bites, with robust cores of size Theta(n). To our knowledge this is the first $Σ_2^p$-complete audit query for disclosure-robust formal explanations.

URL PDF HTML ☆

赞 0 踩 0

2606.19395 2026-06-19 cs.SE 新提交

DevOps and General Developers: Insights from Stack Overflow's 2023 Survey

DevOps 与普通开发者：来自 Stack Overflow 2023 年调查的见解

Hasan Abdulla, Fatema AlJazeeri, Fawzi AlBalooshi, Jaflah Al-Ammary

AI总结通过分析 Stack Overflow 2023 年调查数据，比较 DevOps 专家与普通开发者在工具、技术、方法论和人口统计上的差异，发现两者角色互补，工具偏好无显著差异。

Comments 17 pages, 11 tables, research paper based on the 2023 Stack Overflow Developer Survey data analysis

详情

AI中文摘要

目的：调查 DevOps 专家和普通软件开发者在当前软件开发环境中不同的角色，考察他们在工具、技术、方法论和人口统计方面的不同使用情况。此外，区分这两个专业群体在该领域的独特贡献和挑战。设计/方法论/方法：研究采用定量方法分析 Stack Overflow 2023 年开发者调查数据。重点比较 DevOps 专家和普通开发者在技术偏好、人口统计信息和专业经验方面的差异，突出关键趋势和差异。数据分析使用 Python 的 Pandas 库进行。发现：研究表明，DevOps 专家和普通开发者在工具和技术偏好上没有显著差异，突出了他们的互补角色。DevOps 专家和普通开发者都使用 Docker 和 Kubernetes 等工具，强调效率和自动化。而普通开发者根据不同的角色需求使用多样化的工具，人口统计趋势显示普通开发者更年轻，DevOps 专业人员处于职业生涯中期。这一年龄范围反映了 DevOps 经验的增长，两个群体都在适应技术行业不断发展的远程和混合工作模式。实际意义：这项研究提供了对软件开发中动态角色的视角，强调了 DevOps 日益增长的重要性。它是学术和行业专业人士了解软件开发角色不断演变的宝贵资源。原创性/价值：这项研究填补了现有文献中关于软件开发角色动态演变的重要空白。

英文摘要

Purpose: To investigate the distinct roles of DevOps specialists and general software developers, examining their varying use of tools, technologies, methodologies, and demographics in the current software development environment. In addition, to differentiate these two professional groups regarding their unique contributions and challenges in the field. Design/Methodology/Approach: The research uses a quantitative approach to analyze data from the Stack Overflow 2023 Developer Survey. It focuses on a comparative analysis of technological preferences, demographic information, and professional experiences between DevOps specialists and general developers, highlighting key trends and differences. The data analysis was conducted using Python's Pandas library for data analysis. Findings: The research indicates no significant difference in the tool and technology preferences between DevOps specialists and general software developers, highlighting their complementary roles. DevOps specialists and general software developers use tools like Docker and Kubernetes, emphasizing efficiency and automation. While general developers employ diverse tools for various role demands, demographic trends show younger general developers and mid-career DevOps professionals. This age range reflects growing experience in DevOps, and both groups are adapting to remote and hybrid work models in the evolving tech industry. Practical Implications: This research offers perspectives on the dynamic roles within software development, emphasizing the growing importance of DevOps. It is a valuable resource for academic and industry professionals to understand the evolving dynamics in software development roles. Originality/Value: This research fills a significant gap in the existing literature regarding the evolving dynamics of software development roles.

URL PDF HTML ☆

赞 0 踩 0

2606.19394 2026-06-19 cs.FL 新提交

On Epimorphisms of Hypergraphic Automata and Input Symbol Semigroups

超图自动机及其输入符号半群的满态射

Jasem Hamoud

AI总结本文刻画了泛超图自动机及其输入符号半群的满态射，引入了弱和强两种超图满态射概念，并证明它们在p*-超图子类中一致，给出了三元组成为满态射的充要条件。

Comments 13 pages, 2 figures

详情

AI中文摘要

超图自动机是状态集和输出符号集为超图且在转移函数和输出函数作用下保持不变的自动机。此类自动机构成的范畴中，泛吸引对象称为泛超图自动机；其输入符号半群是映射代数，其性质与自动机自身的代数结构紧密相关。本文建立了泛超图自动机及其输入符号半群的满态射的完整刻画。核心贡献是引入了超图的两种不同满态射概念（弱和强），并证明这些概念通常不同，但对于重要的$p^*$-超图子类必然一致，该子类包括状态超图和输出超图为射影平面或仿射平面的自动机。主要结果给出了三元组$(f, \mathbb{P}_s, g)$成为泛超图自动机满态射的充要条件，用状态超图和输出超图上的分量映射表示。

英文摘要

Hypergraphic automata are automata whose state sets and output symbol sets are hypergraphs invariant under the actions of the transition and output functions. Universally attracting objects in the category of such automata are called universal hypergraphic automata; their semigroups of input symbols are algebras of mappings whose properties are tightly linked to the algebraic structure of the automata themselves. This paper establishes a complete characterisation of epimorphisms of universal hypergraphic automata and of their semigroups of input symbols. A central contribution is the introduction of two distinct notions of epimorphism for hypergraphs including weak, strong and the proof that these notions diverge in general but necessarily coincide for the important subclass of $p^*$-hypergraphs, which includes automata whose state hypergraphs and output hypergraphs are projective or affine planes. The main results give necessary and sufficient conditions for a triple $(f, \mathbb{P}_s, g)$ to be an epimorphism of universal hypergraphic automata, expressed in terms of the component maps on the state and output hypergraphs.

URL PDF HTML ☆

赞 0 踩 0

2606.19342 2026-06-19 cs.SE 新提交

Supporting Design Decisions in Rule-Based Model Transformations

支持基于规则的模型转换中的设计决策

Dejan Stojimirovic, Sinisa Neskovic

AI总结提出一种将设计决策显式建模并分离于规则转换实现的方法，通过决策、绑定、配置三个模型实现灵活性和可重用性，并建立形式化框架证明其性质。

详情

AI中文摘要

模型驱动工程依赖模型转换来自动从源模型推导目标模型或源代码。然而，控制源元素如何映射到目标制品的设计决策通常嵌入在转换源代码中，限制了灵活性、可重用性和可追溯性。本文提出一种在基于规则的模型转换中显式建模和管理设计决策的方法。通过三种机制将设计决策与转换实现分离。首先，决策模型独立于任何源建模语言捕获设计决策及其选项，针对给定转换领域。其次，绑定模型将这些决策连接到给定源建模语言中的特定元模型概念，使得相同设计知识能够在相似语言的转换中重用。第三，配置模型记录每个适用源模型元素所选的特定选项，并自动预选默认值。执行时，转换规则中的可变点根据配置选择动态解析。跟踪模型记录应用了哪些规则和选项来生成每个目标元素。我们建立了一个形式化数学框架，定义了基于可变性的转换的核心概念，并证明了所得转换的关键性质。这些形式化概念在一个由四个相互关联的制品模型组成的实用架构中实现。我们通过为现有的嵌入式领域特定语言扩展可变性支持来演示实际可行性，并用一个完整的ER到关系转换示例说明该方法。

英文摘要

Model Driven Engineering relies on model transformations to automate the derivation of target models or source code from source models. However, the design decisions that govern how source elements are mapped to target artifacts typically remain embedded in the transformation source code, limiting flexibility, reusability, and traceability. This paper proposes an approach to explicitly model and manage design decisions in rule-based model transformations. Design decisions are separated from transformation implementation through three mechanisms. First, a decision model captures design decisions and their options independently of any source modeling language for a given transformation domain. Second, a binding model connects these decisions to specific metamodel concepts in a given source modeling language, enabling reuse of the same design knowledge across transformations from similar languages. Third, a configuration model records the specific option chosen for each applicable source model element, with defaults pre-selected automatically. During execution, variability points in transformation rules are resolved dynamically according to configured choices. A trace model records which rules and options were applied to produce each target element. We establish a formal mathematical framework that defines the core concepts of variability-based transformations and proves key properties of the resulting transformations. The formal concepts are realized in a practical architecture of four interconnected artifact models. We demonstrate practical feasibility by extending an existing embedded domain-specific language for model-driven engineering with variability support and illustrate the approach with a complete ER-to-Relational transformation example.

URL PDF HTML ☆

赞 0 踩 0

2606.20413 2026-06-19 eess.SP cs.IT math.IT 新提交

Hybrid TRP-UE Sensing for Enhanced Target Localization

混合TRP-UE感知用于增强目标定位

Necati Kagan Erkek, Marco Di Renzo, Arman Shojaeifard, Yasser Mestrah, Remun Koirala, Mohammad Heggo, Kunjan Shah

AI总结提出一种混合TRP-UE感知机制，利用UE辅助感知提升网络感知性能，在室内工厂等复杂传播环境下显著改善目标定位精度。

Comments 6 pages

AI 大模型

视觉与机器人

科学与医疗

SAC: Disaggregated KV Cache System for Sparse Attention LLMs with CXL

Designing for Interconnected Islamic Learning: A Qualitative Study of Muslim Women's Experiences with Qur'an, Hadith, and Seerah Apps

Vibe Coding for Visualization Implementation: An Empirical Study of Practices and Challenges

When Global Gating Is Enough: Admission-Time Hubness Control in Anisotropic Vector Retrieval Systems

Syndesmoscope: The Power of Invariant Plots\\Linked to Traditional Network Views

Effect Systems as Abstract Interpretations

ImProNCDE: Impulse-Corrected Neural Controlled Differential Equations with Prototype Learning for Longitudinal Prognosis Prediction

PUFFERDOS: Efficient and Effective Attack String Generation for Regular Expression Denial of Service Vulnerabilities

Prompt Quality and Pull Request Outcomes: A Stage-Based Empirical Study of LLM-Assisted Development

G-Lox: Group-Adaptive, Privacy-Preserving Bridge Distribution with Two-Party Computation

Joint-task truthfulness of the DMI mechanism

Building Drift: Documenting On-Site Construction Adaptations Across Material Lifecycles

Ramping Procurement and Bid-Cost Recovery in Real-Time Market

REMOP: REmote-Memory-aware OPerator Optimization

Code as Anchor, Memory and Metaphor as Support: Learner Experiences with Multi-View Visualizations

A hybrid sharp-diffuse interface approach to accurately model melt pool dynamics with rapid evaporation in laser-based processing of metals

Mesh Inference: A Formal Model of Collective Intelligence Without a Center

Vancomycert: A Certified Neuro-Symbolic Drug Delivery System (Case Study)

The Sheaf Laplacian: A Topological Framework for Data Fusion and Consensus in Distributed Sensing Networks

SPINE: A Fault Injection Profiler for Quantized Neural Networks under Accumulated Faults

ev-flow: A Reproducible, NHTS-Grounded Generator of Synthetic Plug-in Electric Vehicle Charging Behavior for Eight U.S. Regions

A Topos-Theoretic Interpretation of Blockchain Systems: Sheaves of Consensus and the Logic of Decentralized Truth

LLM-Mediated Human-AI Interaction in Search and Rescue: Impact of Expertise on Attentional Allocation

MonaVec: A Training-Free Embedded Vector Search Kernel for Edge and Offline AI Systems

OpenRath: Session-Centered Runtime State for Agent Systems

The Complexity of Auditing Disclosure-Robust Defeasible Explanations

DevOps and General Developers: Insights from Stack Overflow's 2023 Survey

On Epimorphisms of Hypergraphic Automata and Input Symbol Semigroups

Supporting Design Decisions in Rule-Based Model Transformations

Hybrid TRP-UE Sensing for Enhanced Target Localization