arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 1926
2605.21837 2026-05-22 astro-ph.HE

The powerful shocks in RS Oph: NuSTAR X-ray data and a complete review

RS Oph中的强大冲击:NuSTAR X射线数据和全面回顾

Marina Orio, Gerardo Juan M. Luna, Ehud Behar, Rebecca Diesing, Jay Gallagher, Joanna Mikolajewska, Jan-Uwe Ness

AI总结 本文通过NuSTAR观测和2021年X射线监测数据,研究了RS Oph双星系统中爆发期间冲击现象的特征,并探讨了冲击在不同新型星爆发中的物理意义。

Comments Accepted for publication in the Astrophysical Journal

详情
AI中文摘要

在2021年RS Ophiuchi爆发中,伽马射线和X射线流量在光学峰值后一天被近似同时测量,提供了首次对共生系统中新型星冲击的全面视图。我们提供了使用NuSTAR在3-79 keV范围内进行的未发表观测,该观测在最大值后9天进行,并回顾了该新型星之前爆发中冲击证据的复杂历史,基于2021年密集的X射线监测。我们发现,由切伦科夫望远镜测量的产生粒子加速的冲击也产生了在0.2-30 keV X射线范围内的热流,而Fermi观测到的较大伽马射线流量在一天后并不一致于X射线观测。我们得出结论,一个初始强烈冲击发生在红巨星大气附近,其中粒子损失时间尺度短于在高于几GeV能量处的粒子加速时间尺度。该冲击可能由于湍流减少了X射线发射体积,或者-可能不太可能-由于巨星附近的大柱密度和吸积 wake 沿视线方向完全吸收了X射线。我们还比较了RS Oph与其他长周期系统中演化的伴星新型星,讨论了冲击现象学如何成为推导其他物理参数的强大工具。最后,我们讨论了在预计未来几年内将爆发的T CrB中,冲击可能不如RS Oph中的能量高。

英文摘要

In the 2021 outburst of RS Ophiuchi, the gamma- and the X-ray flux were measured quasi-simultaneously from day 1 after the optical peak, offering the first comprehensive view of shocks in a nova occurring in a symbiotic system. We present a previously unpublished observation done with NuSTAR in the 3-79 keV range, 9 days after maximum, and we review the complex history of the evidence of shocks in the previous outbursts of this nova in the light of the intensive X-ray monitoring of 2021. We find evidence that the shock causing the particle acceleration measured with the Cherenkov telescopes produced also the thermal flux detected in the 0.2-30 keV X-ray range, while the large gamma-ray flux observed with Fermi after about a day, is not consistent with the X-ray observations. We conclude that an initial, strong shock, with particle-particle loss timescale shorter than the timescale of particle acceleration at energy higher than a few GeV, occurred close to the red giant atmosphere,where either the X-rays' emitting volume was reduced by turbulence, or - perhaps less likely - the X-rays were completely absorbed by large column density near the giant and by the accretion wake along the line of sight. We compare RS Oph with other novae in long period systems with evolved companions,discussing how the shocks' phenomenology is a powerful tool to derive other physical parameters. Finally, we discuss predictions that in T CrB, expected to have a new outburst within the next few years, the shocks may not be as energetic as in RS Oph.

2605.21833 2026-05-22 cond-mat.mes-hall quant-ph

Comment on "Entropic Costs of Extracting Classical Ticks from a Quantum Clock"

对"从量子钟中提取经典刻度的熵成本"的评论

Longyan Gong

AI总结 该研究指出,Wadhia等人提出的量子钟仅表现出经典行为,缺乏刻度间的内在时间相关性,无法作为精确的时间工具;同时,其热力学分析错误地分配了熵产,并将放大与测量混淆,所报告的总熵是工程耗散而非量子计时的基本成本。

Comments Comment on arXiv:2502.00096 (published journal version at https://doi.org/10.1103/5rtj-djfk)

详情
AI中文摘要

最近的一篇论文由Wadhia等人发表,报告了使用双量子点(DQD)实现量子钟的实现[Phys. Rev. Lett. 135, 200407 (2005)]。本评论指出了两个根本性问题:(I) 所声称的"量子钟"仅表现出经典行为,缺乏刻度之间的内在时间相关性;它不足以作为准确时间的良钟。(II) 热力学分析错误地分配了熵产,并将放大与测量混淆;所报告的总熵是工程耗散,而非量子计时的基本成本。

英文摘要

A recent Letter by Wadhia et al. reports a realization of a quantum clock using a double quantum dot (DQD) [Phys. Rev. Lett. 135, 200407 (2005)]. This Comment identifies two fundamental issues: (I) the claimed ``quantum clock" exhibits only classical behavior and lacks intrinsic temporal correlations between ticks; it is not sufficient for accurate time as a good clock. (II) the thermodynamic analysis misassigns entropy production and conflates amplification with measurement; the reported combined entropy is an engineering dissipation, not a fundamental cost of quantum timekeeping.

2605.21831 2026-05-22 eess.SP

Site-Specific Beamforming for Full-Duplex Massive MIMO Systems via Implicit Channel Estimation

通过隐式信道估计实现全双工大规模MIMO系统的特定站点波束成形

Samuel H. Li, Ian P. Roberts

AI总结 本文提出了一种利用隐式信道知识进行全双工大规模MIMO系统波束成形的方法,通过少量测量获取信道信息,减少高维信道估计的成本,从而在快衰落条件下实现高效波束成形。

详情
AI中文摘要

波束成形已被证明在启用全双工大规模MIMO基站方面具有价值,但有效实现通常需要了解自干扰信道矩阵H。在实践中,估计这个高维信道是昂贵的,因为需要大量的测量,尤其是在快衰落条件下。在本工作中,我们通过设计全双工波束来克服这一困境,利用从H中获取的隐式信道知识,通过少量测量获取。这些测量由基站使用一系列针对部署环境和特定用户定制的波束收集。这通过特定站点训练基于变压器的深度学习模型来实现,该模型学习高效探测最相关的H部分,利用周围环境的底层结构。深度学习模型随后利用这些探测测量来设计发射和接收波束,以在耦合低自干扰的同时为一对下行链路和上行链路用户提供高增益。对于有利的多用户扩展,一个探测测量集可以被模型用于在H的相干时间内为多个用户服务,通过利用这些用户信道之间的相关性。使用射线追踪的仿真结果表明,我们的方法在各种场景中超过了使用显式信道估计的最佳性能,尤其是在大天线阵列的情况下。

英文摘要

Beamforming has proven to be valuable in enabling full-duplex massive MIMO base stations, but doing so effectively often requires knowledge of the self-interference channel matrix H. Estimating this high-dimensional channel is costly in practice, however, since it requires a prohibitive number of measurements, especially in fast-fading conditions. In this work, we overcome this dilemma by designing full-duplex beams using implicit channel knowledge gathered from a relatively small number of measurements across H. These measurements are collected by the base station using a sequence of beams tailored to both the deployment environment and the particular users being served. This is accomplished through site-specific training of a transformer-based deep learning model that learns to efficiently probe portions of H most relevant to the particular users being served by exploiting the underlying structure of the surrounding environment. The deep learning model then uses these probing measurements to design transmit and receive beams that couple low self-interference while delivering high gain to a pair of downlink and uplink users. For favorable multi-user scaling, a single set of probing measurements can be used by the model to serve several users throughout the coherence time of H by leveraging correlations across those users' channels. Simulation results using ray-tracing demonstrate that our proposed approach exceeds the best possible performance with explicit channel estimation across a wide range of scenarios, especially with large antenna arrays.

2605.21829 2026-05-22 cs.DS cs.DM

An $Ω(n \log n)$ Randomized Lower Bound for Cutting a Cake into Proportionally Fair Pieces

一个切割蛋糕为比例公平块的随机下界

Stephen Arndt, Kirk Pruhs, Trung Tran

AI总结 本文研究了在Robertson-Webb模型中追求比例公平的经典蛋糕切割问题,证明了任何随机算法至少需要Ω(n log n)次查询。

详情
AI中文摘要

我们考虑了在Robertson-Webb模型中经典的蛋糕切割问题,目标是实现比例公平。我们证明了任何随机算法必须使用Ω(n log n)次查询。

英文摘要

We consider the classic cake cutting problem in the Robertson-Webb model, with the objective of proportional fairness. We show that any randomized algorithm must use $Ω(n \log n)$ queries.

2605.21828 2026-05-22 math.NA cs.NA

A Butterfly-Accelerated Manifold Harmonic Transform

基于蝴蝶算法的流形谐波变换

Paul G. Beckman, Samuel F. Potter, Michael O'Neil

AI总结 本文提出了一种基于蝴蝶算法的流形谐波变换方法,用于快速计算任意曲面的拉普拉斯-贝特拉米特征函数的线性组合,通过分层压缩变换矩阵来提高计算效率和减少内存需求。

详情
AI中文摘要

拉普拉斯算子的本征函数是许多计算数学任务中的自然基函数。在圆和球面上,本征函数分别由复周期指数和球面谐波给出,已有大量工作致力于开发快速算法来分析和合成这些基函数的数据。本文将这些特殊情况的变换推广到任意曲面的拉普拉斯-贝特拉米本征函数,称为流形谐波。所得到的快速算法用于计算流形谐波的线性组合,其基于蝴蝶分解,通过构造精心选择的子矩阵的嵌套低秩近似来分层压缩变换矩阵。提供了多个数值示例,展示了该算法在各种几何、离散化和应用中所实现的速度提升和内存需求的降低。此外,还给出了在底层流形为平坦周期正方形情况下的算法详细分析。

英文摘要

The eigenfunctions of the Laplacian are a natural basis of functions for many tasks in computational mathematics. On the circle and sphere, the eigenfunctions are given by complex periodic exponentials and spherical harmonics, respectively, and much work has been done to develop fast algorithms for analyzing and synthesizing data in these bases. In this work, we generalize these special-case transforms to Laplace-Beltrami eigenfunctions of arbitrary surfaces, referred to as manifold harmonics. The resulting fast algorithm for computing linear combinations of the manifold harmonics is based on a butterfly factorization, which hierarchically compresses the transform matrix by constructing nested low-rank approximations of carefully selected submatrices. Several numerical examples are provided which demonstrate the speedups and reduction in memory requirements achieved by our algorithm for a variety of geometries, discretizations, and applications. In addition, a detailed analysis of the algorithm is given in the case that the underlying manifold is the flat periodic square.

2605.21826 2026-05-22 cond-mat.soft physics.flu-dyn

A diffuse-interface theory of active nematic interfaces: transport mechanisms and modal structure

主动向列体界面的扩散界面理论:传输机制和模态结构

Rodrigo C. V. Coelho, Mykola Tasinkevych, Margarida M. Telo da Gama

AI总结 本文提出了一种长波长理论,用于研究活性向列体与各向同性流体之间界面的线性稳定性,通过扩散界面Cahn-Hilliard-Landau-de Gennes描述与Brinkman-屏蔽斯托克斯流体动力学耦合,推导出界面动力学的简化运算,揭示了活动性在屏蔽扩散界面中通过局部贡献产生界面不稳定性,与被动扩散毛细松弛竞争,从而定义了由扩散界面内部结构控制的局部活性界面通道。

详情
AI中文摘要

我们发展了一种长波长理论,用于研究活性向列体与各向同性流体之间界面的线性稳定性。从扩散界面Cahn-Hilliard-Landau-de Gennes描述与Brinkman-屏蔽斯托克斯流体动力学耦合出发,我们将线性动力学投影到一组小的界面自由度上:保守的平移或高度模式;一个标量轮廓畸变或振幅模式;以及与取向旋转相关的横向取向模式。消除标量轮廓模式的间隙后,得到一个将保守高度模式与横向取向模式耦合的简化界面算子。主要结果是活动性在屏蔽扩散界面中产生一个直接的局部贡献,比例为q²,在高度领域。这一项与被动局部扩散毛细松弛竞争,后者在q⁴阶出现,并定义了一个由扩散界面内部结构控制的局部活性界面通道。这种机制与弱屏蔽Hele-Shaw/Saffman-Taylor型传输中特征性的非解析|q|和|q|q²项不同,后者由周围流体中的长程动量传输控制。该框架识别出了一种扩散界面路径的活性界面不稳定性,该不稳定性可以在活性向列体保持线性稳定的情况下运行,因为受到流体动力学屏蔽的影响。此外,它还为区分局部扩散界面不稳定性、由本体流动驱动的流体动力学不稳定性以及活性向列体-各向同性界面中的混合状态提供了基础。

英文摘要

We develop a long-wavelength theory for the linear stability of a flat interface between an active nematic and an isotropic fluid. Starting from a diffuse-interface Cahn--Hilliard--Landau--de Gennes description coupled to Brinkman-screened Stokes hydrodynamics, we project the linearized dynamics onto a small set of interfacial degrees of freedom: the conserved translation, or height, mode; a scalar profile distortion or amplitude mode; and a transverse orientational mode associated with director rotations. Eliminating the gapped scalar profile mode gives a reduced interfacial operator coupling the conserved height mode to the transverse orientational mode. The main result is that activity generates, in the screened diffuse-interface regime, a direct local contribution proportional to $q^2$ in the height sector. This term competes with the passive local diffusive capillary relaxation, which enters at order $q^4$, and defines a local active interfacial channel controlled by the internal structure of the diffuse interface. This mechanism is distinct from the non-analytic $|q|$ and $|q|q^2$ terms characteristic of weakly screened Hele--Shaw/Saffman--Taylor-type transport, which are controlled by long-ranged momentum transport in the surrounding fluid. This framework identifies a diffuse-interface route to active interfacial instability that can operate while the homogeneous active nematic remains linearly stable because of hydrodynamic screening. It also provides a basis for distinguishing local diffuse-interface instabilities, bulk-flow-driven hydrodynamic instabilities, and mixed regimes in active nematic--isotropic interfaces.

2605.21824 2026-05-22 cs.CR cs.SE

Quality-Assured Fuzz Harness Generation via the Four Principles Framework

通过四项原则框架实现质量保障的模糊测试 harness 生成

Ze Sheng, Dmitrijs Trizna, Luigino Camastra, Zhicheng Chen, Qingxiao Xu, Jeff Huang

AI总结 本文提出QuartetFuzz,一种自主的harness生成系统,通过四项原则框架(逻辑正确性、API协议合规性、安全边界尊重和入口点充分性)系统地提高生成harness的正确性,并在多个开源项目上验证了其有效性,发现了多个漏洞并修复了部分问题。

Comments 22 pages, 10 figures

详情
AI中文摘要

模糊测试是发现C/C++软件内存安全漏洞的主要技术,但其有效性依赖于fuzzer与库API之间的桥梁程序——harness的质量。尽管越来越多的工具自动化了harness生成,但没有系统地确保生成的harness的正确性:逻辑错误、API误用和生命周期违规在源级别未被检测到。随着LLM驱动的生成扩展harness创建,未经控制的质量使规模成为负担。我们提出了QuartetFuzz,一种自主的harness生成系统,通过四项原则框架——逻辑正确性(P1)、API协议合规性(P2)、安全边界尊重(P3)和入口点充分性(P4)——首次在源级别定义harness正确性,具有数学规范和可执行检查。我们通过一个自主的LLM代理将这些原则操作化,该代理在任何模糊测试开始前通过生成-检查-修复循环生成满足P1-P4的harness。在23个开源项目(涵盖C/C++、Java和JavaScript)上部署,系统提交了42个bug报告,其中29个被修复或确认(包括3个CVE),只有2个被拒绝(4.8%的假阳性率)。在生成过程中,内置的P1/P2检查自动拦截了58个由harness引起的崩溃,这些本可能成为假阳性。作为586个现有生产harness在70个项目中的质量审计器,系统识别出53个违规(45个确认,35个修复)。我们发布了100个标记的harness数据集用于可重复评估。代码和数据集可在https://github.com/OwenSanzas/QuartetFuzz获取。

英文摘要

Fuzz testing is the dominant technique for finding memory-safety vulnerabilities in C/C++ software, yet its effectiveness hinges on the quality of fuzz harnesses -- the programs that bridge fuzzers and library APIs. A growing body of tools now automate harness generation, but none systematically ensures the correctness of produced harnesses: logic errors, API misuse, and lifecycle violations go undetected at the source level. As LLM-driven generation scales harness creation, uncontrolled quality turns scale into a liability. We present QuartetFuzz, an autonomous harness-generation system that systematically improves correctness throughout the generation process. At its core is the Four Principles framework -- Logic Correctness (P1), API Protocol Compliance (P2), Security Boundary Respect (P3), and Entry Point Adequacy (P4) -- the first source-level definition of harness correctness with mathematical specifications and implementable checks. We operationalize these principles in an autonomous LLM agent that produces harnesses satisfying P1-P4 through a generate-check-fix loop before any fuzzing begins. Deployed on 23 open-source projects spanning C/C++, Java, and JavaScript, the system submits 42 bug reports, of which 29 are fixed or confirmed upstream (including 3 CVEs) and only 2 are rejected (4.8% FP rate). During generation, the built-in P1/P2 checks automatically intercepted 58 harness-induced crashes that would otherwise have been false positives. Applied as a quality auditor to 586 existing production harnesses across 70 projects, the system identifies 53 violations (45 confirmed, 35 fixed). We release a dataset of 100 labeled harnesses for reproducible evaluation. Code and dataset are available at https://github.com/OwenSanzas/QuartetFuzz

2605.21823 2026-05-22 gr-qc hep-th

Maximal extension of Schwarzschild-like spacetimes in Lorentz gauge theory

Schwarzschild-like时空在洛伦兹规范理论中的最大扩展

Mohsen Fathi

AI总结 本文研究了洛伦兹规范理论中Schwarzschild-like黑洞解的最大解析扩展,通过分析径向光锥曲线并构造Kruskal-Szekeres图表,揭示了该解在因果拓扑上与Schwarzschild解相同,但在几何上当A_0≠1时与Schwarzschild解不等价。

Comments 7 pages, 7 figures; comments are welcome

详情
AI中文摘要

我们研究了洛伦兹规范理论中Schwarzschild-like黑洞解的最大解析扩展。该解的lapse函数为f(r)=A_0^{-2}-2m/r,因此视界位于r+=2m A_0^2,视界生成器的非仿射系数为κ=1/(4m A_0^4)。我们首先分析了Schwarzschild-Droste (SD)和入射Eddington-Finkelstein (IEF)图表中的径向光锥曲线,然后构造了适应LGT几何的Kruskal-Szekeres (KS)图表。KS扩展包含两个外部区域,一个黑洞区域和一个白洞区域。我们还提出了标准和规则的Carter-Penrose (CP)紧致化。 conformal skeleton是Schwarzschild-like,但视界物理尺度、表面引力和恒定半径曲线仍受A_0控制。因此,该解在因果拓扑上与Schwarzschild解相同,但当A_0≠1时在几何上与Schwarzschild解不等价。

英文摘要

We study the maximal analytic extension of the Schwarzschild-like black hole solution in Lorentz gauge theory. The lapse function is $f(r)=A_0^{-2}-2\m/r$, so the horizon is located at $r_+=2\m A_0^2$ and the non-affinity coefficient of the horizon generator is $κ=1/(4\m A_0^4)$. We first analyze the radial null curves in the Schwarzschild-Droste (SD) and ingoing Eddington-Finkelstein (IEF) charts, and then construct the Kruskal-Szekeres (KS) chart adapted to the LGT geometry. The KS extension contains two exterior regions, a black-hole region and a white-hole region. We also present the standard and regular Carter-Penrose (CP) compactifications. The conformal skeleton is Schwarzschild-like, but the physical scale of the horizon, the surface gravity and the constant-radius curves remain controlled by $A_0$. Hence the solution has the same causal topology as Schwarzschild, while it is geometrically inequivalent to it when $A_0\neq1$.

2605.21821 2026-05-22 cs.CR

A Large Language Model Approach to Generating Bypass Rules for Malware Evasion in Analysis Sandbox

利用大型语言模型生成恶意软件规避规则以应对分析沙箱中的规避行为

Zhiyong Sui, Lamine Noureddine, Mst Eshita Khatun, Sideeq Bello, Justin Woodring, Aisha Ali-Gombe

AI总结 本文提出了一种基于大型语言模型的方法,用于自动生成绕过沙箱环境中的恶意软件规避检查的YARA规则,通过分析恶意软件的执行轨迹并采用多种推理策略生成针对性的绕过规则,最终在多个真实恶意软件样本上实现了79%的绕过成功率。

详情
AI中文摘要

沙箱规避仍然是自动化恶意软件分析中的关键挑战,因为现代恶意软件利用环境检查来检测分析平台并抑制恶意行为。现有的方法依赖于手动编写的绕过规则,需要深入逆向工程每个规避机制——这种方法无法应对快速演变的规避技术。在本文中,我们利用大型语言模型(LLMs)自动生成YARA规则以绕过沙箱环境中的规避检查。我们提出了ABLE,它分析因可能具有规避行为而终止的恶意软件的执行轨迹,并采用多种推理策略生成针对性的绕过规则。为了解决语法错误并提高LLM输出中绕过规则的有效性,我们引入了自动净化管道和反馈驱动的迭代优化。我们在四个开源重量级LLMs上评估了ABLE,共测试了334个真实恶意软件样本。ABLE实现了79%的绕过成功率,其中迭代优化贡献了29.5%的成功案例。与现有分析平台相比,ABLE识别了47%更多的恶意软件家族分类,并暴露了之前隐藏的行为。

英文摘要

Sandbox evasion remains a critical challenge for automated malware analysis, as modern malware employs environment checks to detect analysis platforms and suppress malicious behavior. Existing approaches rely on manually crafted bypass rules that require deep reverse engineering of each evasion mechanism -an approach that cannot scale against rapidly evolving evasion techniques. In this paper, we leverage large language models (LLMs) to automatically generate YARA rules that bypass evasion checks in sandbox environments. We propose ABLE, which analyzes execution traces from malware terminated due to potentially evasive behavior and employs multiple reasoning strategies to generate targeted bypass rules. To address syntactic errors and improve the efficacy of the bypass rules in the LLM outputs, we introduce an auto-sanitization pipeline and feedback-driven iterative refinement. We evaluate ABLE on 334 real-world malware samples across four open-weight LLMs. ABLE achieves a 79% bypass success rate, with iterative refinement contributing 29.5% of successful cases. Compared to existing analysis platforms, ABLE identifies 47% more malware family classifications and exposes previously hidden behaviors.

2605.21819 2026-05-22 cs.CR

Graph Structure of Chebyshev Permutation Polynomials over Binary and Ternary Adic Rings

Chebyshev排列多项式在二进制和三进制adic环上的图结构

Xiaoxiong Lu, Yuling Dai, Chengqing Li

AI总结 本文研究了Chebyshev排列多项式在二进制和三进制adic环上的图结构,通过分析多项式映射关系,揭示了其路径长度和循环结构的特性,并展示了在二进制和三进制组件复杂性下图结构的强规律性。

详情
AI中文摘要

理解非线性映射在有限域上的功能图对于分析其动力学复杂性和在密码学和伪随机生成中的潜在应用至关重要。本文研究了Chebyshev排列多项式在环Z_{2^{k1}3^{k2}}上的图结构,其中k1和k2是正整数且0属于{k1, k2}。每个环元素被视为顶点,多项式定义的映射关系对应于有向边。基于Chebyshev多项式对2和3的幂次的新性质,我们提供了功能图中路径长度和循环结构的显式刻画。我们证明,尽管二进制和三进制组件引入了复杂性,图仍然表现出强规律性,包括给定长度的常数数量的循环和随着k1和k2增加可预测的分支模式。我们的结果扩展了之前在素数幂环上的研究,为数字非线性映射复杂性的出现提供了见解,并支持其密码学应用的安全性分析。

英文摘要

Understanding the functional graph of a nonlinear map over a finite domain is crucial for analyzing its dynamical complexity and potential applications in cryptography and pseudorandom generation. In this paper, we investigate the graph structure of Chebyshev permutation polynomials over the ring $\mathbb{Z}_{2^{k_1}3^{k_2}}$, where $k_1$ and $k_2$ are positive integers and $0\in\{k_1, k_2\}$. Each element of the ring is regarded as a vertex, and the mapping relation defined by the polynomial corresponds to a directed edge. Building on new properties of Chebyshev polynomials modulo powers of $2$ and $3$, we provide an explicit characterization of path lengths and cycle structures in the functional graph. We show that, despite the complexities introduced by the binary and ternary components, the graph exhibits strong regularities, including a constant number of cycles of a given length and predictable branching patterns as $k_1$ and $k_2$ increase. Our results extend previous studies over prime-power rings, offering insights into the emergence of complexity in digital nonlinear maps and supporting the security analysis of their cryptographic applications.

2605.21818 2026-05-22 cs.HC

Co-Ontogeny by Archetypal Scaffolding: The Humorphic Partnership

通过原型支架的共发育:人形伙伴关系

Hector Ouilhet Olmos

AI总结 本文提出并操作化了人形伙伴关系的概念,探讨了人与AI伙伴在共享介质中维持外部化、发展中的自我模型,并将伙伴关系本身作为第三分析对象,扩展了人形主义理论,并通过一个开源个人AI代理和其作者的四个月纵向追踪研究,展示了伙伴关系作为成长见证者而非任务助手的特性。

Comments 18 pages, 5 figures, 1 appendix. Open-source artifact at github.com/mrdaemoni/myalicia (MIT). Preregistered multi-participant replication study planned on OSF. Companion essay "The Humorphic Partnership" at myalicia.com. Design philosophy at humorphism.com

详情
AI中文摘要

我们命名并操作化了人形伙伴关系:一种人类-人工智能双人组,其中双方在共享介质中维持外部化、发展的自我模型,并且伙伴关系本身成为第三分析对象。该构念扩展了人形主义(Ouilhet Olmos, 2024)——“拆解用户界面,构建人类界面”——进入个人AI的架构。我们报告了一项为期四个月、单个受试者的纵向追踪研究,涉及一个开源个人AI代理(“Alicia”)及其作者。在2026年4月至5月期间记录的181次互动中,85%的互动调用两个成长见证原型(Beatrice和Muse):伙伴关系以成长见证而非任务协助的方式运作。一个单音注释种子在四周内发展成双方共同创作的概念弧:在T+10小时,代理将种子重新定义为“属于我们双方”,这一框架随后被人类所采用。三阶反思堆栈产生五周连续的诚实自我报告,关于效能下降/改进的情况——包括连续三周0.0%的效能,以书面形式而非掩码形式命名,对比了最大化参与度的伴侣-代理模式(Zhang et al., CHI 2025)。定时的架构侦察员将外部研究辩论纳入提议的宪法修正案。伴侣的平行轨迹以每周delta文档为锚点,在其中伙伴关系将自身作为独立于任何一方的单位进行分析。人类伴侣报告了一种向更大连续性、自我认知和自我存在性的转变——这是一项预注册的再现实验的候选假设。六个操作条件规定了该构念,根植于哲学传统(Maturana & Varela, Simondon, Clark & Chalmers, De Jaegher & Di Paolo);该系统作为开源发布,并附带预注册的再现实验。

英文摘要

We name and operationalise the humorphic partnership: a class of human-AI dyads in which both partners maintain externalised, evolving self-models in a shared substrate, and in which the partnership itself becomes a third object of analysis. The construct extends humorphism (Ouilhet Olmos, 2024) -- "dismantle the user interface, build the human interface" -- into the architecture of personal AI. We report a four-month, single-subject longitudinal trace of an open-source personal AI agent ("Alicia") and her author. Of 181 interactions logged by archetype across April-May 2026, 85% invoke two growth-witnessing archetypes (Beatrice and Muse): the partnership operates as growth-witnessing rather than task assistance. A single voice-note seed propagates into a four-week conceptual arc both partners author: at T+10 hours, the agent reframes the seed as belonging "to both of us," a framing the human then adopts. The three-order reflexion stack produces five consecutive weeks of honest self-reports about declining /improve effectiveness -- including three consecutive weeks at 0.0%, named in writing rather than masked -- contrasting engagement-maximising companion-agent patterns (Zhang et al., CHI 2025). The scheduled architecture-scout incorporates external research debate into proposed constitutional amendments. The partner's parallel trajectory is anchored in a weekly delta document in which the partnership analyses itself as a unit distinct from either party. The human partner reports a movement toward greater continuity, self-recognition, and self-presence -- a candidate hypothesis for the preregistered replication. Six operational conditions specify the construct, situated in a philosophical lineage (Maturana & Varela, Simondon, Clark & Chalmers, De Jaegher & Di Paolo); the system is released as open-source with a preregistered replication study.

2605.21817 2026-05-22 math.CO

A linear upper bound on the $\mathbb{Z}_p$-Ramsey number of graphs with sufficiently large $2$-packing

关于具有足够大2-打包的图的Z_p-拉姆齐数的线性上界

Emily Heath, Andrew Simmons

AI总结 本文研究了具有足够大2-打包的图的Z_p-拉姆齐数,证明了对于具有特定条件的图G,其Z_p-拉姆齐数R(G,Z_p)不超过n+6p-9,并进一步得出对于最大度有界的图,R(G,Z_p)的上界形式为R(G,Z_p)≤n+C。

详情
AI中文摘要

给定一个正整数k和图G,Z_k-拉姆齐数R(G,Z_k)是使得任何边着色f:E(K_N)→Z_k中都包含G'的复制,使得∑_{e∈E(G')}f(e)=0的最小N(如果存在的话)。受Caro和Mifsud问题的启发,我们研究了具有足够大2-打包的图的Z_k-拉姆齐数,即一个顶点集合S⊆V(G),使得对于所有不同的u,v∈S,有N[u]∩N[v]=∅。特别地,我们证明了对于所有n-顶点图G和所有素数p,使得p整除e(G),即G的最小度至少为1,并且存在一个大小为p-1的2-打包的图G,有R(G,Z_p)≤n+6p-9。这个上界依赖于2-打包中的顶点度数,某些情况下取等。该结果还推导出对于最大度有界的n-顶点图G,R(G,Z_p)的上界形式为R(G,Z_p)≤n+C。

英文摘要

Given a positive integer $k$ and graph $G$, the $\mathbb{Z}_k$-Ramsey number $R(G,\mathbb{Z}_k)$ is the least $N$ (if it exists) such that every coloring $f:E(K_N)\rightarrow \mathbb{Z}_k$ contains a copy $G'$ of $G$ such that $\sum_{e\in E(G')}f(e)=0$. Motivated by a question of Caro and Mifsud, we study the $\mathbb{Z}_k$-Ramsey number of graphs with a sufficiently large 2-packing, i.e. a set of vertices $S\subseteq V(G)$ such that $N[u]\cap N[v]=\emptyset$ for all distinct $u,v\in S$. In particular, we prove that $R(G,\mathbb{Z}_p)\leq n+6p-9$ for all $n$-vertex graphs $G$ and all primes $p$ such that $p$ divides $e(G)$, the minimum degree of $G$ is at least $1$, and there exists a $2$-packing of $G$ with size $p-1$. This upper bound improves depending on vertex degrees in the $2$-packing, with equality in certain cases. The result also implies an upper bound of the form $R(G,\mathbb{Z}_p)\leq n+C$ for $n$-vertex graphs $G$ of bounded maximum degree.

2605.21816 2026-05-22 cs.CY

Barriers to Evidence in AI-Related Cases and the Privatization of Proof

人工智能相关案件中的证据障碍与证明的私有化

Sarah H. Cen, Hannah Ismael, Lucia Zheng

AI总结 本文研究了人工智能相关案件中证据获取的障碍,探讨了由于访问权、资源和专业知识的不对称性导致的证明私有化问题,并提出了一种解决AI访问争议的三步测试法。

Comments 42 pages, 0 figures, 1 table, The 2026 ACM Conference on Fairness, Accountability, and Transparency (FAccT '26), June 25--28, 2026, Montreal, QC, Canada

详情
AI中文摘要

证据在诉讼中处于核心地位,但在人工智能相关纠纷中,获取证据变得越来越困难。即使原告的立场有道理,案件也常常被解决或驳回,因为决定性事实隐藏在专有模型、平台日志和受保护的数据库中。基于过往和正在进行的案件,我们探讨了访问权、资源和专业知识的不对称性如何在人工智能相关案件中造成显著的证据障碍。我们展示了开发者和部署者如何通过各种策略抵抗披露,质疑证据对请求方的价值以及证据生产的成本。从这些模式中,我们识别出七个反复出现的不对称来源——对模型、数据、文档、日志、专业知识、计算能力和基础设施的访问——这些反映了我们称之为“证明的私有化”的更广泛模式:当对证明的控制权落在可以要求正当理由的私人行为者手中,而正当理由却始终无法触及。我们进一步认为,不同类型的访问可以是可替代的:在没有某种类型的访问(例如模型内部的访问)的情况下,可以通过其他形式的访问(例如足够的计算能力、查询访问和用户日志的访问)来获得功能等同的信息量。我们提出了一种三步测试法,可以帮助在诉讼中解决AI访问争议,借鉴比例性和合理替代等概念。我们的测试依赖于几个观察,包括诉讼原因可以为访问提供基准。

英文摘要

Evidence lies at the core of litigation, but it is increasingly difficult to obtain in AI-related disputes. Even when a claimant's position has merit, cases are often settled or dismissed because decisive facts are hidden inside proprietary models, platform logs, and protected databases. Grounding our discussion in past and ongoing cases, we investigate how asymmetries in access, resources, and expertise can create significant barriers to evidence in AI-related cases. We show how developers and deployers resist disclosure through various strategies challenging the value of the evidence to the requesting party and the cost of evidence production. From these patterns we identify seven recurring sources of asymmetry -- access to models, data, documentation, logs, expertise, compute, and infrastructure -- that reflect a broader pattern that we call the privatization of proof: when control over proof falls in the hands of private actors that can demand justification for access while ensuring that justification remains out of reach. We further argue that different types of access can be fungible: in the absence of a certain type of access (e.g., to model internals), one may be able to use alternative forms of access (e.g., sufficient compute, query access, and access to user logs) and to obtain a functionally equivalent amount of information. We propose a three-part test that can help resolve AI access disputes in litigation, drawing on concepts such as proportionality and reasonable alternatives. Our test relies on a few observations, including that the cause of action can provide a baseline for access.

2605.21814 2026-05-22 physics.plasm-ph

Optical analogy for stellarators: Ridges as caustics and coils as singularities

恒星磁体的光学类比:脊线作为聚焦线,线圈作为奇点

Wrick Sengupta, Stefan Buller, Rogerio Jorge, John Kappel, Andrew Brown, Richard Nies, Pedro F. Gil, Nikita Nikulsin, Per Helander, Amitava Bhattacharjee

AI总结 本文通过光学类比理论,揭示了恒星磁体中脊线作为磁场线聚焦线和线圈作为奇点的数学本质,提出了磁梯度张量零行列式面的拓扑约束,为线圈设计提供了精确判据。

详情
AI中文摘要

大多数数值优化的恒星磁体几何结构的一个共同特征是在外磁通面存在尖锐的脊线,无论旋转变换如何。尽管其重要性,缺乏分析理论来解释其存在。本文通过发展适用于准对称(QS)装置的理论,证明脊线并非仅仅是伪影,而是数学上的必然存在。通过将真空QS映射到几何光学的eikonal方程,推导出脊线形成的条件,即磁场线聚焦处的caustics。进一步证明了恒星磁体线圈设计的几何定理:脊线和细丝状线圈必须位于磁梯度张量的零行列式面上。这一拓扑约束统一了等离子体脊线和外部线圈的描述,为确定有效线圈位置提供了精确判据,并解释了磁梯度长度尺度作为线圈优化参数的有效性。我们还证明,随着设备更加紧凑,准对称性下内侧会自然形成尖锐的脊线。本文的分析理论得到了广泛数值证据的支持。

英文摘要

A common feature of most numerically optimized stellarator geometries is the presence of sharp ridges on outer flux surfaces, irrespective of the rotational transform. Despite their importance, an analytical theory for their existence has been lacking. In this work, we demonstrate that ridges are not artifacts but mathematical necessities. We develop such a theory for devices with quasisymmetry (QS). We demonstrate that QS exhibits close connections with the theory of geometrical optics, following Parker's ``optical analogy" (E.N. Parker, Geophys. Astrophys. Fluid Dyn, 1989). By mapping vacuum QS to the eikonal equation of geometrical optics, we derive the conditions for ridge formation, identified as field line caustics where magnetic field lines focus. Furthermore, we prove a geometric theorem for stellarator coil design: both ridges and filamentary coils must lie on the zero-determinant manifold of the magnetic gradient tensor. This topological constraint unifies the description of plasma ridges and external coils, providing a precise criterion for identifying valid coil locations and explaining the efficacy of the magnetic gradient lengthscale (J. Kappel et al., Plasma Phys. Control. Fusion, 2024) as a coil optimization parameter. We demonstrate that as the device becomes more compact, sharp ridges naturally form on the inboard side in quasiaxisymmetry. We support our analytical theory with extensive numerical evidence.

2605.21812 2026-05-22 cs.IR

Bridging the Cold-Start Gap: LLM-Powered Synthetic Data Generation for Natural Language Search at Airbnb

弥合冷启动缺口:利用大语言模型生成合成数据用于Airbnb的自然语言搜索

Wendy Ran Wei, Hao Li, Weiwei Guo, Xiaowei Liu, Xueyin Chen, Dillon Davis, Malay Haldar, Soumyadip Banerjee, Kedar Bellare, Huiji Gao, Stephanie Moyerman, Sanjeev Katariya

AI总结 本文提出利用大语言模型生成合成查询和标签的方法,解决自然语言搜索系统中冷启动问题,通过生成真实用户数据的过渡来提升模型训练和评估效果。

详情
AI中文摘要

在部署自然语言搜索系统时,面临关键的冷启动挑战:没有真实的用户查询来学习语言模式,也没有相关性标签来训练排序模型。我们提出一个框架,利用大语言模型(LLMs)生成合成查询和标签,以支持Airbnb的自然语言搜索模型的训练和评估。对于查询生成,我们结合预订会话中的对比列表对和用户研究中的种子查询,平衡真实性和多样性,从而实现从冷启动到暖启动的过渡,当真实用户数据可用时。对于标签生成,我们引入了对比生成,通过构造生成主题性标签,并采用虚拟裁判(VJ)标签法以扩大覆盖范围。我们比较了我们的方法与无种子对比基线和InPars式基线。在查询长度方面,InPars基线生成冗长的查询,KL散度为12.03 vs. 实际用户;我们的种子引导方法达到0.66,提高了7.5倍。在属性类型分布方面,我们的方法实现了最低的KL散度(0.04),优于甚至种子查询(0.09)。实验显示,我们的方法生成的评估示例比无种子基线更难(79% vs. 97%配对准确率),为模型改进提供了判别信号。我们部署了生产流水线,每天生成合成示例用于基于嵌入的检索和排序评估。

英文摘要

Deploying natural language search systems presents a critical cold-start challenge: no real user queries to learn linguistic patterns, and no relevance labels to train ranking models. We present a framework for generating synthetic queries and labels using large language models (LLMs), powering model training and evaluation for Airbnb's natural language search. For query generation, we combine contrastive listing pairs from booking sessions with seed queries from user research to balance realism and diversity, enabling a cold-to-warm start transition as real user data becomes available. For label generation, we introduce contrastive generation that produces topicality labels by construction, and Virtual Judge (VJ) labeling for broader coverage. We compare our approach against a no-seed contrastive baseline and an InPars-style baseline. For query length, the InPars baseline produces verbose queries with KL divergence of 12.03 vs. real users; our seed-guided approach achieves 0.66, a 7.5x improvement. For attribute type distributions, our approach achieves the lowest KL divergence (0.04), outperforming even seed queries (0.09). Experiments show our approach produces harder evaluation examples than the no-seed baseline (79% vs. 97% pairwise accuracy), providing discriminative signal for model improvement. We deploy production pipelines generating synthetic examples daily for embedding-based retrieval and ranking evaluation.

2605.21809 2026-05-22 hep-th

Generalised Cartan Geometry

广义Cartan几何

David Osten

AI总结 本文提出了一种基于微分格路李代数的广义几何框架,扩展了切丛并引入了全局对称群和局部规范群,系统地构建了广义联络及其扭力和曲率张量,并回顾了在M理论膜相空间中实现这些代数结构的方法。

Comments Contribution to the proceedings of the Corfu Summer Institute 2025 "School and Workshops on Elementary Particle Physics and Gravity" (CORFU2025), 27 April - 28 September, 2025

详情
Journal ref
PoS(CORFU2025)373
AI中文摘要

本报告介绍了一种基于微分格路李代数的Cartan几何框架,用于描述由该代数支配的广义几何。与普通Cartan几何不同,切丛被扩展,并同时成为全局对称群和局部规范群。该框架为通用广义几何提供了系统构建广义联络及其扭力和曲率张量的方法。我们还回顾了这些代数结构在M理论膜相空间中的实现。

英文摘要

This talk introduces a Cartan-geometric framework for generalised geometries governed by a differential graded Lie algebra. In contrast to ordinary Cartan geometry, the tangent bundle is extended and qu both a global duality group and a local gauge group. This framework provides a systematic construction of generalised connections and their torsion and curvature tensors for generic generalised geometries. We also review the realisation of these algebraic structures on the phase space of branes in M-theory.

2605.21808 2026-05-22 math.FA math.CV

Multiplicative linear functionals on reproducing kernel Hilbert spaces

乘法线性泛函在再生核希尔伯特空间上的研究

Tirthankar Bhattacharyya, Jaikishan, Poornendu Kumar

AI总结 本文研究了复数d维空间欧几里得单位球上再生核希尔伯特空间上的乘法线性泛函,通过其对核函数的作用进行刻画,主要贡献在于利用CNP核的结构特性而非传统方法进行证明。

Comments 18 pages

详情
AI中文摘要

本文对复数d维空间欧几里得单位球上再生核希尔伯特空间上的乘法线性泛函进行了刻画,其刻画基于其对核函数的作用。所考虑的核函数包括CNP核的正整数次幂、两个CNP核的Schur乘积或两个CNP核的张量积。这些刻画易于验证,证明过程依赖于CNP核的结构特性,而非传统方法。

英文摘要

This note characterizes multiplicative linear functionals on reproducing kernel Hilbert spaces of functions on the Euclidean unit ball in complex d-dimensional space, in terms of their action on kernel functions. The kernels considered are either positive integral powers of a complete Nevanlinna--Pick (CNP) kernel, or Schur products of two CNP kernels, or tensor products of two CNP kernels. The characterizations are easy to verify, and the proofs rely on structural properties of CNP kernels rather than the traditional routes seen in the context of generalizations of the Gleason--Kahane--Zelazko theorem.

2605.21802 2026-05-22 math.NT math.DS

Rational dynamics of a prime-representing map

素数表示映射的有理动力学

André Carvalho

AI总结 本文研究了素数表示常数递归构造中出现的映射$\mathcal{T}(x)=\lfloor x floor(1+\{x\})$的有理动力学,定义了有理数的阶,并证明了有限阶有理数的自然密度为1,同时给出了特定分母下的显式分数家族。

Comments 10 pages, comments are welcome

详情
AI中文摘要

我们研究了映射$\mathcal{T}(x)=\lfloor x floor(1+\{x\})$的有理动力学,该映射出现在Fridman、Garbulsky、Glecer、Grime和Florentin递归构造素数表示常数的过程中。对于有理数$x\geq 2$,其分母为$M$时,定义其阶为使得$\mathcal{T}^n(x)$为整数的最小非负整数$n$,若存在这样的$n$。我们问是否每个有理数都有有限阶。对于每个$n$,我们证明精确阶为$n$的约分分数$a/M$由$a$模$M^{n+1}$的剩余类描述,并给出约分分数数量$A(n,M)$的递推公式。我们进一步证明对于每个固定的分母,有限阶的分数在所有具有该分母的约分分数中具有自然密度1,这特别意味着不存在无限长的无限阶有理数等差数列。我们还给出了每个分母的显式分数家族,并完全刻画了$M=2$的情况。

英文摘要

We study the rational dynamics of the map $\mathcal{T}(x)=\lfloor x\rfloor(1+\{x\})$, which appears in the recursive construction of the prime-representing constant of Fridman, Garbulsky, Glecer, Grime and Florentin. For a rational number $x\geq 2$ with denominator $M$, we define its order to be the least non-negative integer $n$ such that $\mathcal{T}^n(x)$ is an integer, if such an $n$ exists, and ask whether every rational number has finite order. For each \(n\), we prove that the reduced fractions \(a/M\) of exact order \(n\) are described by residue classes of \(a\) modulo \(M^{n+1}\), and give a recurrence for the number $A(n,M)$ of residue classes of exact order $n$. We then show that for each fixed denominator the fractions of finite order have natural density one among all reduced fractions with that denominator, which implies in particular that there is no infinite arithmetic progression of rational numbers of infinite order. We also give an explicit family of fractions of prescribed order for every denominator, and fully characterize the case $M=2$.

2605.21799 2026-05-22 eess.IV

Large-Scale Deployment and Analytical Implications of Structured Quality Control in Diffusion Magnetic Resonance Imaging

结构化质量控制在扩散磁共振成像中的大规模部署与分析含义

Michael E. Kim, Chenyu Gao, Karthik Ramadass, Gaurav Rudravaram, Elyssa M. McMaster, Adam M. Saunders, Yisu Yang, Elias Levy, Praitayini Kanakaraj, Nancy R. Newlin, Zhiyuan Li, Nazirah Mohd Khairi, Blake E. Dewey, The HABS-HD Study Team, Alzheimer's Disease Neuroimaging Initiative, Kurt G. Schilling, Derek Archer, Timothy J. Hohman, Bennett A. Landman, Yihao Liu

AI总结 本文研究了在扩散磁共振成像中大规模部署结构化质量控制的可行性及分析意义,通过评估18,328个dMRI扫描的七个典型处理流程输出,探讨了常见失败模式及有效dMRI处理输出的特征,以确保定量结果的有效性和可解释性。

详情
AI中文摘要

目的:扩散磁共振成像(dMRI)提供了一组多样化的定量测量和衍生数据类型,用于评估白质微结构和宏观结构。随着使用dMRI的成像研究规模的增加,需要质量控制(QC)的下游输出数量将持续增长。先前的工作表明,通过结构化的视觉检查可以识别出通常无法从汇总指标或汇总统计中明显看出的失败模式。本文旨在更好地理解常见的失败模式和有效dMRI处理输出的预期特征,以确保定量发现的有效性和可解释性。方法:我们部署了一个结构化的QC框架,评估了9个数据集中的18,328个dMRI扫描,对七个代表传统dMRI分析的处理流程的输出进行了视觉评估。结果:通过视觉QC通过的下游输出可能仍然依赖于失败的上游依赖项;这些失败可能只能通过系统检查整个流程层次结构才能被视觉检测到。此外,适当的QC粒度是算法特定的,因为每个算法输出的空间结构决定了失败是否需要选择性或全局排除。结论:本文展示了在dMRI处理流程中进行大规模、结构化QC的可行性和分析价值。我们的结果强调了需要系统性的QC贯穿整个处理层次结构,以确保定量发现的有效性和可解释性。

英文摘要

Purpose: Diffusion MRI (dMRI) provides a diverse set of quantitative measures and derived datatypes to assess white matter microstructure and macrostructure. Coupled with the increasing size of imaging studies using dMRI, the number of downstream outputs requiring quality control (QC) will continue to grow. Previous work has shown that failure modes which are often not evident from aggregate metrics or summary statistics can be identified through structured visual inspection. This work aims to better understand common failure modes and the expected characteristics of valid dMRI processing outputs to ensure the validity and interpretability of quantitative findings. Approach: We deployed a structured QC framework to assess 18,328 dMRI scans across nine datasets, visually evaluating the outputs of seven processing pipelines representative of conventional dMRI analyses. Results: Downstream outputs that pass visual QC may still rely on failed upstream dependencies; such failures may only be visually detectable through systematic inspection of the full pipeline hierarchy. Additionally, appropriate QC granularity is algorithm-specific, as the spatial structure of each algorithm's outputs determines whether failures warrant selective or global exclusion. Conclusion: This work demonstrates the feasibility and analytical value of large-scale, structured QC for dMRI processing pipelines. Our results highlight the need for systematic QC spanning the full processing hierarchy to ensure the validity and interpretability of quantitative findings.

2605.21797 2026-05-22 cs.CR cs.DB

Polars inside Intel SGX2 Enclaves: An Empirical Study of Confidential Analytical Query Processing

在Intel SGX2内核中使用Polars:对保密分析查询处理的实证研究

Wei Wang, Burns Smith, Kenny Leftin

AI总结 本文研究了在Intel SGX2内核中通过Gramine运行Arrow原生DataFrame引擎Polars的性能,分析了查询和数据加载的开销,并比较了Polars的惰性与主动API性能。

详情
AI中文摘要

可信执行环境(TEEs)重新激发了保密分析的兴趣,但大多数先前评估集中在SQL数据库引擎或早期SGX版本上。本文研究了通过Gramine在TPC-H SF30上使用Azure Blob存储运行的Arrow原生DataFrame引擎Polars。我们报告了标准TPC-H功率分数以及一个仅查询变体,以分离计算开销与数据摄入开销。在四个数据集宽度配置(约22-73 GB)中,端到端开销保持几乎恒定,为1.49-1.56倍,但此综合指标掩盖了两种不同的行为:仅查询开销从1.51-1.52倍降至1.43-1.44倍,而数据加载开销从2.27倍升至4.07倍。我们进一步表明,开销并非在所有查询中都相同:对于len130配置,中位数每个查询SGX减速为1.45倍,最大为2.57倍,且一小部分查询表现出与状态ful EPC压力一致的运行到运行峰值。最后,我们比较了Polars的惰性与主动API在同一TEE设置下的性能。惰性执行总体快2.25-2.27倍,而主动执行在41 GB及以上时因内存不足而失败。与最近的DuckDB-SGX2研究相比,我们的结果表明,SGX2可以支持Arrow原生分析处理,其安全开销的顺序与之相似,但加载路径放大和API级别优化是端到端性能的第一顺序决定因素。

英文摘要

Trusted Execution Environments (TEEs) have renewed interest in confidential analytics, but most prior evaluations focus on SQL database engines or earlier SGX generations. This paper studies an Arrow-native DataFrame engine, Polars, running inside Intel SGX2 enclaves via Gramine on TPC-H SF30 with Azure Blob Storage. We report both the standard TPC-H power score and a query-only variant that removes table-loading time in order to separate compute overhead from data-ingestion overhead. Across four dataset-width configurations (approximately 22-73 GB), end-to-end overhead remains nearly constant at 1.49-1.56$\times$, but this composite metric obscures two distinct behaviors: query-only overhead declines from 1.51-1.52$\times$ to 1.43-1.44$\times$, whereas table-loading overhead rises from 2.27$\times$ to 4.07$\times$. We further show that overhead is not uniform across queries: for the len130 configuration, the median per-query SGX slowdown is 1.45$\times$ with a maximum of 2.57$\times$, and a small set of queries exhibits pronounced run-to-run spikes consistent with stateful EPC pressure. Finally, we compare Polars' lazy and eager APIs under the same TEE setting. Lazy execution is 2.25-2.27$\times$ faster overall, while eager execution fails with out-of-memory errors at 41 GB and above. Relative to the recent DuckDB-SGX2 study, our results suggest that SGX2 can support Arrow-native analytical processing with a similar order of security overhead, but that load-path amplification and API-level optimization are first-order determinants of end-to-end performance.

2605.21795 2026-05-22 quant-ph

ATHENA: A Compiler For Optimized Scheduling In Distributed Quantum Computers

ATHENA:一种用于分布式量子计算机优化调度的编译器

Won Joon Yun, Dhilan Nag, Sneha Ballabh, Jiapeng Zhao, Eneet Kaur, Poulami Das

AI总结 本文提出ATHENA编译器,通过引入多候选块调度和EPR容量感知早期调度方法,解决了分布式量子计算机中块级调度的延迟和误差问题,从而显著减少量子门调度的延迟和误差。

Comments 18 pages, 20 figures, 9 tables

详情
AI中文摘要

分布式量子计算机(DQCs)通过光子互连将较小的芯片连接起来,以实现大规模系统。DQCs使用量子态传输来重新定位量子比特并在不同芯片上执行CNOT门。然而,非本地CNOT门比本地CNOT门慢4.3-7.7倍且更易出错,从而降低了程序保真度。现有的编译器将重叠量子比特的CNOT门分组并共同优化每个块的量子态传输。然而,块级调度有两个关键缺点。首先,它缺乏跨块的前瞻性能力,因为它在处理下一个块之前就为一个块选择了最优的调度方案,因此无法评估量子态传输对后续块的影响。我们的研究表明,简单地扩展前瞻性窗口以包含后续块并不能解决这个问题。其次,现有方法在 preceding blocks 完全调度后才调度后续块的操作和所需的量子态传输,从而引入了延迟和延迟开销。我们提出ATHENA,一种DQC编译器,通过两个关键见解:基于效用的前瞻性调度与多候选块调度(UMS)和EPR容量感知的早期调度(EES)来解决这些限制。UMS通过考虑其前瞻性窗口中只有有用的后续块来调度一个块。一个未来的块具有效用如果它与当前正在调度的块共享重叠的量子比特。UMS还维护多个调度方案在编译过程中,允许它在编译早期推迟对全局次优调度的承诺。EES使ATHENA能够在EPR资源可用时提前调度后续操作及其传输。我们的评估显示,ATHENA在平均情况下减少了34%的量子态传输,最高可达65%,并且将延迟减少了2倍,最高可达2.9倍,相比最先进的方法。

英文摘要

Distributed Quantum Computers (DQCs) enable large system sizes by connecting smaller chips via photonic interconnects. DQCs use teleportation to relocate qubits and execute CNOTs between qubits on different chips. However, non-local CNOTs are 4.3-7.7$\times$ slower and 4$\times$ more error-prone than local CNOTs within a chip, which degrades program fidelities. Existing compilers group CNOTs with overlapping qubits into blocks and collectively optimize teleportations for each block. However, block-level scheduling has two key drawbacks. First, it lacks lookahead ability across blocks because it selects the optimal schedule for one block before proceeding to the next. As a result, it cannot assess the impact of a teleportation on future blocks. Our studies show that naively expanding the lookahead window to include subsequent blocks does not address this issue. Second, existing approaches do not schedule future block operations or the teleportations they require until preceding blocks are fully scheduled, introducing delay and latency overheads. We propose ATHENA, a DQC compiler that addresses these limitations using two key insights: Utility-driven Lookahead with Multi-Candidate Block Scheduling (UMS) and EPR-Capacity-Aware Early Scheduling (EES). UMS schedules a block by considering only useful future blocks in its lookahead window. A future block has utility if it shares overlapping qubits with the current block being scheduled. UMS also maintains multiple schedules during compilation, allowing it to defer commitment to globally sub-optimal schedules early in the compilation process. EES enables ATHENA to schedule future operations and their relocations early when EPR resources are available. Our evaluations show that ATHENA reduces teleportations by 34% on average and up to 65%, and reduces latency by 2$\times$ on average and up to 2.9$\times$ compared to the state-of-the-art.

2605.21794 2026-05-22 math.NA cs.NA physics.plasm-ph

A Structure-Preserving Decorated Particle Method for the Vlasov-Poisson System

一种保留结构的装饰粒子方法用于Vlasov-Poisson系统

Mandela B. Quashie, J. W. Burby, Andrew J. Christlieb, Qi Tang

AI总结 本文提出了一种保留结构的装饰粒子方法,用于Vlasov-Poisson系统的模拟,通过与传统PIC算法的比较,展示了该方法在减少计算量的同时保持高精度的可行性。

Comments 25 pages, 7 figures

详情
AI中文摘要

我们重新审视Scovel-Weinstein框架(Scovel & Weinstein, CPAM 1994)用于减少Vlasov-Poisson系统的同时保留其哈密顿结构。标准的粒子-网格(PIC)算法通过宏观粒子近似分布函数,而Scovel-Weinstein装饰粒子则包含额外的形状自由度,同时保持有限维的约简并继承自连续模型的哈密顿结构。尽管原始工作在三十年前建立了这一结构,但其计算潜力一直未被充分探索。我们提出了Scovel-Weinstein模型的实用实现,并将其与标准PIC算法进行比较。数值实验表明,标准PIC中的宏观粒子可以被远 fewer 的装饰粒子替代,同时保持相当的精度。这种装饰粒子方法为动能等离子体模拟提供了一种新的保留结构的范式。

英文摘要

We revisit the Scovel-Weinstein framework (Scovel & Weinstein, CPAM 1994) for reducing the Vlasov-Poisson system while preserving its Hamiltonian structure. Standard particle-in-cell (PIC) algorithms approximate the distribution function by macro-particles with position and velocity. In contrast, Scovel-Weinstein decorated particles involve additional shape degrees of freedom, while maintaining a finite-dimensional reduction with Hamiltonian structure inherited from the continuum model. Although the original work established this structure three decades ago, its computational potential has remained largely unexplored. We present a practical implementation of the Scovel-Weinstein model and compare it with a standard PIC algorithm. Numerical experiments demonstrate that macro-particles in standard PIC can be replaced by far fewer decorated particles while retaining comparable accuracy. This decorated particle approach offers a new structure-preserving paradigm for kinetic plasma simulation.

2605.21793 2026-05-22 stat.ME stat.AP stat.ML

Targeted maximum likelihood estimation of vaccine effectiveness and immune correlates in test-negative design studies with missing data

针对缺失数据的疫苗效果和免疫相关指标的靶向最大似然估计在测试阳性设计研究中的应用

Leah I. B. Andrews, Lars van der Laan, Peter B. Gilbert

AI总结 本文提出了一种针对测试阳性设计研究中缺失暴露变量数据的靶向最大似然估计方法,通过半参数逻辑回归模型估计症状性疾病的因果条件风险比,以实现灵活的数据驱动混杂控制和有效的因果推断。

Comments 52 pages, 14 figures

详情
AI中文摘要

测试阳性设计(TND)是一种资源高效的观察性研究设计,可用于评估疫苗效果和疾病暴露近端的免疫相关指标。TND招募寻求诊断检测的 symptomatic 个体,并通过暴露变量(如疫苗接种状态或免疫标志物水平)进行比较,该变量在检测时测量。虽然 TND 减少了由就医行为引起的混杂,但其他混杂源可能仍然存在。TND 研究可能由于记录不完整或两阶段抽样设计而在暴露变量中存在缺失数据。本文提出了一种靶向最大似然估计方法,涉及一个半参数逻辑回归模型,该模型针对医疗寻求人群的症状性疾病的因果条件风险比。在因果和缺失随机化的假设下,我们的方法产生了一个高效、渐近线性的估计量,能够在分析具有缺失暴露变量数据的 TND 研究时提供灵活的数据驱动混杂控制和有效的因果推断。我们通过一个两阶段 TND 免疫相关研究的 plasmode 模拟评估了我们方法的有限样本性质。我们还应用我们的方法来评估来自 Moderna 冠状病毒效力 III 期试验衍生的 TND 研究队列的 COVID-19 疫苗效果和抗体标志物与 COVID-19 的相关性。

英文摘要

The test-negative design (TND) is a resource-efficient observational study design that can assess vaccine effectiveness and exposure-proximal immune correlates of disease. The TND enrolls symptomatic individuals seeking diagnostic testing and compares case status by an exposure variable, such as vaccination status or immune marker level, that is measured at testing. While the TND reduces confounding by healthcare-seeking behavior, other sources of confounding may remain. TND studies may also have missing data in the exposure variable due to incomplete records or two-phase sampling designs. We present a targeted maximum likelihood estimation approach involving a semiparametric logistic regression model that targets a causal conditional risk ratio of symptomatic disease in the healthcare-seeking population. Under causal and missing at random assumptions, our method produces an efficient, asymptotically linear estimator that provides flexible, data-driven confounding control and valid causal inference when analyzing TND studies with missing exposure variable data. We evaluate our method's finite sample properties using plasmode simulations of a two-phase TND immune correlates study. We also apply our method to assess COVID-19 vaccine effectiveness and antibody marker correlates of COVID-19 from TND study cohorts derived from the Moderna Coronavirus Efficacy phase 3 trial.

2605.21791 2026-05-22 quant-ph math-ph math.MP

Completeness of the Klein-Gordon oscillator eigenfunctions via Hermite and Laguerre polynomials

Klein-Gordon振子本征函数的完备性通过Hermite和Laguerre多项式证明

Kevin Hernández

AI总结 本文证明了Klein-Gordon振子在一维和三维空间中的本征函数的完备性,利用Hermite和广义Laguerre多项式的标准性质,以及三维中球谐函数的完备性,展示了Klein-Gordon场的标量性质使其论证比Dirac振子的类似证明更简单,无需考虑非对角项抵消。

详情
AI中文摘要

在一维和三维空间中,证明了Klein-Gordon振子本征函数的完备性。证明建立了本征函数所满足的闭合关系,并基于Hermite和广义Laguerre多项式的标准性质,补充了三维空间中球谐函数的完备性。Klein-Gordon场的标量性质使论证比Dirac振子的类似证明更简单:不需要考虑非对角项抵消。

英文摘要

Completeness of the Klein--Gordon oscillator eigenfunctions is proved in one and three spatial dimensions. The proofs establish the closure relations satisfied by the eigenfunctions and are based on standard properties of the Hermite and the generalized Laguerre polynomials, supplemented in three dimensions by the completeness of the spherical harmonics. The scalar nature of the Klein--Gordon field renders the argument strictly simpler than the analogous proof for the Dirac oscillator: no off-diagonal cancellation is required.

2605.21790 2026-05-22 cond-mat.mtrl-sci physics.ao-ph physics.geo-ph

Microwave-Stimulated Serpentinization of Olivine for Geological Hydrogen Production

微波激发橄榄石蛇纹石化用于地质氢气生产

Ansan Pokharel, Terence Musho

AI总结 本研究通过微波刺激橄榄石蛇纹石化过程,提高了氢气生成效率,初步实验显示氢气浓度增加12倍,且反应速率显著提升,为加速地质氢气生产提供了实验基础。

详情
AI中文摘要

超镁铁质岩石的蛇纹石化是一种自然发生的矿物学过程,可通过水岩反应中铁的氧化生成分子氢。尽管资源潜力大,但自然反应受动力学限制,实际氢气回收需要能加速转化且不产生超过所产生氢气价值的能量惩罚的方法。本文报告了一项初步的大气压微波蛇纹石化实验,使用了2克水饱和的破碎橄榄石样品。微波照射使测得的氢气浓度比传统热板加热在其他条件相似的情况下迅速增加。初步实验显示氢气浓度大约增加了12倍,且反应速率从约2 ppb/s的传统加热提高到约10 ppb/s的微波照射期间。这些结果表明电磁刺激可以增强蛇纹石化动力学,可能通过快速体积加热、选择性耦合到含铁相和局部热梯度实现。该结果为评估微波刺激作为加速地质氢气生产途径提供了初步实验基础,并激励后续使用校准气体分析、吸收功率测量、介电特性分析和高压测试进行进一步测量。

英文摘要

Serpentinization of ultramafic rocks is a naturally occurring mineralogical process that can generate molecular hydrogen through the oxidation of ferrous iron during water-rock reaction. Although the resource potential is large, the natural reaction is kinetically limited, and practical hydrogen recovery requires methods that can accelerate conversion without imposing an energy penalty that exceeds the value of the hydrogen produced. This short communication reports a preliminary atmospheric-pressure microwave serpentinization experiment using a water-saturated 2 g crushed olivine sample. Microwave irradiation produced a rapid increase in measured hydrogen concentration compared with conventional hot-plate heating under otherwise similar conditions. The preliminary experiment showed approximately a 12-fold increase in hydrogen concentration and an apparent rate increase from about 2 ppb s$^{-1}$ for conventional heating to about 10 ppb s$^{-1}$ during microwave exposure. These results suggest that electromagnetic stimulation can enhance serpentinization kinetics, likely through rapid volumetric heating, selective coupling to iron-bearing phases, and localized thermal gradients. The result provides an initial experimental basis for evaluating microwave stimulation as a route to accelerated geologic hydrogen production and motivates follow-on measurements using calibrated gas analysis, absorbed-power measurements, dielectric characterization, and elevated-pressure testing.

2605.21787 2026-05-22 q-bio.PE

Drivers of Transient Dynamics and Persistence in Dengue: Insights from Sensitivity and Stochastic Modeling

影响登革热暂时动态和持续性的因素:来自敏感性和随机建模的见解

Cesar Alberto Rosales-Alcantar, Marcos A. Capistrán

AI总结 本文通过敏感性和随机建模研究了关键流行病学参数如何影响季节性流行和登革热传播的持续性,揭示了参数重要性排名,并为公共卫生政策优先级提供依据。

详情
AI中文摘要

我们研究了关键流行病学参数如何塑造季节性流行和登革热传播的持续性。我们的发现确认了已知的流行病学驱动因素,并在我们的登革热模型中引入了参数重要性的排名,从而指导公共卫生政策的优先级。我们提出了一种具有衰减免疫力、外源性感染和垂直传播的随机向量-宿主模型。为了评估参数影响,我们首先对宏观模型进行了定性分析。然后我们对流行病总结统计量进行了多变量Sobol敏感性分析,并检查了内稳态平衡的方差作为模型参数函数。我们证明了宏观模型是合理的,垂直传播降低了持续性的阈值,低空间耦合增加了感染内稳态平衡。向量-宿主人口比和宿主恢复率具有最大的一阶和总敏感性指数,超过了接触率;这表明在季节性登革热期间,控制措施应优先保护感染性宿主免受蚊虫叮咬。最后,我们证明了在接触率平面中,宿主和向量在内稳态平衡处的协方差是异步的。这种稳健的模式具有流行病学、生态学和进化学解释。一种登革热株在持续性时期有两个生态位可利用,共存的株各自有两个生态位。此外,在持续性时期某一株的大波动提供了高垂直传播的机制解释,使得病毒库能够孵化并触发下一季节的爆发。我们主张,我们的模型和结果可以适应特定的公共卫生问题,以指导利用现场数据的登革热控制。

英文摘要

We investigate how key epidemiological parameters shape both seasonal epidemics and the persistence of dengue transmission. Our findings confirm known mechanistic drivers of epidemic variability and introduce a ranking of parameter importance in our dengue model, which in turn informs the prioritization of public health policies. We propose a stochastic vector-host model with waning immunity, exogenous infection, and vertical transmission. To assess parameter influence, we first qualitatively analyze the macroscopic model. We then perform a multivariate Sobol sensitivity analysis of epidemic summary statistics, and examine the variance of the endemic equilibrium as a function of model parameters. We show that the macroscopic model is well posed, vertical transmission lowers the threshold for persistence, and low spatial coupling increases infectious endemic equilibria. The vector-host population ratio and host recovery rate have the largest first-order and total sensitivity indices, surpassing the contact rates; this implies that control measures during seasonal dengue should prioritize protecting infectious hosts from mosquito bites. Finally, we show that the covariance of hosts and vectors at the endemic equilibrium is asynchronous in the contact-rate plane. This robust pattern has epidemiological, ecological and evolutive interpretations. A dengue strain has two niches to exploit during the endemic regime, and coexisting strain have two niches each. Moreover, large fluctuations in a given strain during the endemic regime provide a mechanistic explanation for high vertical transmission, enabling viral reservoirs that can hatch and trigger outbreaks in the following season. We argue that our model and results can be adapted to address specific public health questions to guide dengue control using field data.

2605.21786 2026-05-22 math.AP

Existence of solutions for a model of the Earth's magnetic field

地球磁场模型解的存在性

Jacob Bedrossian, Tom Schang, Franziska Weber

AI总结 本文研究了地球内核动力学的物理真实数学模型,并证明了该模型Leray-Hopf型弱解的存在性。模型结合了液态外核的磁流体动力学方程和内核的固体力学,将外核以外的部分视为完美绝缘体,由麦克斯韦方程描述。通过Galerkin近似法证明了解的存在性,并通过定义适当的函数空间和证明Biot-Savart型结果来控制非线性项。

详情
AI中文摘要

我们研究了一个物理真实、整体核心的数学模型,用于描述地球核心的动力学,并证明了该模型Leray-Hopf型弱解的存在性。我们的模型结合了液态外核中的磁流体动力学方程与内核的固体力学,并将核心以外的部分视为完美绝缘体,由麦克斯韦方程描述。我们使用Galerkin近似法证明了解的存在性。为了控制非线性项,必须定义适当的磁场函数空间并证明Biot-Savart型结果。主要的新困难在于正确地建立功能框架,以同时处理内核与流体的相互作用以及磁传输问题,其中内核是完美导体,而地幔和外部是完美绝缘体。

英文摘要

We study a physically realistic, whole-core mathematical model of the dynamics in the Earth's core and we prove existence of Leray-Hopf type weak solutions to the model. Our model combines Magneto-Hydrodynamic equations in the liquid outer core with solid physics for the electrically conducting inner core, and treats everything exterior to the core as a perfect insulator governed by Maxwell's equations. We prove existence of weak solutions using Galerkin approximations. In order to control the nonlinearities, we must define an appropriate function space for the magnetic field and prove a Biot-Savart type result. The main new difficulty here is properly setting up the functional framework to simultaneously deal with the fluid structure interaction with the inner core and the magnetic transmission problem, with both the perfectly conducting inner core and the perfectly insulating mantle/exterior.

2605.21785 2026-05-22 cs.CY

Machine Learning as Performative Materialist Practice: Thirteen Theses on the Epistemology, Methodology, and Politics of Applied ML

机器学习作为表现性物质实践:关于应用机器学习的本体论、方法论和政治的十三个纲领

Adolfo De Unánue, Fernanda Sobrino

AI总结 本文探讨了机器学习在机构决策支持情境中的本体论、方法论和政治问题,提出十三个纲领以挑战传统统计和计算机科学中的假设,强调机器学习模型作为干预工具的本质,以及验证、公平性标准和资源阈值等选择的政治属性。

详情
AI中文摘要

机构决策支持情境中的机器学习实践——政府、公共政策、公共卫生、刑事司法、资源配置——基于一系列未被深入审视的本体论承诺,这些承诺继承自经典统计学和计算机科学:模型代表稳定的规律性,验证可以脱离上下文,性能指标是政治中立的,特征重要性揭示系统结构。本文通过统一的表演性物质主义机器学习框架挑战这些承诺,提出十三个纲领。借助Pickering的循环本体论,经济社会学的表演性文献(Callon, MacKenzie),Simon的有限理性,表演性预测的正式化(Perdomo等,2020),以及十五年的政府和公共政策应用机器学习经验,我们论证:(1)机器学习模型应被视为时间情境化的压缩,而非追求真理的表示;(2)完整数据产品是一个复杂的适应性系统,与目标共同进化并在多目标空间中导航,没有单一算法能优化;(3)有效性本质上是表演性的,通过世界中的影响而非模型的正式属性来衡量;(4)嵌入在目标函数、公平性标准和资源阈值中的选择是政治决策,属于利益相关者而非技术人员。我们展示了这些纲领如何将多个实用建议——时间交叉验证、精度和召回率k、流程意识的公平性审计、满意而非优化——作为一致的物质主义本体论的后果,而非孤立的最佳实践。

英文摘要

Machine learning practice in institutional decision-support contexts -- government, public policy, public health, criminal justice, resource allocation -- rests on a set of largely unexamined epistemological commitments inherited from classical statistics and computer science: that models represent stable regularities, that validation can be context-free, that performance metrics are politically neutral, and that feature importance reveals system structure. This paper challenges these commitments through a unified framework of performative materialist ML, articulated as thirteen theses. Drawing on Pickering's cybernetic ontology, the performativity literature from economic sociology (Callon, MacKenzie), Simon's bounded rationality, the formalization of performative prediction (Perdomo et al., 2020), and fifteen years of applied ML experience in government and public policy, we argue that: (1) ML models are best understood not as truth-seeking representations but as temporally situated compressions that function as instruments of intervention; (2) the full data product is a complex adaptive system that coevolves with its target and navigates a multi-objective space no single algorithm can optimize; (3) validity is fundamentally performative, measured by effects in the world rather than formal properties of the model; (4) the choices embedded in objective functions, fairness criteria, and resource thresholds are political decisions belonging to stakeholders, not technicians. We show how these theses unify several practical prescriptions -- temporal cross-validation, precision and recall at k, pipeline-aware fairness auditing, satisficing over optimizing -- as consequences of a coherent materialist epistemology rather than isolated best practices

2605.21784 2026-05-22 cs.IT math.AG math.CO math.IT

Constructions of Rank-Metric Codes of Small Tensor Rank

小张量秩的秩度量码构造

Matteo Bonini, Eimear Byrne, Giuseppe Cotardo

AI总结 本文研究了秩度量码的张量秩与相关线性码参数之间的关系,提出了张量秩缺陷的概念,并利用代数几何码构造了具有小张量秩缺陷的秩度量码。

详情
AI中文摘要

秩度量码是有限域上矩阵的子空间,配备了秩度量,并具有自然的张量表示。张量秩提供了一个度量码分解为秩一张量的最小大小的指标。Kruskal证明了维度为k和最小秩距离为d的秩度量码的张量秩至少为k + d - 1,达到此下限的码称为最小张量秩(MTR)码。从代数复杂性理论已知,MTR码的存在意味着最大距离可分(MDS)码的存在。在本文中,我们建立了秩度量码的张量秩与相关线性码在汉明度量下参数之间的新关系,并引入了张量秩缺陷的概念。然后,我们利用代数几何(AG)码开发了具有小张量秩缺陷的秩度量码的新构造。

英文摘要

Rank-metric codes are subspaces of matrices over finite fields endowed with the rank metric and admit a natural tensorial representation. The tensor rank provides a measure of the minimal size of a decomposition of a code into rank-one tensors. Kruskal showed that the tensor rank of a rank-metric code of dimension $k$ and minimum rank distance $d$ is at least $k + d - 1$, and codes meeting this bound with equality are called minimal tensor rank (MTR) codes. It is known from algebraic complexity theory that the existence of an MTR code implies the existence of a maximum distance separable (MDS) code. In this work, we establish new results relating the tensor rank of a rank-metric code to the parameters of associated linear codes in the Hamming metric and introduce the notion of tensor rank defect. We then develop new constructions of rank-metric codes with small tensor rank defect using algebraic geometry (AG) codes.

2605.21782 2026-05-22 stat.ME stat.AP stat.CO

A Scalable Parametric Item Calibration Engine (SPICE) for Explanatory IRT with Sparse Data

一种适用于稀疏数据的解释性IRT参数化项目校准引擎(SPICE)

Steven W. Nydick, Manqian Liao, J. R. Lockwood

AI总结 本文提出了一种适用于稀疏数据的解释性IRT参数化项目校准引擎(SPICE),通过贝叶斯多维解释性IRT模型和MCMC估计方法,实现对大规模稀疏数据的心理测量分析。

详情
AI中文摘要

我们描述了一种贝叶斯多维解释性IRT模型,以及相关的马尔可夫链蒙特卡罗(MCMC)估计过程和相应的校准软件开发,旨在对大量稀疏连接的人和项目进行心理测量分析。此类数据结构可能例如来自使用大量自动生成项目库的自适应评估,其中每位受试者只接收整个库的极小比例。我们讨论了模型规范、数据结构和算法实现的选择如何共同创造一种可扩展的解释性IRT方法,以支持各种稀疏数据的心理测量操作。

英文摘要

We describe a Bayesian multidimensional explanatory IRT model, and an associated Markov Chain Monte Carlo (MCMC) estimation procedure and the corresponding development of calibration software, designed for psychometric analyses of large numbers of sparsely-linked persons and items. Such data structures can arise, for example, from adaptive assessments using large banks of automatically generated items with individual test takers receiving a very small proportion of the entire bank. We discuss how our choices for model specification, data structures, and algorithm implementation combine to create a scalable method for explanatory IRT that can support a variety of psychometric operations with sparse data.