arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 2057
2605.30164 2026-06-01 math.QA math-ph math.MP

Monodromy free Schrödinger operators and affine $\mathfrak{sl}_2$ master functions

无单值Schrödinger算子与仿射$\mathfrak{sl}_2$主函数

Andrei Grigorev, Evgeny Mukhin

AI总结 研究形如$L=\partial_x^2-u(x)$的Fuchs型微分算子,使得对所有$\lambda\in\mathbb{C}$算子$L+\lambda P(x)$无单值,证明这些算子由${\widehat{\mathfrak{sl}}_2}$主函数的临界点群体得到,并展示临界点的复制过程对应于算子$P^{-1}(x)L$的Darboux变换,从而在$P(x)=x^k$情形下给出所有此类算子的分类。

Comments Latex 22 pages

详情
AI中文摘要

给定一个非零多项式$P(x)$,我们研究形如$L=\partial_x^2-u(x)$的Fuchs型微分算子,使得对所有$\lambda\in\mathbb{C}$,算子$L+\lambda P(x)$无单值。我们证明所有这样的算子都可以从${\widehat{\mathfrak{sl}}_2}$主函数的临界点群体得到。此外,我们展示了临界点的复制过程对应于算子$P^{-1}(x)L$的Darboux变换。作为结果,我们在$P(x)=x^k$的情形下得到了所有具有这些性质的算子$L$的分类。

英文摘要

Given a non-zero polynomial $P(x)$, we study Fuchsian differential operators of the form $L=\partial_x^2-u(x)$ such that for all $λ\in\mathbb{C}$ the operator $L+λP(x)$ is monodromy free. We prove that all such operators are obtained from populations of critical points of ${\widehat{\mathfrak{sl}}_2}$ master functions. Moreover, we show that the reproduction procedure of critical points corresponds to a Darboux transformation of operator $P^{-1}(x)L$. As a result, we obtain a classification of all operators $L$ with such properties in the case of $P(x)=x^k$.

2605.30019 2026-06-01 cs.AR

elasticAI.explorer: Towards a Unified End-to-End Framework for Hardware-Aware Neural Architecture Search

elasticAI.explorer:面向硬件感知神经架构搜索的统一端到端框架

Natalie Maman, Florian Hettstedt, Andreas Erbslöh, Gregor Schiele

AI总结 提出基于Optuna的弹性AI探索器框架,通过YAML搜索空间规范、硬件代码生成和Docker交叉编译,实现硬件在环的神经架构搜索。

Comments 6 pages, 2 figures, IEEE CODASSCA 2026 (submitted)

详情
AI中文摘要

神经架构搜索(NAS)已成为在特定任务和硬件约束下自动设计神经网络的重要方法。然而,许多现有NAS框架将搜索空间定义、模型实现和部署流程紧密耦合,使得扩展到新硬件平台和自定义算子变得困难。本文提出elasticAI.explorer,一个基于Optuna构建的可扩展Python框架,用于硬件感知NAS。该框架引入基于YAML的搜索空间规范,在采样过程中动态转换为可执行的神经网络模型。该方法支持逐层、基于单元和层次化搜索空间,同时保持优化和部署的统一接口。除了架构生成,该框架还集成了硬件特定代码生成、基于Docker的交叉编译工具链以及设备端基准测试二进制文件的自动创建,实现了硬件在环的NAS工作流。系统进一步提供可扩展的评估器,用于FLOPs、参数数量和延迟估计。elasticAI.explorer旨在减少嵌入式AI部署的工程开销,并加速面向异构加速器平台的硬件感知NAS研究。

英文摘要

Neural Architecture Search (NAS) has become an important approach for automatically designing neural networks under task-specific and hardware-specific constraints. However, many existing NAS frameworks tightly couple search space definitions, model implementations, and deployment pipelines, making extension to new hardware platforms and custom operators difficult. In this paper, we present the elasticAI.explorer, an extensible Python framework for hardware-aware NAS built on top of Optuna. The framework introduces a YAML-based search space specification that dynamically translates into executable neural network models during sampling. The approach supports layer-wise, cell-based, and hierarchical search spaces while maintaining a unified interface for optimization and deployment. Beyond architecture generation, the framework integrates hardware-specific code generation, Docker-based cross-compilation toolchains, and automated creation of on-device benchmarking binaries, enabling hardware-in-the-loop NAS workflows. The system further provides extensible evaluators for FLOPs, parameter count, and latency estimation. The elasticAI.explorer aims to reduce the engineering overhead of embedded AI deployment and accelerate research on hardware-aware NAS for heterogeneous accelerator platforms

2605.29994 2026-06-01 cs.AR

Precomputed 1D-CNNs for Atrial Fibrillation Detection on Tiny Smart Sensor Systems

用于微小智能传感器系统上房颤检测的预计算一维卷积神经网络

Lukas Einhaus, Natalie Maman, Julian Hoever, Andreas Erbslöh, Gregor Schiele

AI总结 本文提出一种基于分组卷积的预计算一维卷积神经网络块及其超参数选择算法,在MIT-BIH心电图数据集上实现房颤检测,仅用2844个LUT即可达到95%的F1分数。

Comments 8 pages, 6 figures, IEEE SMARTCOMP 2026 (accepted)

详情
AI中文摘要

一维卷积神经网络在微小智能传感器系统的时间序列分析中扮演关键角色,例如生物信号分析、预测性维护或结构健康监测。基于查找表的预计算已成为一种有趣的优化技术,用于在FPGA上实现此类神经网络。其核心思想是预计算神经网络层的所有可能输出,并将其直接存储在FPGA的查找表中。这使得网络具有极高的资源效率和超低延迟,但可扩展性较差。先前的工作探索了使用深度可分离卷积来提高可扩展性。在本文中,我们推广了这种方法,考虑了其他形式的分组卷积。基于此,我们提出了一种新型卷积块和一种指导该块超参数选择的算法。我们使用MIT-BIH数据库(心电图记录)在医学时间序列数据集上评估了我们的方法,用于预测房颤。生成的硬件加速器足够小,可以部署在AMD Spartan 7 S15上。它们实现了高达95%的F1分数,同时仅需要2,844个LUT,且无需DSP或BRAM。

英文摘要

1D-CNNs play a crucial role for time-series analysis on tiny smart sensor systems, e.g. for biosignal analysis, predictive maintenance, or structural health monitoring. LUTbased precomputation has emerged as an interesting optimization technique to implement such neural networks on FPGAs. The core idea is to precompute all possible outputs of a neural network layer and store them directly in the lookup tables of the FPGAs. This enables highly resource-efficient networks with ultra-low latency but suffers from poor scalability. Previous work has explored using depthwise-separable convolutions to improve scalability. In this paper, we generalize this approach to consider additional forms of grouped convolutions. Based on this, we propose a novel type of convolutional block and an algorithm to guide the choice of hyper parameters for this block. We evaluate our approach on a medical time-series dataset for predicting atrial fibrillation using the MIT-BIH database (ECG recordings). The resulting hardware accelerators are small enough to be deployed on an AMD Spartan 7 S15. They achieve a F1-Score of up to 95% while only requiring 2,844 LUTs and no DSPs or BRAM.

2605.29991 2026-06-01 math.CA math.CV

Complex spectrum of the partial theta function

部分theta函数的复谱

Boris Shapiro

AI总结 研究部分theta函数Θ(q,x)的复谱,证明单位圆上每点都是谱的聚点,并在固定子盘内通过截断-牛顿过程计算离散谱值,结合径向约定进行数值单值性实验。

Comments 24 pages, 6 figures. Abstract changed and a figure added

详情
AI中文摘要

我们研究部分theta函数\[ Θ(q,x)=\sum_{j=0}^{\infty}q^{j(j+1)/2}x^j, \qquad |q|<1 \]的复谱,其中谱值是使得\(Θ(q,\cdot)\)有重零点的参数。由于该函数仅在\(|q|<1\)内定义,所有谱值严格位于单位圆盘内部;边界点\(|q|=1\)仅作为谱的聚点出现。本文结合两种互补的观点。在单位圆附近,我们证明\(|q|=1\)上的每一点都是谱的聚点;证明使用了截断的显式谱因子、Jacobi三重积以及单位根附近的边界窗口提升论证。在固定子盘内(以\(|q|\leq 0.8\)为例),真实谱是局部有限的,必须小心地与截断和Jensen多项式的更大分支轨迹区分开。我们给出一种截断种子牛顿过程,生成候选谱值的离散列表,解释有限逼近中的焦散/逃逸根机制,并记录使用径向约定的数值单值性实验:对于谱点\(q_*\),在小圆上的点\(0.1q_*/|q_*|\)处标记根,然后沿直线径向段延拓至\(q_*\)。该约定在圆盘内给出碰撞标签的一致集合,从基点\(-0.1\)处理负实谱值,并导出径向单值性的初步有理方向启发式方法。

英文摘要

We study the complex spectrum of the partial theta function \[ Θ(q,x)=\sum_{j=0}^{\infty}q^{j(j+1)/2}x^j, \qquad |q|<1, \] where a spectral value is a parameter for which \(Θ(q,\cdot)\) has a multiple zero. Since the function is defined here only for \(|q|<1\), all spectral values are strictly inside the unit disk; boundary points on \(|q|=1\) occur only as accumulation points of the spectrum. The paper combines two complementary points of view. Near the unit circle we prove that every point of \(|q|=1\) is an accumulation point of the spectrum; the proof uses explicit spectral factors of truncations, the Jacobi triple product, and a boundary-window lifting argument near roots of unity. Inside a fixed subdisk, illustrated for \(|q|\leq 0.8\), the true spectrum is locally finite and must be separated carefully from the much larger branch loci of truncations and Jensen polynomials. We give a truncation-seeded Newton procedure which produces a discrete list of candidate spectral values, explain the caustic/escaping-root mechanism in finite approximants, and record numerical monodromy experiments using a radial convention: for a spectral point \(q_*\), roots are labelled at the point \(0.1q_*/|q_*|\) on the small circle and then continued along the straight radial segment to \(q_*\). This convention gives a coherent set of collision labels in the disk, treats negative real spectral values from the base point \(-0.1\), and leads to a preliminary rational-direction heuristic for radial monodromy.

2605.29947 2026-06-01 math.AG math.NT

Lift-independence problem in the $P$-adic Simpson correspondence for curves

曲线的 $P$-进 Simpson 对应中的提升独立性问题

Xiangyu Pan, Jiahong Yu

AI总结 研究曲线情形下 $p$-进 Simpson 对应中 Higgs 丛与 $v$-丛的等价性是否依赖于提升的选择,证明了小秩半稳定提升独立 Higgs 丛的 Higgs 场为零,并构造了非零 Higgs 场的半稳定提升独立 Higgs 丛。

Comments 29 pages. Some typos are corrected. Comments are welcome!

详情
AI中文摘要

设 $X$ 是完备代数闭 $p$-进域 $\mathbf C$ 上的真光滑刚性解析簇。固定 $\exp$ 的一个延拓 $\mathrm{Exp}$。Faltings(在曲线情形)和 Heuer 证明了 $X$ 在 $\mathbf{B}_{ m dR}^+/t^2$ 上的任意提升 $\widetilde X$ 诱导了 $X_{\mathrm{\acute{e}t}}$ 上 Higgs 丛范畴与 $X_v$ 上 $v$-丛范畴之间的等价。本文旨在研究当 $X$ 是亏格 $g\geqslant 2$ 的曲线时,该等价如何依赖于提升 $\widetilde X$ 的选择。更精确地说,我们称一个 Higgs 丛是提升独立的,如果它在 $p$-进 Simpson 对应下对于任意提升 $\widetilde X$ 总对应同一个 $v$-丛,然后我们将证明: (1) 存在某个 $r(g)\geqslant \sqrt{g-1}$,使得任意秩 $r\leqslant r(g)$ 的半稳定提升独立 Hitchin-小 Higgs 丛的 Higgs 场为零。 (2) 总存在一个次数为 $0$ 且具有非零 Higgs 场的半稳定 Higgs 丛是提升独立的。

英文摘要

Let $X$ be a proper smooth rigid analytic variety over a complete algebraically closed field $p$-adic field $\mathbf C$. Fix an continuation $\mathrm{Exp}$ of $\exp$. Faltings (in the curve case) and Heuer showed that any lifting $\widetilde X$ of $X$ over $\mathbf{B}_{\rm dR}^+/t^2$ induces an equivalence bewteen the category of Higgs bundles on $X_{\mathrm{\acute{e}t}}$ and the category of $v$-bundles on $X_v$. In this paper, we aim to study how the equivalence depends on the choice of such a lifting $\widetilde X$ when $X$ is a curve of genus $g\geqslant 2$. More precisely, we call a Higgs bundle lift-independent if it always corresponds to the same $v$-bundle under $p$-adic Simpson correspondence with respect to any lifting $\widetilde X$ and then we will show that (1) There exists some $r(g)\geqslant \sqrt{g-1}$ such that any semistable lift-independent Hitchin-small Higgs bundle of rank $r\leqslant r(g)$ has zero Higgs field. (2) There always exists a semistable Higgs bundle of degree $0$ with non-zero Higgs field that is lift-independent.

2605.29929 2026-06-01 physics.atom-ph quant-ph

Restoring Velocity Immunity via Dynamic Mirror Compensation in a Large-Area Dual-Atom-Interferometer Gyroscope

通过动态镜面补偿恢复大面积双原子干涉仪陀螺仪的速度免疫性

Jie Gu, Yin-fei Mao, Zhan-Wei Yao, An-qing Zhang, Si-Bin Lu, Shao-kang Li, Min Jiang, Xiao-Li Chen, Min Ke, Xi Chen, Run-Bing Li, Jin Wang, Ming-Sheng Zhan

AI总结 提出并演示了一种动态镜面补偿方案,通过主动旋转拉曼镜面消除科里奥利效应破坏的速度免疫性,在21.1 cm²干涉面积的双原子干涉仪陀螺仪上实现40倍速度依赖性降低和0.13 ppm的标度因子稳定性。

Comments 6 pages, 5 figures

详情
AI中文摘要

我们提出并演示了一种动态镜面补偿方案,用于恢复大面积双原子干涉仪陀螺仪的速度免疫性。在理想的马赫-曾德尔构型中,相位偏移对原子速度固有免疫,但这一性质被地球自转通过科里奥利效应破坏。我们通过在脉冲序列期间主动旋转拉曼镜面来抵消时间相关的角度偏移,从而克服这一问题。该实现依赖于一个可解耦的校准-补偿链来移除旋转引起的时间相关项。该方案在一个干涉面积为21.1 cm²的双原子干涉仪陀螺仪上得到验证。补偿后,相位对原子速度的依赖性降低了40倍,速度对标度因子稳定性的贡献评估为0.13 ppm。该传感器实现了1.3×10⁻⁸ rad/s/Hz^{1/2}的旋转灵敏度和在4500秒积分时间下1.9×10⁻¹⁰ rad/s的稳定性,以及在一次地震事件中高达459的共模噪声抑制比。这项工作消除了原子干涉仪陀螺仪中标度因子稳定化的一个关键障碍,为其在惯性导航和地球物理学中的应用铺平了道路。

英文摘要

We propose and demonstrate a dynamical mirror compensation scheme to restore velocity immunity in a large-area dual-atom-interferometer gyroscope. In an ideal Mach-Zehnder configuration, the phase shift is inherently immune to atomic velocity, but this property is broken by the Earth's rotation via the Coriolis effect. We overcome this by actively rotating the Raman mirrors during the pulse sequence to cancel the time-dependent angular offset. The implementation relies on a decouplable calibration-compensation chain to remove rotation-induced time-dependent terms. The scheme is validated on a dual-atom-interferometer gyroscope with an interference area of 21.1 cm^2. After compensation, the phase's dependence on atomic velocity is reduced 40-fold, and the velocity contribution to scale-factor stability is evaluated to be 0.13 ppm. The sensor achieves a rotation sensitivity of 1.3\times10^{-8} rad/s/Hz^{1/2} and a stability of 1.9\times10^{-10} rad/s at 4500 s integration, together with a common-mode noise rejection ratio of up to 459, demonstrated in a seismic event. This work removes a key obstacle to scale-factor stabilization in atom-interferometer gyroscopes and paves the way for their applications in inertial navigation and geophysics.

2605.29808 2026-06-01 eess.IV physics.app-ph

Absorption and Phase-Contrast Microtomography Using Direct X-ray Detection With COTS CMOS Sensors

使用商用CMOS传感器直接X射线检测的吸收和相位衬度显微断层成像

Damian L. Corzi, Jose Lipovetzky, Fabricio Alcalde Bessia, German Mato, Andres Cicuttin, Maria L. Crespo, Martin Perez, Mariano Gomez Berisso

AI总结 提出一种基于商用CMOS图像传感器作为直接探测器的高分辨率X射线显微断层成像系统,通过动态平场校正和锥束几何实现吸收衬度与传播相位衬度成像,以低成本替代同步辐射或纳米焦点系统。

Comments 8 pages, 15 figures

详情
AI中文摘要

本工作展示了一种高分辨率X射线显微断层成像系统,该系统使用商用现货(COTS)CMOS图像传感器作为直接探测器,依靠传感器的固有分辨率实现无光学元件的断层重建。系统采用锥束几何中的微焦点X射线源,实现吸收衬度和基于传播的相位衬度成像。动态平场校正算法减轻了长时间采集过程中辐射引起的退化,有助于克服消费级硬件的局限性。该装置提供3.9微米至5.2微米的体素尺寸。相位衬度可显示传统放射摄影无法检测的软组织边界。与同步辐射或纳米焦点系统相比,我们的解决方案更简单、成本更低,并且避免了复杂的光学器件或慢速扫描。COTS CMOS传感器似乎是实验室规模高分辨率显微断层成像的可行替代方案。

英文摘要

This work presents a high-resolution X-ray microtomography system that uses commercial off-the-shelf (COTS) CMOS image sensors as direct detectors, relying on the sensor s intrinsic resolution to achieve tomographic reconstructions without optical components. The system employs a microfocus X-ray source in cone-beam geometry, enabling both absorption-contrast and propagation-based phase-contrast imaging. A dynamic flat-field correction algorithm mitigates radiation-induced degradation during long acquisitions, helping to overcome limitations of consumer-grade hardware. The setup provides voxel sizes from 3.9 micron to 5.2 micron. Phase contrast visualizes soft tissue boundaries that would be undetectable by conventional radiography. Compared to synchrotron or nanofocus systems, our solution is simpler, lower-cost, and avoids complex optics or slow scans. COTS CMOS sensors appear as a viable alternative for laboratory-scale high-resolution microtomography.

2605.29772 2026-06-01 cs.NI

ARIADNE: AI-RAN Informed Link Adaptation in Digital Twin Network Environments

ARIADNE:数字孪生网络环境中基于AI-RAN的链路自适应

Maria Tsampazi, Neagin Neasamoni Santhi, Nicole Perrotta, Falko Dressler, Tommaso Melodia

AI总结 提出在线强化学习模块ARIADNE,集成于SIONNA框架实现链路自适应,在频谱效率上比行业标准和最先进方法分别提升11%和20%,并揭示RL学习的MCS选择策略与OLLA不同。

Comments 6 pages, 9 fugures

详情
AI中文摘要

人工智能驱动的无线接入网络(RAN)已引起工业界和学术界的广泛关注。同时,数字孪生为基于AI/机器学习的先进AI-RAN研究提供了安全的实验环境。通过在RAN部署前测试在线算法,数字孪生降低了与物理现场测试相关的成本和安全风险。在本文中,我们提出ARIADNE,一个基于在线强化学习(RL)的模块,它与SIONNA无缝集成,负责执行链路自适应。我们探索了不同的设计选择,并展示了ARIADNE如何在频谱效率上分别比行业标准和最先进方法提升高达11%和20%。最后,我们表明RL学习了一种调制与编码方案(MCS)选择策略,该策略偏离了外环链路自适应(OLLA),根据配置表现出更保守或更激进的行为,这一趋势通过离线训练第五代(5G)空中(OTA)测量数据得到进一步证实。

英文摘要

Artificial Intelligence (AI)-powered Radio Access Network (RAN) networks have attracted significant attention from both industry and academia. Meanwhile, Digital Twins offer a safe playground for experimenting with AI/Machine Learning (ML)-based solutions for advanced AI-RAN research. By enabling the testing of online algorithms before deployment on the RAN, they reduce costs and safety risks associated with physical field testing. In this article, we propose ARIADNE, an online Reinforcement Learning (RL)-based module that seamlessly integrates with SIONNA and is tasked with performing link adaptation. We explore different design choices and demonstrate how ARIADNE can surpass industry-standard and state-of-the-art methods by achieving up to 11% and 20% improvements in Spectral Efficiency, respectively. Finally, we show that RL learns a Modulation and Coding Scheme (MCS) selection strategy that diverges from Outer Loop Link Adaptation (OLLA), exhibiting either more conservative or more aggressive behavior depending on the configuration, a trend further corroborated by training offline on 5th generation (5G) over-the-air (OTA) measurements.

2605.29755 2026-06-01 cs.IR

Rec-Distill: An Industrial Distillation Pipeline for Large-Scale Recommendation Models

Rec-Distill:面向大规模推荐模型的工业蒸馏流水线

Haoran Ding, Wenlin Zhao, Yuchen Jiang, Juren Li, Jie Zhu, Xinchun Li, Yishujie Zhao, Yi Zhang, Ao Qiao, Jianhui Dong, Cheng Chen, Ziyan Gong, Deping Xie, Peng Xu, Zikai Wang, Yuwei Wang, Huizhi Yang, Zhe Chen, Yuchao Zheng

AI总结 针对大规模推荐模型在工业部署中效率与性能的权衡问题,提出Rec-Distill蒸馏流水线,通过解耦训练、黑盒蒸馏、去偏机制和混合批流流水线,将大教师模型的知识高效迁移至轻量学生模型,在真实场景中蒸馏迁移率超60%。

详情
AI中文摘要

大规模推荐模型在缩放定律下展现出巨大的潜在收益,但这些收益在工业推荐系统中难以实现,因为实际部署需要具有严格服务效率和延迟保证的轻量级模型。这造成了离线模型缩放与在线部署之间的根本差距。在这项工作中,我们提出了Rec-Distill,一种工业蒸馏流水线,将大规模推荐建模的性能收益转移到高效的推理服务模型中。Rec-Distill通过解耦训练、黑盒蒸馏、去偏机制以及面向动态推荐环境的混合批流流水线,将大教师模型缩放与学生端迁移优化相结合。在真实平台的多个推荐和广告场景中,我们的框架将教师模型扩展到240亿稠密参数和2万行为序列长度,同时使轻量级学生模型能够恢复教师收益的很大一部分,在最佳设置下蒸馏迁移率超过60%。大量的离线和在线实验进一步表明,这些迁移的收益在工业约束下持续转化为可衡量的业务改进。这些结果表明,Rec-Distill为将大规模推荐模型蒸馏为可部署、成本高效的推理服务系统提供了一个实用框架,同时也为未来将推荐模型扩展到更大规模建立了一条可靠路径。

英文摘要

Large recommendation models have demonstrated substantial potential gains under scaling laws, yet these gains are difficult to realize in industrial recommendation systems because real-world deployment requires lightweight models with strict serving efficiency and latency guarantees. This creates a fundamental gap between offline model scaling and online deployment. In this work, we present Rec-Distill, an industrial distillation pipeline that transfers the performance gains of large-scale recommendation modeling to efficient serving models. Rec-Distill combines large-teacher scaling with student-side transfer optimization through decoupled training, black-box distillation, debiasing mechanism, and a hybrid batch-streaming pipeline for dynamic recommendation environments. Across multiple recommendation and advertising scenarios on real-world platforms, our framework scales teacher models up to 24B dense parameters and 20K behavior sequence length, while enabling lightweight students to recover a substantial portion of teacher gains, with distillation transferability exceeding 60% in the best setting. Extensive offline and online experiments further show that these transferred gains consistently translate into measurable business improvements under industrial constraints. These results demonstrate that Rec-Distill provides a practical framework for distilling large-scale recommendation models into deployable, cost-efficient serving systems, while also establishing a reliable path toward scaling recommendation models to even larger regimes in the future.

2605.29752 2026-06-01 cs.PF cs.AR cs.DC

From Roofline to Ruggedness: Decomposing and Smoothing the GEMM Performance Landscape

从Roofline到Ruggedness:分解和平滑GEMM性能景观

Aditya Chatterjee

AI总结 针对GPU上GEMM性能的锯齿状波动(相邻问题吞吐量差异达30%),提出性能粗糙度分析框架,通过最佳动态分块选择和基于动态规划的填充分割优化器,将粗糙度降低70%并提升平均吞吐量30%。

详情
AI中文摘要

相邻的GEMM问题在N上仅差128步,在相同GPU上吞吐量可相差30%。这种普遍存在的性能粗糙度——对Roofline分析和峰值FLOPs直觉不可见,但对每个非峰值工作负载占主导——是本文的主题。 我们提出性能粗糙度分析作为Roofline的补充分析框架:不将GPU性能总结为标量界限,而是将完整的多维性能表面作为研究对象,将其纹理分解为可归因于机制的组件,并将软件可移除的贡献与硬件绑定的贡献分开。该框架直接类比深度学习损失景观——一个连续量(理想时间2MNK / 计算吞吐量峰值)因与离散硬件基板(分块、子组、缓存行、DRAM通道)的交互而变得粗糙。 我们通过32768个配置的扫描(M, N, K)属于{128, ..., 4096}^3,将框架应用于Intel Battlemage(Arc B580, sycl-tla)上的BF16 NN(无转置)GEMM。峰值在非正方形形状M=3840, N=2048, K=4096且默认分块大小时为110.8 TFLOPs;初始景观粗糙度为每128步16.8 TFLOPs,而理想值为2.0。一个两阶段软件栈——(i) 最佳六种动态分块选择和(ii) 一种新颖的基于动态规划的填充分割优化器,具有O(1)运行时查找——将粗糙度降低70%并将平均吞吐量提升30%。跨分块实验表明,残余锯齿周期与软件分块大小完全成比例,排除了缓存集冲突,并将剩余方差归因于四个硬件绑定来源(每内核基础开销、波前量化、DPAS原子几何和GDDR6通道哈希交互)。

英文摘要

Adjacent GEMM problems that differ by a single 128-element step in N can show 30% different throughput on the same GPU. This pervasive performance ruggedness - invisible to roofline analysis and peak-FLOPs intuition, yet dominant for every non-peak workload - is the subject of this paper. We propose performance ruggedness analysis as an analytical framework complementary to roofline: rather than summarizing GPU performance with a scalar bound, treat the full multidimensional performance surface as the object of study, decompose its texture into mechanism-attributable components and separate software-removable contributions from hardware-bound ones. The framing is directly analogous to deep-learning loss landscapes - a continuous quantity (the idealized time 2MNK / compute_throughput_peak) made rugged by interaction with discrete hardware substrates (tiles, sub-groups, cache lines, DRAM channels). We apply the framework to BF16 NN (no transpose) GEMM on Intel Battlemage (Arc B580, sycl-tla) via a 32,768-configuration sweep (M, N, K) belongs to {128, ..., 4096}^3. The peak is 110.8 TFLOPs at the non-square shape M=3840, N=2048, K=4096 with the default tile size; the initial landscape roughness is 16.8 TFLOPs per 128-step against an ideal of 2.0. A two-stage software stack - (i) best-of-six dynamic tile selection and (ii) a novel dynamic-programming based padding-and-splitting optimizer with O(1) runtime lookup - reduces roughness by 70% and raises mean throughput by 30%. Cross-tile experiments establish that the residual sawtooth period scales exactly with software tile size, ruling out cache set conflicts and attributing the remaining variance to four hardware-bound sources (per-kernel base overhead, wave quantization, DPAS atom geometry and GDDR6 channel-hash interactions).

2605.29699 2026-06-01 physics.app-ph

Decoupling heat and electricity: A thermal invisible gateway

解耦热与电:一种热隐形通道

Jiahao Li, Fei Sun, Yichao Liu, Yawen Qi, Qin Liao, Jianpu Yang, Zhiru Xie

AI总结 通过设计基于主动热超表面的热隐形通道,利用宏观结构解耦热与电的传输,实现了近零热导率与金属级电导率的共存。

Comments 19 pages, 4 figures

详情
AI中文摘要

Wiedemann-Franz定律将电导率和热导率耦合在一起,使得同时实现高电导率和低热导率成为一项重大挑战。为了克服这一难题,我们设计了一种基于主动热超表面(ATMS)的热隐形通道,该通道解耦了热和电的传输路径。该结构以铜为基底,采用哑铃形桥接结构,通过定向补偿抑制热流,同时允许电导无阻碍通过。室温实验表明,其有效热导率低于10^-3 W m^-1 K^-1(接近零,类似空气绝缘),而电导率高达2.8×10^7 S m^-1(金属级别)。与传统的材料改性方法不同,我们的工作利用宏观结构设计打破了内在耦合,为片上互连和可穿戴电子等应用提供了有前景的解决方案。

英文摘要

The Wiedemann-Franz law couples electrical and thermal conductivity, making high electrical conduction with low thermal conduction a major challenge. To overcome this, we designed an active thermal metasurface (ATMS) - based thermal invisible gateway that decouples thermal and electrical paths. Built on a copper substrate with a dumbbell-shaped bridge, the structure suppresses heat flow via directional compensation while allowing unimpeded electrical conduction. Room-temperature experiments show an effective thermal conductivity below 10^-3 W m^-1 K^-1 (near zero, air-like insulation) and an electrical conductivity up to 2.8x10^7 S m^-1 (metal-level). Unlike conventional material-modification approaches, our work uses macroscopic structural design to break the intrinsic coupling, offering a promising solution for applications like on-chip interconnects and wearable electronics.

2605.29644 2026-06-01 astro-ph.HE astro-ph.IM

Enhanced All-Distance Equi-Zenith Angle Method for Cosmic-Ray Anisotropy Measurement

用于宇宙射线各向异性测量的增强型全距离等天顶角方法

Dong-Xu Sun, Dan Li, Wei Liu, Qiang Yuan, Yi-Qing Guo, Hong-bo Hu

AI总结 提出一种增强型全距离等天顶角方法,通过同时测量多个时间框架的各向异性并从数据中直接确定探测效率,克服了探测器效率不稳定带来的挑战,实现了高精度宇宙射线各向异性测量。

Comments 16 pages, 7 figures

详情
AI中文摘要

长期观测表明,能量低于约1 PeV的宇宙射线各向异性相对强度保持在0.1%以下。测量如此微弱的信号在数据分析中构成重大挑战,需要仔细去除仪器和大气伪影。全距离等天顶角方法被广泛用于提取宇宙射线各向异性,因为它能有效抑制仪器和大气引起的瞬时变化。然而,探测器效率的不稳定性使得用该方法精确测量各向异性变得困难。在这项工作中,我们提出了一种用于宇宙射线各向异性测量的增强型全距离等天顶角方法。与之前的实现不同,我们的改进方法能够同时测量多个时间框架的各向异性,并允许直接从数据中确定探测效率。这一特性使得该方法特别适用于探测器阵列无法长期稳定运行的应用场景,从而能够实现高精度的各向异性测量。此外,当数据不跨越完整的回归年时,我们的增强方法也是可行的。

英文摘要

Long-term observations indicate that the relative intensity of cosmic-ray anisotropy remains below $0.1\%$ for energies less than $\sim 1$ PeV. Measuring such faint signals poses a significant challenge in data analysis, requiring careful removal of instrumental and atmospheric artifacts. The all-distance equi-zenith angle method is widely employed to extract cosmic-ray anisotropies, as it effectively suppresses the instantaneous variations arising from the instrument and atmosphere. \textcolor{black}{However, instability in the detector efficiency makes precise measurements of anisotropy challenging with this method.} In this work, we present an enhanced all-distance equi-zenith angle method for cosmic-ray anisotropy measurement. Unlike previous implementations, our improved approach enables the simultaneous measurement of anisotropies over multiple time frames and allows the detection efficiency to be determined directly from the data. This feature makes the method especially suitable for applications where the detector array does not operate with long-term stability\textcolor{black}{, and thus allows for the measurement of anisotropy with high-precision}. Moreover, our enhanced method is also feasible when the data do not span complete tropical years.

2605.29603 2026-06-01 stat.ME

Learning study similarity to investigate heterogeneity in meta-analysis using LLMs and triplet loss

利用大语言模型和三重损失学习研究相似性以探究荟萃分析中的异质性

Kanella Panagiotopoulou, Harald Binder, Theodoros Evrenoglou

AI总结 提出一种结合大语言模型与深度度量学习的框架,通过三重损失训练嵌入模型,在研究层面推断相似性并聚类,以在荟萃分析前识别同质子群,减少异质性并提高估计精度。

Comments 17 pages, 4 figures

详情
AI中文摘要

观察性研究的荟萃分析通常显示研究间存在显著异质性,限制了合并估计的可解释性。元回归可用于探索异质性,但往往难以处理多个效应修饰因子。我们提出一种新颖框架,将大语言模型与深度度量学习相结合,在荟萃分析前推断研究层面的相似性。研究层面的临床和方法学特征由大语言模型处理,生成研究三元组(锚点、相似、不相似)。这些三元组通过将每项研究视为锚点,并与其他研究对进行比较,以识别每次实例中与锚点最相似的研究。然后,三元组用于训练三重损失的嵌入模型,这是一种深度学习方法,学习一个嵌入空间,其中临床和方法学相似的研究聚集在一起。我们将该框架应用于一个包含58项观察性研究的荟萃分析数据集,比较早产儿和足月儿儿童的认知结果。随后,我们在识别出的研究聚类内拟合荟萃分析模型,并将结果与整体分析进行比较。结果提示三个聚类,其中两个保留了相当大的研究间异质性。剩余聚类包含最同质的研究组,与整体分析相比,显示出更极端的合并效应估计和更窄的预测区间。这项工作提出了一种新颖方法,通过在模型拟合前纳入研究特征来探索荟萃分析中的异质性。通过将研究信息转化为相似性空间,该框架识别出同质子群,并支持在异质性真实世界证据中进行更精确的推断。

英文摘要

Meta-analyses of observational studies often show substantial between-study heterogeneity, limiting the interpretability of pooled estimates. Meta-regression can be used to explore heterogeneity, but it is often underpowered to handle multiple effect modifiers. We propose a novel framework that integrates large language models (LLMs) with deep metric learning to infer study-level similarity prior to meta-analysis. Study-level clinical and methodological characteristics were processed by an LLM to generate study triplets (anchor, similar, dissimilar). These triplets were constructed by treating each study as an anchor and comparing it with pairs of other studies to identify, in each instance, the study most similar to the anchor. Then, the triplets were used into an embedding model trained with triplet loss; a deep learning approach that learns an embedding space where clinically and methodologically similar studies are clustered together. We apply our framework to a meta-analysis dataset of 58 observational studies comparing cognitive outcomes between preterm- and term-born children. Subsequently, we fit meta-analysis models within the identified study clusters and compare the results with those of the overall analysis. Results suggested three clusters two of which retained considerable between-study heterogeneity. The remaining cluster comprised the most homogeneous group of studies and exhibited a more extreme pooled effect estimate together with a narrower prediction interval compared with the overall analysis. This work presents a novel approach for exploring heterogeneity in meta-analysis by incorporating study characteristics prior to model fitting. By transforming study information into a similarity space, the framework identifies coherent subgroups and supports more precise inference in heterogeneous real-world evidence.

2605.28373 2026-06-01 astro-ph.IM

FARSim: a fast RF-chain-aware trigger-screening surrogate for radio detection of ultra-high-energy cosmic rays

FARSim:一种用于超高能宇宙射线无线电探测的快速射频链感知触发筛选代理

Xin Xu, Pengfei Zhang, Fufu Yang, Pengxiong Ma, Ramesh Koirala, Chao Zhang

AI总结 提出FARSim代理框架,通过复用ZHAireS参考足迹库重建地面无线电发射并估计触发相关可观测值,实现快速布局筛选,验证显示计算效率显著提升且波形重建中位R²达0.986。

Comments 34 pages, 23 figures

详情
AI中文摘要

无线电阵列为探测超高能宇宙射线产生的大气簇射提供了可扩展的途径,但当每个能量、到达方向、核心位置和触发配置都通过完整的无线电模拟进行评估时,候选布局的触发研究会非常耗时。我们提出了FARSim,一个快速的代理框架,它通过复用简化的ZHAireS参考足迹库来重建地面无线电发射并估计触发相关的可观测值。该方法结合了矢量地磁和电荷过剩场分解、地磁角度和能量缩放、几何投影、基于轮廓的核心采样以及事件率积分。我们针对专用的ZHAireS模拟验证了重建的场足迹和触发区域,并量化了通过用快速足迹查询替换重复的完整簇射模拟所获得的计算增益。我们进一步将峰值场代理扩展到时域电场合成,通过将预测的三分量峰值场矢量与几何依赖的归一化脉冲模板相结合。将这些迹线通过射频链响应传播,可以实现电压域阈值和L1触发诊断。对于此处考虑的验证样本,在真实峰值幅度下测试时,时域扩展在2112个保留的ZHAireS迹线上达到了中位矢量波形R²为0.986。因此,FARSim旨在作为阵列布局和触发研究的快速、物理感知筛选层;绝对曝光预测和探测器调试仍然是完整端到端模拟的职责。

英文摘要

Radio arrays provide a scalable route to detecting extensive air showers from ultra-high-energy cosmic rays, but trigger studies for candidate layouts are expensive when every energy, arrival direction, core position and trigger configuration is evaluated with full radio simulations. We present FARSim, a fast surrogate framework that reuses a reduced library of ZHAireS reference footprints to reconstruct ground-plane radio emission and to estimate trigger-relevant observables. The method combines vector geomagnetic and charge-excess field decomposition, geomagnetic-angle and energy scaling, geometrical projection, contour-based core sampling and event-rate integration. We validate the reconstructed field footprints and trigger regions against dedicated ZHAireS simulations, and quantify the computational gain obtained by replacing repeated full shower simulations with fast footprint queries. We further extend the peak-field surrogate to time-domain electric-field synthesis by combining the predicted three-component peak-field vector with geometry-dependent normalized pulse templates. Propagating these traces through an RF-chain response enables voltage-domain threshold and L1-trigger diagnostics. For the validation samples considered here, the time-domain extension reaches a median vector-waveform R^2 of 0.986 over 2112 held-out ZHAireS traces when tested at the true peak amplitude. FARSim is therefore intended as a rapid, physics-informed screening layer for array-layout and trigger studies; absolute exposure predictions and detector commissioning remain the role of full end-to-end simulations.

2605.25090 2026-06-01 cs.IT math.CO math.IT

Improved Johnson-type Bounds for Insertion-Deletion Codes

插入-删除码的改进Johnson型界

Yulin Yang

AI总结 通过将每个局部列表编码为二进制常重码,改进了Hayashi和Yasunaga关于插入-删除码的Johnson型界,并利用McEliece-Rodemich-Rumsey-Welch界得到渐近速率界。

详情
AI中文摘要

我们通过将每个局部列表编码为二进制常重码,改进了Hayashi和Yasunaga关于插入-删除码的Johnson型界。得到的局部列表大小界在足够大的字母表下是紧的。将McEliece--Rodemich--Rumsey--Welch界应用于该常重公式,得到在非平凡范围内严格优于Yasunaga的Elias型界的渐近速率界。

英文摘要

We improve upon the Johnson-type bound of Hayashi and Yasunaga for insertion-deletion codes by encoding each local list into a binary constant-weight code. The resulting local list-size bound is tight for sufficiently large alphabets. Applying the McEliece--Rodemich--Rumsey--Welch bound to this constant-weight formulation yields an asymptotic rate bound that strictly improves on Yasunaga's Elias-type bound in the nontrivial range.

2605.17550 2026-06-01 hep-th

Krylov Correlators in $\mathfrak{sl}(2,\mathbb R)$ Models: Exact Results and Holographic Complexity

Krylov 关联函数在 $\mathfrak{sl}(2,\mathbb R)$ 模型中的精确结果与全息复杂度

Eleonora Alfinito, Matteo Beccaria

AI总结 本文通过推导具有 $\mathfrak{sl}(2,\mathbb{R})$ 或 Heisenberg-Weyl 对称性的量子系统中 Krylov 关联函数的精确结果,并将其应用于复杂度-动量对应关系,展示了某些不同时间的 Krylov 速度算子的时序关联函数与落入 AdS$_3$ BTZ 黑洞的粒子的径向动量组合成正比,从而推广了原始的复杂度-动量关系。

Comments 33 pages. v4: more references

详情
AI中文摘要

在全息对偶中,复杂度-动量对应关系将落入永恒黑洞的质点动量的增加与对偶边界态的 Krylov 复杂度的增长率联系起来,这一猜想在半经典层次上对 AdS$_3$ 中的 BTZ 黑洞已得到精确验证。我们通过考虑边界高阶 Krylov 复杂度和编码扩展量子态涨落及时间关联的 Krylov 关联函数,探讨了该对应关系的可能扩展。为此,我们推导了具有 $\mathfrak{sl}(2,\mathbb{R})$ 或 Heisenberg-Weyl 对称性的量子系统中 Krylov 关联函数的精确结果,并将其应用于复杂度-动量对应关系。我们证明,某些不同时间的两个或更多 Krylov 速度算子的时序关联函数与落入 AdS$_3$ BTZ 黑洞的粒子在这些时刻的径向动量组合成正比。这代表了原始复杂度-动量关系推广的第一步。

英文摘要

In holography, the complexity--momentum correspondence relates the increasing momentum of a point particle falling into an eternal black hole to the rate of growth of the Krylov complexity of the dual boundary state, a conjecture established exactly for the BTZ black hole in AdS$_{3}$ at the semiclassical level. We examine possible extensions of the correspondence by considering boundary higher Krylov complexities and Krylov correlators encoding fluctuations and temporal correlations of the spreading quantum state. To this end, we derive exact results for Krylov correlators in quantum systems with $\mathfrak{sl}(2,\mathbb{R})$ or Heisenberg-Weyl symmetry and apply them to the complexity--momentum correspondence. We show that certain out-of-time-ordered correlators of two or more Krylov speed operators at different times are proportional to combinations of the proper radial momenta of a particle falling into the BTZ black hole in AdS$_{3}$, evaluated at those times. This represents a first step in the generalization of the original complexity--momentum relation.

2603.02730 2026-06-01 cs.IR

APAO: Adaptive Prefix-Aware Optimization for Generative Recommendation

APAO:面向生成式推荐的自适应前缀感知优化

Yuanqing Yu, Yifan Wang, Weizhi Ma, Zhiqiang Guo, Min Zhang

AI总结 针对生成式推荐中训练与推理不一致的问题,提出自适应前缀感知优化框架,通过前缀级损失和自适应最差前缀优化策略,提升模型在束搜索下的候选保留能力。

Comments Accepted by KDD'26

详情
AI中文摘要

生成式推荐最近作为一种有前景的序列推荐范式出现。它将任务表述为自回归生成过程,根据用户交互历史预测下一个项目的标记。现有的生成式推荐模型通常使用交叉熵损失等标记级似然目标进行训练,同时在推理过程中使用束搜索来生成排序候选。然而,这导致了根本性的训练-推理不一致:标准训练假设真实标记始终可用,而束搜索在推理过程中会剪枝低概率分支,导致当正确项目的前缀得分较低时,该项目被过早丢弃。为了解决这个问题,我们提出了自适应前缀感知优化(APAO)框架,该框架引入前缀级优化损失,以更好地将训练目标与推理设置对齐。此外,我们设计了一种自适应最差前缀优化策略,在训练过程中动态关注最脆弱的前缀,从而增强模型在束搜索约束下保留正确候选的能力。我们提供了理论分析来证明我们框架的有效性和效率。大量实验表明,APAO 持续缓解了训练-推理不一致,并提高了生成式推荐骨干网络的性能。源代码可在 https://github.com/yuyq18/APAO 公开获取。

英文摘要

Generative recommendation has recently emerged as a promising paradigm for sequential recommendation. It formulates the task as an autoregressive generation process, predicting tokens of the next item conditioned on user interaction histories. Existing generative recommendation models are typically trained with token-level likelihood objectives such as cross-entropy loss, while employing beam search during inference to generate ranked candidates. However, this leads to a fundamental training-inference inconsistency: standard training assumes ground-truth tokens are always available, while beam search prunes low-probability branches during inference, causing the correct item to be prematurely discarded when its prefixes receive low scores. To address this issue, we propose the Adaptive Prefix-Aware Optimization (APAO) framework, which introduces prefix-level optimization losses to better align the training objective with the inference setting. Furthermore, we design an adaptive worst-prefix optimization strategy that dynamically focuses on the most vulnerable prefixes during training, thereby enhancing the model's ability to retain correct candidates under beam search constraints. We provide theoretical analyses to demonstrate the effectiveness and efficiency of our framework. Extensive experiments show that APAO consistently alleviates the training-inference inconsistency and improves performance across generative recommendation backbones. The source code is publicly available at https://github.com/yuyq18/APAO.

2512.03116 2026-06-01 stat.ME stat.ML

Assessing Extrapolation of Peaks Over Thresholds with Martingale Testing

使用鞅检验评估超过阈值的峰值的外推能力

Joseph de Vilmarest, Olivier Wintenberger

AI总结 针对极端降水事件概率估计问题,提出基于极值理论的单变量阈值峰值模型,并创新性地使用鞅检验评估外推能力以无偏选择高分位数水平。

详情
AI中文摘要

我们提出了EVA2025数据挑战赛的获胜策略,该挑战旨在估计极端降水事件的概率。这些事件在数据集中最多出现一次,使得挑战本质上是极端值的外推问题。鉴于极端事件的稀缺性,我们认为采用简单、稳健的建模方法至关重要。我们采用单变量模型而非多变量模型,并使用极值理论对超过阈值的峰值进行建模。具体而言,我们拟合指数分布来模拟目标变量在高分位数(经季节调整后)之上的超出量。我们方法的新颖之处在于使用鞅检验来评估该过程的外推能力,并以无偏方式选择高分位数的水平。尽管该方法存在若干局限性,但我们相信将外推视为一种博弈为极值分析中的其他无偏方法打开了大门。

英文摘要

We present the winning strategy for the EVA2025 Data Challenge, which aimed to estimate the probability of extreme precipitation events. These events occurred at most once in the dataset making the challenge fundamentally one of extrapolating extreme values. Given the scarcity of extreme events, we argue that a simple, robust modeling approach is essential. We adopt univariate models instead of multivariate ones and model Peaks Over Thresholds using Extreme Value Theory. Specifically, we fit an exponential distribution to model exceedances of the target variable above a high quantile (after seasonal adjustment). The novelty of our approach lies in using martingale testing to evaluate the extrapolation power of the procedure and to agnostically select the level of the high quantile. While this method has several limitations, we believe that framing extrapolation as a game opens the door to other agnostic approaches in Extreme Value Analysis.

2510.25658 2026-06-01 physics.comp-ph physics.chem-ph

Optical excitations in nanographenes from the Bethe-Salpeter equation and time-dependent density functional theory: absorption spectra and spatial descriptors

纳米石墨烯中的光学激发:基于Bethe-Salpeter方程和含时密度泛函理论的吸收光谱与空间描述符

Maximilian Graml, Jan Wilhelm

AI总结 通过实现GW-BSE方法并应用于纳米石墨烯,研究其光学吸收光谱和激发尺寸,与实验高度一致,并指出含时密度泛函理论无法同时准确描述激发尺寸和光谱。

详情
Journal ref
Phys. Rev. B 113, 205152 (2026)
AI中文摘要

GW加Bethe-Salpeter方程(GW-BSE)形式是计算分子、纳米结构和晶体材料激发能和光学光谱的成熟方法。我们在CP2K代码中实现了GW-BSE,并在标准有机分子测试集上验证了该实现,与参考数据高度一致,激发能的平均绝对误差低于3 meV。然后,我们研究了长度递增的纳米石墨烯的光学光谱,与实验高度一致。我们进一步计算了最低光学活性激发的尺寸,随着长度增加收敛到约7.6 Å。与使用不同精确交换比例的泛函的含时密度泛函理论比较表明,没有一种泛函能同时再现GW-BSE的激发尺寸和光学光谱,这强调了多体方法对于准确描述纳米结构中电子激发的必要性。

英文摘要

The GW plus Bethe-Salpeter equation (GW-BSE) formalism is a well-established approach for calculating excitation energies and optical spectra of molecules, nanostructures, and crystalline materials. We implement GW-BSE in the CP2K code and validate the implementation for a standard organic molecular test set, obtaining excellent agreement with reference data, with a mean absolute error in excitation energies below 3 meV. We then study optical spectra of nanographenes of increasing length, showing excellent agreement with experiment. We further compute the size of the excitation of the lowest optically active excitation which converges to about 7.6 $Å$ with increasing length. Comparison with time-dependent density functional theory using functionals of varying exact-exchange fraction shows that none reproduce both the size of the excitation and optical spectra of GW-BSE, underscoring the need for many-body methods for accurate description of electronic excitations in nanostructures.

2509.15931 2026-06-01 hep-lat hep-ex hep-ph

A Unified Neural-Network Framework for Nucleon Imaging from Numerical Simulations of QCD

基于QCD数值模拟的核子成像统一神经网络框架

Min-Huan Chu, Krzysztof Cichy, Martha Constantinou, Paweł Sznajder, Jakub Wagner

AI总结 提出统一神经网络方法,同时拟合动量空间和坐标空间数据,从格点QCD矩阵元中提取部分子分布函数和广义部分子分布,实现核子层析成像。

Comments 52 pages, 21 figures

详情
AI中文摘要

部分子分布编码了强子内部夸克和胶子的动量空间结构及其广义形式下的空间层析成像,强子是可见物质的构建块。我们提出了一种统一的神经网络方法,通过同时拟合两种互补输入——通过已知动量空间和坐标空间形式论匹配到物理量的数据——直接从通过晶格量子色动力学(QCD)数值模拟计算的矩阵元中学习这些分布。利用两种方法的数据稳定了提取过程,并减轻了单独使用任一方法可能产生的偏差。我们在受控模拟数据上验证了该方法,并将其应用于格点QCD矩阵元以提取部分子分布函数(PDFs)。我们展示了这种方法在确定物理量方面的优势。我们进一步将该框架扩展到零斜率的广义部分子分布,并在相同的神经网络参数化下演示了核子层析成像。我们的结果为从欧几里得关联子中提取部分子分布提供了一种可适应且系统可改进的方法。它可以纳入极化、额外通道以及来自当前和未来设施(如电子-离子对撞机)的实验约束。

英文摘要

Parton distributions encode the momentum-space structure and, in their generalizations, the spatial tomography of quarks and gluons inside hadrons, the building blocks of visible matter. We present a unified neural-network approach that learns these distributions directly from matrix elements calculated via numerical simulations of quantum chromodynamics (QCD) on the lattice by fitting two complementary inputs simultaneously: data matched to physical quantities via known momentum-space and coordinate-space formalisms. Utilizing data from both methods stabilizes the extraction and mitigates biases that can arise when either is used alone. We validate the method on controlled mock data and apply it to lattice-QCD matrix elements to extract parton distribution functions (PDFs). We show benefits of such an approach for determining the physical quantities. We further extend the framework to zero-skewness generalized parton distributions and demonstrate nucleon tomography within the same neural-network parameterization. Our results provide an adaptable and systematically improvable approach for extracting partonic distributions from Euclidean correlators. It can incorporate polarization, additional channels, and future experimental constraints from current and future facilities, such as the Electron-Ion Collider.

2502.15467 2026-06-01 quant-ph

Entanglement corner dependence in two-dimensional systems: A tensor network perspective

二维系统中的纠缠角落依赖性:张量网络视角

Noa Feldman, Moshe Goldstein

AI总结 通过无限投影纠缠对态(iPEPS)的几何结构,研究离散格点上纠缠熵的角落依赖性,并建立与连续场论预测的直接联系。

Comments 6 pages, 7 figures, comments are welcome

详情
AI中文摘要

在连续量子场论中,边界具有尖锐角落的子系统的纠缠熵表现出普遍的角落依赖性贡献。我们通过离散化的视角研究这一贡献,并证明这种角落依赖性自然地从离散格点上的无限投影纠缠对态(iPEPS)的几何结构中涌现。通过严格的计数论证,我们表明iPEPS表示的键维数表现出一个角落依赖项,与有能隙连续系统中的预测项相匹配。关键的是,我们发现这种对应关系仅在平均所有可能的格点方向和原点位置时出现,揭示了正确离散化连续系统的基本要求。我们的结果为纠缠角落定律提供了几何理解,并建立了连续场论预测与离散张量网络表示结构之间的直接联系。我们将分析扩展到规范不变系统,其中二分边界穿过的格点角落贡献了一个额外的角落依赖项。这些发现为连续和离散量子系统之间的纠缠关系提供了新的见解。

英文摘要

In continuous quantum field theories, the entanglement entropy of a subsystem with sharp corners on its boundary exhibits a universal corner-dependent contribution. We study this contribution through the lens of lattice discretization, and demonstrate that this corner dependence emerges naturally from the geometric structure of infinite projected entangled pair states (iPEPS) on discrete lattices. Using a rigorous counting argument, we show that the bond dimension of an iPEPS representation exhibits a corner-dependent term that matches the predicted term in gapped continuous systems. Crucially, we find that this correspondence only emerges when averaging over all possible lattice orientations and origin positions, revealing a fundamental requirement for properly discretizing continuous systems. Our results provide a geometric understanding of entanglement corner laws and establish a direct connection between continuum field theory predictions and the structure of discrete tensor network representations. We extend our analysis to gauge-invariant systems, where lattice corners crossed by the bipartition boundary contribute an additional corner-dependent term. These findings offer new insights into the relationship between entanglement in continuous and discrete quantum systems.

2309.08735 2026-06-01 math.GT math.GR

On groups with Schottky set boundary

关于具有Schottky集边界的群

Peter Haïssinsky, Luisa Paoluzzi, Genevieve Walsh

AI总结 研究边界为Schottky集的相对双曲群对,并刻画其边界Schottky集的关联图具有1或2个分支的群。

Comments 31 pages

详情
Journal ref
Math. Z., 313 (2026), no. 2, Paper No. 28
AI中文摘要

我们研究边界为Schottky集的相对双曲群对。我们刻画了那些边界中Schottky集的关联图具有1个或2个分支的群。

英文摘要

We study relatively hyperbolic group pairs whose boundaries are Schottky sets. We characterize the groups that have boundaries where the Schottky sets have incidence graphs with 1 or 2 components.

2605.29276 2026-06-01 astro-ph.CO

A Designer's Guide to Lunar Far-Side Interferometer Array: Power Spectrum Measurement and Cosmological Constraints from the Dark Ages

月球背面干涉仪阵列设计指南:黑暗时代的功率谱测量与宇宙学约束

Yuewei Wen, Bin Yue, Yidong Xu, Furen Deng, Chen Zhang, Fengquan Wu, Xuelei Chen

AI总结 本文通过量化月球背面干涉仪阵列的设计要求,预测了利用21厘米功率谱测量约束暴胀模型的能力,并分析了热噪声对高红移和小尺度模式的侵蚀效应。

Comments 39 pages, 15 figures

详情
AI中文摘要

宇宙黑暗时代中性氢的21厘米发射线可以作为宇宙学模型和早期宇宙物理的有力探针。本文为月球背面干涉仪阵列的设计要求提供了定量预测,旨在测量21厘米功率谱并通过谱指数运行$α_s$约束暴胀模型。在黑暗时代,较大的坍缩天体尚未形成,使得线性扰动理论在比当前大尺度结构或CMB巡天所能达到的更小尺度上仍然有效。我们首先通过量化迷你暗晕对21厘米信号的贡献来验证这一线性假设。然后,我们建立了一个通用且灵活的分析框架,用于由任意数量的台站或子阵列组成的干涉仪的基线密度分布。结合现实噪声模型,我们确定了达到探测阈值所需的配置,并证明将总收集面积分配到多个台站可以将可调感兴趣小尺度上的功率谱信噪比提高多达两个数量级。接着,我们展示月球阵列需要至少约30,000个探测到的傅里叶模式才能实现对暴胀的约束$σ(α_s) = 0.034$,这一结果可与Planck 2018结果竞争,并能够区分不同的暴胀情景。我们定量解释了热噪声如何严重侵蚀高红移和小尺度上的模式——这些尺度在文献中先前被认为黑暗时代观测容易达到——并讨论了尽管存在这一限制,黑暗时代观测作为新的独立探针的前景。

英文摘要

The 21-cm emission line from neutral hydrogen during the cosmic Dark Ages can be a powerful probe of cosmological models and early universe physics. This work provides a quantitative forecast for the design requirements of a lunar far-side interferometer array aimed at measuring the 21-cm power spectrum and constraining inflationary models through the running of the spectral index $α_s$. During the Dark Ages, larger collapsed objects have not yet formed, allowing linear perturbation theory to remain valid down to much smaller scales than is possible in current large-scale structure or CMB surveys. We first validate this linearity assumption by quantifying the contribution of minihalos to the 21-cm signal. We then establish a generalized and flexible analytical framework for the baseline density distribution of interferometers that may consist of an arbitrary number of stations or sub-arrays. Incorporating a realistic noise model, we determine the configurations necessary to reach the detection threshold and demonstrate that distributing the total collecting area into multiple stations can improve the signal-to-noise ratio of the power spectrum at a tunable small scale of interest by up to two orders of magnitude. We then show that a lunar array requires at least $\sim30,000$ probed Fourier modes to achieve a constraint on inflation of $σ(α_s) = 0.034$, a result competitive with the Planck 2018 results and capable of distinguishing between different inflationary scenarios. We quantitatively explain how thermal noise severely erodes modes at high redshifts and small scales -- scales previously considered easily accessible to Dark Ages observations in the literature -- and discuss the prospects for Dark Ages observations as a new and independent probe despite this limitation.

2605.29085 2026-06-01 eess.SP

Dimming Space-Time Code (DSTC) for Visible Light Communication with Semi-Blind Detection

用于可见光通信的调光空时码(DSTC)与半盲检测

Igor S. C. Rodrigues, Leandro R. Ximenes, André L. F. de Almeida

AI总结 针对CSK可见光通信中的调光问题,提出一种调光空时码(DSTC),通过结构化调光矩阵控制时域功率变化,并设计基于张量的半盲接收机,在保持色度同时降低训练开销,实现分集增益和误码率降低。

详情
AI中文摘要

可见光通信(VLC)为无线数据传输和照明提供了统一框架,但其实际部署需要同时满足通信和照明约束的传输方案。在色移键控(CSK)系统中,调光仍然是一个具有挑战性且尚未充分探索的问题,因为必须在改变感知色度的情况下控制平均光功率。本文提出了一种用于基于CSK的VLC系统的调光空时码(DSTC),其中结构化的调光矩阵在满足物理可行性、颜色保持和可识别性条件的同时,引入受控的时域功率变化。开发了两种接收机架构:导频辅助的迫零(ZF)接收机和基于张量的半盲PARAFAC接收机,后者仅使用一个训练时隙即可联合估计信道和传输符号。仿真结果表明,与传统的CSK相比,所提出的DSTC提供了分集增益和显著的误码率降低,而基于张量的接收机通过减少训练开销提高了频谱效率,在大规模MIMO配置中尤其有益。

英文摘要

Visible light communication (VLC) provides a unified framework for wireless data transmission and illumination, but its practical deployment requires transmission schemes that jointly satisfy communication and lighting constraints. In color-shift keying (CSK) systems, dimming remains a challenging and underexplored problem because the average optical power must be controlled without altering the perceived chromaticity. This paper proposes a dimming space-time code (DSTC) for CSK-based VLC systems, where a structured dimming matrix introduces controlled temporal power variations while satisfying physical feasibility, color preservation, and identifiability conditions. Two receiver architectures are developed: a pilot-assisted zero-forcing (ZF) receiver and a tensor-based semi-blind PARAFAC receiver that jointly estimates the channel and transmitted symbols using only one training time slot. Simulation results show that the proposed DSTC provides diversity gains and substantial BER reductions with respect to conventional CSK, while the tensor-based receiver improves spectral efficiency by reducing training overhead, with particular benefits in large-scale MIMO configurations.

2605.29053 2026-06-01 eess.SY cs.SY

Grid Capacity Expansion under Data Centers and Electrified Manufacturing Large Loads

数据中心和电气化制造大负荷下的电网容量扩展

Jiyong Lee, Melody Agustin, Joanne Langsdorf, Erhan Kutanoglu, Michael Baldea, Ilias Mitrai

AI总结 针对数据中心和电气化制造带来的大负荷,提出多周期电网容量扩展模型,优化发电、储能和输电投资,并通过ERCOT系统七年的案例分析验证模型有效性。

详情
AI中文摘要

在本文中,我们考虑数据中心和电气化制造带来的新兴大负荷下的电网扩展问题。我们开发了一个多周期电网容量扩展模型,以确定发电、储能和输电容量的最优投资方案,同时考虑每小时电力调度,从而满足电力需求并最小化总规划与运营成本。我们还提出了一种关于大负荷需求空间分布的新建模方法。该模型用于分析一个遵循ERCOT系统关键特征的合成电网在七年规划期内的扩展,负荷来自数据中心和电气化炼油,到规划期末分别占总年电力需求的17.5%和4.7%。最优投资政策使发电容量增加83.6%,并利用了太阳能和储能的短建设周期以及火电机组的运行灵活性。最后,敏感性分析表明,电网资产的建设时间显著影响投资时机、发电技术组合和输电容量扩展。所提出的建模框架具有通用性,可扩展到其他电网系统,从而能够探索不同的需求情景、政策假设和区域特征。

英文摘要

In this paper, we consider the expansion of power grids under emerging large loads from data centers and electrified manufacturing. We develop a multi-period grid capacity expansion model to determine optimal investment profiles for power generation, storage, and transmission capacity while accounting for hourly power dispatch, such that electricity demand is satisfied and the total planning and operation cost is minimized. We also propose a new modeling approach regarding the spatial distribution of demand from large loads. The model is used to analyze the expansion of a synthetic grid that follows key characteristics of the ERCOT system over a seven-year planning horizon, under loads from data centers and electrified oil refining, which account for 17.5% and 4.7% of total annual electricity demand by the end of the planning horizon. The optimal investment policy leads to an 83.6% increase in generation capacity and exploits the short construction times of solar and storage as well as the operational flexibility of thermal generators. Finally, sensitivity analysis reveals that the construction time of grid assets substantially impacts investment timing, generation technology mix, and transmission capacity expansion. The proposed modeling framework is general and can be extended to other grid systems, enabling the exploration of diverse demand scenarios, policy assumptions, and regional characteristics.

2605.29034 2026-06-01 cond-mat.str-el

Topological superconductivity from Abelian fractional Chern insulators

阿贝尔分数量子陈绝缘体中的拓扑超导

Taige Wang

AI总结 利用U(3)部分子理论,研究了ν=1/3分数量子陈绝缘体中的超导配对,发现了三种不同拓扑超导相,并统一描述了邻近的电荷密度波态。

Comments 5 pages, 1 figure. (v2) Added Discussion

详情
AI中文摘要

Laughlin任意子流体能否成为拓扑超导体?我们利用U(3)红外部分子理论,对ν=1/3分数量子陈绝缘体(FCI)回答了这个问题。三个电荷为e/3的组分构成电子,而三个组分对形成规范不变的电荷为2e的库珀对。由此产生的超导体共享普通的电荷-2e扇区,但在中性色响应上有所不同,从而得到具有母体Laughlin U(1)_3序的SC*、c_-=3/2的手性拓扑超导体,以及c_-=3的强配对任意子超导体。同一框架组织了邻近的σ_{xy}=0电荷密度波(CDW)态。在公度填充下,正常态以及c_-=3和c_-=3/2的超导后裔自然与周期三的密度序背景相关联,而SC*分支可以保持微观平移不变性。在非公度填充下,电荷子金属可以配对进入相同的c_-=3/2拓扑超导体。FCI、重现CDW和手性超导在U(3)部分子理论中统一。

英文摘要

Can a Laughlin anyon fluid become a topological superconductor? We answer this question for a $ν=1/3$ fractional Chern insulator (FCI) using a $U(3)$ infrared parton theory. Three charge-$e/3$ constituents form the electron, while three constituent pairs form a gauge-invariant charge-$2e$ Cooper pair. The resulting superconductors share an ordinary charge-$2e$ sector but differ in their neutral color response, giving an SC$^\ast$ with the parent Laughlin $U(1)_3$ order, a chiral topological superconductor with $c_-=3/2$, and a strong-pairing anyon superconductor with $c_-=3$. The same framework organizes a nearby $σ_{xy}=0$ charge density wave (CDW) state. At commensurate filling, the normal state and the $c_-=3$ and $c_-=3/2$ superconducting descendants are naturally tied to a period-three density order background, while the SC$^\ast$ branch can preserve microscopic translations. Away from commensuration, a chargon metal can pair into the same $c_-=3/2$ topological superconductor. The FCI, reentrant CDW, and chiral superconductivity are unified in the $U(3)$ parton theory.

2605.28973 2026-06-01 math.NT

Asymptotic formulas for sums of elements from a multiplicative group

乘法群中元素和的渐近公式

Jan-Hendrik Evertse, Kálmán Győry, Lajos Hajdu, Florian Luca, László Remete

AI总结 研究数域K上秩为r的有限生成子群Γ中元素线性组合的绝对指数高度不超过X的计数渐近公式,并推广到线性递归序列。

详情
AI中文摘要

设$K$为数域,$k\geq 2$为整数,$(K^*)^k$为$K^*$的$k$重直积(坐标乘法),$\Gamma$为$(K^*)^k$中秩为$r$的有限生成子群。进一步,设$H(\alpha)$为代数数$\alpha$的绝对指数高度。固定非零元$a_1,\ldots , a_k\in K$。我们给出了当$X\to\infty$时,满足$H(a_1x_1+\cdots +a_kx_k)\leq X$且$a_1x_1+\cdots +a_kx_k$的任何非空子和不等于零的$\mathbf{x}=(x_1,\ldots , x_k)\in\Gamma$的个数的渐近公式。通过相同的证明方法,我们得到了当$X\to\infty$时,满足$H(u_n)\leq X$的非负整数$n$的个数的渐近公式,其中$\{ u_n\}$是线性递归序列。

英文摘要

Let $K$ be a number field, $k\geq 2$ an integer, $(K^*)^k$ the $k$-fold direct product of $K^*$ with coordinatewise multiplication, and $Γ$ a finitely generated subgroup of rank $r$ of $(K^*)^k$. Further, let $H(α)$ denote the absolute exponential height of an algebraic number $α$. Fix non-zero elements $a_1,\ldots , a_k\in K$. We give asymptotic formulas for the number of $\mathbf{x}=(x_1,\ldots , x_k)\inΓ$ with $H(a_1x_1+\cdots +a_kx_k)\leq X$ as $X\to\infty$ such that no non-empty subsum of $a_1x_1+\cdots +a_kx_k$ vanishes. By the same method of proof, we obtain an asymptotic formula as $X\to\infty$ for the number of non-negative integers $n$ with $H(u_n)\leq X$, where $\{ u_n\}$ is a linear recurrence sequence.

2605.28875 2026-06-01 quant-ph

Quantum and Thermal Properties of the Klein-Gordon Inverted Harmonic Oscillator with Physical Applications

Klein-Gordon反谐振子的量子与热性质及其物理应用

Kevin Hernández, Mustapha Maamache

AI总结 通过辛相空间旋转将Klein-Gordon反谐振子映射为有效谐振子,导出自由能、熵和热关联函数的闭式表达式,并应用于宇宙学暴胀、黑洞视界和凝聚态相变。

详情
AI中文摘要

我们为受反谐振势$-{1\over2} m^2ω^2 x^2$作用的Klein-Gordon标量场的量子与热性质建立了一个系统框架。从非厄米动量替换$P \to P - mωx$出发,我们采用辛相空间旋转$V = \exp\!\left[-\tfracπ{8}(xp+px)\right]$将系统映射到一个解析可处理的有效谐振子,该谐振子在$xe^{iπ/4}$处求值。这使我们能够定义良定义的配分函数$Z(β,ω,m)$,并导出自由能、熵和热关联函数的闭式表达式。然后我们将该框架应用于三个物理场景:(i) 宇宙学暴胀期间的标量场涨落,(ii) 黑洞视界附近的量子场,以及(iii) 凝聚态中二级相变附近的序参量动力学。我们的结果统一了文献中先前分散的结果,并为不稳定量子系统的有限温谱密度和纠缠熵提供了新预测。

英文摘要

We develop a systematic framework for the quantum and thermal properties of a Klein-Gordon scalar field subject to an inverted harmonic potential $-{1\over2} m^2ω^2 x^2$. Starting from a non-Hermitian momentum substitution $P \to P - mωx$, we employ a symplectic phase-space rotation $V = \exp\!\left[-\tfracπ{8}(xp+px)\right]$ to map the system onto an analytically tractable effective harmonic oscillator evaluated at $xe^{iπ/4}$. This allows us to define a well-regulated partition function $Z(β,ω,m)$ and derive closed-form expressions for the free energy, entropy, and thermal correlation functions. We then apply this framework to three physical settings: (i) scalar field fluctuations during cosmological inflation, (ii) quantum fields near black-hole horizons, and (iii) order-parameter dynamics near second-order phase transitions in condensed matter. Our results unify previously scattered results in the literature and provide new predictions for the finite-temperature spectral density and entanglement entropy of unstable quantum systems.

2605.28759 2026-06-01 cond-mat.supr-con cond-mat.mes-hall

Absolute measurement of penetration depth of superconducting thin films using microwave stripline resonators

利用微波带状线谐振器绝对测量超导薄膜的穿透深度

Arghya Dutta, Ajeet Salunke, Mahesh Poojary, Vivas Bagwe, Sangita Bose, Pratap Raychaudhuri

AI总结 本文提出一种结合谐振器测量与COMSOL有限元电磁模拟的混合方法,通过不同厚度NbN薄膜及翻转薄膜结构,实现宽温度范围内超导薄膜穿透深度的绝对测量。

Comments References updated in this version

详情
AI中文摘要

超导微带谐振器利用动能电感探测电动力学,是研究超导薄膜微波频率特性的灵敏工具。然而,从这些测量中提取绝对超导穿透深度仍然具有挑战性。在这项工作中,我们提出了一种混合方法,通过将谐振器测量与COMSOL Multiphysics中的有限元电磁模拟相结合,在宽温度范围内确定穿透深度的绝对值。我们通过从不同厚度的薄膜制造谐振器并提取NbN薄膜的穿透深度来演示这种方法。此外,我们通过采用翻转薄膜几何结构,将该技术扩展到具有较低临界温度的材料。通过将样品放置在NbN谐振器上方,中间用薄Mylar介电层隔开,我们创建了一个耦合结构,其中样品穿透深度的变化会改变谐振频率。这种非破坏性方法为表征各种超导薄膜的穿透深度提供了一个可靠、高灵敏度的平台。

英文摘要

Superconducting microstrip resonators, which leverage kinetic inductance to probe electrodynamics, are sensitive tools for studying superconducting thin films at microwave frequencies. However, extracting the absolute superconducting penetration depth from these measurements remains challenging. In this work, we present a hybrid method to determine the absolute value of penetration depth over a wide temperature range by combining resonator measurements with finite-element electromagnetic simulations in COMSOL Multiphysics. We demonstrate this approach by extracting the penetration depth of NbN films by fabricating resonators from films of various thicknesses. Furthermore, we extend the technique to materials with lower critical temperatures by employing a flip-film geometry. By placing a sample above a NbN resonator, separated by a thin Mylar dielectric, we create a coupled structure where changes in the sample's penetration depth shift the resonant frequency. This non-destructive method provides a reliable, high-sensitivity platform for characterizing the penetration depth of diverse superconducting thin films.

2605.27807 2026-06-01 nucl-th quant-ph

Light nuclear scattering from neural quantum states

基于神经量子态的轻核散射

Scott Lawrence, Yukari Yamauchi

AI总结 提出一种利用神经量子态研究少体核散射的方法,无需时间演化,通过薛定谔方程的稳定最小原理提供截面和分波振幅的保守不确定性,并用于研究真实核力下的中子-氘核弹性与非弹性散射。

Comments 16 pages, 4 figures; fixed typos and display of references

详情
AI中文摘要

我们提出了一种通过神经量子态研究少体核散射的方法,无需时间演化。最近发展的一族薛定谔方程的稳定最小原理为以这种方式计算的截面和分波振幅提供了保守的不确定性。我们使用该方法研究了具有现实核两体力的中子-氘核弹性与非弹性散射。

英文摘要

We present a method of studying few-body nuclear scattering by means of neural quantum states, without requiring time-evolution. A recently developed family of stable minimum principles for Schrodinger's equation provides conservative uncertainties on cross sections and partial wave amplitudes computed in this way. We use this method to study both elastic and inelastic neutron-deuteron scattering with realistic nuclear two-body forces.