arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 2062
2508.20326 2026-06-01 stat.ML cs.LG math.OC

Stochastic Gradients under Nuisances

干扰下的随机梯度

Facheng Yu, Ronak Mehta, Alex Luedtke, Zaid Harchaoui

AI总结 本文研究目标函数依赖于未知干扰参数的学习问题中随机梯度算法的非渐近收敛性,证明在Neyman正交性等条件下经典算法仍可收敛,并提出近似正交化更新变体以在非正交情形下达到类似收敛率。

Comments Published at NeurIPS 2025

详情
AI中文摘要

随机梯度优化是从经典监督学习到现代自监督学习等多种场景的主要学习范式。我们考虑目标函数依赖于未知干扰参数的学习问题的随机梯度算法,并建立非渐近收敛保证。我们的结果表明,虽然干扰的存在会改变最优值并扰乱优化轨迹,但在适当条件下(如Neyman正交性),经典随机梯度算法仍可能收敛。此外,即使不满足Neyman正交性,我们证明一种具有近似正交化更新(通过近似正交化梯度预言)的算法变体也能达到类似的收敛率。讨论了来自正交统计学习/双机器学习以及因果推断的例子。

英文摘要

Stochastic gradient optimization is the dominant learning paradigm for a variety of scenarios, from classical supervised learning to modern self-supervised learning. We consider stochastic gradient algorithms for learning problems whose objectives rely on unknown nuisance parameters, and establish non-asymptotic convergence guarantees. Our results show that, while the presence of a nuisance can alter the optimum and upset the optimization trajectory, the classical stochastic gradient algorithm may still converge under appropriate conditions, such as Neyman orthogonality. Moreover, even when Neyman orthogonality is not satisfied, we show that an algorithm variant with approximately orthogonalized updates (with an approximately orthogonalized gradient oracle) may achieve similar convergence rates. Examples from orthogonal statistical learning/double machine learning and causal inference are discussed.

2508.11911 2026-06-01 math.NA cs.LG cs.NA physics.comp-ph

Reduced-order modeling of Hamiltonian dynamics based on symplectic neural networks

基于辛神经网络的哈密顿动力学降阶建模

Yongsheng Chen, Wei Guo, Qi Tang, Xinghui Zhong

AI总结 提出一种数据驱动的辛诱导降阶建模框架,通过统一端到端神经架构同时发现潜空间和学习动力学,确保降阶模型精确保持辛结构,提升长期稳定性和保真度。

详情
AI中文摘要

我们为高维哈密顿系统引入了一种新颖的数据驱动辛诱导降阶建模框架,该框架在单个端到端神经架构中统一了潜空间发现和动力学学习。编码器-解码器由Henon神经网络构建,并可增加线性SGS-反射层,从而在全相空间和潜相空间之间产生精确的辛映射。潜动力学由作为HenonNet实现的辛流映射推进。这种统一的神经架构确保在降阶水平上精确保持底层辛结构,显著增强所得ROM的保真度和长期稳定性。我们通过在典型哈密顿系统上的全面数值实验验证了该方法。结果表明,该方法具有准确的轨迹重建能力、训练时间范围之外的鲁棒预测性能以及精确的哈密顿量保持。这些有希望的结果强调了我们的辛ROM框架在广泛科学和工程学科中复杂动力系统的有效性和潜在适用性。

英文摘要

We introduce a novel data-driven symplectic induced-order modeling (ROM) framework for high-dimensional Hamiltonian systems that unifies latent-space discovery and dynamics learning within a single, end-to-end neural architecture. The encoder-decoder is built from Henon neural networks (HenonNets) and may be augmented with linear SGS-reflector layers. This yields an exact symplectic map between full and latent phase spaces. Latent dynamics are advanced by a symplectic flow map implemented as a HenonNet. This unified neural architecture ensures exact preservation of the underlying symplectic structure at the reduced-order level, significantly enhancing the fidelity and long-term stability of the resulting ROM. We validate our method through comprehensive numerical experiments on canonical Hamiltonian systems. The results demonstrate the method's capability for accurate trajectory reconstruction, robust predictive performance beyond the training horizon, and accurate Hamiltonian preservation. These promising outcomes underscore the effectiveness and potential applicability of our symplectic ROM framework for complex dynamical systems across a broad range of scientific and engineering disciplines.

2508.04457 2026-06-01 stat.ML cs.LG

Benchmarking Uncertainty and its Disentanglement in multi-label Chest X-Ray Classification

多标签胸部X光分类中的不确定性及其解缠基准测试

Simon Baur, Wojciech Samek, Jackie Ma

AI总结 本研究使用MIMIC-CXR-JPG数据集,对多标签胸部X光分类任务中的13种不确定性量化方法进行基准测试,评估了卷积和Transformer架构,并扩展了三种方法到多标签设置,揭示了不同方法和架构在不确定性估计和解缠认知与偶然不确定性方面的优缺点。

详情
AI中文摘要

可靠的不确定性量化对于医疗影像中可信赖的决策和AI模型的部署至关重要。虽然先前的工作已经探索了神经网络在合成或定义良好的数据设置(如自然图像分类)中使用信息论方法量化预测、认知和偶然不确定性的能力,但其在真实医学诊断任务中的适用性仍未得到充分探索。在本研究中,我们使用MIMIC-CXR-JPG数据集为多标签胸部X光分类提供了广泛的不确定性量化基准。我们评估了基于卷积(ResNet)和基于Transformer(Vision Transformer)架构的13种不确定性量化方法,涵盖广泛的任务。此外,我们将证据深度学习、HetClass神经网络和深度确定性不确定性扩展到多标签设置。我们的分析提供了对不确定性估计有效性以及解缠认知和偶然不确定性能力的见解,揭示了方法和架构特定的优势和局限性。

英文摘要

Reliable uncertainty quantification is crucial for trustworthy decision-making and the deployment of AI models in medical imaging. While prior work has explored the ability of neural networks to quantify predictive, epistemic, and aleatoric uncertainties using an information-theoretical approach in synthetic or well defined data settings like natural image classification, its applicability to real life medical diagnosis tasks remains underexplored. In this study, we provide an extensive uncertainty quantification benchmark for multi-label chest X-ray classification using the MIMIC-CXR-JPG dataset. We evaluate 13 uncertainty quantification methods for convolutional (ResNet) and transformer-based (Vision Transformer) architectures across a wide range of tasks. Additionally, we extend Evidential Deep Learning, HetClass NNs, and Deep Deterministic Uncertainty to the multi-label setting. Our analysis provides insights into uncertainty estimation effectiveness and the ability to disentangle epistemic and aleatoric uncertainties, revealing method- and architecture-specific strengths and limitations.

2507.17026 2026-06-01 stat.ML cs.LG

Conformal C2ST: Turning weak classifiers into strong two-sample tests

Conformal C2ST:将弱分类器转化为强双样本检验

Vansh Bansal, Tianyu Chen, James G. Scott

AI总结 本文提出基于共形预测的C2ST变体,使任意弱分类器都能产生精确有限样本p值,实现可控第一类错误和温和退化的检验功效,并应用于神经后验估计验证。

详情
AI中文摘要

双样本检验问题是统计学和机器学习中的一项基本任务,旨在判断来自潜在分布$p$和$q$的两组样本是否实际上同分布(即$p=q$)。一种流行且直观的方法是分类器双样本检验(C2ST),其中训练一个分类器来区分来自$p$和$q$的样本。然而,尽管C2ST简单,其可靠性依赖于接近贝叶斯最优的分类器,这一要求很少满足且难以验证。这引发了一个重要的开放问题:弱分类器是否仍能用于双样本检验?我们证明答案是肯定的。基于Hu和Lei(2024)的工作,我们分析了C2ST的两种共形变体,它们将任何训练好的分类器(即使是弱的、有偏的或过拟合的)的分数转化为精确的有限样本p值。我们建立了共形C2ST的两个关键理论性质:(i)有限样本第一类错误控制,以及(ii)非平凡的功效,该功效随训练分类器误差的增加而温和退化。结果是,即使是表现不佳的分类器也能产生强大且可靠的双样本检验。这一通用框架在贝叶斯推断中找到了强大的应用,特别是在验证神经后验估计(NPE)模型时,其中比较学习到的后验近似$q(θ\mid y)$与真实后验$p(θ\mid y)$的任务可以表述为双样本检验。实验上,共形C2ST在此任务的广泛基准测试中优于经典判别检验。我们的结果确立了共形C2ST作为一种实用、理论基础的诊断工具。

英文摘要

The two-sample testing problem, a fundamental task in statistics and machine learning, seeks to determine whether two sets of samples, drawn from underlying distributions $p$ and $q$, are in fact identically distributed (i.e. whether $p=q$). A popular and intuitive approach is the classifier two-sample test (C2ST), where a classifier is trained to distinguish between samples from $p$ and $q$. Yet despite simplicity of the C2ST, its reliability hinges on access to a near-Bayes-optimal classifier, a requirement that is rarely met and difficult to verify. This raises a major open question: can a weak classifier still be useful for two-sample testing? We show that the answer is a definitive yes. Building on the work of Hu and Lei (2024), we analyze two conformal variants of the C2ST that convert the scores from any trained classifier -- even if weak, biased, or overfit -- into exact, finite-sample p-values. We establish two key theoretical properties of the conformal C2ST: (i) finite-sample Type-I error control, and (ii) non-trivial power that degrades gently in tandem with the error of the trained classifier. The upshot is that even poorly performing classifiers can yield powerful and reliable two-sample tests. This general framework finds a powerful application in Bayesian inference, particularly for validating Neural Posterior Estimation (NPE) models, where the task of comparing a learned posterior approximation $q(θ\mid y)$ to the true posterior $p(θ\mid y)$ can be framed as a two-sample test. Empirically, the Conformal C2ST outperforms classical discriminative tests across a wide range of benchmarks for this task. Our results establish the conformal C2ST as a practical, theoretically grounded diagnostic tool.

2411.19463 2026-06-01 cs.SE cs.AI

Understanding the Fundamental Design Decisions of Retrieval-Augmented Generation Systems

理解检索增强生成系统的基本设计决策

Shengming Zhao, Yuchen Shao, Yuheng Huang, Jiayang Song, Zhijie Wang, Chengcheng Wan, Lei Ma

AI总结 本文通过系统实验,研究了RAG部署中的三个关键决策(是否部署、检索量、知识集成方式),揭示了任务和模型依赖的优化策略,为实践者提供基于证据的指导。

详情
Journal ref
ACM Transactions on Software Engineering and Methodology (TOSEM), 2026
AI中文摘要

检索增强生成(RAG)已成为增强大型语言模型(LLM)能力的关键技术。然而,实践者在做出RAG部署决策时面临重大挑战。尽管现有研究优先考虑算法创新,但在理解决定RAG成功的基本工程权衡方面仍存在系统性空白。我们首次对三个通用的RAG部署决策进行了全面研究:是否部署RAG、检索多少信息以及如何有效集成检索到的知识。通过在三个LLM和六个数据集(涵盖问答和代码生成任务)上的系统实验,我们揭示了关键见解:(1)RAG部署必须高度选择性,即使有完美文档,可变召回阈值和失败模式也会影响多达12.6%的样本。(2)最优检索量表现出任务依赖性:问答任务呈现通用模式(5-10个文档最优),而代码生成需要针对场景的优化。(3)知识集成有效性取决于任务和模型特性,代码生成从提示方法中显著受益,而问答任务改进甚微。这些发现表明,通用的RAG策略是不够的。有效的RAG系统需要基于任务特性和模型能力的上下文感知设计决策。我们的分析为实践者提供了基于证据的指导,并为原则性RAG部署建立了基础见解。我们的代码、数据和工件公开于https://github.com/ShengmingZ/RAG_Benchmark_Code_QA。

英文摘要

Retrieval-Augmented Generation (RAG) has emerged as a critical technique for enhancing large language model (LLM) capabilities. However, practitioners face significant challenges when making RAG deployment decisions. While existing research prioritizes algorithmic innovations, a systematic gap persists in understanding fundamental engineering trade-offs that determine RAG success. We present the first comprehensive study of three universal RAG deployment decisions: whether to deploy RAG, how much information to retrieve, and how to integrate retrieved knowledge effectively. Through systematic experiments across three LLMs and six datasets spanning question answering and code generation tasks, we reveal critical insights: (1) RAG deployment must be highly selective, with variable recall thresholds and failure modes affecting up to 12.6\% of samples even with perfect documents. (2) Optimal retrieval volume exhibits task-dependent behavior QA tasks show universal patterns (5-10 documents optimal) while code generation requires scenario-specific optimization. (3) Knowledge integration effectiveness depends on task and model characteristics, with code generation benefiting significantly from prompting methods while question answering shows minimal improvement. These findings demonstrate that universal RAG strategies prove inadequate. Effective RAG systems require context-aware design decisions based on task characteristics and model capabilities. Our analysis provides evidence-based guidance for practitioners and establishes foundational insights for principled RAG deployment. Our code, data and artifacts are publicly available at https://github.com/ShengmingZ/RAG_Benchmark_Code_QA.

2506.12060 2026-06-01 cs.CR cs.AI cs.CY

Organizational Adaptation to Generative AI in Cybersecurity

组织对生成式人工智能在网络安全中的适应

Christopher Nott

AI总结 本研究通过分析2022至2025年的25项研究,采用定性方法探讨组织如何通过修改框架和混合操作流程适应生成式AI,发现成熟基础设施、监管压力和人力资本投资是成功的关键,同时指出攻防能力不平衡等挑战。

Comments 38 pages, 1 table, 1 figure Revised title, abstract, and formatting for journal submission, corrected heading numbers, no substantive changes in content

详情
AI中文摘要

网络安全组织正在通过修改框架和混合操作流程来适应生成式AI的整合,其成功受到现有安全成熟度、监管要求以及人力和基础设施投资的影响。本定性研究采用系统文档分析和比较案例研究方法,考察了2022至2025年间25项研究如何记录威胁建模框架的组织适应,揭示了从传统基于签名的系统向AI能力框架的转变,涉及三种主要模式:用于安全应用的LLM集成、用于风险检测和响应自动化的GenAI框架,以及用于威胁狩猎和匹配的AI/ML集成。拥有成熟基础设施的组织,尤其是在金融和关键基础设施领域,通过结构化治理、专门的AI团队和稳健的事件响应流程表现出更高的准备度,其中中央银行和金融机构在监管压力下引领适应工作。成功整合需要人工监督自动化系统、关注数据质量和可解释性,以及特定行业的治理,尽管在隐私保护、偏见减少、人员培训和对抗性防御方面仍存在持续困难。进攻性和防御性GenAI能力之间的显著不平衡为安全规划带来了战略担忧。研究结果为网络安全专业人员提供了可操作的见解,并强调了在管理AI增强威胁时采用适应性方法、伦理框架和员工发展的必要性。

英文摘要

Cybersecurity organizations are adapting to GenAI integration through modified frameworks and hybrid operational processes, with success influenced by existing security maturity, regulatory requirements, and investments in human capital and infrastructure. This qualitative research employs systematic document analysis and comparative case study methodology to examine how 25 studies from 2022 to 2025 document organizational adaptation of threat modeling frameworks, revealing a shift away from traditional signature-based systems toward AI-capable frameworks across three primary patterns: LLM integration for security applications, GenAI frameworks for risk detection and response automation, and AI/ML integration for threat hunting and matching. Organizations with mature infrastructures, particularly in finance and critical infrastructure, demonstrate higher readiness through structured governance, dedicated AI teams, and robust incident response processes, with central banks and financial institutions leading adaptation efforts under regulatory pressure. Successful integration requires human oversight of automated systems, attention to data quality and explainability, and sector-specific governance, though ongoing difficulties with privacy protection, bias reduction, personnel training, and adversarial defense persist. Notable imbalances between offensive and defensive GenAI capabilities create strategic concerns for security planning. The findings offer actionable insights for cybersecurity professionals and underscore the need for adaptive approaches, ethical frameworks, and staff development when managing AI-enhanced threats.

2506.03779 2026-06-01 quant-ph cs.LG stat.ML

Position: Quantum Kernel Machines Should Move Beyond Scalar-Valued Kernels to Realize Their Potential

立场:量子核机器应超越标量值核以实现其潜力

Hachem Kadri, Joachim Tomasi, Yuka Hashimoto, Sandrine Anthoine

AI总结 本文主张量子核机器应转向算子值核等更富表达力的框架,以利用纠缠和非交换结构处理复杂结构化预测问题,并通过初步概念验证展示其优势。

详情
Journal ref
ICML 2026
AI中文摘要

基于量子力学原理构建的量子核函数已成为量子机器学习的核心。最近的研究表明,当从经典数据学习时,量子核无法提供显著的计算或统计优势,这削弱了最初对量子核机器的热情。然而,该领域的大多数研究都集中在标准分类或回归设置中的标量值核上,而经典核方法在这些设置中已经高效且有效,留给量子核改进的空间很小。在这篇立场论文中,我们认为该领域的进展需要超越标量值核,转向更富表达力的核框架。标量值核缺乏充分利用纠缠等内在量子资源所需的自由度,并且不足以处理经典学习方法难以应对的复杂学习任务。基于算子值核学习和$C^*$-代数核表示的最新进展,我们提出了一条设计能够利用纠缠和非交换结构来处理复杂结构化预测问题的量子核的路线图。为了支持这一观点,我们展示了一个初步的概念验证,说明量子算子值核公式如何揭示标量值核方法难以访问的结构依赖性。这一焦点的转移可能为新一代量子核机器及其潜在优势的更忠实探索开辟道路。

英文摘要

Quantum kernel functions built using quantum-mechanical principles and have emerged as a centerpiece of quantum machine learning. The initial enthusiasm for quantum kernel machines has been tempered by recent studies suggesting that quantum kernels could not offer significant computational or statistical advantages when learning from classical data. However, most of the research in this area has been devoted to scalar-valued kernels in standard classification or regression settings for which classical kernel methods are efficient and effective, leaving very little room for improvement with quantum kernels. In this position paper, we argue that progress in this field requires moving beyond scalar-valued kernels toward more expressive kernel frameworks. Scalar-valued kernels lack the degrees of freedom necessary to fully exploit intrinsically quantum resources such as entanglement and are not rich enough to deal with complex learning tasks where classical learning methods struggle. Building on recent advances in operator-valued kernel learning and $C^*$-algebraic kernel representations, we propose a roadmap for designing quantum kernels capable of leveraging entanglement and non-commutative structures to tackle complex structured prediction problems. To support this viewpoint, we present an initial proof-of-concept illustrating how quantum operator-valued kernel formulations can reveal structural dependencies that remain difficult to access for scalar-valued kernel methods. This shift in focus could open a pathway toward a new generation of quantum kernel machines and a more faithful exploration of their potential advantages.

2411.13865 2026-06-01 cs.IR cs.AI cs.CL cs.LG

Breaking Information Cocoons: A Hyperbolic Framework for Balancing Exploration and Exploitation in Recommender Systems

打破信息茧房:推荐系统中平衡探索与利用的双曲框架

Qiyao Ma, Menglin Yang, Mingxuan Ju, Tong Zhao, Neil Shah, Rex Ying

AI总结 提出双曲框架HERec,通过语义增强的层次机制和自动层次聚类,在推荐系统中平衡探索与利用,有效缓解信息茧房。

Comments Accepted to KDD 2026. Code: https://github.com/Martin-qyma/HERec

详情
AI中文摘要

现代推荐系统常常形成信息茧房,限制用户接触多样化内容。核心挑战在于平衡内容探索与利用,同时允许用户调整推荐偏好。理想情况下,这种平衡可以通过层次表示来捕捉,其中深度搜索促进利用,广度搜索促进探索。然而,现有方法面临两个基本限制:欧几里得方法难以捕捉层次结构,而双曲方法尽管在层次建模上表现优越,但缺乏对用户和物品画像的语义理解,且未能提供平衡探索与利用的原则性机制。为解决这些问题,我们提出HERec,一个在推荐系统中有效平衡探索与利用的双曲框架。我们的框架引入两项关键创新:(1)语义增强的层次机制,直接在双曲空间中将丰富的文本描述与协同信息对齐。理论梯度分析表明,这种对齐有效利用了底层双曲流形结构,从而更准确地建模用户和物品;(2)通过优化Dasgupta代价的自动层次聚类机制,无需预定义超参数即可发现层次结构,实现用户可调节的探索-利用权衡。大量实验表明,HERec持续优于欧几里得和双曲基线,在效用指标上提升高达5.49%,多样性指标提升11.39%,有效缓解了信息茧房。

英文摘要

Modern recommender systems often create information cocoons, restricting users' exposure to diverse content. The central challenge is to balance content exploration and exploitation while allowing users to adjust their recommendation preferences. Ideally, this balance can be captured with a hierarchical representation, where depth search facilitates exploitation and breadth search enables exploration. However, existing approaches face two fundamental limitations: Euclidean methods struggle to capture hierarchical structures, while hyperbolic methods, despite their superior hierarchical modeling, lack semantic understanding of user and item profiles and fail to provide a principled mechanism for balancing exploration and exploitation. To address these challenges, we propose HERec, a hyperbolic framework that effectively balances exploration and exploitation in recommender systems. Our framework introduces two key innovations: (1) a semantic-enhanced hierarchical mechanism that aligns rich textual descriptions with collaborative information directly in hyperbolic space. Theoretical gradient analysis demonstrates that this alignment effectively leverages the underlying hyperbolic manifold structure, resulting in more accurate modeling of users and items; (2) an automatic hierarchical clustering mechanism by optimizing Dasgupta's cost, which discovers hierarchical structures without requiring predefined hyperparameters, enabling user-adjustable exploration-exploitation trade-offs. Extensive experiments demonstrate that HERec consistently outperforms both Euclidean and hyperbolic baselines, achieving up to 5.49% improvement in utility metrics and 11.39% increase in diversity metrics, effectively mitigating information cocoons.

2504.10564 2026-06-01 q-bio.QM cs.LG q-bio.BM

FLOWR: Flow Matching for Structure-Aware De Novo, Interaction- and Fragment-Based Ligand Generation

FLOWR: 用于结构感知的从头、基于相互作用和片段的配体生成的流匹配

Julian Cremer, Ross Irwin, Alessandro Tibo, Jon Paul Janet, Simon Olsson, Djork-Arné Clevert

AI总结 提出FLOWR框架,通过结合连续和分类流匹配与等变最优传输,并利用高效蛋白口袋条件化,实现三维配体的生成与优化,在有效性、姿态精度和相互作用恢复上超越现有方法,推理速度提升高达70倍。

详情
AI中文摘要

我们介绍了FLOWR,一个新颖的基于结构的框架,用于三维配体的生成和优化。FLOWR将连续和分类流匹配与等变最优传输相结合,并通过高效的蛋白口袋条件化增强。与FLOWR一起,我们提出了SPINDR,一个精心策划的数据集,包含配体-口袋共晶复合物,专门用于解决现有数据质量问题。实证评估表明,FLOWR在PoseBusters有效性、姿态精度和相互作用恢复方面超越了当前最先进的基于扩散和流的方法,同时提供了显著的推理加速,性能提升高达70倍。此外,我们引入了FLOWR:multi,一个高精度的多用途模型,允许针对性地采样符合预定义相互作用谱和化学子结构的新配体,用于基于片段的设计,无需重新训练或任何重采样策略。

英文摘要

We introduce FLOWR, a novel structure-based framework for the generation and optimization of three-dimensional ligands. FLOWR integrates continuous and categorical flow matching with equivariant optimal transport, enhanced by an efficient protein pocket conditioning. Alongside FLOWR, we present SPINDR, a thoroughly curated dataset comprising ligand-pocket co-crystal complexes specifically designed to address existing data quality issues. Empirical evaluations demonstrate that FLOWR surpasses current state-of-the-art diffusion- and flow-based methods in terms of PoseBusters-validity, pose accuracy, and interaction recovery, while offering a significant inference speedup, achieving up to 70-fold faster performance. In addition, we introduce FLOWR:multi, a highly accurate multi-purpose model allowing for the targeted sampling of novel ligands that adhere to predefined interaction profiles and chemical substructures for fragment-based design without the need of re-training or any re-sampling strategies

1709.08894 2026-06-01 stat.ML cs.LG

On the regularization of Wasserstein GANs

关于Wasserstein GANs的正则化

Henning Petzka, Asja Fischer, Denis Lukovnikov

AI总结 本文研究Wasserstein GANs中Lipschitz约束的正则化方法,通过理论分析和实验证明使用较弱的正则化项优于权重裁剪。

Comments Published as a conference paper at ICLR 2018. * Henning Petzka and Asja Fischer contributed equally to this work (11 pages +13 pages appendix)

详情
AI中文摘要

自生成对抗网络(GANs)发明以来,它们已成为学习建模真实(未标记)数据分布的一种流行方法。训练过程中的收敛问题通过Wasserstein GANs得以克服,后者通过不同的度量最小化模型与经验分布之间的距离,但由此在优化问题中引入了Lipschitz约束。在神经网络可建模的函数类上强制Lipschitz约束的一种简单方法是权重裁剪。有人提出,可以通过在损失函数中添加正则化项来改进训练,该正则化项惩罚判别器(作为网络输入的函数)的梯度偏离1。我们提出了理论论据,说明为什么使用较弱的正则化项来强制Lipschitz约束更可取。这些论据得到了在玩具数据集上的实验结果的支持。

英文摘要

Since their invention, generative adversarial networks (GANs) have become a popular approach for learning to model a distribution of real (unlabeled) data. Convergence problems during training are overcome by Wasserstein GANs which minimize the distance between the model and the empirical distribution in terms of a different metric, but thereby introduce a Lipschitz constraint into the optimization problem. A simple way to enforce the Lipschitz constraint on the class of functions, which can be modeled by the neural network, is weight clipping. It was proposed that training can be improved by instead augmenting the loss by a regularization term that penalizes the deviation of the gradient of the critic (as a function of the network's input) from one. We present theoretical arguments why using a weaker regularization term enforcing the Lipschitz constraint is preferable. These arguments are supported by experimental results on toy data sets.

2605.31602 2026-06-01 cond-mat.str-el hep-th math.QA

Twin Algebras: Condensable Algebras beyond Anyons

孪生代数:超越任意子的可凝聚代数

Yuhan Gai, Sakura Schafer-Nameki, Alison Warman

AI总结 本文引入孪生可凝聚代数概念,在群论拓扑序中通过不同机制构造无限族孪生代数,揭示其描述具有相同基态空间但不等价序参量的不同对称相,并用于构造无隐藏对称性破缺的相变。

Comments 37 pages, 3 ancillary files

详情
AI中文摘要

在2+1维非手征拓扑序中,可凝聚代数刻画有隙边界条件和界面。应用于对称拓扑场论时,它们允许对称有隙相的分类,并对可能的相变施加严格约束。可凝聚代数不仅由其底层任意子集合(这些任意子终结于边界或界面)指定,还由其代数结构指定。我们引入孪生可凝聚代数概念,它们具有相同的任意子分解,但代数结构不等价。我们重新审视了$\mathcal{Z}( ext{Vec}_G^ω)$中的可凝聚代数分类,即有限群$G$带反常$ω$的群论拓扑序。在此背景下,我们能够识别出由不同机制(如子群数据、SPT上循环和对称性作用)产生的孪生代数。特别地,我们从所谓的Gassmann三元组构造了无限族孪生代数实例,并展示了约化拓扑序尽管具有相同的任意子内容却不等价的情形。物理上,孪生代数描述具有同构基态空间但不等价序参量的不同对称相。这种孪生相从不表现出相对自发对称性破缺,可用于构造无隐藏对称性破缺的相变,这些相变本质上超越了朗道相变。

英文摘要

Condensable algebras in 2+1d non-chiral topological orders characterize gapped boundary conditions and interfaces. Applied to the Symmetry Topological Field Theory, they allow classification of symmetric gapped phases and impose sharp constraints on possible phase transitions. A condensable algebra is specified not only by its underlying set of anyons, which end on the boundary or interface, but also by its algebra structure. We introduce the concept of twin condensable algebras, which have the same anyon decomposition, but inequivalent algebra structure. We revisit the classification of condensable algebras in $\mathcal{Z}(\text{Vec}_G^ω)$, i.e. in group-theoretical topological orders for finite groups $G$ with anomaly $ω$. In this context we are able to identify twin algebras that arise from different mechanisms, such as subgroup data, SPT cocycles, and symmetry actions. In particular, we construct infinite families of examples of twins from so-called Gassmann triples, and exhibit cases in which the reduced topological orders are inequivalent despite having identical anyon content. Physically, twin algebras describe distinct symmetric phases that have isomorphic spaces of ground states, but inequivalent order parameters. Such twin phases never exhibit relative spontaneous symmetry breaking, and can be used to construct phase transitions without hidden symmetry breaking, which are intrinsically beyond Landau transitions.

2605.31601 2026-06-01 cond-mat.str-el hep-th math.CT quant-ph

Twin Phases: Phase Transitions Without Hidden Symmetry Breaking

孪生相:无隐藏对称性破缺的相变

Alison Warman, Yuhan Gai, Sakura Schafer-Nameki

AI总结 本文提出对称性S下的孪生相概念,即不等价相,其序参量属于S下的同一广义荷,并证明这类相之间的直接相变不涉及隐藏对称性破缺,以1+1维反常有限群对称性为例展示内在超越朗道相变。

Comments 5 pages + appendices and ancillary data file

详情
AI中文摘要

我们引入了对称性$\mathcal{S}$下的孪生相概念,定义为不等价相,其序参量是$\mathcal{S}$下同一广义荷的一部分。这类孪生相之间的稳定、直接相变永远不会是自发对称性破缺相变,即使在(部分)规范初始对称性$\mathcal{S}$之后也是如此:它们是没有隐藏对称性破缺的相变。我们以1+1维中的(反常)有限群对称性为例说明这一点,该对称性展示了这种内在超越朗道的相变。

英文摘要

We introduce the concept of twin phases for a symmetry $\mathcal{S}$, defined as inequivalent phases, whose order parameters are part of the same generalized charge under $\mathcal{S}$. Stable, direct transitions between such twin phases are never spontaneous-symmetry-breaking transitions, even after (partially) gauging the initial symmetry $\mathcal{S}$: they are phase transitions without hidden symmetry breaking. We illustrate this with an (anomalous) finite group symmetry in 1+1d, which exhibits such intrinsically beyond Landau transitions.

2605.31600 2026-06-01 astro-ph.CO hep-ph hep-th

Gravitational Waves from hybrid defects as probe of Flavor symmetry breaking: Machine-Learning Approach

来自混合缺陷的引力波作为味对称性破缺的探针:机器学习方法

Anish Ghoshal, Ilia Gogoladze, Amit Tiwari

AI总结 提出由宇宙弦束缚的畴壁网络产生随机引力波背景信号,源于规范U(1)_F味对称性自发破缺及后续离散Z_2对称性破缺,通过机器学习代理区分混合缺陷与其他缺陷的引力波谱。

Comments 39 pages, 190 figures + References

详情
AI中文摘要

我们提出了一种新的可能性:由宇宙弦束缚的畴壁网络会产生随机引力波背景信号,该信号源于规范$U(1)_F$味对称性的自发破缺以及随后容纳暗物质的离散$Z_2$对称性的破缺。对于高$U(1)_F$破缺能标,在即将到来的引力波探测器(包括LISA、ET和SKA)中可以探测到由弦束缚壁网络产生的引力波谱。与标准宇宙弦情况相比,该引力波信号在微赫兹到赫兹的频率范围内表现出独特的红外频率斜率。我们开发了一种可能的策略,通过使用基于多层感知器的机器学习代理(基于全数值处理得到的谱进行训练)进行精确计算,来区分和表征混合缺陷与其他缺陷(如稳定宇宙弦)的引力波谱。然后将其用于探测器特定信噪比计算中的快速推断,使过程快速高效。我们还讨论了引力波搜索与实验室味观测量之间可能的互补性。

英文摘要

We present a novel possibility that a network of domain walls bounded by cosmic strings generates a stochastic gravitational wave background (SWGB) signal originating from the spontaneous breaking of a gauged $U(1)_F$ flavor symmetry and the subsequent breaking of discrete $Z_2$ symmetry that accommodates dark matter. The gravitational wave (GW) spectrum produced by the string-bounded-wall network can be detected for high $U(1)_F$ breaking scales in forthcoming GW detectors including LISA, ET and SKA. The GW signal exhibits a distinctive frequency slope, in the infrared, compared to the standard cosmic-string case, in the frequency range between micro-hertz and hertz. We develop a possible strategy to distinguish and characterize GW spectrum of the hybrid defect from from other defects, such as stable cosmic strings, via employing the exact calculation with a machine-learning surrogate, based on a multilayer perceptron (MLP), trained on spectra obtained from the full numerical treatment. This is then used for rapid inference in the detector-specific signal-to-noise ratio (SNR) computation which also makes the process fast and efficient. We also discuss some possible complementarity between GW searches and Flavor observables in the laboratory.

2605.31599 2026-06-01 math.ST stat.TH

Normal approximations in nonparametric empirical Bayes

非参数经验贝叶斯中的正态近似

Jiafeng Chen, Nabarun Deb, Nikolaos Ignatiadis

AI总结 本文通过理论分析证明,在非参数经验贝叶斯中,非参数最大似然估计和相关筛法的去噪遗憾由精确正态性下的速率加上中心极限定理近似质量项控制,且该近似仅需边际平均成立,无需高维正态近似。

Comments 53 pages

详情
AI中文摘要

经验贝叶斯分析通常将潜在参数的噪声测量建模为正态分布,并非正式地引用中心极限定理(CLT)来证明其合理性。本文将该启发式论证置于更坚实的分析基础上。我们证明,非参数最大似然估计(NPMLE)和相关筛法的去噪遗憾由精确正态性下的速率加上反映CLT近似质量的项控制。CLT仅需对每个坐标边际成立,且仅需平均意义,无需高维正态近似。我们识别出两个渐近区域,其中正态近似足够且经验贝叶斯先验保持信息性,并证明我们的保证对依赖性和方差估计具有鲁棒性。

英文摘要

Empirical Bayes analyses routinely model noisy measurements of latent parameters as normal, justifying this by an informal appeal to the central limit theorem (CLT). This paper puts this heuristic appeal on firmer analytical grounds. We show that the denoising regret of the nonparametric maximum likelihood estimator (NPMLE) and related sieve methods is controlled by the rate attained under exact normality, plus a term reflecting the quality of the CLT approximation. The CLT need only hold marginally for each coordinate, and moreover only on average, without needing high-dimensional normal approximations. We identify two asymptotic regimes in which the normal approximation is adequate and the empirical Bayesian prior remains informative, and we show that our guarantees are robust to dependence and to variance estimation.

2605.31592 2026-06-01 cond-mat.quant-gas quant-ph

Floquet Engineering of Quantum Transport through two Driven Impurities

通过两个驱动杂质的量子输运的Floquet工程

Vincenzo Bruno, Corinna Kollath, Roberta Citro, Ameneh Sheikhan

AI总结 研究通过周期驱动两个杂质的一维介观通道中的量子输运,利用Floquet工程揭示Fabry-Perot腔模与Fano干涉导致的丰富输运现象,并展示束缚态和准束缚态的可控相干俘获机制。

详情
AI中文摘要

Floquet工程通过周期驱动物理参数提供了操纵量子态的强有力工具。在这项工作中,我们研究了通过介观一维通道中两个周期驱动杂质的量子输运。通过将含时哈密顿量映射为有效的多通道散射问题,我们揭示了由Fabry-Perot腔模与Fano干涉相互作用产生的丰富输运现象。我们证明了杂质间距离作为关键控制参数,允许形成连续谱中的束缚态(BICs)。此外,我们识别了准BICs,即具有有限寿命的极窄共振,可以通过驱动幅度动态调谐。我们展示了这些态能够实现鲁棒的相干俘获机制,使系统从完美透明或反射切换到具有巨大Wigner时间延迟的强局域化。我们的结果提示了可调延迟线和量子存储器的可能应用,并在冷原子背景下具有可行的实验实现。

英文摘要

Floquet engineering offers powerful tools to manipulate quantum states by periodically driving physical parameters. In this work, we investigate the quantum transport through two periodically driven impurities in a mesoscopic one-dimensional channel. By mapping the time-dependent Hamiltonian into an effective multichannel scattering problem, we unveil a rich landscape of transport phenomena arising from the interplay between Fabry-Perot cavity modes and Fano interference. We demonstrate that the inter-impurity distance acts as a critical control parameter, allowing for the formation of Bound States in the Continuum (BICs). Furthermore, we identify Quasi-BICs, extremely narrow resonances with finite lifetimes, that can be dynamically tuned by the drive amplitude. We show that these states enable a robust coherent trapping mechanism, allowing the system to switch from perfect transparency or reflection to strong localization with giant Wigner time delays. Our results suggest possible applications for tunable delay lines and quantum memories, with feasible experimental realizations in the context of cold atoms.

2605.31588 2026-06-01 hep-th math-ph math.GT math.MP math.QA

Two roles of Alexander in two Kashaev phases

两个Kashaev相中Alexander的两个角色

Dmitry Galakhov, Alexei Morozov

AI总结 本文通过Chern-Simons理论的双重缩放极限(Kashaev极限),揭示了Alexander多项式在经典A-多项式和Jones多项式微扰展开中的两个相反角色,并指出其一致性源于量子A-多项式的特殊形式以及准经典极限中两个不同分支(相)的共存。

Comments 20 pages, 3 figures

详情
AI中文摘要

复兴理论的关键特征是非微扰行为的模糊性,这反映在积分轮廓的不同选择或Ward恒等式的多个解的存在中。这一点通过考虑精确可解模型得到了很好的说明,其中突出的例子是Chern-Simons理论。其重要篇章——应能直接推广到任意Yang-Mills理论——是在大表示和小耦合的双重缩放极限下考虑Wilson平均。出于历史原因,我们称之为Kashaev极限。它拥有一个基于准经典/WKB近似的自然解释,然而这有些特殊,从而为旧故事提供了新视角。关键点是Alexander多项式$Δ$以两个看似相反的角色出现:经典$A$-多项式与$Δ$有公共根,而Jones多项式在微扰展开中趋向于$Δ^{-1}$。一致性由量子$A$-多项式的特殊形式提供,而这一谜团的解决是准经典极限中两个不同分支(相)的共存——一个具有非平凡经典作用,另一个具有消失的经典作用。前者导致经典$A$-多项式和双曲体积,后者导致逆Alexander多项式。

英文摘要

The crucial feature of resurgence theory is the ambiguity of non-perturbative behavior, reflected either in the different choices of integration contours or in the existence of several solutions to Ward identities. This is well illustrated by considering exactly solvable models, of which the prominent example is Chern-Simons theory. Its important chapter, which should have a direct generalization to arbitrary Yang-Mills, is the consideration of Wilson averages in the double-scaling limit of large representation and small coupling. For historical reasons, we call it a Kashaev limit. It possesses a natural interpretation in terms of quasiclassical/WKB approximation, which is, however, somewhat peculiar and thus sheds new light on the old story. The crucial point is the appearance of Alexander polynomials $Δ$ in two seemingly opposite roles: the classical $A$-polynomials have common roots with $Δ$, while Jones polynomials tend to $Δ^{-1}$ in the perturbative expansion. The consistency is provided by the peculiar form of the quantum $A$-polynomial, and the resolution of the puzzle is the co-existence of two different branches (phases) in the quasiclassical limit -- with non-trivial and with vanishing classical actions. The first leads to classical $A$-polynomials and hyperbolic volumes, the second -- to inverse Alexanders.

2605.31587 2026-06-01 math.AP

A conditional Lagrangian clock barrier at the $C^{1,\frac{1}{3}}$ threshold for axisymmetric Euler without swirl

轴对称无旋Euler方程在$C^{1,\frac{1}{3}}$阈值处的条件拉格朗日时钟屏障

Ovidiu-Neculai Avadanei

AI总结 研究轴对称无旋三维不可压缩Euler方程在$C^{1,\alpha}\cap L^2$初始速度下的解,通过拉格朗日时钟机制在$\alpha\geq\frac{1}{3}$时建立超临界/临界屏障,并证明在特定条件下变形梯度的最小奇异值不会在有限时间内坍缩。

Comments 21 pages

详情
AI中文摘要

我们考虑三维不可压缩Euler方程的轴对称无旋解,初始速度在$C^{1,\alpha}\cap L^2$中,其中$\alpha\in\left[\frac{1}{3},1\right)$。受Shkoller在$C^{1,\frac{1}{3}}$阈值以下有限时间爆破的拉格朗日时钟-驱动框架启发,我们引入初始数据的相干拉格朗日类和条件解,对于这些解,相同的时钟机制在$\alpha\geq\frac{1}{3}$时产生超临界/临界屏障,其中$\alpha>\frac{1}{3}$时屏障真正耗尽,而在临界端点$\alpha=\frac{1}{3}$时产生指数界。在一般情况下,我们根据变形梯度的最小奇异值制定了一个矩阵时钟准则,并表明在尖尾、Dini相干、近场兼容和有界横向畸变假设下,该奇异值不能在有限时间内坍缩。在轴上情况下,该准则简化为标量时钟不等式$\displaystyle \dot{J}(t)\gtrsim -B(t)J(t)-CJ(t)^{3\alpha}$,这排除了$\alpha\geq\frac{1}{3}$时Shkoller型时钟坍缩。这些结果并未扩大已知的Lorentz空间全局正则类。相反,它们特别识别了与Shkoller在$\alpha>\frac{1}{3}$情况下的亚临界爆破机制对偶的超临界拉格朗日障碍。

英文摘要

We consider axisymmetric no-swirl solutions to the three-dimensional incompressible Euler equations, with initial velocity in $C^{1,α}\cap L^2$, where $α\in\left[\frac{1}{3},1\right)$. Motivated by Shkoller's Lagrangian clock-and-driver framework for finite-time blow-up below the $C^{1,\frac{1}{3}}$ threshold, we introduce coherent Lagrangian classes of initial data and conditional solutions for which the same clock mechanism yields a supercritical/critical barrier when $α\geq\frac{1}{3}$, with a genuinely depleted barrier for $α>\frac{1}{3}$ and an exponential bound at the critical endpoint $α=\frac{1}{3}$. In the general case, we formulate a matrix-clock criterion in terms of the smallest singular value of the deformation gradient and show that, under cusp-tail, Dini coherence, near-field compatibility, and bounded transverse-distortion hypotheses, this singular value cannot collapse in finite time. In the on-axis case, the criterion reduces to the scalar clock inequality $\displaystyle \dot{J}(t)\gtrsim -B(t)J(t)-CJ(t)^{3α}$, which rules out Shkoller-type clock collapse for $α\geq\frac{1}{3}$. These results do not enlarge the known Lorentz-space global regularity classes. Rather, they in particular identify the supercritical Lagrangian obstruction dual to Shkoller's subcritical blow-up mechanism in the case $α>\frac{1}{3}$.

2605.31585 2026-06-01 gr-qc math.AP

Stability and instability of torus-symmetric Einstein spacetimes with square-integrable connection

具有平方可积联络的环面对称爱因斯坦时空的稳定性与不稳定性

Bruno Le Floch, Philippe G. LeFloch

AI总结 研究T2对称下爱因斯坦方程的整体演化,引入平方可积联络系数,建立非微扰整体存在性与稳定性理论,并证明非线性稳定性与不稳定性定理。

Comments 133 pages

详情
AI中文摘要

我们研究T3上T2对称下爱因斯坦方程的整体演化问题,允许真空、标量场和可压缩流体物质模型,由包括等温流体和多方流体在内的通用状态方程控制。在这种对称性下,我们首次获得了联络系数仅为平方可积的非微扰整体存在性与稳定性理论,该理论同时允许脉冲引力波和激波。在面积规范下,我们引入新的流体和几何变量,并将爱因斯坦-欧拉系统重新表述为带有约束和熵结构的一阶非线性平衡律系统。所得公式表现出双曲性、零形式、熵流、散度-旋度结构、最大值原理和时空估计。这引出了驯服爱因斯坦-欧拉流的概念,其中基本几何和流体变量是平方可积的(有限能量),而次要变量是绝对连续的(或更一般地,有界变差)。在这种非微扰和弱正则性设定下,即使Weyl曲率沿类时超曲面集中为Dirac质量,且Ricci曲率仅为可积,方程仍然有意义。我们的主要结果是面积叶层的整体存在性定理、良好准备初始数据的非线性稳定性定理以及几何振荡数据的非线性不稳定性定理,后者对应力-能量张量产生测度修正。在未来收缩区域,面积叶层达到几何奇点,其中T3空间片的体积退化为零。在非真空Gowdy对称和真空环面对称情况下,面积函数一般达到零。在未来膨胀区域,面积叶层是完整的。

英文摘要

We study the global evolution problem for the Einstein equations under T2 symmetry on T3, allowing vacuum, scalar-field, and compressible-fluid matter models, governed by a general equation of state including isothermal and polytropic fluids. Under this symmetry, we obtain the first non-perturbative, global existence and stability theory with connection coefficients being merely square-integrable, which allows both impulsive gravitational waves and shock waves. In areal gauge, we introduce new fluid and geometric variables and reformulate the Einstein-Euler system as a first-order system of nonlinear balance laws with constraints and an entropy structure. The resulting formulation exhibits hyperbolicity, null forms, entropy currents, div-curl structure, maximum principles, and spacetime estimates. This leads to a notion of tame Einstein-Euler flow for which the essential geometric and fluid variables are square-integrable (finite energy), and the secondary variables are absolutely continuous (or, more generally, of bounded variation). In this non-perturbative and weak regularity setting, the equations remain meaningful even when the Weyl curvature concentrates into Dirac masses along timelike hypersurfaces, and the Ricci curvature remains only integrable. Our main results are a global existence theorem for areal foliations, a nonlinear stability theorem for well-prepared initial data, and a nonlinear instability theorem for geometrically oscillatory data, the latter producing measure corrections to the stress energy tensor. In the future-contracting regime, the areal foliation reaches a geometric singularity where the volume of T3 spatial slices degenerates to zero. The areal function reaches zero generically in the non-vacuum Gowdy-symmetric and vacuum torus-symmetric cases. In the future-expanding regime, the areal foliation is complete.

2605.31582 2026-06-01 physics.chem-ph cond-mat.str-el

Richardson-Gaudin states of non-zero seniority III: The Perfect-Pairing limit

非零辛格数的Richardson-Gaudin态 III:完美配对极限

Paul A. Johnson

AI总结 本文通过简化Richardson-Gaudin态得到完美配对参考态及其低激发态,在保持数值精度的同时大幅降低计算成本,并展示了二阶Epstein-Nesbet微扰修正与完全活性空间自洽场方法相当的质量。

详情
AI中文摘要

强关联电子可以通过按未配对电子数分组的Slater行列式的组态相互作用以指数成本处理。本系列的前两篇论文表明,基于Richardson-Gaudin态构建的单参考方法以多项式成本给出了类似质量的结果。在本工作中,这些态被大幅简化,得到完美配对态作为参考态及其低激发态。这些态更简单,计算成本大幅降低,且数值精度没有损失。价电子的二阶Epstein-Nesbet微扰修正与完全活性空间自洽场方法质量相当。

英文摘要

Strongly correlated electrons can be treated with a configuration interaction of Slater determinants grouped by number of unpaired electrons with exponential cost. The first two papers in this series demonstrated that single reference methods built from Richardson-Gaudin states gave results of similar quality at polynomial cost. In this contribution, the states are simplified substantially yielding the perfect-pairing state as a reference along with its low-lying excitations. The states are much simpler, the computational cost is substantially reduced, and there is no sacrifice in numerical accuracy. Second-order Epstein-Nesbet perturbative corrections for the valence electrons are similar in quality to the complete active space self-consistent field.

2605.31579 2026-06-01 eess.SP cs.IT math.IT math.ST stat.TH

Functional Multi-Target Detection via Bispectrum Inversion

基于双谱反演的功能性多目标检测

Anna Little, Daniel Sanz-Alonso, Mikhail Sweeney, Ruiyi Yang

AI总结 针对含未知平移的多目标检测问题,提出基于自相关分析的无初始化恢复算法,通过去偏三阶经验自相关估计双谱,并利用频率推进或Kotlarski反卷积公式恢复信号,证明非渐近恢复保证。

详情
AI中文摘要

本文发展了多目标检测的功能性理论,其中从包含信号多个未知平移的单个含噪观测中恢复紧支撑信号。我们的公式允许连续、非网格平移和相关平稳高斯过程噪声,超越了先前工作中常见的离散、网格对齐、白噪声模型。我们分析了两种基于自相关分析的无初始化恢复算法;特别地,两种算法首先通过去偏三阶经验自相关估计信号的双谱。然后利用功能性频率推进方案或Kotlarski型反卷积公式从估计的双谱中恢复信号。对于两种算法,我们在无带限假设下证明了紧支撑信号的非渐近恢复保证。得到的误差界依赖于信号的光滑性和双谱估计的精度,后者由噪声特性和信号出现次数决定。数值实验验证了我们的理论,并展示了在低信噪比条件下的准确恢复。

英文摘要

This paper develops a functional theory for multi-target detection, where a compactly supported signal is recovered from a single noisy observation containing many unknown translations of the signal. Our formulation allows continuous, off-grid translations and correlated stationary Gaussian process noise, extending beyond the discrete, grid-aligned, white-noise models common in prior work. We analyze two uninitialized recovery algorithms based on autocorrelation analysis; in particular, both algorithms first estimate the signal's bispectrum via a debiased third-order empirical autocorrelation. The signal is then recovered from the estimated bispectrum using either a functional frequency marching scheme or a Kotlarski-type deconvolution formula. For both algorithms, we prove non-asymptotic recovery guarantees for compactly supported signals without bandlimiting assumptions. The resulting error bounds depend on the smoothness of the signal and the accuracy of bispectrum estimation, with the latter governed by the noise characteristics and the number of signal occurrences. Numerical experiments validate our theory and demonstrate accurate recovery in low-SNR regimes.

2605.31578 2026-06-01 hep-ph nucl-th

Deeply bound dibaryon $d^*(2380)$ from meson-exchange saturation $ΔΔ$ effective field theory

深度束缚的双重子 $d^*(2380)$:来自介子交换饱和 $ΔΔ$ 有效场理论

Prin Sawasdipol, Chinadanai Bubpatate, Daris Samart

AI总结 本文提出了一种 RG 改进的有效场论框架,用于描述深度束缚的双重子 $d^*(2380)$,通过介子交换饱和接触相互作用,在 $(J,I)=(3,0)$ ${}^7S_3$ 道中实现了 $ΔΔ$ 束缚态,并解释了其与实验值的偏差。

Comments 7 pages, 2 tables

详情
AI中文摘要

我们为深度束缚的双重子 $d^*(2380)$ 提出了一种 RG 改进的有效场论框架,该双重子是 $(J,I)=(3,0)$ ${}^7S_3$ 道中的 $ΔΔ$ 束缚态。其束缚动量 $γ\simeq 320$ MeV 给出 $γ/m_π\simeq 2.3$,表明需要重新组织超出形式无π EFT 的短程动力学。我们将大 $N_c$ 约束的无π接触势匹配到介子交换饱和接触相互作用,其中 $σ,ρ,ω$ 动力学在强子尺度 $m_V$ 上被积掉,得到受控展开参数 $γ/m_V\simeq 0.42$。将接触耦合归一化到氘核,并代入唯象的 CD-Bonn 耦合,得到 $B_{ΔΔ}\simeq 96$ MeV。与实验值 $B_{\rm exp}=84$ MeV 的 $\simeq 14\%$ 偏差处于 $\mathcal{O}(1/N_c^2)\simeq 11\%$ 修正的自然大小,这证实了与围绕有限程强子尺度组织的受控 EFT 展开的兼容性。因此,通过本文中的 EFT 重组,观测到的 $d^*(2380)$ 极点从虚态转变为束缚态。

英文摘要

We propose an RG-improved effective-field-theory framework for the deeply bound dibaryon $d^*(2380)$, a $ΔΔ$ bound state in the $(J,I)=(3,0)$ ${}^7S_3$ channel. Its binding momentum $γ\simeq 320$ MeV gives $γ/m_π\simeq 2.3$, indicating the need to re-organize the short-range dynamics beyond a formal pionless EFT. We match the large-$N_c$-constrained pionless contact potential to a meson-exchange-saturated contact interaction in which the $σ,ρ,ω$ dynamics are integrated out at the hadronic scale $m_V$, yielding the controlled expansion parameter $γ/m_V\simeq 0.42$. Normalizing the contact coupling to the deuteron and substituting the phenomenological CD-Bonn couplings gives $B_{ΔΔ}\simeq 96$ MeV. The $\simeq 14\%$ discrepancy from $B_{\rm exp}=84$ MeV is of the natural size of $\mathcal{O}(1/N_c^2)\simeq 11\%$ corrections to the $NN$ potential, confirming compatibility with a controlled EFT expansion organized around the finite-range hadronic scale. As a result, the observed $d^*(2380)$ pole emerges from the virtual state to bound state by using the EFT re-organization in this work.

2605.31574 2026-06-01 cs.HC

Can Generative AI help people navigate Radical Moral Disagreements? The CONSIDER prototype

生成式AI能否帮助人们应对根本性道德分歧?CONSIDER原型

William Hohnen-Ford, Sarah Chen, Kathryn B. Francis, Madeline G. Reinecke, Ilina Singh, David Lyreskog

AI总结 本文提出CONSIDER原型,一种基于大型语言模型的一对一AI工具,通过结构化分歧帮助用户澄清价值观,以应对根本性道德分歧。

Comments 25 pages, 1 figure, 2 tables. Submitted manuscript

详情
AI中文摘要

根本性道德分歧(RMDs)是高度两极分化的话题,在日常生活中日益受到审查,越来越多的证据表明这种两极分化对公众心理健康造成了可衡量的成本。为了应对这些挑战,一些研究者提出使用大型语言模型(LLMs)来支持更民主的 deliberation 和更好的道德推理。然而,现有工具由于RMDs的激烈和分裂特性,难以帮助人们有效应对。本文介绍了CONSIDER,一种用于RMD导航的一对一AI工具原型。借鉴密尔关于分歧的认识论价值的观点,CONSIDER旨在通过与对立的LLM生成观点进行结构化分歧来澄清价值观。我们描述了CONSIDER的设计逻辑,并分析了此类工具可能带来的潜在风险,以指导未来的发展。

英文摘要

Radical Moral Disagreements (RMDs) are highly polarising topics that are increasingly censored in everyday life, with growing evidence suggesting that this polarisation carries measurable costs to public mental health. To address these challenges, some researchers have proposed Large Language Models (LLMs) as a means to support more democratic deliberation and better moral reasoning. Yet existing tools are poorly calibrated to help people navigate RMDs, because of their intense and divisive characteristics. This paper introduces CONSIDER, a prototype for a one-to-one AI tool for RMD navigation. Drawing on Mill's account of the epistemic value of disagreement, CONSIDER aims at value clarification through structured disagreement with an opposing LLM-generated opinion. We describe CONSIDER's design logic and analyse potential risks posed by such tools to guide future development.

2605.31573 2026-06-01 math.OC

The Value of Temporary Control for the M/M/1 Queue

M/M/1队列的临时控制价值

Odysseas Kanavetas, Camiel M. P. Koopmans, Floske M. Spieksma

AI总结 研究M/M/1队列中一次性临时服务速率控制选项的价值,通过价值迭代算法近似期望总节省成本,并分析最优策略结构和Blackwell最优性。

Comments Extended version of "The Value of Temporary Control for the M/M/1 Queue"

详情
AI中文摘要

本文考虑M/M/1队列的一次性临时服务速率控制选项。在采取该选项后,在一个指数分布的单次周期内,可以使用两种服务速率。一旦失去服务速率控制,系统将以固定服务速率$μ$继续运行。目标是最小化持有成本和服务成本之和。我们根据起始状态或起始分布,近似采取一次性选项的期望总节省成本。使用具有$M$-均匀几何递归的价值迭代算法,我们提出了近似期望总节省未来成本以及期望总节省折现未来成本的方法。此外,我们获得了关于最优策略结构和强Blackwell最优性的理论结果。最后通过数值方法将模型应用于各种实例。

英文摘要

In this article, a one-off option for temporary service rate control for the M/M/1 queue is considered. After taking this option, during a single exponentially distributed period, two service rates are available for use. Once service rate control is lost, the system continues with a fixed service rate $μ$. The objective is to minimise the sum of holding costs and service costs. We approximate the expected total saved cost by taking the one-off option, depending on the starting state or starting distribution. Using the Value Iteration algorithm with $M$-uniform geometric recurrence, we present methods to approximate the expected total saved future cost, as well as the expected total saved discounted future cost. Furthermore, we obtain theoretical results on the structure of optimal policies and strong Blackwell optimality. The paper is concluded by numerically applying the methods to various instances of the model.

2605.31571 2026-06-01 physics.chem-ph

All-Electron Relativistic Fully Self-Consistent $GW$ Study of Heteronuclear Actinide-Containing Diatomics

全电子相对论完全自洽$GW$研究含锕系杂核双原子分子

Jacob Adamski, Vibin Abraham, Dominika Zgid

AI总结 采用全电子完全自洽$GW$方法结合精确二分量相对论形式,计算含铀双原子分子(UC、UN、UO、UF)的电离能、电子亲和能、垂直脱附能、平衡结构和振动频率,验证其准确性并揭示自旋-轨道耦合对UF电子附着能的关键影响。

Comments 13 pages, 2 figures

详情
AI中文摘要

完全自洽$GW$(sc$GW$)近似提供了一种与起始点无关的格林函数方法,与高级波函数方法相比,具有有利的成本-精度平衡。本文通过精确二分量(X2C)形式引入相对论效应,对含铀双原子分子(UC、UN、UO和UF)进行了全电子sc$GW$研究。我们评估了绝热电离能以及电子附着和脱附能量(AEA和VDE),以及平衡结构和谐振动频率,并评估了它们对基组选择和相对论处理的敏感性。我们发现sc$GW$给出的电离能和振动性质与实验和高精度理论估计非常一致。对于AEA和VDE,弥散基组对于收敛至关重要。UF是标量相对论方法的一个特别具有挑战性的案例,因为其电子附着能和垂直脱附能受到自旋-轨道耦合的强烈影响,凸显了变分二分量处理的必要性。这些结果确立了全电子X2C-sc$GW$作为精确锕系分子能量学和光谱学的实用途径,并激励了未来对更大含铀系统的应用。

英文摘要

The fully self-consistent $GW$ (sc$GW$) approximation provides a Green's-function approach that is starting-point independent and offers a favorable cost-to-accuracy balance compared to high-level wavefunction methods. Here, we present an all-electron sc$GW$ study of uranium-containing diatomics (UC, UN, UO, and UF), incorporating relativistic effects through the exact two-component (X2C) formalism. We evaluate adiabatic ionization energies as well as electron-attachment and detachment energetics (AEA and VDE), together with equilibrium structures and harmonic vibrational frequencies, and we assess their sensitivity to basis-set choice and relativistic treatment. We find that sc$GW$ yields ionization energies and vibrational properties in very good agreement with experiment and high-accuracy theoretical estimates. For AEA and VDE, diffuse basis sets are essential for convergence. UF is a particularly challenging case for scalar relativistic methods because its electron-attachment and vertical detachment energies are strongly affected by spin--orbit coupling, highlighting the need for a variational two-component treatment. These results establish all-electron X2C-sc$GW$ as a practical route for accurate actinide-molecule energetics and spectroscopy and motivate future applications to larger uranium-containing systems.

2605.31570 2026-06-01 math-ph math.MP

Variational theory of Cosserat arches and affine tensors

Cosserat 拱与仿射张量的变分理论

Géry de Saxcé

AI总结 利用仿射张量形式重新审视螺旋理论,引入共动量和动量张量,并通过 Ehresmann 联络证明欧拉-庞加莱方程意味着动量张量平行输运,应用于刚体运动及 Cosserat 拱的静力学与动力学。

Comments 32 pages

详情
AI中文摘要

我们的目的是在仿射张量形式下重新审视螺旋理论,引入共动量和动量张量。欧拉-庞加莱方程的目标应用是力学问题,例如刚体运动或与动量张量概念相关的 Cosserat 拱的静力学和动力学。利用仿射标架主丛上的 Ehresmann 联络框架,我们证明了欧拉-庞加莱方程意味着动量张量是平行输运的。

英文摘要

Our purpose is to revisit the screw theory in light of the affine tensor formalism, introducing the co-momentum and momentum tensors. Our target-applications of the Euler-Poincaré equation are problems of mechanics such as the motion of the rigid body or the statics and the dynamics of Cosserat arches, in relation to the concept of momentum tensor. Using the framework of Ehresmann connections on the principal bundle of affine frames, we show that the Euler-Poincaré equation means that the momentum tensor is parallel-transported.

2605.31568 2026-06-01 math.PR

Lipschitz continuity of the time constant for continuum percolation

连续渗流时间常数的Lipschitz连续性

Karoline Dubin, Christian Gorski

AI总结 研究布尔模型连续渗流中时间常数关于强度的Lipschitz连续性,通过改编Can、Nakajima和Nguyen的论证方法证明。

Comments 17 pages, 0 figures

详情
AI中文摘要

我们考虑连续渗流的布尔模型,其中点由泊松点过程放置在$\mathbb{R}^d$中,距离不超过1的点对通过边连接。时间常数是远距离连通点对的化学距离(即图距离)与欧几里得距离之比的极限。Yao、Chen和Guo建立了超临界状态下时间常数的存在性。我们证明,在临界强度以上,时间常数是强度的Lipschitz连续函数。证明改编了Can、Nakajima和Nguyen最近的一个论证到连续设定中。

英文摘要

We consider the Boolean model of continuum percolation, where points are placed in $\mathbb{R}^d$ by a Poisson point process and pairs of points with distance at most 1 are connected by an edge. The time constant is the limiting ratio of the chemical distance (i.e. graph distance) to the Euclidean distance for pairs of distant connected points. Yao, Chen, and Guo established the existence of a time constant in the supercritical regime. We show that above the critical intensity, the time constant is a Lipschitz continuous function of the intensity. The proof adapts a recent argument of Can, Nakajima, and Nguyen to the continuous setting.

2605.31567 2026-06-01 stat.ME stat.AP

Addressing errors in multiple variables using generalized raking and cumulative probability models

使用广义分层和累积概率模型解决多变量中的错误

Eric S. Kawaguchi, Chun Li, Frank E. Harrell, Pamela A. Shaw, Thomas Lumley, Bryan E. Shepherd

AI总结 本研究针对电子健康记录等常规收集数据中的错误,提出使用广义分层结合累积概率模型来校准验证子样本权重,从而减少偏差并提高估计效率。

详情
AI中文摘要

常规收集的数据,如电子健康记录(EHR)数据,经常用于生物医学研究,但这些数据容易出错,可能会使研究结果产生偏差。在记录的子样本中验证数据可以减少偏差,并且通过将整个队列中可用的易错数据和子样本中可用的验证数据纳入分析,可以提高估计的效率。整合这两种数据源的一种方法是使用广义分层,它利用整个队列中的易错数据来校准验证抽样权重。受一项关于孕期母亲体重增加的EHR研究(带有验证子样本)的启发,我们开发并说明了累积概率模型(CPMs)的广义分层技术。CPMs是稳健的、基于秩的半参数模型,适用于连续、有序或混合类型的结果数据。我们为CPMs开发了高效的广义分层估计量,评估了它们相对于竞争方法的性能,并在一个检查与孕期体重增加相关因素的研究中展示了广义分层与CPMs的实用性和优势。

英文摘要

Routinely collected data, such as electronic health record (EHR) data, are frequently used for biomedical research, but these data are prone to errors, which can bias study findings. Validating data in subsamples of records can reduce bias, and the efficiency of estimates can be improved by incorporating in analyses both the error-prone data available on the entire cohort and the validated data available on the subsample. One approach to incorporate both data sources is with generalized raking, which calibrates validation sampling weights using error-prone data from the entire cohort. Motivated by an EHR study of maternal weight gain during pregnancy with a validation subsample, we develop and illustrate generalized raking techniques for cumulative probability models (CPMs). CPMs are robust, rank-based and semiparametric models for continuous, ordinal, or mixed type outcome data. We develop efficient generalized raking estimators for CPMs, evaluate their performance relative to competing methods, and demonstrate the utility and strengths of generalized raking with CPMs in a study that examines factors associated with weight gain during pregnancy.

2605.31566 2026-06-01 astro-ph.HE

Insights on the Gamma-Ray Bursts variability in their cosmological rest frame

关于伽马射线暴在宇宙学静止参考系中的变异性见解

Giovanni Della Casa, Fabrizio Fiore, Giuseppe Dilillo, Simonetta Puccetti, Andrea Vacchi

AI总结 通过分析伽马射线暴光变曲线中的最短变时标(尤其针对有红移测量的暴),研究其与中心引擎物理特性及谱参数(如各向同性能量和峰值能量)的关系。

Comments 12 pages, 18 figures Submitted to A&A. Revised version

详情
AI中文摘要

伽马射线暴的时间轮廓可以极其多变,从持续几秒的单脉冲到发生在数十甚至数百秒内的多个叠加脉冲。每个伽马射线暴光变曲线中显示的变异性可能是产生这些剧烈现象的中心引擎活动的结果,也可能是由于更大距离处的磁重联活动。这项工作的目的是寻找GRB光变曲线中隐藏的最短变时标,特别关注那些有红移测量的暴,时标短至几毫秒。然后将这种变异性与中心引擎的物理特性联系起来,并展示其与暴的谱参数(如各向同性能量和峰值能量)关系的证据。鉴于未来一代具有更高时间分辨率的卫星将使我们能够探索微秒区域的可能变异性,这项研究更加重要。

英文摘要

Gamma-ray bursts temporal profile can be extremely variable, going from a single pulse of a few seconds duration to multiple superimposed pulses occurring over tens or even hundreds of seconds. The variability displayed in the lightcurve of each gamma-ray burst can be the result of the activity taking place in the central engine that generates these violent phenomena, as well as due to magnetic reconnection activities at larger distances. The objective of this work is to find the shortest variability hidden in the lightcurves of the GRBs, with particular focus for the ones with measured redshift, on timescales as short as few milliseconds. This variability will then be related to physical characteristics of the central engine, and evidences of its relation with the spectral parameters of the burst, such as the isotropic energy and peak energy, will be presented. This research is even more relevant in view of the future generation of satellites with improved timing resolution, that will allow us to explore the possible variability in the microsecond region.

2605.31565 2026-06-01 math.OC math.DS

A derivative-free particle method for optimization in Hilbert spaces

希尔伯特空间中优化的无导数粒子方法

Hui Huang, Hicham Kouhkouh

AI总结 提出一种在可分离希尔伯特空间中的随机相互作用粒子系统及其平均场形式,证明了动力学适定性并分析了共识机制,在目标泛函适当假设下推导出长时间收敛到极小点的保证,从而将该方法推广到无限维优化问题。

详情
AI中文摘要

我们在可分离希尔伯特空间中引入了一个随机相互作用粒子系统及其相关的平均场公式。该模型保留了经典基于共识的优化的特征共识驱动结构,同时考虑了无限维动力学的分析挑战。我们建立了所提出动力学的适定性,并分析了相关的共识机制。此外,我们在目标泛函的适当假设下推导了收敛保证,显示了动力学在长时间区域内向极小点的集中。这将该方法的适用性扩展到一大类无限维优化问题。另外,我们研究了与数值实现相关的有限粒子系统,并提出了一种实用算法。

英文摘要

We introduce a stochastic interacting particle system in separable Hilbert spaces together with its associated mean-field formulation. The model is shown to retain the characteristic consensus-driven structure of classical Consensus-Based Optimization, while accounting for the analytical challenges of infinite-dimensional dynamics. We establish well-posedness of the proposed dynamics and analyze the associated consensus mechanism. Furthermore, we derive convergence guarantees under suitable assumptions on the objective functional, showing concentration of the dynamics toward the minimizer in the long-time regime. This extends the applicability of the method to a broad class of infinite-dimensional optimization problems. In addition, we study the corresponding finite-particle system relevant for numerical implementation and propose a practical algorithm.

2605.31560 2026-06-01 cs.CE cond-mat.mtrl-sci physics.app-ph physics.chem-ph

Can dents and gouges compromise the structural integrity of hydrogen transport pipelines?

凹痕和沟槽会危及氢气输送管道的结构完整性吗?

R. Das, B. Bezensek, E. Martínez-Pañeda

AI总结 通过实验和氢脆模型研究,发现氢气不会显著增加凹痕和沟槽缺陷的损伤严重性,除非在特定条件下氢气逸出被完全阻止。

详情
AI中文摘要

将天然气管道改造用于氢气输送需要了解外部缺陷(如凹痕和沟槽)在氢气暴露下如何影响结构完整性。为此,我们将实验与一种针对大塑性应变场景的新型氢脆模型相结合,该模型整合了:(i) 多陷阱氢传输,(ii) 有限应变塑性,以及 (iii) 依赖于氢和三轴度的损伤定律。模型的每个组成部分均通过X65管道钢的实验验证:(i) 氢渗透,(ii) 全尺寸管道压痕,以及 (iii) 在不同氢和三轴度水平下的力学测试。验证后的模型用于研究被动(在氢气暴露前压痕)和主动(在氢气存在下压痕)凹痕和沟槽。结果表明,氢气不会显著增加这些缺陷的损伤严重性,除非在内部加压且存在预先存在的被动凹痕和沟槽的管道外表面,氢的逸出被完全阻止。

英文摘要

Repurposing natural gas pipelines for hydrogen transport requires understanding how external defects, like dents and gouges, affect structural integrity under H$_2$ exposure. To address this, we combine experiments with a new hydrogen embrittlement model aimed at large plastic straining scenarios, which integrates: (i) multi-trap hydrogen transport, (ii) finite-strain plasticity, and (iii) a hydrogen- and triaxiality-dependent damage law. Each constituent of the model is validated with experiments on X65 pipeline steel: (i) hydrogen permeation, (ii) full-scale pipe-indentation, and (iii) mechanical testing at different hydrogen and triaxiality levels. The validated model is used to study \textit{passive} (indent before H$_2$ exposure) and \textit{active} (indent with H$_2$) dents and gouges. Results reveal that hydrogen does not significantly increase the damage severity of those defects, unless hydrogen egress is completely precluded at the outer surface of a pipeline that is being pressurised internally and contains a pre-existing \textit{passive} dent with a gouge.