arXivDaily arXiv每日学术速递 周一至周五更新
重置
2605.30327 2026-05-29 cs.LG cs.AI cs.CL math.ST stat.ML stat.TH

Reasoning with Sampling: Cutting at Decision Points

基于采样的推理:在决策点进行裁剪

Felix Zhou, Anay Mehrotra, Quanquan C. Liu

AI总结 提出Entropy-Cut Metropolis-Hastings算法,利用基础模型的下一词元熵作为代理识别关键决策点并重新采样,从而高效地从幂分布中采样以增强推理能力,在多个基准上超越基线和RL训练模型。

详情
AI中文摘要

前沿推理模型是通过对基础语言模型进行强化学习后训练而产生的。最近的研究对此提出了挑战,表明从基础模型分布的锐化版本(即所谓的幂分布)中采样,无需额外训练、精心策划的数据集或验证器,就能产生可比的推理能力。然而,使这种方法实用化需要高效地从幂分布中采样。采样器需要“混合”到幂分布,这需要在目标分布的模态之间移动;直观地说,例如尝试不同的推理策略。先前工作中提出的采样器反复在当前推理轨迹中均匀随机选择一个“裁剪”位置,并从该位置开始重新采样后缀。然而,推理轨迹通常包含少数关键决策(例如,证明策略或算法的选择),我们观察到均匀选择的裁剪往往重写局部细节,而不是重新审视决策点。我们引入了一种算法(Entropy-Cut Metropolis-Hastings),该算法使用基础模型的下一词元熵作为代理来识别关键决策点,并从这些位置重新采样。我们通过实验验证了熵跳变是决策点的有用代理,并在一个风格化的推理模型中证明了我们的方法的混合时间与轨迹中的决策数量成比例,而不是与可能大得多的词元数量成比例。在MATH500、HumanEval、GPQA Diamond和AIME26上,我们的方法始终优于基线和RL训练模型。

英文摘要

Frontier reasoning models are produced by posttraining base language models with reinforcement learning. Recent work has challenged this by showing that sampling from a sharpened version of the base model's distribution, a so-called power distribution, elicits comparable reasoning without additional training, curated datasets, or verifiers. However, making this method practical requires efficiently sampling from the power distribution. A sampler needs to "mix" to the power distribution, which necessitates moving between modes of the target distribution; intuitively, e.g., trying different reasoning strategies. The samplers proposed in prior works repeatedly select a "cut" position in the current reasoning trace uniformly at random and resample the suffix from that position onward. However, reasoning traces typically contain a few consequential decisions (e.g., the choice of proof strategy or algorithm), and we observe that a uniformly chosen cut tends to rewrite local details rather than revisit decision points. We introduce an algorithm (Entropy-Cut Metropolis-Hastings) that uses the base model's next-token entropy as a proxy to identify key decision points and resample from those positions. We empirically verify that entropy jumps are a useful proxy for decision points and, in a stylized model of reasoning, prove that our method's mixing time scales with the number of decisions in a trace rather than with the number of tokens, which can be much larger. Across MATH500, HumanEval, GPQA Diamond, and AIME26, our method consistently improves over baselines and RL-trained models.

2605.30321 2026-05-29 math.PR math.ST stat.TH

A Bayesian Proof and Interpretation of Talagrand's Majorizing Measure Theorem

Talagrand 优势测度定理的贝叶斯证明与解释

Ilias Zadik

AI总结 本文通过贝叶斯方法,利用高斯加性模型的两个面积恒等式,比较最大似然估计与贝叶斯最优估计,给出了 Talagrand 优势测度定理下界的简洁证明。

详情
AI中文摘要

在本文中,我们给出了 Talagrand 著名的优势测度定理(MMT)的一个简短贝叶斯证明。虽然 MMT 的上界方向相对直接地遵循标准论证,但下界方向被广泛认为是更困难的部分,并且已有几种不同的证明。与以往的方法不同,我们的证明不依赖于现有的高斯过程下界技术,也不依赖于组合、几何或编码理论构造。相反,我们从高斯加性模型的两个面积恒等式推导出下界。我们证明,有限集的高斯宽度是最大似然估计(MLE)的积分均方误差,而积分最小均方误差(MMSE)大于 Fernique-Talagrand 泛函,相差一个通用常数。然后,只需比较 MLE 与贝叶斯最优估计,即可直接证明 MMT 的困难方向。

英文摘要

In this paper, we give a short Bayesian proof of Talagrand's celebrated majorizing-measure theorem (MMT). While the upper-bound direction of MMT follows relatively directly from standard arguments, the lower-bound direction is widely regarded as the more difficult part and has received several distinct proofs. Unlike previous approaches, our proof does not rely on existing Gaussian processes lower bounds techniques, nor on combinatorial, geometric, or coding-theoretic constructions. Instead, we derive the lower bound from two area identities for Gaussian additive models. We show that the Gaussian width of a finite set is the integrated mean-squared error of the maximum-likelihood estimator (MLE), while the integrated minimum mean-squared error (MMSE) is larger than the Fernique-Talagrand functional, up to a universal constant. Simply then comparing the MLE with Bayes-optimal estimation gives a direct proof of the hard direction of MMT.

2605.30319 2026-05-29 stat.ML cs.AI cs.DS cs.LG math.ST stat.TH

Improved Guarantees for Heterogeneous Treatment-Effect Estimation via Matrix Completion

通过矩阵补全改进异质性处理效应估计的保证

Anay Mehrotra, Phuc Tran, Van H. Vu, Manolis Zampetakis

AI总结 针对面板数据中的异质性处理效应估计问题,提出一种基于矩阵补全的简单高效估计器,在低秩假设下实现行向$\ell_2$误差$ ilde{O}(\sqrt{1/n + n/m^2})$,并首次建立了低秩逼近的行向$\ell_2$扰动界。

详情
AI中文摘要

现代因果推断的一个核心目标是估计异质性处理效应,以回答诸如“干预如何影响每个单元”的问题,而不仅仅是平均效应。我们研究面板数据下的该问题,其中我们观察到$n$个单元在$m$个时间点上的数据,且处理分配未知且非均匀。该设置中的数据自然表示为所有单元-时间处理效应的矩阵。估计异质性处理效应可以表示为对该矩阵中每一行平均值的良好估计。这使我们能够将问题表述为矩阵补全,在自然低秩假设下可解。然而,现有的矩阵补全保证不足以得到估计异质性处理效应所需的每行保证的有意义界;粗略地说,它们仅适用于估计平均处理效应界,正如最近一系列工作所示。我们给出一个简单、计算高效的估计器,在不知道倾向性且标准低秩和正则性假设下,实现行向$\ell_2$误差$ ilde{O}(\sqrt{ rac{1}{n} + rac{n}{m^2}})$。在技术上,我们的分析首次建立了低秩逼近的尖锐行向$\ell_2$扰动界,补充了现有的谱、Frobenius和逐元素扰动理论。

英文摘要

A central goal of modern causal inference is estimating heterogeneous treatment effects to answer questions like "how does an intervention affect each unit," rather than only on average. We study this problem with panel-data where we observe $n$ units across $m$ times under unknown, non-uniform treatment assignments. The data in this setting is naturally represented as a matrix of all unit--time treatment effects. Estimating heterogeneous treatment effects can then be expressed as obtaining a good estimation of each row's average in this matrix. This allows us to formulate the problem as matrix completion, which can be solved under natural low-rankness assumptions. However, existing matrix-completion guarantees are not powerful enough to get meaningful bounds for the per-row guarantee required for estimating the heterogeneous treatment effect; roughly speaking, they are only useful for estimating average treatment effect bounds, as also illustrated in a recent line of work. We give a simple, computationally efficient estimator that, without knowledge of the propensities and under standard low-rankness and regularity assumptions, achieves a row-wise $\ell_2$ error of $\tilde{O}(\sqrt{\frac{1}{n} + \frac{n}{m^2}})$. Technically, our analysis establishes the first sharp row-wise $\ell_2$-perturbation bound for low-rank approximation, complementing existing spectral-, Frobenius-, and entrywise perturbation theory.

2605.30300 2026-05-29 math.DG

Invariant statistical connections on the multivariate centered Gaussian model and their moduli spaces

多元中心高斯模型上的不变统计联络及其模空间

Hideyuki Ishi, Hikozo Kobayashi, Takayuki Okuda

AI总结 本文研究多元中心高斯模型上关于Fisher度量的不变统计联络,引入齐次黎曼流形上不变统计联络的模空间,并明确刻画了GL(n,R)和等距群不变统计联络(特别关注对偶平坦情形)及其模空间。

详情
Comments
40 pages. Comments are welcome!
AI中文摘要

我们研究了装备Fisher度量$g^F$的零均值多元正态分布空间$\mathcal{N}_0^n$(多元中心高斯模型)上的不变统计联络。通过从范畴论观点出发的两种自然等价关系,我们在齐次黎曼流形上引入了不变统计联络的模空间,并将这一框架应用于$(\mathcal{N}_0^n, g^F)$。我们明确确定了$GL(n,\mathbb{R})$-不变和$\mathrm{Isom}(\mathcal{N}_0^n, g^F)$-不变的统计联络,特别关注对偶平坦情形,并描述了相应的模空间。

英文摘要

We study invariant statistical connections on the space $\mathcal{N}_0^n$ of zero-mean multivariate normal distributions (the multivariate centered Gaussian model) equipped with the Fisher metric $g^F$. We introduce moduli spaces of invariant statistical connections on homogeneous Riemannian manifolds via two natural equivalence relations arising from a categorical viewpoint, and apply this framework to $(\mathcal{N}_0^n, g^F)$. We explicitly determine the $GL(n,\mathbb{R})$-invariant and $\mathrm{Isom}(\mathcal{N}_0^n, g^F)$-invariant statistical connections, with particular emphasis on the dually flat case, and describe the corresponding moduli spaces.

2605.30299 2026-05-29 math.PR math-ph math.MP

On reversing the Simon-Lieb inequality in high-dimensional percolation

高维渗流中Simon-Lieb不等式的反转

Romain Panis, Bruno Schapira

AI总结 研究高维Bernoulli渗流,证明Simon-Lieb不等式存在部分反转,并应用于证明$φ_{p_c}(S)$的一致有界性及近临界估计。

详情
Comments
35 pages, 6 figures
AI中文摘要

我们在维度${d>6}$的$\mathbb Z^d$上研究Bernoulli渗流。我们证明了van den Berg-Kesten不等式的一个经典推论(在Ising模型背景下常被称为Simon-Lieb不等式)存在部分反转。作为主要应用,我们证明了由Duminil-Copin和Tassion (Comm.\ Math.\ Phys., 2016)引入的量$φ_{p_c}(S)$在所有$S\subset \mathbb Z^d$上一致有界。这一部分反转进一步为若干关键结果提供了简短且自包含的路径,包括两点函数的近临界估计和临界单臂概率的尖锐界。

英文摘要

We study Bernoulli percolation on $\mathbb Z^d$ in dimensions ${d>6}$. We prove that a classical consequence of the van den Berg-Kesten inequality, often referred to as the Simon-Lieb inequality in the context of the Ising model, admits a partial reversal. As a main application, we show that the quantity $φ_{p_c}(S)$, introduced by Duminil-Copin and Tassion (Comm.\ Math.\ Phys., 2016), is uniformly bounded over all $S\subset \mathbb Z^d$. This partial reversal further yields a short and self-contained route to several key results, including near-critical estimates on the two-point function and sharp bounds on the critical one-arm probability.

2605.30298 2026-05-29 math.AG

Cohomology of the Moduli Stacks of Real Vector Bundles on Type I Real Algebraic Curves

第一类实代数曲线上实向量丛模空间的同调

Luca Dal Molin, Frank Neumann

AI总结 研究第一类实代数曲线上固定秩和度的实向量丛模空间,并用示性类确定其模2上同调代数。

详情
Comments
20 pages, 6 figures, v1
AI中文摘要

我们研究第一类实代数曲线上固定秩和度的实向量丛的模空间,并用示性类确定其模2上同调代数。

英文摘要

We study the moduli stacks of real vector bundles of fixed rank and degree on a type I real algebraic curve and determine its mod 2 cohomology algebra in terms of characteristic classes.

2605.28786 2026-05-29 math.FA

A quantum harmonic analysis approach to nonlinear time-frequency concentration

非线性时频集中的量子调和分析方法

Erling A. T. Svela, S. Ivan Trapasso

AI总结 本文利用量子调和分析技术,研究Cohen类时频分布的非线性集中问题,给出优化子存在的充分条件及反例,并推广到算子相空间表示和双相空间表示。

详情
Comments
47 pages
AI中文摘要

我们研究了Cohen类时频分布的非线性集中问题。利用量子调和分析(QHA)的最新技术,我们提供了正面和负面的结果,例如以“窗口算子”形式存在的优化子的充分条件,以及上确界永远无法达到的显式例子。我们还研究了窗口算子的结构性质,特别是产生弱连续集中泛函的算子,以及非线性集中问题允许优化子的算子,也超越了海森堡表示。然后我们考虑推广到算子相空间表示的集中问题研究。我们通过量子卷积考虑广义Husimi分布,以及它们在Hilbert--Schmidt算子和密度算子上的优化问题。最后,我们考虑双相空间上的算子表示,遵循量子时频分析的精神,并给出了关于Weyl符号的完整解。

英文摘要

We study nonlinear concentration problems for time-frequency distributions in the Cohen class. Using recent techniques from quantum harmonic analysis (QHA) we provide both positive and negative results, such as sufficient conditions for the existence of optimizers in terms of the ``window operator'' and explicit examples where the supremum is never attained. We also study the structural properties of window operators, in particular operators that yield weakly continuous concentration functionals and operators for which the nonlinear concentration problem admits an optimizer, also beyond the Heisenberg representation. We then consider generalizations to the study of concentration problems for phase space representations of operators. We consider generalized Husimi distributions via quantum convolution, and their optimization problem when optimizing over Hilbert--Schmidt and density operators. Lastly, we consider representations of operators on double phase space, in the spirit of quantum time-frequency analysis, and give a full solution in terms of the Weyl symbols.

2605.28746 2026-05-29 math.OC cs.AI cs.NE

Preference-Shaped Expected Hypervolume and R2 Improvement: Exact Computation and Monotonicity

偏好形状的期望超体积和R2改进:精确计算与单调性

Michael T. M. Emmerich

AI总结 本文研究了贝叶斯多目标优化中偏好形状的期望改进准则,精确计算了超体积和R2指标的期望改进,并分析了其单调性和几何特性。

详情
Comments
17 pages; Changes v1 (added strict Pareto compliance proof, removed missing figure references and redundant graphics section, added Liang et al 2026 citation in outlook. Improved figures and language
AI中文摘要

本文研究了贝叶斯多目标优化中偏好形状的期望改进准则。我们考虑了两个常用于类似算法目的但几何性质不同的指标族。超体积指标基于一个反乌托邦参考点,测量目标空间中的支配体积。R2指标基于一个乌托邦点,通过加权Tchebycheff标量化包络评估近似集。本文的目的是明确哪些偏好变换保留了精确计算、Pareto兼容性和单调性,哪些变换改变了底层几何。在超体积方面,我们通过Deng表示重新审视了经典的EHVI,在期望坐标中制定了乘积密度加权的EHVI,讨论了基于锥的EHVI作为线性锥变换后的普通EHVI,并将这些情况与截断EHVI区分开来,后者可能违反方差单调性。在R2方面,我们证明精确积分R2改进通常不是普通的目标空间加权超体积。障碍是低维的:Lebesgue密度超体积无法看到Tchebycheff标量化仍能检测到的某些边界贡献。然后我们证明精确积分R2改进恰好是一个标量化空间体积,即当前标量化包络与参考包络之间的Tchebycheff阴影的测度。该表示产生了离散R2的有限和ER2I算法、精确积分R2的求积方法,以及一个成就空间高斯代理公式,其中ER2I是标量高斯期望改进的积分。

英文摘要

This paper studies preference-shaped expected improvement criteria for Bayesian multiobjective optimization. We consider two indicator families which are often used for similar algorithmic purposes, but which are geometrically different. The hypervolume indicator is based on a dystopian reference point and measures dominated volume in objective space. The R2 indicator is based on a utopian point and evaluates approximation sets through weighted Tchebycheff scalarization envelopes. The purpose of the paper is to make precise which preference transformations preserve exact computation, Pareto compatibility, and monotonicity properties, and which transformations change the underlying geometry. On the hypervolume side, we revisit canonical EHVI through the Deng representation, formulate product-density weighted EHVI in desirability coordinates, discuss cone-based EHVI as ordinary EHVI after a linear cone transformation, and separate these cases from truncated EHVI, where variance monotonicity may fail. On the R2 side, we prove that exact integral R2 improvement is not, in general, an ordinary objective-space weighted hypervolume. The obstruction is lower-dimensional: Lebesgue-density hypervolume cannot see certain boundary contributions that Tchebycheff scalarizations still detect. We then show that exact integral R2 improvement is exactly a scalarization-space volume, namely the measure of the Tchebycheff shadow between the incumbent scalarization envelope and the reference envelope. This representation yields finite-sum ER2I algorithms for discrete R2, quadrature methods for exact integral R2, and an achievement-space Gaussian surrogate formulation in which ER2I is an integral of scalar Gaussian expected improvements.

2605.01103 2026-05-29 quant-ph math-ph math.GT math.MP math.SG

On Quantum Indeterminacy

论量子不确定性

Maurice de Gosson

AI总结 本文通过相空间凸几何和辛拓扑方法,提出量子不确定性的几何表述,将标准不确定度不等式作为必然结果导出,揭示不确定性为相空间的结构性质。

详情
Comments
First version
AI中文摘要

我们提出了一种量子不确定性的几何表述,从中标准不确定度不等式作为必然结果出现。我们的方法基于相空间中的凸几何和辛拓扑方法,不依赖于方差或协方差等统计描述符。相反,我们将经验位置和动量数据与凸体相关联,这些凸体的相互关系编码了量子力学的基本约束。核心工具是h-极对偶和辛容量,它们提供了可容许相空间配置的内在、无坐标界限。在此框架内,Robertson-Schrödinger不等式自然地作为更深层几何和拓扑原理的体现出现。这一观点表明,量子不确定性主要不是统计现象,而是由辛协变性支配的相空间的结构性质。因此,这些结果为不确定性原理提供了统一且概念上透明的基础。

英文摘要

We introduce a geometric formulation of quantum indeterminacy from which the standard uncertainty inequalities emerge as necessary consequences. Our approach is based on convex geometry in phase space and on methods from symplectic topology, and does not rely on statistical descriptors such as variances or covariances. Instead, we associate to empirical position and momentum data with convex bodies whose mutual relations encode the fundamental constraints of quantum mechanics. The central tools are h-polar duality and symplectic capacities, which provide intrinsic, coordinate-free bounds on admissible phase-space configurations. Within this framework, the Robertson-Schrodinger inequalities arise naturally as manifestations of deeper geometric and topological principles. This perspective suggests that quantum indeterminacy is not primarily a statistical phenomenon, but rather a structural property of phase space governed by symplectic covariance. The results thus provide a unified and conceptually transparent foundation for the uncertainty principle.

2510.08535 2026-05-29 stat.ML cs.LG math.PR

Permutation-Invariant Spectral Learning via Dyson Diffusion

通过戴森扩散的置换不变谱学习

Tassilo Schwarz, Cai Dieball, Constantin Kogler, Renaud Lambiotte, Arnaud Doucet, Aljaž Godec, George Deligiannidis

AI总结 提出戴森扩散模型,利用随机矩阵理论从分析上提取扩散过程的谱特性,将归纳偏置从架构转移到动力学,实现置换不变的谱学习,准确学习图谱并超越现有图扩散模型。

详情
AI中文摘要

扩散模型是生成建模的核心,并已通过扩散邻接矩阵表示适应于图。对于具有$n$个节点的图,存在多达$n!$个这样的表示,这一挑战仅通过使用置换等变学习架构得到部分缓解。尽管计算效率高,现有的图扩散模型难以区分某些图族及其谱,除非图数据被增强以特定的特征。这一缺陷源于在学习架构中强制执行归纳偏置。在这项工作中,我们利用随机矩阵理论从分析上提取扩散过程的谱特性,从而将大部分归纳偏置从架构推入动力学。在此基础上,我们引入了戴森扩散模型,该模型采用戴森布朗运动来捕捉邻接矩阵上Ornstein-Uhlenbeck过程的谱动力学。此外,以谱动力学为条件,我们制定了一个李群扩散,适当地建模剩余的自由度。引人注目的是,由此产生的学习问题在李代数层面上变为置换不变的。我们证明,戴森扩散模型能够准确学习图谱,并优于现有的图扩散模型。

英文摘要

Diffusion models are central to generative modeling and have been adapted to graphs by diffusing adjacency matrix representations. The challenge of having up to $n!$ such representations for graphs with $n$ nodes is only partially mitigated by using permutation-equivariant learning architectures. Despite their computational efficiency, existing graph diffusion models struggle to distinguish certain graph families and their spectra, unless graph data are augmented with ad hoc features. This shortcoming stems from enforcing the inductive bias within the learning architecture. In this work, we leverage random matrix theory to analytically extract the spectral properties of the diffusion process, allowing us to push most of the inductive bias from the architecture into the dynamics. Building on this, we introduce the Dyson Diffusion Model, which employs Dyson's Brownian motion to capture the spectral dynamics of an Ornstein-Uhlenbeck process on the adjacency matrix. Furthermore, conditioned on the spectral dynamics, we formulate a Lie group diffusion, appropriately modeling the remaining degrees of freedom. Strikingly, the resulting learning problem becomes permutation invariant at the Lie algebra level. We demonstrate that the Dyson Diffusion Model learns graph spectra accurately and outperforms existing graph diffusion models.

2506.00231 2026-05-29 math-ph math.MP quant-ph

Detecting screens modeled by Schrödinger operators that generate $C_0$ contraction semigroups

检测由生成 $C_0$ 压缩半群的薛定谔算子建模的屏幕

Lawrence Frolov

AI总结 本文利用边界四元组理论参数化所有生成元扩展薛定谔哈密顿量的 $C_0$ 压缩半群,证明此类演化由沿边界 $\\partial \\\Omega$ 的线性吸收边界条件生成,并结合 Werner 的工作证明检测时间存在 Born 规则且几乎必然在有限时间内发生。

详情
AI中文摘要

考虑一个非相对论性量子粒子,其波函数 $\\\psi$ 位于有界 $C^2$ 区域 $\\\Omega\\\subset \\\mathbb{R}^n$ 中,并假设探测器沿边界 $\\\partial \\\Omega$ 放置。假设探测过程不可逆,其机制与时间无关且是硬的,即探测仅沿边界 $\\\partial \\\Omega$ 发生。在这些条件下,Tumulka 非正式地论证了 $\\\psi$ 的动力学必须由弱解薛定谔方程的 $C_0$ 压缩半群控制,并提议通过在 $\\\partial \\\Omega$ 处施加与时间无关的局部吸收边界条件来建模探测器。在本文中,我们应用新发现的边界四元组理论来参数化所有生成元扩展薛定谔哈密顿量的 $C_0$ 压缩半群,并证明了 Tumulka 断言的一个变体:所有这样的演化都是由沿 $\\\partial \\\Omega$ 对 $\\\psi$ 施加线性吸收边界条件产生的。我们将此结果与 Werner 的工作相结合,表明每个 $C_0$ 压缩半群自然地允许沿 $\\\partial \\\Omega$ 的探测时间的 Born 规则,并且我们证明如果探测器已沿 $\\\partial \\\Omega$ 处处放置,则探测几乎必然在有限时间内发生。

英文摘要

Consider a non-relativistic quantum particle with wave function $ψ$ in a bounded $C^2$ region $Ω\subset \mathbb{R}^n$, and suppose detectors are placed along the boundary $\partial Ω$. Assume the detection process is irreversible, its mechanism is time independent and also hard, i.e., detections occur only along the boundary $\partial Ω$. Under these conditions Tumulka informally argued that the dynamics of $ψ$ must be governed by a $C_0$ contraction semigroup that weakly solves the Schrödinger equation and proposed modeling the detector by a time-independent local absorbing boundary condition at $\partial Ω$. In this paper, we apply the newly discovered theory of boundary quadruples to parameterize all $C_0$ contraction semigroups whose generators extend the Schrödinger Hamiltonian, and prove a variant of Tumulka's claim: all such evolutions are generated by the placement of a linear absorbing boundary condition on $ψ$ along $\partial Ω$. We combine this result with the work of Werner to show that each $C_0$ contraction semigroup naturally admits a Born rule for the time of detection along $\partial Ω$, and we prove that a detection will almost surely occur in finite time if detectors have been placed everywhere along $\partial Ω$.

2503.24022 2026-05-29 math.ST stat.ML stat.TH

Wasserstein KL-divergence for Gaussian distributions

高斯分布的Wasserstein KL散度

Adwait Datar, Nihat Ay

AI总结 提出基于Wasserstein几何的高斯分布KL散度新版本(WKL散度),证明其与样本空间几何一致,且狄拉克测度的WKL散度正比于两点间距离平方。

详情
AI中文摘要

我们引入了一种基于Wasserstein几何的高斯分布KL散度新版本,称为WKL散度。我们证明该版本与样本空间${\Bbb R}^n$的几何一致。特别地,我们可以评估集中在两个点上的狄拉克测度的WKL散度,结果发现它正比于这两点之间的平方距离。

英文摘要

We introduce a new version of the KL-divergence for Gaussian distributions which is based on Wasserstein geometry and referred to as WKL-divergence. We show that this version is consistent with the geometry of the sample space ${\Bbb R}^n$. In particular, we can evaluate the WKL-divergence of the Dirac measures concentrated in two points which turns out to be proportional to the squared distance between these points.

2605.30286 2026-05-29 math.OC

Proper efficiency results in vector optimisation in real linear-topological spaces based on vectorial penalisation

基于向量惩罚的实线性拓扑空间中向量优化的真有效结果

Paul Schmölling, Christian Günther, Christiane Tammer, Elisabeth Köbis

AI总结 本文利用向量惩罚方法,在目标函数满足锥凸性假设下,研究约束与无约束向量优化问题的真有效解集之间的关系。

详情
AI中文摘要

本文研究目标函数作用于实线性拓扑空间之间的约束向量优化问题。我们的目的是在目标函数满足某些锥凸性假设下,利用向量惩罚方法,研究约束和无约束向量优化问题的真有效解集之间的关系。

英文摘要

In this paper, we are dealing with constrained vector optimisation problems where the objective function acts between real linear-topological spaces. Our aim is to study the relationships between the sets of properly efficient solutions to constrained and unconstrained vector optimisation problems under certain cone convexity assumptions on the objective function using a vectorial penalisation approach.

2605.30279 2026-05-29 math.AG math.NT

A note on Azumaya algebras and one-forms

关于Azumaya代数与一次微分形式的注记

Siqing Zhang

AI总结 本文证明若光滑簇X存在非闭的整体一次微分形式,则其Frobenius扭转X'上由晶体微分算子诱导的非分裂Azumaya代数在X'的某个一次覆盖上仍不分裂,回答了Sasha Petrov的问题。

详情
AI中文摘要

光滑簇X上的晶体微分算子在其Frobenius扭转X'的余切丛上诱导了一个非分裂的Azumaya代数。在某些情况下,该Azumaya代数限制到X'的有限覆盖上时会分裂。在这篇短注中,我们证明,只要X存在非闭的整体一次微分形式,就存在X'的一次覆盖使得该Azumaya代数在其上不分裂,从而回答了Sasha Petrov的问题。

英文摘要

The crystalline differential operators on a smooth variety X give rise to a non-split Azumaya algebra over the cotangent bundle of the Frobenius twist X'. In some cases, this Azumaya algebra splits when restricted to finite covers of X'. In this short note, we show that, whenever X has a non-closed global one-form, there is a degree one cover of X' on which the Azumaya algebra does not split, answering a question of Sasha Petrov.

2605.30272 2026-05-29 math.NA cs.NA

IGA-ODIL: Optimizing DIscretre robust Loss with Isogeometric Analysis to solve forward and inverse problems faster using machine learning tools

IGA-ODIL: 利用等几何分析优化离散鲁棒损失,借助机器学习工具更快求解正反问题

Maciej Paszyński, Tomasz Służalec

AI总结 提出IGA-ODIL框架,用B样条基函数替代神经网络参数化,结合等几何分析与鲁棒变分残差最小化,实现稀疏雅可比矩阵和高效高斯-牛顿优化,在多个基准问题上相比PINNs和CRVPINNs实现数量级加速。

详情
Comments
Physics-informed neural networks, Isogeometric analysis, Residual minimization, Gauss--Newton methods, Scientific machine learning, PDE-constrained optimization
AI中文摘要

物理信息神经网络(PINNs)将偏微分方程的解表述为神经网络参数化上的残差最小化问题。尽管高度灵活,但使用随机梯度下降算法的现代变体优化PINNs代价高昂。另一方面,使用高斯-牛顿方法迭代计算PINN参数化面临收敛困难、雅可比矩阵稠密以及条件数差等问题,限制了二阶优化方法的有效性。在这项工作中,我们引入了IGA-ODIL,一个基于样条的残差最小化框架,结合了优化离散损失(ODIL)、鲁棒变分残差最小化和等几何分析(IGA)的思想。与PINNs的神经网络参数化不同,未知解由光滑的B样条基函数表示,从而产生稀疏结构的雅可比矩阵和高效的高斯-牛顿优化。我们还基于加权Gram算子推导了鲁棒残差公式,使损失函数与真实误差相关。所得系统继承了经典有限元和等几何方法的局部性、稀疏性和逼近理论性质,同时保留了科学机器学习的残差学习理念。所提出的方法在多个基准问题上进行了评估,包括泊松方程、对流主导的对流-扩散方程、具有高度振荡解的亥姆霍兹问题、非线性Allen-Cahn方程以及逆亥姆霍兹参数识别。数值实验表明,与PINNs和CRVPINNs相比,该方法在保持高精度和鲁棒性的同时实现了数量级的加速。

英文摘要

Physics-informed neural networks (PINNs) formulate the solution of partial differential equations as residual minimization problems over neural network parameterizations. Although highly flexible, optimization of PINNs using modern variants of Stochastic Gradient Descent algorithms is expensive. On the other hand, iterative computation of PINN parameterization using the Gauss-Newton method suffers from convergence difficulties, dense Jacobian structures, and poor conditioning that limit the effectiveness of second-order optimization methods. In this work, we introduce IGA-ODIL, a spline-based residual minimization framework combining ideas from Optimizing DIscrete Loss (ODIL), robust variational residual minimization, and Isogeometric Analysis (IGA). Instead of neural-network parameterizations of PINNs, the unknown solution is represented by smooth B-spline basis functions, leading to sparse structured Jacobians and efficient Gauss--Newton optimization. We also derive robust residual formulations based on weighted Gram operators, making the loss function related with the true error. The resulting systems inherit locality, sparsity, and approximation-theoretic properties of classical finite element and isogeometric methods while preserving the residual-learning philosophy of scientific machine learning. The proposed methodology is evaluated on several benchmark problems, including Poisson equations, convection-dominated advection--diffusion equations, Helmholtz problems with highly oscillatory solutions, nonlinear Allen--Cahn equations, and inverse Helmholtz parameter identification. Numerical experiments demonstrate orders-of-magnitude speedups compared with PINNs and CRVPINNs while maintaining high accuracy and robustness.

2605.30266 2026-05-29 math.ST stat.TH

Wasserstein Least Squares: A Canonical Regression Method for Probability Distributions

Wasserstein最小二乘法:概率分布的规范回归方法

Uriel Martínez León, Jonathan Niles-Weed

AI总结 本文提出Wasserstein最小二乘回归方法,从凸分析角度证明其是欧几里得最小二乘在概率分布空间上的规范扩展,并在模板变形模型下实现n^{-1/2}的估计速率,应用于人口统计学数据分析。

详情
AI中文摘要

我们对Wasserstein最小二乘问题进行了数学和统计分析,这是一种针对向量值协变量和分布值响应的回归方法。我们的提议与其他分布回归方法形成对比,因为它具有直接基于随机变量的解释,是经典随机效应模型的非参数类比。在数学方面,我们采用Lavenant (2024)的策略,从凸分析的角度证明Wasserstein最小二乘是欧几里得最小二乘在概率分布空间上的规范扩展;这一观点引出了Wasserstein最小二乘问题的多边缘和对偶公式,扩展了Wasserstein重心类似的理论。我们在模板变形模型下对Wasserstein最小二乘问题进行了统计分析,令人惊讶地表明,估计可以达到n^{-1/2}的速率。作为特例,我们获得了Wasserstein重心估计的改进速率,这比Ahidar-Coutrix、Le Gouic和Paris (2020)建立的速率呈指数级改进。最后,我们提出了一种启发式粒子方法用于Wasserstein最小二乘,并利用它对来自RAND健康与退休研究的大规模人口统计学数据进行了新颖的分析。

英文摘要

We perform a mathematical and statistical analysis of the Wasserstein least squares problem, a regression method for vector-valued covariates and distribution-valued responses. Our proposal contrasts with other distributional regression methods by having a direct interpretation in terms of random variables, as a nonparametric analogue of the classic random-effects model. On the mathematical side, we use a strategy of Lavenant (2024) to show that Wasserstein least squares is the canonical extension of Euclidean least squares to the space of probability distributions from the perspective of convex analysis; this viewpoint gives rise to multimarginal and dual formulations of the Wasserstein least squares problem, extending a similar theory for Wasserstein barycenters. We perform a statistical analysis of the Wasserstein least squares problem under the template deformation model, showing, surprisingly, that estimation is possible at the n^{-1/2} rate. As a special case, we obtain improved rates of estimation for Wasserstein barycenters, which are an exponential improvement over those established by Ahidar-Coutrix, Le Gouic and Paris (2020). Finally, we propose a heuristic particle method for Wasserstein least squares and use it to conduct a novel analysis of large-scale demographic data from the RAND Health and Retirement Study.

2605.30262 2026-05-29 math.CT math.AC math.AG

Semi-Bousfield classes and nonmonotone perversities

半Bousfield类与非单调perversities

Dolors Herbera, Michal Hrbek, Giovanna Le Gros

AI总结 本文在刚性紧生成张量三角范畴中引入半Bousfield类,证明其统一了Bousfield类和紧生成张量兼容t-结构,并应用于Noetherian方案的无界导出范畴,将分层双射扩展至非单调perversities。

详情
Comments
43 pages, comments are welcome
AI中文摘要

在刚性紧生成张量三角范畴的一般性中,我们通过相对于固定合理$t$-结构的正次数张量积的消失来引入半Bousfield类。我们证明半Bousfield类提供了Bousfield类和紧生成张量兼容$t$-结构的共同推广。然后我们特化到Noetherian方案$X$的无界导出范畴$\mathcal{D}_{\mathrm{qc}}(X)$,并证明分层双射自然地扩展为一个赋值,该赋值将$X$上的(不一定单调的)perversity映射到$\mathcal{D}_{\mathrm{qc}}(X)$中的半Bousfield类。如果$X$是正则的,该赋值构成整个半Bousfield格的分层,而在奇异情形下,其像恰好由那些来自有限Tor维数对象的半Bousfield类组成。将此双射限制到单调perversities上,恢复了Dubey和Sahoo(arXiv:2204.05015)最近关于紧生成张量兼容$t$-结构的分类。

英文摘要

In the generality of a rigidly-compactly generated tensor triangulated category, we introduce semi-Bousfield classes in terms of the vanishing of the tensor product in positive degrees with respect to a fixed reasonable $t$-structure. We show that semi-Bousfield classes provide a common generalisation of Bousfield classes and compactly generated tensor-compatible $t$-structures. Then we specialise to the setting of the unbounded derived category $\mathcal{D}_{\mathrm{qc}}(X)$ of a Noetherian scheme $X$ and show that the stratification bijection naturally extends to an assignment which takes a (not necessarily monotone) perversity on $X$ to a semi-Bousfield class in $\mathcal{D}_{\mathrm{qc}}(X)$. If $X$ is regular, this assignment constitutes a stratification of the whole semi-Bousfield lattice, while in the singular case, its image consists precisely of those semi-Bousfield classes arising from objects of finite Tor-dimension. Restricting this bijection to monotone perversities recovers the recent classification of compactly generated tensor-compatible $t$-structures of Dubey and Sahoo, (arXiv:2204.05015).

2605.30254 2026-05-29 math.DG math.SP

Lower bounds for the low Steklov eigenvalues

低 Steklov 特征值的下界

Tirumala Chakradhar, Bruno Colbois, Asma Hassannezhad

AI总结 针对具有 b 个边界分支的紧致连通可定向黎曼流形,通过迹不等式和边界环的电阻解释,获得了低 Steklov 特征值(σ_k, 1≤k≤b-1)的几何下界,揭示了内部几何对低特征值的影响。

详情
Comments
Comments are welcome
AI中文摘要

对于一个具有 $b$ 个边界分支的紧致、连通、可定向黎曼流形,我们获得了低 Steklov 特征值(即 $σ_k$, $1\le k\le b-1$)的几何下界。我们的结果补充了早期仅适用于 $k\ge b$ 且依赖于边界附近几何的结果,展示了内部几何如何影响低特征值。我们的结果还在负曲率受控流形的情形下给出了低 Steklov 特征值的下界,从而通过另一种证明恢复了该背景下的类似结果。主要结果的证明基于将 Steklov 特征值与包含边界环的流形连通子区域的 Neumann 特征值联系起来的迹不等式。该不等式中出现的几何系数由边界环的电阻(可解释为电学量)的显式公式给出。

英文摘要

For a compact, connected, orientable Riemannian manifold with $b$ boundary components, we obtain geometric lower bounds for the low Steklov eigenvalues, namely $σ_k$, $1\le k\le b-1$. Our results complement earlier results, which apply only to $σ_k$ with $k\ge b$ and depend on the geometry near the boundary, by showing how the interior geometry influences the low eigenvalues. Our result also yields lower bounds for the low Steklov eigenvalues in the setting of pinched negatively curved manifolds, thus recovering similar results in that context through an alternative proof. The proof of the main result is based on the trace inequality relating the Steklov eigenvalue to the Neumann eigenvalues of the connected subdomains of the manifold containing a boundary collar. The geometric coefficient appearing in this inequality is given by an explicit formula in terms of a quantity that can be interpreted as the electrical resistance of the boundary collar.

2605.30243 2026-05-29 math.DS math.PR

Energetic characterisation of transient clustering dynamics in aggregation-diffusion systems

聚集-扩散系统中瞬态聚集动力学的能量表征

Nathalie Wehlitz, Richard Scherzer, Carsten Hartmann, Stefanie Winkelmann

AI总结 本文从能量角度研究非局部聚集-扩散系统中的瞬态聚集动力学,利用Wasserstein梯度流结构分析聚集与扩散的热力学竞争,并通过数值实验揭示交替主导的瞬态机制及聚集观测量的局限性。

详情
Comments
19 pages, 9 figures
AI中文摘要

我们从能量角度研究了非局部聚集-扩散系统中的瞬态聚集动力学。从随机相互作用粒子系统出发,我们研究了环面上相关的宏观McKean-Vlasov方程,并利用其Wasserstein梯度流结构来分析相互作用驱动的聚集与熵驱动的扩散之间的热力学竞争。通过对局部吸引相互作用核的数值实验,我们识别出沿着收敛到固定平衡的轨迹上交替出现的聚集主导和扩散主导的瞬态区域。这些动力学可以解释为一种非单调的聚集行为。此外,我们证明了聚集观测量(如密度峰值高度)仅部分耦合到潜在的能量机制,因此不能唯一地表征相关的宏观输运动力学。我们的结果强调了变分结构不仅用于平衡分析,而且作为理解相互作用粒子系统中瞬态聚集现象框架的作用。

英文摘要

We investigate transient clustering dynamics in nonlocal aggregation-diffusion systems from an energetic perspective. Starting from a stochastic interacting particle system, we study the associated macroscopic McKean-Vlasov equation on the torus and exploit its Wasserstein gradient-flow structure to analyse the thermodynamic competition between interaction-driven aggregation and entropy-driven diffusion. Through numerical experiments for locally attractive interaction kernels, we identify alternating aggregation- and diffusion-dominated transient regimes along trajectories converging to fixed equilibria. These dynamics can be interpreted as a form of non-monotone clustering behaviour. Moreover, we demonstrate that clustering observables, such as the density peak height, are only partially coupled to the underlying energetic mechanisms and therefore do not uniquely characterise the relevant macroscopic transport dynamics. Our results highlight the role of the variational structure not only for equilibrium analysis, but also as a framework for understanding transient clustering phenomena in interacting particle systems.

2605.30228 2026-05-29 math.AP

Eigenvalue bounds for quantum dot Dirac operators

量子点Dirac算子的特征值界

Joaquim Duran

AI总结 利用量子点Dirac算子与∂̄-Robin Laplacian的联系,通过最小正特征值的图形关系,推导出上下界转换方法,并给出仅依赖于区域几何量的新特征值界,特别对凸薄区域得到Faber-Krahn型不等式。

详情
Comments
20 pages, 3 figures
AI中文摘要

我们利用量子点Dirac算子与$\overline\partial$-Robin Laplacian之间的联系。首先,我们发现它们的最小正特征值之间存在一个图形关系,这使我们能够推导出将(上界和下界)从一个转换到另一个的方法。作为一个应用,我们为量子点Dirac算子的特征值提供了新的上下界,这些界仅依赖于底层区域的几何量。特别地,我们获得了凸薄区域的一些Faber-Krahn型不等式。

英文摘要

We exploit the connection between quantum dot Dirac operators and $\overline\partial$-Robin Laplacians. First, we find a graphical relation between their smallest positive eigenvalues, which allows us to deduce a recipe for translating bounds (from above and below) from one to the other. As an application, we provide new upper and lower bounds for the eigenvalues of the quantum dot Dirac operators, which depend only on geometric quantities of the underlying domain. In particular, we obtain some Faber-Krahn type inequalities for convex thin domains.

2605.30223 2026-05-29 math.AG

Hodge numbers of moduli of principal bundles on a curve

曲线上主丛模空间的Hodge数

Chiu-Chu Melissa Liu, Florent Schaffhauser

AI总结 通过证明关于双变量收敛幂级数的反演定理,得到了半稳定主丛模栈的Hodge-Poincaré级数的闭公式,并计算了所有主丛模栈的Hodge结构随曲线周期矩阵的变化。

详情
Comments
34 pages
AI中文摘要

我们证明了一个关于某些双变量收敛幂级数族所满足的递归公式的反演定理。这些幂级数由光滑射影曲线$X$上度$d \in \pi_1 G$的主$G$-丛的Harder-Narasimhan类型索引,其中$G$是连通的复约化群。作为一个应用,我们得到了度$d$的半稳定主$G$-丛模栈的Hodge-Poincaré级数的闭公式。我们还计算了$X$上所有主$G$-丛模栈的Hodge结构的变化,作为该曲线周期矩阵的函数。

英文摘要

We prove an inversion theorem for recursive formulas satisfied by certain families of converging power series in two variables. These power series are indexed by the Harder-Narasimhan types of principal $G$-bundles of degree $d \in π_1 G$ on a smooth projective curve $X$, where $G$ is a connected complex reductive group. As an application, we obtain a closed formula for the Hodge-Poincaré series of moduli stacks of semistable principal $G$-bundles of degree $d$. We also compute the variation of Hodge structure of the moduli stack of all principal $G$-bundles over $X$, as a function of the period matrix of that curve.

2605.30216 2026-05-29 hep-ph hep-th math-ph math.MP

HyperPrecision: A Mathematica package for High-Precision Numerical Evaluation of Multivariate Hypergeometric Functions

HyperPrecision: 用于多元超几何函数高精度数值计算的 Mathematica 包

Sumit Banik, Souvik Bera

AI总结 提出 HyperPrecision Mathematica 包,通过自动构造 Pfaffian 系统并沿一维等高线求解常微分方程,实现 Horn 型多元超几何函数及其 Laurent 展开的高精度数值计算。

详情
Comments
49 pages, 7 figures, code repository: https://github.com/HyperPrecision/HyperPrecision
AI中文摘要

本文介绍了 HyperPrecision,一个用于通用 Horn 型多元超几何函数及其在小参数 $ε$ 下 Laurent 展开的高精度数值计算的 Mathematica 包。这类函数广泛出现在物理学和数学中,应用范围从量子场论和弦理论到数论和统计学。然而,它们的高精度数值计算仍然具有挑战性,因为其定义级数仅在受限域内收敛,而解析延拓到这些域之外通常是非平凡的。HyperPrecision 通过自动构造给定超几何函数的 Pfaffian 偏微分方程组,并将其限制在变量空间中连接起点和目标点的一维等高线上来解决这个问题。由此得到的常微分方程通过 Frobenius 方法求解,边界条件由定义级数解析确定。我们通过评估常见的多元超几何函数(包括 Appell $F_1$、$F_2$、$F_3$ 和 $F_4$ 函数,Horn $G$ 和 $H$ 级数,以及 Lauricella $F_A$、$F_B$、$F_C$ 和 $F_D$ 函数),并考虑在角度积分、费曼积分以及宇宙学和全息关联函数中的应用,来说明该包的使用。

英文摘要

In this paper, we present HyperPrecision, a Mathematica package for high-precision numerical evaluation of general Horn-type multivariate hypergeometric functions and their Laurent expansions in a small parameter $ε$. Such functions appear widely in physics and mathematics, with applications ranging from quantum field theory and string theory to number theory and statistics. Their high-precision numerical evaluation, however, remains challenging, since their defining series converge only in restricted domains and analytic continuation beyond these domains is, in general, non-trivial. HyperPrecision addresses this problem by automatically constructing the Pfaffian system of partial differential equations for a given hypergeometric function and restricting it to a one-dimensional contour in the space of variables connecting the starting to the target point. The resulting ordinary differential equation is then solved by the Frobenius method, with boundary conditions fixed analytically by the defining series. We illustrate the use of the package by evaluating commonly occurring multivariate hypergeometric functions, including the Appell $F_1$, $F_2$, $F_3$, and $F_4$ functions, the Horn $G$- and $H$-series, and the Lauricella $F_A$, $F_B$, $F_C$, and $F_D$ functions, as well as by considering applications to angular integrals, Feynman integrals, and cosmological and holographic correlators.

2605.30199 2026-05-29 math-ph gr-qc math.MP

The Continuum Limit Analysis of Causal Fermion Systems for Curved Spacetimes

弯曲时空因果费米子系统的连续极限分析

Patrick Fischer, Felix Finster

AI总结 本文在全局双曲时空中从代数量子场论出发构造因果费米子系统,通过准自由Hadamard态的单粒子密度算子识别费米子投影子,并发展连续极限分析,证明因果作用原理的Euler-Lagrange方程等价于耦合的Einstein-Dirac方程。

详情
Comments
39 pages
AI中文摘要

我们从代数量子场论的框架出发,为全局双曲时空构造因果费米子系统。费米子投影子被识别为准自由Hadamard态的单粒子密度算子。通过一个与图无关的$i\varepsilon$-正规化方案,紫外正规化被内置于费米子投影子中。在全局双曲时空中发展了连续极限分析。结果表明,在此设置下,因果作用原理的Euler-Lagrange方程成立当且仅当耦合的Einstein-Dirac方程成立。

英文摘要

We construct the causal fermion system for globally hyperbolic spacetimes starting in the framework of algebraic quantum field theory. The fermionic projector is identified with the one-particle density operator of a quasi-free Hadamard state. The ultraviolet regularization is built into the fermionic projector via a chart-independent $i\varepsilon$-regularization scheme. The continuum limit analysis is developed in globally hyperbolic spacetimes. It is shown that the Euler-Lagrange equations of the causal action principle are satisfied in this setup if and only if the coupled Einstein-Dirac equations hold.

2605.30197 2026-05-29 math.DS

Connection of hypocoercivity and hypocontractivity via the $θ$-methods

通过 $\theta$-方法连接超耗散性和超收缩性

Anton Arnold, Stefan Egger

AI总结 本文探讨了线性演化方程的超耗散性(指数衰减和传播子范数的短时衰减)如何通过 $\theta$-方法(特别是 $\theta \neq 1/2$)传递到离散化,并证明隐式离散化($\theta \in (1/2,1]$)是收缩的而非超收缩的,而 $\theta \in [0,1/2)$ 的离散化在时间步长足够小时是收缩的。

详情
Comments
7 pages
AI中文摘要

近期文献表明,线性演化方程的超耗散性(特别是其指数衰减和传播子范数的尖锐短时衰减)通过中点法则传递到其离散化。本文讨论了这一联系对于(其他)$\theta$-方法,即对于 $\theta\neq\frac12$ 的情况。结果表明,任何隐式离散化($\theta\in (\frac12,1]$,对应于超耗散连续时间演化方程)是收缩的,而不仅仅是超收缩的——与中点法则形成对比。对于耗散连续时间演化方程,$\theta\in [0,\frac12)$ 的离散化在时间步长足够小时是收缩的。

英文摘要

Recent literature shows that hypocoercivity properties of linear evolution equations (in particular their exponential decay and the sharp short time decay of their propagator norm) carry over to their discretization via the midpoint rule. This note discusses this connection for the (other) $θ$-methods, i.e.\ for $θ\ne\frac12$. It is shown that any implicit discretization with $θ\in (\frac12,1]$ (pertaining to a hypocoercive continuous-time evolution equation) is contractive, and not only hypocontractive -- in contrast to the midpoint rule. For a coercive continuous-time evolution equation, a discretization with $θ\in [0,\frac12)$ is contractive for time steps small enough.

2605.30191 2026-05-29 math.FA

On $L^p$-spaces of functions with values in locally convex spaces

关于取值于局部凸空间的函数的$L^p$空间

Matthieu F. Pinaud, Humberto Prado

AI总结 研究取值于局部凸空间的Lusin可测函数,探讨非可度量化情形下的点态极限病理现象,并建立基于该可测性概念的$L^p$空间的逼近与稠密性结果。

详情
AI中文摘要

我们研究取值于局部凸空间的Lusin可测函数。特别地,我们考察Lusin可测函数序列的点态极限行为,并展示在非可度量化设定中出现的病态现象。此外,我们建立了基于这种可测性概念构造的$L^p$空间的逼近和稠密性结果,包括Hausdorff局部凸空间中简单函数的稠密性以及通过二进逼近得到的收敛结果。

英文摘要

We study Lusin-measurable functions with values in locally convex spaces. In particular, the behavior of pointwise limits of sequences of Lusin-measurable functions and exhibit pathological phenomena arising in the nonmetrizable setting. Moreover, we establish approximation and density results for $L^p$-spaces constructed with this notion of measurability, including the density of simple functions in Hausdorff locally convex spaces and convergence results obtained through dyadic approximations.

2605.30186 2026-05-29 math.FA math.LO math.SP

Spectral embedding through weak* limit of finite-dimensional approximations

通过有限维逼近的弱*极限的谱嵌入

Fabrice Nonez

AI总结 本文通过经典分析论证,无需非标准技术或超积,重新证明了谱嵌入定理:任何稠定对称算子可通过将Hilbert空间嵌入到L2空间中的乘法算子来延拓。

详情
Comments
37 pages
AI中文摘要

本文旨在研究一个过程,该过程给出了谱嵌入定理的另一种证明:任何稠定对称算子可以通过将Hilbert空间嵌入到L2空间中的乘法算子来延拓。此外,该过程旨在用于特定算子,其中可能找到自然的谱嵌入或等价关系。该过程先前已在arXiv:2411.06281和arXiv:2511.18189中通过非标准技术引入。我们的贡献在于通过经典分析论证重新表述该理论,而不使用非标准技术或超积。

英文摘要

The scope of this text is to study a process that induces another proof of the Spectral Embedding Theorem: that any densely defined symmetric operator can be extended by a multiplication operator through an embedding of the Hilbert space into an $L_2$ space. Furthermore, that process is meant to be used for specific operators, where natural spectral embeddings or equivalences may be found. That process has previously been considered in arXiv:2411.06281 and in arXiv:2511.18189, where it has been introduced through nonstandard techniques. Our contribution aims to be the reformulation of the theory through classical analysis arguments, without the use of nonstandard techniques nor ultraproducts.

2605.30181 2026-05-29 math.NA cs.NA math.OC

Generalized matrix nearness problems II

广义矩阵逼近问题 II

Rongbiao Thomas Wang, Chi-Kwong Li, Lek-Heng Lim

AI总结 研究广义矩阵逼近问题,通过引入仿射项、克罗内克积和任意正交不变范数进行扩展,提出适用于任意Schatten范数的迭代算法并证明全局收敛。

详情
Comments
23 pages, 4 figures
AI中文摘要

给定矩阵$A$,矩阵逼近问题寻求一个$X$,在$X$的各种约束下最小化$\lVert A - X\rVert$,从而最接近$A$。广义矩阵逼近问题则使用三个给定矩阵$A,B,C$和$\lVert A - BXC\rVert$代替$\lVert A - X\rVert$。我们从三个方面扩展了先前对该问题的研究:引入仿射项,以多种方式用克罗内克积替换矩阵乘积,并将Frobenius范数推广到任意正交不变范数。我们将以闭式解解决其中几个问题。对于其余问题,我们开发了一种适用于任何Schatten范数的迭代算法,并证明无论初始点如何,该算法都收敛到全局最小值。此外,该算法纯粹依赖于数值线性代数,并且特别地不计算任何显式梯度或次梯度。在此过程中,我们还将证明对于秩约束的广义矩阵逼近问题,不存在Mirsky型定理。

英文摘要

Given a matrix $A$, a matrix nearness problem seeks an $X$ that most closely approximates $A$ in the sense of minimizing $\lVert A - X\rVert$ under a variety of constraints on $X$. A generalized matrix nearness problem seeks the same but with three given matrices $A,B,C$ and $\lVert A - BXC\rVert$ in place of $\lVert A - X\rVert$. We extend previous studies of the latter problem in three directions: incorporating an affine term, replacing matrix product by Kronecker product in various manners, and generalizing Frobenius norm to any orthogonally invariant norm. We will solve several of these in closed form. For the rest, we develop an iterative algorithm that works for any Schatten norm, proving that it converges to a global minimizer regardless of the initial point. In addition, the algorithm relies purely on numerical linear algebra, and notably does not compute any explicit gradients or subgradients. Along the way, we will also show that there is no Mirsky-type theorem for rank constrained generalized matrix nearness problems.

2605.30176 2026-05-29 math-ph gr-qc math.MP

The Fermionic Signature Operator in the Reissner-Nordström Geometry in Horizon-Penetrating Coordinates

Reissner-Nordström 几何中穿透视界坐标下的费米子签名算子

Felix Finster, Christoph Krpoun

AI总结 研究在穿透视界坐标下Reissner-Nordström几何中Dirac方程至柯西视界,通过质量分解定理给出时空内积的协变表示,计算费米子签名算子和费米子通量算子的谱并证明它们是有界对称算子,构造费米子投影态并证明其满足Hadamard条件,最后给出费米子通量算子的物理解释。

详情
Comments
25 pages, LaTeX, one figure
AI中文摘要

我们研究了在穿透视界坐标下Reissner-Nordström几何中直至柯西视界的Dirac方程。证明了一个质量分解定理,该定理给出了时空内积的协变表示,自然涉及费米子签名算子和费米子通量算子。我们计算了它们的谱,并证明两者都是大质量Dirac方程解空间$\mathcal{H}_m$上的有界对称算子。构造了相应的费米子投影态,并证明其满足Hadamard条件。最后,我们给出了费米子通量算子的一些物理解释。

英文摘要

We study the Dirac equation in the Reissner-Nordström geometry in horizon-penetrating coordinates up to the Cauchy horizon. A mass decomposition theorem is proved, which gives a covariant representation of the spacetime inner product that naturally involves the fermionic signature operator and the fermionic flux operator. We compute their spectra and show that both are bounded symmetric operators on the solution space $\mathcal{H}_m$ of the massive Dirac equation. The corresponding fermionic projector state is constructed and shown to satisfy the Hadamard condition. Lastly, we give some physical interpretations of the fermionic flux operator.

2605.30163 2026-05-29 math.GR

A finitely presented group of non-uniform exponential growth

一个有限呈现的非均匀指数增长群

Roman Sauer, Eduard Schesler

AI总结 通过Thompson群V,首次构造了有限呈现群和单群中具有非均匀指数增长的例子。

详情
Comments
17 pages
AI中文摘要

我们提供了第一个有限呈现的,以及第一个简单的,具有非均匀指数增长的群的例子。该例子由Thompson群V给出。

英文摘要

We provide the first example of a finitely presented, and the first example of a simple, group of non-uniform exponential growth. The example is given by Thompson's group V.

2605.30153 2026-05-29 stat.ML cs.IT cs.LG math.IT math.ST stat.TH

Diffusion Models Are Statistically Optimal for Learning Low-Dimensional Multi-Modal Distributions

扩散模型在学习低维多模态分布时具有统计最优性

Jingda Wu, Changxiao Cai

AI总结 本文证明扩散模型在学习支撑在低维子空间并集上的分布时,样本复杂度仅依赖于内在维度,达到近最优的1-Wasserstein误差率,无需光滑性或有界密度假设。

详情
Comments
accepted to ICML 2026
AI中文摘要

基于分数的扩散模型在学习高维分布,特别是那些具有低维和多模态结构的分布方面,已经展现出显著的实证成功。然而,对其统计效率的理论理解仍然有限。现有理论通常依赖于强正则性假设,例如一致有界密度或全局光滑的分数函数,这些假设无法捕捉此类内在结构。在这项工作中,我们研究了扩散模型在学习支撑在低维子空间并集上的分布时的样本复杂度。假设每个子空间内的数据分布是次高斯的,我们证明扩散模型最多需要$\widetilde{O}(\varepsilon^{-k \vee 2})$个样本即可在1-Wasserstein距离上达到$\varepsilon$误差,其中$k$是内在维度。这一近最优的收敛速率仅依赖于内在维度,并显著改进了先前遭受维度灾难的理论保证。值得注意的是,我们的分析适用于广泛的分布,无需施加光滑性、有界密度或对数凹性假设。总体而言,我们的结果表明,扩散模型能够统计适应内在低维结构,同时自然容纳多模态数据,为其在复杂高维学习任务中的成功提供了严格的理论依据。

英文摘要

Score-based diffusion models have demonstrated remarkable empirical success in learning high-dimensional distributions, particularly those exhibiting low-dimensional and multi-modal structures. However, theoretical understanding of their statistical efficiency remains limited. Existing theories typically rely on strong regularity assumptions, such as uniformly bounded densities or globally smooth score functions, which fail to capture such intrinsic structures. In this work, we study the sample complexity of diffusion models for learning distributions supported on a union of low-dimensional subspaces. Assuming that the data distribution within each subspace is subgaussian, we show that diffusion models require at most $\widetilde{O}(\varepsilon^{-k \vee 2})$ samples to achieve $\varepsilon$ error in 1-Wasserstein distance, where $k$ is the intrinsic dimension. This near-optimal convergence rate depends only on the intrinsic dimension and significantly improves upon prior theoretical guarantees that suffer from the curse of dimensionality. Notably, our analysis applies to a broad collection of distributions without imposing smoothness, bounded-density, or log-concavity assumptions. Overall, our results show that diffusion models can statistically adapt to intrinsic low-dimensional structure while naturally accommodating multi-modal data, offering a rigorous theoretical justification for their success in complex high-dimensional learning tasks.