arXivDaily arXiv每日学术速递 周一至周五更新
重置
2605.08079 2026-05-11 cond-mat.stat-mech hep-th math-ph math.MP

Multiscale Structure of Eigenstate Thermalization

Pavel Orlov, Rustem Sharipov, Enej Ilievski

AI总结 本文研究孤立量子多体系统中本征态热化假说的多尺度结构,揭示了矩阵元分布不仅依赖于宏观态参数,还与采样本征态所用系综的性质有关。通过引入可调节的尺度参数来描述电荷涨落,作者在可高效数值计算的可积场论模型中,发现了矩阵元统计特性由非解析的代数指数所控制的多尺度结构,并识别出一类可显式计算矩阵元抑制率的状态,为理解热化机制提供了新的视角。

Comments 21 pages, 14 figures

详情
英文摘要

The eigenstate thermalization hypothesis provides a framework for understanding thermalization in isolated quantum many-body systems by characterizing statistical properties of local observables in energy eigenstates. Here we demonstrate that distributions of matrix elements in macroscopic systems may depend not only on the macrostate parameters, such as the densities of local conserved charges, but generally also on the properties of ensembles used in sampling eigenstates. To this end, we depart from the conventional analysis of microcanonical windows and consider statistical ensembles with an adjustable scale parameter prescribing the magnitude of charge fluctuations. We specifically consider an integrable field theory that permits efficient numerical sampling of matrix elements and reliable extrapolation to the thermodynamic limit. Moreover, in this system, we identify a class of states that enables explicit closed-form computation of the suppression rate of matrix elements. Our findings reveal an underlying multiscale structure of matrix elements captured by a non-analytic fluctuation-scale dependence of algebraic exponents governing their statistical properties.

2605.08072 2026-05-11 stat.ML cs.DS cs.LG math.ST stat.TH

A Note on Non-Negative $L_1$-Approximating Polynomials

Jane H. Lee, Anay Mehrotra, Manolis Zampetakis

AI总结 本文研究了在高斯分布下具有非负性的 $L_1$-逼近多项式的存在性,这类多项式在逼近指示函数时不仅满足 $L_1$-范数误差要求,还保证输出非负。作者证明了对于具有有限高斯表面面积(GSA)的集合类,存在次数为 $\tilde{O}(Γ^2/\varepsilon^2)$ 的非负多项式,能够以 $\varepsilon$ 的误差逼近其指示函数。该结果在保持 $L_1$-逼近能力的同时,提供了更强的点态保证,并且与当前最优的无非负性约束的高斯 $L_1$-逼近多项式次数相差仅常数因子。

详情
英文摘要

$L_1$-Approximating polynomials, i.e., polynomials that approximate indicator functions in $L_1$-norm under certain distributions, are widely used in computational learning theory. We study the existence of \textit{non-negative} $L_1$-approximating polynomials with respect to Gaussian distributions. This is a stronger requirement than $L_1$-approximation but weaker than sandwiching polynomials (which themselves have many applications). These non-negative approximating polynomials have recently found uses in smoothed learning from positive-only examples. In this short note, we prove that every class of sets with Gaussian surface area (GSA) at most $Γ$ under the standard Gaussian admits degree-$k$ non-negative polynomials that $\eps$-approximate its indicator functions in $L_1$-norm, for $k=\tilde{O}(Γ^2/\varepsilon^2)$. Equivalently, finite GSA implies $L_1$-approximation with the stronger pointwise guarantee that the approximating polynomial has range contained in $[0,\infty)$. Up to a constant-factor, this matches the degree of the best currently known Gaussian $L_1$-approximation degree bound without the non-negativity constraint.

2605.08066 2026-05-11 quant-ph cs.IT math.IT

Covert Signaling for Communication and Sensing over the Bosonic Channels

Tianrui Tan, Evan J. D. Anderson, Michael S. Bullock, Boulat A. Bash

AI总结 本文研究了在具有热噪声的玻色子信道中实现隐蔽通信与感知的稀疏信号策略。通过分析信号可检测性,作者发现了一种非直观的最优量子态结构:仅由两个连续光子数态组成的混合态。在低亮度条件下,最优信号态为真空态与单光子态的混合。该研究揭示了隐蔽性与通信、感知性能之间的权衡关系,并确定了不同优化目标之间的功率阈值。

Comments 15 pages, 4 figures, draft, comments welcome

详情
英文摘要

Preventing signal detection in communication and active sensing requires careful control of transmission power. In fact, the square-root laws (SRL) for covert classical and quantum communication and sensing prescribe that the average output power per channel use scales as $1/\sqrt{n}$ for $n$ channel uses. Two strategies for achieving this are diffuse and sparse signaling. The former transmits signals with power decaying as $1/\sqrt{n}$ on all $n$ channel uses, which is convenient for mathematical analysis. The latter transmits constant-power signals rarely, on approximately $\sqrt{n}$ out of $n$ channel uses, while remaining silent on the others. This offers significant practical advantages in compatibility with modern digital transmitters. Here, we study sparse signaling over lossy thermal-noise bosonic channels, which describe quantumly many practical channels (including optical, microwave, and radio-frequency). We characterize the input signal state that minimizes detectability. We find an unintuitive optimal quantum state structure: a mixture of just two consecutive photon-number states. In particular, in the low-brightness regime, the optimal signal state is a mixture of vacuum and a single photon. Since these states are generally suboptimal for both communication and active sensing, we explore the resulting trade-off and identify input-power thresholds for transitions between optimizing for covertness vs. performance in communication and sensing tasks.

2605.08065 2026-05-11 math-ph hep-th math.MP nlin.SI

Hamiltonian formulation of the supersymmetric KdV equation

Ali Pazarci, Nadir Ghazanfari, Ilmar Gahramanov

AI总结 本文研究了超对称Korteweg-de Vries(KdV)方程的约束哈密顿形式,发现其与经典KdV方程类似,也是一个约束系统。通过选取特定的自由参数 $a=2$,作者构建了一个非平凡的拉格朗日描述,并利用Dirac-Bergmann算法确定了全部的初等和次级约束,构造了系统的总哈密顿量。研究还揭示了哈密顿密度中存在非局域贡献,突显了该超对称扩展的独特性质,并最终给出了哈密顿方程的分量形式及其在超空间中的紧凑表达。

Comments 7 pages

详情
英文摘要

We studied the constrained Hamiltonian formulation of a supersymmetric Korteweg-de Vries (KdV) equation, which is observed to be a constrained system similar to its classical version. We found a nontrivial Lagrangian description, where we select $a=2$ for the free parameter $a$ in the supersymmetric extension. The corresponding degenerate Lagrangian requires an exclusive consideration and the utilization of the Dirac-Bergmann algorithm. We explicitly determined the full set of primary and secondary constraints and constructed the total Hamiltonian governing the dynamics of the system. In this analysis, in addition to a nontrivial constraint involving the fermionic fields, the consistency conditions give rise to a nonlocal contribution to the Hamiltonian density. This highlights a distinctive feature of this supersymmetric extension. We showed that the resulting Hamilton equations of motion reproduce the supersymmetric KdV system in the component form. Finally, we derived a compact superspace representation of the Hamiltonian and demonstrated its consistency with the component-level formulation.

2605.08062 2026-05-11 math.AG

Finite order symplectic birational self-maps on Kummer-type manifolds

Yajnaseni Dutta, Dominique Mattei, Stevell Muller, Howard Nuer

AI总结 本文研究了Kummer型超凯勒流形上的有限阶辛双有理自映射问题。作者证明了,除了某些Picard秩为3的例外情况外,任何具有非平凡第二同调群作用的有限阶辛双有理自映射的投影Kummer型流形都是扭曲模形式的。文中还给出了这些例外情况在Néron-Severi格上的完整刻画,并进一步分析了模形式Kummer型流形上的辛双有理自映射,确定了哪些Mukai向量可以对应有限阶的辛双有理自映射。

详情
英文摘要

A projective hyperkähler manifold of Kummer-type is said to be twisted modular if it is birational to the Albanese fiber of a moduli space of twisted sheaves on an abelian surface. We prove that, with the exception of certain cases of Picard rank 3, any projective Kummer-type manifold admitting a finite-order symplectic birational self-map that acts nontrivially on its second cohomology group is twisted modular. We provide a complete characterization of these exceptions in terms of their Néron-Severi lattices. We then investigate symplectic birational self-maps of modular Kummer-type manifolds, determining exactly which Mukai vectors allow the birational transformation induced by crossing the vertical wall, which acts on cohomology as a reflection, to correspond to a finite-order symplectic birational self-map. Additionally, we prove in an appendix several results concerning moduli spaces of twisted sheaves on abelian surfaces which were not readily available in the literature.

2605.08052 2026-05-11 math.PR math-ph math.MP

Rapid phase ordering of Ising dynamics on $\mathbb Z^2$

Reza Gheissari, Allan Sly

AI总结 本文研究了二维整数晶格上低温伊辛动力学的相序问题,考虑从偏向加态的无序初始状态出发的动力学行为。作者证明,在二维伊辛模型的低温区域,若初始自旋以足够高的概率为+1,则动力学过程会快速收敛到全加态的平稳分布。该结论通过一种适用于任意维数的时空多尺度耦合方法得到,该方法将带加边界条件的伊辛动力学的混合时间的准多项式界提升为无边界条件下从偏向初始态的快速相序行为。

Comments 56 pages, 3 figures

详情
英文摘要

We consider the phase ordering problem for the low-temperature Ising dynamics initialized from a biased and disordered initialization. Work of Fontes, Schonmann, Sidoravicius (2002) showed that at zero-temperature, Ising Glauber dynamics on $\mathbb Z^d$ for $d\ge 2$ initialized from i.i.d. spins on each vertex that are $+1$ with sufficiently large probability, absorbs into the all-plus configuration quickly. We prove that analogous behavior holds throughout the low-temperature regime of the Ising model in two dimensions. Namely, there exists $p_0 <1$ such that Ising Glauber dynamics initialized from i.i.d. spins that are $+1$ with probability $p>p_0$, run at any low temperature $β>β_c$ converges rapidly to the plus phase measure $π^+$. The result is proved using a spacetime multiscale coupling valid in any $d\ge 2$, that boosts a uniform-in-$β$ quasi-polynomial bound on the mixing time of Ising dynamics with plus boundary conditions, into rapid phase ordering from biased initializations with no boundary conditions.

2605.08038 2026-05-11 math.NA cs.NA

Invariant domain preserving limiting of time explicit and time implicit discretizations for systems of conservation laws

Bartolomeo Fanizza, Florent Renac

AI总结 本文研究了一种用于非线性双曲守恒律系统高阶数值解的限制技术,旨在保持不变域的性质。该方法适用于各种空间守恒格式以及显式和隐式时间积分方案,通过将高阶解限制在已知保持不变域的低阶解附近,从而确保数值解的物理合理性。该方法推广了通量校正传输限制器,并基于凸限制框架定义限制系数,同时提供了有限体积和不连续伽辽金方法在显式和隐式时间离散中的应用实例与数值实验结果。

详情
英文摘要

This work concerns the design and analysis of a limiting technique that allows the preservation of invariant domains for high-order numerical approximations of nonlinear hyperbolic systems of conservation laws. The method can be applied to any conservative discretization method in space as well as to a wide range of explicit and implicit time integration schemes. The method limits the high-order solution around a low-order accurate solution that is known to preserve all the invariant domains. It generalizes the flux-corrected transport limiter [J. P. Boris and D. L. Book, J. Comput. Phys., 11, 1973; S. T. Zalesak, J. Comput. Phys., 31, 1979] to systems of conservation laws and relies on the limitation of antidiffusive fluxes, but defines the limiting coefficients so as to express the limited solution as a convex combination of invariant domain preserving quantities similarly to the convex limiting framework [Guermond et al., Comput. Methods Appl. Mech. Engrg., 347, 2019]. We give details on the derivation of this limiting technique and provide some illustration with finite volume or discontinuous Galerkin (DG) space discretizations associated to explicit or implicit Runge-Kutta methods as well as to time DG integrations. The limiter is applied iteratively to refine the limited solution around the high-order one, while preserving the invariant domains, and a heuristic is proposed to accelerate its convergence. Numerical experiments solving one- and two-dimensional problems involving scalar hyperbolic equations and the compressible Euler equations are presented to illustrate the properties of these schemes.

2605.08033 2026-05-11 math.CO

Weak Order on the MacNeille Completion of Bruhat Order

Colin Defant

AI总结 本文研究了Coxeter群Bruhat序的MacNeille完备上的弱序结构,引入了0-Hecke单oid的作用,从而定义了弱序和降序集统计量。当群为类型A时,该方法重现了Hamaker和Reiner基于单调三角形和交替符号矩阵的构造,并用于证明关于Knutson-Miller子字复形和Cohen-Macaulay ASM簇的猜想,同时给出了一个反例。此外,作者还引入了MacNeille pop-stack算子,并确定了其达到底状态所需的最大迭代次数。文章还探讨了使用大型语言模型辅助数学研究的案例。

Comments 14 pages, 1 figure, 2 tables

详情
英文摘要

Let $\mathrm{Mac}(W)$ be the MacNeille completion of the Bruhat order of a Coxeter group $W$. We introduce an action of the $0$-Hecke monoid of type $W$ on $\mathrm{Mac}(W)$, which allows us to define a weak order and a descent set statistic on $\mathrm{Mac}(W)$. When $W$ is of type $A$, we recover constructions of Hamaker and Reiner, which were originally formulated in terms of monotone triangles and alternating sign matrices. Using this action, we prove that certain unions of Knutson--Miller subword complexes are vertex-decomposable. By specializing to type $A$, we prove a conjecture of Escobar, Klein, and Weigandt regarding Cohen--Macaulay ASM varieties. Along the way, we also exhibit a counterexample to a conjecture of Hamaker and Reiner regarding the poset topology of intervals in the ASM weak order. Finally, when $W$ is finite and irreducible, we use our $0$-Hecke action to introduce a noninvertible dynamical system on $\mathrm{Mac}(W)$ that we call the MacNeille pop-stack operator, and we prove that the maximum number of iterations of this operator needed to reach the bottom state is $h-1$, where $h$ is the Coxeter number of $W$. This article is meant to serve as a case study in using large language models to automate the workflow of mathematical research. The proof of the conjecture of Escobar--Klein--Weigandt and the disproof of the conjecture of Hamaker--Reiner were obtained autonomously by ChatGPT 5.4 Pro. Other aspects of the paper were obtained mostly by the author, but ChatGPT expedited the process. We provide a detailed account of this interaction, and we speculate on what allowed the model to be successful.

2605.08026 2026-05-11 math.OC

Approximate directional stationarity and associated qualification conditions

Isabella Käming, Patrick Mehlitz

AI总结 本文研究了近似方向平稳性及其相关的资格条件,旨在将近似平稳性和方向平稳性相结合,提出一种新的优化必要性条件——近似方向平稳性。该条件适用于带有非光滑几何约束的优化问题,能够更精确地刻画局部最优解的性质。此外,作者还提出了一种基于近似方向平稳点的资格条件,通过验证一个特定序列和简单的Mangasarian–Fromovitz型条件,可推导出方向平稳性,从而为优化理论提供了新的分析工具。

详情
英文摘要

Approximate stationarity conditions provide necessary optimality conditions without requiring additional assumptions by demanding that a perturbed stationarity system possesses solutions as the involved perturbations tend to zero. Together with associated approximate constraint qualifications, which are typically rather mild, they raised much interest in the optimization community during the last decade. In parallel, directional stationarity conditions became quite popular as they sharpen standard stationarity conditions by incorporating data associated with underlying critical directions. The purpose of this paper is twofold. First, we melt the aforementioned concepts of approximate and directional stationarity to formulate and study so-called approximate directional stationarity. For the underlying model problem, an optimization problem with nonsmooth geometric constraints is chosen, which covers diverse practically relevant applications. The role of approximate directional stationarity as a necessary optimality condition is investigated in much detail, complementing results from the literature. Second, we formulate a qualification condition which, based on an approximately directionally stationary point, can be exploited to infer its directional stationarity. The latter condition depends on one particular sequence verifying approximate directional stationarity and merely requires to check a simple condition of Mangasarian--Fromovitz type stated in terms of the directional tools of limiting variational analysis. This contrasts standard approximate constraint qualifications that typically demand a certain stable behavior of all sequences validating approximate stationarity. Throughout, various approaches to verify directional stationarity of local minimizers are established, and illustrative examples are presented to make the theoretical results more accessible.

2605.08023 2026-05-11 math.DG math.AG math.CV

Asymptotics of small eigenvalues on degenerations of Kähler manifolds

Junyu Cao

AI总结 本文研究了紧凯勒流形在单参数退化情形下拉普拉斯算子小特征值的渐近行为,给出了其精确的渐近速率。通过结合Li的统一Skoda不等式与辅助Monge-Ampère方程方法,推广了Dai和Yoshikawa在高维空间中的结果。作为应用,文章还得到了具有可约奇纤维的凯勒流形退化情形下的估计结果。

Comments 37 pages; comments are welcome!

详情
英文摘要

We derive the exact asymptotic rates of the small eigenvalues of the Laplacian on one-parameter degenerations of compact Kähler manifolds equipped with induced background metrics. This generalizes a recent result of Dai and Yoshikawa to higher dimensions. To achieve this, we combine Li's uniform Skoda inequality with the method of auxiliary Monge-Ampère equations, introduced by Guo--Phong--Song--Sturm--Tong and adapted by Guedj--Tô. As an application, we establish estimates for degenerations of compact Kähler manifolds with reducible singular fibers.

2605.08014 2026-05-11 q-bio.NC math.DS

Dynamical mechanisms of flexible phase-locking in cortical theta oscillators

Yangyang Wang, Benjamin R. Pittman-Polletta

AI总结 本研究探讨了大脑皮层θ振荡器如何实现对不同时间尺度输入信号的灵活相位锁定机制。通过动力系统理论分析,研究发现多时间尺度的内在抑制电流相互作用,能够产生延迟Hopf分岔现象,从而显著扩展振荡器的同步频率范围。实验表明,θ频段和δ频段的抑制电流协同作用,形成了三时间尺度结构,使皮层振荡器在外部输入下具备更强的相位锁定能力,为语音分割等认知功能提供了潜在的神经机制基础。

Comments 30 pages, 15 figures

详情
英文摘要

Oscillatory activity in auditory cortex is thought to play a central role in auditory and speech processing by synchronizing neural rhythms to external acoustic features of the speech stream. To support this function, cortical oscillators must flexibly phase-lock to inputs spanning a wide range of timescales, including rhythms substantially slower than their intrinsic frequency. Here we identify a general dynamical mechanism by which intrinsic inhibitory currents operating on multiple timescales enable such flexible phase-locking. Using tools from dynamical systems theory, we show that interactions between slow and superslow inhibitory processes generate prolonged post-input recovery delays through delayed Hopf phenomena, thereby substantially expanding the frequency range over which entrainment can occur. We demonstrate this mechanisms in a biophysically grounded cortical theta oscillator model for speech segmentation. Specifically, we show that both a theta-timescale (4-8 Hz) inhibitory current $I_m$ and a slower delta-timescale (1-4 Hz) inhibitory potassium current $I_{\rm K_{SS}}$ are crucial for entrainment flexibility. Their interaction creates a three-timescale structure that gives rise to pronounced delay phenomena associated with a delayed Hopf bifurcation (DHB). Interestingly, the superslow $I_{\rm K_{SS}}$ and the associated DHB play little role in the unforced oscillatory dynamics, but are recruited to support phase locking under external forcing. Moreover, the intermediate-timescale current $I_m$, rather than being redundant, further expands the phase-locking range by prolonging delayed recovery along the superslow manifold. Together, these results suggest that coordination among intrinsic inhibitory currents operating on multiple timescales may represent a key mechanism supporting flexible phase locking to rhythmic inputs in the brain.

2605.08006 2026-05-11 math.OC cs.LG stat.ML

Penalty-Based First-Order Methods for Bilevel Optimization with Minimax and Constrained Lower-Level Problems

Yiyang Shen, Yutian He, Weiran Wang, Qihang Lin

AI总结 本文研究了一类具有上下层均为极小极大结构的双层优化问题,这类问题在许多新兴应用中具有广泛代表性。为了解决现有方法在处理下层为极小极大问题时的不足,作者提出了一种基于惩罚函数的一阶优化方法,无需假设下层问题强凸,即可高效求解。在确定性设置下,该方法能够以 $\tilde{O}(ε^{-4})$ 的计算复杂度找到 $ε$-KKT 点,并在随机设置下也给出了相应的复杂度分析,显著优于现有结果。

详情
英文摘要

We study a class of bilevel optimization problems in which both the upper- and lower-level problems have minimax structures. This setting captures a broad range of emerging applications. Despite the extensive literature on bilevel optimization and minimax optimization separately, existing methods mainly focus on bilevel optimization with lower-level minimization problems, often under strong convexity assumptions, and are not directly applicable to the minimax lower-level setting considered here. To address this gap, we develop penalty-based first-order methods for bilevel minimax optimization without requiring strong convexity of the lower-level problem. In the deterministic setting, we establish that the proposed method finds an $ε$-KKT point with $\tilde{O}(ε^{-4})$ oracle complexity. We further show that bilevel problems with convex constrained lower-level minimization can be reformulated as special cases of our framework via Lagrangian duality, leading to an $\tilde{O}(ε^{-4})$ complexity bound that improves upon the existing $\tilde{O}(ε^{-7})$ result. Finally, we extend our approach to the stochastic setting, where only stochastic gradient oracles are available, and prove that the proposed stochastic method finds a nearly $ε$-KKT point with $\tilde{O}(ε^{-9})$ oracle complexity.

2605.08004 2026-05-11 math.OA math-ph math.MP

Functoriality of the KSGNS Construction for Intertwiners of Strict Positive $C^*$-Correspondences

Lucus Brady, Ryan Grady

AI总结 本文研究了严格正 $C^*$-对应在固定 $C^*$-代数上的KSGNS构造的函子性质,证明该构造可以视为一个范畴上的内函子,其中对象为正 $C^*$-对应,态射由考虑固定代数自同构的纠缠算子给出。通过这一视角,文章给出了严格正等变 $C^*$-对应在动力系统中的函子性描述,并证明每个此类对应在KSGNS构造下可唯一单位地扩张为等变 $C^*$-对应。

Comments 57 pages, comments welcome

详情
英文摘要

We prove that the KSGNS construction can be viewed as an endofunctor on a category whose objects are positive $C^*$-correspondences from a fixed $C^*$-algebra and morphisms are given by intertwiners which account for automorphisms of the fixed $C^*$-algebra. Using this perspective, we provide a functorial perspective for strict positive equivariant $C^*$-correspondences of $C^*$-dynamical systems and show every strict positive equivariant $C^*$-correspondence of $C^*$-dynamical systems unitarily uniquely dilates under the KSGNS construction to an equivariant $C^*$-correspondence of the dynamical systems.

2605.08002 2026-05-11 stat.ME math.ST stat.TH

Cellwise and Casewise Robust Multivariate Regression with Inference

Fabio Centofanti, Mia Hubert, Peter J. Rousseeuw

AI总结 本文研究了在存在案例型和单元型异常值、缺失数据及高维特征情况下的多元线性回归问题,提出了一个鲁棒的多元回归估计方法——单元多元回归(cellMR),该方法结合了单元鲁棒协方差估计和岭正则化,能够同时处理多种数据污染问题。此外,作者还提出了一种基于自助法的推断方法cellBoot,能够在存在异常值的情况下提供渐近有效的置信区间,并通过模拟和基因组实际应用验证了方法的有效性。

详情
英文摘要

Multivariate linear regression is a fundamental statistical task, but classical estimators such as ordinary least squares are highly sensitive to outliers. These may occur as casewise outliers that affect entire observations, or as outlying cells, that are individual contaminated entries in the predictor and/or response matrix. Moreover, modern datasets frequently contain missing values and are high-dimensional. To address these challenges we propose the cellwise multivariate regression (cellMR) estimator, a robust regression method that simultaneously accommodates casewise and cellwise outliers, missing data, and high dimensionality. The approach builds on a cellwise robust covariance estimator and uses ridge regularization for numerical stability. We further introduce cellBoot, a novel bootstrap-based inference procedure tailored to the cellMR framework. Relying on indirect inference, cellBoot provides asymptotically valid confidence intervals that are robust to casewise and cellwise contamination. We derive influence functions of the regression estimator and prove the asymptotic validity of the cellBoot confidence intervals. Simulations and a real genomics application illustrate the strong finite-sample performance of the proposed methods.

2605.08001 2026-05-11 math.ST stat.ME stat.TH

Scale selection for geometric medians on product manifolds

Kisung You

AI总结 本文研究了在乘积流形上几何中位数的尺度选择问题,指出直接联合优化位置和尺度会导致尺度退化到边界,从而使问题退化为边缘中位数,丢失一个因子的信息。为此,作者提出了三种改进方法,分别从敏感性路径、鲁棒尺度校准和平衡方程等角度出发,确保尺度估计的稳定性、一致性及单位不变性,并通过仿真验证了方法在欧几里得和Bures-Wasserstein空间中的有效性。

详情
英文摘要

Geometric medians on product manifolds are sensitive to the relative scaling of factor metrics because the median objective couples the factors rather than separating them. We study this scale-selection problem and first prove that naive joint minimization over location and scale is degenerate: the scale is driven to the boundary and the problem collapses to a marginal median, effectively discarding one factor. Thus relative scale is not identifiable from the raw median loss alone. We develop three alternatives to mitigate this issue. The first treats scale as indexing a sensitivity path and establishes uniform consistency, a functional central limit theorem, and a derivative-based sensitivity measure. The second constructs a robust scale-calibrated median using marginal radial median scales, yielding unit invariance, consistency, a two-step central limit theorem, and bounded influence. The third introduces a bounded balance equation for direct scale estimation, with monotonicity, uniqueness, joint asymptotic normality, and bounded influence. Simulations illustrate boundary collapse, sensitivity, unit invariance, and balanced estimation in Euclidean and Bures-Wasserstein settings.

2605.07994 2026-05-11 cs.IT math.IT

Semantic Smoothing for Language Models via Distribution Estimation and Embeddings

Haricharan Balasundaram, Swathi Shree Narashiman, Pranay Mathur, Andrew Thangaraj

AI总结 本文提出了一种基于语义平滑的语言模型平滑方法,通过嵌入表示在语义相似的上下文中共享统计信息。该方法从对数困惑度的分解出发,将平滑问题转化为在Kullback-Leibler损失下的分布估计问题,并利用嵌入空间中上下文的接近性来推导出词分布的KL散度接近性。实验表明,该方法在多种嵌入模型和语言模型上有效降低了测试困惑度,具有较好的泛化能力。

详情
英文摘要

We propose semantic smoothing, a smoothing method for language models that uses embeddings to share statistical observations across semantically similar contexts. The starting point is a decomposition of log-perplexity that motivates smoothing as a collection of distribution-estimation problems under Kullback-Leibler (KL) loss. We then show that, under a Lipschitz-logit model for embedding-based language generation, proximity of context embeddings implies proximity of the corresponding next-word distributions in KL divergence. Combining these observations, we formulate semantic smoothing as distribution estimation in KL loss with KL-proximity side information. For $n$ samples on a $d$-symbol alphabet with a side-information distribution at KL distance $Δ$, we give an interpolation estimator with worst-case KL risk $O(\min\{Δ,d/n\})$, and prove a matching-order lower bound for uniform side information. We extend the estimator to multiple and empirically estimated synonymous distributions. Experiments on synthetic Markov data and WikiText-103 bigram models using Word2Vec, GloVe, and GPT-2 embeddings show that semantic smoothing consistently reduces test perplexity when applied to add-constant and Kneser-Ney estimates.

2605.07991 2026-05-11 math.AG

On Bands and Limit Theorems in Tropical Geometry

Arne Kuhrs, Alejandro Martínez Méndez, Pedro Souza

AI总结 本文综述了Baker-Jin-Lorscheid提出的带(bands)与带概形(band schemes)的基本理论,这是一个用于热带化、分析化及$\mathbb{F}_1$-几何的代数框架。对于非阿基米德赋值域上的仿射概形$X$,文章证明$X$可以视为其在带概形范畴中所有仿射嵌入所对应带概形的极限,从而在概形理论层面推广了Payne的极限定理。该结果从带概形的角度重新诠释了Payne关于仿射热带化的定理,并在实热带情形中得到了类似结论。

Comments 13 pages

详情
英文摘要

We review the basic theory of bands and band schemes introduced by Baker-Jin-Lorscheid, which is an algebraic framework for tropicalization, analytification, and $\mathbb{F}_1$-geometry. For an affine scheme $X$ over a non-Archimedean valued field $k$, one can associate to every affine embedding $ι$ of $X$ a naturally defined affine band scheme $Y_ι$ whose rational points over the tropical band $\mathbb{T}$ recover the tropicalization $Trop(X,ι)$. We prove that $X$ is the limit of the $Y_ι$ in the category of band schemes, thereby obtaining a scheme-theoretic enhancement of Payne's limit theorem. By taking $\mathbb{T}$-rational points, this recovers Payne's theorem for affine tropicalizations from the perspective of band scheme theory and the same method provides an analogous result in the real tropical setting.

2605.07980 2026-05-11 cs.LG cond-mat.stat-mech math.ST stat.TH

Susceptibilities and Patterning: A Primer on Linear Response in Bayesian Learning

Chris Elliott, Daniel Murfet

AI总结 本文介绍了在神经网络解释中发展的易感性理论,用于分析贝叶斯学习中的线性响应。易感性定义为可观测量对数据扰动的后验期望导数,根据涨落-耗散定理等价于后验协方差。通过不同可观测量的选择,可得到不同对象,如样本损失对应影响矩阵,局部组件可观测量对应结构易感性矩阵,该矩阵与数据模式和模型组件的映射有关,并可用于寻找实现特定结构变化的数据扰动。文章从统计力学基础出发,详细阐述了易感性及其估计方法与损失景观几何的关系。

Comments 34 pages, 3 figures, comments welcome!

详情
英文摘要

These notes introduce the theory of susceptibilities as developed in [arXiv:2504.18274, arXiv:2601.12703] for interpreting neural networks. The susceptibility of an observable $ϕ$ to a data perturbation is defined as a derivative of a posterior expectation, which by the fluctuation--dissipation theorem equals a posterior covariance. Different choices of $ϕ$ yield different objects: per-sample losses give the influence matrix (the Bayesian influence function of [arXiv:2509.26544]), while component-localized observables give the structural susceptibility matrix that pairs model components with data patterns. The susceptibility matrix is (up to a factor of $nβ$) the Jacobian of the map from data distributions to structural coordinates; its pseudo-inverse provides a linearized solution to the patterning problem of [arXiv:2601.13548]: finding data perturbations that produce a desired structural change. We motivate the theory from its statistical-mechanical foundations, then give a detailed exposition of susceptibilities, their empirical estimators, and their connection to the geometry of the loss landscape.

2605.07974 2026-05-11 math.AG math.AC

Tensor product surfaces and graded syzygies

Matthew Weaver

AI总结 本文研究由双齐次多项式基定义的有理映射所对应的张量积曲面在三维射影空间中的隐式方程问题。作者通过扩展前人关于分级系综的研究,提出了一种在理想具有单分级系综的情况下求解隐式方程的方法,为几何建模中广泛存在的此类曲面提供了更高效的计算工具。

Comments 24 pages. Comments welcome

详情
英文摘要

Let $U\subseteq H^0(\mathcal{O}_{\mathbb{P}^1\times \mathbb{P}^1}(a,b))$ be a four-dimensional vector space and consider the rational map $ϕ_U:\,\mathbb{P}^1\times \mathbb{P}^1 \dashrightarrow \mathbb{P}^3$ defined by its basis of bihomogeneous polynomials. The tensor product surface $X_U\subseteq \mathbb{P}^3$ is the closed image of $ϕ_U$, and a fundamental problem in this setting is to determine its implicit equation. As these surfaces are ubiquitous within the field of geometric modeling and design, knowledge of their implicit equations is particularly advantageous, allowing for more effective and efficient computations. In this article, we expand upon work of Duarte-Schenck and work of the present author to solve this implicitization problem when the bigraded ideal $I_U$ admits a singly graded syzygy.

2605.07970 2026-05-11 math.ST cs.LG stat.TH

Linear Response Estimators for Singular Statistical Models

Chris Elliott, Daniel Murfet

AI总结 本文研究了一类统计模型在数据扰动下可观测量的响应特性,定义了用于衡量这种响应的“易感度”指标。作者提出了一种针对这些易感度的估计方法,并证明了在数据量趋于无穷大时,这些估计量具有一致性和渐近无偏性。该研究为理解复杂统计模型对数据变化的敏感性提供了理论基础和实用工具。

Comments 24 pages, comments welcome!

详情
英文摘要

We define susceptibilities as a measure of the response of an observable quantity of a parameterized statistical model to a perturbation of the data for a general class of observables. We define estimators for these susceptibilities as statistics in a sequence of n data-points and prove that these estimators are consistent and asymptotically unbiased in the large n regime.

2605.07969 2026-05-11 cs.LG cs.IT math.IT

When Diffusion Model Can Ignore Dimension: An Entropy-Based Theory

Ahmad Aghapour, Erhan Bayraktar

AI总结 本文从信息论角度研究扩散模型在高维数据中的收敛性问题,提出了一种基于香农熵的理论分析框架。研究发现,对于高斯混合目标分布,离散化误差主要由潜在混合成分的熵控制,而非环境维度。该结果表明,当数据分布具有紧凑的潜在表示时,扩散采样在高维空间中仍能保持高效,为理解扩散模型的高效性提供了新的理论依据。

详情
英文摘要

Diffusion models perform remarkably well on high-dimensional data such as images, often using only a modest number of reverse-time steps. Despite this practical success, existing convergence theory does not fully explain why such samplers remain efficient in high dimensions. Many prior KL guarantees bound the discretization error in terms of the ambient dimension, while other improved results replace this dependence using intrinsic-dimensional or geometric structure assumptions. In this work, we develop an alternative information-theoretic perspective on diffusion sampler convergence. We prove that, for Gaussian mixture targets, the discretization error is controlled by the Shannon entropy of the latent mixture component rather than by the ambient dimension. Consequently, the leading step complexity scales linearly with latent entropy and depends only logarithmically on the second moment of the data. Our analysis also extends to discrete target distributions, where the relevant complexity is the entropy of the target rather than the dimension of the embedding space. These results suggest that diffusion sampling can remain efficient in high-dimensional spaces when the data distribution admits a compact latent representation, as is widely believed to be the case for natural images.

2605.07967 2026-05-11 math.ST stat.TH

Density Estimation Using the Sinc Kernel

Ingrid Kristine Glad, Nils Lid Hjort, Nikolai G. Ushakov

AI总结 本文研究了一种基于sinc核(或傅里叶积分核)的密度估计方法,该核函数为 $K(x)=(πx)^{-1}\sin x$。通过详细分析该估计器的渐近性质和有限样本性质,研究发现与普遍看法相反,sinc核密度估计器在多个方面优于其他估计器,包括样本量适中时的精度更高、在非光滑密度情况下的渐近性能更优,以及带宽选择更为方便等。

Comments 20 pages, no figures. Preprint, Department of Mathematical Statistics, Norwegian University of Science and Technology, Trondheim, no. 2, 2007; arXiv'd for broader visibility and for direct use in a forthcoming paper

详情
英文摘要

This paper deals with the kernel density estimator based on the so-called sinc (or Fourier integral) kernel $K(x)=(πx)^{-1}\sin x$. We study in detail both asymptotic and finite sample properties of this estimator. It is shown that, contrary to widespread opinion, the sinc estimator is superior to other estimators in many respects: it is more accurate for quite moderate values of the sample size, has better asymptotics in non-smooth case (the density to be estimated has only first derivative), is more convenient for the bandwidth selection, etc.

2605.07959 2026-05-11 cs.LG math.FA math.PR

Convergent Stochastic Training of Attention and Understanding LoRA

Zhengkai Sun, Dibyakanti Kumar, Alejandro F Frangi, Anirbit Mukherjee, Mingfei Sun

AI总结 本文研究了在注意力机制和浅层神经网络中使用低秩适配(LoRA)方法时,如何通过随机训练方法实现模型的可训练性。作者提出一个统一的理论框架,证明在轻微正则化条件下,注意力层和LoRA参数化的回归损失满足Poincaré不等式,从而保证了随机梯度下降的收敛性。该研究首次在无需假设数据分布或网络规模的情况下,严格建立了注意力模型和LoRA结构的可训练性,为大模型的高效训练提供了理论支撑。

详情
英文摘要

Transformers have revolutionized machine learning and deploying attention layers in the model is increasingly standard across a myriad of applications. Further, for large models, it is common to implement Low Rank Adaptation (LoRA), whereby a factorized parameterization of them is trained, to achieve a surprisingly beneficial accuracy-size trade-off. In this work, via a unified framework we rigorously establish trainability of such models under stochastic methods. We prove that for any mild regularization, the empirical regression loss on a attention layer and LoRA on a shallow neural net, both induce Poincaré inequality for the corresponding Gibbs' measure. Then it follows via invoking recent results that a certain SDE, which mimics the SGD, minimizes the corresponding losses. In both the cases, our first-of-its-kind results of trainability on attention and nets, do not rely on any assumptions on the data or the size of the architecture.

2605.07956 2026-05-11 math.AG math.AC

Adjoint test modules along Cohen--Macaulay morphisms

Javier Carvajal-Rojas, Axel Stäbler

AI总结 本文研究了在具有$F$-rational几何纤维的Cohen-Macaulay映射下,伴随测试模的变换规则。作者给出了一个有效的转换公式,推广了Enescu关于$F$-rational性质在局部映射下上升的定理。该成果为研究代数几何中奇异性的不变量提供了新的工具。

Comments 8 pages, comments are welcome

详情
英文摘要

We provide a transformation rule for adjoint test modules along Cohen--Macaulay maps between Cohen--Macaulay varieties that have $F$-rational geometric fibers. This is, in part, an effective version of Enescu's theorem on the ascent of $F$-rationality under local maps with $F$-rational geometric fibers.

2605.07953 2026-05-11 math.AP

Noise-Driven Free Boundaries In The Compressible Navier-Stokes Equations

Gianmarco Del Sarto, Matthias Hieber, Tarek Zöchling

AI总结 本文研究了三维可压缩正压Navier-Stokes方程的随机自由边界问题,其中自由边界由Stratonovich随机流驱动,噪声通过运动学边界条件影响区域的演化。通过引入由速度和输运向量场生成的随机拉格朗日映射,将问题转换为新的坐标系,并结合随机极大正则性、确定性L^p-L^q估计和局部收缩论证,证明了在几乎必然的正停时内解的局部路径唯一性与严格正密度的存在性。

详情
英文摘要

A stochastic free-boundary problem for the three-dimensional barotropic compressible Navier--Stokes equations is studied. The main feature of the model is that the free boundary is transported by a Stratonovich stochastic flow, so that the noise enters the kinematic boundary condition and hence the evolution of the moving domain. An additional Itô forcing in the momentum equation is also allowed. The problem is transformed by a stochastic Lagrangian map generated by the velocity and the transport vector fields. In these coordinates the density is represented through the Jacobian of the flow, and the remaining system is solved by combining stochastic maximal regularity, deterministic %\rL^p%-%\rL^q$ estimates, and a localized contraction argument. Local pathwise well-posedness is obtained up to an a.s. positive stopping time, with strictly positive density and pathwise uniqueness.

2605.07947 2026-05-11 cs.CE cs.AI cs.LG math.OC

Exploring the non-convexity in machine learning using quantum-inspired optimization

Kandula Eswara Sai Kumar, Parth Dhananjay Danve, Abhishek Chopra, Rut Lineswala

AI总结 本文研究了现代机器学习中高维非凸优化问题的求解挑战,尤其是存在严重异常值时的结构恢复问题。为此,作者提出了一种基于量子启发进化优化(QIEO)的统一框架,通过量子叠加的概率表示保持全局搜索视角,有效克服传统梯度下降和贪心算法易陷入局部最优的缺陷。实验表明,QIEO在稀疏信号恢复和鲁棒线性回归等任务中,相比现有先进算法具有更高的结构保真度、更低的均方误差和更强的鲁棒性。

详情
英文摘要

The escalating complexity of modern machine learning necessitates solving challenging non-convex optimization problems, particularly in high-dimensional regimes and scenarios contaminated by gross outliers. Traditional approaches, relying on convex relaxations or specialized local search heuristics, frequently succumb to suboptimal local minima and fail to recover the true underlying discrete structures. In this paper, we propose treating these non-convex challenges as a global search problem and introduce a unified framework based on Quantum-Inspired Evolutionary Optimization (QIEO). By leveraging a probabilistic representation inspired by quantum superposition, QIEO maintains a global view of the search space, enabling it to tunnel through local optima that trap conventional gradient-based and greedy solvers. We comprehensively evaluate QIEO across diverse non-convex applications, including sparse signal recovery (gene expression analysis and compressed sensing) and robust linear regression. Extensive benchmarking against state-of-the-art continuous solvers (ADAM, Differential Evolution), classical metaheuristics (Genetic Algorithms), and specialized non-convex algorithms (Iterative Hard Thresholding) demonstrates that QIEO consistently achieves superior structural fidelity, lower mean squared error, and enhanced robustness without support inflation. Our findings suggest that embracing a quantum-inspired global search provides a resilient, unified paradigm for overcoming the inherent intractability of discrete nonconvex machine learning landscapes.

2605.07944 2026-05-11 math.AG math.CV math.DS

Homogeneous pre-foliations of co-degree one and degree four on the projective plane

Carla Pracias, Maycol Falla Luza

AI总结 本文研究复射影平面上共次数为1、次数为4的齐性预叶的分类问题,重点分析其Legendre变换所定义的4维网为平坦的情形。通过区分底层次数为3的齐性叶的切丛次数分别为2、3和4的不同情况,给出了在射影自同构下所有此类预叶的分类结果。研究结合了Bedrouni的曲率-全纯性准则、显式正规形式和符号计算,最终得到由高斯映射分枝数据参数化的有限个显式微分1形式列表。

详情
英文摘要

We classify, up to projective automorphism, all homogeneous pre-foliations of co-degree one and degree four on the complex projective plane $\Ptwo$ whose Legendre transform defines a flat $4$-web. The classification is organized according to the type of the underlying homogeneous foliation $\Hcal$ of degree~$3$, distinguishing the cases $°(\Tcal_{\Hcal})=2$, $3$, and~$4$. The case $°(\Tcal_{\Hcal})=2$ was treated by Bedrouni, while the cases $°(\Tcal_{\Hcal})=3$ and $°(\Tcal_{\Hcal})=4$ are completed here. The proof combines Bedrouni's curvature-holomorphy criteria with explicit normal forms and symbolic computation; the result yields a finite list of explicit one-forms, parametrised by the ramification data of the Gauss map of~$\Hcal$.

2605.07939 2026-05-11 math.ST cs.NA math.NA stat.TH

Accelerating Langevin Monte Carlo via Efficient Stochastic Runge--Kutta Methods beyond Log-Concavity

Bin Yang, Xiaojie Wang

AI总结 本文研究了如何通过高效的随机Runge-Kutta方法加速高维概率分布采样中的朗之万蒙特卡洛(LMC)算法。提出了一种基于强阶为1.5的随机Runge-Kutta方法的高阶、无需Hessian矩阵的LMC算法,相比现有方法每迭代仅需两次梯度计算,计算效率更高。在非对数凹条件下的非渐近误差界分析表明,该算法具有与现有工作相同量级的收敛速率,数值实验验证了其有效性。

详情
英文摘要

Sampling from a high-dimensional probability distribution is a fundamental algorithmic task arising in wide-ranging applications across multiple disciplines, including scientific computing, computational statistics and machine learning. Langevin Monte Carlo (LMC) algorithms are among the most widely used sampling methods in high-dimensional settings. This paper introduces a novel higher-order and Hessian-free LMC sampling algorithm based on an efficient stochastic Runge--Kutta method of strong order $1.5$ for the overdamped Langevin dynamics. In contrast to the existing Runge--Kutta type LMC (Li et al., 2019) involved with three gradient evaluations, the newly proposed algorithm is computationally cheaper and requires only two gradient evaluations for one iteration. Under certain log-smooth conditions, non-asymptotic error bounds of the proposed algorithms are analyzed in $\mathcal{W}_2$-distance. In particular, a uniform-in-time convergence rate of order $O(d ^{\frac32} h^{\frac32})$ is derived in a non-log-concave setting, matching the convergence rate proved in the aforementioned work but under the log-concavity condition. Numerical experiments are finally presented to demonstrate the effectiveness of the new sampling algorithm.

2605.07932 2026-05-11 math.HO

Notes on Beltrami's Essay

Steven Rose

AI总结 本文介绍了欧金尼奥·贝尔特拉米1868年发表的关于非欧几里得几何解释的论文,重点阐述了他在负曲率曲面上测地线与欧几里得圆盘中直线之间的对应关系,并展示了圆盘上图形满足双曲几何特性。文章补充了贝尔特拉米未完全解释的公式推导,包括圆盘上的双曲距离公式、三角形内角和小于两直角的证明,以及圆、等距线和视界线的方程。

Comments 24 pages

详情
英文摘要

Eugenio Beltrami published his seminal 'Essay on the Interpretation of Non-Euclidean Geometry' in 1868, where he showed that geodesics on a surface of constant negative curvature can be mapped as straight lines on a Euclidean disc. More importantly he showed that figures on the disc would satisfy the identities of hyperbolic geometry characteristic of a surface of negative curvature. However Beltrami did not always give a full explanation of the equations which he used. These notes are an attempt to provide a derivation of some of his principal results, including his formula for hyperbolic distance on the disc, his proof that the sum of the (hyperbolic) angles of a triangle on the disc is less than two right angles and his equations for circles, equidistants and horocycles.

2605.07923 2026-05-11 math.CO math.PR

The number and structure of connected graphs with a fixed degree sequence

Sasha Bell, Serte Donderwinkel, Remco van der Hofstad

AI总结 本文研究具有固定度序列的连通图的数量与结构,特别是在边数随顶点数线性增长的稀疏情形下。通过配置模型的关联,作者确定了这类连通图的数量,精度达到指数级别。研究中引入了巨分支组件的概念,并结合交换论证确保图的度序列精确匹配,同时分析了均匀连通图的局部结构及罕见事件的概率特性。

详情
英文摘要

We study connected graphs with a fixed degree sequence, in the sparse setting where the number of edges grows linearly in the number of vertices. Using the relation to the configuration model, we identify the number of such connected graphs up to the exponential order. We do this by viewing a connected graph with a given degree distribution as the realization of the giant component in a larger configuration model, and carefully choosing the degree distribution of the larger graph so that it is likely that its giant component has the required degree distribution. To ensure that the connected graph has exactly the correct degrees, we use a switching argument. Additionally, we obtain results on rare event probabilities and describe the local structure of a uniform connected graph with a fixed degree sequence.