arXivDaily arXiv每日学术速递 周一至周五更新
重置
2604.14139 2026-04-16 math.DG math.AP

Mean curvature flows with prescribed singular sets

Raphael Tsiamis

Comments 19 pages

详情
英文摘要

For every closed set $K \subset \mathbb{R}^n$ and every $m \geq 2$, we construct a mean-convex ancient solution to mean curvature flow of hypersurfaces in $\mathbb{R}^{m+n}$, with respect to a smooth Riemannian metric arbitrarily $C^\infty$-close to the Euclidean metric, whose first-time singular set is exactly $K \times \{0\}$.

2604.14138 2026-04-16 math.PR math.CO

Sweet Trims are made of Threes: A càdlàg erasure of the Brownian tree

Alessandra Caraceni, Nicolas Curien, William Fleurat, Adrianus Twigt

Comments 8 pages, 3 figures, 1 video attached

详情
英文摘要

We present a simple trimming algorithm that generates nested uniform binary plane trees by removing leaves one-by-one using a best-of-three-match procedure. While its one-step transition specializes to the Luczak-Winkler & Caraceni-Stauffer coupling, its scaling limit provides a suprising càdlàg erasure of Brownian trees, reminiscent of SLE theory.

2604.14130 2026-04-16 eess.SY cs.SY math.DS

Joint Identification of Linear Dynamics and Noise Covariance via Distributional Estimation

Yang Hu, Na Li

Comments 25 pages, 5 figures

详情
英文摘要

In this paper, we propose a novel framework for the joint identification of system dynamics and noise covariance in linear systems, under general noise distributions beyond Gaussian. Specifically, we would like to simultaneously estimate the dynamical matrix $A$ and the noise covariance matrix $\varSigma$ using state transition data. The formulation builds upon a novel parameterization of the state-transition distribution, which enables more effective use of distributional "shape" information for improved identification accuracy. We introduce two practical estimators, namely the maximum likelihood estimator (MLE) and the score-matching estimator (SME), to solve the joint dynamics-covariance identification problem, and provide rigorous analysis of their statistical properties and sample complexity. Simulation results show that the proposed estimators outperform the ordinary least squares (OLS) baseline.

2604.14127 2026-04-16 math.AG hep-th math-ph math.DG math.GT math.MP

Lagrangian correspondences for moduli spaces of Higgs bundles and holomorphic connections

Panagiotis Dimakis, Duong Dinh, Shengjing Xu

Comments First draft, to be revised soon. Comments welcome!

详情
英文摘要

On a compact connected Riemann surface $C$ of genus at least $2$, we construct Lagrangian correspondences between moduli spaces of rank-$n$ Higgs bundles (respectively, holomorphic connections) and the Hilbert schemes of points on $T^\ast C$ (respectively, the twisted cotangent bundles of $C$). Central to these constructions are Higgs bundles (respectively, holomorphic connections) which are transversal to line subbundles of the underlying bundles: these naturally induce divisors on $C$ together with auxiliary parameters, namely lifts to divisors on spectral curves for Higgs bundles and residue parameters of apparent singularities for holomorphic connections. We discuss the evidence showing that the Dolbeault geometric Langlands correspondence is generically realized by these Lagrangian correspondences; we expect that the de Rham geometric Langlands correspondence can be realized by their quantization, following Drinfeld's construction of Hecke eigensheaves. We also discuss the relations of our constructions to various topics, including reductions of Kapustin-Witten equations, the conformal limit, separation of variables, and degenerate fields in conformal field theories.

2604.14122 2026-04-16 math.PR math-ph math.MP

The scaling limit of random walk and the intrinsic metric on planar critical percolation

Irina Đanković, Maarten Markering, Jason Miller, Yizheng Yuan

Comments 139 pages, 31 figures

详情
英文摘要

We consider critical site percolation ($p=p_c=1/2$) on the triangular lattice $\mathbf{T}$ in two dimensions. We show that the simple random walk on the clusters of open vertices converges in the scaling limit to a continuous diffusion which lives in the gasket of a conformal loop ensemble with parameter $κ= 6$ $\big(\mathrm{CLE}_6\big)$, the so-called $\mathrm{CLE}_6$ Brownian motion. We also show that the intrinsic (i.e., chemical distance) metric converges in the scaling limit to the geodesic $\mathrm{CLE}_6$ metric. As a consequence, we deduce the existence of the chemical distance exponent, the resistance exponent, and the spectral dimension of the critical percolation clusters. Moreover, we show that the exponents satisfy the Einstein relations.

2604.14118 2026-04-16 cs.LG math.SP

Complex Interpolation of Matrices with an application to Multi-Manifold Learning

Adi Arbel, Stefan Steinerberger, Ronen Talmon

详情
英文摘要

Given two symmetric positive-definite matrices $A, B \in \mathbb{R}^{n \times n}$, we study the spectral properties of the interpolation $A^{1-x} B^x$ for $0 \leq x \leq 1$. The presence of `common structures' in $A$ and $B$, eigenvectors pointing in a similar direction, can be investigated using this interpolation perspective. Generically, exact log-linearity of the operator norm $\|A^{1-x} B^x\|$ is equivalent to the existence of a shared eigenvector in the original matrices; stability bounds show that approximate log-linearity forces principal singular vectors to align with leading eigenvectors of both matrices. These results give rise to and provide theoretical justification for a multi-manifold learning framework that identifies common and distinct latent structures in multiview data.

2604.14108 2026-04-16 cs.LG math.DS math.OC stat.ML

Momentum Further Constrains Sharpness at the Edge of Stochastic Stability

Arseniy Andreyev, Advikar Ananthkumar, Marc Walden, Tomaso Poggio, Pierfrancesco Beneventano

Comments 40 pages, 38 figures

详情
英文摘要

Recent work suggests that (stochastic) gradient descent self-organizes near an instability boundary, shaping both optimization and the solutions found. Momentum and mini-batch gradients are widely used in practical deep learning optimization, but it remains unclear whether they operate in a comparable regime of instability. We demonstrate that SGD with momentum exhibits an Edge of Stochastic Stability (EoSS)-like regime with batch-size-dependent behavior that cannot be explained by a single momentum-adjusted stability threshold. Batch Sharpness (the expected directional mini-batch curvature) stabilizes in two distinct regimes: at small batch sizes it converges to a lower plateau $2(1-β)/η$, reflecting amplification of stochastic fluctuations by momentum and favoring flatter regions than vanilla SGD; at large batch sizes it converges to a higher plateau $2(1+β)/η$, where momentum recovers its classical stabilizing effect and favors sharper regions consistent with full-batch dynamics. We further show that this aligns with linear stability thresholds and discuss the implications for hyperparameter tuning and coupling.

2604.14107 2026-04-16 math.NA cs.NA

Bound-Preserving Flux-Corrected Transport Methods for Solving Richards' Equation

Arnob Barua, Christopher E. Kees, Dmitri Kuzmin

详情
英文摘要

Simulating infiltration in porous media using Richards' equation remains computationally challenging due to its parabolic structure and nonlinear coefficients. While a wide range of numerical methods for differential equations have been applied over the past several decades, basic higher-order numerical methods often fail to preserve physical bounds on water pressure and saturation, leading to spurious oscillations and poor iterative solver convergence. Instead, low-order, bound-preserving methods have been preferred. The combination of mass lumping and relative permeability upwinding preserves bounds but degrades accuracy to first order in space. Flux-corrected transport is a high-resolution numerical technique designed for combining the bound-preserving property of low-order schemes with the accuracy of high-order methods, by blending the two methods through limited anti-diffusive fluxes. In this work, we extend flux-corrected transport schemes to the nonlinear, degenerate parabolic structure of Richards' equation, verify attainment of second-order convergence on unstructured meshes, and demonstrate applications to stormwater management infrastructure.

2604.14100 2026-04-16 math.AP

The 2D Euler equations are well-posed for generic initial data in $L^2$

Lucio Galeati

详情
英文摘要

In this note we show the existence of a residual set (in the sense of Baire) of divergence free initial data $u_0\in L^2(D)$, $D=\mathbb{R}^2$ or $\mathbb{T}^2$, for which global existence and uniqueness of weak solutions to the incompressible 2D Euler equations holds. The associated solutions $u$ satisfy the energy balance and are recovered in the vanishing viscosity limit from solutions to 2D Navier-Stokes, which as a consequence cannot display anomalous dissipation of energy. Additionally, there exists a unique regular Lagrangian flow associated to such $u$, and the associated transport equation is well-posed. Finally, when $D=\mathbb{T}^2$, the solution $u$ is recovered as the limit of Galerkin approximations. The proof relies on global existence of smooth solutions and weak-strong uniqueness arguments.

2604.14087 2026-04-16 math.DG math.AP

Quantification of $C^0$ Convergence in Dimension Three

Liam Mazurowski, Xuan Yao

Comments 32 pages, comments are welcome

详情
英文摘要

We address Gromov's Quantification of $C^0$ Convergence Conjecture in dimension three. Let $B$ be the unit ball in $\mathbb R^3$. Let $g$ and $g_0$ be smooth metrics on $B$. We prove there are constants $C$ and $ε_0$ depending only on $g_0$ so that \[ \inf_{x\in B} R_g(x) \leq R_{g_0}(0) + C \|g-g_0\|_{C^0}^{1/2} \] provided $\|g-g_0\|_{C^0}\leq ε_0$. We also construct examples to show that the exponent $1/2$ is sharp. This explicitly quantifies the fact that scalar curvature lower bounds are preserved under $C^0$ convergence of metrics. When $g_0$ is merely $C^2$ we prove a related estimate with a slightly weaker rate, and when $g_0$ has rotational symmetry we prove a related estimate with a stronger linear rate. To prove these results, we use harmonic functions to define a local quantity that detects the scalar curvature. Then we use classical elliptic PDE estimates to show that this quantity is stable under $C^0$ perturbations of the metric. As a further application of this method, we give a partial answer to a question of Gromov on the preservation of scalar curvature lower bounds for metrics that are converging in measure.

2604.14077 2026-04-16 math-ph math.MP math.RT nlin.SI

Open WDVV equations and $\bigvee$-systems

Alessandro Proserpio, Ian A. B. Strachan

Comments 22 pages; comments welcome!

详情
英文摘要

The idea of a $\bigvee$-system was introduced by Veselov in the study of rational solutions of the WDVV equations of associativity. These are algebraic/geometric conditions on the set of covectors that appear in rational solutions to the WDVV equations. Here, this idea is generalized to open WDVV equations, which are an additional set of PDEs originating from open Gromow-Witten Theory. We develop -- for rank-one extensions -- algebraic/geometric conditions on the covectors that supplement the $\bigvee$-system to give rational solutions to the open WDVV equations. Examples, and the relation to superpotentials and to Dubrovin almost-duality, are given.

2604.14076 2026-04-16 math.AP math.CA

Coagulation equations with particle emission

Joseph Klobusicky, Matthew Rakauskas

详情
英文摘要

We present a model for sticky particles in which cluster sizes after a reaction have $\ell$ fewer total particles than the sum of their reactants. The finite particle system is modeled as a Markov process under a mean-field assumption for selecting reactants. The limiting kinetic equations form an infinite system of nonlinear differential equations similar to the Smoluchowski coagulation equations with multiplicative kernel. We show existence and uniqueness for systems whose cluster sizes are either bounded above or below by the emission size $\ell$. When clusters have at most $\ell$ particles, well-posedness can be extended until an exhaustion time in which certain cluster fractions vanish. For clusters with more than $\ell$ particles, we prove short-time well-posedness, along with explicit formulas for cluster sizes and moments. We also conduct numerical experiments which suggest these formulas hold until a gelation time, at which an infinite-sized cluster forms.

2604.14075 2026-04-16 math.OC cs.LG stat.ML

Multistage Conditional Compositional Optimization

Buse Şen, Yifan Hu, Daniel Kuhn

详情
英文摘要

We introduce Multistage Conditional Compositional Optimization (MCCO) as a new paradigm for decision-making under uncertainty that combines aspects of multistage stochastic programming and conditional stochastic optimization. MCCO minimizes a nest of conditional expectations and nonlinear cost functions. It has numerous applications and arises, for example, in optimal stopping, linear-quadratic regulator problems, distributionally robust contextual bandits, as well as in problems involving dynamic risk measures. The naïve nested sampling approach for MCCO suffers from the curse of dimensionality familiar from scenario tree-based multistage stochastic programming, that is, its scenario complexity grows exponentially with the number of nests. We develop new multilevel Monte Carlo techniques for MCCO whose scenario complexity grows only polynomially with the desired accuracy.

2604.14071 2026-04-16 math.ST math.DS stat.TH

Finite-Step Bounds for Iterated Correlation Matrices

Ishrak AlhajjHassan

详情
英文摘要

We establish finite-step probabilistic upper bounds on the contraction ratios $ρ_k = Δ_{k+1}/Δ_k$ for iterated Pearson correlation dynamics. Let $(P_k)_{k\ge 0}$ be the sequence generated by the Pearson update. Define $Δ_k := \|P_{k+1}-P_k\|_F$, $ρ_k := Δ_{k+1}/Δ_k$ for $Δ_k > 0$, and $δ_k := Δ_k/n$. Although $Δ_k \to 0$ along convergent trajectories, the ratios $ρ_k$ may exceed unity in finitely many steps. This behavior is invisible to local linearization. Our main contribution is a probabilistic bounding framework that captures these finite-step expansions. We initialize $P_0$ with i.i.d. $\mathcal{U}[-1,1]$ entries and let $\mathbb{P}$ be the induced measure. For $k \ge 2$, we construct state-dependent bounds $B_p : \mathbb{R}_+ \to \mathbb{R}_+$ satisfying $\mathbb{P}(ρ_k \le B_p(δ_k)) \ge p$. The functions $B^{\mathrm{q}}_p(δ)$ are empirical conditional $p$-quantiles of $\log ρ_k$ given $δ_k$ under logarithmic binning. Larger families $B^{\mathrm{TC}}_{p,τ}(δ)$ and $B^{\mathrm{tol}}_{p,τ}(δ)$ are obtained via multiplicative adjustments, yielding pointwise larger bounds that preserve the $δ$-dependence. Validation on held-out trajectories confirms the bounds hold with empirical coverage matching nominal levels for all $n \in [3,2000]$. The baseline $0.95$-quantile bound $B^{\mathrm{q}}_{0.95}(δ)$ yields two concrete results: $\mathbb{P}(ρ\le 1 \mid δ\le 0.03) \ge 0.95$ uniformly in $n$, and $\mathbb{P}(ρ\le 1.7) \ge 0.95$ for 21 of 22 dimensions. The exception $n = 69$ attains $2.35$, revealing a rare extreme upper tail discontinuity not captured by asymptotic analysis. These are the first finite-step probabilistic bounds for Pearson correlation dynamics. The framework is fully reproducible with provided code and data.

2604.14068 2026-04-16 math.FA

A study on coreflexive Banach Spaces

S. Dwivedi

Comments 7 pages

详情
英文摘要

In this paper, we study non-reflexive Banach spaces $X$ for which the quotient space $X^{**}/X$ is reflexive. Such spaces were first introduced by James R.~Clark, where they were called coreflexive spaces. We show that a space $X$ is coreflexive if and only if every separable subspace $Y\subseteq X$ is coreflexive, provided that $X$ is w$^*$-sequently dense in its bidual $X^{**}$. We show that coreflexive spaces are stable under $\ell^{p}$-sum for $1<p<\infty$. We show that if $X$ is a coreflexive space such that $X^{**}/X$ is separable, then the space of Bochner $p$-integrable functions, $L^{p}(μ,X)$ is coreflexive for $1<p<\infty$. We conclude by providing an alternative proof of the fact, in a quasi-reflexive space $X$, w-PC's of the unit ball $X_{1}$ continue to have the same property in all the higher even-order dual unit balls of $X$.

2604.14061 2026-04-16 cs.IT math.IT math.PR stat.ML

Two-Sided Bounds for Entropic Optimal Transport via a Rate-Distortion Integral

Jingbo Liu

Comments IEEE International Symposium on Information Theory (ISIT) 2026

详情
英文摘要

We show that the maximum expected inner product between a random vector and the standard normal vector over all couplings subject to a mutual information constraint or regularization is equivalent to a truncated integral involving the rate-distortion function, up to universal multiplicative constants. The proof is based on a lifting technique, which constructs a Gaussian process indexed by a random subset of the type class of the probability distribution involved in the information-theoretic inequality, and then applying a form of the majorizing measure theorem.

2604.14055 2026-04-16 quant-ph cs.IT math.FA math.IT math.OA

Two-Indexed Schatten Quasi-Norms with Applications to Quantum Information Theory

Jan Kochanowski, Omar Fawzi, Cambyse Rouzé

Comments 61pages

详情
英文摘要

We define 2-indexed $(q,p)$-Schatten quasi-norms for any $q,p > 0$ on operators on a tensor product of Hilbert spaces, naturally extending the norms defined by Pisier's theory of operator-valued Schatten spaces. We establish several desirable properties of these quasi-norms, such as relational consistency and the behavior on block diagonal operators, assuming that $|\frac{1}{q} - \frac{1}{p}| \leq 1$. In fact, we show that this condition is essentially necessary for natural properties to hold. Furthermore, for linear maps between spaces of such quasi-norms, we introduce completely bounded quasi-norms and co-quasi-norms. We prove that the $q \to p$ completely bounded co-quasi-norm is super-multiplicative for tensor products of quantum channels for $q \geq p>0$, extending an influential result of [Devetak, Junge, King, Ruskai, 2006]. Our proofs rely on elementary matrix analysis and operator convexity tools and do not require operator space theory. On the applications side, we demonstrate that these quasi-norms can be used to express relevant quantum information measures such as Rényi conditional entropies for $α\geq \frac{1}{2}$ or the Sandwiched Rényi Umlaut information for $α< 1$. Our multiplicativity results imply a tensorizing notion of reverse hypercontractivity, additivity of the completely bounded minimum output Rényi-$α$-entropy for $α\geq\frac{1}{2}$ extending another important result of [Devetak, Junge, King, Ruskai, 2006], and additivity of the maximum output Rényi-$α$ entropy for $α\geq \frac{1}{2}$.

2604.14042 2026-04-16 math.CO

On the Scalability of Quasi-Complementary Sequence Sets: Quadratic and Cubic Laws

Huaning Liu, Lirong Guo, Zilong Liu

Comments This work has been submitted to IEEE Transactions on Information Theory on 15 April 2026

详情
英文摘要

This work is concerned with the fundamental scaling laws of quasi-complementary sequence sets (QCSSs) by understanding how large the set size (denoted by $M$) can grow with the flock size ($K$) and the sequence length ($N$). We first establish a geometric framework that transforms a QCSS into a complex unit-norm codebook, through which and by exploiting the density thresholds of the codebooks, certain polynomial upper bounds of the QCSS set size are obtained. Sharp quadratic and cubic scaling laws are then introduced. Specifically, we show that asymptotically optimal QCSSs with tightness factor $ρ=1$ satisfy $M \le (1+o(1))K^2N$, while asymptotically near-optimal QCSSs satisfy $M \le (1+o(1))K^3N^2$ for $ρ< {(1+\sqrt{5})}/{2}$. To validate these upper bounds, we further propose explicit additive-character and mixed-character based constructions for QCSSs that achieve $M = K^2N + K$ and $M = K^3N^2 + 2K^2N + K$, respectively, thereby showing that the quadratic and cubic scaling laws are asymptotically tight. Our proposed constructions admit flexible parameter choices, and their maximum correlation estimates are shown to be tight through explicit extremal examples. Additionally, it is conjectured that the cubic scaling law is universal for all $1<ρ\le 2$, i.e., any asymptotically near-optimal QCSSs should satisfy $M \le (1+o(1))K^3N^2$. This identifies a fundamental cubic barrier for QCSS scalability.

2604.14040 2026-04-16 math.AG

A lower bound on the Calabi functional for a degeneration of polarized varieties

Gabriel Frey

Comments 28 pages, comments welcome!

详情
英文摘要

We prove a lower bound on the Calabi functional for degenerations of polarized varieties, involving the difference of CM degrees between generically isomorphic families. This may be viewed as a discretely valued version of Donaldson's lower bound for models, in the sense of non-Archimedean geometry. In particular, this generalizes a result of Donaldson, who considered a single polarized variety. As a main tool, we develop the theory of GIT height, introduced by Wang, and apply it to the family GIT problem of the Chow variety. Using the GIT height, we also give a numerical proof of separatedness of GIT quotients for general and special linear actions, strengthening prior work of Wang--Xu.

2604.14037 2026-04-16 cs.LG math.AG math.CO

A Complete Symmetry Classification of Shallow ReLU Networks

Pranavkrishnan Ramakrishnan

详情
英文摘要

Parameter space is not function space for neural network architectures. This fact, investigated as early as the 1990s under terms such as ``reverse engineering," or ``parameter identifiability", has led to the natural question of parameter space symmetries\textemdash the study of distinct parameters in neural architectures which realize the same function. Indeed, the quotient space obtained by identifying parameters giving rise to the same function, called the \textit{neuromanifold}, has been shown in some cases to have rich geometric properties, impacting optimization dynamics. Thus far, techniques towards complete classifications have required the analyticity of the activation function, notably excising the important case of ReLU. Here, in contrast, we exploit the non-differentiability of the ReLU activation to provide a complete classification of the symmetries in the shallow case.

2604.14036 2026-04-16 math.NT

Distribution modulo one of linear recurrent sequences

Zhangchi Chen, Zihao Ye, Weizhe Zheng

Comments 12 pages. This is an expanded version of Section 4 of arXiv:2511.21324v3

详情
英文摘要

We study the distribution modulo one of linear recurrent sequences of real numbers. We prove criteria for the finiteness of the set of limit values of the fractional parts of such a sequence and give lower bounds for the maximal distance between two limit values. Our results generalize theorems of Flatto, Lagarias, Pollington, and Dubickas.

2604.14033 2026-04-16 math.FA math.AP

On the divergence of the composition of irregular fields with BV functions

Graziano Crasta, Virginia De Cicco, Annalisa Malusa

Comments 36 pages

详情
英文摘要

We introduce a family of (nonlinear) pairing measures that ensure the validity of the divergence rule for composite functions $\boldsymbol{B}(x,u(x))$, where $\boldsymbol{B}(\cdot,t)$ is a bounded divergence-measure vector field, and $u$ is a scalar function of bounded variation. The elements of the family depend on the choice of the pointwise representative of $u$ on its jump set. Beyond the standard properties, such as the Coarea and Gauss-Green formulas on sets of finite perimeter, this flexibility allows us to characterize the pairings that ensure the lower semicontinuity of the corresponding functionals along sequences converging in $L^1$ with controlled precise values. We show that these lower semicontinuous pairings arise as the relaxation of integral functionals defined in Sobolev spaces.

2604.14031 2026-04-16 math.CT cs.LO math.LO

Topologically valued transition structures

Matthew Collinson

详情
英文摘要

We investigate several categories related to transition structures, using a mixture of algebraic and topological methods. We show how two such categories are connected by a contravariant adjunction. This is the most detailed of a family of such results depending on topological restrictions on objects and morphisms.

2604.14024 2026-04-16 math.AG hep-th math.DG

Deformations of fibered Calabi--Yau varieties

Benjamin Bakker, Kristin DeVleming, Stefano Filipazzi, Radu Laza, Jennifer Li, Roberto Svaldi, Chengxi Wang, Junyan Zhao

Comments v1: 15 pages. Comments are welcome

详情
英文摘要

Kollár showed that small deformations of elliptically fibered smooth $K$-torsion varieties with $H^2(X,\mathcal{O}_X)=0$ remain elliptically fibered. We extend this result to any fibered smooth $K$-torsion variety $X$ with $H^2(X,\mathcal{O}_X)=0$, using Hodge theoretic techniques and the $T^1$-lifting criterion of Kawamata--Ran. More generally, our strategy implies that even without the cohomological assumption, small deformations of a semiample line bundle on a smooth $K$-torsion variety remain semiample up to homological equivalence.

2604.14020 2026-04-16 math.LO math.CV math.GN

Saturation and isomorphism of abstract harmonic spaces

Haoming Wang

详情
英文摘要

This paper models the theory of abstract harmonic spaces in the syntax of the continuous first-order logic of Banach lattices. It addresses a topological question asking when a one-to-one harmonic map onto smooth manifolds $M^n$ is a diffeomorphism. We give $M^n$ ($n\le 2$) a characterization by $U$-rank and elementary saturation for large cardinals. Polar sets are characterized by several equivalent conditions from the omitting type theorem. Consequently, harmonic measures on the ideal boundary in Martin representation are bijectively mapped to Keisler measures supported on non-principal types. Further problems concerning o-minimality and non-local potentials are finally discussed.

2604.14017 2026-04-16 math.OC cs.LG

Stochastic Trust-Region Methods for Over-parameterized Models

Aike Yang, Hao Wang

Comments 26 pages, 3 figures

详情
英文摘要

Under interpolation-type assumptions such as the strong growth condition, stochastic optimization methods can attain convergence rates comparable to full-batch methods, but their performance, particularly for SGD, remains highly sensitive to step-size selection. To address this issue, we propose a unified stochastic trust-region framework that eliminates manual step-size tuning and extends naturally to equality-constrained problems. For unconstrained optimization, we develop a first-order stochastic trust-region algorithm and show that, under the strong growth condition, it achieves an iteration and stochastic first-order oracle complexity of $O(\varepsilon^{-2} \log(1/\varepsilon))$ for finding an $\varepsilon$-stationary point. For equality-constrained problems, we introduce a quadratic-penalty-based stochastic trust-region method with penalty parameter $μ$, and establish an iteration and oracle complexity of $O(\varepsilon^{-4} \log(1/\varepsilon))$ to reach an $\varepsilon$-stationary point of the penalized problem, corresponding to an $O(\varepsilon)$-approximate KKT point of the original constrained problem. Numerical experiments on deep neural network training and orthogonally constrained subspace fitting demonstrate that the proposed methods achieve performance comparable to well-tuned stochastic baselines, while exhibiting stable optimization behavior and effectively handling hard constraints without manual learning-rate scheduling.

2604.14006 2026-04-16 math.CO

Coloring powers of random graphs

Alan Frieze, Ross Kang, Aditya Raut, Michelle Sweering, Hilde Verbeek

详情
英文摘要

Given a graph $G$ and an integer $r\ge 1$, the $r$th power $G^r$ of $G$ is the graph obtained from $G$ by adding edges for all pairs of distinct vertices at distance at most $r$ from each other. We focus on two basic structural properties of the $r$th power of the binomial random graph $G_{n,p}$, namely, the maximum degree $Δ(G_{n,p}^r)$ and the chromatic number $χ(G_{n,p}^r)$, and give with high probability (w.h.p.) bounds. In the sparse case that $p=d/n$ for some fixed constant $d>0$, we prove the following. We prove that w.h.p.~$Δ(G_{n,p}^r) \sim \frac{\log n}{\log_{(r+1)}n}$ (where $\log_{(1)}n=\log n$ and $\log_{(r+1)}n=\log\log_{(r)}n$) and that w.h.p.~$Δ(G_{n,p}^{\lfloor{r/2}\rfloor})+1 \le χ(G_{n,p}^r) \le Δ(G_{n,p}^{r-1})+1$. For $r=2$, we show the upper bound holds with equality. For denser cases, for $d$ satisfying $d=ω(\log n)$ and $d\le n^{1/r-Ω(1)}$ as $n\to\infty$, we have $χ(G_{n,p}^r) = Θ(d^r/\log d)$ w.h.p.

2604.14000 2026-04-16 math.AP math.SP

The Makai inequality in higher dimensions: qualitative and quantitative aspects

Vincenzo Amato, Nunzia Gavitone, Rossano Sannipoli

详情
英文摘要

In this paper, given a convex, bounded, open set $Ω\subset \mathbb{R}^n$ we prove a sharp inequality involving the Laplacian torsional rigidity and both the perimeter and the measure of the domain. Our result generalizes to arbitrary dimensions the inequality established by Makai in the plane which, as conjectured in arXiv:2007.02549. Furthermore, we establish quantitative estimates that provide key insights into the geometric structure and the thickness of the underlying optimizing sequences.

2604.13999 2026-04-16 math.GT

Triple-cup product forms of 3-manifolds and Heegaard diagrams

Maya Kayali

Comments 19 pages

详情
英文摘要

The triple-cup product form $μ$ is a classical invariant of $3$-manifolds, determining the cohomology ring up to torsion. Given a closed, connected, oriented $3$-manifold $M$, we describe an explicit formula for computing $μ$ from a Heegaard diagram of $M$. Then, we show that the triple-cup product form $μ$ can be recovered as a reduction of Turaev's homotopy intersection form $η$ of the Heegaard surface.

2604.13989 2026-04-16 math.GR

Computing least common multiples in monoids with a finite Garside family

Emir Melliti

Comments 30 pages, 16 figures, associated code available at https://github.com/domyrmininon/compute_gf

详情
英文摘要

Right-reversing is an algorithm used to compute least common multiples in monoids that admit a right-complemented presentation. The algorithm can either terminate and find a result, fail, or run indefinitely. The correctness of the algorithm can be proved with additional assumptions coming from Garside theory. In the same framework, we prove that a non-terminating run of the algorithm is necessarily cyclic. Stopping when a cycle is detected provides a way of computing a minimal Garside family.