First-Order Trajectory Matching: Fast Ensemble Predictions of Chaotic, Turbulent, Stochastic Systems
一阶轨迹匹配:混沌、湍流、随机系统的快速集成预测
AI总结 提出一阶轨迹匹配(FTM)方法,通过学习随机系统轨迹的一阶局部概率质量输运,实现低成本的集成预测,并捕捉通量、环流等轨迹量。
一阶轨迹匹配:混沌、湍流、随机系统的快速集成预测
Shreya Jha, Timo Schorlepp, Nicholas Geissler, Jules Berman, Benjamin Peherstorfer
AI总结 提出一阶轨迹匹配(FTM)方法,通过学习随机系统轨迹的一阶局部概率质量输运,实现低成本的集成预测,并捕捉通量、环流等轨迹量。
我们引入一阶轨迹匹配(FTM),这是一种替代建模方法,从随机系统的轨迹中学习概率质量的一阶局部输运。通过匹配轨迹的对称一阶运动,FTM学习概率流速度,其流动保持时间边缘以匹配集成平均值,同时捕获类似流的轨迹量,如通量、环流和跨势垒电流。FTM直接从轨迹学习流速度,避免了漂移、扩散和分数估计。我们的稳定性分析将离散化误差与采样方差分开,并表明当时间分辨率和样本量适当平衡时,单步无模拟的FTM损失是稳定的。在随机动力系统和PDE示例中,我们经验证明FTM以低确定性展开成本提供轨迹感知的集成预测。
We introduce First-Order Trajectory Matching (FTM), a surrogate-modeling method that learns the first-order local transport of probability mass from trajectories of stochastic systems. By matching the symmetric first-order motion of trajectories, FTM learns the probability current velocity, whose flow preserves time marginals to match ensemble averages, while also capturing current-like trajectory quantities such as fluxes, circulations, and barrier-crossing currents. FTM learns the current velocity directly from trajectories, avoiding drift, diffusion, and score estimation. Our stability analysis separates discretization error from sampling variance and shows that the one-step simulation-free FTM loss is stable when temporal resolution and sample size are properly balanced. Across stochastic dynamical systems and PDE examples, we empirically demonstrate that FTM provides trajectory-aware ensemble predictions at low, deterministic-rollout cost.
通过极次数的对角幂和对称行列式下界
Karthik Sheshadri
AI总结 本文证明了对角幂和多项式∑x_i^n的对称行列式复杂度至少为(1/(2e)-o(1))n^2,通过极次数方法和对称秩一核入射分析,并给出一般超曲面的上界。
多项式f的对称行列式复杂度sdc(f)是使得f = det(M)的最小m,其中M是大小为m的仿射线性形式的对称矩阵。我们在复数域上证明了sdc(∑_{i=1}^n x_i^n) ≥ (1/(2e) - o(1)) n^2。这是作者非对称极次数预印本(arXiv:7680505)的对称版本;方法类似,但这里的证明是自包含的,并在对称设置中重新进行了承重局部入射分析。一般定理:若X = V(f)在P^{N-1}中是光滑的d次超曲面,N ≥ 3,且f = det(A_0 + ∑ x_i A_i),其中所有A_i是对称的且大小为m,则最高极次数d(d-1)^{N-2} ≤ 2^{N-2} C(m, N-1)。证明使用了对称秩一核入射M(z,x) u = 0。在真正的极点上M的秩为m-1,对称Schur补标准型在方案论上消除了唯一的核线;在得到的局部图上,提升的余法形式u^T A_i u是偏导数d_i f的公共单位倍数,因此提升的极方程在单位意义下切割了普通极切片,且每个真正的提升极点都是零维孤立解。在P^N × P^{m-1}上的多齐次Bezout定理给出界2^{N-2} C(m, N-1)。对于F_n = ∑ x_i^n,这给出常数1/(2e)。更一般地,对于F_{N,d} = ∑_{i=1}^N x_i^d,相同定理给出当N→∞时sdc(F_{N,d}) ≥ (1/(2e) - o_N(1)) N(d-1)。我们给出了F_{N,d}的大小为2N(d+1)+1的显式对称表示,因此对角界是非平凡的且紧致到常数。结果适用于特征零下的精确对称行列式复杂度;它不是边界陈述,也不是一致的正特征定理。
The symmetric determinantal complexity sdc(f) of a polynomial f is the least m such that f = det(M) for an m x m symmetric matrix M of affine-linear forms. We prove, over the complex numbers, that sdc(sum_{i=1}^n x_i^n) >= (1/(2e) - o(1)) n^2. This is a symmetric companion to the author's non-symmetric polar-degree preprint (arXiv:7680505); the method parallels that work, but the proof here is self-contained and redoes the load-bearing local incidence analysis in the symmetric setting. The general theorem: if X = V(f) in P^{N-1} is a smooth degree-d hypersurface, N >= 3, and f = det(A_0 + sum x_i A_i) with all A_i symmetric of size m, then the top polar degree d(d-1)^{N-2} is at most 2^{N-2} C(m, N-1). The proof uses the symmetric rank-one kernel incidence M(z,x) u = 0. At a genuine polar point M has rank m-1, and a symmetric Schur-complement normal form eliminates the unique kernel line scheme-theoretically; on the resulting local graph the lifted conormal forms u^T A_i u are a common unit multiple of the partials d_i f, so the lifted polar equations cut the ordinary polar slice up to units and each genuine lifted polar point is a zero-dimensional isolated solution. Multihomogeneous Bezout on P^N x P^{m-1} then yields the bound 2^{N-2} C(m, N-1). For F_n = sum x_i^n this gives the constant 1/(2e). More generally, for F_{N,d} = sum_{i=1}^N x_i^d the same theorem gives sdc(F_{N,d}) >= (1/(2e) - o_N(1)) N(d-1) as N -> infinity. We give an explicit symmetric representation of F_{N,d} of size 2N(d+1)+1, so the diagonal bounds are non-vacuous and tight up to a constant. The result is for exact symmetric determinantal complexity in characteristic zero; it is not a border statement and not a uniform positive-characteristic theorem.
加权计时正则语言的带宽:最后一步(长版)
Eugene Asarin, Aldric Degorre, Catalin Dima, Bernardo Jacobo Inclán
AI总结 针对正常计时自动机,提出将带宽计算约化为加权有限图中最佳奖励-成本比问题,实现其带宽的近似计算。
计时语言的带宽刻画了每单位时间的信息量(具有有限观测精度 $\varepsilon$)。当 $\varepsilon \to 0$ 时,带宽的渐近行为将计时正则语言分为三类:贫乏、正常和肥胖。正常计时自动机具有有界的事件频率和一些非点状转移,并且到目前为止,是唯一一类没有算法可用于计算其带宽的计时自动机。在本文中,我们计算任何此类自动机的带宽,形式为 $\approx \alpha \log{1/\varepsilon}$。我们的方法将这个问题约化为在从给定计时自动机构造的加权有限图中计算最佳奖励-成本比。
The bandwidth of a timed language characterizes the quantity of information per time unit (with a finite observation precision $\varepsilon$). The asymptotic behavior of the bandwidth as $\varepsilon \to 0$ classifies timed regular languages in three classes: meager, normal, and obese. Normal timed automata have a bounded frequency of events and some non-punctual transitions, and, up to now, were the only class of timed automata for which no algorithm was available for computing their bandwidth. In this article, we compute the bandwidth of any such automaton in the form $\approxα\log{1/\varepsilon}$. Our approach reduces this problem to computing the best reward-to-cost ratio in a weighted finite graph constructed from the given timed automaton.
学习双稀疏显式条件变换
Tudor Pistol
AI总结 提出一种将固定规范矩阵与自适应稀疏分量乘积形式的结构化显式条件变换学习方法,在保持快速稳定分析变换优势的同时引入可控自适应性,实验表明在双稀疏变换学习问题上达到最优性能。
在最近的研究中,找到自然信号假定稀疏结构成立的便利空间已成为一个理想结果,其影响体现在数据压缩、降噪和特征提取等领域。虽然广泛使用的分析变换(如DFT或DCT)已经提供了高效的算法和鲁棒的稀疏表示,但它们假设了关于数据的固定先验,无法准确捕捉更严格信号类别的特定结构。为了解决这个问题,文献中引入了数据自适应学习变换的概念,允许减少变换域中的残差项。最近的研究表明,条件数在此背景下是一个良好的度量,期望的结果在泛化倾向和实现最小近似误差之间交替。受这些考虑启发,我们引入了一种结构化显式条件变换的学习,该变换被表述为一个固定规范矩阵与一个精炼的数据自适应稀疏分量的乘积。这种方法旨在保留快速稳定分析变换的优势,同时引入对数据的可控自适应性。目前尚未发现涉及这种特定公式的参考文献,表明其新颖性。所提出的算法在不精确近端方法的框架内被推导,利用了一个新导出的闭式投影算子。实验观察表明,在双稀疏变换学习问题上取得了最先进的结果,并且与密集变体相比,在显著降低计算成本的同时,有时收敛更快且更好地避免不良局部最小值。
Finding convenient spaces in which certain hypotheses regarding an assumed sparse structure of natural signals hold true has become a desirable result in recent research, its implications being reflected in areas such as data compression, noise reduction and feature extraction. While the extensively used analytical transforms, such as DFT or DCT, already provide efficient algorithms and robust sparse representations, they assume a fixed prior about the data, failing to accurately capture the specific structure of more restrictive classes of signals. To address this, the concept of a data-adaptive, learnt transform has been introduced in the literature, allowing for the reduction of a residual term in the transform domain. More recent studies have shown that the condition number serves as a good metric in this context, where the desired outcome alternates between a generalizing tendency and one that achieves minimal approximation error. Motivated by these considerations, we introduce the learning of a structured, explicitly conditioned transform formulated as the product of a fixed canonical matrix and a refining data-adaptive sparse component. This approach seeks to preserve the advantages of fast and stable analytical transforms, while introducing controllable adaptivity to the data. No references that concern this specific formulation have been identified so far, indicating its novelty. The proposed algorithm is motivated within the framework of inexact proximal methods, leveraging a newly derived closed-form projection operator. Empirical observations demonstrate state-of-the-art results on the doubly sparse transform learning problem and comparable performance with its dense variant at significantly lower computational costs and sometimes faster convergence and better avoidance of bad local minima.
Express 语言建模
Albert Gong, Annabelle Michael Carrell, Raaz Dwivedi, Lester Mackey
AI总结 提出 Express 工具,将非因果注意力近似转换为因果近似,结合 Thinformer 实现最优因果注意力保证,并加速语言建模中的四个资源瓶颈。
我们引入了一个新工具 Express,用于将非因果注意力近似转换为具有匹配近似保证的因果近似。当与最先进的 Thinformer 近似结合时,Express 改进了已知的最佳因果注意力保证,对于长度为 $n$ 的序列,实现了 $\log^{3/2}(n)/s$ 的近似误差,仅需 $O(s)$ 内存和 $O(s^2 \log^2(n))$ 的压缩开销。我们将这些进展与高效的 I/O 感知 Triton 实现相结合,展示了相对于 FlashAttention 2 的显著加速,并使用 Express 克服了语言建模流程中的四个资源瓶颈:长上下文预填充、KV 缓存压缩、长形式内存受限解码和长形式计算受限解码。
We introduce a new tool, Express, for converting a non-causal attention approximation into a causal approximation with matching approximation guarantees. When combined with the state-of-the-art Thinformer approximation, Express improves upon the best known causal attention guarantees, delivering $\log^{3/2}(n)/s$ approximation error with only $O(s)$ memory and $O(s^2 \log^2(n))$ compression overhead for a sequence of length $n$. We pair these developments with an efficient I/O-aware Triton implementation, demonstrate substantial speedups over FlashAttention 2, and use Express to overcome four resource bottlenecks in the language modeling pipeline: long-context prefill, KV cache compression, long-form memory-constrained decoding, and long-form compute-constrained decoding.
范围惩罚:理论洞见及其在联邦学习中的应用
Yiyuan She, Zhaojun Hu, Yifan Sun
AI总结 提出范围正则化方法,通过极值聚类实现跨客户端正则化,并开发非渐近统计精度与模式恢复的新证明技术,以及利用局部强凸性的快速优化算法。
本文针对具有线性系统组件的联邦学习引入范围正则化,以提高统计精度并诱导跨客户端正则性,从而有利于量化、编码和资源效率。我们的方法识别不同客户端之间共享权重的特征,并将个性化特征的权重自适应地聚类到极值,这一过程称为极值聚类。由于正则化子的半范数性质和不可分解性,相关估计量的理论分析面临重大挑战。我们为统计精度和忠实模式恢复的非渐近分析开发了新的证明技术。此外,提出了一种利用不同程度局部强凸性的快速优化算法,以降低迭代复杂度。实验支持了所提方法的有效性和效率。
This paper introduces range regularization for federated learning with linear systematic components to enhance statistical accuracy and induce cross-client regularity conducive to quantization, coding, and resource efficiency. Our approach identifies features with shared weights across different clients and adaptively clusters the weights of personalized features at extreme values, a process we refer to as polar clustering. Theoretical analysis of the associated estimators poses significant challenges due to the seminorm nature and non-decomposability of the regularizer. We develop new proof techniques for the nonasymptotic analysis of statistical accuracy and faithful pattern recovery. Moreover, a fast optimization algorithm that leverages varying degrees of local strong convexity is proposed to reduce iteration complexity. Experiments support the efficacy and efficiency of the proposed approach.
基于梯度的双层逆最优控制:一种黎曼方法
Ahmed-Manaf Dahmani, Vincent Bonnet, David Daney, François Charpillet
AI总结 提出一种黎曼逆最优控制方法,将最优轨迹集视为流形,通过流形上的优化避免标准约束违规,计算时间减少约四倍。
逆最优控制旨在恢复解释观测轨迹作为最优控制问题解的成本函数。经典逆最优控制公式依赖于双层优化,反复求解嵌套的最优控制问题,对于实际系统很快变得计算上不可行。最近的基于投影的方法提供了一种有希望的替代方案,但由于违反标准约束条件,在使用基于梯度的方法求解时会出现数值不稳定性。在本文中,我们表明这些困难源于逆最优控制可行集的几何结构。我们证明满足最优性条件的轨迹集自然形成一个流形,并将逆最优控制重新表述为该流形上的优化问题。基于这一见解,我们提出了一种黎曼逆最优控制方法,该方法将观测轨迹投影到最优解流形上,同时通过构造保持可行性。在真实人类手臂轨迹上的实验表明,所提出的方法在重建精度上与经典双层逆最优控制相当或更好,同时计算时间减少约四倍。这些结果凸显了几何优化方法在提高逆最优控制在机器人和人体运动分析中的可扩展性和可靠性方面的潜力。
Inverse Optimal Control (IOC) aims to recover the cost function that explains observed trajectories as solutions of an optimal control problem. Classical IOC formulations rely on bilevel optimization, which repeatedly solves a nested optimal control problem and quickly becomes computationally prohibitive for realistic systems. Recent projection-based approaches offer a promising alternative but suffer from numerical instability when solved with gradient-based methods due to violations of standard constraint qualifications. In this paper, we show that these difficulties stem from the geometric structure of the IOC feasible set. We demonstrate that the set of trajectories satisfying the optimality conditions naturally forms a manifold and reformulate IOC as an optimization problem on this manifold. Based on this insight, we propose a Riemannian Inverse Optimal Control (RIOC) method that projects observed trajectories onto the manifold of optimal solutions while preserving feasibility by construction. Experiments on real human arm trajectories show that the proposed method achieves comparable or better reconstruction accuracy than classical bilevel IOC while reducing computation time by about a factor of four. These results highlight the potential of geometric optimization methods to improve the scalability and reliability of IOC for robotics and human motion analysis.
编码欧拉特征变换
Nello Blaser, Odin Hoff Gardaa, Lars M. Salbu, Elena Xinyi Wang, Bastian Rieck
AI总结 提出连续编码方法,将欧拉特征曲线转化为每个顶点的净变化序列,通过小型变换器生成特征向量,并在多个数据集上提升分类精度。
欧拉特征曲线(ECC)记录线性嵌入的胞复形在给定方向上的欧拉特征随过滤高度的变化,而欧拉特征变换(ECT)是通过收集多个方向上的ECC得到的单射形状描述符。如何为神经网络编码ECT本身是一种归纳偏置,传统上通过离散化每个ECC来固定。我们引入一种连续编码:对于每个方向和每个顶点,它记录归因于该顶点的净欧拉特征变化,产生一个每个方向的令牌序列,由一个小型变换器映射到特征向量。我们将得到的流程分为两个正交轴上的阶段:一个ECC编码器,在每个方向内作用,将其曲线映射到固定长度向量;以及一个ECT表示,跨方向作用,聚合每个方向的向量为一个。我们研究了六种ECT表示架构,涵盖从结构无关的前馈基线到保持平面旋转等变性的卷积和复值模型的一系列归纳偏置。在涵盖点云、图、立方复形和网格的六个分类基准上,连续编码在六个数据集中有五个提高了准确率,控制实验将增益归因于令牌化本身而非增加的变换器容量。表示架构的重要性小于编码,其归纳偏置的收益取决于编码:前馈网络在连续编码下表现最佳,但在离散化下不如卷积架构鲁棒。
The Euler Characteristic Curve (ECC) records the Euler characteristic of a linearly embedded cell complex as a function of filtration height in a given direction, and the Euler Characteristic Transform (ECT) is the injective shape descriptor obtained by collecting ECCs over many directions. How the ECT is encoded for a neural network is itself an inductive bias, conventionally fixed by discretizing each ECC. We introduce a continuous encoding: for each direction and each vertex it records the net Euler-characteristic change attributed to that vertex, producing a per-direction token sequence that a small transformer maps to a feature vector. We separate the resulting pipeline into two stages on orthogonal axes: an ECC encoder that acts within each direction, mapping its curve to a fixed-length vector, and an ECT representation that acts across directions, aggregating the per-direction vectors into one. We study six ECT representation architectures spanning a range of inductive biases, from a structure-agnostic feedforward baseline to convolutional and complex-valued models that preserve equivariance under planar rotations. Across six classification benchmarks covering point clouds, graphs, cubical complexes, and meshes, the continuous encoding improves accuracy on five of six datasets, and control experiments attribute the gain to the tokenization itself rather than to the added transformer capacity. The representation architecture matters less than the encoding, and the payoff from its inductive biases depends on the encoding: a feedforward network performs best under continuous encoding but is less robust under discretization than convolutional architectures.
Moonshine:一个以猜想生成为中心的自主数学研究智能体
Xiaoyang Chen, Xiang Jiang
AI总结 提出自主智能体Moonshine,通过提取经典问题结构、提炼新概念并生成数学猜想,以Jacobian猜想为例,将其转化为神经Jacobian猜想并证明部分情况。
Moonshine是一个自主智能体,其核心目标是生成数学猜想。它的核心能力是从经典问题中提取结构、提炼新概念,并制定具有数学意义的猜想。Moonshine不将解决单个命题作为终点,而是通过猜想生成、桥梁构建和障碍识别来构建可扩展的理论框架。本文以Moonshine对Jacobian猜想的探索为例,展示了局部非退化性是否强制全局单射性的核心逻辑如何转移到单隐层仿射-岭sigmoid网络上。这导致了\emph{神经Jacobian猜想}(NJC)的提出:如果这样的网络在整个空间上具有严格正的Jacobian行列式,则它必须是全局单射的。通过分别调用GPT-5.5-pro和DeepSeek-V4-pro,Moonshine获得了情况\(N=n+1\)的独立完整证明。此外,在ChatGPT通过其网页界面与GPT-5.5-pro交互使用的辅助下,开发了一个几何拓扑证明。这些结果为猜想的合理性提供了初步证据。然而,一般的高宽度情况\(N\ge n+2\)仍未解决,留待进一步研究。这项工作展示了Moonshine自主生成有意义的数学问题并对其取得严格进展的能力。
Moonshine is an autonomous agent whose central objective is to generate mathematical conjectures. Its core capability is to extract structure from classical problems, distill new concepts, and formulate conjectures of mathematical significance. Rather than treating the solution of a single proposition as its endpoint, Moonshine builds an extensible theoretical framework through conjecture generation, bridge building, and obstacle identification. This article uses Moonshine's exploration of the Jacobian conjecture as an example. It shows how the central logic of whether local nondegeneracy can force global injectivity is transferred to one-hidden-layer affine-ridge sigmoid networks. This leads to the formulation of the \emph{Neural Jacobian Conjecture} (NJC): if such a network has strictly positive Jacobian determinant on the whole space, then it must be globally injective. By invoking GPT-5.5-pro and DeepSeek-V4-pro separately, Moonshine obtained independent complete proofs for the case \(N=n+1\). In addition, with the assistance of ChatGPT through interactive use of its web interface with GPT-5.5-pro, a geometric-topological proof was developed. These results provide preliminary evidence for the plausibility of the conjecture. The general higher-width case \(N\ge n+2\), however, remains unresolved and is left for further investigation. This work illustrates Moonshine's ability to autonomously generate meaningful mathematical problems and make rigorous progress on them.
最大策略迭代,再探讨
David Monniaux, Helmut Seidl
AI总结 提出用值迭代替代数学优化实现最大策略迭代,保证终止性,并扩展到有理数域,通过最小策略迭代求解优化问题,证明有界系统的最小解收敛性。
最大策略迭代是一种通过连续尝试解析最大值算子并归约为数学优化来计算精确数值程序不变量的方法。然而,数学优化可能代价高昂。这里,我们证明,对于整数和浮点数上的方程组的最大策略迭代,数学优化可以被简单的值迭代替代——且仍保证终止。作为应用,获得了整数或浮点变量的精确界限分析,完全避免了扩张算子。我们还考虑了有理数上的最大策略迭代,其中右侧是未知数的仿射组合的最小值的最大值。我们提出最小策略迭代作为线性规划的替代方案,用于解决最大策略迭代提出的优化问题。我们证明最大-最小策略迭代保证返回有界系统的最小解。我们还展示了如何将该算法扩展到无界系统,以及如何构造计算结果的可靠性和最优性证书。
Max-policy iteration is an approach to computing precise numeric program invariants by successive attempts at resolving maximum operators and reduction to mathematical optimization. Mathematical optimization, though, may be expensive. Here, we show, for max-policy iteration on systems of equations over integers as well as over floating point numbers, that mathematical optimization can be replaced by plain value iteration -- which is still guaranteed to terminate. As an application, a precise bound analysis for integer or floating point variables is obtained, avoiding widening operators altogether. We also consider max-policy iteration over the rational numbers, where the right-hand sides are maxima of minima of affine combinations of unknowns. We propose min-policy iteration as an alternative to linear programming for solving the optimization problems posed by max-policy iteration. We prove that max-min policy iteration is guaranteed to return the least solution for bounded systems. We also show how to extend this algorithm to unbounded systems, and how to construct certificates of soundness as well as of optimality of the computed results.
NOVA: 可解释的跟驰与换道模型及驾驶员异质性的符号回归发现
Ishak Abassi, Nassim Ali Bouazzouni, Farah Ibelaiden, Nadir Farhi
AI总结 提出NOVA符号回归框架,从原始轨迹数据自动发现可解释的跟驰与换道结构,在NGSIM数据集上优于基线,并揭示主导非线性项与心理物理理论关联。
我们提出了NOVA,一个自主符号回归框架,能够从原始轨迹数据中识别出可解释的跟驰和换道结构,且仅需极少的先验行为假设。应用于来自NGSIM I-80和US-101数据集的4,765,788个活跃驾驶观测,NOVA的确定性Rust驱动搜索引擎评估了超过10,000个候选代数结构,并在前向平移滚动均值预测目标下识别出一个紧凑的两项加速度模型。在两种互补的预处理流程下评估,NOVA在意图预测基准上实现了RMSE = 1.376 m/s²(R² = 15.57%),在相同评估协议下,RMSE比最佳重新校准的符号回归基线(SR-LLM, PNAS 2025)低0.135 m/s²。在八个独立实验中,单个主导非线性项作为人类跟驰的稳健骨干出现;残差引导的扩展进一步将所选结构与已建立的碰撞避免心理物理理论联系起来。发现的特征算子在不同高速公路地点之间零样本迁移,R²损失低于3个百分点。扩展到多项logit框架内的换道建模,NOVA在502个未见驾驶员的严格车辆ID留出测试中实现了67.4%的平衡准确率,在三类问题上超过现有换道基线+29.8个百分点。
We present NOVA, an autonomous symbolic regression framework that identifies interpretable car-following and lane-change structures from raw trajectory data with minimal behavioral priors. Applied to 4,765,788 active driving observations from the NGSIM I-80 and US-101 datasets, NOVA's deterministic Rust-powered search engine evaluates over 10,000 candidate algebraic structures and identifies a compact two-term acceleration model under a forward-shifted rolling-mean prediction target. Evaluated under two complementary preprocessing pipelines, NOVA achieves $RMSE = 1.376 m/s^2$ ($R^2 = 15.57\%$) on the intent-forecasting benchmark, outperforming the best recalibrated symbolic-regression baseline (SR-LLM, PNAS~2025) by 0.135 m/s$^2$ in RMSE under an identical evaluation protocol. Across eight independent experiments, a single dominant nonlinear term emerges as a robust backbone of human car-following; a residual-guided extension further links the selected structure to an established psychophysical theory of collision avoidance. The discovered feature operators transfer zero-shot between freeway sites with under 3 pp $R^2$ loss. Extended to lane-change modelling within a multinomial logit framework, NOVA achieves 67.4\% balanced accuracy under strict vehicle-ID holdout on 502 unseen drivers, surpassing existing lane-changing baselines by +29.8 percentage points on a three-class problem.
局部驯化随机梯度朗之万动力学的确定性分母设计
Yiwei Zhou, Ziheng Chen
AI总结 针对驯化随机梯度朗之万动力学中分母设计问题,提出基于代理分数和分位数的确定性分母方法,避免随机分母的均值偏移,实验表明其性能接近理想情况。
驯化随机梯度朗之万动力学通过向更新中添加分母来稳定大漂移。如果该分母使用与更新步骤相同的随机梯度样本,它也会改变条件均值漂移。我们研究确定性分母:状态依赖的包络在抽取当前预言机样本之前固定。主要问题是如何在实践中设计这个包络。设计从预言机分数开始,在试点状态上构建低成本代理分数,通过经验分位数选择激活阈值,然后应用一个小校准层。分析跟踪三个步骤:代理和阈值误差变为包络误差;包络误差扰动一个SGLD步骤;局部残差通过条件扰动桥给出平稳误差。实验表明,代理分位数分母接近预言机分数行为,避免了随机分母均值偏移通道,并改进了简单的确定性驯化选择。
Tamed stochastic-gradient Langevin dynamics (SGLD) stabilizes large drifts by adding a denominator to the update. If this denominator uses the same stochastic-gradient sample as the update step, it can also change the conditional mean drift. We study deterministic denominators: the state-dependent envelope is fixed before the current oracle sample is drawn. The main question is how to design this envelope in practice. The design starts from an oracle score, builds a low-cost proxy score on pilot states, chooses activation thresholds by empirical quantiles, and then applies a small calibration layer. The analysis tracks three steps: proxy and threshold errors become envelope errors; envelope errors perturb one SGLD step; and the local residuals give stationary errors through a conditional perturbation bridge. Experiments show that the proxy-quantile denominators are close to oracle-score behavior, avoid the random-denominator mean-shift channel, and improve simple deterministic taming choices.
共线速度多维耦合一阶双曲型PDE的反步控制
Mohamed Camil Belhadjoudja
AI总结 针对共线速度场的多维耦合一阶双曲系统,通过特征曲线变换将系统转化为连续的一维系统族,设计反步控制器实现有限时间镇定。
本文研究了耦合多维一阶双曲系统的反步边界镇定。我们考虑传输速度场共线的系统,即每个速度场是公共基速度场的标量倍数。基于最近为标量多维一阶双曲方程建立的框架,我们引入了一个基于完全在空间域中定义的特征曲线的变量变换,将原始多维系统转化为连续的一维一阶双曲系统族。通过为连续表示中的每个系统设计反步控制器,并假设特征曲线的传输时间一致有界,我们实现了多维系统的有限时间镇定。
This paper addresses the backstepping boundary stabilization of coupled multidimensional first-order hyperbolic systems. We consider systems whose transport velocity fields are collinear, meaning that each velocity field is a scalar multiple of a common base velocity field. Building upon a recent framework developed for scalar multidimensional first-order hyperbolic equations, we introduce a change of variables, based on characteristic curves defined entirely in the spatial domain, that converts the original multidimensional system into a continuum of coupled one-dimensional first-order hyperbolic systems. By designing a backstepping controller for each system in the continuum representation, and assuming that the transit times of the characteristic curves are uniformly bounded, we achieve finite-time stabilization of the multidimensional system.
量化命题演算与窄隐式证明
Pavel Pudlák, Neil Thapen
AI总结 研究隐式命题证明系统中窄证明的概念,通过切割消去构造证明G_{i+1}等价于窄隐式G_i,并建立G_1与隐式归结的等价性。
在命题证明系统Q的隐式版本中,我们处理的Q-证明不是直接写出的,而是由电路简洁编码的。因此隐式Q-证明可能比通常的Q-证明指数级更短。我们研究窄隐式证明,即该概念的一个受限版本,其中编码证明中的行只能具有多项式大小。我们使用切割消去构造证明,对于i >= 1,G_{i+1}等价于窄隐式G_i,其中G_i是Frege系统的扩展,允许使用Sigma^q_i量化命题公式进行推理。我们证明G_1等价于隐式归结。
In the implicit version of a propositional proof system Q, we work with Q-proofs that are not written down directly, but are succinctly encoded by circuits. Thus implicit Q-proofs are potentially exponentially shorter than usual Q-proofs. We study narrow implicit proofs, a restricted version of this notion, in which lines in the encoded proof can only have polynomial size. We use a cut-elimination construction to show that G_{i+1} is equivalent to narrow implicit G_i, for i >= 1, where G_i is the extension of Frege allowing reasoning with Sigma^q_i quantified propositional formulas. We show that G_1 is equivalent to implicit resolution.
最小自由能随机化设计以改善协变量平衡
Haolin Chen, Jun Yu
AI总结 提出最小自由能随机化设计,通过平衡协变量与最大化熵的权衡,结合高效动态分配算法,提升统计效率与鲁棒性。
“分块你能分的,随机化你不能分的”是随机对照试验中处理效应估计的核心原则。尽管已经开发了丰富的分配策略,但分块实现的协变量平衡与随机化保证的鲁棒性之间的明确权衡很少被量化。受热力学第二定律的启发,本文提出一个新准则,即降低协变量不平衡的同时最大化量化对比和分配多样性的熵。由此推导出最优策略,称为最小自由能随机化设计,从而正式实现这种权衡。为了便于实际实施,我们进一步开发了一种计算高效的动态分配算法,并具有理论保证。通过有限样本方差分解,表明所提出的随机化策略能够控制协变量不平衡,同时防止未观测到的异质性主导均方误差,从而在规定的设计约束下保持极小极大效率。大量数值模拟表明,我们的方法比现有方法具有更优的统计效率和更强的鲁棒性。
``Block what you can and randomize what you cannot'' is the core principle for treatment effect estimation in randomized controlled trials. Although a wealth of allocation strategies has been developed, an explicit trade-off between the covariate balance achieved by blocking and the robustness guaranteed by randomization is seldom quantified. Motivated by the second law of thermodynamics, this work posits a new criterion that lowers the covariate imbalance while maximizing the entropy that quantifies contrast and allocation diversity. The resulting optimal strategy, termed the minimum free energy randomized design, is then derived, thereby formally achieving such a trade-off. To facilitate practical implementation, we further develop a computationally efficient dynamic allocation algorithm with theoretical guarantees. Using a finite-sample variance decomposition, the proposed randomization strategy is shown to control covariate imbalance while preventing unobserved heterogeneity from dominating the mean squared error, thus retaining minimax efficiency under the prescribed design constraints. Extensive numerical simulations demonstrate that our method achieves superior statistical efficiency and greater robustness than existing approaches.
无基线的神经组合优化策略优化
Carlos S. Sepúlveda, Gonzalo A. Ruz
AI总结 提出使用GRPO算法消除神经组合优化中的基线依赖,避免训练崩溃,在TSP和CVRP上达到接近POMO的性能。
神经组合优化(NCO)训练自回归策略以解决路由问题。标准训练算法REINFORCE使用滚动基线,需要维护并定期更新策略的冻结副本以降低方差。这种基线引入了一个结构脆弱性:在更难的问题实例上,较差的基线会产生噪声梯度估计,从而破坏训练稳定性。我们评估了来自大语言模型对齐的组相对策略优化(GRPO),该算法通过归一化组内采样轨迹的优势完全消除了基线。在RL4CO框架内对TSP和CVRP基准上的五种RL算法进行受控比较,我们发现:(i) GRPO避免了REINFORCE在TSP-100上观察到的训练崩溃,其中性能在预热阶段后立即从成本9.8下降到52.1,并且在延长训练下无法恢复;(ii) 在匹配的梯度更新次数下,GRPO达到了与POMO(一种基于AM的强多起点基线)在2%以内的解质量,同时无需外部基线;(iii) P3O,一种也来自对齐文献的成对偏好算法,在TSP上具有竞争力,但在CVRP上表现出更高的变异性。这些结果表明GRPO是一种有前途的无基线NCO替代方案,特别是在基线依赖训练变得脆弱的场景中。
Neural combinatorial optimization (NCO) trains autoregressive policies to solve routing problems. The standard training algorithm, REINFORCE with a rollout baseline, requires maintaining and periodically updating a frozen copy of the policy for variance reduction. This baseline introduces a structural vulnerability: on harder instances, a poor baseline produces noisy gradient estimates that can destabilize training. We evaluate Group Relative Policy Optimization (GRPO), an algorithm from large language model alignment that eliminates the baseline entirely by normalizing advantages within groups of sampled trajectories. In a controlled comparison of five RL algorithms on TSP and CVRP benchmarks within the RL4CO framework, we find that: (i) GRPO avoids the training collapse observed with REINFORCE on TSP-100, where performance degrades from cost 9.8 to 52.1 immediately after the warmup phase and does not recover under extended training; (ii) at matched gradient updates, GRPO achieves solution quality within 2% of POMO, a strong AM-based multi-start baseline, while requiring no external baseline; and (iii) P3O, a pairwise preference algorithm also from the alignment literature, is competitive on TSP but shows higher variability on CVRP. These results identify GRPO as a promising baseline-free alternative for NCO, particularly in settings where baseline-dependent training becomes fragile.
Gromov--Wasserstein空间中的$k$-最近邻
Kaitlyn Hohmeier, Nicolas Fraiman, Caroline Moosmueller
AI总结 本文在Gromov-Wasserstein距离框架下实现k-最近邻分类,证明了度量测度空间和图上分类器的普适一致性,并通过实验验证了其有效性。
Gromov--Wasserstein (GW) 距离为比较度量测度空间提供了一个框架,无论其底层结构或几何形状如何。对于基于网络的数据,它能够直接比较具有不同节点数量的图,无需嵌入或其他抽象。此外,通过GW的变体——融合Gromov--Wasserstein (fGW),还可以在图形结构之外结合节点特征。在这项工作中,我们使用GW和fGW距离实现了$k$-最近邻 ($k$-NN) 分类。我们证明了在具有有限支撑和均匀概率测度的度量测度空间等价类空间上,GW-$k$-NN分类器的普适一致性。通过将图视为具有成对距离度量和节点上均匀概率测度的有限支撑度量测度空间,我们获得了图空间上GW-$k$-NN的普适一致性。类似地,对于fGW-$k$-NN,我们证明了在由具有有限支撑和均匀概率测度的度量测度空间以及到欧几里得空间的特征映射组成的结构化对象的弱同构类空间上的普适一致性,从而建立了节点属性图空间上的普适一致性。我们的数值实验表明,GW-$k$-NN和fGW-$k$-NN在多个图数据集上始终表现良好,这表明诸如$k$-NN之类的度量分类器在GW框架中效果良好。
The Gromov--Wasserstein (GW) distance provides a framework for comparing metric measure spaces, regardless of their underlying structure or geometry. For network-based data, it enables direct comparisons of graphs with different numbers of nodes, without requiring an embedding or other abstraction. Furthermore, through a variant of GW known as fused Gromov--Wasserstein (fGW), it is also possible to incorporate node features in addition to graph structure. In this work, we implement $k$-nearest neighbors ($k$-NN) classification using the GW and fGW distances. We prove the universal consistency of the GW-$k$-NN classifier on the space of equivalence classes of metric measure spaces with finite support and uniform probability measure. By viewing graphs as finitely supported metric measure spaces equipped with the pairwise distance metric and a uniform probability measure on the nodes, we obtain universal consistency of GW-$k$-NN for the space of graphs. Likewise for fGW-$k$-NN, we prove universal consistency on the space of weak isomorphism classes of structured objects consisting of metric measure spaces with finite support and uniform probability measure and feature maps into Euclidean space, thus establishing universal consistency on the space of node-attributed graphs. Our numerical experiments show that GW-$k$-NN and fGW-$k$-NN consistently perform well across multiple graph datasets, suggesting that metric classifiers such as $k$-NN work well in the GW framework.
通过张量符号改进矩阵李群运算的表示
Clark Taylor
AI总结 本文引入张量和爱因斯坦求和符号来简化矩阵李群在李导数计算中的表示,提高估计框架中梯度计算的清晰度。
近期的几篇论文展示了在估计问题中使用李群的实用性,提高了准确性和一致性。本文介绍了一种描述矩阵李群运算的新工具:张量和爱因斯坦求和符号。虽然张量和爱因斯坦符号在其他研究领域广为人知,但应用这种数学符号来表示和计算矩阵李导数却是新颖的。更重要的是,这种新符号极大地澄清了在(基于梯度的)估计框架中处理矩阵李群所需的导数和运算。因此,本文的主要贡献不是一种新能力,而是一种用于处理矩阵李群的更清晰的数学符号。
Several recent papers have demonstrated the utility of using Lie groups within estimation problems, yielding improved accuracy and consistency. This paper introduces a new tool for describing operations with matrix Lie groups: tensors and the Einstein summation notation. While tensors and Einstein notation are well-known in other research fields, applying this mathematical notation to represent and compute matrix Lie derivatives is novel. More importantly, this new notation greatly clarifies the derivatives and operations necessary to work with matrix Lie Groups in (gradient-based) estimation frameworks. Therefore, the main contribution of this paper is not a new capability, but a more perspicuous mathematical notation for working with matrix Lie groups.
当前状态数据下神经网络估计的收敛速度
Yuan Wu, Tianhui Zhou
AI总结 针对当前状态数据,提出非参数神经网络筛最大似然估计器,结合ReLU网络逼近理论与经验过程论证,在Hölder光滑假设下建立显式收敛速度。
当前状态数据出现在事件时间仅通过一个指示变量(是否在检查时间之前发生)被观测到时。本文研究了事件时间条件累积分布函数的非参数神经网络筛最大似然估计器。在Hölder光滑假设下,我们通过结合整流线性单元神经网络的逼近理论与经验过程论证,建立了显式收敛速度。这一结果为当前状态观测下的神经网络估计及后续推断提供了理论支持。
Current-status data arise when an event time is observed only through an indicator of whether it occurred before an examination time. This paper studies a nonparametric neural-network sieve maximum likelihood estimator of the conditional cumulative distribution function of the event time. Under Hölder smoothness assumptions, we establish an explicit convergence rate by combining approximation theory for rectified linear unit neural networks with empirical-process arguments. This result provides theoretical support for neural-network estimation and subsequent inference under current-status observation.
流式数据的结构化自适应张量预测
Zhen Qin, Yang Chen
AI总结 针对矩阵值时间序列的流式预测,提出自适应张量回归框架,包含矩阵-矩阵和张量-矩阵两种形式,并开发在线SGD算法,张量-矩阵模型在稳态误差和去噪方面更优,同时建立了低维结构下的恢复保证。
矩阵值时间序列出现在广泛的应用中,例如来自医学成像和地球物理学的时空数据。现有方法主要针对静态环境设计,缺乏对流式和时变环境的适应性。自适应滤波技术也大多局限于标量或向量值数据,使得矩阵值时间序列的自适应预测理解不足。为弥补这些差距,我们开发了一个自适应张量回归框架,包括矩阵-矩阵(MoM)和张量-矩阵(ToM)两种形式,用于流式矩阵值预测。这两种形式的区别在于是否直接建模矩阵值输出,或通过高阶张量表示利用时间结构。针对所提出的张量回归框架,我们开发了用于在线学习的随机梯度下降(SGD)算法。我们表明,将多个响应随时间堆叠成高阶张量可以提高性能;特别是,ToM比MoM实现了更低的稳态误差和更强的去噪能力,这促使我们关注ToM模型。我们进一步刻画了SGD在时变动态下的跟踪行为。从统计角度,我们建立了ToM在一般低维结构(包括稀疏性、低秩性及其联合稀疏低秩模型)下的固定时间恢复保证。
Matrix-valued time series arise in a wide range of applications, such as spatio-temporal data from medical imaging and geophysics. Existing methods are mainly designed for static settings and lack adaptability to streaming and time-varying environments. Adaptive filtering techniques have also been largely limited to data with scalar or vector values, leaving adaptive forecasting for matrix-valued time series inadequately understood. To bridge these gaps, we develop an adaptive tensor regression framework that includes Matrix-on-Matrix (MoM) and Tensor-on-Matrix (ToM) formulations for streaming matrix-valued prediction. The two formulations differ in whether to directly model matrix-valued outputs or to exploit temporal structure via higher-order tensor representations. For the proposed tensor regression framework, we develop stochastic gradient descent (SGD) algorithms for online learning. We show that stacking multiple responses across time into higher-order tensors improves performance; in particular, the ToM achieves lower steady-state error and stronger denoising capability than MoM, motivating our focus on the ToM model. We further characterize the tracking behavior of SGD under time-varying dynamics. From a statistical perspective, we establish fixed-time recovery guarantees for ToM under general low-dimensional structures, including sparsity, low-rankness, and their joint sparselow-rank models.
一种用于Lyapunov函数发现的约束符号回归方法
Ilias Mitrai, Wentao Tang
AI总结 提出一种约束自监督符号回归方法,通过表达式树表示Lyapunov函数并施加稳定性条件,无需先验假设即可发现自治动力系统的Lyapunov函数,并设计了分支定界检查算法高效求解。
本文考虑自治动力系统Lyapunov函数的数据驱动发现。我们将Lyapunov函数表示为固定深度的表达式树,并将Lyapunov发现任务表述为约束自监督符号回归问题。约束条件对给定输入下Lyapunov函数的输出以及Lyapunov稳定性条件进行建模。这种建模方法不对Lyapunov函数的函数形式做任何先验假设,由于函数以符号形式获得,因此本质上是可解释的,并且原则上可以应用于任何连续动力系统。我们还开发了一种定制的分支定界检查求解方法,以有效解决由此产生的学习任务。对几个案例研究的应用表明了所提方法发现Lyapunov函数的能力。
In this paper, we consider the data-driven discovery of Lyapunov functions for autonomous dynamical systems. We represent the Lyapunov function as an expression tree of fixed depth and formulate the Lyapunov discovery task as a constrained self-supervised symbolic regression problem. The constraints model the output of the Lyapunov function for a given input as well as the Lyapunov stability conditions. This modeling approach makes no a priori assumptions about the functional form of the Lyapunov function, is inherently interpretable since the function is obtained in a symbolic form, and, in principle, can be applied to any continuous dynamical system. We also develop a tailored branch-and-bound-and-check solution approach to efficiently solve the resulting learning task. Applications to several case studies show the ability of the proposed approach to discover Lyapunov functions.
SynIB: 多模态学习中最大化协同的信息瓶颈
Konstantinos Kontras, Teodora Gagaleska, Thomas Strypsteen, Christos Chatzichristos, Matthew Blaschko, Maarten De Vos, Paul Pu Liang
AI总结 提出SynIB方法,通过信息瓶颈理论直接优化多模态协同,在训练中屏蔽单模态时惩罚高置信度,提升跨模态推理能力,在合成和真实任务上准确率提升达7.8%。
多模态学习的一个核心目标是捕捉协同:即仅通过联合使用多个模态才能获得的、且任何单一模态都无法提供的任务相关信息。虽然大多数方法通过更大或更复杂的融合模型在架构层面进行操作,但我们提出一个互补的轴:塑造训练目标本身。标准训练通常强调单模态或冗余信息,在需要跨模态推理的示例上表现不足。我们通过信息论形式化多模态协同,并引入协同信息瓶颈(SynIB),一个直接针对协同的可扩展目标。为了优先学习协同,SynIB激励模型从所有模态准确预测,同时当任何模态的信息被隐藏时惩罚置信度。除了标准任务损失外,模型每次运行时屏蔽一个模态进行前向传播,如果保持高置信度则受到惩罚,这表明依赖单模态线索而非跨模态交互。我们在两个场景中验证SynIB。在合成XOR任务中,真实协同已知,标准训练无法恢复协同而SynIB成功。在五个真实世界基准测试中,包括三个MultiBench情感任务、使用CLIP-ViT和DeBERTa骨干的Hateful Memes,以及我们引入的可控讽刺扩展CREMA-D,SynIB在依赖协同的示例上准确率提升高达7.8%,总体准确率提升高达3.8%。
A central objective in multimodal learning is to capture synergy: task-relevant information that arises only from the joint use of multiple modalities, and is not available from any single modality alone. While most approaches operate at the architectural level through larger or more complex fusion models, we propose a complementary axis: shaping the training objective itself. Standard training often emphasizes unimodal or redundant information, falling short on examples that require cross-modal reasoning. We formalize multimodal synergy through information theory and introduce the Synergistic Information Bottleneck (SynIB), a scalable objective that targets synergy directly. To prioritize learning synergy, SynIB motivates the model to predict accurately from all modalities while penalizing confidence when information from any modality is withheld. Alongside the standard task loss, the model runs forward passes with one modality masked at a time and is penalized for remaining confident, which would indicate reliance on unimodal cues rather than cross-modal interactions. We validate SynIB in two regimes. On synthetic XOR tasks where the ground-truth synergy is known by construction, standard training fails to recover it while SynIB does. On five real-world benchmarks, including three MultiBench affective tasks, Hateful Memes with CLIP-ViT and DeBERTa backbones, and a controllable irony extension of CREMA-D we introduce, SynIB improves accuracy on synergy-dependent examples by up to 7.8% and overall accuracy by up to 3.8%.
通过 $BV$ 函数实现边界可整流性与积分流的紧性
Giacomo Del Nin
AI总结 利用 De Giorgi 的整数值 $BV$ 函数结构定理和圆柱投影论证,证明了有限质量且边界也有限质量的整数可整流流是积分的,并给出了积分流紧性的新证明。
我们给出了一个新的证明:一个具有有限质量且其边界也具有有限质量的整数可整流流是积分的。我们从 De Giorgi 的整数值 $BV$ 函数结构定理和圆柱投影论证推导出该结果。作为推论,我们还给出了积分流紧性的一个新证明,该证明最终基于 $BV$ 理论。
We present a new proof that an integer rectifiable current with finite mass, and whose boundary has also finite mass, is integral. We deduce the result from De Giorgi's structure theorem for integer-valued $BV$ functions and a cylindrical projection argument. As a consequence, we also give a new proof of the compactness of integral currents that is ultimately based on the $BV$ theory.
非参数黎曼经验贝叶斯与流形上的测量去噪
Adam Quinn Jaffe, Leonardo V. Santoro, Bodhisattva Sen
AI总结 针对流形上潜变量与测量值的去噪问题,提出基于Tweedie-Eddington公式的切向贝叶斯去噪器,利用拉普拉斯-贝尔特拉米算子实现数据驱动近似,并证明其在低噪声下接近贝叶斯风险,但收敛速率慢于欧氏情形。
我们启动了在紧黎曼流形上潜变量及其测量值均位于流形上、似然为黎曼高斯分布的非参数经验贝叶斯去噪方法研究。起点是黎曼高斯混合模型的一个新颖的Tweedie-Eddington公式,该公式通过测量的边际分布识别出某个替代神谕去噪器;它通过一阶近似避免了显式计算后验弗雷歇均值(贝叶斯去噪器所需),因此我们称之为“切向”贝叶斯去噪器。我们证明该替代神谕在低噪声条件下几乎达到贝叶斯风险,利用拉普拉斯-贝尔特拉米算子的谱理论构建其完全数据驱动的近似,并建立替代神谕与其近似之间距离的有限样本收敛速率。与欧氏情形中近乎参数的速率相比,黎曼情形中的速率较慢,这是由于黎曼高斯密度在其弗雷歇均值的割迹处存在奇异性;在圆环的特殊情形下,我们建立了匹配的下界,表明所提出的去噪器是极小化最优的,并且去噪问题呈现出真正的非参数收敛速率。最后,我们将方法应用于两个科学问题:天文学中球面值伽马射线暴位置去噪,以及结构生物学中环面值蛋白质相邻氨基酸扭转角对(即拉马钱德兰图)去噪。
We initiate the study of nonparametric empirical Bayes denoising methods in the setting where both the latent variables and their measurements lie on a compact Riemannian manifold, and where the likelihood is a Riemannian Gaussian distribution. Our starting point is a novel Tweedie-Eddington formula for Riemannian Gaussian mixture models which identifies a certain surrogate oracle denoiser in terms of the marginal distribution of the measurements; it avoids the explicit computation of the posterior Fréchet mean (as required by the Bayes denoiser) via a first-order approximation, hence we refer to it as the "tangential" Bayes denoiser. We show that this surrogate oracle achieves nearly the Bayes risk in a low-noise regime, we construct a fully data-driven approximation of it using the spectral theory of the Laplace-Beltrami operator, and we establish finite-sample rates of convergence for the distance between the the surrogate oracle and its approximation. Contrasting the nearly-parametric rates from the Euclidean setting, the rates in the Riemannian setting are slower due to the singularities of the Riemannian Gaussian density at the cut locus of its Fréchet mean; in the special case of the circle we establish matching lower bounds which show that our proposed denoiser is minimax-optimal, and that the denoising problem exhibits a genuinely nonparametric rate of convergence. Lastly, we implement our methodology in two scientific applications: in astronomy, the sphere-valued problem of denoising the locations of gamma ray bursts; in structural biology, the torus-valued problem of denoising pairs of torsion angles of adjacent amino acids in a protein (i.e., the Ramachandran plot).
等变接触达布商与反常范畴化勒让德对应
Efe İzbudak
AI总结 本文建立了-1-移位接触导出阿廷栈的等变达布定理,并通过导出辛化与代数下降构造了ℓ-adic反常层,提取了导出枚举不变量,同时提出了接触版本的Joyce猜想并构造了范畴化勒让德2-范畴。
先前的工作表明,移位接触导出阿廷栈具有光滑的达布图册。然而,建立枚举不变量和线性化这些范畴结构需要等变局部模型。我们建立了$-1$-移位接触导出阿廷栈的等变达布定理。我们证明,在光滑拓扑下,这些栈通过由相对截面$s$的导出判别式轨迹关联的导出接触达布概形$\Delta\mathrm{loc}(s)$承认光滑图册。在存在约化稳定化子$G$时,这细化为等变几何商栈$[\Delta\mathrm{loc}(s)/G]$。通过将BBDJS最小模型应用于导出辛化并沿结构自由$\mathbb{G}_m$-作用代数下降,我们在任何有向的$-1$-移位接触栈上构造了一个$\ell$-adic反常层。我们利用Verdier的关于单子层的特殊化等价,为该反常层配备了一个驯服几何单值自同构$T$。该结构允许通过$\ell$-adic Grothendieck-Lefschetz迹提取导出枚举不变量,从而解决了通用拓扑非循环性问题。本文其他主要结果的内容依赖于先前的工作,其中我们证明了$n$-移位勒让德子的导出交产生$(n-1)$-移位接触栈,并制定了勒让德子的非线性2-范畴$\mathcal{F}_c(X)$和$Leg_n$。利用这一几何设置,我们在本文中提出了一个接触版本的Joyce猜想以线性化这些结构。然后,我们通过$\ell$-adic Fourier-Mukai拉回-推出函子构造了范畴化勒让德2-范畴$\mathfrak{L}\mathcal{F}c(X)$和$LLeg_0$,将导出接触模空间的研究与微局部层理论联系起来。
Prior work has shown that shifted contact derived Artin stacks admit smooth Darboux atlases. However, establishing enumerative invariants and linearizing these categorical structures requires equivariant local models. We establish an equivariant Darboux theorem for $-1$-shifted contact derived Artin stacks. We prove that, in the smooth topology, these stacks admit smooth atlases by the derived contact Darboux scheme $Δ\mathrm{loc}(s)$ associated to the derived discriminant locus of a relative section $s$. In the presence of reductive stabilizers $G$, this refines to the equivariant geometric quotient stack $[Δ\mathrm{loc}(s)/G]$. By applying the BBDJS minimal model to the derived symplectification and descending algebraically along the structural free $\mathbb{G}_m$-action, we construct an $\ell$-adic perverse sheaf on any oriented $-1$-shifted contact stack. We utilize Verdier's specialization equivalence for monodromic sheaves to equip this perverse sheaf with a tame geometric monodromy automorphism $T$. This structure allows for the extraction of derived enumerative invariants via the $\ell$-adic Grothendieck-Lefschetz trace, thereby resolving the issue of generic topological acyclicity. The content of the other main results in this paper relies on a prior work, in which we have shown that derived intersections of $n$-shifted Legendrians yield $(n-1)$-shifted contact stacks and formulated the non-linear 2-categories of Legendrians $\mathcal{F}_c(X)$ and $Leg_n$. Using this geometric setup, we formulate in this paper a contact analogue of Joyce's conjecture to linearize these structures. We then construct the categorified Legendrian 2-categories $\mathfrak{L}\mathcal{F}c(X)$ and $LLeg_0$ via $\ell$-adic Fourier-Mukai pull-push functors, connecting the study of derived contact moduli spaces to microlocal sheaf theory.
Nash-Williams猜想的证明
Michelle Delcourt, Luke Postle
AI总结 本文完全证明了Nash-Williams猜想:每个顶点数足够大、最小度至少0.75n的三角形可分割图都有三角形分解。证明分三部分:分数Nash-Williams猜想、分数稳定性定理和完整猜想。
极值设计理论中的一个核心开放问题是1970年提出的Nash-Williams猜想:每个顶点数为$n$($n$足够大)且最小度至少为$0.75 n$的三角形可分割图都有三角形分解。本文完整证明了该猜想。2016年,Barber、Kühn、Lo和Osthus证明,如果Nash-Williams猜想的分数松弛对于某个常数$c\ge 0.75$的最小度$cn$成立,那么Nash-Williams猜想对于任何常数$c' > c$成立。此前分数松弛的最佳界是Delcourt和Postle在2021年得到的$c= \frac{7+\sqrt{21}}{14} \approx 0.82733$。由于该分数松弛的界直接关系到极值设计理论中许多其他问题的界,其重要性逐年增长。本文由三部分组成。第一部分,我们的第一个主要结果是分数Nash-Williams猜想的证明:如果$G$是顶点数为$n$且最小度至少为$\frac{3n}{4}$的图,那么$G$有分数三角形分解。第二部分,我们的第二个主要结果是Nash-Williams猜想的分数稳定性定理:如果顶点数为$n$的图$G$的最小度接近$\frac{3n}{4}$但没有分数$K_3$-分解,那么$G$在编辑距离上接近两个$\frac{n}{4}$-正则图(每个图有$\frac{n}{2}$个顶点)的并。我们利用这一点证明,如果顶点数为$n$的三角形可分割图$G$的最小度接近$\frac{3n}{4}$但没有$K_3$-分解,那么$G$在编辑距离上接近两个$\frac{n}{4}$-正则图(每个图有$\frac{n}{2}$个顶点)的并。第三部分,我们的最终主要结果是Nash-Williams猜想的完整证明。
A central open question in extremal design theory is Nash-Williams' Conjecture from 1970 that every triangle-divisible graph on $n$ vertices (for $n$ large enough) with minimum degree at least $0.75 n$ has a triangle decomposition. In this paper, we prove this conjecture in full. In 2016, Barber, Kühn, Lo, and Osthus proved that if the fractional relaxation of Nash-Williams' Conjecture holds for minimum degree $cn$ for some constant $c\ge 0.75$, then Nash-Williams' Conjecture holds for any constant $c' > c$. The previously best-known bound on the fractional relaxation was due to Delcourt and Postle from 2021 with $c= \frac{7+\sqrt{21}}{14} \approx 0.82733$. This bound on the fractional relaxation has grown in importance over the years as it has been directly tied to bounds for a number of other problems in extremal design theory. This paper consists of three parts. In Part I, our first main result is a proof of the Fractional Nash-Williams' Conjecture: if $G$ is a graph on $n$ vertices with minimum degree at least $\frac{3n}{4}$, then $G$ has a fractional triangle decomposition. In Part II, our second main result is a Fractional Stability Theorem for Nash-Williams' Conjecture: if a graph $G$ on $n$ vertices has minimum degree close to $\frac{3n}{4}$ but no fractional $K_3$-decomposition, then $G$ is close (in edit distance) to the join of two $\frac{n}{4}$-regular graphs each on $\frac{n}{2}$ vertices. We use this to prove that if a triangle-divisible graph $G$ on $n$ vertices has minimum degree close to $\frac{3n}{4}$ but no $K_3$-decomposition, then $G$ is close (in edit distance) to the join of two $\frac{n}{4}$-regular graphs each on $\frac{n}{2}$ vertices. In Part III, our final main result is a proof of Nash-Williams' Conjecture in full.
$R(C_k,H)$ 的一个一般上界
Stijn Cambie, Andrea Freschi
AI总结 本文证明了对任意 $k$ 和任意无孤立顶点的图 $H$(有 $m$ 条边),Ramsey 数 $R(C_k,H)$ 不超过 $(k-1)m+1\le km$,解决了 Erdős 等人提出的一个图论问题。
在本文中,我们证明了对任意 $k$ 和任意无孤立顶点的图 $H$(有 $m$ 条边),Ramsey 数 $R(C_k,H)$ 至多为 $(k-1)m+1\le km$。这解决了 Erdős、Faudree、Rousseau 和 Schelp 提出的一个问题,该问题在图论文集中列为问题 34。
In this paper, we prove that for every $k$ and every graph $H$ with $m$ edges and no isolated vertices, the Ramsey number $R(C_k,H)$ is at most $(k-1)m+1\le km$. This settles a problem of Erdős, Faudree, Rousseau and Schelp, which is listed as problem 34 in the graph theory collection.
Sasakian流形与spin-c Killing旋量
Alejandro Gil-García, C. S. Shahbazi
AI总结 利用复旋量形式理论,证明奇数维黎曼流形在纯性条件下允许具有实Killing常数的纯spin-c Killing旋量当且仅当它是α-Sasakian的,推广了Moroianu的结果,无需单连通或完备性假设。
利用复旋量形式理论,我们证明了一个奇数维黎曼流形允许一个具有实Killing常数$\alpha\in\mathbb{R}^{\ast}$的纯spin-c Killing旋量当且仅当它是$\alpha$-Sasakian的,从而在纯性假设下得到了A. Moroianu的一个著名结果的推广,该推广不需要单连通性或完备性。
Using the theory of complex spinorial forms, we prove that an odd-dimensional Riemannian manifold admits a pure spin-c Killing spinor with a real Killing constant $α\in\mathbb{R}^{\ast}$ if and only if it is $α$-Sasakian, thereby obtaining an extension of a well-known result by A. Moroianu that, under the purity assumption, does not require simple connectivity or completeness.
无压欧拉对齐系统的单向熵解
Joshua O. Adeleke, Trevor M. Leslie
AI总结 研究单向速度的无压欧拉对齐系统,通过重写为标量平衡律族,在Lipschitz通信协议下证明解的存在性、唯一性和稳定性,并展示单向几何允许在通信协议沿流向消失时仍出现群集。
我们研究具有单向速度 (u,0,...,0) 的无压欧拉对齐系统。通过将系统重写为耦合标量平衡律族(每个对应 R^d 的一个水平切片),我们能够在有界 Lipschitz 通信协议的假设下,证明单向解类中的存在性、唯一性和稳定性。水平切片之间的非局部耦合为系统提供了在一维设置中缺失的丰富结构,这也是本工作的主要技术难点。我们将解构造为粘性粒子 Cucker-Smale 动力学的极限,首先横向于流离散化,然后沿流离散化。横向离散化关键依赖于我们两个互补稳定性估计中更微妙的一个,该估计仅依赖于通量差的 L^1-L^infty 控制。这个低正则性估计是必要的,因为我们的离散通量通常不能期望在(例如)Lipschitz 半范数下收敛。该估计本身受 Bouchut 和 Perthame 工作的启发,并通过仔细附加的非局部分析进行改编。为了比较不同水平切片上的动力学,两个稳定性估计最自然地用涉及两个密度剖面在 R^{d-1} 上的投影之间的最优耦合的量来表述。我们还研究了单向解的长时间行为。除了处理标准重尾区域,我们做出一个简单观察:即使通信协议在平行于流的轴的圆柱邻域内消失,单向几何也允许群集(速率与智能体数量无关)。这表明沿运动方向的直接通信对于群集的发生并非必要。
We study the pressureless Euler Alignment system with unidirectional velocity (u,0,...,0). By re-casting the system as a family of coupled scalar balance laws, one for each horizontal slice of R^d, we are able to prove existence, uniqueness, and stability within the class of unidirectional solutions, under the assumption of a bounded Lipschitz communication protocol. The nonlocal coupling between horizontal slices provides the system with a rich structure that is absent from the 1D setting and also constitutes the main technical difficulty of the present work. We construct our solutions as limits of sticky particle Cucker-Smale dynamics, discretizing first transverse to the flow and then along it. The transverse discretization depends crucially on the more subtle of our two complementary stability estimates, which relies only on L^1-L^infty control of the flux difference. This low-regularity estimate is essential since our discretized fluxes cannot in general be expected to converge in (for instance) Lipschitz seminorm. The estimate itself is inspired by work of Bouchut and Perthame and adapted here through a careful additional analysis of the nonlocality. In order to compare the dynamics along different horizontal slices, both stability estimates are most naturally formulated in terms of quantities involving the optimal coupling between the projections of two density profiles onto R^{d-1}. We also investigate the long-time behavior of unidirectional solutions. In addition to treating the standard heavy-tailed regime, we make the simple observation that the unidirectional geometry allows for flocking (with a rate independent of the number of agents) even when the communication protocol vanishes in a cylindrical neighborhood of the axis parallel to the flow. This demonstrates that direct communication along the direction of motion is not necessary for flocking to occur.
双双重叠分支循环的相关细化
Thomas Blomme, Francesca Carocci, Ajith Urundolil Kumaran
AI总结 本文通过根部的Weil配对,对双重重叠分支循环进行了细化,证明了细化类满足多重覆盖公式,并应用于环面曲面的对数Gromov-Witten不变量,得到了N. Takahashi猜想的变体。
给定一族半稳定曲线以及两个度数为0的线丛,双重重叠分支循环度量了两个线丛在纤维上都是平凡的位置。当两个线丛配备有自然根时,我们利用根部的Weil配对提供了DDR-类的一个细化。我们证明了细化类满足一个多重覆盖公式,类似于[BC25b]中证明的椭圆曲线上射影丛的相关不变量的公式。作为推论,我们证明环面曲面的对数GW不变量可以通过考虑映射到环面边界的点的位置进行细化,并且这些细化不变量也满足多重覆盖公式;后者是对于相对于光滑椭圆曲线E的P2的亏格0最大接触曲线的N. Takahashi猜想的一个变体。
Given a family of semi-stable curves together with two degree 0 line bundles, the double double ramification cycle measures the locus where both line bundles are trivial on the fibers. When the two line bundles come equipped with natural roots, we provide a refinement of the DDR-class using the Weil pairing of the roots. We prove that the refined classes satisfy a multiple cover formula analogous to the one for correlated invariants of projective bundles on elliptic curves proved in [BC25b]. As a consequence, we prove that log-GW invariants of toric surfaces can be refined taking into account the position of the points mapped to the toric boundary, and that these refined invariants also satisfy a multiple cover formula; the latter is as a variation of the N. Takahashi conjecture for genus zero maximal contact curves for P2 relative a smooth elliptic curve E.