arXivDaily arXiv每日学术速递 周一至周五更新
重置
2606.13634 2026-06-12 cs.CL math.CT 新提交

Operads for compositional reasoning in LLMs

用于LLM组合推理的Operad框架

Nathaniel Bottman, Kyle Richardson

AI总结 提出operad作为问题分解的数学框架,定义问题operad Q,将QA模型解释为Q上的代数,并引入operadic一致性度量,实验表明该度量与准确性强相关。

详情
AI中文摘要

问题分解,即将复杂查询分解为更简单的子查询,并将子查询的答案组合成最终答案,是提高LLM推理能力的常用策略,但目前缺乏严格的数学基础。本文提出operad(一种模拟多输入单输出操作及其组合的数学结构)作为描述问题分解的自然框架。我们定义了问题operad $Q$,其中操作对应问题模板,组合对应子答案的替换,并展示了QA模型如何被解释为$Q$上的代数。除了重新诠释现有实践,这一operad视角还指向了新方法,特别是operadic一致性概念,它衡量QA模型的答案在问题分解树的部分折叠上是否一致。关于operadic一致性的实证评估见我们的姊妹论文(Bottman, Liu, and Richardson, 2026),该论文发现它在12个LLM和4个多跳QA数据集上与准确性强相关,且优于基于温度的标准自一致性基线。我们认为operad是问题分解的自然数学框架,而诸如operadic一致性等不变量为分析和改进多步推理的可靠性开辟了新方向。

英文摘要

Question decomposition, i.e. breaking a complex query into simpler sub-queries whose answers are composed to produce a final answer, is a widely used strategy for improving LLM reasoning, yet it currently lacks a rigorous mathematical foundation. In this paper, we propose operads, mathematical structures that model many-in, one-out operations and compositions thereof, as a natural framework for describing question decomposition. We define the questions operad $Q$, in which operations correspond to question templates and composition corresponds to substitution of sub-answers, and show how QA models can be interpreted as algebras over $Q$. Beyond reframing existing practice, this operadic perspective points toward new methods, in particular a notion of operadic consistency, which measures whether a QA model's answers agree across the partial collapses of a question decomposition tree. Empirical evaluation of operadic consistency is reported in our companion paper (Bottman, Liu, and Richardson, 2026), which finds it strongly correlated with accuracy across twelve LLMs and four multi-hop QA datasets and outperforming standard temperature-based self-consistency baselines. We argue that operads are the natural mathematical home for question decomposition, and that invariants such as operadic consistency open new directions for analyzing and improving the reliability of multi-step reasoning.

2606.13614 2026-06-12 stat.ML cs.LG math.ST 新提交

Majority-of-Three is Optimal

三中多数是最优的

Divit Rawal, Nikita Zhivotovskiy

AI总结 本文通过简短证明,在可实现PAC学习框架下,三个独立一致分类器的多数投票是最优学习器,简化了投票学习器的算法结构和概率分析。

详情
Comments
9 pages
AI中文摘要

我们给出一个简短证明,表明在可实现PAC学习框架下,三个独立一致分类器的多数投票是最优学习器。这证明了最简单投票方案的最优性,同时简化了先前投票学习器的算法结构和概率分析,包括S. Hanneke的算法和K. Green Larsen对装袋的分析。

英文摘要

We give a short proof that the majority vote of three independent consistent classifiers is an optimal learner in the realizable PAC setting. This proves optimality for the simplest voting scheme, while simplifying both the algorithmic structure and the probabilistic analysis of previous voting learners, including the algorithm of S. Hanneke and the analysis of bagging by K. Green Larsen.

2606.13374 2026-06-12 cs.DC cs.DS math.PR 新提交

Temporal Conductance and Bounds on the Voter Model for Dynamic Networks

时间电导与动态网络上投票者模型的界

Tatiana Rocha Avila, Holger Dell, John Lapinskas

AI总结 引入时间电导度量动态网络连通性,证明投票者模型在动态网络上的共识时间上界为O(m/(d_min Φ)),并证明该界紧。

详情
AI中文摘要

投票者模型是一个经典的随机过程,模拟观点如何在网络中传播:每一步,每个节点懒惰地采纳随机邻居的观点;最终所有节点共享同一观点(共识)。更强的连通性应导致更快的共识。Berenbrink, Giakkoupis, Kermarrec 和 Mallmann-Trenn (ICALP 2016) 通过网络的电导精确化了这一点:如果网络有 $m$ 条边,最小度 $d_{\min}$,且电导至少为 $\phi$,则投票者模型在期望 $O(m/(d_{\min}\phi))$ 步内达成共识。他们的结果通过考虑每个时间步网络的电导,扩展到具有固定顶点度的动态网络。我们引入了时间电导 $\Phi$,一种更通用的动态网络连通性度量。与静态电导不同,静态电导在某个快照不连通时坍缩为 $0$,而 $\Phi$ 通过在不同时间出现的边捕捉连通性。我们将 Berenbrink 等人的结果从静态电导推广到时间电导,证明了标准投票者模型的期望共识时间至多为 $O(m/(d_{\min}\Phi))$。此外,我们证明这个界在常数因子内是紧的。我们期望时间电导成为分析时间网络上其他动力学以及更一般的时间非齐次马尔可夫链的有用工具。

英文摘要

The voter model is a classical stochastic process that models how opinions might spread through a network: at each step, every node lazily adopts the opinion of a random neighbour; eventually all nodes share the same opinion (consensus). Stronger connectivity should yield faster consensus. Berenbrink, Giakkoupis, Kermarrec, and Mallmann-Trenn (ICALP 2016) make this precise via the network's conductance: if the network has $m$ edges, minimum degree $d_{\min}$, and conductance at least $\phi$, then the voter model reaches consensus in expected $O(m/(d_{\min}\phi))$ steps. Their results extend to dynamic networks with fixed vertex degrees by considering the network's conductance at each time step. We introduce temporal conductance $\Phi$, a more general connectivity measure for dynamic networks. Unlike static conductance, which collapses to $0$ whenever some snapshot is disconnected, $\Phi$ captures connectivity through edges that appear at different times. We generalise the results of Berenbrink et al. from static conductance to temporal conductance, showing that the expected consensus time of the standard voter model is at most $O(m/(d_{\min}\Phi))$. Moreover, we prove that this bound is tight up to constant factors. We expect temporal conductance to be a useful primitive for analysing other dynamics on temporal networks, and potentially time-inhomogeneous Markov chains more generally.

2606.13308 2026-06-12 cs.CE math.NA 新提交

Subdivision-based isogeometric analysis for axisymmetric electromagnetic problems

基于细分的等几何分析用于轴对称电磁问题

Devin Balian, Sebastian Schöps, Melina Merkel

AI总结 提出基于细分的等几何方法求解轴对称Maxwell特征值问题,利用Catmull-Clark构造离散化,在TESLA 9-cell腔体上获得更平滑的数值解,收敛率与文献一致。

详情
AI中文摘要

本文应用基于细分的等几何方法求解轴对称Maxwell特征值问题。通过简化为$H^1$公式,允许对几何和场离散化使用Catmull-Clark构造。该方法得到电场数值解,除奇异顶点外处处$C^1$连续。通过计算TESLA 9-cell腔体的本征模式进行演示,显示比传统方法更平滑的场和更少的数值噪声。数值分析了该方法的收敛率,与文献中观察到的速率一致。

英文摘要

This paper applies a subdivision-based isogeometric method to solve the axisymmetric Maxwell eigenvalue problem. The reduction to an $H^1$-formulation allows to use a Catmull-Clark construction for both geometry and field discretization. The approach yields a numerical solution for the electric field, which is $C^1$-continuous everywhere except at extraordinary vertices. This is demonstrated by computing the eigenmodes of a TESLA 9-cell cavity, showing smoother fields with less numerical noise than conventional methods. The convergence rate of the method is numerically analyzed and is in agreement with rates observed in the literature.

2606.13287 2026-06-12 cs.LG cs.DC math.OC 新提交

Clipping Makes Distributed and Federated Asynchronous SGD Robust to Stragglers

裁剪使分布式和联邦异步SGD对掉队者具有鲁棒性

Samuel Erickson, Mikael Johansson

发表机构 * KTH Royal Institute of Technology(瑞典皇家理工学院)

AI总结 本文理论证明梯度裁剪能消除异步SGD中最大延迟对复杂度的影响,基于次Weibull梯度噪声模型,首次实现异步优化的高概率收敛。

详情
AI中文摘要

在现代机器学习中,训练的并行化是扩大规模的重要策略。异步随机梯度下降(ASGD)通过避免等待慢速工作节点来最大化可用硬件的利用率。然而,在恒定步长下,由于更新中的大延迟,慢速工作节点仍然会对ASGD的收敛产生负面影响。同时,在深度学习模型的异步训练中,经验观察到梯度裁剪能“稳定”训练。在这项工作中,我们为这一行为提供了理论依据,证明裁剪消除了最大延迟对预言复杂度的依赖。我们采用次Weibull梯度噪声模型,该模型将次高斯和次指数分布推广到更重尾的分布,受深度学习中的经验观察启发。我们证明了期望收敛,并且首次在异步优化中证明了高概率收敛。

英文摘要

In modern machine learning, parallelization of training is an important strategy for increasing scale. Asynchronous stochastic gradient descent (ASGD), which maximizes the utilization of available hardware by avoiding waiting for slow workers. However, with constant step sizes, the convergence of ASGD is nonetheless affected negatively by slow workers due to large delays in updates. At the same time, it has been empirically observed in asynchronous training of deep learning models that gradient clipping "stabilizes" training. In this work, we provide a theoretical justification for this behavior, as we show that clipping removes the dependence of the maximum delay in the oracle complexity. We employ a sub-Weibull model of gradient noise which generalizes sub-Gaussian and sub-exponential distributions to more heavy-tailed distributions, motivated by empirical observations in deep learning. We show convergence in expectation, and the first time in asynchronous optimization, convergence with high probability.

2606.13255 2026-06-12 cs.CE math.NA 新提交

Embedding-based Methods for Linear Solver Performance Prediction

基于嵌入方法的线性求解器性能预测

Hayden Liu Weng, Hans-Joachim Bungartz, Felix Dietrich

AI总结 提出模块化低成本的嵌入框架,通过解耦性能建模与特征表示,利用观测数据学习求解器-问题关系,在SuiteSparse矩阵集上实现17%的预测准确率提升。

详情
Comments
16 pages, 4 figures. Submitted to the 26th International Conference on Computational Science. This version includes a minor correction to the submitted manuscript, which does not result from the conference's peer review, and no changes resulting from the peer review process
AI中文摘要

大规模稀疏线性系统的求解往往主导科学应用的计算成本,并成为频繁的优化目标。现代库提供了大量求解器和预处理器配置,但其性能在不同问题实例间差异显著。先前工作已涉及最优求解器的选择,但通常受限于所处理的问题集(例如仅对称正定矩阵)、昂贵的矩阵特征使用或方法的复杂性。本文提出一种模块化、低成本的基于嵌入的求解器选择框架,将性能建模与特征表示及下游预测解耦。求解器-问题关系直接从观测性能数据中学习,同时使用廉价数值特征将未见问题投影到学习到的嵌入空间。该框架聚焦于多标签预测和基于用户中心指标(如MAPE和nDCG)的评估,这些指标能更好地反映相对性能。在SuiteSparse矩阵集合的621个矩阵和101个PETSc求解器配置上的实验表明,当包含昂贵的数值特征时,与经典基于特征的模型相比,顶部预测准确率提升17%,平均绝对百分比误差(MAPE)降低37%,顶部预测误差(1-error)降低46%。当限制为简化特征集时,嵌入方法仍具有竞争力,在广泛问题范围内持续实现约24%的MAPE和1-error降低。

英文摘要

The solution of large, sparse linear systems often dominates the computational effort of scientific applications and is a frequent optimization target. Modern libraries provide numerous solver and preconditioner configurations, but their performance varies significantly across problem instances. Previous works have addressed the selection of an optimal solver, but are typically limited by the problem set addressed (e.g., only symmetric positive definite matrices), the use of expensive matrix features, or the complexity of the approach. This work proposes a modular, low-cost embedding-based framework for solver selection that decouples performance modeling from feature representation and downstream prediction. Solver-problem relationships are learned directly from observed performance data, while inexpensive numerical features are used to project unseen problems into the learned embedding space. The framework focuses on multilabel prediction and evaluation using user-centric metrics, such as MAPE and nDCG, which better reflect relative performance. Experiments on 621 matrices from the SuiteSparse matrix collection across 101 PETSc solver configurations demonstrate a 17% increase in top-prediction accuracy over classical feature-based models when expensive numerical features are included, along with reductions of 37% in mean average percentage error (MAPE) and 46% in top-prediction error (1-error). When restricted to a reduced feature set, the embedding approach remains competitive, while still consistently achieving ca. 24% lower MAPE and 1-error across a broad range of problems.

2606.13234 2026-06-12 stat.CO math.NA math.ST 新提交

Switching Hamiltonian Monte Carlo for sampling from mixture distributions

切换哈密顿蒙特卡洛方法用于混合分布采样

A. Sharma

AI总结 提出切换哈密顿蒙特卡洛方法,结合对称数值积分器和泊松跳跃,实现有限混合玻尔兹曼-吉布斯分布的采样,并证明几何遍历性和二阶偏差。

详情
AI中文摘要

我们提出了一种切换哈密顿蒙特卡洛方法,用于从有限混合玻尔兹曼-吉布斯分布中采样。我们提出了对称数值积分器来近似与泊松跳跃交织的切换哈密顿动力学,其中状态切换链使用均匀化技术或随机模拟算法进行模拟。我们证明了所得马尔可夫链的几何遍历性。我们开发了一种基于与数值方案相关的离散泊松方程的方法,用于估计计算遍历平均值的误差。使用这种方法,我们证明了所提出的数值积分器具有二阶偏差。该方法简单且可推广到其他设置,例如动力学朗之万方程。最后,我们通过数值实验验证了收敛结果。

英文摘要

We introduce a switching Hamiltonian Monte Carlo method for sampling from finite mixture Boltzmann-Gibbs distributions. We propose symmetric numerical integrators to approximate switching Hamiltonian dynamics interlaced with Poisson jumps, where the regime-switching chain is simulated using the uniformization technique or the stochastic simulation algorithm. We prove geometric ergodicity of the resulting Markov chain. We develop an approach based on the discrete Poisson equation associated with numerical schemes to estimate the error in computing ergodic averages. Using this approach we prove that the proposed numerical integrators have second-order bias. This approach is simple and can be generalized to other settings, for example, kinetic Langevin equations. Finally, we verify the convergence result via numerical experiment.

2606.13159 2026-06-12 cs.DM math.CO math.DS 新提交

The Curious Case of Reversible Elementary Second Order Cellular Automaton 115

可逆基本二阶元胞自动机115的奇特案例

Enrico Formenti (Université Côte d'Azur, France), Supreeti Kamylia (Department of Computer Science and Engineering, Birla Institute of Technology, Mesra, Ranchi, India)

AI总结 证明了可逆基本二阶元胞自动机规则115在有限初始配置下是周期的,并研究了具有有趣周期函数的有限配置族。

详情
Comments
In Proceedings GASCom 2026, arXiv:2606.09910
AI中文摘要

我们证明了可逆基本二阶元胞自动机规则115在有限初始配置下是周期的。我们还研究了一些具有有趣周期函数的有限配置族。

英文摘要

We prove that the reversible elementary second order cellular automaton rule 115 is periodic when started on finite initial configurations. We also study some families of finite configurations that have interesting period functions.

2606.13149 2026-06-12 cs.DM math.CO 新提交

Touchard-Riordan Polynomials and Schur-positivity of Set Partitions

Touchard-Riordan多项式与集合划分的Schur正性

Eli Bagno (Jerusalem College of Technology and Michlalah College Jerusalem, Israel), David Garber (Holon Institute of Technology, Israel)

AI总结 研究集合划分的对称函数的Schur正性,通过下降集函数关联Touchard-Riordan多项式,给出Schur展开系数。

详情
Comments
In Proceedings GASCom 2026, arXiv:2606.09910
AI中文摘要

如果一个对称函数在Schur基下的展开系数非负,则称其为Schur正的。本文研究自然与集合划分相关的对称函数的Schur正性,其中下降集函数将i视为下降,如果i和i+1在划分中属于同一块。Schur展开涉及钩形Young图,相应的系数由Touchard-Riordan多项式给出,该多项式通过交叉数枚举匹配。

英文摘要

A symmetric function is called Schur-positive if it admits an expansion in the Schur basis with nonnegative coefficients. In this paper, we study the Schur-positivity of symmetric functions naturally associated with set partitions, with respect to a descent set function that considers i as descent, if i and i+1 share a block in the partition. The Schur expansion involves hook-shaped Young diagrams, and the corresponding coefficients are given by Touchard-Riordan polynomials, which enumerate matchings by their number of crossings.

2606.13092 2026-06-12 cs.LG cs.RO math.DS 新提交

Scale Buys Interpolation, Structure Buys a Horizon: Certified Predictability for Equivariant World Models

规模买插值,结构买地平线:等变世界模型的认证可预测性

Hongbo Wang

AI总结 针对等变潜在世界模型,提出可计算的多步可预测地平线认证,证明T步滚动误差在对称轨道上恒定,并由李雅普诺夫谱分层界定,且该认证为等变模型独有。

详情
Comments
23 pages (9 main + appendices). Code: this https URL
AI中文摘要

规模买插值;结构买认证的地平线。世界模型的平均误差无法说明特定预测是否可信,或可信多久。对于等变潜在世界模型,我们给出可计算的多步可预测地平线认证:$T$步滚动误差在每个对称轨道上恒定(定理A),并由预测器的李雅普诺夫谱逐通道分层,$T_j(\epsilon)\sim\log(1/\epsilon)/\lambda_j$。地平线是双向的——匹配的下界使近似等变被证明受地平线限制——且该认证为结构独有:轨道恒定误差刻画等变性,因此任何非等变模型无论规模多大都不具备。实验上,在40维Lorenz-96上,只有$\mathbb{Z}_N$等变网络恢复完整李雅普诺夫谱($R^2=0.98$);密集和循环基线失败。由于谱是忠实的,认证先验地起作用:在固定感知预算下,$c$倍膨胀的认证需要$c$倍预算,且等变认证满足其膨胀密集对应物无法满足的预算——无需校准数据。相同的读出,未经修改,可无训练审计公开预训练世界模型:TD-MPC2检查点落在认证自身的范围分类上——在强膨胀处校准(比率0.94-1.02),在弱膨胀处乐观,在收缩处正确弃权——部署的监控器逐单元复制该映射,样本外。在官方1M-317M多任务阶梯上,校准不随参数增加。在V-JEPA 2-AC(1B,真实机器人数据)上,测量的交叉检查正确覆盖了过度承诺的切空间谱——交叉验证审计,而非原始数值,是可部署的对象。规模买插值,而非校准的地平线。

英文摘要

Scale buys interpolation; structure buys a certified horizon. A world model's average error says nothing about whether a particular prediction can be trusted, or for how long. For equivariant latent world models we give a computable, multi-step certificate of the predictable horizon: $T$-step rollout error is provably constant over each symmetry orbit (Theorem A) and stratified channel-by-channel by the predictor's Lyapunov spectrum, $T_j(\epsilon)\sim\log(1/\epsilon)/\lambda_j$. The horizon is two-sided -- a matching lower bound makes approximate equivariance provably horizon-limited -- and the certificate is exclusive to structure: orbit-constant error characterizes equivariance, so no non-equivariant model has it at any scale. Empirically, on 40-D Lorenz-96 only a $\mathbb{Z}_N$-equivariant network recovers the full Lyapunov spectrum ($R^2{=}0.98$); dense and recurrent baselines fail. Because the spectrum is faithful, the certificate acts, a priori: under a fixed sensing budget a $c\times$-inflated certificate provably needs $c\times$ the budget, and the equivariant certificate meets a budget its inflated dense counterpart cannot -- with zero calibration data. The same read-out, unchanged, audits public pretrained world models training-free: TD-MPC2 checkpoints land on the certificate's own scope taxonomy -- calibrated where strongly expansive (ratio 0.94-1.02), optimistic where weakly expansive, correctly abstaining where contracting -- a map a deployed monitor replicates cell-by-cell, out-of-sample. Across the official 1M-317M multitask ladder, calibration does not improve with parameters. On V-JEPA 2-AC (1B, real robot data) the measured cross-check correctly overrides an over-promising tangent spectrum -- the cross-validated audit, not the raw number, is the deployable object. Scale buys interpolation, not a calibrated horizon.

2606.13087 2026-06-12 eess.SY math.OC 新提交

Characterization and Computation of Feedback Nash Equilibria in Scalar Discounted N-Player Linear Quadratic Games

标量折扣N人线性二次博弈中反馈纳什均衡的表征与计算

Chiara Cavalagli, Alberto Bemporad, Mario Zanon

AI总结 研究标量折扣线性二次博弈中反馈纳什均衡,区分稳定与非稳定均衡,提出参数化表征和数值计算方法,并导出对称博弈的闭式解及多重均衡条件。

详情
Comments
Accepted for publication in IEEE Control Systems Letters (LCSS), 2026
AI中文摘要

本文研究了具有$N$个参与者的标量折扣线性二次(LQ)博弈中的反馈纳什均衡(FNE)。通过显式引入折扣因子,我们表明有限成本均衡可能无法稳定原始系统,从而激励了FNE与稳定FNE之间的区分以及一个充分的稳定性条件。基于策略的参数化表征,我们提出了计算所有均衡的数值方法。特别关注对称博弈,其中推导了对称FNE的闭式表达式以及存在最多$M\leq2^N-2$个均衡的条件。数值实验说明了均衡多重性如何依赖于博弈配置,并突出了有限成本非稳定均衡的出现。

英文摘要

This paper studies feedback Nash equilibria (FNE) in scalar discounted linear quadratic (LQ) games with $N$ players. By explicitly incorporating the discount factor, we show that finite-cost equilibria may fail to stabilize the original system, motivating a distinction between FNE and stable FNE together with a sufficient stability condition. Based on a parametric characterization of the policies, we propose numerical methods for computing all equilibria. Particular attention is devoted to the symmetric game, where a closed-form expression of the symmetric FNE and conditions for the existence of up to $M\leq2^N-2$ equilibria are derived. Numerical experiments illustrate how equilibrium multiplicity depends on the game configuration and highlight the emergence of finite-cost non-stabilizing equilibria.

2606.12892 2026-06-12 stat.ML cs.LG econ.EM math.ST stat.ME 新提交

Prediction-Powered Causal Inference by Automatic Debiased Machine Learning and Semi-Supervised Riesz Regression

预测驱动的因果推断:自动去偏机器学习与半监督Riesz回归

Masahiro Kato

AI总结 研究半监督设置下因果参数的半参数有效估计,通过结合去偏机器学习和半监督Riesz回归,提出DML-PPCI和TMLE-PPCI方法,实现比仅用标注数据更小的渐近方差。

详情
AI中文摘要

本研究探讨了在半监督设置下因果和结构参数的半参数有效估计。在我们的设置中,除了由结果和回归变量组成的标注观测数据外,还有未标记的辅助回归变量可用。我们的目标是构建因果和结构参数的估计量,其渐近方差小于仅使用标注数据构建的估计量。我们将此框架称为预测驱动的因果推断(PPCI)。我们首先推导了有效影响函数和效率界,这表明使用辅助回归变量可以获得比仅从标注观测数据可达到的效率界更小的渐近方差。然后,通过将有效影响函数与去偏机器学习(DML)框架相结合,我们提出了称为DML-PPCI的方法。如果我们构建一个估计方程估计量,我们称之为EE-DML-PPCI;如果我们构建一个目标学习估计量,我们称之为TMLE-DML-PPCI。两种估计量的渐近方差都与我们推导的效率界相匹配。在构建估计量时,有效影响函数的估计起着重要作用。在我们的研究中,有效影响函数也是一个Neyman正交分数,它依赖于Riesz表示子和回归函数。对于Riesz表示子估计,我们开发了具有收敛速度保证的半监督广义Riesz回归。

英文摘要

This study investigates semiparametric efficient estimation of causal and structural parameters in a semi-supervised setting. In our setting, unlabeled auxiliary regressors are available in addition to labeled observations consisting of outcomes and regressors. Our goal is to construct estimators of causal and structural parameters whose asymptotic variances are smaller than those of estimators constructed using only labeled data. We refer to this framework as prediction-powered causal inference (PPCI). We first derive the efficient influence function and the efficiency bound, which imply that the use of auxiliary regressors can attain a smaller asymptotic variance than the efficiency bound attainable from labeled observations alone. Then, by combining the efficient influence function with the debiased machine learning (DML) framework, we propose methods that we call DML-PPCI. If we construct an estimating-equation estimator, we refer to the method as EE-DML-PPCI; if we construct a targeted-learning estimator, we refer to the method as TMLE-DML-PPCI. The asymptotic variances of both estimators match our derived efficiency bound. In the construction of the estimators, estimation of the efficient influence function plays an important role. In our study, the efficient influence function is also a Neyman orthogonal score, which depends on the Riesz representer and the regression function. For Riesz representer estimation, we develop semi-supervised generalized Riesz regression with convergence rate guarantees.

2606.12879 2026-06-12 cs.DS math.ST stat.ML 新提交

Diffusion-Network Alignment: An Efficient Algorithm and Explicit Probability Bounds

扩散-网络对齐:一种高效算法与显式概率界

Ziao Wang, Lei Ying

AI总结 提出扩散-网络对齐问题,基于树相关性测试设计高效算法,在稀疏图下证明高概率正确匹配,并给出顶点正确匹配的显式下界。

详情
AI中文摘要

本文研究经典网络对齐问题的一个变体,称为扩散-网络对齐。目标是将有根扩散树的顶点与网络的顶点对齐,其中扩散树可能来自通信追踪或接触追踪,而网络可能是在线或离线社交网络。与两个网络都被完全观测的经典网络对齐不同,该模型捕捉了两个网络的信息不对称性。为了解决这个问题,本文提出了一种基于树相关性测试的高效算法,从局部邻域中提取对齐信息。我们分析了该算法在稀疏图情况下的性能,并表明以高概率,所有匹配对都是正确的。此外,对于扩散树上的每个顶点,本文建立了该顶点被正确匹配的概率的显式下界。这些下界是深度依赖的,并且随着顶点接近根而增加。

英文摘要

This paper studies a variation of the classic network alignment problem, named diffusion-network alignment. The goal is to align the vertices of a rooted diffusion tree to the vertices of a network, where the diffusion tree could be from a communication trace or contact tracing, and the network could be an online or offline social network. Different from the classic network alignment where both networks are fully observed, this model captures the information asymmetry of two networks. To solve this problem, this paper presents an efficient algorithm based on tree correlation tests to extract alignment information from local neighborhoods. We analyze the performance of the algorithm in the sparse graph regime and show that with high probability, all matched pairs are correct. Furthermore, for each vertex on the diffusion tree, this paper establishes an explicit lower bound on the probability that the vertex is correctly matched. These lower bounds are depth-dependent and increase as vertices get closer to the root.

2606.12710 2026-06-12 cs.LG math.OC 新提交

A Stabilized Path-Space Approach to Diffusion-Based Posterior Sampling

一种稳定的路径空间方法用于基于扩散的后验采样

Evan Scope Crafts, Umberto Villa, Saviz Mowlavi, Yanting Ma, Hassan Mansour, Wael H. Ali

发表机构 * Oden Institute for Computational Engineering and Sciences, The University of Texas at Austin(德克萨斯大学奥斯汀分校奥登计算工程与科学研究所) Mitsubishi Electric Research Laboratories (MERL)(三菱电机研究实验室) Department of Biomedical Engineering, The University of Texas at Austin(德克萨斯大学奥斯汀分校生物医学工程系) Mitsubishi Electric Research Laboratories(三菱电机研究实验室)

AI总结 提出一种稳定的路径空间框架,通过随机最优控制与信任域优化,实现非线性逆问题中准确且鲁棒的后验采样。

详情
AI中文摘要

扩散模型为贝叶斯逆问题提供了表达性数据驱动先验,但许多扩散后验采样器依赖启发式引导近似,可能对非线性算子和多模态后验失效。本文开发了一种稳定的路径空间框架用于基于扩散的后验采样。从终端边际代表先验的基础扩散过程出发,我们定义了轨迹上的似然加权目标测度,并将后验采样转化为学习一个路径测度匹配该目标的受控随机过程。该公式将扩散后验采样与随机最优控制联系起来,同时保留了不确定性量化所需的贝叶斯结构。我们引入了一种时间重参数化,通过消除未知初始值函数引起的偏差,使路径空间控制问题适定,无需辅助训练。然后通过具有对数方差目标的信任域路径空间优化方法学习控制。路径空间视角还统一了我们的学习控制方法与现有的基于引导的采样器,量化了近似控制引起的采样误差,并产生了用于渐近精确后验期望的重要性采样校正。我们在具有解析表征或高质量参考后验的基准逆问题套件上评估了所提出的框架,从而实现了对采样精度和不确定性量化的原则性评估。这些实验深入揭示了基于扩散的后验采样器的行为,并证明了相比领先方法更高的准确性和鲁棒性。

英文摘要

Diffusion models provide expressive data-driven priors for Bayesian inverse problems, but many diffusion posterior samplers rely on heuristic guidance approximations that can fail for nonlinear operators and multimodal posteriors. In this work, we develop a stabilized path-space framework for diffusion-based posterior sampling. Starting from a base diffusion process whose terminal marginal represents the prior, we define a likelihood-weighted target measure on trajectories and cast posterior sampling as learning a controlled stochastic process whose path measure matches this target. This formulation connects diffusion posterior sampling to stochastic optimal control while preserving the Bayesian structure needed for uncertainty quantification. We introduce a time reparameterization that makes the path-space control problem well posed by removing the bias induced by the unknown initial value function, without auxiliary training. We then learn the control via a trust-region path-space optimization method with log-variance objectives. The path-space perspective also unifies our learned control approach with existing guidance-based samplers, quantifies the sampling error induced by approximate controls, and yields importance sampling corrections for asymptotically exact posterior expectations. We evaluate the proposed framework on a suite of benchmark inverse problems with analytically characterized or high-quality reference posteriors, enabling principled assessment of sampling accuracy and uncertainty quantification. These experiments provide insight into the behavior of diffusion-based posterior samplers and demonstrate improved accuracy and robustness over leading approaches.

2606.12707 2026-06-12 eess.SY math.PR 新提交

Storage and Transport Capacity Design for a Self-Reliable Two-Node Stochastic Resource System

自可靠双节点随机资源系统的存储与运输容量设计

Arnab Deya, Vivek Khatana, Ankur Mani, Murti V. Salapaka

AI总结 针对有限时域内双节点随机资源系统,提出机会约束容量设计问题,刻画最小存储需求、最优运输策略及存储-运输容量权衡,发现存在临界运输容量阈值实现完全风险汇聚。

详情
Comments
9 pages, 4 figures
AI中文摘要

我们研究了一个在有限时域上运行的双节点随机资源系统。每个节点经历不确定的供应和需求,并配备有限存储。目标是确保资源水平以高概率保持在规定限制内。为此,我们制定了一个机会约束容量设计问题,其中资源可以通过容量有限的运输链路进行交换。我们刻画了每个节点所需的最小存储,推导了最优运输策略,并量化了存储与运输容量之间的权衡。我们的结果表明存在一个临界运输容量阈值,使得节点之间能够实现完全风险汇聚。此外,该阈值随运行时间增加而减小,意味着在更长的时间范围内,可以通过逐渐更小的运输容量实现完全汇聚性能。

英文摘要

We study a two-node stochastic resource system operating over a finite horizon. Each node experiences uncertain supply and demand and is equipped with finite storage. The objective is to ensure that resource levels remain within prescribed limits with high probability. To this end, we formulate a chance-constrained capacity-design problem in which resources can be exchanged through a capacity-limited transport link. We characterize the minimum storage required at each node, derive the optimal transport policy, and quantify the trade-off between storage and transport capacities. Our results show the existence of a critical transport-capacity threshold that enables full risk pooling between the nodes. Moreover, this threshold decreases with the operating horizon, implying that full-pooling performance can be achieved with progressively smaller transport capacity over longer horizons.

2606.12694 2026-06-12 cs.DS cs.LG math.PR stat.ML 新提交

A unified complexity bound for logconcave sampling

对数凹采样的统一复杂度界

Yunbum Kook, Santosh S. Vempala

AI总结 本文通过In-and-Out算法与指数提升,给出了从热启动采样任意对数凹分布的简单、统一且近乎紧的界,主要创新是提升了提升分布的Poincaré常数界。

详情
Comments
5 pages
AI中文摘要

我们给出了一个简单、统一且近乎紧的界,用于从热启动使用In-and-Out算法结合指数提升采样任意对数凹分布。分析中的主要新成分是提升了提升分布的Poincaré常数界。因此,得到的收敛率对于约束设置(例如,限制在凸体上的高斯分布)和良条件设置(例如,强对数凹且光滑的密度)都是近乎紧的。

英文摘要

We give a simple, unified, and nearly tight bound for sampling arbitrary logconcave distributions from a warm start using the In-and-Out algorithm along with exponential lifting. The main new ingredient in the analysis is an improved bound on the Poincaré constant of a lifted distribution. As a consequence, the resulting convergence rate is nearly tight for both constrained settings (e.g., Gaussian restricted to a convex body) and well-conditioned settings (e.g., strongly logconcave and smooth densities).

2606.12691 2026-06-12 cs.LG cs.AI eess.SY math.OC stat.ML 新提交

Two-Layer Linear Auto-Regressive Models Estimate Latent States

两层线性自回归模型估计潜在状态

Yahya Sattar, Sunmook Choi, Leo Maynard-Zhang, Yassir Jedra, Maryam Fazel, Sarah Dean

AI总结 本文证明两层线性自回归模型通过经验风险最小化训练时,能近似卡尔曼滤波,恢复潜在状态估计,并提供有限样本保证。

详情
Comments
ICML 2026
AI中文摘要

自回归模型已成为处理序列数据(从语言到视频)的强大工具。理解这些模型如何以及为何学习潜在表示仍然是一个开放的理论问题。在这项工作中,我们证明,当在部分观测的线性动力系统的数据上通过经验风险最小化训练时,两层线性自回归模型自然学会近似卡尔曼滤波。特别地,我们表明,学习到的隐藏表示与最优(卡尔曼)滤波器产生的状态估计一致,仅相差一个相似变换,尽管模型没有关于底层动力学或状态的显式知识。该结果基于三个主要见解。首先,我们建立卡尔曼滤波器可以被具有有界截断误差的自回归模型很好地近似。其次,我们表明,尽管非凸性,两层优化景观是良性的,即所有驻点要么是严格鞍点,要么是全局最小值。最后,作为我们的主要贡献,我们提供了关于预测误差、参数估计误差和潜在状态恢复的有限样本保证。数值模拟支持理论结果,并表明自回归模型的潜在表示恢复了状态估计。

英文摘要

Auto-regressive models have emerged as powerful tools for sequential data, from language to video. Understanding how and why these models learn latent representations remains an open theoretical question. In this work, we demonstrate that when trained by empirical risk minimization on data from partially observed linear dynamical systems, two-layer linear auto-regressive models naturally learn to approximate Kalman filtering. In particular, we show that the learned hidden representation coincides, up to a similarity transformation, with the state estimates produced by the optimal (Kalman) filter, even though the model has no explicit knowledge of the underlying dynamics or state. The result follows from three main insights. First, we establish that the Kalman filter is well approximated by an auto-regressive model with bounded truncation error. Second, we show that despite non-convexity, the two-layer optimization landscape is benign, i.e., all stationary points are either strict saddles or global minima. Finally, as our main contributions, we provide finite-sample guarantees on prediction error, parameter estimation error, and latent state recovery. Numerical simulations support the theoretical results and demonstrate that the latent representations of auto-regressive models recover state estimates.

2606.12684 2026-06-12 q-bio.NC math.DS 新提交

Phase model analysis of the effect of M-current on neural synchrony in hippocampal networks

M电流对海马网络神经同步性影响的相位模型分析

Megha Manoj, Sue Ann Campbell

AI总结 通过相位模型约化,研究乙酰胆碱通过调节M电流对海马神经元集群同步性的双向作用,发现低ACh促进完全同步,高ACh导致多稳定对称集群解。

详情
Comments
39 pages, 14 figures
AI中文摘要

在海马中观察到的神经集群,即短暂协调的神经元群体,被认为是情景记忆形成的基础。乙酰胆碱(ACh)是一种神经调节剂,由海马接收,在记忆和学习中起关键作用。一个得到充分支持的假说认为,在主动探索和快速眼动(REM)睡眠期间高水平的ACh促进记忆编码,而在安静清醒和慢波睡眠(SWS)期间低水平的ACh支持记忆巩固。我们通过ACh对神经元间同步性的影响来研究其在神经集群形成中的双向作用。我们考虑一个锥体神经元网络模型,每个神经元配备一个缓慢的、电压依赖性的、非失活的钾电流(M电流),该电流在ACh存在时下调。神经集群被表示为该系统的集群解。利用一维相位模型约化,对在不同M电流水平下弱耦合的一对锥体神经元,我们预测了在具有全连接全局均匀、对称距离依赖和最近邻耦合架构的更大网络中可能出现的对称集群解。我们发现,在低ACh条件下,网络可以完全同步,而高ACh水平则可以使网络去同步,形成多个稳定的对称集群解,代表不同的神经集群。

英文摘要

Neural assemblies, transiently coordinated groups of neurons, observed in the hippocampus are thought to underlie the formation of episodic memories. Acetylcholine (ACh), a neuromodulator, that is received by the hippocampus, plays a critical role in memory and learning. A well supported hypothesis suggests that high levels of ACh during active exploration and rapid eye movement (REM) sleep promote memory encoding, while low levels during quiet waking and slow-wave sleep (SWS) support memory consolidation. We study this bidirectional role of ACh in neural assembly formation through its effect on the synchrony among neurons. We consider a network model of pyramidal neurons, each equipped with a slow, voltage-dependent, non-inactivating potassium current (M-current), which is downregulated in the presence of ACh. Neural assemblies are represented as cluster solutions to this system. Using a one-dimensional phase model reduction of a pair of weakly coupled pyramidal neurons under different levels of the M-current, we predict the symmetric cluster solutions that may emerge in larger networks equipped with all-to-all globally homogeneous, symmetric distance-dependent and nearest-neighbours coupling architectures. We find that under low ACh conditions, the network can fully synchronize, whereas high levels can desynchronize the network into multiple stable symmetric cluster solutions representing distinct neural assemblies.

2606.12573 2026-06-12 q-bio.MN math.DS 新提交

Implementation of Linear Regression and Linear Interpolation using Reaction Networks

利用反应网络实现线性回归和线性插值

Aryan Kumar, Amey Choudhary, Jiaxin Jin, Chittaranjan Hens, Abhishek Deshpande

AI总结 提出基于反应网络的方法实现单变量/多变量线性回归和线性插值,通过编码稳态浓度作为输出,并引入处理负数除法的广义除法模块,在合成数据集上验证了有效性。

详情
Comments
30 pages, 7 figures
AI中文摘要

执行统计推断是数据科学的重要组成部分。本文关注两种推断技术,即回归和插值。我们提出了一种基于反应网络的方法,可以实现线性回归(包括单变量和多变量)和线性插值。我们通过将物种的稳态浓度编码为这些推断技术的输出来实现这一点。为此,我们使用了一种新颖的广义除法模块,可以处理负数的除法。我们通过在标准合成数据集上的计算机模拟结果进行比较,验证了我们的结果。

英文摘要

Performing statistical inference is an essential component of data science. Our focus in this work is on two inference techniques, viz. regression and interpolation. We propose a reaction network based approach that can implement linear regression (both univariate and multivariate) and linear interpolation. We do this by encoding the steady state concentration of species as the output of these inference techniques. Towards this, we use a novel generalized division module that can handle division of negative numbers. We verify our results by comparing them with in-silico implementation on standard synthetic datasets.

2606.12450 2026-06-12 q-fin.CP math.NA 新提交

Forward-Time Black-Scholes Reconstruction via Regularized Legendre Reduction

通过正则化勒让德约化实现前向时间Black-Scholes重构

Phuong M. Nguyen, Matt Nguyen, Loc H. Nguyen

AI总结 针对状态依赖波动率的Black-Scholes方程前向时间公式的不适定性,提出基于移位勒让德多项式的谱截断与勒让德-吉洪诺夫方法,证明存在唯一性、数据稳定性和收敛性,数值实验验证了从含噪初始数据恢复终端期权价格剖面的有效性。

详情
AI中文摘要

我们研究了具有状态依赖波动率的Black-Scholes方程的前向时间公式。与经典的终端值定价问题(其中期权收益在到期日给定,价格向后计算)不同,本问题给定当前期权价格剖面,并试图恢复到期日T的期权价格剖面。该公式是不适定的,因为方程沿抛物算子的不稳定方向演化,初始数据中的高频扰动可能被强烈放大。为解决这一困难,我们引入基于移位勒让德多项式的价格维度约化。原始Black-Scholes方程在资产价格变量上投影到有限维勒让德基上,得到展开系数的时间常微分方程组。这种约化起到谱截断的作用,并缓解了零价格边界上由因子S^2引起的退化。主要重构方法是维度约化的勒让德-吉洪诺夫方法。我们证明了每个固定截断水平下的存在唯一性、数据稳定性和收敛性。我们还在勒让德约化后包含一个约化PINN求解器作为辅助计算比较。使用平滑、蝶式价差和欧式看跌期权收益的数值实验表明,勒让德-吉洪诺夫方法能从含噪初始数据恢复终端期权价格剖面,而约化PINN求解器提供了有用的额外基准。与传统物理空间拟可逆方法的比较证明了勒让德约化的稳定效果。

英文摘要

We study a forward-time formulation of the Black-Scholes equation with state-dependent volatility. In contrast to the classical terminal-value pricing problem, where the option payoff is prescribed at maturity and the price is computed backward in time, the present problem prescribes the current option-price profile and seeks to recover the option-price profile at the expiration date T. This formulation is ill-posed, since the equation evolves in the unstable direction of the parabolic operator and high-frequency perturbations in the initial data may be strongly amplified. To address this difficulty, we introduce a price-dimensional reduction based on shifted Legendre polynomials. The original Black-Scholes equation is projected onto a finite-dimensional Legendre basis in the asset-price variable, leading to a system of ordinary differential equations in time for the expansion coefficients. This reduction acts as a spectral cutoff and also relaxes the degeneracy caused by the factor S^2 at the zero-price boundary. The main reconstruction method is a dimension-reduced Legendre--Tikhonov method. We prove existence, uniqueness, data stability, and convergence for each fixed truncation level. We also include a reduced PINN solver as a secondary computational comparison after the Legendre reduction. Numerical experiments with smooth, butterfly-spread, and European put payoffs show that the Legendre--Tikhonov method recovers the terminal option-price profile from noisy initial data, while the reduced PINN solver provides a useful additional benchmark. Comparisons with the conventional physical-space quasi-reversibility method demonstrate the stabilizing effect of the Legendre reduction.

2606.13656 2026-06-12 math.LO math.CO 新提交

On the sunflower property and the galah property

关于向日葵性质与加拉性质

Cheng Liao

AI总结 本文研究无限向日葵性质与加拉性质的关系,证明高维无限向日葵性质不存在,并完全刻画了Henson有向图、齐次度量空间和齐次超度量空间的加拉性质。

详情
AI中文摘要

向日葵性质(无限向日葵性质)由Ackerman、Karker和Mirabi引入并研究,作为著名的集合\Delta-系统引理的结构推广。结果表明,对于具有强融合的关系Fraïssé极限,该性质等价于所谓的加拉性质,该性质由Sullivan和Winkel作为不可分性的非对称变体引入。本文研究这两个性质,分为三部分。第一部分,我们通过证明对于任意n,k \geq 2,没有无限结构具有k维无限n-向日葵性质,表明Ackerman、Karker和Mirabi关于高维无限向日葵性质的猜想远非正确。第二部分,我们给出了Henson有向图、齐次度量空间和齐次超度量空间的加拉性质的完整刻画,从而回答了Sullivan和Winkel提出的第二个问题。第三部分包含关于有限向日葵性质的若干额外结果,包括对Guingona等人获得的某些无向图类不可分性近期结果的加强。

英文摘要

Sunflowerability, or the infinite sunflower property, was introduced and studied by Ackerman, Karker and Mirabi as a structural generalization of the well-known \Delta-system lemma for sets. It turns out that for relational Fra\''issé limits with strong amalgamation, this property is equivalent to the so-called galah property, which was introduced by Sullivan and Winkel as an asymmetric variation of indivisibility. This paper is about these two properties and is divided into three parts. In the first part, we show that the conjecture proposed by Ackerman, Karker and Mirabi about the infinite sunflower property in higher dimensions is far from being true by proving that no infinite structure has the infinite n-sunflower property in dimension k for any n, k \geq 2. In the second part, we give a complete characterization of the galah property for Henson directed graphs, homogeneous metric spaces and homogeneous ultrametric spaces, thereby answering the second question asked by Sullivan and Winkel. The third part contains several additional results about the finite sunflower property, including a strengthening of recent results about indivisibility for some classes of undirected graphs obtained by Guingona et al..

2606.13648 2026-06-12 math.PR 新提交

Stochastic dominations for FK percolation and sharp thinning thresholds for the Ising energy field

FK渗流和伊辛能量场的尖锐稀释阈值的随机占优

Paul Cahen, Avelio Sepúlveda

AI总结 本文引入p-弱和p-弱†占优概念,证明伊辛模型能量场关于耦合常数的随机单调性,并建立不同参数FK渗流之间的新随机占优关系。

详情
Comments
26 pages, no figure
AI中文摘要

乍一看,人们会认为伊辛模型的能量场(即端点共享相同自旋的边的集合)作为耦合常数的函数是随机单调的。然而,情况通常并非如此。在本文中,我们引入了两个较弱的随机占优概念,使得这一结果成立:$p$--弱和$p$--弱$^\dagger$占优。这两个概念都依赖于参数$p$,我们找到了最优值$p$和$p^\dagger$,使得这些占优成立。获得部分结果的关键要素之一是建立了关于不同参数$q,\tilde{q}\geq 1$的FK渗流之间的新随机占优关系,这本身也具有独立意义。

英文摘要

At first glance, one would imagine that the energy field of the Ising model, the set of edges whose endpoints share the same spin, is stochastically monotone as a function of the coupling constants. However, this is not generally the case. In this paper, we introduce two weaker notions of stochastic domination that make this result true: $p$--weak and $p$--weak$^\dagger$ domination. Both of these notions depend on a parameter $p$ and we find the optimal values $p$ and $p^\dagger$ so that these dominations hold. One of the key ingredient to obtain some of the results is a new stochastic domination relating FK percolations with different parameters $q,\tilde{q}\geq 1$ that is of independent interest.

2606.13632 2026-06-12 math.GR math.CO 新提交

Growth of Approximate Groups in Hyperbolic Groups

双曲群中近似群的增长

Michael Saks, Gal Yehuda

AI总结 本文证明双曲群中无限近似群(及更一般的近似半群)的增长二分法:要么生成子群是虚拟循环群,要么集合在词度量中具有正指数增长;并引入近似半群增长率的存在性判据,给出自由群中的最优常数。

详情
AI中文摘要

我们证明了双曲群中无限近似群(以及更一般的近似半群)的增长二分法。如果 \(G\) 是有限生成的双曲群,且 \(A\subseteq G\) 是无限集,满足对某个有限集 \(X\subseteq G\) 有 \(A^2\subseteq AX\),那么要么 \(\langle A\rangle\) 是虚拟循环群,要么 \(A\) 在环境词度量中具有正指数增长。我们还引入了近似半群增长率存在性的乘积增长判据。该判据适用于双曲群:如果 \(G\) 是带有有限生成集 \(S\) 的双曲群,则存在常数 \(c_{G,S}>0\) 使得 \[ |UV| \geq c_{G,S}\,\frac{|U||V|}{n+k+1}, \qquad U\subseteq B_n,\; V\subseteq B_k. \] 当 \(G\) 包含无限阶元素时,线性损失在阶上是最优的。在自由群及其标准生成集下,可取 \(c_{G,S}=1/4\)。我们还证明,在自由群中,若 \(U\subseteq S_n\) 且 \(V\subseteq S_k\),则 \[ |UV|\geq \left(\frac{2}{3}+\frac{1}{3\cdot 4^{\min\{n,k\}}}\right)|U||V|, \] 且该常数对所有 \(n,k\) 都是最优的。

英文摘要

We prove a growth dichotomy for infinite approximate groups, and more generally approximate semigroups, in hyperbolic groups. If \(G\) is a finitely generated hyperbolic group and \(A\subseteq G\) is infinite with \[ A^2\subseteq AX \] for some finite \(X\subseteq G\), then either \(\langle A\rangle\) is virtually cyclic, or \(A\) has positive exponential growth in the ambient word metric. We also introduce a product-growth criterion for the existence of growth rates of approximate semigroups. The criterion applies to hyperbolic groups: if \(G\) is hyperbolic with finite generating set \(S\), then there is a constant \(c_{G,S}>0\) such that \[ |UV| \geq c_{G,S}\,\frac{|U||V|}{n+k+1}, \qquad U\subseteq B_n,\; V\subseteq B_k. \] The linear loss is optimal in order whenever \(G\) contains an element of infinite order. In the free group with its standard generating set one may take \(c_{G,S}=1/4\). We also prove that, in a free group, if \(U\subseteq S_n\) and \(V\subseteq S_k\), then \[ |UV|\geq \left(\frac{2}{3}+\frac{1}{3\cdot 4^{\min\{n,k\}}}\right)|U||V|, \] and this constant is sharp for all \(n,k\).

2606.13619 2026-06-12 math.NT math.CO 新提交

Split primes and the Elekes-Rónyai problem

分裂素数与Elekes-Rónyai问题

Cosmin Pohoata

AI总结 构造反例表明存在绝对常数c>0和任意大的有限实数集A,使得多项式x+y+(x-y)^2的值集大小不超过|A|^{2-c},从而否定Elekes-Rónyai猜想。

详情
AI中文摘要

存在一个绝对常数$c>0$和任意大的有限集合$A\subset \mathbb{R}$,使得$$\left| \left\{x+y+(x-y)^2:\ x, y \in A\right\}\right| \le|A|^{2-c}.$$由于$x+y+(x-y)^2 \in \mathbb{R}[x,y]$是一个既非加法也非乘法的多项式,这为Elekes-Rónyai问题提供了一个反例。

英文摘要

There exist an absolute constant $c>0$ and arbitrarily large finite sets $A\subset \mathbb{R}$ with $$\left| \left\{x+y+(x-y)^2:\ x, y \in A\right\}\right| \le|A|^{2-c}.$$ Since $x+y+(x-y)^2 \in \mathbb{R}[x,y]$ is a polynomial which is neither additive nor multiplicative, this provides a counterexample for the Elekes-Rónyai problem.

2606.13616 2026-06-12 math.OA 新提交

Inclusions of Fell bundles $\mathrm{C}^*$-algebras and coaction crossed products

Fell 丛 $\mathrm{C}^*$-代数的包含与余作用交叉积

Md Amir Hossain

AI总结 研究Fell丛C*-代数在离散群余循环下的嵌入与拓扑分次结构,证明限制丛C*-代数的等距嵌入,并构造典范余作用,进而证明余作用交叉积同构于新Fell丛的C*-代数。

详情
Comments
19 pages
AI中文摘要

设 $p \colon \mathcal{A} \to G$ 是装备Haar系的局部紧Hausdorff第二可数群胚 $G$ 上的Fell丛,$\Gamma$ 是离散群。给定连续 $1$-余循环 $c \colon G \to \Gamma$,我们证明限制Fell丛 $\mathcal{A}|_{G_e}$ 的 $\mathrm{C}^*$-代数等距嵌入 $\mathrm{C}^*(G;\mathcal{A})$,其中 $G_e = c^{-1}(e)$ 是对应于单位元的开闭子群胚。我们利用此嵌入证明 $\mathrm{C}^*(G;\mathcal{A})$ 具有Exel意义下的自然拓扑分次 $\mathrm{C}^*$-代数结构。作为推论,我们得到 $\Gamma$ 在 $\mathrm{C}^*(G; \mathcal{A})$ 上的典范余作用 $\delta$。我们进一步证明相关的余作用交叉积 $\mathrm{C}^*(G; \mathcal{A})\rtimes_\delta \Gamma$ 自然同构于由余循环数据构造的Fell丛的 $\mathrm{C}^*$-代数。

英文摘要

Let $p \colon \mathcal{A} \to G$ be a Fell bundle over a locally compact Hausdorff second countable groupoid $G$ equipped with a Haar system, and let $\Gamma$ be a discrete group. Given a continuous $1$-cocycle $c \colon G \to \Gamma$, we show that the $\mathrm{C}^*$-algebra of the restricted Fell bundle $\mathcal{A}|_{G_e}$ embeds isometrically into $\mathrm{C}^*(G;\mathcal{A})$, where $G_e = c^{-1}(e)$ is the clopen subgroupoid corresponding to the identity element. We exploit this embedding to show that $\mathrm{C}^*(G;\mathcal{A})$ admits a natural structure of a topologically graded $\mathrm{C}^*$-algebra in the sense of Exel. As a consequence, we obtain a canonical coaction $\delta$ of $\Gamma$ on $\mathrm{C}^*(G; \mathcal{A})$. We further show that the associated coaction crossed product $\mathrm{C}^*(G; \mathcal{A})\rtimes_\delta \Gamma$ is naturally isomorphic to the $\mathrm{C}^*$-algebra of a Fell bundle constructed from the cocycle data.

2606.13615 2026-06-12 math.PR stat.ME 新提交

Data-driven subsampling rates for diffusion parameter estimation of SDEs

数据驱动的扩散参数估计子采样率选择

Felix Lindner, Andre Schmeißer, Felipe Trolldenier, Raimund Wegener

AI总结 提出基于单调游程统计的自动子采样率选择方法,确保子采样数据与SDE模型在无穷小尺度上一致,无需多尺度扩散渐近框架。

详情
Comments
30 pages, 11 figures
AI中文摘要

我们研究随机微分方程(SDE)模型中扩散参数估计的问题,其中数据和模型仅在尚未确定的特定尺度上兼容。我们引入一种简单有效的方法,用于选择合适的速率对给定的时间序列数据进行子采样,以确保子采样数据的统计结构与SDE模型在无穷小尺度上的行为一致。我们的方法基于分析子采样数据序列中单调递增或递减段(称为单调游程)的长度统计。作为分析基础,我们证明对于一大类具有加性噪声的SDE,在无穷小尺度上单调游程的长度近似服从成功概率为$1/2$的几何分布。利用这一通用特征,我们推导出一种自动化方法,用于为给定的时间序列数据选择合适的子采样率,该方法可直接应用于实际场景,且不依赖于多尺度扩散的渐近框架。通过一个工业数学应用——非织造纺织品生产过程中纤维铺放曲线的替代模型——展示了该方法。

英文摘要

We study the problem of diffusion parameter estimation for stochastic differential equation (SDE) models in scenarios where data and model are compatible only on specific scales that have yet to be determined. We introduce a simple and efficient method for selecting suitable rates at which given time series data should be subsampled in order to ensure that the statistical structure of the subsampled data is consistent with the behavior of the SDE model on an infinitesimal scale. Our approach is based on analyzing the statistics of the lengths of monotonically increasing or decreasing segments in the subsampled data sequence, which we refer to as monotone runs. As an analytical foundation, we prove for a large class of SDEs with additive noise that the lengths of monotone runs at an infinitesimal scale are approximately geometrically distributed with success probability $1/2$. This universal characterization is employed to derive an automated method for selecting appropriate subsampling rates for given time series data that is directly applicable in real-world scenarios and does not rely on an asymptotic framework of multiscale diffusions. The approach is demonstrated using an application from industrial mathematics concerning surrogate models for fiber lay-down curves in production processes of nonwoven textiles.

2606.13605 2026-06-12 math.OC cs.LG eess.SY 新提交

Distribution-Agnostic Robust Trajectory Optimization via Chance-Constrained Reinforcement Learning

基于机会约束强化学习的分布无关鲁棒轨迹优化

Yashdeep Chaudhary, Roberto Armellin, Harry Holt, Marco Sagliano

AI总结 提出一种分布无关的鲁棒轨迹优化框架,通过机会约束强化学习处理初始条件和过程噪声的不确定性,采用离线标称轨迹与在线仿射闭环校正,在两种不同轨迹设计问题上验证了概率可行性与燃料效率。

详情
Comments
Preprint. 39 pages, 16 figures
AI中文摘要

本文提出了一种基于机会约束强化学习的分布无关鲁棒轨迹优化框架。不确定性通过初始条件和过程噪声表示,唯一要求是能够对其进行采样。首先离线计算确定性标称轨迹,然后仅使用强化学习通过结构化仿射闭环校正律(包括前馈控制调整和时变反馈增益)来鲁棒化该基线。通过基于rollout的上尾分位数经验性地强制执行概率可行性,同时通过协方差可行性惩罚来调节终端分散性。该框架在两个性质不同的轨迹设计问题上进行了评估。主要案例研究是一个三维多脉冲地球-火星转移任务,其中学习策略在高斯不确定性下与最近的鲁棒轨迹优化参考进行基准比较,然后在有界均匀不确定性和训练期间未见的过程扰动下进行评估。第二个案例研究是一个随机大气精确火箭着陆问题,用于评估在具有阻力、质量消耗和下滑角约束的短时连续推力设置中的可移植性。结果表明,所提出的框架在保持概率可行性的同时,能够在上尾燃料成本方面保持竞争力,并且相同的鲁棒化框架可以跨异构航天器轨迹规划问题移植,而无需重新设计其核心随机控制结构。

英文摘要

This paper presents a distribution-agnostic robust trajectory-optimization framework based on chance-constrained reinforcement learning. The uncertainty is represented here through initial conditions and process noise, with the only requirement being that it can be sampled. A deterministic nominal trajectory is first computed offline, and reinforcement learning is then used only to robustify that baseline through a structured affine closed-loop correction law comprising a feedforward control adjustment and time-varying feedback gains. Probabilistic feasibility is enforced empirically through rollout-based upper-tail quantiles, while terminal dispersion is regulated through covariance-feasibility penalties. The framework is assessed on two materially different trajectory design problems. The flagship case study is a three-dimensional multi-impulse Earth-Mars transfer, where the learned policy is benchmarked against a recent robust trajectory-optimization reference under Gaussian uncertainty and then evaluated under bounded uniform uncertainty and under process disturbances not seen during training. The second case study is a stochastic atmospheric pinpoint rocket landing problem, used to assess portability to a short-horizon continuous-thrust setting with drag, mass depletion, and glide-slope constraints. The results show that the proposed framework can remain competitive in upper-tail fuel cost while preserving probabilistic feasibility, and that the same robustification scaffold can be carried across heterogeneous spacecraft trajectory planning problems without redesign of its core stochastic-control structure.

2606.13590 2026-06-12 math.NT math.CO 新提交

Some new modular Nahm sums of ranks 3 and 4

一些新的秩3和秩4的模Nahm和

Zhineng Cao, Liuquan Wang

AI总结 通过修改Zagier的秩3例子、对Milas和Wang的秩3蝌蚪Nahm和施以提升对偶运算以及常数项方法,发现了六个新的秩3和秩4的模Nahm和族,并建立了Rogers-Ramanujan型恒等式证明其模性。

详情
Comments
22 pages
AI中文摘要

我们发现了六个新的秩3和秩4的模Nahm和族。其中两个是通过修改Zagier的两个秩3例子得到的秩3和。三个秩4族是通过对Milas和Wang研究的秩3蝌蚪Nahm和施以提升对偶运算推导出来的,而另一个秩4族是通过常数项方法发现的。为了证明模性,我们建立了Rogers-Ramanujan型恒等式,将这些Nahm和表示为无穷乘积,这些乘积是模的。

英文摘要

We discover six new families of modular Nahm sums in ranks 3 and 4. Two of them are rank three sums obtained by modifying two of Zagier's rank three examples. Three rank four families are derived by applying the lift-dual operation to the rank three tadpole Nahm sums studied by Milas and Wang, while the other rank four family is found by the constant term method. To prove modularity, we establish Rogers-Ramanujan type identities that express these Nahm sums as infinite products which are modular.

2606.13585 2026-06-12 math.GT math.GR 新提交

Cellular waists of hyperbolic spaces

双曲空间的细胞腰

Grigori Avramidi, Thomas Delzant

AI总结 利用双曲群环中理想的自由定理,证明大单射半径闭双曲流形到欧氏空间的PL或光滑映射的纤维必须具有大量k维胞腔。

详情
Comments
14 pages
AI中文摘要

我们找到了PL和一般光滑映射 $p:M^d\rightarrow\mathbb R^m$ 的纤维的拓扑复杂性的下界,其中 $M^d$ 是具有大单射半径的闭双曲流形。更精确地说,我们证明如果 $M$ 的单射半径大于 $50\log((n+1)!)$,那么对于每个维度 $0<k<d-m$,存在一点 $z\in\mathbb R^m$ 使得纤维 $p^{-1}(z)$ 上的任何胞腔结构都有多于 $n$ 个 $k$ 维胞腔。证明基于 arXiv:2309.16791 中证明的双曲群环中理想的自由定理。

英文摘要

We find lower bounds on the topological complexity of fibers of PL and generic smooth maps $p:M^d\rightarrow\mathbb R^m$, where $M^d$ is a closed hyperbolic manifold of large injectivity radius. More precisely, we show that if the injectivity radius of $M$ is greater than $50\log((n+1)!)$, then for each dimension $0<k<d-m$ there is a point $z\in\mathbb R^m$ such that any cell structure on the fiber $p^{-1}(z)$ has more than $n$ cells of dimension $k$. The proof is based on a freedom theorem for ideals in group rings of hyperbolic groups proved in arXiv:2309.16791.

2606.13579 2026-06-12 math.OC math.CO 新提交

Optimal Proximity Bound and Product Function Estimates in Integer Linear Programming

整数线性规划中的最优邻近界与乘积函数估计

Iskander Aliev, Gennadiy Averkov, William Jones, Timm Oertel

AI总结 针对标准型整数线性规划,证明了LP松弛最优顶点解到最近整数最优解的欧氏距离以√det(AA^t)-1为上界且渐近紧,并给出了涉及乘积函数∏(x_i+1)的整数最优解界及其在背包问题中的应用。

详情
Comments
18 pages
AI中文摘要

我们获得了标准型整数线性规划 max{cx: Ax=b, x 非负整数} 的最优邻近界,其中 A 是秩为 m<n 的整数 m×n 矩阵,b 是整数向量。具体地,我们证明了从 LP 松弛的任意最优顶点解到最近整数最优解的欧氏距离以 $\sqrt{\det(AA^t)}-1$ 为界,并且该估计是渐近紧的。我们还推导了涉及乘积函数 $\prod_{i=1}^{n}(x_i+1)$ 的整数最优解的界,并讨论了它们在背包问题中的应用。

英文摘要

We obtain an optimal proximity bound for integer linear programs in standard form max{cx: Ax=b, x nonnegative integer}, where A is an integer mxn matrix of rank m<n and b is an integer vector. Specifically, we show that the Euclidean distance from any optimal vertex solution of the LP relaxation to a nearest optimal integer solution is bounded by $\sqrt{\det(AA^t)}-1$ and that this estimate is asymptotically tight. We also derive bounds for the optimal integer solutions involving the product function $\prod_{i=1}^{n}(x_i+1)$ and discuss their applications in the knapsack setting.