First server's effect on the expected number of games in tennis
首发球员对网球比赛期望局数的影响
AI总结 研究在假设每位球员发球得分概率恒定下,首发球员信息如何影响网球比赛期望总局数和胜局差,并确定该影响不可忽略的概率区域。
首发球员对网球比赛期望局数的影响
Ali Mohammadi
AI总结 研究在假设每位球员发球得分概率恒定下,首发球员信息如何影响网球比赛期望总局数和胜局差,并确定该影响不可忽略的概率区域。
我们证明,在每位球员发球得分概率恒定的标准假设下,首发球员的信息会影响网球比赛的期望总局数和胜局差,并识别出这些概率下该影响不可忽略的具体区域。我们通过数值验证,在盘和比赛层面,该影响最多不超过$0.4$局。例如,当球员发球得分概率相差$10\%$时,比赛超过$19.5$局的概率大约会变化$9\%$。我们通过专业比赛数据的实证比较补充了分析,说明了恒定概率假设在建模总局数方面的充分性。
We show that information on the first server influences the expected total number of games and margin in a tennis match under the standard assumption that each player's serve point win probability remains constant, and identify the exact regions, in terms of these probabilities, in which this effect is non-negligible. We confirm numerically that this effect is bounded by at most $0.4$ games at both the set and match level. This translates, for example, to roughly a $9$ percent shift in the probability that a match exceeds $19.5$ games when the players' serve point win probabilities differ by $10$ percent. We complement the analysis with an empirical comparison on professional match data, illustrating the adequacy of the constant-probability assumption for modelling the total number of games.
教室环境中的冲突感知座位分配
Bruna Cristina Braga Charytitsch, Mariá Cristina Vasconcelos Nascimento
AI总结 针对教室座位分配问题,提出数学模型和迭代局部搜索启发式算法,以最小化学生间的人际冲突。
课堂动态受多种因素影响,这些因素影响教学表现和学习活动。一个关键挑战是确定最有效的座位安排,即学生在特定教室环境中就座以实现最佳学习环境。本文介绍了学生座位分配问题(SSAP),用于在传统教室中战略性地组织学生座位,以最小化人际冲突。我们提出了一个数学模型和迭代局部搜索(ILS)启发式算法来解决SSAP。计算实验表明,与商业求解器在引入的数学模型上获得的结果相比,ILS在更复杂的场景中表现更优。ILS在处理具有更高冲突数量的真实和人工实例时特别有效。
Classroom dynamics depend on various elements that influence teaching performance and learning activities. A key challenge is to determine the most effective seating plan, where students will seat in a specific classroom setting to achieve the best learning environment. This paper introduces the Student Seat Allocation Problem (SSAP) for strategically organizing student seating in traditional classrooms to minimize interpersonal conflicts. We propose a mathematical model and an Iterated Local Search (ILS) heuristic to solve the SSAP. Computational experiments demonstrated that ILS outperformed in more complex scenarios when compared to the results obtained by a commercial solver on the introduced mathematical model. ILS was particularly efficient in real and artificial instances that exhibited a higher number of conflicts.
广义拉梅曲线理论
You-Cheng Chou, Chin-Lung Wang, Po-Sheng Wu
AI总结 通过分析椭圆曲线上具有多个正则奇点的广义拉梅方程,构造了广义拉梅曲线和对数自由曲线,并利用加法映射和扭曲等单值形变建立了边界退化与BGG范畴O中sl2(C)模张量代数的对应,解决了Treibich猜想。
我们研究椭圆曲线$E$上具有多个正则奇点$\mathbf{p} = (p_i)_{i = 1}^r$和权重$\mathbf{n} = (n_i)_{i = 1}^r$的广义拉梅方程(GLE)。通过分析允许拟周期解的轨迹,我们构造了两条基本代数曲线:(i) 广义拉梅曲线(GLC) $\mathcal{Y}_{\mathbf{n}, \mathbf{p}}$,它位于$\operatorname{Sym}^n E$上的仿射丛中(总权重$n:=\sum n_i \in \mathbb{Z}_{\geq 0}$),并参数化广义Hermite–Halphen ansatz解。(ii) 对数自由曲线$V_{\mathbf{n}, \mathbf{p}}$,当所有$n_i \in rac{1}{2}\mathbb{N}$时出现的非完全交簇,我们证明它是约化曲线,证实了Wang的一个猜想。我们将GLC作为极点配置空间上的代数族进行分析。通过研究加法映射$$\sigma\colon \operatorname{Sym}^n E \longrightarrow E,$$我们建立了一个一般有限、万有的次数公式,表明极点碰撞下边界退化的几何完美地镜像了BGG范畴$\mathcal{O}$中$\mathfrak{sl}_2(\mathbb{C})$-模的张量代数。这提供了建立GLC整体平坦性所需的局部结构极限。此外,我们发展了扭曲等单值形变的框架,并构造了由扭曲单值数据$(t,s)$参数化的$(\mathbf{n}, \mathbf{p})$-变形预模形式。它们的消失解决了底层的单值问题,并沿边界层分解,允许任意配置连续形变到经典拉梅方程。最后,利用渐近缩放技术,我们完全解决了$r=2$对称对的Treibich猜想,将其扩展到$r \leq 4$,并提出了一个枚举所有$r$的对称有限间隙KdV势的一般公式。
We study the generalized Lam'e equation (GLE) on an elliptic curve $E$ with multiple regular singularities $\mathbf{p} = (p_i)_{i = 1}^r$ of weights $\mathbf{n} = (n_i)_{i = 1}^r$. By analyzing the locus admitting quasi-periodic solutions, we construct two fundamental algebraic curves: (i) The generalized Lam'e curve (GLC), $\mathcal{Y}_{\mathbf{n}, \mathbf{p}}$, which lies in an affine bundle over $\operatorname{Sym}^n E$ for total weight $n:=\sum n_i \in \mathbb{Z}_{\geq 0}$ and parametrizes generalized Hermite--Halphen ansatz solutions. (ii) The log-free curve, $V_{\mathbf{n}, \mathbf{p}}$, a non-complete intersection variety arising when all $n_i \in \frac{1}{2}\mathbb{N}$, which we prove is a reduced curve, confirming a conjecture of Wang. We analyze the GLC as an algebraic family over the pole configuration space. By studying the addition map$$σ\colon \operatorname{Sym}^n E \longrightarrow E,$$where we establish a generically finite, universal degree formula, we show that the geometry of boundary degenerations under pole collisions perfectly mirrors the tensor algebra of $\mathfrak{sl}_2(\mathbb{C})$-modules within the BGG category $\mathcal{O}$. This provides the local structural limits needed to establish the global flatness of the GLC. Furthermore, we develop a framework of twisted isomonodromic deformations and construct $(\mathbf{n}, \mathbf{p})$-deformed pre-modular forms parameterized by twisted monodromy data $(t,s)$. Their vanishing solves the underlying monodromy problem and factorizes along boundary strata, allowing an arbitrary configuration to be continuously deformed down to the classical Lam'e equation. Finally, using an asymptotic scaling technique, we completely solve the Treibich conjecture for $r=2$ symmetric pairs, extend it to $r \leq 4$, and propose a general formula enumerating symmetric finite-gap KdV potentials for all $r$.
随机着色图中浓度的一个普遍二分法
Nicola Apollonio
AI总结 本文通过图度序列的欧几里得范数ζ,证明了在顶点随机着色且各类颜色比例有界远离零时,ζ=o(1)导致子图大小集中,而ζ=Θ(1)时浓度依赖于颜色平衡性。
设ζ为图的度序列的欧几里得范数除以图的大小。我们证明,当图的顶点被随机着色为s种颜色,且每种颜色类中顶点的比例有界远离零时,仅出现两种渐近情形。如果ζ=o(1),则颜色类诱导的子图的大小集中在其期望值附近。如果ζ=Θ(1),则浓度取决于颜色平衡:对于具有持续不平衡的着色,单色边的总数M以正概率保持远离其均值;否则,对于消失的不平衡,M仍然集中。同样的二分法适用于一大类随机着色的随机图。
Let $ζ$ be Euclidean norm of the degree sequence of a graph normalized by the graph size. We prove that when the vertices of a graph are randomly colored with $s$ colors such that the fraction of vertices in each color class is bounded away from zero, only two asymptotic regimes emerge. If $ζ=o(1)$, then the sizes of the subgraphs induced by the color classes concentrate around their expected values. If $ζ=Θ(1)$, then concentration depends on the color balance: for colorings with persisting imbalance, the total number $M$ of monochromatic edges stays bounded away from its mean with positive probability; otherwise, for vanishing imbalance, $M$ still concentrates. The same dichotomy holds for a broad class of randomly colored random graphs.
多元数据中方向不对称性与尾部比率偏离的投影诊断
Sayantan Banerjee, Soudeep Deb
AI总结 提出基于投影的诊断方法,通过方向偏度与分位数尾部比率将数据分类为四种模式,避免高阶矩的不稳定性,并建立理论性质。
我们研究基于投影的诊断方法,用于区分多元数据中的方向不对称性与尾部比率偏离。该方法将问题简化为单维投影,并计算两个基于分位数的汇总统计量:在多个分位数水平上评估的方向偏度度量,以及相对于选定基准评估的分位数间尾部比率。这两个汇总统计量导致四类分类:对称基准尾部、对称尾部偏离、偏斜基准尾部和偏斜尾部偏离。分位数公式避免依赖三阶和四阶矩,这些矩在重尾设置中可能不稳定。我们在中心对称性和椭圆性下建立总体性质,在搜索方向上建立均匀有限样本界,以及在分离模式下的阈值分类器一致性。还使用稀疏秩一计算说明为什么坐标方向在高维中可以补充随机方向。所得诊断旨在指导后续建模选择,例如对称、偏斜、尾部偏离或组合多元模型是否合适。
We study projection-based diagnostics for distinguishing directional asymmetry from tail-ratio departure in multivariate data. The procedure reduces the problem to one-dimensional projections and computes two quantile-based summaries: a directional skewness measure evaluated over several quantile levels, and an interquantile tail-ratio evaluated relative to a chosen benchmark. The two summaries lead to a four-regime classification: symmetric benchmark-tail, symmetric tail-departed, skewed benchmark-tail, and skewed tail-departed. The quantile formulation avoids relying on third and fourth moments, which can be unstable in heavy-tailed settings. We establish population properties under central symmetry and ellipticity, uniform finite-sample bounds over the searched directions, and consistency of the threshold classifier under separated regimes. A sparse rank-one calculation is also used to show why coordinate directions can complement random directions in high dimensions. The resulting diagnostic is meant to guide subsequent modelling choices, for example whether a symmetric, skewed, tail-departed, or combined multivariate model is appropriate.
稀疏主成分分析的鲁棒优化方法
David Vävinggren, Francis Bach, André M. H. Teixeira, Dave Zachariah, Antônio H. Ribeiro
AI总结 提出AdvPCA方法,通过鲁棒优化在重建目标中引入最坏情况潜在空间扰动实现稀疏性,并给出闭式解和迭代算法。
虽然主成分分析(PCA)是降维的基本工具,但其稠密表示使其不适用于高维数据。现有方法通过显式的$\ell_1$惩罚来促进稀疏性,但由于任务的无监督性质,这些惩罚不易调整。相比之下,我们提出了对抗性PCA(AdvPCA),它利用鲁棒优化,通过优化针对有界、最坏情况潜在空间扰动的重建目标来实现稀疏性。我们表明,该公式允许闭式约简,从而产生一种实用的迭代算法,该算法交替进行稀疏编码器的对抗性线性回归式更新和解码器的正交更新。通过对解进行理论刻画,我们推导出一种数据自适应参数化,使算法能够开箱即用地有效执行。我们通过在合成和真实世界基因组学数据上的数值实验验证了这些主张。
While principal component analysis (PCA) is a fundamental tool for dimensionality reduction, its dense representations make it ill-suited for high-dimensional data. Existing methods address this by promoting sparsity through explicit $\ell_1$-penalties, but these are not obvious to tune due to the unsupervised nature of the task. In contrast, we propose Adversarial PCA (AdvPCA), which leverages robust optimization to achieve sparsity by optimizing the reconstruction objective against bounded, worst-case latent space perturbations. We show that this formulation admits a closed-form reduction, leading to a practical iterative algorithm that alternates between adversarial linear regression-style updates for the sparse encoder and orthogonal updates for the decoder. By theoretically characterizing the solution, we derive a data-adaptive parameterization that allows the algorithm to perform effectively out of the box. We validate these claims through numerical experiments on synthetic and real-world genomics data.
高阶矢量Potts模型对离散数据的建模
Aaron De Clercq, Merijn Moody, Clélia de Mulatier
AI总结 本文通过引入q态自旋模型,将最大熵框架从二元数据推广到离散数据,提出高阶矢量Potts模型,并利用配分函数的圈展开和规范变换揭示其统计性质,最后聚焦于最小复杂模型实现快速模型选择。
对高维数据进行建模具有挑战性,但对于理解许多复杂系统至关重要。最大熵模型(如Ising模型和Potts模型)已被广泛用于从数据中的相关模式捕获成对相互作用,从而能够从观测(例如,从蛋白质序列或神经群体活动)中推断复杂系统的图形表示。最近,人们对涉及三个或更多变量的高阶相关模式建模的兴趣日益增长。虽然在高阶Ising模型的二元数据方面取得了进展,但我们将此框架扩展到更一般的离散数据情况。我们引入了q态自旋模型,这是一个完整的最大熵模型族,将矢量Potts模型推广到包含长程和任意高阶相互作用。在成对情况下,与标准矢量Potts模型相比,我们的模型允许更多样化的相互作用类型。我们通过示例讨论了它们的统计解释,并将其与离散傅里叶分析联系起来。利用配分函数的圈展开,我们证明了自旋模型的统计性质完全由其相互作用的代数结构所捕获。我们定义了规范变换,在此变换下该结构(以及配分函数)保持不变。规范变换下等价的模型可以被视为同一抽象统计模型的不同表示,尽管通常具有不同阶数的相互作用,这扩展了二元情况的结果。对于数据分析的实际应用,我们专注于二元情况下称为最小复杂模型的一个子集,并将其推广到离散数据。我们获得了这些模型边际似然的闭式表达式,从而能够快速进行模型选择。我们通过简单的真实世界示例说明了它们的用途。
Modeling high-dimensional data is challenging, yet essential to understanding many complex systems. Maximum entropy models such as Ising and Potts models have been used extensively to capture pairwise interactions from correlation patterns in data, allowing to infer graphical representations of complex systems from observations (e.g., from protein sequences or neural population activity). Recently, there has been growing interest in modeling higher-order correlation patterns involving simultaneously three or more variables. While progress has been made in binary data with high-order Ising models, we extend this framework to the more general case of discrete data. We introduce q-state spin models, a complete family of maximum entropy models that generalize the vector Potts model to include long-range and arbitrary high-order interactions. In the pairwise case, our models allow for more diverse interaction types compared to the standard vector Potts model. We discuss their statistical interpretation with examples and relate them to discrete Fourier analysis. Using a loop expansion of the partition function, we show that the statistical properties of spin models are fully captured by the algebraic structure of their interactions. We define gauge transformations under which this structure, and thus the partition function, remains invariant. Models equivalent under gauge transformations can be seen as different representations of the same abstract statistical model, despite generally having interactions of different orders, extending results from the binary case. For practical application to data analysis, we focus on a subset of models known in the binary case as Minimally Complex Models, generalizing them to discrete data. We obtain a closed-form expression for the marginal likelihood of these models, enabling fast model selection. We illustrate their use with simple real-world examples.
高维结果与高维预测变量的快速筛选方法
Hongju Park, Zhenyao Ye, Shuo Chen
AI总结 提出图独立双筛选(GIDS)框架,同时降低响应变量和预测变量的维度,以解决高维交叉模态分析中的计算负担和可解释性问题。
由于超高维度和复杂依赖结构伴随高水平噪声,对多模态高维数据间的交互建模本质上具有挑战性。筛选方法能有效降低维度,但大多数现有方法仅缩减预测变量空间而保留所有结果变量。在交叉模态分析中,不同结果变量通常选择不同的预测变量子集,因此并集仍然很大且响应维度不变,限制了筛选的实际效益。这导致沉重的计算负担和较差的可解释性。为解决这些局限,我们提出一个新的筛选框架——图独立双筛选(GIDS),它同时降低响应变量和预测变量的维度。我们设计了计算高效的算法,促进后续选择过程,提高准确性和可扩展性,并建立了支持性的理论结果。广泛的模拟研究表明,GIDS优于仅筛选预测变量的现有方法。为展示其实用性,我们将GIDS应用于阿尔茨海默病神经影像学倡议(ADNI)数据集,分析全基因组865,353个DNA甲基化与49,386个转录组变量之间的交互。GIDS将特征空间缩减至约9,000个CpG位点和2,000个转录本,揭示了块状交互结构:具有强关联的CpG位点簇和基因转录本簇。这些发现不仅提高了计算可处理性,还产生了可解释的生物学见解,突显了阿尔茨海默病背后的协调调控机制。
Modeling interactions among multimodal, high-dimensional data is intrinsically challenging due to ultra-high dimensionality and complex dependence structure with high level noise. Screening methods are effective for reducing dimensionality, but most existing approaches shrink only the predictor space while retaining all outcomes. In cross-modal analyses, different outcomes often select different predictor subsets, so the union remains large and the response dimension is unchanged, limiting the practical benefit of screening. This gives rise to heavy computational burdens and poor interpretability. To address these limitations, we propose a new screening framework, Graph Independence Dual Screening (GIDS), which simultaneously reduces the dimensionality of response variables and predictors. We design computationally efficient algorithms that facilitate downstream selection procedures, improving accuracy and scalability, and establish supporting theoretical results. Extensive simulation studies demonstrate that GIDS outperforms existing methods that screen only predictors. To illustrate its utility, we applied GIDS to the Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset, analyzing interactions between genome-wide 865,353 DNA methylation and 49,386 transcriptomic variables. GIDS reduced the feature space to approximately 9,000 CpGs and 2,000 transcripts, uncovering blockwise interaction structures: clusters of CpG sites and gene transcripts with strong associations. These findings not only improve computational tractability but also yield interpretable biological insights, highlighting coordinated regulatory mechanisms underlying Alzheimer's disease.
无限期界最优消费:Epstein-Zin偏好下的跨期对冲
Erhan Bayraktar, Emmet Lawless
AI总结 针对Epstein-Zin随机微分效用下的无限期界消费-投资问题,通过变分刻画价值函数并证明其存在性与正则性,结合测度变换与BSDE唯一性给出最优策略的反馈表示。
我们研究了一个具有Epstein-Zin随机微分效用的投资者在不完全市场中面临随机投资机会的无限期界最优消费-投资问题。风险厌恶和跨期替代被分离,我们工作在$ heta\in(0,1)$的框架下,其中对于任意非负渐进可测消费流存在唯一的广义效用过程。我们的主要贡献是价值函数的变分刻画。我们证明价值函数是一个泛函的唯一极小元,其欧拉-拉格朗日方程与汉密尔顿-雅可比-贝尔曼方程一致。尽管该泛函可能非凸,直接方法仍给出存在性,并且我们证明每个极小元都是严格正的、有界的和经典的。一个验证定理将任意极小元识别为价值函数,并给出最优消费和投资策略的反馈表示。证明结合了向近视概率的测度变换、Epstein-Zin BSDE的唯一性结果以及最优性的扰动论证。具有随机波动率、高斯超额收益和厚尾超额收益的例子说明了该框架的范围及其对跨期对冲的含义。
We study an infinite-horizon optimal consumption-investment problem for an investor with Epstein-Zin stochastic differential utility with stochastic investment opportunities in an incomplete market. Risk aversion and intertemporal substitution are separated, and we work in the regime $θ\in(0,1)$, where there exists a unique generalised utility process for arbitrary non-negative progressively measurable consumption streams. Our main contribution is a variational characterisation of the value function. We show that the value function is the unique minimiser of a functional whose Euler-Lagrange equation coincides with the Hamilton-Jacobi-Bellman equation. Although the functional may be non-convex, the direct method yields existence, and we prove every minimiser is strictly positive, bounded, and classical. A verification theorem identifies any minimiser with the value function and gives feedback representations for optimal consumption and investment policies. The proof combines a change of measure to the myopic probability with uniqueness results for Epstein-Zin BSDEs and a perturbation argument for optimality. Examples with stochastic volatility, Gaussian excess returns, and fat-tailed excess returns illustrate the scope of the framework and its implications for intertemporal hedging.
演化作为因果推断的过程
Jacopo Iacovacci
AI总结 本文提出自然选择应被理解为因果推断过程,利用Neyman-Rubin潜在结果框架形式化突变作为自然实验,并证明平均适应度的代际变化可分解为选择项和突变项。
最近,复制子方程到贝叶斯定理的映射已被认识,导致了演化动力学与贝叶斯学习之间的类比。然而,这种类比仅适用于无限种群中的纯选择,当引入突变——演化的核心机制——时则失效。这里我提出,自然选择下的演化,至少在静态环境中的单倍体复制子种群中,最好不被理解为学习过程,而是因果推断过程。每个突变事件构成一个自然实验,其中亲本作为对照,突变后代作为处理单元。自然选择筛选突变对适应度的因果效应,保留非负效应的突变。我在Neyman-Rubin潜在结果框架内形式化了这一观点。我首先使用通用适应度结果发展了一般理论,并展示了因果推断中的核心识别假设(稳定单位处理值假设、一致性、无混杂性、积极性)如何映射到演化生物学。利用非归一化的准种方程,我证明了平均适应度的代际变化精确分解为一个选择项——恢复了费舍尔基本定理——加上一个突变项,该突变项对应于所有亲本基因型上所有突变的累积效应的适应度加权平均值。我展示了在适当假设下,这种分解扩展到广义复制子-突变子方程,并且匹配的亲本-后代群体的频率根据突变对适应度的平均因果效应成比例更新。
Recently, the mapping of the replicator equation onto Bayes' theorem has been recognised, leading to an analogy between evolutionary dynamics and Bayesian learning. However, this analogy holds only for pure selection in infinite populations and breaks down when mutations -- a central mechanism of evolution -- are introduced. Here I propose that evolution by natural selection, at least for populations of haploid replicators in static environments, is best understood not as a learning process but as a process of causal inference. Each mutation event constitutes a natural experiment in which the parent serves as the control and the mutant offspring as the treated unit. Natural selection screens the causal effect of the mutation on fitness, retaining mutations with non-negative effects. I formalise this view within the Neyman-Rubin potential-outcomes framework. I first develop the general theory using a generic fitness outcome and show how the core identification assumptions in causal inference (Stable Unit Treatment Value Assumption, Consistency, Unconfoundedness, Positivity) map onto evolutionary biology. Using the unnormalised quasispecies equation, I prove that the intergenerational change in mean fitness decomposes exactly into a selection term -- recovering Fisher's Fundamental Theorem -- plus a mutation term that corresponds to a fitness-weighted average of the cumulated effect of all mutations over all parental genotypes. I show that this decomposition extends, under suitable assumptions, to the generalised replicator-mutator equation and that the frequencies of populations of matched parents-offspring update in proportion to the average causal effect of mutations on fitness.
计算多参数二次规划Lipschitz常数的APX难度
Xingchen Li, Kunpeng Liu, Keyou You
AI总结 本文证明了计算多参数二次规划解映射的Lipschitz常数不仅是NP难的,而且是APX难的,并揭示了当约束或决策变量数量固定时问题可多项式求解,且即使在标量参数情形下NP难和APX难仍然存在。
计算多参数二次规划解映射的Lipschitz常数对于基于优化的控制分析至关重要。该问题受三个因素影响:参数维度、决策变量数量和约束数量。虽然经验证据长期表明其指数复杂度,但缺乏严格的复杂性理论证明。本文填补了这一空白,证明该问题不仅是NP难的,而且是APX难的。此外,我们揭示:(a) 当约束或决策变量数量固定时,问题可在多项式时间内求解;(b) 即使在标量参数情形下,NP难和APX难仍然存在。这些结果证实了复杂性源于约束和变量的数量,而非参数维度。数值实验进一步验证了这些理论发现。
Computing the Lipschitz constant of the solution map of a multi-parametric quadratic program is important for the analysis of optimization-based control. This problem is governed by three factors: the parameter dimension, the number of decision variables, and the number of constraints. While empirical evidence has long suggested exponential complexity, a rigorous complexity-theoretic proof has been lacking. In this paper, we fill this gap by proving that this problem is not only NP-hard but also APX-hard. Furthermore, we reveal that: (a) the problem becomes polynomial-time solvable when the number of constraints or decision variables is fixed; and (b) both NP-hardness and APX-hardness persist even in the scalar parameter case. These results confirm that the complexity stems from the number of constraints and variables, rather than the parameter dimension. Numerical experiments further validate these theoretical findings.
形式化所有索引数学作为通用推理的基准,以范畴膨胀的实现为例
A. Mayeux
AI总结 本文提出将全部已发表数学形式化为机器可验证知识库的基准问题,并以范畴代数中的范畴膨胀为例展示实现过程。
形式严谨性将数学与其他学科区分开来,因为数学陈述是通过逻辑可验证的步骤从显式公理推导出来的。交互式定理证明器通过用完全形式化的语言表达定义、定理和证明并机械地验证它们来支持这一点。我们考虑了将所有已发表数学形式化为机器可验证且持续更新的数学知识语料库的基准问题。这种观点将数学视为一个相互依赖结果的结构化数据库,并引发了关于大规模形式库的可扩展性和组织的问题。作为案例研究,我们展示了范畴代数中正在进行的形式化工作,即范畴的膨胀,它扩展了经典局部化,并说明了这种实现在实践中是什么样的。
Formal rigor distinguishes mathematics from other disciplines, in the sense that mathematical statements are derived from explicit axioms by logically verifiable steps. Interactive theorem provers support this by expressing definitions, theorems, and proofs in a fully formal language and verifying them mechanically. We consider the benchmark problem of formalizing all published mathematics as a machine verifiable and continuously updated corpus of mathematical knowledge. This viewpoint treats mathematics as a structured database of interdependent results and raises questions about scalability and organization of large formal libraries. As a case study, we present an ongoing formalization in categorical algebra, namely dilatations of categories, extending classical localizations and illustrating what such an implementation looks like in practice.
熵门:LLM流水线中用于近无损令牌压缩的熵淬火
Justice Owusu Agyemang, Jerry John Kponyo, Kwame Opuni-Boachie Obour Agyekum, Francisca Adoma Acheampong, Kwame Agyeman-Prempeh Agyekum, James Dzisi Gadze
AI总结 提出Entropy Gate框架,通过熵淬火(一种热力学过程)逐步冻结低信息令牌,实现40-60%压缩率且语义保真度高于0.80,并证明其接近信息论极限。
LLM流水线在低信息内容上浪费大量令牌预算:重复上下文、冗长响应和冗余模板。我们引入Entropy Gate,一种令牌压缩框架,应用熵淬火——一种热力学过程,逐步冻结低能量令牌同时保持语义保真度。每个令牌获得一个多因素信息能量$E(t)$,结合统计、结构和位置分量。自适应淬火计划$T(\tau) = T_0 / (1 + \alpha \tau)$移除玻尔兹曼生存概率$p_i = \exp(-E_i / kT)$低于阈值的令牌,并有一个保真度门在能量加权相似度低于$\theta$时停止压缩。我们证明按$E(t)$降序选择令牌最大化预期语义保留,淬火产生嵌套生存集,且可达压缩率接近信息论极限$\text{CR} \to 1 - I(P; T)/H(P)$。第一阶段启发式方法在五种提示类别上实现40-60%压缩,同时保持$S_E > 0.80$,能量平方放大$E \to E^2$额外增加10-25个百分点。上下文去重在重复块上节省50-70%。输出端淬火受简洁性提高准确性的发现启发,进一步减少响应开销。结合外部记忆,压缩在代理工作负载上复合乘数达到88-96%。该框架无状态、模型无关,并作为兼容OpenAI的HTTP代理部署。
LLM pipelines waste substantial token budgets on low-information content: repeated context, verbose responses, and redundant boilerplate. We introduce Entropy Gate, a token compression framework applying entropy quenching $-$ a thermodynamic process that progressively freezes out low-energy tokens while preserving semantic fidelity. Each token receives a multi-factor information energy $E(t)$ combining statistical, structural, and positional components. An adaptive quenching schedule $T(τ) = T_0 / (1 + ατ)$ removes tokens whose Boltzmann survival probability $p_i = \exp(-E_i / kT)$ falls below threshold, with a fidelity gate halting compression when energy-weighted similarity drops below $θ$. We prove token selection by descending $E(t)$ maximizes expected semantic preservation, that quenching produces nested survival sets, and that achievable compression approaches the information-theoretic limit $\text{CR} \to 1 - I(P; T)/H(P)$. A Phase 1 heuristic achieves 40-60% compression across five prompt categories while maintaining $S_E > 0.80$, with energy-squared amplification $E \to E^2$ adding 10-25 percentage points. Context deduplication adds 50-70% savings on repeated blocks. Output-side quenching, motivated by findings that brevity improves accuracy, further reduces response overhead. Combined with external memory, reduction composes multiplicatively to 88-96% for agentic workloads. The framework is stateless, model-agnostic, and deploys as an OpenAI-compatible HTTP proxy.
图正则化非负简化四元数矩阵分解用于彩色图像识别
Hailang Wu, Yonghe Liu, Bingxuan Yu, Chaoqian Li
AI总结 针对非负简化四元数矩阵分解忽略局部几何结构的问题,提出图正则化模型,通过引入图拉普拉斯正则化项保持局部结构,并设计分量交替投影梯度算法,在彩色图像识别中取得竞争性结果。
非负简化四元数矩阵分解(NRBMF)利用简化四元数(RB)矩阵的乘积,将彩色图像像素的非负约束纳入分解过程。然而,NRBMF主要关注重构精度,未利用图像数据的局部几何结构,这可能限制所学低维特征的判别能力。为解决此问题,我们提出了一种图正则化非负简化四元数矩阵分解(GNRBMF)模型用于彩色图像识别。该模型将图拉普拉斯正则化项引入简化四元数系数矩阵,鼓励原始空间中的邻近样本在学习的特征空间中具有相似表示。同时,GNRBMF在简化四元数域中保留了NRBMF的非负保持特性。为求解优化问题,推导了一种分量交替投影梯度算法,并分析了其收敛性。实验结果表明,所提出的GNRBMF模型在某些测试设置下取得了具有竞争力或更优的识别性能。
Non-negative reduced biquaternion matrix factorization (NRBMF) uses the product of reduced biquaternion (RB) matrices to incorporate the non-negativity constraints of color image pixels into the factorization process. However, NRBMF mainly focuses on reconstruction accuracy and does not exploit the local geometric structure of image data, which may limit the discriminative ability of the learned low-dimensional features. To address this issue, we propose a graph regularized non-negative reduced biquaternion matrix factorization (GNRBMF) model for color image recognition. The proposed model incorporates a graph Laplacian regularizer into the reduced biquaternion coefficient matrix, encouraging nearby samples in the original space to have similar representations in the learned feature space. Meanwhile, GNRBMF retains the non-negativity-preserving property of NRBMF in the reduced biquaternion domain. To solve the optimization problem, a component-wise alternating projected gradient algorithm is derived, and its convergence properties are analyzed. Experimental results demonstrate that the proposed GNRBMF model achieves competitive or superior recognition performance in some tested settings.
基于地理不可区分性的信道图位置隐私
Atsu Kokuvi Angélo Passah, Rodrigo C. de Lamare, Arsenia Chorti
AI总结 针对信道制图中的位置隐私问题,提出基于马氏范数平面拉普拉斯机制的图表位置不可区分性框架,在保护隐私的同时保持信道图拓扑结构。
信道制图通过使用信道图中的伪位置,无需显式位置信息即可实现基于位置的服务(LBS)。虽然这一特性意味着固有的隐私优势,但它并不提供正式的隐私保证。在这项工作中,我们解决了信道制图中的位置隐私问题,称为图表位置不可区分性(CLI),它将地理不可区分性(GI)扩展到信道制图表示。为了实现CLI,研究了标准的平面拉普拉斯机制,并设计了一种几何感知的马氏范数平面拉普拉斯(MNPL)机制。所提出的MNPL机制通过注入与图表局部结构对齐的噪声来扰动信道图。在采用MNPL的CLI框架中,隐私使用从图表邻域导出的局部自适应协方差在潜在的信道图流形上定义,同时在隐私约束下保持流形拓扑。此外,差分隐私被考虑作为隐私基线。所提出的方法在多种信道制图方案上进行了评估。性能通过效用度量如质量损失(QL)和范围查询误差(RQE),以及几何感知度量包括可信度(TW)和连续性(CT)进行评估。数值结果表明,所提出的隐私机制在保持用于LBS任务的信道图的同时,提供了强大的隐私保证。
Channel charting enables location-based services (LBSs) without requiring explicit position information by using pseudo-locations from the channel chart. While this property implies inherent privacy advantages, it does not provide formal privacy guarantees. In this work, we address location privacy in channel charting referred to as chart location indistinguishability (CLI), which extends geo-indistinguishability (GI) to channel charting representations. In order to achieve CLI, a standard planar Laplace mechanism is investigated and a geometry-aware Mahalanobis norm planar Laplace (MNPL) mechanism is devised. The proposed MNPL mechanism perturbs the channel chart by injecting noise aligned with the local structure of the chart. In the CLI framework with MNPL, privacy is defined in latent channel chart manifolds using locally adaptive covariance derived from chart neighborhoods, while preserving manifold topology under privacy constraints. In addition, differential privacy is considered as a privacy baseline. The proposed approach is evaluated across multiple channel charting schemes. The performance is assessed using utility metrics such as quality loss (QL) and range query error (RQE), as well as geometry-aware metrics including trustworthiness (TW) and continuity (CT). Numerical results demonstrate that the proposed privacy mechanism provides strong privacy guarantees while preserving the channel chart for LBSs tasks.
随机森林中需要多少棵树?一种结合平台搜索与Optuna集成的重新审视方法
Vadim Porvatov, Andrey Dukhovny, Andrey Lange
AI总结 提出一种基于三元组平台搜索的算法,通过监控袋外分数的相对变化自动确定随机森林的树数量,避免预设搜索范围,并提供了理论分析和实验验证。
随机森林的超参数优化在调整树数量时面临一个特定困难:预测分数通常随集成规模单调提升,因此诸如树结构Parzen估计器(TPE)和Hyperband等标准方法需要预定义搜索范围,且往往将估计推向其右边界。早停策略避免了固定这样的范围,但对分数噪声敏感且容易过早停止。为解决此问题,我们提出一种集成的基于三元组的平台搜索算法,该算法将树数量从直接TPE搜索空间中移除,同时仍利用跨HPO试验积累的信息。该方法通过监控三个森林规模上的袋外(OOB)分数相对变化,自适应地跟踪接近最小的充分集成规模,并相应移动该三元组。这产生了一个基于容差参数的自动化且用户可解释的过程。我们还提供了理论分析:我们将所提出的相对OOB分数准则与当前分数和极限分数之间的差距联系起来,并推导了相应的基于OOB的绝对相对差异的渐近方差估计。实验表明,所选树数量可能与常见启发式方法有显著差异:对于大多数经典基准数据集,它更小;而对于一些高维生物信息学数据集(如Arcene和Dorothea),则更大。源代码和可重复实验可在以下网址获取:https://github.com/your-repo。
Hyperparameter optimization (HPO) for Random Forest faces a specific difficulty in tuning the number of trees: the predictive score typically improves monotonically with ensemble size, so standard methods such as Tree-structured Parzen Estimator (TPE) and Hyperband require a predefined search range and often drive the estimate toward its right boundary. Early-stopping strategies avoid fixing such a range, but can be sensitive to score noise and prone to premature stopping. To address this, we propose an integrated triplet-based plateau-search algorithm that removes the number of trees from the direct TPE search space and still exploits information accumulated across HPO trials. The method adaptively tracks a near-minimal sufficient ensemble size by monitoring relative changes in the out-of-bag (OOB) score across a triplet of forest sizes and shifting this triplet accordingly. This yields an automated and user-interpretable procedure based on a tolerance parameter. We also provide a theoretical analysis: we relate the proposed relative OOB-score criterion to the gap between the current and limiting scores, and derive an asymptotic variance estimate for the corresponding OOB-based absolute relative difference. Experiments show that the selected number of trees can differ substantially from the common heuristic: for most classical benchmark datasets it is smaller, whereas for some high-dimensional bioinformatics datasets, such as Arcene and Dorothea, it is larger. The source code and reproducible experiments are available at https://github.com/lange-am/rf_plateau_hpo.
基于体素的量子计算方法(VBQC)用于固体力学问题
Feng Wu, Yuxiang Yang, Li Zhu, Chen Li, Yansong Guo, Xu Guo
AI总结 提出一种基于体素的量子计算方法(VBQC),通过体素网格离散化使系统矩阵具有三对角分形性质,并利用KCQ分解结合量子傅里叶变换和量子多路复用器实现固体力学中哈密顿量的高效量子模拟。
量子计算为克服大规模力学问题中的效率和内存限制提供了一种有前景的方法,在流体力学中已展示了许多成功应用。然而,由于拉格朗日公式和复杂边界,固体力学问题通常需要不规则网格进行空间离散化,这使得系统矩阵(例如质量矩阵或刚度矩阵,在量子计算中常被称为哈密顿量)的量子模拟难以有效进行。本研究提出了一种基于体素的量子计算方法(VBQC),用于固体力学中哈密顿量的量子模拟。VBQC应用体素网格对空间域进行离散化,从而使系统矩阵具有三对角分形性质。基于这一性质,系统矩阵可以分解为三组基本矩阵:$\mathbf{k}_{n}$、$\mathbf{c}_{n}$和$\mathbf{q}_{n}$。这一分解过程称为KCQ分解。通过将KCQ分解与量子傅里叶变换和量子多路复用器相结合,VBQC能够高效地实现固体力学中哈密顿量的量子模拟。应用了三个不同维度和变量数量的具体固体问题,初步验证了所提出的VBQC在固体力学问题中的正确性。
Quantum computing presents a promising method to overcome the efficiency and memory constraints in large-scale mechanical problems, with numerous successful applications demonstrated in fluid mechanics. However, solid mechanics problems usually require irregular grids for spatial discretization, due to the Lagrange formulations and complex boundaries, which makes the quantum simulation of the system matrix, e.g., the mass or stiffness matrix which is often referred to as the Hamiltonian in quantum computing, difficult to be effectively conducted. This study proposes a voxel-based quantum computing method (VBQC) for the quantum simulation of Hamiltonians in solid mechanics. VBQC applies voxel grids to discretize the spatial domain, thereby enabling the system matrix to exhibit the tridiagonal fractal property. Based on this property, the system matrix can be decomposed into three groups of fundamental matrices, $\mathbf{k}_{n}$, $\mathbf{c}_{n}$, and $\mathbf{q}_{n}$. This decomposition process is referred to as the KCQ decomposition. By integrating the KCQ decomposition with the quantum Fourier transform and the quantum multiplexer, VBQC enables efficient quantum simulation of Hamiltonians in solid mechanics. Three specific solid problems with different dimensions and numbers of variables are applied to preliminarily verify the correctness of the proposed VBQC for solid mechanics problems.
通过线性化优化 Gödel-Löb 逻辑的树-超矢列证明搜索
Tim S. Lyon, Omar Taher
AI总结 针对 GL 逻辑在树-超矢列系统 CSGL 中的可判定性证明和 PSPACE 证明搜索算法问题,提出线性化方法,仅构建单分支推导和树矢列,实现 PSPACE 最优复杂度,并提取有限反模型验证算法正确性。
我们回答了 Poggiolesi 提出的关于树-超矢列系统 CSGL 中 GL 的语法可判定性证明的问题,并解决了 Maggesi 和 Perini Brogi 识别的挑战,他们寻求在表达性矢列形式主义中为 GL 寻找 PSPACE 证明搜索算法。我们使用以(标记)树矢列形式表述的 CSGL 的符号变体。我们的答案是复杂度最优的:我们提出了一种证明搜索算法,该算法判定公式的(不)有效性并在 PSPACE 中运行,与已知的 GL 的 PSPACE 完全性相匹配。为实现这一点,我们引入了一种“线性化方法”,该方法每次只构建推导和树矢列的单个分支,避免了在矢列形式主义中朴素证明搜索典型的指数爆炸。我们展示了如何系统地组合证明搜索期间生成的树矢列片段以提取有限反模型,这作为理论工具用于在证明搜索失败时建立算法的正确性。最后,我们证明每个有效公式都有一个仅由线矢列(对应于线性嵌套矢列)组成的证明。这建立了深度优先证明搜索与线性嵌套矢列演算之间的联系。我们的结果不仅回答了上述问题,还为模态逻辑中树矢列系统的证明搜索和正确性论证提供了新见解。
We answer a question posed by Poggiolesi concerning a syntactic decidability proof for GL in the tree-hypersequent system CSGL, and resolve a challenge identified by Maggesi and Perini Brogi, who sought a PSPACE proof-search algorithm for GL in expressive sequent-based formalisms. We work with a notational variant of CSGL formulated in terms of (labeled) tree sequents. Our answer is complexity-optimal: we present a proof-search algorithm that decides the (in)validity of formulae and runs in PSPACE, matching the known PSPACE-completeness of GL. To achieve this, we introduce a "linearization method," which constructs only a single branch of a derivation and of a tree sequent at a time, avoiding the exponential blowup typical of naive proof-search in sequent formalisms. We show how to systematically combine fragments of tree sequents generated during proof-search to extract finite counter-models, which serves as a theoretical device for establishing the correctness of the algorithm when proof-search fails. Finally, we show that every valid formula admits a proof consisting solely of line sequents, which correspond to linear nested sequents. This establishes a connection between depth-first proof-search and linear nested sequent calculi. Our results not only answer the aforementioned questions, but also provide new insights into proof-search and correctness arguments in tree sequent systems for modal logics.
LTL的非良基与循环证明:与线性嵌套相继式的句法对应
Tim S. Lyon, Lukas Zenger
AI总结 本文针对线性时序逻辑(LTL),引入并研究非良基和循环线性嵌套相继式演算,通过饱和递归和规则前移技术,解决了从非良基证明中提取循环证明(循环识别)以及反向转换(展开)两个核心问题。
我们引入并研究了非良基和循环线性嵌套相继式演算,并以线性时序逻辑(LTL)为案例开发了此类系统。本文解决了两个核心问题,即“循环识别”和“展开”。循环识别涉及识别非良基证明中的循环,以提取相应的循环证明;而展开研究反向转换,即从循环证明到非良基证明。尽管这些过程在Gentzen相继式中已被充分理解,但对于更具表达力的相继式形式化方法却鲜有关注,并且在线性嵌套相继式设置中变得更加具有挑战性。为了解决循环识别,我们证明了非良基证明相对于一种特定范式的完备性,该范式具有我们称为“饱和递归”的性质,从而能够系统地提取循环证明。为了解决展开,我们引入了一种专门的过程,将规则应用沿线性嵌套相继式向前移动,从而允许从循环证明重构非良基证明。总体而言,我们的工作为表达性多相继式形式化方法中的循环识别和展开提供了新的证明论技术。
We introduce and investigate non-wellfounded and cyclic linear nested sequent calculi, and, as a case study, develop such systems for linear temporal logic (LTL). The paper addresses two central problems, which we call 'cycle recognition' and 'unraveling.' Cycle recognition concerns identifying cycles in non-wellfounded proofs in order to extract corresponding cyclic proofs, while unraveling studies the converse transformation, from cyclic proofs to non-wellfounded ones. Although these processes are well understood for Gentzen sequents, they have received little attention for more expressive sequent formalisms and become more challenging in the linear nested sequent setting. To address cycle recognition, we show the completeness of non-wellfounded proofs relative to a particular normal form exhibiting a property we call 'saturation recurrence,' which enables the systematic extraction of cyclic proofs. To address unraveling, we introduce a specialized procedure that shifts rule applications forward along linear nested sequents, allowing non-wellfounded proofs to be reconstructed from cyclic ones. Overall, our work provides new proof-theoretic techniques for cycle recognition and unraveling in expressive multisequent formalisms.
Isbell 核中的类型演算
Juan Luis Gastaldi, Samantha Jarvis, Thomas Seiller, John Terilla
AI总结 本文证明线性逻辑中的双正交闭包与富化Isbell对偶中的核构造在给定最小数据(执行乘积和实值度量)下产生相同对象,并由此导出非交换Lambek演算。
我们识别了来自不同数学传统的两种构造。在线性逻辑和实现性中,逻辑类型是生成的而非预先固定的:从一个配备执行(execution)的实现者宇宙开始,使用正交性测试它们的交互,并将类型取为双正交闭子集。在富化Isbell对偶中,一个定量关系诱导一个伴随,其不动点构成一个范畴,即它的核。这些构造通过不同方式进行;我们证明,在当前设定下,它们产生相同的对象。共享的数据是最小的:一个称为执行的结合性乘积,以及一个实值度量,两者之间不假设任何兼容性。度量非加性这一点同时是定义正交性的关系以及我们形成其Isbell核的定量关系,而由正交性切割出的类型恰好是相关伴随的不动点。这一识别在双向都有收益。最自然的类型乘积不满足结合性;修复这一缺陷迫使采用不同的类型概念,对复合的两侧都敏感,使得诱导的乘积是结合的,并且当执行有单位时,携带两个余项。由此产生的是一个非交换的Lambek演算,直接从执行和正交性导出而非强加。在反向方向上,每个这样的类型,从范畴侧解读,生成它自己的定量关系,并随之产生一个导出的伴随和进一步的类型生成;这些导出的类型再次是原始情况的类型,由Lambek演算的余项计算。我们还证明了这一构造的三重排列的相干定理,并在有限维情形下给出了乘积的显式公式。
We identify two constructions from different mathematical traditions. In linear logic and realisability, logical types are generated rather than fixed in advance: one begins with a universe of realisers equipped with execution, uses orthogonality to test their interactions, and takes types to be the biorthogonally closed subsets. In enriched Isbell duality, a quantitative relation induces an adjunction whose fixed points form a category, its nucleus. These constructions proceed by different means; we show that, in the present setting, they produce the same objects. The shared datum is minimal: an associative product, called execution, and a real-valued measurement, with no compatibility assumed between them. The failure of the measurement to be additive is at once the relation defining orthogonality and the quantitative relation whose Isbell nucleus we form, and the types cut out by orthogonality are exactly the fixed points of the associated adjunction. The identification pays off in both directions. The most natural product of types fails to be associative; repairing this failure forces a different notion of type, sensitive to both sides of a composite, on which the induced product is associative and, when execution has units, carries two residuals. What emerges is a noncommutative Lambek calculus, derived directly from execution and orthogonality rather than imposed. In the reverse direction, each such type, read on the categorical side, generates a quantitative relation of its own, and with it a derived adjunction and a further generation of types; these derived types are again types of the original situation, computed by the residuals of the Lambek calculus. We also prove a coherence theorem for the threefold arrangements of this construction and, in the finite-dimensional case, give explicit formulas for the product.
反射计数系统 I:全局视角
Benoît Rittaud
AI总结 提出一个框架,通过引入Z-Gray积和理论工具,将标准b进制格雷码推广到k-bonacci格雷码及其他类型,并保持幂结合性和翻转数字性质。
我们提出了一个框架,将标准的b进制格雷码推广到[5]中得到的k-bonacci格雷码以及许多其他类型,通过使用允许在列表上进行计算的理论工具。我们引入了Z-Gray积的概念,由此推导出避免预定义因子列表Z的有限单词列表序列,这些序列满足幂结合性以及经典翻转数字性质的推广。
We present a framework to generalize the standard b-ary Gray code to get the k-bonacci ones obtained in [5] as well as many others by using theoretical tools that allow to make calculations on lists. We introduce the notion of Z-Gray product, from which we deduce sequences of lists of finite words avoiding a predefinite list Z of factors and which satisfy a power-associativity property as well a generalizations of the classical flipping digit property.
Let There Be Light: 面向神经算子的反射、折射与散射
Keke Wu, Yixuan Zhang, Jingrun Chen
AI总结 提出一种受光传输启发的神经算子LiNO,通过反射、折射和散射三种机制分解潜在演化,实现局部特征调制与全局空间通信的结构化分离,并开发高效散射变体将空间复杂度从二次降至线性。
神经算子学习无限维函数空间之间的映射,为参数化偏微分方程(PDE)提供数据驱动的代理建模范式。现有架构通常通过在指定变换域中参数化积分核,或对离散空间点应用类似注意力的交互来获得表达能力。尽管这些方法取得了显著进展,但它们常常面临物理可解释性、非局部空间通信、网格可扩展性和计算成本之间的持续权衡。我们提出了一种光启发的神经算子(LiNO),其潜在演化被分解为由基本光传输启发的三种机制:反射、折射和散射。反射和折射在潜在特征空间中充当自适应逐点变换,实现局部特征重定向和各向异性调制,而散射则在物理域上执行输入依赖的非局部传播。我们首先将散射公式化为具有相对位置偏置的归一化成对核,然后开发了一种高效的散射变体,用正特征全局传播和局部扩散分支替代显式的成对交互,将主导空间复杂度从二次降至线性。这产生了一个结构化的神经算子,将局部特征调制与全局空间通信分离,同时保留了模块化和可解释的潜在演化。
Neural operators learn mappings between infinite-dimensional function spaces and provide a data-driven surrogate modeling paradigm for parametric partial differential equations (PDEs). Existing architectures typically obtain expressivity by parameterizing integral kernels in prescribed transform domains or by applying attention-like interactions over discretized spatial points. While these approaches have achieved substantial progress, they often face a persistent trade-off among physical interpretability, nonlocal spatial communication, mesh scalability, and computational cost. We propose a Light-inspired neural operator(LiNO), an operator-learning architecture whose latent evolution is decomposed into three mechanisms motivated by elementary light transport: reflection, refraction, and scattering. Reflection and refraction act as adaptive pointwise transformations in latent feature space, enabling local feature reorientation and anisotropic modulation, whereas scattering performs input-dependent nonlocal propagation over the physical domain. We first formulate scattering as a normalized pairwise kernel with relative positional bias, and then develop an efficient scattering variant that replaces explicit pairwise interactions with positive-feature global propagation and a local diffusion branch, reducing the dominant spatial complexity from quadratic to linear. This yields a structured neural operator that separates local feature modulation from global spatial communication while retaining a modular and interpretable latent evolution.
PINN 在 FWD 反分析中的批判性评估及可微有限元方法作为替代方案
Yongjin Choi, Hyeonbin Moon, Seunghwa Ryu
AI总结 本文批判性评估了物理信息神经网络(PINN)在多层路面系统落锤式弯沉仪(FWD)反分析中的表现,并提出可微有限元方法(DiffFEM)作为更准确、稳定和高效的替代方案。
基于自动微分的反分析方法,包括物理信息神经网络(PINN)和可微编程,最近因其计算精确梯度和收敛效率的能力而显示出巨大潜力。然而,它们对落锤式弯沉仪(FWD)反计算的适用性尚未被探索。本研究基于合成基准,批判性评估了基于PINN的多层路面系统反分析,并研究了可微有限元方法(DiffFEM)作为替代方案。标准PINN由于层状路面系统固有的尖锐域不连续性而无法恢复层模量。尽管我们使用了具有域分解的扩展PINN(XPINN),它在不连续域上表现更好,但其性能仍然对损失权重和网络架构高度敏感,并且在测量噪声下会退化。相比之下,DiffFEM始终获得更准确、稳定且计算高效的反演结果。这些结果表明,将控制物理作为硬约束强加的DiffFEM比基于PINN的方法(其中控制物理通过损失函数作为软约束施加)具有更好的准确性、鲁棒性和计算效率。更广泛地说,研究结果表明,在基于PINN和DiffFEM的反分析之间进行选择需要仔细考虑,当存在高效且稳健的可微正演求解器时,DiffFEM提供了实际优势。
Automatic-differentiation-based inverse analysis methods, including physics-informed neural networks (PINNs) and differentiable programming, have recently shown great promise due to their ability to compute accurate gradients and convergence efficiency. However, their applicability to falling weight deflectometer (FWD) backcalculation remains unexplored. This study critically evaluates PINN-based inverse analysis for a multilayer pavement system and investigates differentiable finite element method (DiffFEM) as an alternative based on a synthetic benchmark. The standard PINN does not recover layer moduli because of the sharp domain discontinuities inherent to layered pavement systems. Although we use an extended PINN with domain decomposition (XPINN), which shows better performance on discontinuous domains, its performance remains highly sensitive to loss weighting and network architecture, and degrades under measurement noise. By contrast, DiffFEM consistently achieves more accurate, stable, and computationally efficient inversion results. These results indicate that DiffFEM, which enforces the governing physics as a hard constraint, yields better accuracy, robustness, and computational efficiency than PINN-based approaches, in which the governing physics is imposed as a soft constraint through the loss function. More broadly, the findings suggest that the choice between PINN- and DiffFEM-based inverse analysis needs careful consideration, with DiffFEM offering practical advantages when an efficient and robust differentiable forward solver is available.
Lean 4 机器验证的 P = NP 证明:基于谱系多面体成员问题
T. S. Arthanari
AI总结 本文通过递归构造分层网络和多重商品流问题,证明谱系多面体成员问题(M3P)可在强多项式时间内求解,进而由对称旅行商问题(STSP)归约得到 P = NP,并在 Lean 4 中完成机器验证。
谱系多面体成员问题(M3P)询问:给定 $X\in\mathbb{Q}^{\binom{n}{3}}$,是否 $X\in\mathrm{conv}(P_n)$,其中 $P_n$ 是所有谱系的集合。谱系是 $K_n$ 中哈密顿圈构造的结构化编码。我们通过递归构造的分层网络 $(N_k, R_k, \mu)$ 和多重商品流问题 MCF$(k)$ 证明 M3P 可在强多项式时间内求解。建立的成员必要充分条件是 MCF$(n-1)$ 中的最优总流量等于最大可能流量 $z_{\max}$。基于 Tardos(1986)的组合线性规划强多项式算法,复杂度分析表明该条件可在所涉及矩阵维度的强多项式时间内检验。由充分性,这意味着 M3P $\in$ P。由于对称旅行商问题(STSP)可通过多阶段插入(MI)公式(Arthanari 1983)归约到 M3P,STSP 可在多项式时间内求解,P 与 NP 问题由此解决。导致该结果的所有证明已在 Lean 4/Mathlib4 中完全机器验证,主证明链中无未解决的 \texttt{sorry}。主要贡献是对主链中所有证明的 Lean 4 机器验证,得到 \texttt{theorem p\_equals\_np}: P = NP。Lean 4 形式化验证覆盖了 MCF(n-1) 对 $\mathrm{conv}(P_n)$ 成员性的充分性,以及通过 Maurras (2002)、Grötschel–Lovász–Schrijver (1988)、Cook (1971) 和 Karp (1972) 得到的 P = NP 链。完整的 Lean 项目(36 个 Lean 4 文件,2968/2968 构建目标干净)可在该 https URL 获取。
The Membership Problem for Pedigree Polytope (M3P) asks, given $X\in\mathbb{Q}^{\binom{n}{3}}$, whether $X\in\mathrm{conv}(P_n)$, where $P_n$ is the set of all pedigrees. A pedigree is a structured encoding of a Hamiltonian cycle construction in $K_n$. We establish that M3P is solvable in strongly polynomial time via a recursively constructed layered network $(N_k, R_k, μ)$ and a multicommodity flow problem MCF$(k)$. The necessary and sufficient condition for membership established is that the optimal total flow in MCF$(n-1)$ equals the maximum possible flow $z_{\max}$. The complexity analysis, grounded in Tardos's strongly polynomial algorithm for combinatorial linear programs (1986), shows that this condition can be checked in strongly polynomial time in the dimension of the matrix involved. By sufficiency, this implies M3P~$\in$~P. Since the Symmetric Travelling Salesman Problem (STSP) reduces to M3P via the Multistage Insertion (MI) formulation (Arthanari 1983), STSP is solvable in polynomial time, and the P vs.NP question is resolved. The proofs leading to this result are fully machine-verified in Lean~4/Mathlib4, with zero unresolved \texttt{sorry}s in the main proof chain. The main contribution is the Lean~4 machine verification of all proofs in the main chain, resulting in \texttt{theorem p\_equals\_np}: P = NP. The Lean~4 formal verification covers the sufficiency of MCF(n-1) for membership in $\mathrm{conv}(P_n)$, and the P = NP chain via Maurras (2002), Grötschel--Lovász--Schrijver (1988), Cook (1971), and Karp (1972). The complete lean project (36 Lean~4 files, 2968/2968 build targets clean) is available at https://github.com/TiruArt/Pedigree-Polytopes-Lean4.
神经网络可证明地学习群组合的谱表示
Jianliang He, Leda Wang, Fengzhuo Zhang, Siyu Chen, Zhuoran Yang
AI总结 通过将投影梯度流提升到傅里叶域,证明两层神经网络在群组合任务中几乎必然收敛到单个不可约表示,并揭示了表示论视角下的特征学习和低秩压缩现象。
理解神经网络训练过程中结构化内部结构如何涌现是深度学习研究的核心。我们通过群组合任务研究这一现象,其中训练一个两层神经网络来预测有限群 $G$ 中元素的 $g_1 \star g_2$。通过将投影梯度流提升到傅里叶域,我们证明训练动力学由一个表示论能量泛函上的黎曼梯度上升控制。我们证明,在随机初始化下,该流驱动每个神经元几乎必然收敛到单个不可约表示,而跨层傅里叶系数实现旋转秩一对齐。该框架提供了特征学习的表示论解释,并刻画了矩阵值群表示的一种新颖的低秩压缩现象。此外,对于阿贝尔群,我们提供了完整的总体水平描述:随机初始化促进非平凡表示上的均匀多样化,并诱导 Haar 均匀相位,通过多数投票机制联合逼近指示函数。我们进一步证明相位对齐和表示竞争都以指数收敛速率出现。
Understanding how structured internal structure emerges during neural network training is central to the study of deep learning. We investigate this phenomenon through the group composition task, where a two-layer neural network is trained to predict $g_1 \star g_2$ for elements of a finite group $G$. By lifting the projected gradient flow to the Fourier domain, we demonstrate that the training dynamics are governed by a Riemannian gradient ascent on a representation-theoretic energy functional. We prove that, under random initialization, this flow drives each neuron to converge almost surely toward a single irreducible representation, while the cross-layer Fourier coefficients achieve a rotational rank-one alignment. This framework provides a representation-theoretic account of feature learning and characterizes a novel low-rank compression phenomenon for matrix-valued group representations. Moreover, for Abelian groups, we provide a complete population-level description: random initialization promotes uniform diversification across nontrivial representations and induces Haar-uniform phases, jointly approximating the indicator via a majority-vote mechanism. We further prove that both phase alignment and representation competition emerge with exponential convergence rates.
柔性2自由度机械臂的混合动力学建模
Maciek Popik, Daniel Yang, Mahdis Bisheban
AI总结 针对刚性模型无法捕获的未建模动力学,本文结合刚体动力学与高斯混合模型或纯数据驱动回归,对柔性2自由度机械臂进行混合建模,并比较了不同方法的扭矩预测精度。
本文研究了三种对柔性连杆2自由度机械臂动力学进行建模的方法,以解决刚体模型无法捕获的未建模动力学。两种物理信息模型将刚体动力学(RBD)公式与高斯混合模型(GMM)相结合,以捕获残差模型误差和连杆柔性。一个基于运动学的回归模型作为纯数据驱动的基线。使用开源数据集,首先通过运动学特征的岭回归估计扭矩预测,而基于物理的基线则根据公布的规格构建,随后使用普通最小二乘回归直接从数据估计相同的参数集。结果表明,基于物理的参数精度最差,而正则化和最小二乘估计器与实测扭矩更吻合。残差分析和误差指标凸显了纯参数模型在柔性连杆系统中的局限性,并强调了正则化和数据驱动辨识的价值,支持了半参数残差学习方法的发展。
This paper examines three approaches for modeling the dynamics of a flexible-link 2-DoF robotic arm to address unmodeled dynamics not captured by rigid-body models. Two physics informed models combine rigid-body dynamics (RBD) formulations with a Gaussian Mixture Model (GMM) to capture residual model errors and linkage flexibility. A kinematics-based regression model serves as a purely data-driven baseline. Using an open-source dataset, torque predictions are first estimated using Ridge regression on kinematic features, while the physicsbased baseline is constructed from published specifications, and ordinary least-squares regression is subsequently used to estimate the same parameter set directly from data. Results show that the physics-based parameters yield the poorest accuracy, while regularized and least-squares estimators align more closely with measured torques. Residual analysis and error metrics highlight the limitations of purely parametric models for flexible-link systems and underscore the value of regularization and data-driven identification, supporting developments of semi-parametric residual learning methods.
量化公共计量发布中的侧信道泄漏
Faruk Alpay, Taylan Alpay
AI总结 本文形式化并量化了公共科学和计量发布中侧信道泄漏的风险,提出了一种基于统计侧信道审计的框架,并导出了有限带宽传输泄漏定律。
公共科学和计量发布可能泄漏产生它们的隐藏设置。我们形式化并量化了这种风险,将其视为一种轮廓统计侧信道审计:发布映射暴露了功率谱密度(PSD)的有限带宽统计量,轮廓观察者在明确预算下训练标记模板谱,挑战发布从两个效用等价的配方中抽取,这些配方由一个受保护坐标分隔。平均PSD bin遵循伽马信道,当bin相关时,由协方差加权对数谱信道替代;这产生了精确的Kullback-Leibler散度、Chernoff指数、受保护比特优势界限以及有限训练、有限库、有限计算和模型失配校正。我们的主要结果是一个有限带宽传输泄漏定律:在消除幅度和模糊后,受保护的酸传输信息满足$I_{\lambda|\alpha,eta}(K) = (64/1225)\, w \lambda^{6} K^{9} + O(w \lambda^{8} K^{11})$,其中$K\lambda \ll 1$,这是一个具有闭式安全带的九阶指数。一个逐步协议将测量发布转化为这些数字,一个固定种子的可重复性包重新生成每个表格和图形。我们将该审计应用于筛选的极紫外(EUV)粗糙度谱作为模型条件案例研究,下一步是部署在测量发布上。
Public scientific and metrology releases can leak the hidden settings that produced them. We formalize and quantify this risk as a profiled statistical side-channel audit: a release map exposes finite-band statistics of a power spectral density (PSD), a profiled observer trains labeled template spectra under an explicit budget, and a challenge release is drawn from one of two utility-equivalent recipes separated by a protected coordinate. Averaged PSD bins follow a gamma channel, replaced by a covariance-weighted log-spectrum channel when the bins are correlated; this yields exact Kullback-Leibler divergences, Chernoff exponents, protected-bit advantage bounds, and finite-training, finite-library, finite-compute, and model-mismatch corrections. Our headline result is a finite-band transport-leakage law: after amplitude and blur are eliminated, the protected acid-transport information obeys $I_{λ|α,β}(K) = (64/1225)\, w λ^{6} K^{9} + O(w λ^{8} K^{11})$ for $Kλ\ll 1$, a ninth-order exponent with a closed-form safe band. A step-by-step protocol turns a measured release into these numbers, and a fixed-seed reproducibility package regenerates every table and figure. We instantiate the audit on screened extreme-ultraviolet (EUV) roughness spectra as a model-conditioned case study, with deployment on measured releases the next step.
一种用于对称非负矩阵分解和图聚类的非单调梯度算法
Ryan Swart, Johannes Brust
AI总结 提出SNMPBB算法,首次将非单调投影Barzilai-Borwein方法应用于对称非负矩阵分解,并扩展至图聚类和大规模问题,证明全局收敛性,实验显示显著加速和精度提升。
对称非负矩阵分解(Symmetric NMF)将矩阵近似为 $WW^T$,其中 $W$ 是非负矩形因子。它在图聚类和机器学习中有广泛应用。与NMF相比,投影梯度方法在对称问题上收敛缓慢。为了解决这个问题,我们引入了SNMPBB,这是非单调投影Barzilai-Borwein方法在对称NMF上的首次应用,表明梯度算法比以前认为的有效得多。我们进一步将SNMPBB扩展到使用图拉普拉斯正则化的图聚类(Graph-SNMPBB)以及使用低秩近似的大规模问题(LAI-SNMPBB)。对于所有变体,我们证明了全局收敛到一阶稳定点,并且Barzilai-Borwein曲率信息在随机近似下得以保留。在合成数据上,SNMPBB在相似残差下比替代的SymANLS快6倍,且优势随秩增加而扩大。在六个真实世界聚类基准测试中,Graph-SNMPBB匹配或超过了SymANLS的精度。最后,LAI-SNMPBB在34个SuiteSparse矩阵上,在运行时间和残差质量方面均优于最先进的LAI-SymPGNCG。
Symmetric nonnegative matrix factorization (Symmetric NMF) approximates a matrix as $WW^T$ with nonnegative rectangular factor $W$. It has broad applications in graph clustering and machine learning. In contrast to the NMF, projected gradient methods for the symmetric problem had been associated with slow convergence. To address this, we introduce SNMPBB, the first adaptation of nonmonotone projected Barzilai-Borwein methods to Symmetric NMF, demonstrating that gradient algorithms are significantly more effective than previously understood. We further extend SNMPBB to graph clustering using the graph Laplacian regularization (Graph-SNMPBB) and to large problems with low-rank approximations (LAI-SNMPBB). For all variants we prove global convergence to first-order stationary points and also that Barzilai-Borwein curvature information is preserved with randomized approximations. On synthetic data, SNMPBB achieves 6 times speedup over the alternative SymANLS for similar residuals, with advantages growing at higher ranks. Across six real-world clustering benchmarks, Graph-SNMPBB matches or exceeds SymANLS accuracy. Lastly, LAI-SNMPBB outperforms state-of-the-art LAI-SymPGNCG on 34 SuiteSparse matrices in both runtime and residual quality.
Clef 编程语言中的不动点脚手架
Houston Haynes
AI总结 本文提出一种基于不动点组合子的编译器中间表示方法,通过范畴论构造实现类型结构保持和正确性验证,并利用 MLIR 工具链支撑实际编译。
对于 Gabriel 的“更糟即更好”的粉丝来说,讽刺的是,C++ 通过 MLIR 充当了编译 ML 家族语言的脚手架,而该语言的正确性属性是结构性的。在我们的 Composer 编译器中,一个关键的交叉点以不动点组合子开始其降级,该组合子保留了程序语义图中的维度、等级、转义和数值表示结构。而来自 PSG 的 MLIR 并非被动宿主。它使用静态单赋值、属性系统和方言来实质性地承载这些结构。我们展示了编译器中间端使用范畴论构造来降级代码,并伴随该层级的验证:一个从编译偏序集到目标范畴的函子,满足组合性方程。我们方法的依据来自三个来源,每个都基于自身的代数对象:Ohori 的机器码证明理论为编译轴提供基础,参数性为底层内容提供基础,伴随模态逻辑为验证层级之间的遍历提供基础。为了扩展论点,我们引入了紧闭负类型和分数类型,并展示了类型机制可以在保持结构的情况下实现,并通过 MLIR 提供的工具实现。更广泛地说,通过编译保持类型的相同不动点原语也提供了证明项,这些证明项可以在 MLIR 中继续使用,以验证其完整性,因为降级过程通过流水线进行。我们认为,这个基础是我们框架预期的一个独特附加点,包括维度类型、Tarau 的群胚和细胞层。在整个过程中,形式化被实现为内部脚手架:抽象支持编译器的机制,开发者永远不需要接触范畴论就能依赖编译器提供的保证。
For fans of Gabriel's "Worse is Better" it may be ironic that C++, by way of MLIR, serves as the scaffold for compiling an ML-family language whose correctness properties are structural. A crucial intersection in our Composer compiler initiates its lowering with a fixed-point combinator that preserves the dimensional, grade, escape, and numeric-representation structure from the Program Semantic Graph. And the MLIR that's witnessed from the PSG is no passive host. Its use of static single assignment, attribute system and dialects carry that structure materially. We show that our compiler middle end uses categorical construction for lowering code with companion verification to that strata: a functor from the compilation poset to a target category, subject to the compositionality equation. The grounding of our approach comes from three sources, each on its own algebraic object: Ohori's machine-code proof theory grounds the compilation axis, parametricity grounds the content at the base, and adjoint mode logic grounds the traversal between our verification tiers. To extend the thesis we introduce compact-closed negative and fractional types, and show the type machinery can be carried with preserved structure and realized through tooling MLIR provides. More broadly, the same fixed-point primitive that preserves types through compilation also supplies proof terms that can continue to be exercised in MLIR to verify its integrity as lowering proceeds through the pipeline. We argue that this foundation is a unique additional point anticipated by our framework that includes dimensional types, Tarau's groupoid, and cellular sheaves. Throughout, the formalism is instrumented as an internal scaffold: the abstractions support the compiler's mechanics, where a developer is never required to reach for category theory in order to rely on the guarantees the compiler provides.
学习一致表示:一种拓扑可解释性方法
Sigurd Gaukstad, Melvin Vaupel, Valdemar Kargård Olsen, Erik Hermansen, Benjamin Dunn
AI总结 提出基于脑神经编码启发的“一致性”几何约束,通过Fréchet方差目标函数Coh训练模型,使特征在样本空间中形成连续区域,从而提升表示的可解释性。
深度神经网络学习的表示中,单个特征往往缺乏可解释意义;一个神经元可能对分散、不相关的输入激活。我们引入一致性,这是一种受大脑神经编码启发的几何性质,其中像网格细胞和头部方向细胞这样的神经元对状态空间的连续区域做出响应。一个非负矩阵是一致的,如果每个行(样本)关注几何上聚类的列(特征),反之亦然,并且每个样本都由某个特征很好地描述,每个特征都被某个样本需要。我们证明一致矩阵在样本和特征的Vietoris-Rips过滤之间诱导有界交错,保证两个空间共享兼容的拓扑结构。这种几何约束促进了可解释性。例如,如果数据位于圆上,一致特征必须将该圆分割成连续的弧段。我们引入Coh,一种基于Fréchet方差的可微目标函数,在训练过程中强制执行一致性。与稀疏性(限制一个特征激活多少个样本)不同,一致性限制哪些样本,要求几何连通性而不仅仅是稀有性。这不仅产生可解释的特征,还产生可解释的特征空间。我们使用合成数据和旋转MNIST数据集在自编码器中验证Coh,并使用语言数据在BERT的词嵌入中验证Coh。
Deep neural networks learn representations where individual features often lack interpretable meaning; a single neuron may activate for scattered, unrelated inputs. We introduce coherence, a geometric property inspired by neural coding in the brain, where neurons like grid cells and head direction cells respond to contiguous regions of state space. A non-negative matrix is coherent if each row (sample) attends to geometrically clustered columns (features) and vice versa, and in addition every sample is well described by some feature and every feature is needed by some sample. We prove that coherent matrices induce a bounded interleaving between the Vietoris-Rips filtrations of samples and features, guaranteeing that both spaces share compatible topological structure. This geometric constraint facilitates interpretability. For example, if data lies on a circle, coherent features must tile that circle into contiguous arcs. We introduce Coh, a differentiable objective function based on Fréchet variance that enforces coherence during training. Unlike sparsity, which bounds how many samples a feature activates on, coherence bounds which samples, requiring geometric connectivity rather than only rarity. This yields not just interpretable features but an interpretable feature space. We validate Coh in an auto-encoder using synthetic and rotated MNIST datasets and in a token embedding of BERT using language data.