arXivDaily arXiv每日学术速递 周一至周五更新
重置
2604.18577 2026-04-21 math.CO

On Chromatic Asymptotic Approximate Groups

Arindam Biswas

详情
英文摘要

We study a chromatic theory of asymptotic approximate groups for tuples of subsets of abelian groups, combining Nathanson's chromatic sumset formalism with asymptotic covering ideas from approximate group theory. This framework encodes simultaneous additive growth across several color classes. We show some general lifting and invariance principles, establish chromatic covering theorems for finite tuples and for tuples whose color classes are finite unions of unbounded linear sets, and obtain exact structure theorems for translated submonoids and finite-set-plus-submonoid sets. We also obtain sharper binomial bounds in the finite and unbounded-linear cases than the previous lattice-covering estimates. In the integer setting, we show that for each fixed threshold $t$, the threshold-$t$ chromatic layers form an asymptotic approximate family, using Nathanson's eventual interval-plus-edges description to obtain a uniform bound of size $r+2$ and prove an inhomogeneous extension for certain families.

2604.18558 2026-04-21 math.PR math-ph math.MP

Uniform analyticity of local observables in FK-percolation and analyticity of the Ising spontaneous magnetisation

Lucas D'Alimonte, Loïc Gassmann

Comments 27 pages. Comments welcome

详情
英文摘要

We prove that, in the FK-percolation model, the probabilities of local events are uniformly analytic in the percolation parameter $p$ under suitable mixing assumptions on the measure, and satisfy a uniform exponential growth bound. This result allows us to prove that the magnetisation of the Potts model is analytic in a suitable range of parameters, including the Ising case in all dimensions $d \geq 3$ in the whole supercritical regime. We also provide a proof of the analyticity of the susceptibility of the Potts model with $q$ colours, for any $q \geq 2$ in the whole subcritical interval. Finally, we prove the analyticity of various quantities in the FK-percolation measure, including the multi-point and truncated multi-point connectivity probabilities.

2604.18554 2026-04-21 math.DG

The hypersymplectic flow descended from the $G_2$-Laplacian coflow

Amanda Maria Petcu

Comments 20 pages

详情
英文摘要

A conjecture of Simon Donaldson is that on a compact $4$-manifold $X^4$ one can flow from a hypersymplectic structure to a hyperkähler structure while remaining in the same cohomology class. To this end the hypersymplectic flow was introduced by Fine-Yao. In this paper the notion of a positive triple on $X^4$ is used to describe a hypersymplectic and hyperkähler structure. Given a closed positive triple one can define either a closed $G_2$ structure or a coclosed $G_2$ structure on $\mathbb{T}^3 \times X^4$. The coclosed $G_2$ structure is evolved under the modified $G_2$-Laplacian coflow. The coflow descends to a flow of the positive triple on $X^4$, which is again the Fine-Yao hypersymplectic flow.

2604.18551 2026-04-21 math.QA

Non-linear Lie Conformal Algebras and One-Loop Corrections of self-dual Yang-Mills amplitudes

Charles Igel, Jeremy Mancinas, Juan Villarreal

Comments 13 Pages, 0 figures, submitted to Letters in Mathematical Physics

详情
英文摘要

This work is motivated by recent developments in celestial holography. In \cite{CP}, the authors interpreted QCD collinear singularities in terms of operator product expansions in a two-dimensional CFT. We reformulate the algebraic structures arising in their work using the formalism of non-linear Lie conformal algebras developed in \cite{SK}.

2604.18546 2026-04-21 cs.LG eess.SP math.OC

Wasserstein Distributionally Robust Risk-Sensitive Estimation via Conditional Value-at-Risk

Feras Al Taha, Eilyan Bitar

Comments 6 pages, 2 figures

详情
英文摘要

We propose a distributionally robust approach to risk-sensitive estimation of an unknown signal x from an observed signal y. The unknown signal and observation are modeled as random vectors whose joint probability distribution is unknown, but assumed to belong to a given type-2 Wasserstein ball of distributions, termed the ambiguity set. The performance of an estimator is measured according to the conditional value-at-risk (CVaR) of the squared estimation error. Within this framework, we study the problem of computing affine estimators that minimize the worst-case CVaR over all distributions in the given ambiguity set. As our main result, we show that, when the nominal distribution at the center of the Wasserstein ball is finitely supported, such estimators can be exactly computed by solving a tractable semidefinite program. We evaluate the proposed estimators on a wholesale electricity price forecasting task using real market data and show that they deliver lower out-of-sample CVaR of squared error compared to existing methods.

2604.18545 2026-04-21 math.MG math.CO math.GT

Soft tilings

Gergely Ambrus, Dorottya Dancsó

详情
英文摘要

By means of constructing a new edge-bending algorithm, we prove that every locally polyhedral tiling of $\mathbb{R}^3$ can be completely softened. A weaker form of this statement, for polyhedral space tilings, was conjectured by Domokos, Goriely, G. Horváth and Regős in 2024. We also provide a short proof for a result of Domokos, G. Horváth, and Regős, stating that in a balanced polygonic tiling of the plane, the average number of spikes is at least 2 per cell.

2604.18544 2026-04-21 math.CA math.CO math.NT

Near-optimal density theorems for large dilates of large point configurations

Vjekoslav Kovač, Adian Anibal Santos Sepčić

Comments 16 pages, 3 figures

详情
英文摘要

We study density thresholds that force a measurable set $E\subseteq\mathbb{R}^d$ to contain all sufficiently large similar copies of every $n$-point configuration. We prove a lower bound of the form $1-O((\log n)/n)$, which matches the known upper bound up to the logarithmic factor, thus essentially resolving a problem posed by Falconer, Yavicoli, and the first author of the present paper. We also study the same problem for embeddings of $n$-point configurations into $\mathbb{R}^d$ equipped with the $\ell^p$ norm, obtaining an asymptotically sharp bound $1-1/n+o(1/n)$, as soon as $p\in(1,\infty)\setminus\{2\}$. In the proof of the former estimate we use equidistribution of polynomial sequences modulo $1$ combined with probabilistic thinning. The proof of the latter estimate relies on the geometry of the $\ell^p$ spaces for $p\neq2$.

2604.18540 2026-04-21 math.AP cs.LG math.FA math.OC

Duality for the Adversarial Total Variation

Leon Bungert, Lucas Schmitt

Comments 39 pages

详情
英文摘要

Adversarial training of binary classifiers can be reformulated as regularized risk minimization involving a nonlocal total variation. Building on this perspective, we establish a characterization of the subdifferential of this total variation using duality techniques. To achieve this, we derive a dual representation of the nonlocal total variation and a related integration of parts formula, involving a nonlocal gradient and divergence. We provide such duality statements both in the space of continuous functions vanishing at infinity on proper metric spaces and for the space of essentially bounded functions on Euclidean domains. Furthermore, under some additional conditions we provide characterizations of the subdifferential in these settings.

2604.18536 2026-04-21 math.NA cs.NA physics.flu-dyn

A differentiable software suite for accelerated simulation of turbulent flows

Syver Døving Agdestein, Benjamin Sanderse

Comments 22 pages, 19 figures

详情
英文摘要

We present IncompressibleNavierStokes.jl, an open-source Julia package for solving the incompressible Navier--Stokes equations on staggered Cartesian grids. The package features matrix-free, hardware-agnostic kernels that are compiled from a single source for multi-threaded CPU or GPU execution, and hand-written adjoint kernels for all discrete operators, enabling efficient reverse-mode automatic differentiation through the entire solver. This differentiability allows neural network closure models to be trained a-posteriori while embedded in a large-eddy simulation. Memory optimizations permit double-precision direct numerical simulations at resolutions up to $840^3$ on a single GPU. The software design, numerical methods, hardware performance, and integration of neural network closure models are described, and results for turbulent channel flow are validated against reference data.

2604.18526 2026-04-21 math.LO

Positive, Negative, and Reliable Information in a First-Order Logic of Evidence and Truth

Abilio Rodrigues, Marcelo E. Coniglio

详情
英文摘要

In this paper we present the first-order logic QLETF+, a quantified version of the logic LETF+, introduced in Coniglio and Rodrigues (Studia Logica 112:561-606, 2024). QLETF+ exhibits several properties that are not always enjoyed by logics equipped with classicality operators. We show that it satisfies the replacement property and admits conjunctive, disjunctive, and prenex normal forms. Alongside extensions and anti-extensions, as in the previously studied first-order semantics for LETs, we make use here of what we call o-extensions: given an n-ary predicate symbol P, the o-extension of P is the set of n-tuples of individuals that satisfy the predicate oP. We prove the soundness and completeness of the deductive system of QLETF+ with respect to the six-valued first-order semantics.

2604.18523 2026-04-21 cond-mat.dis-nn cs.IT math.IT math.ST stat.TH

BBP transition and the leading eigenvector of the spiked Wigner model with inhomogeneous noise

Leonardo S. Ferreira, Fernando L. Metz

Comments 21 pages, 7 figures

详情
英文摘要

The spiked Wigner ensemble is a prototypical model for high-dimensional inference. We study the spectral properties of an inhomogeneous rank-one spiked Wigner model in which the variance of each entry of the noise matrix is itself a random variable. In the high-dimensional limit, we derive exact equations for the spectral edges, the outlier eigenvalue, and the distribution of the components of the outlier eigenvector. These equations determine the BBP transition line that separates the gapped phase, where the signal is detectable, from the gapless phase. In the gapped regime, the distribution of the outlier eigenvector provides a natural estimator of the spike. We solve the equations for a noise matrix whose variances are generated from a truncated power-law distribution. In this case, the BBP transition line is non-monotonic, showing that an inhomogeneous noise can enhance signal detectability.

2604.18441 2026-04-21 math.ST cs.LG stat.ML stat.TH

Conformal Robust Set Estimation

Alejandro Cholaquidis, Emilien Joly, Leonardo Moreno

详情
英文摘要

Conformal prediction provides finite-sample, distribution-free coverage under exchangeability, but standard constructions may lack robustness in the presence of outliers or heavy tails. We propose a robust conformal method based on a non-conformity score defined as the half-mass radius around a point, equivalently the distance to its $(\lfloor n/2\rfloor+1)$-nearest neighbour. We show that the resulting conformal regions are marginally valid for any sample size and converge in probability to a robust population central set defined through a distance-to-a-measure functional. Under mild regularity conditions, we establish exponential concentration and tail bounds that quantify the deviation between the empirical conformal region and its population counterpart. These results provide a probabilistic justification for using robust geometric scores in conformal prediction, even for heavy-tailed or multi-modal distributions.

2604.08473 2026-04-21 math.DG

A dimension descent scheme for the positive mass theorem in arbitrary dimension

S. Brendle, Y. Wang

Comments minor improvements in exposition

详情
英文摘要

We describe how the Schoen-Yau proof of the positive mass theorem can be extended to arbitrary dimensions. To overcome the problem of singularities, we propose a new inductive scheme. To carry out the inductive step, we use a combination of several techniques, including the shielding principle of Lesourd-Unger-Yau, as well as a conformal blow-up argument in the spirit of Bi-Hao-He-Shi-Zhu. Our arguments also rely on the Cheeger-Naber bound for the Minkowski dimension of the singular set.

2603.08675 2026-04-21 math.CO math.GR

The Lovász conjecture holds for moderately dense Cayley graphs

Benjamin Bedert, Nemanja Draganić, Alp Müyesser, Matías Pavez-Signé

Comments 16 pages, updated version features some corrections and simplifications

详情
英文摘要

We show that there is an absolute constant $c>0$ such that every large connected $n$-vertex Cayley graph with degree $d\geq n^{1-c}$ has a Hamilton cycle. This makes progress towards the Lovász conjecture and improves upon the previous best result of this form due to Christofides, Hladký, and Máthé from 2014 concerning graphs with $d\geq \varepsilon n$. Our proof avoids the use of Szemerédi's regularity lemma and relies instead on an efficient arithmetic regularity lemma specialised to Cayley graphs.

2512.06856 2026-04-21 math.RT math.GR

On stable equivalences of Morita type with twisted diagonal vertices

Xin Huang

详情
英文摘要

We give a new proof, by using simplified terminology and notation, to a result of Puig stating that if a bimodule of two block algebras of finite groups over an algebraically closed field induces a stable equivalence of Morita type and has a twisted diagonal vertex, then it has an endopermutation module as a source. We also extend this result to arbitrary fields under a mild assumption.

2511.11308 2026-04-21 eess.SY cs.SY math.OC

Policy Optimization for Unknown Systems using Differentiable Model Predictive Control

Riccardo Zuliani, Efe C. Balta, John Lygeros

详情
英文摘要

Model-based policy optimization often struggles with inaccurate system dynamics models, leading to suboptimal closed-loop performance. This challenge is especially evident in Model Predictive Control (MPC) policies, which rely on the model for real-time trajectory planning and optimization. We introduce a novel policy optimization framework for MPC-based policies combining differentiable optimization with zeroth-order optimization. Our method combines model-based and model-free gradient estimation approaches, achieving faster transient performance compared to fully data-driven approaches while maintaining convergence guarantees, even under model uncertainty. We demonstrate the effectiveness of the proposed approach on a nonlinear control task involving a 12-dimensional quadcopter model.

2511.02757 2026-04-21 cs.LG math.OC stat.ML

ConMeZO: Adaptive Descent-Direction Sampling for Gradient-Free Finetuning of Large Language Models

Lejs Deen Behric, Liang Zhang, Bingcong Li, Kiran Koshy Thekumparampil

详情
英文摘要

Zeroth-order or derivative-free optimization (MeZO) is an attractive strategy for finetuning large language models (LLMs) because it eliminates the memory overhead of backpropagation. However, it converges slowly due to the inherent curse of dimensionality when searching for descent directions in the high-dimensional parameter space of billion-scale LLMs. We propose ConMeZO, a novel zeroth-order optimizer that accelerates convergence by adaptive directional sampling. Instead of drawing the direction uniformly at random, ConMeZO restricts the sampling to a cone centered around a momentum estimate. This concentrates the search in directions where the true gradient is more likely to lie and thus reduces the effect of high dimensions. We prove that ConMeZO achieves the same worst-case convergence rate as MeZO. Empirically, when finetuning LLMs on natural language tasks, ConMeZO is up to 2X faster than MeZO while retaining the low-memory footprint of zeroth-order methods.

2510.16750 2026-04-21 math.ST cs.IT math.IT stat.ML stat.TH

On Robust Hypothesis Testing with respect to the Hellinger Distance

Eeshan Modak, Sivaraman Balakrishnan, Ananda Theertha Suresh

Comments 15 pages, 2 figures. Updated authors list. Some changes in notations and exposition. Shorter version to appear in the proceedings of ISIT 2026

详情
英文摘要

We study a variant of the simple hypothesis testing problem where observed samples do not necessarily come from either of the specified distributions, but rather from a close variant of them. In this setting, we require a test that is robust to misspecification and identifies which distribution is closer in Hellinger distance. If the underlying distribution is nearly equidistant from both hypotheses, the problem becomes intractable. Our main result is a lower bound on the slack factor, which quantifies how much closer the underlying distribution must be to one hypothesis relative to the other for any test to remain robust. We also demonstrate the implications of this result for testing with respect to symmetric chi-squared distance. Finally, we study an alternative way to specify robustness, where each hypothesis is a Hellinger ball around a fixed distribution. We provide and analyze a test for this composite hypothesis testing problem.

2510.15581 2026-04-21 math.OA

Characterizations of amenability for noncommutative dynamical systems and Fell bundles

Alcides Buss, Damián Ferraro

Comments 27 pages

详情
英文摘要

We resolve key open questions regarding approximation properties and their permanence for Fell bundles over locally compact groups. Specifically, we establish the equivalence between the Bédos--Conti approximation property (BCAP) and the Exel--Ng positive approximation property (AP), completely removing the necessity of assuming nuclearity on the unit fiber. To overcome the obstructions present in general Fell bundles (such as the lack of spatial arguments and exactness), we introduce a diagonal maximal tensor product $\otimes^d_{\max}$. We prove that a Fell bundle $\mathcal{A}$ has the AP if and only if $\mathcal{A} \otimes^d_{\max} \mathcal{B}$ has the weak containment property (wcp) for every Fell bundle $\mathcal{B}$. For $C^*$-dynamical systems, this yields a characterization of amenability that was known to hold under exactness assumptions. Furthermore, this tensorial machinery allows us to establish highly non-trivial permanence properties for the AP, including passage to restrictions over closed subgroups and partial quotients by normal subgroups. We also provide applications concerning the nuclearity of full and reduced cross-sectional $C^*$-algebras.

2510.13764 2026-04-21 math.GT math.QA

The minimal Rickard complexes of braids on two strands

Joshua Wang

Comments 47 pages. An explicit formula for the grading shift H is provided, simplifying and shortening sections 4.3 and 5.2

详情
英文摘要

The Rickard complex of a braid with strands colored by positive integers is a chain complex of singular Soergel bimodules. The complex determines the colored triply-graded homology and colored sl(N) homology of the braid closure, when closure is color-compatible. For each braid on two strands with any colors, we construct a minimal complex that is homotopy equivalent to its Rickard complex. It is not obtained by laborious simplification; instead, it is defined directly by explicit formulas obtained by educated guesswork and reverse engineering.

2507.12182 2026-04-21 math-ph cs.LG math.MP math.PR

Asymptotic behavior of eigenvalues of large rank perturbations of large random matrices

Ievgenii Afanasiev, Leonid Berlyand, Mariia Kiyashko

Comments v1: 14 pages, 3 figures; v2: a part of the proof of Lemma 4.2 was revised, 15 pages, 3 figures; v3: the proof was generalized, 20 pages, 3 figures; v4: minor changes in the proof, typos correced, 21 pages, 3 figures

详情
英文摘要

The paper is concerned with deformed Wigner random matrices. These matrices are closely related to Deep Neural Networks (DNNs): weight matrices of trained DNNs could be represented in the form $R + S$, where $R$ is random and $S$ is highly correlated. The spectrum of such matrices plays a key role in rigorous underpinning of the novel pruning technique based on Random Matrix Theory. In practice, the spectrum of the matrix $S$ can be rather complicated. In this paper, we develop an asymptotic analysis for the case of full rank $S$ with increasing number of outlier eigenvalues.

2507.06898 2026-04-21 math.OC math.CO

Strength of the Upper Bounds for the Edge-Weighted Maximum Clique Problem

Fabio Ciccarelli, Valerio Dose, Fabio Furini, Marta Monaci

详情
英文摘要

We theoretically and computationally compare the strength of the three main upper bounds from the literature on the optimal value of the Edge-Weighted Maximum Clique Problem (EWMCP). We provide a set of instances for which the ratio between any of the three upper bounds and the optimal value of the EWMCP is unbounded, showing that none of them can give a performance guarantee. We further analyze the relative strength among the three upper bounds by determining, for every choice of a ratio between any two of them, the largest values it can attain and providing families of instances for which such values can be reached. Our results show that, for each pair of upper bounds, there exist appropriately chosen instances on which either bound is tighter than the other. Our theoretical analysis is complemented by extensive computational experiments on two benchmark datasets: the standard DIMACS instances and randomly generated instances, providing practical insights into the empirical strength of the upper bounds.

2505.13929 2026-04-21 math.NA cs.NA

Error estimates for numerical approximations of a nonlinear gradient flow model

Jerome Droniou, Kim-Ngan Le, Huateng Zhu

详情
英文摘要

We perform numerical analysis of a nonlinear gradient flow, which can be regarded as a parabolic minimal surface problem or a regularised total variation flow, using the gradient discretisation method (GDM). GDM is a unified convergence analysis framework that covers conforming and nonconforming numerical methods, for instance, conforming and nonconforming finite element, two-point flux approximation, etc.. In this paper, a fully discretised implicit scheme of the model is proposed, the existence and uniqueness of the solution to the scheme is proved, the stability and consistency of the scheme are analysed, and error estimates are established. Numerical results based on the conforming and nonconforming $\mathbb{P}^1$ finite elements are also provided.

2405.04649 2026-04-21 math.AT math-ph math.MP

The Smith Fiber Sequence and Invertible Field Theories

Arun Debray, Sanath K. Devalapurkar, Cameron Krulewski, Yu Leon Liu, Natalia Pacheco-Tallaj, Ryan Thorngren

Comments 77 pages, 3 figures. This is a companion paper to arXiv:2309.16749.

详情
Journal ref
Comm. Math. Phys. 407 (2026), no. 2, Paper No. 25
英文摘要

Smith homomorphisms are maps between bordism groups that change both the dimension and the tangential structure. We give a completely general account of Smith homomorphisms, unifying the many examples in the literature. We provide three definitions of Smith homomorphisms, including as maps of Thom spectra, and show they are equivalent. Using this, we identify the cofiber of the spectrum-level Smith map and extend the Smith homomorphism to a long exact sequence of bordism groups, which is a powerful computation tool. We discuss several examples of this long exact sequence, relating them to known constructions such as Wood's and Wall's sequences. Furthermore, taking Anderson duals yields a long exact sequence of invertible field theories, which has a rich physical interpretation. We developed the theory in this paper with applications in mind to symmetry breaking in quantum field theory, which we study in a companion paper.

2104.05008 2026-04-21 math.OA math.FA

Operator spaces with the WEP, the OLLP and the Gurarii property

Gilles Pisier

Comments Minor changes mainly to clarify or add more details to certain proofs. Added reference to Pedersen in 2023. minor improvements in january 2026. Paper to appear in Annales Fennici Mathematici

详情
英文摘要

We construct non-exact operator spaces satisfying the Weak Expectation Property (WEP) and the Operator space version of the Local Lifting Property (OLLP). These examples should be compared with the example we recently gave of a $C^*$-algebra with WEP and LLP. The construction produces several new analogues among operator spaces of the Gurarii space, extending Oikhberg's previous work. Each of our "Gurarii operator spaces" is associated to a class of finite dimensional operator spaces (with suitable properties). In each case we show the space exists and is unique up to completely isometric isomorphism.

2604.18517 2026-04-21 math-ph math.MP

Low-noise Pauli-consistent ensemble Monte Carlo for graphene with electron-electron scattering

Tigran Zalinyan, Giovanni Nastasi

Comments 15 pages, 12 figures, 7 tables, submitted manuscript (status: with referee(s))

详情
英文摘要

We investigate Pauli-consistent ensemble Monte Carlo simulations of graphene with explicit intraband electron-electron scattering. To reduce the cost of electron-electron proposal-rate evaluation, we introduce a sampled-partner approximation that replaces the full partner-cell sum by uniform sampling from the instantaneous ensemble, while leaving the event-level collision step unchanged. Comparison with the full-sum reference shows close agreement together with a substantial reduction in computational cost, enabling large-ensemble low-noise simulations. In this regime, systematic oscillatory components become clearly resolved in ensemble-averaged time traces. We show that these oscillations are numerical and originate from deterministic drift on the discretized momentum-space grid. We also discuss a procedure for reducing their impact in recorded observables without modifying the underlying Monte Carlo dynamics.

2604.18515 2026-04-21 math.GN

JAI functional contractions in relational metric spaces

Mihai Turinici

详情
英文摘要

The 2015 fixed point result on rs-relational metric spaces due to Alam and Imdad [J. Fixed Point Th. Appl., 17 (2015), 693-702] is equivalent with the classical Banach Contraction Principle [Fund. Math., 3 (1922), 133-181]. This is also valid for the 1961 statement in metric spaces due to Edelstein [Proc. Amer. Math. Soc., 12 (1961), 7-10], or the 2005 fixed point result in quasi-ordered metric spaces obtained by Nieto and Rodriguez-Lopez [Order, 22 (2005), 223-239].

2604.18513 2026-04-21 hep-th math-ph math.MP quant-ph

Bosonization, vertex operators and maximal violation of the Bell-CHSH inequality in wedge regions

J. G. A. Caribé, M. S. Guimaraes, I. Roditi, S. P. Sorella

Comments 5 pages

详情
英文摘要

It is pointed out that the vertex operators of a chiral boson in 1+1 dimensions provide an explicit realization of dichotomic, bounded, Hermitian operators that saturate the Tsirelson bound of the Bell-CHSH inequality in the vacuum state.

2604.18511 2026-04-21 math.OC

An adaptive discretization algorithm for locally optimal experimental design with constraints

Jochen Schmid, Philipp Seufert, Jan Schwientek, Tobias Seidel, Karl-Heinz Küfer

Comments 44 pages,

详情
英文摘要

We develop a novel iterative algorithm for locally optimal experimental design under constraints, like budget or performance constraints. It is an adaptive discretization algorithm. In every iteration, a discretized version of the constrained-design problem is solved and then the discretization is adaptively refined by adding an approximate violator of a suitable sufficient $\eps$-optimality condition for the current design. We prove that with $\eps = 0$, our algorithm converges to an optimal design and that with $\eps > 0$, our algorithm finitely terminates at an $\eps$-optimal design. Compared to the existing algorithms on constrained experimental design, our algorithm comes with considerably less computational effort because the nonlinear subproblems in our algorithm have a smaller dimension and have to be solved only approximately and only in selected iterations (typically the last few). Additionally, our algorithm covers a considerably larger class of constraints. We demonstrate the good convergence properties of the algorithm on experimental design problems from chemical engineering that feature time and yield constraints.

2604.18505 2026-04-21 cs.IT math.IT stat.ML

Bayesian experimental design: grouped geometric pooled posterior via ensemble Kalman methods

Huchen Yang, Xinghao Dong, Jinlong Wu

详情
英文摘要

Bayesian experimental design (BED) for complex physical systems is often limited by the nested inference required to estimate the expected information gain (EIG) or its gradients. Each outer sample induces a different posterior, creating a large and heterogeneous set of inference targets. Existing methods have to sacrifice either accuracy or efficiency: they either perform per-outer-sample posterior inference, which yields higher fidelity but at prohibitive computational cost, or amortize the inner inference across all outer samples for computational reuse, at the risk of degraded accuracy under posterior heterogeneity. To improve accuracy and maintain cost at the amortized level, we propose a grouped geometric pooled posterior framework that partitions outer samples into groups and constructs a pooled proposal for each group. While such grouping strategy would normally require generating separate proposal samples for different groups, our tailored ensemble Kalman inversion (EKI) formulation generates these samples without extra forward-model evaluation cost. We also introduce a conservative diagnostic to assess importance-sampling quality to guide grouping. This grouping strategy improves within-group proposal-target alignment, yielding more accurate and stable estimators while keeping the cost comparable to amortized approaches. We evaluate the performance of our method on both Gaussian-linear and high-dimensional network-based model discrepancy calibration problems.