arXivDaily arXiv每日学术速递 周一至周五更新
2601.11512 2026-01-19 math.RT math.CT math.RA

Krull-Gabriel dimension of Skew group algebras

Shantanu Sardar

详情
英文摘要

For an algebraically closed field K, let G be a finite abelian group of K-linear automorphisms of a finite-dimensional algebra A and AG is the associated skew group algebra. The author with S. Trepode and A. G. Chaio introduced the notion of a Galois semi-covering functor to study the irreducible morphisms over skew group algebras. In this paper, we establish a Galois semi-covering functor between the morphism categories as well as the functor categories over the algebras A and AG and prove that their Krull-Gabriel dimension are equal. This computation confirms Prests conjecture on the finiteness of Krull-Gabriel dimension and Schroers conjecture on its connection with the stable rank (the least stabilized radical power) over skew gentle algebras. Moreover, we determine all posible stable ranks for (skew) Brauer graph algebras.

2601.11501 2026-01-19 cs.IT math.IT

Coding Schemes for the Noisy Torn Paper Channel

Frederik Walter, Maria Abu-Sini, Nils Weinhardt, Antonia Wachter-Zeh

详情
英文摘要

To make DNA a suitable medium for archival data storage, it is essential to consider the decay process of the strands observed in DNA storage systems. This paper studies the decay process as a probabilistic noisy torn paper channel (TPC), which first corrupts the bits of the transmitted sequence in a probabilistic manner by substitutions, then breaks the sequence into a set of noisy unordered substrings. The present work devises coding schemes for the noisy TPC by embedding markers in the transmitted sequence. We investigate the use of static markers and markers connected to the data in the form of hash functions. These two tools have also been recently exploited to tackle the noiseless TPC. Simulations show that static markers excel at higher substitution probabilities, while data-dependent markers are superior at lower noise levels. Both approaches achieve reconstruction rates exceeding $99\%$ with no false decodings observed, primarily limited by computational resources.

2601.11498 2026-01-19 cs.IT cs.NI eess.SP math.IT quant-ph

Convergence Properties of Good Quantum Codes for Classical Communication

Alptug Aytekin, Mohamed Nomeir, Lei Hu, Sennur Ulukus

详情
英文摘要

An important part of the information theory folklore had been about the output statistics of codes that achieve the capacity and how the empirical distributions compare to the output distributions induced by the optimal input in the channel capacity problem. Results for a variety of such empirical output distributions of good codes have been known in the literature, such as the comparison of the output distribution of the code to the optimal output distribution in vanishing and non-vanishing error probability cases. Motivated by these, we aim to achieve similar results for the quantum codes that are used for classical communication, that is the setting in which the classical messages are communicated through quantum codewords that pass through a noisy quantum channel. We first show the uniqueness of the optimal output distribution, to be able to talk more concretely about the optimal output distribution. Then, we extend the vanishing error probability results to the quantum case, by using techniques that are close in spirit to the classical case. We also extend non-vanishing error probability results to the quantum case on block codes, by using the second-order converses for such codes based on hypercontractivity results for the quantum generalized depolarizing semi-groups.

2601.11493 2026-01-19 math.NA cs.NA

Efficient error estimators for Generalized Nyström

Lorenzo Lazzarino, Katherine J. Pearce, Nathaniel Pritchard

详情
英文摘要

Randomized algorithms in numerical linear algebra have proven to be effective in ameliorating issues of scalability when working with large matrices, efficiently producing accurate low-rank approximations. A key remaining challenge, however, is to efficiently assess the approximation accuracy of randomized methods without additional expensive matrix accesses. Recent work has addressed this issue by deriving fast leave-one-out error estimators for the randomized SVD and Nyström decomposition, enabling accurate error estimation with no additional matrix accesses. In this work, we extend the leave-one-out framework to the generalized Nyström decomposition, an approach that can be applied to general rectangular matrices. We do this by deriving three new leave-one-out error estimators and validating their effectiveness through numerical experiments.

2601.11490 2026-01-19 math.NT

Sumset size races for measurable sets

Melvyn B. Nathanson

Comments 12 pages

详情
英文摘要

Let $G$ be a locally compact abelian group with Haar measure $μ$. For integers $n \geq 2$ and $H \geq 2$ and for any $n$-tuples $\mathbf{u}_1,\ldots, \mathbf{u}_H \in \mathbf{N}^n$, there exist measurable subsets $A_1,\ldots, A_n$ of $G$ such that the $n$-tuple $\left( μ(hA_1),\ldots, μ(hA_n) \right)$ has the same relative order as the $n$-tuple $\mathbf{u}_h$ for all $h = 1,\ldots, H$. For integers $m_{i,h}$ for $i =1,\ldots, n-1$ and $h = 1,\ldots, H$, there are Lebesgue measurable sets $A_1,\ldots, A_n$ in $\mathbf{R}$ such that $μ(hA_{i+1}) - μ(hA_i) = m_{i,h}$ for all $i$ and $h$.

2601.11489 2026-01-19 math.AT

Quasi-unitial Inner Kan Spaces

Trygve Poppe Oldervoll

详情
英文摘要

We show that semi-simplicial spaces that i) admit inner horn fillers up to homotopy and ii) possess units in a weak sense provide a viable model for $\infty$-categories. The existence of units can be expressed through various quasi-unitality conditions, and we compare the natural generalization of three such conditions found in the literature. This work is motivated by applications in Floer homotopy theory.

2601.11483 2026-01-19 math.NA cs.NA

Tensor field tomography with attenuation and refraction: adjoint operators for the dynamic case and numerical experiments

Lukas Vierus, Thomas Schuster, Bernadette Hahn

Comments 18 pages, 7 figures

详情
英文摘要

This article is concerned with tensor field tomography in a fairly general setting, that takes refraction, attenuation and time-dependence of tensor fields into account. The mathematical model is given by attenuated ray transforms of the fields along geodesic curves corresponding to a Riemannian metric that is defined by the index of refraction. The data are given at the boundary tangent bundle of the domain and it is well-known that they can be characterized as boundary data of a transport equation turning tensor field tomography into an inverse source problem. This way the adjoint of the forward mapping can be computed using the integral representation or, equivalently, associated to a dual transport equation. The article offers and proves two different representations for the adjoint mappings both in the dynamic and static case. The numerical implementation is demonstrated and evaluated for static fields using the damped Landweber method with Nesterov acceleration applied to both, the integral and PDE-based formulations. The transport equations are solved using a viscosity approximation. The error analysis reveals that the integral representation significantly outperforms PDE-based methods in terms of computational efficiency while achieving comparable reconstruction accuracy. The impact of noise and deviations from straight-line trajectories are investigated confirming improved accuracy if refraction is taken into account. We conclude that the inclusion of refraction to the forward model pays in spite of increased numerical cost.

2601.11482 2026-01-19 math.DS

A Genetic Algorithm for Generating Extreme Examples in Arithmetic Dynamics

Benjamin Hutz

Comments article 15 pages. Appendix with data tables 43 pages. github for code: https://github.com/bhutz/GA_for_ADS_examples

详情
英文摘要

We describe a genetic algorithm to find extreme examples in the arithmetic of dynamical systems. The algorithm is applied to four problems: small (non-zero) canonical heights, many rational preperiodic points, long rational cycles, and long rational tails. Data is provided for extreme examples generated for polynomials up to degree 13 and rational functions up to degree 5. This work significantly expands the known examples of extreme behavior for several of the conjectured behaviors in arithmetic dynamics and provides a foundation from which to begin a more advanced application of machine learning techniques in the creation of extreme examples for arithmetic dynamics.

2601.11474 2026-01-19 math.AG

The resultant divisor is negative

Olivier Benoist

Comments This article replaces and supersedes arXiv:1212.4862. 26 pages

详情
英文摘要

Fix two integers $1\leq d<e$. We study the birational geometry of a parameter space for pairs of homogeneous polynomials of degrees $d$ and $e$ in two variables (in which the higher degree polynomial is well defined only up to a multiple of the lower degree polynomial). We show that one can run the MMP on this space, and that it eventually contracts the resultant divisor.

2601.11473 2026-01-19 math.OC cs.LG

A Probabilistic Approach to Trajectory-Based Optimal Experimental Design

Ahmed Attia

Comments 42 Figures, this version includes supplementary material as appendices

详情
英文摘要

We present a novel probabilistic approach for optimal path experimental design. In this approach a discrete path optimization problem is defined on a static navigation mesh, and trajectories are modeled as random variables governed by a parametric Markov policy. The discrete path optimization problem is then replaced with an equivalent stochastic optimization problem over the policy parameters, resulting in an optimal probability model that samples estimates of the optimal discrete path. This approach enables exploration of the utility function's distribution tail and treats the utility function of the design as a black box, making it applicable to linear and nonlinear inverse problems and beyond experimental design. Numerical verification and analysis are carried out by using a parameter identification problem widely used in model-based optimal experimental design.

2601.11472 2026-01-19 math.CT

Comonadic approach to pretorsion theories

Elena Caviglia, Zurab Janelidze, Luca Mesiti

详情
英文摘要

We present a comonadic approach to pretorsion theories on semiexact categories, i.e. categories equipped with a closed ideal of null morphisms that admits all kernels and all cokernels. We first prove that bihereditary pretorsion theories are comonadic in a 2-dimensional sense over the 2-category of semiexact categories with naturally chosen 1-cells. We then extend the built pseudo-comonad to guarantee that all pretorsion theories are pseudo-coalgebras. But interestingly, not all pseudo-coalgebras are pretorsion theories. Rather, pseudo-coalgebras give a generalized notion of pretorsion theory.

2601.11470 2026-01-19 math.PR

Dvoretzky covering problem for general measures

Roope Anttila, Markus Myllyoja

Comments 36 pages + 6 page appendix. Comments are welcome!

详情
英文摘要

We study the Dvoretzky covering problem for random covering sets driven by general Borel probability measures. As our main result, we solve the problem of covering analytic sets by random covering sets generated by arbitrary Borel probability measures on the real line. Prior to this work, a complete solution was not known for any singular measure. Our solution is potential theoretic and involves a generalisation of a notion of capacity in the work of Kahane, who solved the problem of covering compact sets in the classical setting where the random covering process is driven by the Lebesgue measure on the unit circle. One of our key innovations is a simple but powerful application of the Jankov-von Neumann uniformisation theorem, which we believe to have interest outside of this work. In addition, we determine the critical exponent for the covering problem for polynomially decreasing sequences $(cn^{-t})_n$ for random covering sets driven by Borel probability measures on $\mathbb{R}^d$. At exactly the critical exponent, the covering property generally depends on the constant $c>0$, and as an application of our main result, we determine the critical constant for random covering sets driven by natural measures on strongly separated self-conformal sets on the line. The critical constant depends on the multifractal structure of the average densities of the measure, and the result is new even for the simplest case of the Hausdorff measure on the Cantor set.

2601.11468 2026-01-19 cs.AI cs.IT math.IT

Exploring LLM Features in Predictive Process Monitoring for Small-Scale Event-Logs

Alessandro Padella, Massimiliano de Leoni, Marlon Dumas

Comments 19 pages, 4 figure, TMIS journal submission

详情
英文摘要

Predictive Process Monitoring is a branch of process mining that aims to predict the outcome of an ongoing process. Recently, it leveraged machine-and-deep learning architectures. In this paper, we extend our prior LLM-based Predictive Process Monitoring framework, which was initially focused on total time prediction via prompting. The extension consists of comprehensively evaluating its generality, semantic leverage, and reasoning mechanisms, also across multiple Key Performance Indicators. Empirical evaluations conducted on three distinct event logs and across the Key Performance Indicators of Total Time and Activity Occurrence prediction indicate that, in data-scarce settings with only 100 traces, the LLM surpasses the benchmark methods. Furthermore, the experiments also show that the LLM exploits both its embodied prior knowledge and the internal correlations among training traces. Finally, we examine the reasoning strategies employed by the model, demonstrating that the LLM does not merely replicate existing predictive methods but performs higher-order reasoning to generate the predictions.

2601.11467 2026-01-19 math.OC

The XL Instances for the Capacitated Vehicle Routing Problem

Eduardo Queiroga, Rafael Martinelli, Anand Subramanian, Eduardo Uchoa, Thibaut Vidal

Comments 18 pages, 4 figures

详情
英文摘要

This paper introduces a new set of large-scale benchmark instances for the Capacitated Vehicle Routing Problem (CVRP). The proposed XL set extends existing benchmarks by covering instances with 1,000 to 10,000 customers and a wide range of structural characteristics, following established generation principles from prior CVRP studies. A computational study involving several state-of-the-art algorithms is conducted to provide initial best known solutions (BKSs) for the XL instances, which serve as a baseline for a community-driven BKS challenge launched on the CVRPLib website. The instances are made publicly available to support experimental evaluation and comparison of solution methods. Furthermore, additional computational analyses are reported to compare algorithmic performance on other existing CVRP benchmark instances.

2601.11465 2026-01-19 math.ST stat.TH

Optimal transport based theory for latent structured models

XuanLong Nguyen, Yun Wei

Comments 27 pages

详情
英文摘要

This article is an exposition on some recent theoretical advances in learning latent structured models, with a primary focus on the fundamental roles that optimal transport distances play in the statistical theory. We aim at what may be the most critical and novel ingredient in this theory: the motivation, formulation, derivation and ramification of inverse bounds, a rich collection of structural inequalities for latent structured models which connect the space of distributions of unobserved structures of interest to the space of distributions for observed data. This theory is illustrated on classical mixture models, as well as the more modern hierarchical models that have been developed in Bayesian statistics, machine learning and related fields.

2601.11463 2026-01-19 math.FA

The classification of $C(K)$ spaces for countable compacta by positive isomorphisms

Marek Cúth, Jonáš Havelka, Jakub Rondoš, Bünyamin Sarı

详情
英文摘要

We study the classification of spaces of continuous functions $C(K)$ under positive linear maps. For infinite countable compacta, we show that whenever $C(K)$ and $C(L)$ are isomorphic, there exists an isomorphism $T:C(K)\to C(L)$ satisfying either $T\geq 0$ or $T^{-1}\geq 0$. We also prove that for any compact spaces $K$ and $L$, the existence of a positive embedding $T: C(K) \to C(L)$ implies that the Cantor-Bendixson height of $K$ does not exceed the height of $L$. Furthermore, we introduce a one-sided positive Banach-Mazur distance and obtain new estimates for both the classical and positive distances. Notably, we prove the exact formula $d_{BM}(C(ω^{ω^α}), C(ω^{ω^αn})) = n+\sqrt{(n-1)(n+3)}$.

2601.11462 2026-01-19 math.OC

Stochastic Recursive Inclusions under Biased Perturbations: An Input-to-State Stability Perspective

Anik Kumar Paul, Karthik Shenoy, Arun D. Mahindrakar

详情
英文摘要

This paper investigates the asymptotic behavior of stochastic recursive inclusions in the presence of non-zero, non-diminishing bias, a setting that frequently arises in zeroth-order optimization, stochastic approximation with iterate-dependent noise, and distributed learning with adversarial agents. The analysis is conducted through the lens of input-to-state stability of an associated differential inclusion, which serves as the continuous-time limit of the discrete recursion. We first establish that if the limiting differential inclusion is input-to-state stable and the iterates remain almost surely bounded, then the iterates converge almost surely to the neighborhood of desired equilibrium. We then provide a verifiable sufficient condition for almost sure boundedness by assuming that the underlying operator is single-valued and globally Lipschitz. Finally, we show that several zeroth-order variants of stochastic gradient naturally fit within this framework, and we demonstrate their input-to-state stability under standard conditions. Overall, the results provide a unified theoretical foundation for studying almost sure convergence of biased stochastic approximation schemes through the Input to State stability theory of differential inclusions.

2601.11461 2026-01-19 stat.CO cs.CE math.ST stat.TH

Smooth SCAD: A Raised Cosine SCAD Type Thresholding Rule for Wavelet Denoising

Radhika Kulkarni, Aluisio Pinheiro, Brani Vidakovic, Abdourrahmane M. Atto

Comments 25 pages, 2 figures

详情
英文摘要

We introduce a smooth variant of the SCAD thresholding rule for wavelet denoising by replacing its piecewise linear transition with a raised cosine. The resulting shrinkage function is odd, continuous on R, and continuously differentiable away from the main threshold, yet retains the hallmark SCAD properties of sparsity for small coefficients and near unbiasedness for large ones. This smoothness places the rule within the continuous thresholding class for which Stein's unbiased risk estimate is valid. As a result, unbiased risk computation, stable data-driven threshold selection, and the asymptotic theory of Kudryavtsev and Shestakov apply. A corresponding nonconvex prior is obtained whose posterior mode coincides with the estimator, yielding a transparent Bayesian interpretation. We give an explicit SURE risk expression, discuss the oracle scale of the optimal threshold, and describe both global and level-dependent adaptive versions. The smooth SCAD rule therefore offers a tractable refinement of SCAD, combining low bias, exact sparsity, and analytical convenience in a single wavelet shrinkage procedure.

2601.11439 2026-01-19 math.OC cs.SY eess.SY

Projection-based discrete-time consensus on the unit sphere

Johan Thunberg, Galina Sidorenko

Comments 14 pages including appendix, 0 figures

详情
英文摘要

We address discrete-time consensus on the Euclidean unit sphere. For this purpose we consider a distributed algorithm comprising the iterative projection of a conical combination of neighboring states. Neighborhoods are represented by a strongly connected directed graph, and the conical combinations are represented by a (non-negative) weight matrix with a zero structure corresponding to the graph. A first result mirrors earlier results for gradient flows. Under the assumptions that each diagonal element of the weight matrix is more than $\sqrt{2}$ larger than the sum of the other elements in the corresponding row, the sphere dimension is greater or equal to 2, and the graph, as well as the weight matrix, is symmetric, we show that the algorithm comprises gradient ascent, stable fixed points are consensus points, and the set of initial points for which the algorithm converges to a non-consensus fixed point has measure zero. The second result is that for the unit circle and a strongly connected graph or for any unit sphere with dimension greater than or equal to $1$ and the complete graph, only for a measure zero set of weight matrices there are fixed points for the algorithm which do not have consensus or antipodal configurations.

2601.11438 2026-01-19 eess.SP cs.IT math.IT

Channel Estimation in MIMO Systems Aided by Microwave Linear Analog Computers (MiLACs)

Qiaosen Zhang, Matteo Nerini, Bruno Clerckx

Comments Submitted to IEEE for publication

详情
英文摘要

Microwave linear analog computers (MiLACs) have recently emerged as a promising solution for future gigantic multiple-input multiple-output (MIMO) systems, enabling beamforming with greatly reduced hardware and computational cost. However, channel estimation for MiLAC-aided systems remains an open problem. Conventional least squares (LS) and minimum mean square error (MMSE) estimation rely on intensive digital computation, which undermines the benefits offered by MiLACs. In this letter, we propose efficient LS and MMSE channel estimation schemes for MiLAC-aided MIMO systems. By designing training precoders and combiners implemented by MiLACs, both LS and MMSE estimation are performed fully in the analog domain, achieving identical performance to their digital counterparts while significantly reducing computational complexity, transmit RF chains, analog-to-digital/digital-to-analog converters (ADCs/DACs) resolution requirements, and peak-to-average power ratio (PAPR). Numerical results verify the effectiveness and advantages of the proposed schemes.

2601.11435 2026-01-19 math.OC cs.LG

Near-Optimal Decentralized Stochastic Nonconvex Optimization with Heavy-Tailed Noise

Menglian Wang, Zhuanghua Liu, Luo Luo

详情
英文摘要

This paper studies decentralized stochastic nonconvex optimization problem over row-stochastic networks. We consider the heavy-tailed gradient noise which is empirically observed in many popular real-world applications. Specifically, we propose a decentralized normalized stochastic gradient descent with Pull-Diag gradient tracking, which achieves approximate stationary points with the optimal sample complexity and the near-optimal communication complexity. We further follow our framework to study the setting of undirected networks, also achieving the nearly tight upper complexity bounds. Moreover, we conduct empirical studies to show the practical superiority of the proposed methods.

2601.11431 2026-01-19 math.QA

Classification of 1-super-transitive quantum subgroups in type A

Cain Edie-Michell, Jacques Katumba

Comments 34 pages with 8 page appendix

详情
英文摘要

We define a notion of super-transitivity for ètale algebra objects $A \in \mathcal{C}(\mathfrak{sl}_N, k)$. This definition is a direct analogue of the notion of super-transitivity for subfactors, and measures at what depth the first ``new stuff'' appears in the category of $A$-modules internal to $\mathcal{C}(\mathfrak{sl}_N, k)$. Our main theorem gives a classification of all 1-super-transitive ètale algebra objects in $\mathcal{C}(\mathfrak{sl}_N, k)$ running over all $N,k \in \mathbb{N}$. Our classification captures all known infinite families of non-pointed ètale algebras in $\mathcal{C}(\mathfrak{sl}_N, k)$, and includes all but 16 of the known non-pointed ètale algebra objects in these categories. These remaining 16 known examples have super-transitivities between 2 and 4.

2601.11426 2026-01-19 eess.SY cs.RO cs.SY math.OC

Learning-Based Shrinking Disturbance-Invariant Tubes for State- and Input-Dependent Uncertainty

Abdelrahman Ramadan, Sidney Givigi

Journal ref IEEE Control Systems Letters, vol. 9, pp. 2699-2704, Dec. 2025

详情
英文摘要

We develop a learning-based framework for constructing shrinking disturbance-invariant tubes under state- and input-dependent uncertainty, intended as a building block for tube Model Predictive Control (MPC), and certify safety via a lifted, isotone (order-preserving) fixed-point map. Gaussian Process (GP) posteriors become $(1-α)$ credible ellipsoids, then polytopic outer sets for deterministic set operations. A two-time-scale scheme separates learning epochs, where these polytopes are frozen, from an inner, outside-in iteration that converges to a compact fixed point $Z^\star\!\subseteq\!\mathcal G$; its state projection is RPI for the plant. As data accumulate, disturbance polytopes tighten, and the associated tubes nest monotonically, resolving the circular dependence between the set to be verified and the disturbance model while preserving hard constraints. A double-integrator study illustrates shrinking tube cross-sections in data-rich regions while maintaining invariance.

2601.11420 2026-01-19 math.OC cs.LG stat.ML

Statistical Robustness of Interval CVaR Based Regression Models under Perturbation and Contamination

Yulei You, Junyi Liu

详情
英文摘要

Robustness under perturbation and contamination is a prominent issue in statistical learning. We address the robust nonlinear regression based on the so-called interval conditional value-at-risk (In-CVaR), which is introduced to enhance robustness by trimming extreme losses. While recent literature shows that the In-CVaR based statistical learning exhibits superior robustness performance than classical robust regression models, its theoretical robustness analysis for nonlinear regression remains largely unexplored. We rigorously quantify robustness under contamination, with a unified study of distributional breakdown point for a broad class of regression models, including linear, piecewise affine and neural network models with $\ell_1$, $\ell_2$ and Huber losses. Moreover, we analyze the qualitative robustness of the In-CVaR based estimator under perturbation. We show that under several minor assumptions, the In-CVaR based estimator is qualitatively robust in terms of the Prokhorov metric if and only if the largest portion of losses is trimmed. Overall, this study analyzes robustness properties of In-CVaR based nonlinear regression models under both perturbation and contamination, which illustrates the advantages of In-CVaR risk measure over conditional value-at-risk and expectation for robust regression in both theory and numerical experiments.

2601.11419 2026-01-19 cs.DM math.OC

On the Virtual Network Embedding polytope

Amal Benhamiche, Pierre Fouilhoux, Lucas Létocart, Nancy Perrot, Alexis Schneider

详情
英文摘要

We initiate the polyhedral study of the Virtual Network Embedding (VNE) problem, which arises in modern telecommunication networks. We propose new valid inequalities for the so-called flow formulation. We then prove, through a dedicated flow decomposition algorithm, that these inequalities characterize the VNE polytope in the case of an embedding of a virtual edge on a substrate path. Preliminary experiments show that the new inequalities propose promising speedups for MIP solvers.

2601.11416 2026-01-19 physics.optics math-ph math.MP

Wigner picture of partially coherent accelerating beams

Sergey A. Ponomarenko, Morteza Hajati

详情
英文摘要

We advance a phase-space theory of partially coherent accelerating, non-diffracting beams employing the Wigner distribution function (WDF). We derive a general expression for the WDF of any accelerating, diffraction-free beam of arbitrary degree of spatial coherence and find an elegant closed-form expression for the WDF of such beam with a Gaussian energy spectrum of noise. We also show how partially coherent accelerating beams of finite power can be constructed within the Wigner picture.

2601.11406 2026-01-19 math.NA cs.NA

Solving the Fisher nonlinear differential equations via Physics-Informed Neural Networks: A Comprehensive Retraining Study and Comparative Analysis with the Finite Difference Method

Ahmed Aberqi, Ahmed Miloudi

Comments 14 pages, 9 Figures

详情
英文摘要

Physics-Informed Neural Networks (PINNs) represent a groundbreaking paradigm in scientific computing, seamlessly integrating the robust framework of deep learning with fundamental physical laws. This paper meticulously applies the standard PINN framework to solve the challenging one-dimensional nonlinear Fisher-KPP equation, a critical model in reaction-diffusion dynamics describing phenomena such as population spread and flame propagation. We detail a comprehensive methodology, encompassing the neural network architecture, the physics-informed loss function, and an in-depth investigation into retraining strategies aimed at optimizing model performance. Our approach is rigorously validated through a direct comparison of the PINN solution against both the known analytical solution and a numerical solution derived from the Finite Difference Method (FDM). Through this work, we elucidate the intricate balance between model complexity, training efficiency, and accuracy. Results highlight the PINN's remarkable capability in accurately approximating the solution to this complex PDE, while also shedding light on the critical aspects and challenges of model retraining, particularly concerning the optimizer's state. This study provides a thorough quantitative error analysis, demonstrating the efficacy of PINNs as a viable and competitive alternative to traditional numerical methods for solving nonlinear differential equations, and discusses their broader applications across various scientific domains.

2601.11397 2026-01-19 cs.LG cs.NA math.NA

Latent Space Inference via Paired Autoencoders

Emma Hart, Bas Peters, Julianne Chung, Matthias Chung

Comments 21 pages, 7 figures

详情
英文摘要

This work describes a novel data-driven latent space inference framework built on paired autoencoders to handle observational inconsistencies when solving inverse problems. Our approach uses two autoencoders, one for the parameter space and one for the observation space, connected by learned mappings between the autoencoders' latent spaces. These mappings enable a surrogate for regularized inversion and optimization in low-dimensional, informative latent spaces. Our flexible framework can work with partial, noisy, or out-of-distribution data, all while maintaining consistency with the underlying physical models. The paired autoencoders enable reconstruction of corrupted data, and then use the reconstructed data for parameter estimation, which produces more accurate reconstructions compared to paired autoencoders alone and end-to-end encoder-decoders of the same architecture, especially in scenarios with data inconsistencies. We demonstrate our approaches on two imaging examples in medical tomography and geophysical seismic-waveform inversion, but the described approaches are broadly applicable to a variety of inverse problems in scientific and engineering applications.

2601.11395 2026-01-19 math.OC

The maximum principle for discrete-time control systems and applications to dynamic games

Alberto Domínguez Corella, Onésimo Hernández-Lerma

Journal ref J. Math. Anal. Appl. 475 (2019), no. 1, 253-277

详情
英文摘要

We study deterministic nonstationary discrete-time optimal control problems in both finite and infinite horizon. With the aid of Gateaux differentials, we prove a discrete-time maximum principle in analogy with the well-known continuous-time maximum principle. We show that this maximum principle, together with a transversality condition, is a necessary condition for optimality; we also show that it is sufficient under additional hypotheses. We use Gateaux differentials as a natural setting to derive first-order conditions. Additionally, we use the discrete-time maximum principle to derive the discrete-time Euler equation and to characterize Nash equilibria for discrete-time dynamic games.

2601.11384 2026-01-19 math.AP

Homogenized moderately wrinkled shell theory from 3D Koiter's linear elasticity

Pedro Hernández-Llanos, Rajesh Mahadevan, Ravi Prakash

详情
英文摘要

In this paper we derive, by two$-$scale convergence, periodically wrinked shell models starting from three dimensional linear elasticity, depending of the behaviour of the small parameter $\varepsilon>0$ and $p>1$, differents theories appear. We assume that the mid-surface of the shell is given by $\displaystyle ψ(x_1,x_2)+\varepsilon^pθ\left(\frac{x_1}{\varepsilon},\frac{x_2}{\varepsilon}\right)\vect{a}_{3}(x_1,x_2)$, where $θ$ is $[0,1)^2$-periodic function and $p=2$. We also assume that the strain energy of the shell has the Koiter's model.