ND-TNN: Tensor-Neural-Network Approximation for High-Dimensional Nonlocal Diffusion Models
ND-TNN:高维非局部扩散模型的张量神经网络逼近
Ziyue Cai, Zuoqiang Shi
AI总结 提出基于张量神经网络(TNN)的数值方法求解高维非局部扩散模型,利用TNN的张量积结构和高斯核的可分离性将高维积分降维,并给出Dirichlet和Neumann边界条件下的误差估计。
详情
我们研究了一种基于张量神经网络(TNN)架构的数值方法(该架构由\cite{wang2022tensor}引入),用于求解高维空间中的非局部扩散模型。TNN ansatz的张量积结构结合高斯核的可分离性,将非局部能量中的高维积分简化为低维积分的乘积,这些低维积分通过Gauss--Legendre求积公式计算;不可分离的源项和边界数据通过基于TNN的预处理步骤处理。对于Dirichlet边界条件,我们建立了渐近相容的$L^2$误差估计:\\[ \\|u_{\mathrm{loc}}-u_{\delta,p}\\|_{L^2(\Omega)} \le C\\!\left(\frac{\varepsilon_f}{\sqrt\delta} +\frac{\varepsilon_g}{\delta} +\frac{\varepsilon_u}{\sqrt\delta} +\eta_{\mathrm{opt}}\right) +C\sqrt\delta, \\] 其中$\varepsilon_f$、$\varepsilon_g$和$\varepsilon_u$是数据和试验类的逼近误差,$\eta_{\mathrm{opt}}$是优化残差。对于Neumann边界条件,$L^2$估计改进为$O(\varepsilon_f+\varepsilon_g/\sqrt\delta+\varepsilon_u +\eta_{\mathrm{opt}}+\delta)$,并通过平滑后处理步骤进一步获得$H^1$梯度估计。在高达$d=20$的张量积域上的数值实验支持了理论结果,在二维和三维L形域上的额外测试证明了该方法在分析覆盖的光滑域设置之外的实用鲁棒性。
We study a numerical method, built on the tensor neural network (TNN) architecture introduced in \cite{wang2022tensor}, for solving nonlocal diffusion models in high-dimensional spaces. The tensor-product structure of the TNN ansatz, combined with the separability of the Gaussian kernel, reduces the high-dimensional integrals in the nonlocal energy to products of low-dimensional integrals, which are evaluated by Gauss--Legendre quadrature; nonseparable source and boundary data are handled by a TNN-based preconditioning step. For the Dirichlet boundary condition, we establish the asymptotically compatible $L^2$ error estimate \[ \|u_{\mathrm{loc}}-u_{δ,p}\|_{L^2(Ω)} \le C\!\left(\frac{\varepsilon_f}{\sqrtδ} +\frac{\varepsilon_g}δ +\frac{\varepsilon_u}{\sqrtδ} +η_{\mathrm{opt}}\right) +C\sqrtδ, \] where $\varepsilon_f$, $\varepsilon_g$ and $\varepsilon_u$ are the data and trial-class approximation errors and $η_{\mathrm{opt}}$ is the optimization residual. For the Neumann boundary condition, the $L^2$ estimate is improved to $O(\varepsilon_f+\varepsilon_g/\sqrtδ+\varepsilon_u +η_{\mathrm{opt}}+δ)$, and an $H^1$ gradient estimate is further obtained through a smoothing post-processing step. Numerical experiments on tensor-product domains up to $d=20$ support the theoretical results, and additional tests on two- and three-dimensional $L$-shaped domains demonstrate the practical robustness of the method beyond the smooth-domain setting covered by the analysis.