arXivDaily arXiv每日学术速递 周一至周五更新
重置
2606.20504 2026-06-19 quant-ph cs.LG 新提交

Entropy Estimation in Multi-Qutrit Systems via Variational and Classical Neural Networks

多qutrit系统中基于变分和经典神经网络的熵估计

Sai Sakunthala Guddanti, Anil Prabhakar, Ria Rushin Joseph

AI总结 本文系统研究了多qutrit量子系统中von Neumann熵的估计,采用变分量子算法和经典卷积神经网络两种方法,发现VQA适用于小系统,而CNN在大系统中更具可扩展性和鲁棒性。

详情
AI中文摘要

我们使用两种互补方法——变分量子算法(VQAs)和经典卷积神经网络(CNNs),在理想(无噪声)量子模拟器上对多qutrit量子系统中的von Neumann熵估计进行了系统研究。对于最多三个qutrit的系统,我们构建并评估了11种硬件高效的SU(3)启发ansatzes。参数扫描表明,在存在足够纠缠的情况下,估计精度主要由可训练参数的数量决定。基于此研究,我们将后续实验的参数数量固定为约120,观察到纠缠门数量超过阈值后仅带来边际改进。对于更大的系统(二至五个qutrit),我们使用在张量积互无偏基测量结果上训练的CNN。该模型实现了准确且稳定的预测,并表现出随系统大小系统性改善的性能,其中二qutrit系统的误差最高,五qutrit系统的误差最低。值得注意的是,仅使用全状态层析所需测量的12.5%就足以使四和五qutrit系统的90百分位绝对误差达到约0.13-0.16 nat。CNN模型还对散粒噪声具有鲁棒性,并能很好地泛化到分布外状态。总体而言,在我们研究的模拟设置中,结果表明了实用方法的转变:VQAs对小系统有效,而基于CNN的估计器为更大的qutrit系统提供了更好的可扩展性和鲁棒性。

英文摘要

We present a systematic study of von Neumann entropy estimation in multi-qutrit quantum systems using two complementary approaches: variational quantum algorithms (VQAs) and classical convolutional neural networks (CNNs), evaluated using an ideal (noise-free) quantum simulator. For systems up to three qutrits, we construct and evaluate 11 hardware-efficient SU(3)-inspired ansatzes. A parameter sweep shows that estimation accuracy is primarily determined by the number of trainable parameters, provided sufficient entanglement is present. Based on this study, we fix the parameter count to approximately 120 for subsequent experiments, observing that increasing entangling-gate counts beyond a threshold yields only marginal improvements. For larger systems (two to five qutrits), we use a CNN trained on measurement outcomes from tensor-product mutually unbiased bases. The model achieves accurate and stable predictions and exhibits a systematic improvement in performance with system size, with the highest errors for two-qutrit systems and the lowest for five-qutrit systems. Notably, using only 12.5% of the measurements required for full state tomography is sufficient to reach 90th-percentile absolute errors of approximately 0.13-0.16 nats for both four- and five-qutrit systems. The CNN model is also robust to shot noise and generalizes well to out-of-distribution states. Overall, within the simulated settings studied here, our results indicate a transition in practical methods: VQAs are effective for small systems, while CNN-based estimators offer improved scalability and robustness for larger qutrit systems.

2606.20385 2026-06-19 quant-ph cs.NA math.NA 新提交

Sparse Configuration Interaction for the Electronic Schrödinger Equation Revisited: Complete Basis Set Limit Complexity and Quantum-Encoding Impact

电子薛定谔方程的稀疏组态相互作用再探:完备基组极限复杂度与量子编码影响

Michael Griebel, Jan Hamaekers

AI总结 本文重新审视电子薛定谔方程离散谱中本征函数的正则性结果,并研究其对逼近复杂度的影响,发现稀疏网格构造下收敛速率的主项与电子数无关,为经典和量子计算提供新编码优势。

详情
AI中文摘要

在本文中,我们重新审视了电子薛定谔方程离散谱中本征函数的正则性结果,并研究了它们对逼近复杂度的影响。特别地,对于完备基组极限的收敛性,可以证明主导代数指数中的维度灾难可以得到缓解。也就是说,对于一般的稀疏网格构造,关于自由度数目的收敛速率的主项与电子数无关。这些见解表明,对于电子薛定谔方程的经典数值求解器以及通过新的量子比特高效波函数编码的量子计算方法,都具有潜在的好处。

英文摘要

In this article we revisit regularity results for eigenfunctions in the discrete spectrum of the electronic Schrödinger equation and study their consequences for approximation complexity. In particular, for the convergence to the complete basis set limit, it can be shown that the curse of dimensionality in the leading algebraic exponent can be mitigated. That is, for general sparse grid constructions, the main term of the convergence rate with respect to the number of degrees of freedom is independent of the number of electrons. These insights indicate potential benefits for classical numerical solvers of the electronic Schrödinger equation and also for quantum-computing approaches through new qubit-efficient wavefunction encodings.

2606.20344 2026-06-19 quant-ph cs.DC cs.LG 新提交

Quantum ring all-reduce: communication and privacy advantages for distributed learning

量子环全归约:分布式学习的通信与隐私优势

María Gragera Garcés, Lirandë Pira

AI总结 提出量子环全归约协议,利用预共享纠缠和超密编码将每链路在线通信量减半,并通过验证纠缠实现信息论安全的可组合ε-安全聚合,同时获得通信与隐私优势。

Comments 23 pages, 1 figure

详情
AI中文摘要

机器学习模型已扩展到前所未有的规模,使得跨分布式设备的训练成为该领域的事实标准。在这项工作中,我们探讨量子通信如何使分布式训练在通信效率和信息论隐私方面都更具优势,适用于经典和量子学习模型。环全归约是大规模分布式训练的基础通信原语。我们提出一种量子版本,通过预共享纠缠和超密编码,将每链路在线通信量减少一个可证明最优的因子二,且无需改变学习模型或梯度计算。除了带宽优势,该原语还能实现任何经典协议在信息论上不可能实现的隐私保证,通过验证纠缠以GHZ副本的2倍开销实现可组合的ε-安全聚合。我们的混合量子-经典通信架构为大规模分布式训练同时带来通信和安全优势,无论学习本身是量子还是经典。最后,我们描述了在带宽约束下服务器到客户端通信中梯度冲突检测的量子优势,该设置出现在环全归约完成后,当完整梯度广播到外部客户端不可行时。该问题的两个变体呈现出不同的分离。对于基于间隔的对齐测试(\textsc{GapIP}_{\tau}),量子优势在间隔参数上是二次的:\widetilde{O}({\tau}^{-1}\log P) 量子比特对比 \widetilde{O}(\min(\{\tau}^{-2},P)) 比特。对于针对私有参数匹配的符号一致性审计(\textsc{TieAudit}_{\epsilon}),优势表现为通信复杂度的指数级分离:\Omega(\sqrt{P}) 比特,而 O({\epsilon}^{-2}\log P) 量子比特就足够了。

英文摘要

Machine learning models have scaled to unprecedented sizes, making training across distributed devices the de facto standard in the field. In this work, we explore how quantum communications can make distributed training both more communication-efficient and information-theoretically private, for both classical and quantum learning models. Ring all-reduce is the foundational communication primitive for large-scale distributed training. We present a quantum version that reduces per-link online communication by a provably optimal factor of two using pre-shared entanglement and superdense coding, without requiring the learning model or gradient computation to change. Beyond bandwidth, the primitive enables privacy guarantees that are information-theoretically impossible for any classical protocol, achieving composable ε-secure aggregation, via verified entanglement, at a 2x overhead in GHZ copies. Our hybrid quantum-classical communication architecture yields simultaneous communication and security advantages for large scale distributed training, regardless of whether the learning itself is quantum or classical. Finally, we characterise quantum advantages in gradient conflict detection for server-to-client communication under bandwidth constraints, a setting that arises after ring all-reduce is completed, when full gradient broadcast to external clients is infeasible. Two variants of the problem admit different separations. For margin-based alignment testing (\textsc{GapIP}_τ), the quantum advantage is quadratic in the margin parameter: \widetilde{O}(τ^{-1}\log P) qubits versus \widetilde{O}(\min(\τ^{-2},P)) bits. For sign-consistency auditing against a private parameter matching (\textsc{TieAudit}_ε), the advantage represents an exponential separation in communication complexity: Ω(\sqrt{P}) bits whereas O(ε^{-2}\log P) qubits suffice.

2606.19947 2026-06-19 quant-ph cs.LG 新提交

QMaxCal: Path-Space Regularization for Open Quantum Control via Girsanov's Theorem

QMaxCal: 基于 Girsanov 定理的开环量子控制路径空间正则化

Merijn Moody, Zier Mensch, Miranda C. N. Cheng, Peter G. Bolhuis, Max Welling

AI总结 针对开放量子系统退相干问题,利用 Girsanov 定理推导 KL 散度的可微估计器,提出两种正则化项以最小化退相干影响,在多种量子系统中优于未正则化的梯度方法和强化学习基线。

Comments 26 pages, 6 figures. ICML 2026 AI4Physics Workshop

详情
AI中文摘要

在存在退相干的条件下,可靠的量子控制需要能够对抗环境噪声对受控动力学影响的策略。连续监测下的开放量子系统产生经典测量记录,其漂移依赖于系统所经历的噪声;共享相同退相干通道的两个演化的记录仅在此漂移上有所不同,因此 Girsanov 定理给出了它们轨迹分布之间 KL 散度的闭式、可微估计器。我们用两个物理动机的参考度量实例化该估计器,得到两个正则化项,它们都将系统驱动到退相干效应最小的状态:Wiener KL (KL_W),在噪声模型的某些条件下经验上更有效;以及漂移方差正则化项 (R_DV),适用于所有噪声模型。两者在性质上不同于现有的控制通量或平滑性惩罚:它们惩罚控制对退相干通道的可观测后果,而非控制幅度本身。这些正则化项在一系列开放量子系统中优于未正则化的基于梯度和强化学习的基线——包括单量子比特和多量子比特基准测试,以及一个校准到已发表的 IBM Kingston 处理器快照的多量子比特链——在多个评估维度上:最终态保真度、对假设噪声模型失配的鲁棒性(在训练噪声下增益从 +17 个百分点增长到 2.5 倍噪声失配下的 +27 个百分点),以及禁止态的占据。正则化项将不保真度降低高达 50%,在校准的 IBM Kingston 链上获得约 16% 的增益。

英文摘要

Reliable quantum control in the presence of decoherence requires policies that combat the effect of environmental noise on the controlled dynamics. Open quantum systems under continuous monitoring generate classical measurement records whose drift depends on the noise experienced by the system; the records of two evolutions sharing the same decoherence channels differ only in this drift, so Girsanov's theorem yields a closed-form, differentiable estimator of the KL divergence between their trajectory distributions. We instantiate this estimator with two physically motivated reference measures, yielding two regularizers that both drive the system toward states where the effects of decoherence are minimal: the Wiener KL (KL_W), which is empirically more effective under certain conditions on the noise model, and the drift-variance regularizer (R_DV), which works for all noise models. Both are qualitatively distinct from existing penalties on control fluence or smoothness: they penalize the observable consequences of control on the decoherence channels rather than the control amplitude itself. The regularizers outperform unregularized gradient-based and reinforcement-learning baselines across a range of open quantum systems -- including single- and multi-qubit benchmarks and a multi-qubit chain calibrated to a published snapshot of the IBM Kingston processor -- along several axes of evaluation: final-state fidelity, robustness to mismatch in the assumed noise model (gains grow from +17 pp at training noise to +27 pp under 2.5x noise mismatch), and occupation of forbidden states. The regularizers reduce infidelity by up to 50%, with ~16% gains on the calibrated IBM Kingston chain.

2606.19551 2026-06-19 quant-ph cs.CR 新提交

Passive-User Bell-State Loop-Back Key Establishment without Quantum Detectors at the User Nodes

无量子探测器用户节点的贝尔态环回密钥建立

Luis Adrián Lizama-Pérez

AI总结 提出一种贝尔态扩展的环回量子密钥分发架构,使两个无量子探测器的被动用户通过单个主动站实现密钥建立,利用贝尔态测量和有效泡利操作组合实现密钥协商。

详情
AI中文摘要

我们提出并分析了一种贝尔态扩展的环回量子密钥分发架构,用于在两个不需要量子发射器或量子探测器的被动用户之间建立密钥。在所提出的设置中,单个主动站Alice提供纠缠态基础设施,保留初始制备的贝尔对中的一个量子比特,并将旅行子系统发送给两个被动用户(记为$B_1$和$B_2$)。每个被动用户对同一个旅行子系统施加一个局部泡利操作,使得Alice观察到的操作仅为有效组合$U_{\mathrm{eff}}=U_2U_1$。子系统返回后,Alice执行贝尔态测量,并利用她对初始贝尔态的私有知识,确定性地识别出有效泡利操作。然而,当局部选择均匀且独立时,单个因子$U_1$和$U_2$对Alice在代数上保持隐藏。公开的有效操作充当类似奇偶校验的约束:每个被动用户可以根据自己的私有选择推断另一个用户施加的操作,而主动站只知道全局组合。这种构造将被动用户环回QKD的基本分布式变换机制转移到纠缠态领域。与单量子比特被动用户方案(其有用事件本质上是后选择的)不同,贝尔态版本主要受限于贝尔态测量的成功概率。我们讨论了协议的代数结构、其作为基础设施辅助的介导密钥建立机制的解释,以及保护被动泡利调制器免受主动注入或木马型攻击所需的物理假设。

英文摘要

We propose and analyze a Bell-state extension of the Loop-Back quantum key distribution architecture for secret-key establishment between two passive users that do not require quantum transmitters or quantum detectors. In the proposed setting, a single active station, Alice, provides the entangled-state infrastructure, retains one qubit of an initially prepared Bell pair, and sends the traveling subsystem through two passive users, denoted by $B_1$ and $B_2$. Each passive user applies a local Pauli operation to the same traveling subsystem, so that the operation observed by Alice is only the effective composition $U_{\mathrm{eff}}=U_2U_1$. After the subsystem returns, Alice performs a Bell-state measurement and, using her private knowledge of the initial Bell state, deterministically identifies the effective Pauli operation. However, the individual factors $U_1$ and $U_2$ remain algebraically hidden from Alice whenever the local choices are uniformly and independently selected. The public effective operation acts as a parity-like constraint: each passive user can infer the operation applied by the other from its own private choice, while the active station learns only the global composition. This construction transfers the essential distributed-transformation mechanism of passive-user Loop-Back QKD to the entangled-state regime. Unlike single-qubit passive-user schemes, whose useful events are intrinsically post-selected, the Bell-state version is limited primarily by the success probability of the Bell-state measurement. We discuss the algebraic structure of the protocol, its interpretation as an infrastructure-assisted mediated key-establishment mechanism, and the physical assumptions required to protect passive Pauli modulators against active injection or Trojan-horse-type attacks.

2606.19486 2026-06-19 quant-ph cs.IT cs.LG math.IT 新提交

Optimal Ansatz-free Hamiltonian Learning In Situ

无假设哈密顿量的最优原位学习

Taiqi Zhou, Weiyuan Gong

AI总结 提出一种无需控制、无需辅助比特的算法,仅用泡利乘积态制备和测量,以最优总演化时间学习无假设哈密顿量,适用于近中期量子实验。

Comments 51 pages, 2 figures

详情
AI中文摘要

描述控制量子系统的哈密顿量特征,是量子设备校准、信号传感和纠错的基本子程序。近期工作提出了协议,通过实时演化实现无假设哈密顿量的最优海森堡极限学习,无需完全指定相互作用结构。然而,这些协议依赖于带有交错探测和控制的深电路以及极短的时间分辨率,使其难以在近中期原位量子实验中实现。本文提出一种计算高效、无需控制、无需辅助比特的算法,仅使用泡利乘积态制备和测量,在总演化时间 $\Theta(\frac{\Lambda}{\epsilon^2}\log(\frac{\Lambda}{\epsilon}))$ 内学习无假设哈密顿量 $H$(满足 $||H||\leq\Lambda$)。该算法的演化时间成本对于任何无控制协议是最优的,因为我们进一步证明了 $\Omega(\frac{\Lambda}{\epsilon^2}\log(\frac{\Lambda}{\epsilon}))$ 的下界。技术上,我们的方法引入了一个随机采样框架,结合了带限核时间采样和用于哈密顿量结构学习的位移筛。特征探测时间分辨率仅依赖于 $\Lambda$ 而非 $\varepsilon$,这使得我们的协议在传感和校准的高精度场景中特别有吸引力。我们还表明,当哈密顿量在校准后是局域的时,该算法在存在状态制备和测量(SPAM)噪声的情况下保持相同的渐近总演化时间。我们的结果展示了实验友好型哈密顿量学习的基本成本,并为近中期量子平台的严格原位表征提供了实用途径。

英文摘要

Characterizing the features of a Hamiltonian that governs a quantum system serves as a fundamental subroutine of quantum device calibration, signal sensing, and error correction. Recent works proposed protocols have achieved the optimal Heisenberg-limited scaling learning ansatz-free Hamiltonians from their real-time evolutions without fully specifying interaction structures. However, these protocols rely on both deep circuits with interleaving probes and control, and extremely short time resolution, making them difficult to implement on near- and intermediate-term in situ quantum experiments. In this work, we propose a computationally efficient, control-free, and ancilla-free algorithm that uses only Pauli product state preparation and measurement, and learns an ansatz-free Hamiltonian $H$ with $||H||\leqΛ$ in total evolution time of $Θ(\fracΛ{ε^2}\log(\fracΛε))$. The evolution time cost of our algorithm is optimal for any control-free protocols as we further prove a lower bound of $Ω(\fracΛ{ε^2}\log(\fracΛε))$. Technically, our method introduces a randomized-sampling framework that combines band-limited kernel-based time sampling with a displacement sieve for Hamiltonian structure learning. The characteristic probe time resolution depends only on $Λ$ instead of $\varepsilon$, which makes our protocol especially appealing in the high-precision regime for sensing and calibration applications. We also show that the algorithm maintains the same asymptotic total evolution time in the presence of state-preparation-and-measurement (SPAM) noise when the Hamiltonian is local after calibration. Our results demonstrate the fundamental cost of experimentally friendly Hamiltonian learning and provide a practical route to rigorous in situ characterization of near-term quantum platforms.

2606.20467 2026-06-19 cs.LG cs.NA math.NA physics.comp-ph 新提交

Agentic Symbolic Search: Characterizing PDEs Beyond Hand-crafted Expressions, Meshes, and Neural Networks

智能符号搜索:超越手工表达式、网格和神经网络的PDE特征化

Zongmin Yu, Liu Yang

发表机构 * National University of Singapore(新加坡国立大学)

AI总结 提出ASYS框架,通过智能体将PDE理论转化为可微分符号程序,结合进化搜索和梯度优化自动发现解析形式或近似,在多个问题中生成可解释表示。

详情
AI中文摘要

数学家通过数学结构而非计算值表来理解PDE解。历史上,这需要针对每个问题单独进行数学分析。数值模拟和神经网络都不能直接产生这些结构。我们提出智能符号搜索(ASYS),一种先验引导框架,其中智能体将PDE理论、公共问题约束和累积搜索经验转化为可测试的可微分符号程序。数学形式在进化搜索下被精炼,而其连续参数通过基于梯度的优化拟合。这使得搜索成为归纳偏置注入的自动化形式,而非盲目的符号回归。对于已知解析形式的问题,ASYS自然恢复这些形式;对于其他问题,ASYS构建解析近似,可引导数学家进行进一步分析。在我们的实验中,跨越五个问题,包括有界动力学、有限时间爆破和自由边界聚焦,ASYS产生了可解释表示,包括Allen-Cahn 2D动力学的几何界面公式和Keller-Segel趋化爆破的九参数收缩律,这些场景中先前没有闭式描述。ASYS展示了表征PDE解的新范式的可能性,超越了手工解析解、基于网格的数值解和神经网络近似。

英文摘要

Mathematicians understand a PDE solution through mathematical structures rather than tables of computed values. Historically, this has been the product of mathematical analysis, carried out by hand for each problem individually. Neither numerical simulation nor neural networks produce those structures directly. We propose Agentic Symbolic Search (ASYS), a prior-guided framework in which an agent translates PDE theory, public problem constraints, and accumulated search experience into testable differentiable symbolic programs. The mathematical forms are refined under evolutionary search, while their continuous parameters are fit by gradient-based optimization. This makes the search an automated form of inductive-bias injection rather than blind symbolic regression. For problems with known analytical forms, ASYS recovers these forms naturally; for other problems, ASYS constructs analytical approximations which can guide mathematicians toward further analysis. In our experiments, across five problems spanning bounded dynamics, finite-time blow-up, and free-boundary focusing, ASYS produces interpretable representations, including a geometric interface formula for Allen-Cahn 2D dynamics and a nine-parameter contraction law for Keller-Segel chemotactic blow-up, in settings where no closed-form description was previously available. ASYS shows the possibility of a new paradigm for characterizing PDE solutions, beyond handcrafted analytical solutions, mesh-based numerical solutions, and neural network approximations.

2606.20329 2026-06-19 cs.LG physics.geo-ph 新提交

Constrained hybrid modelling to predict microbial dynamics and organic matter turnover in soil systems

约束混合建模预测土壤系统中微生物动态与有机质周转

Paul Collart, Juergen Gall, Andrea Schnepf, Holger Pagel, Lars Doorenbos

发表机构 * Agrosphere (IBG-3), Forschungszentrum Jülich GmbH(农业圈(IBG-3),于利希研究中心) Institute of Crop Science and Resource Conservation, University of Bonn(波恩大学作物科学与资源保护研究所) Institute of Computer Science, University of Bonn(波恩大学计算机科学研究所) Lamarr Institute for Machine Learning and Artificial Intelligence(拉马尔机器学习和人工智能研究所)

AI总结 提出首个混合建模框架,利用神经网络从宏基因组推断功能性状预测过程模型参数,并整合生态理论约束,有效预测微生物动态和有机质周转。

Comments Accepted at ICML '26

详情
AI中文摘要

土壤微生物控制有机质循环,并在很大程度上决定土壤系统如何应对和缓解气候变化及环境威胁。因此,在基于过程的土壤模型中表示微生物动态对于预测土壤碳循环至关重要,尽管从数据中获取信息极具挑战性。改进参数化的一个有前景的方法是整合基因组数据,然而建模基因组与微生物驱动过程之间复杂且未知的关系是一个未解决的问题。在这项工作中,我们提出了第一个混合建模框架,用于从基于DNA测序数据的宏基因组推断功能性状中推导基于过程的土壤有机质周转模型的生物动力学参数值。我们的模型通过神经网络从基因组性状数据预测过程模型的生物动力学参数,并整合来自生态理论和文献的约束,以确保即使是非观测状态变量也能实现逼真的行为。我们在不同复杂度的合成基因组性状数据集和真实数据上评估了我们的方法,结果表明,我们的方法在多个基线上提高了性能,并有效学习了过程模型中不可测量组分的动态,即使是在小训练数据集上也是如此。

英文摘要

Soil microorganisms control organic matter cycling and largely determine how soil systems can cope with and mitigate climate change and environmental threats. Representing microbial dynamics in process-based soil models is therefore critical to predict carbon cycling in soils, albeit highly challenging to inform from data. One promising approach to improve their parametrisation is the integration of genomic data, yet modelling the complex and unknown relationship between genomes and the processes the microbes are driving is an unsolved problem. In this work, we present the first hybrid modeling framework for deriving biokinetic parameter values of a process-based soil organic matter turnover model from metagenome-inferred functional traits based on DNA sequencing data. Our model predicts biokinetic parameters of the process-based model from genomic trait data with a neural network and integrates constraints from ecological theory and literature to ensure realistic behavior, even of non-observed state variables. We evaluate our method on synthetic genomic trait datasets of varying complexity and on real data, showing that our approach improves performance over multiple baselines and learns the dynamics of unmeasurable components of the process-based model effectively, even for small training datasets.

2606.20326 2026-06-19 cs.LG physics.comp-ph 新提交

Quantum-classical physics-informed Kolmogorov-Arnold networks for PDEs

量子-经典物理信息Kolmogorov-Arnold网络求解偏微分方程

Xiang Rao, Yuxuan Shen

AI总结 提出QCPIKAN,首个量子-经典物理信息Kolmogorov-Arnold网络,结合Chebyshev多项式KAN层和参数化量子电路,通过嵌入物理约束加速高频误差指数收敛并抑制数值色散,在多孔介质渗流场景中优于现有量子-经典PINN。

详情
AI中文摘要

我们开发了QCPIKAN,这是首个旨在求解偏微分方程(PDE)的量子-经典物理信息Kolmogorov-Arnold网络。该混合框架基于Chebyshev多项式KAN层和参数化量子电路构建,将物理约束嵌入训练损失中以强制执行物理一致性。我们的基于逼近论的理论研究证明,该设计将高频误差收敛加速至指数速率,并有效抑制数值色散。我们在多孔介质中的三个典型渗流场景(包括单相流、组分运移和两相流)上验证了该框架。与现有的量子-经典物理信息神经网络相比,QCPIKAN在全局预测精度、局部误差控制、动态演化跟踪和驱替前沿定位方面均实现了优越性能。这项工作为求解复杂PDE提供了一种鲁棒且高效的替代方案。

英文摘要

We develop QCPIKAN, the first quantum-classical physics-informed Kolmogorov-Arnold network designed to solve partial differential equations (PDEs). Built upon Chebyshev-polynomial KAN layers and parameterized quantum circuits, this hybrid framework embeds physical constraints into the training loss to enforce physical consistency. Our theoretical investigations grounded in approximation theory prove that this design accelerates high-frequency error convergence to an exponential rate and effectively mitigates numerical dispersion. We validate the framework across three typical seepage scenarios in porous media, including single-phase flow, component transport and two-phase flow. Compared with existing quantum-classical physics-informed neural networks, QCPIKAN achieves superior performance in global prediction accuracy, local error control, dynamic evolution tracking and displacement front localization. This work provides a robust and efficient alternative for solving complex PDEs.

2606.19912 2026-06-19 math.NA cs.LG cs.NA physics.comp-ph 新提交

Structure-Oriented Randomized Neural Networks for Poisson-Nernst-Planck and Poisson-Nernst-Planck-Navier-Stokes Systems

面向结构的随机神经网络用于泊松-能斯特-普朗克和泊松-能斯特-普朗克-纳维-斯托克斯系统

Yunlong Li, Fei Wang

AI总结 提出结构导向随机神经网络(SO-RaNN)框架,通过解耦线性化子问题、逐点截断保持浓度正性、离散质量缩放因子和SAV后处理修正,实现PNP和PNP-NS系统的高效求解,并理论推导残差估计和收敛性。

详情
AI中文摘要

我们开发了一种面向结构的随机神经网络框架,称为SO-RaNN,用于泊松-能斯特-普朗克(PNP)系统和泊松-能斯特-普朗克-纳维-斯托克斯(PNP-NS)系统。解耦的线性化子问题通过随机神经网络在时空框架中迭代求解。对于浓度变量,使用逐点截断在数值层面强制正性,并在选定的修正时刻计算离散质量缩放因子并在时间上插值,以确保在这些时刻精确匹配质量并促进近似质量守恒。为了引入辅助离散耗散机制,我们进一步采用SAV型后处理修正,该修正使得SAV辅助变量在理想SAV更新下具有单调性。对于PNP-NS系统,使用保结构随机神经网络(SP-RaNN)处理速度场,使得速度近似通过构造满足逐点不可压缩约束。在理论方面,我们推导了线性化子问题的原始未修正RaNN求解器的残差估计,为PNP系统的原始外Picard迭代制定了条件性局部时间收敛结果,并分析了数值层面的正性修正以及质量修正和SAV后处理步骤。对于PNP-NS系统,我们建立了SP-RaNN空间的逼近结果,并给出了相应线性化Oseen型问题的条件性误差陈述。数值实验展示了源驱动制造测试中的逼近精度,并说明了预期中的数值层面正性修正、选定时刻质量匹配、基于最终规范固定势的计算自由能曲线以及基准测试中的无散度逼近。

英文摘要

We develop a structure-oriented randomized neural network framework, termed SO-RaNN, for the Poisson-Nernst-Planck (PNP) system and the Poisson-Nernst-Planck-Navier-Stokes (PNP-NS) system. The decoupled linearized subproblems are solved iteratively by randomized neural networks in a space-time framework. For the concentration variables, a pointwise cut-off is used to enforce positivity at the value level, and discrete mass-scaling factors are computed at selected correction instants and interpolated in time, so as to ensure exact mass matching at those instants and to promote approximate mass preservation between them. To introduce an auxiliary discrete dissipation mechanism, we further employ an SAV-type post-processing correction, which yields monotonicity of the SAV auxiliary variable under the ideal SAV update. For the PNP-NS system, a structure-preserving randomized neural network (SP-RaNN) is used for the velocity field, so that the velocity approximation satisfies the incompressibility constraint pointwise by construction. On the theoretical side, we derive residual-based estimates for the raw, uncorrected RaNN solvers of the linearized subproblems, formulate a conditional local-in-time convergence result for the raw outer Picard iteration of the PNP system, and analyze the value-level positivity correction together with the mass-correction and SAV post-processing steps. For the PNP-NS system, we establish an approximation result for the SP-RaNN space and provide a conditional error statement for the corresponding linearized Oseen-type problem. Numerical experiments demonstrate approximation accuracy in the source-driven manufactured tests and illustrate the intended value-level positivity correction, selected-time mass matching, computed free-energy curves based on the final gauge-fixed potential, and divergence-free approximation in benchmark tests.

2606.19853 2026-06-19 cs.LG physics.comp-ph 新提交

Physics-Informed Neural Network with Squeeze-Excitation-like Attention

带有挤压-激励式注意力的物理信息神经网络

Yun-Fei Song, Long-Gang Pang, Fu-Peng Li, Jun-Jie Zhang

发表机构 * Key Laboratory of Quark and Lepton Physics (MOE) & Institute of Particle Physics, Central China Normal University(华中师范大学夸克与轻子物理教育部重点实验室及粒子物理研究所) Artificial Intelligence and Computational Physics Research Center, Central China Normal University(华中师范大学人工智能与计算物理研究中心) Key Laboratory of Nuclear Physics and Ion-beam Application (MOE) & Institute of Modern Physics, Fudan University(复旦大学核物理与离子束应用教育部重点实验室及现代物理研究所) Shanghai Research Center for Theoretical Nuclear Physics, NSFC and Fudan University(国家自然科学基金委员会-复旦大学上海理论核物理研究中心) Northwest Institute of Nuclear Technology(西北核技术研究所)

AI总结 提出SEA-PINN架构,通过挤压-激励式注意力机制动态调整神经元重要性,实现稳定初始化,在20个基准问题中17个方差极小,无需傅里叶嵌入或周期激活即可达到与TSA-PINN相当的精度,并可作为轻量插件提升其他PINN性能。

Comments 15 pages, 6 figures

详情
AI中文摘要

我们引入了SEA-PINN,一种新颖的架构,它将类似挤压-激励的注意力机制融入物理信息神经网络,以动态重新校准各层神经元的重要性。SEA-PINN的一个关键特性是其高度稳定的初始化。在20个基准问题中的17个上,SEA-PINN表现出几乎可忽略的方差和显著降低的初始损失,为优化建立了一个准确定且有利的起点。值得注意的是,在没有采用傅里叶特征嵌入或周期激活函数的情况下,SEA-PINN与TSA-PINN(一种通过正弦激活中的可学习频率专门为高频问题设计的模型)相比,达到了具有竞争力的精度(在高频案例7上,相对于FNN-PINN的改进分别为83%和90%)。此外,将SEA-PINN集成到TSA-PINN中使性能提升了42.49%。这些结果强调了SEA-PINN作为一种轻量级插件模块,能够增强非线性表示能力,促进更稳健和高效的收敛,并提高物理信息学习的整体可靠性。

英文摘要

We introduce SEA-PINN, a novel architecture that incorporates a Squeeze-Excitation-like attention mechanism into physics-informed neural networks to dynamically recalibrate the importance of neurons across layers. A key feature of SEA-PINN is its highly stable initialization. On 17 out of 20 benchmark problems, SEA-PINN exhibit nearly negligible variance and significantly reduced initial loss, establishing a quasi-deterministic and favorable starting point for optimization. Notably, without employing Fourier feature embeddings or periodic activation functions, SEA-PINN attained competitive accuracy (83\% vs. 90\% improvement relative to FNN-PINN on the high-frequency case 7) as compared with TSA-PINN-a model specifically engineered for high-frequency problems via learnable frequencies in sinusoidal activations. Furthermore, integrating SEA-PINN into TSA-PINN boosted performance by 42.49\%. These results underscore SEA-PINN as a lightweight plug-in module that enhances nonlinear representation power, promotes more robust and efficient convergence, and strengthens the overall reliability of physics-informed learning.

2606.19767 2026-06-19 eess.IV cs.CV physics.med-ph 新提交

Contour-Constrained Deformable Registration with Parameter Characterization for Head and Neck Surgical Guidance

面向头颈外科引导的带参数表征的轮廓约束可变形配准

Qingyun Yang, Jon S. Heiselman, Ayberk Acar, Morgan J. Ringel, Michael I. Miga, Matthieu Chabanas, Michael C. Topf, Jie Ying Wu

AI总结 提出一种基于正则化Kelvinlet基函数的可变形配准框架,通过表面点云、基准标记和轮廓约束校正术后组织变形,在9例头颈标本上将配准误差从刚性配准的11.11mm降至5.62mm,降幅达49.41%。

详情
AI中文摘要

全球每年新增89万例头颈部鳞状细胞癌,其复发率在实体恶性肿瘤中最高。尽管冰冻切片分析是术中切缘评估的标准方法,但由于切除标本与切除床之间的对准不精确,加上切除后黏膜组织收缩,准确地将检测到的阳性切缘重新定位到切除床上仍然具有挑战性。我们提出了一种生物力学驱动的可变形配准框架,用于校正术后组织变形以提供术中引导。该方法基于正则化Kelvinlet基函数的可变形配准方法,将3D标本网格配准到术中切除床点云。配准匹配表面点云、基准标记和边界轮廓约束,直接惩罚标本与切除床边界之间的垂直距离一致性。在来自皮肤、颊粘膜和舌部位的9个标本上,使用刚性配准的整体平均目标配准误差为$11.11 \pm 4.07$ mm,使用无轮廓约束的可变形配准则降至$8.20 \pm 2.68$ mm(降低26.19%)。所提出的轮廓约束可变形配准进一步将误差降至$5.62 \pm 2.28$ mm,相对于刚性配准降低了49.41%。我们在临床最具挑战性的舌标本中观察到最大降幅。我们还进行了系统的两阶段参数搜索,以表征表面配准、基准对应、轮廓约束和应变能正则化的相对重要性。该搜索表明,对于具有大侧向变形的组织类型,轮廓权重主导配准精度,而算法在广泛的参数组合范围内均可运行。

英文摘要

With 890,000 annual new cases globally, head and neck squamous cell carcinoma has one of the highest recurrence rates among solid malignancies. Although frozen section analysis is the standard of care for intraoperative margin assessment, accurately relocating detected positive margins on the resection bed remains challenging due to imprecise alignment between resected specimens and their resection bed, compounded by post-resection mucosal tissue shrinkage. We present a biomechanics-driven deformable registration framework that corrects post-resection tissue deformation to provide intraoperative guidance. Our approach registers 3D specimen meshes to intraoperative resection bed point clouds using a deformable registration approach based on regularized Kelvinlet basis functions. The registration matches surface point clouds, fiducial landmarks, and boundary contour constraints that directly penalize perpendicular distance-to-agreement between specimen and resection bed boundaries. Across nine specimens from skin, buccal mucosa, and tongue sites, the overall mean target registration error was $11.11 \pm 4.07$ mm using rigid registration, which decreased to $8.20 \pm 2.68$ mm (26.19\% reduction) using deformable registration without contour constraint. The proposed contour-constrained deformable registration further reduced the error to $5.62 \pm 2.28$ mm, a 49.41\% reduction relative to rigid registration. We observed the largest reduction in the most clinically challenging tongue specimens. We also performed a systematic two-stage parameter search to characterize the relative importance of surface alignment, fiducial correspondences, contour constraint, and strain energy regularization. This search revealed that contour weighting dominates registration accuracy for tissue types with large lateral deformation, while the algorithm operates over a broad range of parameter combinations.

2606.19674 2026-06-19 cs.ET physics.optics 新提交

Design Considerations for Phase Modulation in Testable Photonic Systems and Co-packaged Optics

可测试光子系统和共封装光学中相位调制的设计考虑

Pratishtha Agnihotri, Priyank Kalla, Steve Blair

AI总结 本文比较了硅光子集成电路中热致相位调制和载流子电调制在Mach-Zehnder和微环调制器中的性能,分析了消光比、调谐效率、功耗和调制带宽等关键权衡,为可测试光子系统的相位调制策略选择提供设计指导。

Comments This article is a part of the PhD thesis dissertation published in 2025 (https://www.proquest.com/openview/5b04e74f2008099c8c2ee9975f26482f/1?pq-origsite=gscholar&cbl=18750&diss=y)

详情
AI中文摘要

随着硅光子集成电路(PIC)复杂度的增加,测试和校准越来越依赖于有效的相位调制机制。本文比较了Mach-Zehnder和微环调制器中的热致相位调制和基于载流子的电调制。这些器件在消光比、调谐效率、功耗和调制带宽方面进行了设计和评估。研究确定了调制速度、能量消耗和调谐可控性之间的关键权衡,这些权衡直接影响这些方法在测试信号生成和校准任务中的适用性。结果突出了热调制和电调制在不同工作区域中的相对优势和局限性。这些发现为在具有集成测试和校准需求的可扩展硅光子系统中选择相位调制策略提供了实用的设计指导。

英文摘要

As silicon photonic integrated circuits (PICs) scale in complexity, testing and calibration increasingly depend on effective phase modulation mechanisms. This work compares thermally induced phase modulation and carrier-based electrical modulation in Mach-Zehnder and microring modulators. The devices are designed and evaluated for extinction ratio, tuning efficiency, power consumption, and modulation bandwidth. The study identifies key trade-offs among modulation speed, energy consumption, and tuning controllability that directly influence the suitability of these methods for test signal generation and calibration tasks. The results highlight the relative advantages and limitations of thermal and electrical approaches across different operating regimes. These findings provide practical design guidance for selecting phase modulation strategies in scalable silicon photonic systems with integrated test and calibration requirements.

2606.19562 2026-06-19 cs.LG physics.flu-dyn 新提交

Advances in Scientific Machine Learning for Coupled Fluid Flow and Transport

耦合流体流动与输运的科学机器学习进展

Gabriel F. Barros, Rômulo M. Silva, Alvaro L. G. A. Coutinho

发表机构 * COPPE - Federal University of Rio de Janeiro - UFRJ(里约热内卢联邦大学COPPE学院)

AI总结 综述科学机器学习在耦合流体流动与输运问题中的进展,包括基于SVD的线性降阶和PINNs、β-VAE等神经网络方法,并展示其在浊流和热对流中的应用。

详情
AI中文摘要

本章回顾了科学机器学习(SciML)在模拟由不可压缩Navier-Stokes方程和标量输运方程控制的耦合流体流动与输运现象方面的最新进展。这类系统出现在浊流和热对流等应用中,具有强非线性耦合和多尺度行为,使得高保真模拟计算成本高昂。为此,本章调查了构建高效代理模型的最新SciML方法,包括基于奇异值分解的线性降阶技术(如动态模态分解)和非线性神经网络方法(如物理信息神经网络(PINNs)和β-变分自编码器(β-VAEs))。首先介绍了作者将这些模型与高性能计算策略相结合的工作,包括自适应网格细化/粗化(AMR/C)和科学浮点数据压缩。然后提出了两个新贡献:通过PINNs对浊流进行代理建模,以及使用β-VAEs从热流中提取解缠的非线性模态。控制方程和代表性基准(包括锁交换流和Rayleigh-Bénard对流)说明了这些方法。本章篇幅较长,涵盖了耦合流体流动的数学和物理基础以及最先进建模的计算方面。总体而言,它展示了SciML如何在特定数据范围和建模假设下,实现复杂耦合系统的快速、精确近似,同时相对于全阶模拟大幅降低计算成本。实时预测和不确定性量化等更广泛的能力仍然是活跃的研究方向,其可行性在很大程度上取决于具体问题。

英文摘要

This chapter reviews recent advances in Scientific Machine Learning (SciML) for modeling coupled fluid flow and transport phenomena governed by the incompressible Navier-Stokes and scalar transport equations. Such systems, found in applications like turbidity currents and thermal convection, feature strong nonlinear coupling and multiscale behavior that make high-fidelity simulations computationally expensive. To address this, the chapter surveys state-of-the-art SciML methods for building efficient surrogate models, including linear reduced-order techniques based on Singular Value Decomposition (such as Dynamic Mode Decomposition) and nonlinear neural network approaches like Physics-Informed Neural Networks (PINNs) and $β$-Variational Autoencoders ($β$-VAEs). It first covers the authors' work combining these models with High Performance Computing strategies, including Adaptive Mesh Refinement/Coarsening (AMR/C) and scientific floating-point data compression. It then presents two new contributions: surrogate modeling of turbidity currents via PINNs, and the extraction of disentangled nonlinear modes from thermal flows using $β$-VAEs. Governing equations and representative benchmarks, including lock-exchange flows and Rayleigh-Bénard convection, illustrate these methodologies. The chapter is intentionally long, covering both the mathematical and physical foundations of coupled fluid flow and the computational aspects of state-of-the-art modeling. Overall, it demonstrates how SciML enables fast, accurate approximations of complex coupled systems within the specific data regimes and modeling assumptions considered, while substantially reducing computational cost relative to full-order simulations. Broader capabilities such as real-time prediction and uncertainty quantification remain active research directions whose feasibility depends strongly on the problem at hand.

2606.20025 2026-06-19 physics.geo-ph cs.NA math.NA 新提交

Acceleration methods for the planar 3D ILSA hydraulic fracturing model

平面3D ILSA水力压裂模型的加速方法

V. I. Shukalo, A. V. Valov, A. N. Baykin

AI总结 针对平面3D ILSA水力压裂模型计算成本高的问题,提出统一迭代方案、矩阵分裂、Anderson加速和预测-校正等加速策略,在保持精度下实现平均4倍加速,最高11倍。

Comments 56 pages, 35 figures. Submitted for publication

详情
AI中文摘要

水力压裂的平面3D模型在具有限制性几何假设的模型和全3D模拟器之间提供了实用的平衡,能够以适中的计算成本捕捉具有任意平面足迹的裂缝。然而,诸如处理设计优化和微型压裂测试解释等应用需要大量的模拟集合,平面3D模型的成本仍然是显著瓶颈。本文提出了平面3D隐式水平集算法(ILSA)的加速策略,以减少模拟运行时间同时保持数值精度。引入了一个统一的平面3D ILSA方案,将弹性流体动力学求解器和前沿追踪算法的嵌套循环合并为单个迭代过程。对线性化的弹性流体动力学系统应用矩阵分裂方法,将弹性算子的稠密部分移到右侧,产生一个可以更高效求解的稀疏系统矩阵。将Anderson加速纳入弹性流体动力学系统的求解中,以改善在不同裂缝几何形状下的收敛性。此外,结合所提出的方法检查了预测-校正方案,以评估它们的组合效果。在五个基准案例上,分别和组合评估了每种技术在参考和统一平面3D ILSA方案上的表现。数值实验表明,仅统一方案就实现了平均2.5倍的加速,对于沙漏几何形状达到5.7倍。所有技术的组合应用实现了平均4倍的加速,对于沙漏案例高达11倍,与参考方案相比,裂缝开度的相对差异低于5%。

英文摘要

Planar 3D models of hydraulic fracturing provide a practical balance between models with restrictive geometric assumptions and fully 3D simulators, capturing fractures with arbitrary planar footprints at moderate computational cost. Nevertheless, applications such as treatment design optimization and mini-frac test interpretation require large ensembles of simulations, for which the cost of planar 3D models remains a significant bottleneck. This work presents acceleration strategies for the planar 3D Implicit Level Set Algorithm (ILSA) to reduce simulation runtime while preserving numerical accuracy. A unified planar 3D ILSA scheme that consolidates the nested loops of the elastohydrodynamic solver and the front tracking algorithm into a single iterative process is introduced. A matrix splitting approach is applied to the linearized elastohydrodynamic system, moving the dense part of the elasticity operator to the right-hand side and yielding a sparse system matrix that can be solved more efficiently. Anderson acceleration is incorporated into the solution of the elastohydrodynamic system to improve convergence under varying fracture geometry. Additionally, a predictor--corrector scheme is examined with the proposed methods to assess their combined effect. Each technique is evaluated individually and in combination on both the reference and unified planar 3D ILSA schemes across five benchmark cases. Numerical experiments demonstrate that the unified scheme alone delivers an average 2.5x speedup, reaching 5.7x for the sandglass geometry. The combined application of all techniques achieves an average 4x speedup and up to 11x for the sandglass case, with the relative discrepancy in fracture aperture below 5% compared with the reference scheme.

2606.20485 2026-06-19 q-fin.RM cs.AI nlin.AO physics.soc-ph 新提交

Optimal Order of Multi-Agent and General Many-Body Systems

多智能体与一般多体系统的最优序

Jake J. Xia

AI总结 提出一个分析多智能体系统的通用框架,基于智能体的权力和响应函数,推导出宏观性质,并引入风险偏好系数研究增长与韧性之间的权衡,得出最优有序度。

Comments Key Words: Many body systems, multi agent crowd interactions, feedback loops, agent power, response function, utility function, risk appetite, order, optimal order, fragility, mobility, synchronization, useful energy, entropy, concentration, correlation, task dependency, receiver dependency, collective intelligence, AI model scaling law

详情
AI中文摘要

本文开发了一个通用框架,用于分析具有智能体行动与集体观测之间反馈回路的多智能体系统。该框架建立在两个基本的智能体层面变量上:权力,衡量智能体对集体结果的影响;以及响应函数,决定智能体如何对观测做出反应。我们推导了宏观性质(包括总权力、有用权力、熵、有序度、脆弱性和流动性)如何从异质智能体的这两个变量中涌现。为了研究增长与韧性之间的权衡,我们引入了一个由风险偏好系数参数化的系统层面效用函数,并推导出一个平衡生产力、稳定性和适应性的最优有序度。分析表明,更强的同步可以增加集体产出,但也可能增加系统脆弱性并降低流动性。我们进一步论证,有序度、熵、信息和有用能量是任务依赖和系统相对的概念,其含义取决于系统的目标。通过测量和设计智能体的权力分布和响应函数,可能更好地理解、预测和优化集体行为,并识别集体智慧和最优序出现的条件。

英文摘要

This paper develops a general framework for analyzing multi-agent systems with feedback loops between agents actions and collective observations. The framework is built on two fundamental agent-level variables: power, which measures agent influence on collective outcomes, and response functions, which determine how agents react to observations. We derive how macroscopic properties, including total power, useful power, entropy, order, fragility, and mobility, emerge from these two variables of heterogeneous agents. To study the trade off between growth and resilience, we introduce a system-level utility function parameterized by a risk-appetite coefficient and derive an optimal degree of order that balances productivity, stability, and adaptability. The analysis suggests that stronger synchronization can increase collective output but may also increase systemic fragility and reduce mobility. We further argue that order, entropy, information, and useful energy are task-dependent and system-relative concepts whose meanings depend on the objectives of the system. By measuring and designing agent power distributions and response functions, it may be possible to better understand, predict, and optimize collective behavior and identify the conditions under which collective intelligence and optimal order emerge.

2606.19488 2026-06-19 physics.soc-ph cs.SI nlin.AO 新提交

Networks of agglomeration: how population density rewires social networks and reshapes contagion dynamics

集聚网络:人口密度如何重塑社交网络并改变传染动态

Christopher K. Tokita

AI总结 通过最小主体模型,发现人口密度单独变化即可重构社交网络结构,稀疏人口形成局部集群,密集人口形成全局集成网络,并影响简单与复杂传染的传播速度与范围。

Comments Main text: 12 pages with 5 figures. Attached Supplemental Text: 3 pages with 5 figures

详情
AI中文摘要

从古代美索不达米亚到现代城市,密集的人类定居点伴随着经济生产力、文化创新和社会变革的爆发。但是,将人们更紧密地聚集在一起如何改变社会组织,从而重塑集体结果?在这里,我使用一个最小主体模型来隔离人口密度的影响,保持人口规模和个人行为不变,仅改变个体在空间中的接近程度。在模型中,个体逐渐形成社会联系,偏好附近的人以及已经联系良好的人。在这些简单规则下,仅改变人口密度就足以重组社交网络结构:稀疏人口形成局部集群社区,而密集人口形成全局集成网络,具有更短的社会距离和紧密互联的流行个体核心。这种结构转变在狭窄的密度范围内急剧发生,并由物理接近性还是社会流行性主导联系形成决定。在这些网络上模拟传染揭示,这种转变的后果取决于传播的内容。简单传染(例如,信息或疾病)在密集人口中更快地到达大多数个体。复杂传染(例如,社会规范或集体行为)不会传播得更快,但随着密度增加,实现更广泛和更可靠的采纳。总之,这些结果表明,人口密度可以作为一种结构性力量,独立于通常用来解释城市为何是变革引擎的经济和行为机制。

英文摘要

From ancient Mesopotamia to modern cities, dense human settlements coincide with bursts of economic productivity, cultural innovation, and social change. But how does packing people more tightly together alter social organization in ways that reshape collective outcomes? Here, I use a minimal agent-based model to isolate the effect of population density, holding population size and individual behavior fixed while varying only how closely individuals are placed in space. In the model, individuals form social ties gradually, favoring those nearby and those already well-connected. Under these simple rules, varying population density alone is sufficient to reorganize social network structure: sparse populations develop locally clustered communities, while denser ones form globally integrated networks with shorter social distances and a tightly interconnected core of popular individuals. This structural transition occurs sharply over a narrow range of densities and is governed by whether physical proximity or social popularity dominates tie formation. Simulating contagions on these networks reveals that the consequences of this shift depend on what is spreading. Simple contagions (e.g., information or disease) reach a majority of individuals more quickly in denser populations. Complex contagions (e.g., social norms or collective behaviors) do not spread faster, but instead achieve broader and more reliable adoption as density increases. Together, these results show that population density can act as a structural force independent of the economic and behavioral mechanisms typically invoked to explain why cities are engines of change.

2606.20060 2026-06-19 nlin.AO cs.SY eess.SY 新提交

Nodal Braess's Paradox and Inertia Destabilization with Dynamic Node and Line Failures in Power Grids

电网中动态节点与线路故障的节点Braess悖论与惯性失稳

Nubius Brandner, Frank Hellmann, Hans Würfel, Jürgen Kurths, Anton Plietzsch, Anna Büttner

AI总结 提出集成节点/线路故障与同步振荡器动力学的新模型,发现高惯性和节点鲁棒性增强可能反常地扩大级联规模,揭示新型Braess悖论。

详情
AI中文摘要

大规模停电通常由级联故障引起。这些故障通过网络动力学与单个组件故障之间的复杂相互作用动态展开。相比之下,物理学中对级联故障的研究集中在准静态状态下分析线路过载。我们引入了一个新模型,将节点和线路故障的动力学与电网同步的典型振荡器模型相结合。这使我们能够首次研究耦合故障的集体级联行为。我们研究了节点鲁棒性(节点承受瞬态扰动的能力)和惯性(节点抵抗频率偏差的能力)对级联规模的影响。我们发现了驱动系统脆弱性的两种新机制:i) 虽然低惯性被广泛认为是电网的主要挑战,但我们发现高惯性会放大级联规模,除非伴随其他动力学特性的适当调整。ii) 此外,我们发现单个节点鲁棒性的增强可能反常地导致更大的级联。后一种现象构成了一种新型的Braess悖论。理解这种反直觉的集体效应对于实现有弹性的未来电网可能至关重要。

英文摘要

Large-scale power outages are typically caused by cascading failures. These unfold dynamically through complex interactions between network dynamics and individual component failures. In contrast, the study of cascading failures in physics has focused on analyzing line overloads in the quasi-static regime. We introduce a new model that integrates the dynamics of node and line failures with a paradigmatic oscillator model for power grid synchronization. This enables us to investigate the collective cascading behavior of coupled failures for the first time. We study the impact of nodal robustness, the ability of nodes to tolerate transient disturbances, and inertia, the ability of nodes to resist frequency deviations, on cascade sizes. We discover two novel mechanisms driving system fragility: i) While low inertia is widely considered a major challenge for power grids, we find that high inertia can amplify cascade sizes unless accompanied by appropriate adjustments of other dynamical properties. ii) Further, we find that an increase in the robustness of individual nodes can paradoxically lead to larger cascades. This latter phenomenon constitutes a novel type of Braess's paradox. Understanding such counterintuitive collective effects may become central for achieving resilient future power grids.

2606.19811 2026-06-19 math.NA cs.NA math-ph math.MP 新提交

Second order explicit splitting scheme for fluid-poroelastic structure interaction problems

流体-多孔弹性结构相互作用问题的二阶显式分裂格式

Yifan Wang, Jeonghun Lee, Suncica Canic

AI总结 针对固定域上时变Stokes-Biot问题,提出结合BDF2时间步进与二阶Adams-Bashforth界面外推的显式分裂格式,在抛物线CFL条件下证明稳定性,并通过投影框架导出先验误差估计,数值实验验证二阶时间收敛和最优空间收敛。

详情
AI中文摘要

高效的且可证明精确的流体-多孔弹性结构相互作用的分区方法仍然具有挑战性,因为Stokes-Biot界面耦合条件的显式处理可能损害稳定性。本文针对固定域上的时变Stokes-Biot问题,开发并分析了一个全离散、二阶、显式分裂格式。该方法将BDF2时间步进与通过Robin重构的界面数据的二阶Adams-Bashforth外推相结合,得到一个分区算法,其中Stokes和Biot子问题在每个时间步独立并行求解。主要分析贡献在于对该二阶显式耦合策略进行了严格的稳定性和误差分析。利用BDF2能量恒等式、外推界面项的尖锐分解以及离散迹估计,我们在抛物线CFL条件下证明了封闭的稳定性界。然后通过基于投影的框架,使用流体变量的Fortin投影和多孔弹性变量的Ritz型投影,导出了先验误差估计。分析识别了来自BDF2时间离散、Adams-Bashforth界面外推以及投影运动学关系的一致性缺陷。结果表明,在整体能量范数下,流体速度、结构速度、孔隙压力和弹性位移的总误差由C乘以网格尺寸的k次幂(k从1到3)与时间步长的平方之和界定。使用制造解的数值实验证实了二阶时间收敛和最优阶空间收敛。我们还包含了一个具有Navier-Stokes流体流动的移动域示例,展示了超出所分析的固定域Stokes-Biot设置的适用性。

英文摘要

Efficient and provably accurate partitioned methods for fluid-poroelastic structure interaction remain challenging because explicit treatment of the Stokes-Biot interface coupling condition can compromise stability. In this work, we develop and analyze a fully discrete, second-order, explicit splitting scheme for the time-dependent Stokes-Biot problem on fixed domains. The method combines BDF2 time stepping with second-order Adams-Bashforth extrapolation of interface data through a Robin reformulation, yielding a partitioned algorithm in which the Stokes and Biot subproblems are solved independently and in parallel at each time step. The main analytical contribution is a rigorous stability and error analysis for this second-order explicit coupling strategy. Using BDF2 energy identities, a sharp decomposition of the extrapolated interface terms, and discrete trace estimates, we prove a closed stability bound under a parabolic CFL condition. We then derive an a priori error estimate through a projection-based framework using a Fortin projection for the fluid variables and Ritz-type projections for the poroelastic variables. The analysis identifies consistency defects from BDF2 time discretization, Adams-Bashforth interface extrapolation, and the projected kinematic relation. It shows that the total errors in fluid velocity, structure velocity, pore pressure, and elastic displacement are bounded by C times the sum of the kth power of the mesh size and the square of the time step, for k from 1 to 3, in bulk energy norms. Numerical experiments with manufactured solutions confirm second-order temporal convergence and optimal-order spatial convergence. We also include a moving-domain example with Navier-Stokes fluid flow, demonstrating applicability beyond the fixed-domain Stokes-Biot setting analyzed.

2606.19493 2026-06-19 cs.IT math-ph math.IT math.MP quant-ph 新提交

Ricci flow for the Bures--Helstrom qubit metric

Bures-Helstrom 量子比特度量的 Ricci 流

Andrew Lesniewski

AI总结 本文显式描述了量子比特态空间上Bures-Helstrom度量的Ricci流,发现该度量是爱因斯坦度量,几何流为同伦收缩,并给出了归一化流的线性化谱。

Comments 14 pages

详情
AI中文摘要

Bures-Helstrom度量是量子比特态空间上最小的单调黎曼度量。采用本文的量子Fisher归一化后,它将Bloch球与单位圆三-球面的测地半球等同起来。我们显式地描述了其Ricci流。在一般旋转对称规范下,该流是径向间隔和扭曲因子的耦合系统;只有在Hamilton-DeTurck规范选择后才出现单个标量方程。在相应的移动DeTurck标架中,平方扭曲函数$\Psi=\Phi^2$满足线性受迫热方程\begin{equation*} D_t\Psi=\Psi_{ss}-2, \end{equation*}而固定间隔坐标形式包含相关的输运项。由于Bures-Helstrom度量是爱因斯坦度量,几何流本身是同伦收缩\begin{equation*} g(t)=(1-4t)g_{\mathrm{BH}}, \end{equation*}标量曲率为$6/(1-4t)$,灭绝时间$T=1/4$。因此,该度量对所有$t<T$保持在单调锥内,并仅在塌缩极限下离开非退化黎曼度量锥。我们还记录了体积归一化流,其中Bures-Helstrom度量是一个不动点。其线性化是平移后的圆三-球面拉普拉斯算子$\Delta_{\mathbb S^3}+3$,谱为\begin{equation*} \sigma_\ell=-(\ell-1)(\ell+3), \end{equation*}去除缩放模式后的谱隙为$5$。

英文摘要

The Bures--Helstrom metric is the minimal monotone Riemannian metric on the state space of a qubit. With the quantum Fisher normalization used here, it identifies the Bloch ball with a geodesic hemisphere of the unit round three--sphere. We describe its Ricci flow explicitly. In a general rotationally symmetric gauge the flow is a coupled system for the radial lapse and warping factor; a single scalar equation appears only after a Hamilton--DeTurck gauge choice. In the corresponding moving DeTurck frame the squared warping function $Ψ=Φ^2$ satisfies the linear forced heat equation \begin{equation*} D_tΨ=Ψ_{ss}-2, \end{equation*} while the fixed-lapse coordinate form contains the associated transport term. Since the Bures--Helstrom metric is Einstein, the geometric flow itself is the homothetic shrinker \begin{equation*} g(t)=(1-4t)g_{\mathrm{BH}}, \end{equation*} with scalar curvature $6/(1-4t)$ and extinction time $T=1/4$. Thus the metric remains inside the monotone cone for all $t<T$ and leaves the cone of nondegenerate Riemannian metrics only through the collapsed limit. We also record the volume--normalized flow, for which the Bures--Helstrom metric is a fixed point. Its linearization is the shifted round--sphere Laplacian $Δ_{\mathbb S^3}+3$, with spectrum \begin{equation*} σ_\ell=-(\ell-1)(\ell+3), \end{equation*} and spectral gap $5$ after removal of the scaling mode.

2606.20299 2026-06-19 stat.ML cs.LG hep-ph physics.data-an 新提交

Statistical Properties of Training & Generalization

训练与泛化的统计特性

Itay Lavie, Noam Levi, Yonatan Kahn

AI总结 从物理学角度研究深度学习的关键特征和意外现象,回顾神经缩放定律及其与物理问题中约束和归纳偏置的相互作用。

Comments 32 pages, 3 figures. Part of the VERaiPHY initiative

详情
AI中文摘要

深度学习成功规避了经典统计学的众多直觉,在多个现实任务中取得了前所未有的性能。本文从物理学角度研究深度学习的关键特征和意外现象,注意指出并尽可能证明构建深度学习模型时固有的多种选择。特别地,我们回顾了神经缩放定律的现象,并讨论了它们与在物理问题中应用机器学习时可能存在的约束和归纳偏置之间的相互作用。

英文摘要

Deep learning has managed to evade numerous intuitions from classical statistics to achieve unprecedented performance on a number of real-world tasks. In this article, we investigate the key features and surprises of deep learning from a physics-informed perspective, taking care to point out and justify where possible the many choices inherent in constructing a deep learning model. In particular, we review the phenomenon of neural scaling laws and discuss their interplay with the constraints and inductive biases which may be present when applying machine learning to problems in physics.

2606.20437 2026-06-19 hep-ex cs.LG 新提交

HEPTv2: End-to-End Efficient Point Transformer for Charged Particle Reconstruction

HEPTv2:用于带电粒子重建的端到端高效点变换器

Siqi Miao, Shitij Govil, Jack P. Rodgers, Mia Liu, Javier Duarte, Shih-Chieh Hsu, Yuan-Tang Chou, Pan Li

AI总结 提出HEPTv2,一种端到端点变换器架构,通过局部敏感哈希编码和扇区化解码,无需图构建即可从探测器击中点直接重建粒子轨迹,在TrackML上以0.8%假率实现98.6%追踪效率,延迟仅15ms。

详情
AI中文摘要

带电粒子追踪——从稀疏探测器测量中重建轨迹——是一个基础的高能物理推理问题,也是在极端组合歧义下学习的典型例子。在高亮度大型强子对撞机(HL-LHC)上,尽管碰撞密度前所未有,追踪必须保持准确和高效。图神经网络表现强劲,但图构建和处理带来了大量成本,而基于变换器的方法依赖辅助阶段,阻碍了端到端优化。为解决这一问题,我们提出了HEPTv2,一种端到端点变换器架构,在一个可训练管道中从探测器击中点重建轨迹。HEPTv2结合了局部感知点编码器和轨迹解码器,无需图构建、聚类或过滤即可预测完整轨迹。编码器在探测器坐标空间中使用局部敏感哈希,以保留追踪相关几何结构,同时实现高效的局部注意力。解码器通过扇区化解码和联合编码器-解码器监督下的直接击中到轨迹预测来消除歧义,使整个管道能够端到端优化。在TrackML上,HEPTv2以0.8%的假率实现了98.6%的双多数追踪效率,同时在NVIDIA A100 GPU上每个事件仅需约15毫秒推理时间和0.4 GB峰值内存。对于最多包含$5\ imes10^5$个击中点的事件,延迟和内存大致线性扩展。HEPTv2在精度-延迟权衡中建立了新的最先进水平,相比之前最强的变换器效率提升4.5%,相比优化的基于图管道提升1.1-2.2%,同时延迟分别降低7倍和38-52倍。这些结果表明,端到端变换器能够提供HL-LHC实时粒子重建所需的精度和效率。

英文摘要

Charged-particle tracking -- reconstructing trajectories from sparse detector measurements -- is a fundamental high-energy-physics inference problem and a canonical example of learning under extreme combinatorial ambiguity. At the High-Luminosity Large Hadron Collider (HL-LHC), tracking must remain accurate and efficient despite unprecedented collision densities. Graph neural networks perform strongly, but incur substantial costs from graph construction and processing, while transformer-based approaches rely on auxiliary stages that prevent end-to-end optimization. To address this, we present HEPTv2, an end-to-end point-transformer architecture that reconstructs tracks from detector hits in one trainable pipeline. HEPTv2 combines a locality-aware point encoder with a track decoder that predicts complete trajectories without graph-building, clustering, or filtering. The encoder uses locality-sensitive hashing in detector coordinate space to preserve tracking-relevant geometry while enabling efficient local attention. The decoder resolves ambiguities through sectorized decoding and direct hit-to-track prediction under joint encoder-decoder supervision, allowing the full pipeline to be optimized end-to-end. On TrackML, HEPTv2 achieves 98.6% double-majority tracking efficiency at a 0.8% fake rate, while requiring only $\sim$15~ms inference time and 0.4~GB peak memory per event on a NVIDIA A100 GPU. Latency and memory scale approximately linearly for events with up to $5\times10^5$ hits. HEPTv2 establishes a new state of the art in the accuracy-latency trade-off, improving efficiency by 4.5% over the strongest prior transformer and by 1.1--2.2% over optimized graph-based pipelines, while reducing latency by factors of 7 and 38--52, respectively. These results show end-to-end transformers can deliver the accuracy and efficiency required for real-time particle reconstruction at the HL-LHC.

2606.19781 2026-06-19 hep-ex cs.AI 新提交

Towards Engineering Scaling Laws with Pretraining Data Composition

迈向基于预训练数据组成的工程化缩放定律

Jan-Lucas Uslu, Kevin Greif, Daniel Whiteson, Benjamin Nachman

AI总结 研究通过工程化预训练数据组成(增加多样性和与下游任务的对齐)来改变粒子物理中神经网络的缩放行为,使其更偏向数据扩展而非模型扩展。

详情
AI中文摘要

神经缩放定律描述了模型性能如何随计算量、模型大小和数据集大小呈幂律提升。虽然这些关系在大型语言模型中已得到充分验证,但在粒子物理学的大型模型中正在出现。与语言类似,实证研究表明性能呈幂律缩放。然而,与自然语言或图像领域不同,基础物理学拥有高保真模拟器,可以廉价地生成合成数据。这有利于缩放机制中额外数据比额外参数更便宜,并允许预训练数据集本身被工程化以影响缩放。对于高能粒子束碰撞中产生的强子喷注分类任务,我们表明,通过包含更多样化且与下游分类任务更对齐的预训练数据,可以工程化缩放行为,使其需要更多数据而非更大模型。

英文摘要

Neural scaling laws describe how model performance improves as a power law in compute, model size, and dataset size. While well-established for large language models, these relationships are emerging for large models in particle physics. As with language, empirical studies show that the performance scales as a power law. However, unlike natural language or image domains, fundamental physics has high-fidelity simulators that produce synthetic data cheaply. This favors scaling regimes where additional data is cheaper than additional parameters, and allows the pretraining dataset itself to be engineered to influence the scaling. For the task of classifying hadronic jets produced in collisions of high-energy particle beams, we show that the scaling behavior can be engineered towards requiring more data rather than larger models by inclusion of pretraining data which is more diverse and better aligned with the downstream classification task.

2606.20497 2026-06-19 cs.CE cond-mat.mtrl-sci 新提交

Interpretable Meta-Learning for Multi-Objective Chemical Search

可解释的元学习用于多目标化学搜索

Antonio Varagnolo, Yulia Pimonova, Michael G. Taylor, Raphaël Pestourie, Nicholas E. Lubbers

AI总结 提出结合可解释线性元学习与自适应置信度不确定性的模块化流水线,在多目标分子发现中首次应用线性元学习,在自旋交叉金属有机配合物搜索中Pareto性能提升78%。

Comments LA-UR-26-24964

详情
AI中文摘要

导航合成可访问分子的广阔空间需要能够同时处理多个竞争目标的、可解释的代理模型。在量子级化学的计算约束下,深度学习方法难以满足这些要求。这里,我们引入了一个模块化流水线,将可解释的线性元学习模型和自适应置信度不确定性量化结合到高效全局优化(EGO)框架中,用于多目标分子发现。首次在多目标化学搜索环境中部署线性元学习:通过跨化学目标和廉价辅助属性进行训练,元学习代理获得了可迁移的化学知识,能够从有限数据中快速适应新目标。在真实的大规模自旋交叉金属有机配合物搜索中进行的实证评估显示,基线在Pareto意义上比元学习替代方案差78%。我们还解决了主动搜索固有的校准挑战。由于最优候选通常位于分布尾部,标准不确定性估计失效。我们引入了一种自适应置信度调优算法,该算法随着分子搜索的进行动态重新校准探索-利用权衡。实证表明,动态置信度调优进一步主导了超过50%的静态校准前沿。

英文摘要

Navigating the vast space of synthetically accessible molecules demands surrogate models that are interpretable and capable of handling multiple competing objectives at the same time. Deep learning approaches struggle to satisfy them under the computational constraints of quantum-level chemistry. Here, we introduce a modular pipeline that combines interpretable linear meta-learning models and adaptive-confidence uncertainty quantification into an Efficient Global Optimization (EGO) framework for multi-objective molecular discovery. For the first time, linear meta-learning is deployed in a multi-objective chemical search setting: by training across chemical objectives and cheap auxiliary properties, the meta-learned surrogates acquire transferable chemical knowledge that adapts rapidly to new objectives from limited data. Evaluated empirically on a real large scale search for spin-crossover metal-organic complexes, the baseline performs 78% worse in Pareto sense than the meta-learning alternative. We also address the calibration challenges inherent to active search. Since optimal candidates typically lie precisely in the distributional tails, standard uncertainty estimates fail. We introduce an adaptive confidence-tuning algorithm that dynamically recalibrates the exploration-exploitation trade-off as the molecular search evolves. Empirically, dynamic confidence tuning further dominates over 50% of the statically calibrated front.

2606.20347 2026-06-19 cs.LG cond-mat.dis-nn 新提交

Critical Percolation as a Synthetic Data Model for Interpretability

临界渗流作为可解释性的合成数据模型

Aryeh Brill, Tom Ingebretsen Carlson

AI总结 提出基于临界平均场渗流簇的层次函数合成数据集,具有稀疏、分形和幂律分布特性,支持几乎线性时间算法生成任意规模数据,可用于评估可解释性方法。

Comments 21 pages, 10 figures, accepted to the Mechanistic Interpretability Workshop at ICML 2026

详情
AI中文摘要

神经网络学习反映自然数据层次化、多尺度结构的特征。用于评估可解释性方法的合成数据集通常缺乏这种结构,限制了其作为现实玩具模型的价值。为弥补这一差距,我们引入了一系列合成数据集,由定义在高维数据空间中嵌入的临界平均场渗流簇上的层次函数组成。渗流数据由稀疏、低维的分形簇组成,具有幂律大小分布。模拟分类层次结构的潜变量生成每个数据点的目标值。该数据模型在分析上易于处理,具有已知的临界指数,无需超参数调整即可固定其属性。我们利用渗流簇、随机树和加法凝聚之间的映射,提出了一种几乎线性时间的算法,用于联合采样随机树及其层次潜变量分解,从而能够生成任意规模的数据。通过探测实验,我们发现模型的地面真值潜变量可以从神经网络激活中线性解码。稀疏性、自相似性、幂律统计和分析可处理性共同使临界渗流成为可解释性研究的原理性测试平台。

英文摘要

Neural networks learn features that reflect the hierarchical, multi-scale structure of natural data. Synthetic datasets used to evaluate interpretability methods typically lack this structure, limiting their value as realistic toy models. To close this gap, we introduce a family of synthetic datasets consisting of hierarchical functions defined on critical mean-field percolation clusters embedded in a high-dimensional data space. The percolation data consists of sparse, low-dimensional fractal clusters with a power-law size distribution. Latent variables modeling a taxonomic hierarchy generate each data point's target value. The data model is analytically tractable with known critical exponents that fix its properties without requiring hyperparameter tuning. We leverage a mapping between percolation clusters, random trees, and additive coalescence to propose an almost linear-time algorithm to jointly sample a random tree and its hierarchical latent decomposition, enabling data generation at arbitrary scale. Using probing experiments, we find that the model's ground-truth latent variables can be linearly decoded from neural network activations. Together, sparsity, self-similarity, power-law statistics, and analytical tractability make critical percolation a principled testbed for interpretability research.

2606.20231 2026-06-19 cs.AI cond-mat.stat-mech cs.IT math-ph math.IT math.MP nlin.AO 新提交

Thermodynamic Measure of Intelligence

智能的热力学度量

Ishanu Chattopadhyay

发表机构 * Institute for Biomedical Informatics, University of Kentucky(肯塔基大学生物医学信息学研究所) Department of Computer Science, University of Kentucky(肯塔基大学计算机科学系)

AI总结 提出智能是稀有但有效未来的合法放大,通过递归自模拟实现,并给出热力学度量,证明该结构对高智能必要且近乎充分。

详情
AI中文摘要

智能可以被度量吗?我们提出智能可以定义为稀有但有效未来的合法放大:一个系统增加那些在被动动力学下不太可能但在领域约束下仍然可允许的结果的概率。我们从智能系统必须建模世界及其自身在其中的位置这一前提开始。由于系统是其建模世界的一部分,这自然导致递归自模拟:系统表示其自身动作是轨迹一部分的未来。我们的核心结果给出了一个必要性陈述和一个条件性近乎充分性陈述,将该架构与稀有-有效未来的合法放大的精确热力学度量联系起来:高稀有-有效提升是不可能的,除非内部模拟以高保真度识别稀有-有效未来;反之,当稀有-有效保真度高且模拟包含有效策略时,可实现的提升接近受驱动限制的最优值。因此,递归自模拟不仅是智能的一个合理特征,而且在所述假设下,对于高热力学智能是必要且近乎充分的。由此产生的框架使智能在通用尺度上可度量,从被动物质和反馈控制器、大型语言模型、作为文本生成器的人类到麦克斯韦妖式信息引擎。

英文摘要

Can intelligence be measured? We propose that intelligence can be defined as the lawful amplification of rare but valid futures: a system increases the probability of outcomes that would be unlikely under passive dynamics but remain admissible under the constraints of the domain. We start with the premise that an intelligent system must model the world and its own place within it. Because the system is part of the world it models, this leads naturally to recursive self-simulation: the system represents futures in which its own actions are part of the trajectory. Our central results give a necessity statement and a conditional near-sufficiency statement connecting this architecture to a precise thermodynamic measure of lawful amplification of rare-valid futures: high rare-valid lift is impossible unless the internal simulation identifies rare-valid futures with high fidelity; conversely, when rare-valid fidelity is high and the simulation contains an effective policy, the achievable lift approaches the actuation-limited optimum. Thus recursive self-simulation is not merely a plausible feature of intelligence but, under the stated assumptions, is necessary and nearly sufficient for high thermodynamic intelligence. The resulting framework makes intelligence measurable on a universal scale, from passive matter and feedback controllers, large language models, and humans as text generators to Maxwell-demon-like information engines.

2606.19471 2026-06-19 math.NA cond-mat.mtrl-sci cs.NA math.FA physics.chem-ph 新提交

Moreau-Yosida-based Kohn-Sham Inversion for Periodic Systems

基于Moreau-Yosida的周期系统Kohn-Sham反演

Vebjørn H. Bakkestuen, Michael F. Herbst, Vegard Falmår, Markus Penz, Andre Laestadius

AI总结 本文在Moreau-Yosida正则化密度泛函理论框架下,理论并数值研究了周期系统的密度-势反演,通过极限过程恢复Kohn-Sham交换关联势,并证明了非相互作用动能泛函的下半连续性。

详情
AI中文摘要

在Moreau-Yosida正则化密度泛函理论框架下,从理论和数值上研究了周期系统的密度-势反演。我们在周期齐次Sobolev空间中建立该框架,并通过极限过程恢复Kohn-Sham理论的交换关联势。一个关键的分析要素是证明非相互作用动能泛函在所选拓扑中的下半连续性。近端映射及其算法评估在所得反演方案中起核心作用。数值实验展示了该方法对Kohn-Sham方程和Gross-Pitaevskii方程的性能和特性。

英文摘要

Density-potential inversion for periodic systems within Moreau-Yosida-regularised density-functional theory is investigated, both theoretically and numerically. We develop the framework in a periodic homogeneous Sobolev space and use it to recover the exchange-correlation potential of Kohn-Sham theory through a limiting procedure. A key analytical ingredient is the proof of lower semicontinuity of the non-interacting kinetic-energy functional in the chosen topology. The proximal mapping, together with its algorithmic evaluation, plays a central role in the resulting inversion scheme. Numerical experiments illustrate the performance and properties of the method for both the Kohn-Sham and Gross-Pitaevskii equations.

2606.19378 2026-06-19 cs.LG cond-mat.mtrl-sci 新提交

A Hybrid GNN-FEM Framework for Phase-Field Fracture Simulation. Physics-Preserving Hybridization for Generalizable Surrogate Modeling

一种用于相场断裂模拟的混合GNN-FEM框架:面向通用代理模型的物理保持混合方法

Hyeonbin Moon, Yongjin Choi, Seunghwa Ryu

发表机构 * KAIST(韩国科学技术院)

AI总结 提出混合GNN-FEM框架,用图神经网络替代相场更新步骤,保留FEM位移求解器,通过无量纲特征设计和物理信息损失实现跨几何、载荷、材料和离散化的通用断裂模拟,降低计算成本并保持精度。

Comments 46 pages

详情
AI中文摘要

科学机器学习(SciML)已成为加速复杂物理系统模拟的一种有前景的方法,但对于非线性、历史依赖问题实现物理一致且可泛化的预测仍然是一个核心挑战。在本研究中,我们提出了一种混合GNN-FEM框架,用于高效且可泛化的相场断裂建模。虽然相场方法为模拟复杂裂纹演化提供了稳健的变分框架,但其高计算成本限制了实际应用,因为需要在增量有限元过程中求解耦合、非线性和历史依赖的系统。为应对这一挑战,我们将图神经网络代理集成到传统的交错方案中,在每个载荷增量下替代相场更新,同时保留基于FEM的位移求解器以强制执行力学平衡和边界条件。通过保留增量求解结构,该框架与历史依赖的断裂演化保持一致,而无需代理近似整个解轨迹。这种选择性代理策略强调识别物理上有意义且增量结构化的学习目标,而非依赖暴力数据生成来学习整个断裂过程。所提出的框架通过无量纲特征设计、基于网格域的图公式以及源自控制相场方程的物理信息损失,实现了跨不同几何、载荷条件、材料属性和离散化的强泛化能力。数值实验表明,与传统FEM相比,该混合方法在保持精度的同时降低了计算成本,并在多种问题设置下展现出稳健的预测性能。

英文摘要

Scientific machine learning (SciML) has emerged as a promising approach for accelerating simulations of complex physical systems, yet achieving physically consistent and generalizable predictions for nonlinear, history-dependent problems remains a central challenge. In this study, we propose a hybrid GNN--FEM framework for efficient and generalizable phase-field fracture modeling. While phase-field approaches provide a robust variational framework for simulating complex crack evolution, their high computational cost limits practical applications because they require solving coupled, nonlinear, and history-dependent systems within an incremental finite element procedure. To address this challenge, a graph neural network surrogate is integrated into the conventional staggered scheme, replacing the phase-field update at each load increment while retaining the FEM-based displacement solver to enforce mechanical equilibrium and boundary conditions. By preserving the incremental solution structure, the framework remains consistent with history-dependent fracture evolution without requiring the surrogate to approximate the full solution trajectory. This selective surrogate strategy emphasizes the identification of a physically meaningful and incrementally structured learning target, rather than relying on brute-force data generation to learn the full fracture process. The proposed framework achieves strong generalization across varying geometries, loading conditions, material properties, and discretizations through dimensionless feature design, a graph-based formulation on mesh-based domains, and a physics-informed loss derived from the governing phase-field equation. Numerical experiments demonstrate that the hybrid approach reduces computational cost while maintaining accuracy compared with conventional FEM, and exhibits robust predictive performance across diverse problem settings.

2606.19375 2026-06-19 cs.LG cond-mat.mtrl-sci 新提交

Physics-Informed Discovery of Yield Functions in Plasticity via Convex Neural Representations

基于凸神经表示的塑性屈服函数物理信息发现

Hyeonbin Moon, Donghyuk Cho, Jecheon Yu, Jeong Whan Yoon, Seunghwa Ryu

发表机构 * KAIST(韩国科学技术院)

AI总结 提出一种物理信息框架,从全场位移和反力数据中自动发现各向异性屈服函数,无需应力观测或预设参数形式,采用凸神经网络表示并嵌入弹塑性应力积分中训练。

Comments 39 pages

详情
AI中文摘要

识别各向异性屈服函数仍然具有挑战性,因为屈服在全场力学测量中无法直接观测,方向标定可能需要多个加载方向,且选择合适的解析形式并非易事。本研究提出一种物理信息框架,用于从全场位移数据和反力数据中发现屈服函数,无需应力观测、塑性应变测量、直接屈服面数据或预设的参数化屈服函数。该框架将屈服函数识别为弹塑性应力积分中受力学约束的本构组成部分,而非通过直接的应力空间监督。屈服函数由凸神经网络表示,该网络强制执行凸性和一次正齐次性,同时施加假定的拉压对称性,并通过可微应力更新和跨多个加载工况的物理信息力平衡损失来训练该神经屈服函数。使用von Mises、Hill 1948和Yld2000-2d屈服函数的有限元基准研究验证了所提框架,评估了屈服轮廓一致性、位移噪声敏感性、通过塑性活跃应力状态的可识别性、认知不确定性和多项式代理部署。本研究提供了一条受力学约束的路径,用于从位移和力数据中发现各向异性屈服函数,同时将识别出的组件保留在弹塑性应力积分的结构内。

英文摘要

Identifying anisotropic yield functions remains challenging since yielding is not directly observed in full-field mechanical measurements, directional calibration can require many loading directions, and selecting an appropriate analytical form is nontrivial. This study proposes a physics-informed framework for discovering yield functions from full-field displacement data and reaction force data, without stress observations, plastic strain measurements, direct yield surface data, or a prescribed parametric yield function. The framework identifies the yield function as a mechanically constrained constitutive component inside elastoplastic stress integration, rather than through direct stress-space supervision. The yield function is represented by a convex neural network that enforces convexity and positive homogeneity of degree one while imposing the assumed tension-compression symmetry, and this neural yield function is trained with a differentiable stress update and a physics-informed force equilibrium loss across multiple loading cases. The proposed framework is validated using finite element (FE) benchmark studies with von Mises, Hill 1948, and Yld2000-2d yield functions, assessing yield contour agreement, displacement-noise sensitivity, identifiability through plastically active stress states, epistemic uncertainty, and polynomial-surrogate deployment. This study provides a mechanics-constrained pathway for discovering anisotropic yield functions from displacement and force data while keeping the identified component within the structure of elastoplastic stress integration.

2606.20253 2026-06-19 cond-mat.mtrl-sci cs.CE 新提交

On representation of macroscopic crack in periodic fine-scale discrete mechanical models

关于周期性细观离散力学模型中宏观裂纹的表征

Jan Raisinger, Jan Eliáš

AI总结 针对异质软化材料多尺度模拟中边界条件影响应变局部化的问题,评估了新型边界条件(如镶嵌、渗流路径对齐及带位移跳跃的球形周期边界)在细观离散粒子模型中的适用性,发现镶嵌边界条件能稳定产生由模型几何唯一确定的局部化带。

Comments 28 pages, 20 figures

详情
AI中文摘要

在异质软化材料的多尺度建模中,细观模型的边界条件强烈影响应变局部化模式和宏观响应。对于直线型模型(如正方形或立方体),当局部化带倾角与周期方向不匹配时,标准周期边界条件会产生人为的延性行为并导致过度能量耗散。最近提出的镶嵌和渗流路径对齐边界条件通过调整周期框架以与演化的局部化带对齐,有望解决这一问题。另外,球形/圆形模型通过设计提供与方向无关的响应。不幸的是,标准周期边界条件不允许在球形模型边界上形成适当的局部化带交叉。最近的一项修改通过在球形周期边界条件中添加位移跳跃来解决这一问题。本研究评估了这些新型边界条件在混凝土细观离散粒子模型中的适用性。分析了不同加载方向下受单轴拉伸的二维正方形和圆形模型,并将所选方法扩展到三维立方体模型。结果表明,渗流路径对齐边界条件存在主要缺陷:由于两个边界部分的应变不均匀,可能导致多条局部化带,且其弱约束部分容易产生虚假应变局部化。相比之下,镶嵌边界条件始终产生定义明确的局部化带,其长度仅由模型几何决定,使得在后处理中易于考虑。对圆形模型应用带位移跳跃的周期边界条件有时会错误地产生与标准周期边界条件相似的裂纹模式。

英文摘要

In multiscale modeling of heterogeneous softening materials, boundary conditions (BC) in the fine-scale model strongly influence the strain localization pattern and the macroscopic response. For rectilinear models (e.g., squares or cubes), standard Periodic BCs produce artificially ductile behavior with excessive energy dissipation when the localization band inclination does not match the periodicity directions. Recently proposed Tessellation and Percolation-path-aligned BCs promise to address this by adapting the periodicity frame to align with the evolving localization bands. Alternatively, spherical/circular models provide an orientation independent response by design. Unfortunately, the standard Periodic BCs do not allow development of proper localization band crossing spherical model's boundaries. A recently proposed modification addresses this by adding a displacement jump to the spherical periodic BCs. This study evaluates the applicability of these novel BCs to a mesoscale discrete particle model of concrete. Two-dimensional square and circular models under uniaxial tension with different loading directions are analyzed, with the selected approaches extended to three-dimensional cube models. Results show that Percolation-path-aligned BCs exhibit major shortcomings: they can lead to multiple localization bands due to uneven straining of the two boundary sections and their weakly constrained section can be prone to spurious strain localization. In contrast, Tessellation BCs consistently yield a well-defined localization band, whose length is determined solely by the model geometry, making it straightforward to account for in post-processing. Periodic boundary conditions augmented with a displacement jump applied to a circular model sometimes incorrect produce crack patterns similar to those under the standard Periodic BCs.