arXivDaily arXiv每日学术速递 周一至周五更新
重置
physics.chem-ph化学物理15
2606.12326 2026-06-11 physics.chem-ph 新提交

Transferable Machine Learning of Electronic Hamiltonians with Superposition-of-Atomic-Potentials Features

基于原子势叠加特征的可迁移电子哈密顿量机器学习

Chaoqun Zhang, Christian Venturella, Enzhi Chen, Tianyu Zhu

AI总结 提出基于原子势叠加(SAP)近似的哈密顿量学习框架,结合对称性适配的原子轨道学习基和轨道图神经网络预测Kohn-Sham Fock矩阵,并通过降维方案扩展到大基组,在QM9和有机电荷传输材料中实现高精度可迁移预测。

详情
AI中文摘要

电子哈密顿量的机器学习为电子波函数和物理可观测量提供了一条统一途径。我们引入了一个哈密顿量学习框架,该框架基于从原子势叠加(SAP)近似导出的电子特征,这是一种有效的自洽场初始猜测,能够捕获基本的电子-电子屏蔽效应。SAP量定义了一个对称性适配的本征原子轨道学习基,并为基于轨道的图神经网络提供物理信息输入,该网络预测收敛的Kohn-Sham Fock矩阵。为了将方法扩展到更大的基组,我们进一步开发了一个降维方案,从最小基组特征预测大基组电子结构。在QM9数据集上,该模型准确再现了前沿和核心轨道能量、偶极矩以及完整态密度。对于有机电荷传输材料,它产生了苯、四氰基对苯二醌二甲烷(TCNQ)和四硫富瓦烯(TTF)二聚体的精确分子间转移积分,并迁移到未见过的取代苯异质二聚体,平均绝对误差为4.8 meV。这些结果确立了基于SAP的电子哈密顿量机器学习作为高通量电子结构预测的可迁移且可扩展的工具。

英文摘要

Machine learning (ML) of electronic Hamiltonians offers a unified route to electronic wave functions and physical observables. We introduce a Hamiltonian learning framework built on electronic features derived from the superposition-of-atomic-potentials (SAP) approximation, an efficient self-consistent-field initial guess that captures essential electron-electron screening. SAP quantities define a symmetry-adapted intrinsic atomic orbital learning basis and provide physics-informed inputs to an orbital-based graph neural network that predicts converged Kohn-Sham Fock matrices. To extend the approach to larger basis sets, we further develop a downfolding scheme that predicts large-basis electronic structure from minimal-basis features. On the QM9 dataset, the model accurately reproduces frontier and core orbital energies, dipole moments, and the full density of states. For organic charge-transport materials, it yields accurate intermolecular transfer integrals for benzene, tetracyanoquinodimethane (TCNQ), and tetrathiafulvalene (TTF) dimers, and transfers to unseen substituted-benzene heterodimers with a mean absolute error of 4.8 meV. These results establish SAP-based ML of electronic Hamiltonians as a transferable and scalable tool for high-throughput electronic-structure prediction.

2606.12292 2026-06-11 physics.chem-ph 新提交

Coupling of diffusion and reaction in a thin cylindrical tube: Methodological drawbacks of the Fick--Jacobs approach

细圆柱管内扩散与反应的耦合:Fick-Jacobs方法的方法论缺陷

Sergey D. Traytak, Timofey V. Fedoseev

AI总结 研究细圆柱管内扩散与反应的耦合问题,通过边界函数法推导渐近解,并与精确解对比揭示Fick-Jacobs方法的严重缺陷。

详情
Comments
20 pages, 2 figures
AI中文摘要

我们研究了一个描述细圆柱管内扩散与反应耦合的问题。通过边界函数法推导了所提问题的渐近解。我们将该渐近解与相应的精确解进行比较,揭示了已知的Fick-Jacobs约化方法的严重方法论缺陷。所得结果可用于研究Fick-Jacobs方法无法应用的一系列反应-扩散问题。

英文摘要

We investigate a problem, that describes coupling between diffusion and reaction inside a thin circular cylindrical tube. The asymptotic solution of the posed problem is derived by means of the boundary functions method. We perform comparison of this asymptotic solution against corresponding exact solution, which revealed serious methodological drawbacks of known Fick-Jacobs reduction approach. The results obtained may be used to study a wide range of reaction-diffusion problems, when the Fick-Jacobs method cannot be applied.

2606.12272 2026-06-11 physics.chem-ph 新提交

Excited-state Properties Beyond the Excitation Energy from Orbital-Optimized Density Functional Calculations I: Dipole Moments of Rydberg States

基于轨道优化密度泛函计算的激发态性质:超越激发能的里德堡态偶极矩

Lorenzo Restaino, Jukka John, Diego Llorena Prieto, Yorick L. A. Schmerwitz, Elvar Örn Jónsson, Gianluca Levi

AI总结 采用平面波基组的轨道优化密度泛函计算,研究分子里德堡激发态的偶极矩,揭示原子基组局限性,并评估不同泛函的性能。

详情
AI中文摘要

里德堡激发态因其高度弥散特性而难以描述。轨道优化密度泛函计算比含时密度泛函理论能更好地描述里德堡态。然而,迄今为止的基准测试主要关注激发能,而对偶极矩的评估仅限于最低激发态。本文采用平面波基组的轨道优化密度泛函计算,计算了一组分子中多个里德堡态的偶极矩。平面波为弥散的里德堡轨道提供了灵活的表示,揭示了原子轨道基组的局限性。常用的单增广原子基组即使在激发能对基组表示不敏感时也会产生不准确的偶极矩,并且即使添加额外的增广弥散函数,与平面波计算的差异在最弥散态中仍然存在。广义梯度近似泛函PBE与更高水平的计算结果(若有)吻合良好。杂化泛函PBE0进一步改善了结果,而采用全局标度显式Perdew-Zunger自相互作用校正的PBE虽然恢复了有效Kohn-Sham势的正确渐近-1/r行为,却导致更大的误差和偶极矩的高估。

英文摘要

Rydberg excited states are challenging to describe due to their highly diffuse character. Orbital-optimized density functional calculations provide a better description of Rydberg states than time-dependent density functional theory. However, benchmarks have so far focused on the excitation energy, while assessments of dipole moments remain limited to the lowest excited state. Here, orbital-optimized density functional calculations with a plane waves basis set are used to compute the dipole moments of several Rydberg states of a set of molecules. Plane waves provide a flexible representation of the diffuse Rydberg orbitals, revealing limitations of atomic orbitals basis sets. A commonly used single-augmented atomic basis set yields inaccurate dipole moments even when the excitation energy is insensitive to the basis representation, and discrepancies with plane waves calculations persist for the most diffuse states even when extra augmented diffuse functions are added. The generalized gradient approximation functional PBE gives good agreement with higher-level calculations where available. The hybrid functional PBE0 further improves the results, while PBE with globally scaled explicit Perdew-Zunger self-interaction correction leads to larger errors and an overestimation of the dipole moment, despite restoring the correct asymptotic $-1/r$ behavior of the effective Kohn--Sham potential.

2606.11809 2026-06-11 physics.chem-ph 新提交

Symplectic and Thermodynamically Consistent Molecular Dynamics in the Frequency Domain

频域中的辛且热力学一致的分子动力学

Kyunghoon Han, Alexandre Tkatchenko, Joshua T. Berryman

AI总结 提出傅里叶积分分子动力学(FIMD),在频域中稳定可逆地传播哈密顿系统的选定振动运动,同时实现带选择和振动分析,无需后处理。

详情
Comments
Under review: Physical Review Letters
AI中文摘要

我们引入了傅里叶积分分子动力学(FIMD),这是一种在时间上稳定且可逆地传播哈密顿系统选定振动运动的方法,同时在频域中分析和控制动力学。这使得带选择和振动分析成为积分器的特性,而非后处理步骤。我们使用经典力场、基于量子数据训练的机器学习力场以及半经验量子化学方法,对CO$_2$和封端的Ace--Phe--Tyr--NMe肽进行了演示。该方法在所选频带内重现光谱,抑制带外响应,揭示模式耦合,并展示了光谱特征对力场的依赖性,特别是对于热力学重要的低频区域。FIMD提供了一种高效且透明的方式来探索光谱和量热观测背后的振动物理机制。

英文摘要

We introduce Fourier integrator molecular dynamics (FIMD), a method for propagating selected vibrational motion of Hamiltonian systems stably and reversibly in time while analyzing and controlling dynamics in the frequency domain. This makes band selection and vibrational analysis features of the integrator rather than post-processing steps. We demonstrate the method with classical force fields, a machine-learned force field trained on quantum data, and semi-empirical quantum chemistry for CO$_2$ and the capped Ace--Phe--Tyr--NMe peptide. The method reproduces spectra within the chosen band, suppresses out-of-band response, reveals mode coupling, and demonstrates force-field dependence of spectral features, especially for the thermodynamically important low frequencies. FIMD offers an efficient and transparent way to probe the vibrational physics underlying spectroscopic and calorimetric observables.

2606.11730 2026-06-11 physics.optics physics.app-ph physics.chem-ph 新提交

Tailoring soft cavities for robust molecular strong coupling

定制软腔以实现稳健的分子强耦合

Siddharaj M. Gadge, Adarsh B. Vasista

AI总结 通过实验和理论分析,发现当腔线宽与分子线宽匹配时,软腔中分子强耦合的鲁棒性最优,为设计形态依赖的腔提供了新框架。

详情
Comments
17 pages
AI中文摘要

如何设计高效的化学开放光学腔以实现分子强耦合?解决这个问题对于开发动态可调光-物质相互作用的软腔平台至关重要,其中直接访问受限电磁模式是必不可少的。传统的腔品质因数如$Q/\sqrt{V}$和协同性成功描述了光谱限制和耗散,但未能完全捕捉腔与分子自由度之间线宽不对称性的作用。在这里,我们通过在大范围内改变聚苯乙烯微球半径,系统地研究了TDBC染料分子与微球回音壁模式之间的强耦合。为了量化强耦合的鲁棒性,我们定义了参数$\chi = \frac{g}{\max(\kappa,\gamma)}$,其中$g$是耦合强度,$\kappa$和$\gamma$分别表示腔和分子线宽。尽管由于模式体积缩放,耦合强度随腔尺寸增加而单调下降,但我们发现$\chi$在$\kappa \approx \gamma$条件附近表现出明显的最大值。这一观察表明,线宽匹配不仅是改善光谱可见性的标准,而且反映了一种耗散匹配条件,该条件优化了软腔中相干光-物质交换的鲁棒性。我们的结果为设计用于分子强耦合的形态依赖腔提供了替代框架。

英文摘要

How should one design efficient chemically open optical cavities for molecular strong coupling? Addressing this question is important for the development of soft-cavity platforms for dynamically tunable light--matter interactions, where direct access to confined electromagnetic modes is essential. Conventional cavity figures of merit such as $Q/\sqrt{V}$ and cooperativity successfully describe spectral confinement and dissipation but do not fully capture the role of linewidth asymmetry between cavity and molecular degrees of freedom. Here, we systematically investigate strong coupling between TDBC dye molecules and whispering gallery modes of polystyrene microspheres by varying the microsphere radius over a broad range. To quantify the robustness of strong coupling, we define the parameter $\chi = \frac{g}{\max(\kappa,\gamma)}$, where $g$ is the coupling strength, while $\kappa$ and $\gamma$ denote the cavity and molecular linewidths, respectively. Although the coupling strength decreases monotonically with increasing cavity size due to mode-volume scaling, we find that $\chi$ exhibits a pronounced maximum near the condition $\kappa \approx \gamma$. This observation suggests that linewidth matching is not merely a criterion for improved spectral visibility, but reflects a dissipation-matching condition that optimizes the robustness of coherent light--matter exchange in soft-cavities. Our results provide an alternative framework for designing morphology-dependent cavities for molecular strong coupling.

2606.11579 2026-06-11 quant-ph cs.DC physics.atm-clus physics.atom-ph physics.chem-ph 新提交

Tensor-Network-Based Distributed Quantum Dynamics on Independent Quantum Computers

基于张量网络的独立量子计算机分布式量子动力学

Anurag Dwivedi, Melissa C. Revelle, Daniel S. Lobser, Brian K. McFarland, Edward C. Tortorici, Christopher G. Yale, Susan M. Clark, Philip Richerme, Srinivasan S. Iyengar

AI总结 提出基于张量网络的分布式量子计算方法,将多维时间演化算子分解为独立低维传播,在异构量子-经典架构上异步执行,并在离子阱量子计算机上实验验证,计算质子化水团簇振动光谱精度达4 cm⁻¹。

详情
AI中文摘要

我们提出了一种基于张量网络的方法,用于连续变量表示中化学波包动力学的分布式量子计算模拟。核心思想是:多维时间演化算子的张量网络表示自然诱导出一个提升的希尔伯特空间,其中动力学分解为一组独立的低维传播。这种变换将纠缠的量子演化转化为一组并行的计算任务,可以在异构量子与经典计算架构上异步执行。由此产生的形式体系建立了张量网络分解、均匀受控量子电路和异步分布式量子计算之间的直接联系。该方法旨在实现混合量子/经典实现,适用于通用异构量子硬件系统。由张量网络分解产生的异步分布式量子过程的实验实现是在桑迪亚国家实验室的离子阱量子计算机上进行的,其中电路使用原生部分纠缠$XX(\ heta)$门进行编译,与传统的完全纠缠分解相比,预期的两量子比特门保真度降低了30%以上。我们通过量子计算一个小型质子化水团簇的振动光谱来演示该方法,该团簇显示出关键的量子核行为。此类水团簇系统已被发现对实验作用光谱学和理论具有挑战性,而在这里,我们首次提供了与相应经典结果一致(误差在4 cm⁻¹以内)的振动光谱结果,从而展示了量子计算实现光谱精度的潜力。

英文摘要

We present an approach based on tensor networks for distributed quantum computing simulation of chemical wavepacket dynamics in a continuous variable representation. The central idea is that the tensor-network representation of the multidimensional time-evolution operator naturally induces an elevated Hilbert space where the dynamics decomposes into a set of independent lower-dimensional propagations. This transformation converts an entangled quantum evolution into a set of parallel computational tasks that can be executed asynchronously across heterogeneous quantum and classical computing architectures. The resulting formalism establishes a direct connection between tensor-network decompositions, uniformly controlled quantum circuits, and asynchronous distributed quantum computing. The approach is developed with a goal towards hybrid quantum/classical implementation, and is appropriate for a general heterogeneous mixture of quantum hardware systems. The experimental realization of the asynchronously distributed quantum processes that arise from the tensor-network decomposition are carried out on the Sandia National Laboratories' trapped-ion quantum computer, where the circuits are compiled using native partial-entangling $XX(\theta)$ gates, reducing the expected two-qubit gate infidelity by more than 30\% relative to conventional fully entangling decompositions. We demonstrate the methodology by quantum computing the vibrational spectra of a small protonated water cluster that shows critical quantum nuclear behavior. Such water cluster systems have been found to be challenging for experimental action spectroscopy and for theory, and here, for the first time, we provide results for vibrational spectroscopy that are in agreement with the respective classical results to within 4cm$^{-1}$, thus allowing for the potential for spectroscopic accuracy from quantum computations.

2606.11574 2026-06-11 cs.LG cond-mat.mtrl-sci physics.chem-ph stat.ML 新提交

Range-Aware Bayesian Optimization for Discovering Diverse Designs within Target Property Windows

范围感知贝叶斯优化用于在目标属性窗口内发现多样化设计

Shengli Jiang, Jason Wu, Charles M. Schroeder, Michael A. Webb

发表机构 * Department of Chemical and Biological Engineering, Princeton University(普林斯顿大学化学与生物工程系)

AI总结 提出范围感知贝叶斯优化框架,通过采集函数直接评分候选解满足目标范围的后验概率,在基准任务和实际案例中比标准方法发现更多样化的有效设计。

详情
Comments
64 pages, 6 main text figures, 17 supporting figures, 6 supporting tables
AI中文摘要

在许多材料和产品设计问题中,理想的候选物表现出可接受范围内的属性,而非达到单一最优值。恢复满足此类规格的多个不同解也具有实际价值,因为某些候选物可能因成本、可加工性或鲁棒性等原因而更受青睐,而这些因素难以直接编码到目标函数中。在此,我们开发了一个范围感知贝叶斯优化(BO)框架,其中采集函数直接评分候选解满足目标范围的后验概率。该框架自然扩展到在共享候选空间上并行追求多个不同规格。在基准任务中,范围感知采集一致地比标准BO基线和最近的目标寻求方法恢复更大且更多样化的有效设计集。其效用进一步在两个实际动机的设计案例研究中得到证明,涉及优化聚合物合成的反应条件和发现指定光学吸收带的序列定义低聚物,并得到量子化学计算的支持。这些结果表明,范围感知BO可以为规格驱动设计提供实用且样本高效的基础,特别是当设计灵活性和解多样性是重要考虑因素时。

英文摘要

In many materials and product design problems, desirable candidates exhibit properties that fall within an acceptable range rather than achieve a single optimum. Recovering multiple, distinct solutions that satisfy such specifications is also practically valuable, as some candidates may be preferred for reasons of cost, processability, or robustness that are difficult to encode directly in an objective function. Here, we develop a range-aware Bayesian optimization (BO) framework in which the acquisition function directly scores the posterior probability that a candidate satisfies a target range. The framework naturally extends to parallel pursuit of multiple distinct specifications over a shared candidate space. Across benchmark tasks, range-aware acquisition consistently recovers larger and more diverse sets of valid designs than standard BO baselines and recent goal-seeking methods. Its utility is further demonstrated in two practically motivated design case studies involving optimizing reaction conditions for polymer synthesis and sequence-defined oligomer discovery for prescribed optical absorption bands, supported by quantum chemical calculations. These results suggest that range-aware BO can provide a practical and sample-efficient foundation for specification-driven design, particularly when design flexibility and solution diversity are important considerations.

2606.11486 2026-06-11 physics.chem-ph q-bio.MN 新提交

Elucidating the Size of Chemical Space with Assembly Theory

通过组装理论阐明化学空间的大小

Juan Carlos Morales Parra, Keith Y Patarroyo, Abhishek Sharma, David Obeh Alobo, Leroy Cronin

AI总结 利用组装理论,通过组装指数量化分子复杂度,首次从第一性原理估计化学空间大小,发现其随复杂度至少超指数增长,最多双指数增长,在药物相似约束下约为10^117个分子。

详情
Comments
26 pages, 10 figures, 31 references
AI中文摘要

化学空间极其广阔,常见启发式估计表明,在分子质量低于500 Da时,可能存在约10^60个“类药”分子。然而,这些估计很大程度上忽略了所枚举分子的结构和合成复杂性。这里,我们利用组装理论从第一性原理估计化学空间的大小,该理论量化了形成分子所需的因果量,由组装指数捕获。这是一个可测量的分子复杂度度量,源于构建分子图所需的最小递归键合操作次数。组装理论将化学空间划分为由组装指数定义的层次,从而可以对其随分子复杂度增加的增长设定界限。我们表明,化学空间(累积的组装指数水平集)至少以超指数方式增长,至多以双指数方式增长相对于组装指数。使用GDB-13数据库作为增长率估计的参考,我们模拟了化学空间如何在复杂度增加下扩张以及在结构约束(包括原子和键类型、环数、环大小和化学基序)下收缩。在类似于标准类药估计的约束下,包括分子质量低于500 Da,我们的分析得出在组装指数25时化学空间约为10^117个分子。最后,我们通过生物相关基序约束化学空间,并识别出这些组装定义空间的可访问边界附近的结构相关分子。

英文摘要

Chemical space is unimaginably vast with common heuristic estimates suggesting that there are ca. 10^60 'drug-like' molecules possible below a molecular mass of 500 Da. However, these estimates largely ignore the structural and synthetic complexity of the molecules enumerated. Here we present a first-principles estimate of the size of chemical space using the Assembly Theory, which quantifies the amount of causation required to form a molecule, captured in the assembly Index. This is a measurable molecular complexity measure derived from the minimum number of recursive bond-joining operations required to construct a molecular graph. Assembly Theory partitions chemical space into levels defined by Assembly Index, allowing bounds to be placed on its growth as molecular complexity increases. We show that chemical space (the accumulated Assembly Index level sets) grows at least super-exponentially, and at most, double-exponentially with respect to the Assembly Index. Using the GDB-13 database as a reference for growth-rate estimation, we model how chemical space expands under increasing complexity and contracts under structural constraints, including atom and bond types, number of rings, ring size, and chemical motifs. Under constraints comparable to standard drug-like estimates, including molecular mass below 500 Da, our analysis yields a chemical space of approximately 10117 molecules at Assembly Index 25. Finally, we constrain chemical space by biologically relevant motifs and identify structurally relevant molecules near the accessible boundaries of these assembly-defined spaces.

2606.11256 2026-06-11 physics.chem-ph cs.LG cs.NE 新提交

My Chemical Harness: Evolutionary Molecular Design over Synthetic Pathways with Large Language Model Agents

我的化学缰绳:基于合成路径的大语言模型智能体进化分子设计

César Ojeda, Darius A. Faroughy, Maryam Karimi, Payam Zarrintaj, Mir Mehdi Seyedebrahimi, Martín Carballo-Pacheco

AI总结 提出一种以可执行合成路径为种群、大语言模型仅作策略控制器的进化框架,在可溶性环氧化物水解酶代理任务上达到最优性能。

详情
Comments
27 pages | 10 figures
AI中文摘要

当候选结构伴随可行的合成路线时,设计具有目标性质的分子最为有用。我们介绍了My Chemical Harness,一种面向目标分子设计的路线原生进化框架,其中搜索种群由可执行的合成路径而非孤立的分子图组成。每条路径由可购买的构建块和反应模板构建,通过确定性化学工具执行,并通过任务特定的分子预言机评分。大语言模型仅用作策略控制器,选择关于路径长度、移动类型、反应家族、基序和探索压力的高级偏好,而本地代码执行路径构建、验证、去重、评分、选择和记忆更新。这种分离使得大语言模型能够引导探索,同时防止其引入幻觉产物或不受支持的反应步骤。在一个可溶性环氧化物水解酶代理任务上,我们的LLM智能体优于单次LLM和确定性控制器,在sEH分数、合成可及性分数和AiZynthFinder成功率指标上达到最先进性能。这些结果表明,受约束的大语言模型智能体可以在无需训练、微调或专用生成模型的情况下,在分子发现中发挥重要作用。

英文摘要

Designing molecules with target properties is most useful when candidate structures are accompanied by feasible synthetic routes. We introduce My Chemical Harness, a route-native evolutionary framework for goal-directed molecular design in which the search population consists of executable synthetic pathways rather than isolated molecular graphs. Each route is built from purchasable building blocks and reaction templates, executed by deterministic chemistry tools, and scored through task-specific molecular oracles. Large language models (LLMs) are used only as strategy controllers that select high-level preferences over route length, move type, reaction families, motifs, and exploration pressure, while local code performs route construction, validation, deduplication, scoring, selection, and memory updates. This separation lets the LLM guide exploration without allowing it to introduce hallucinated products or unsupported reaction steps. On a soluble epoxide hydrolase proxy task, our LLM agent improves over single pass LLM and deterministic controllers, reaching state-of-the-art performance across the sEH score, synthetic accessibility score, and AiZynthFinder success rate metrics. These results suggest that constrained LLM agents can play a significant role in molecular discovery without requiring training, fine-tuning, or dedicated generative models.

2606.11227 2026-06-11 physics.bio-ph cond-mat.mes-hall physics.chem-ph quant-ph 新提交

Collective Emission in LH2 Assembly Beyond the Point-Dipole Approximation

超越点偶极近似的LH2组装体集体发射

Javed Akhtar, Himangshu Prabal Goswami

AI总结 本文利用量子电动力学并矢格林张量构建非厄米哈密顿量,研究紫色细菌LH2组装体的集体发射,发现P42$_1$2对称性反转了单环的明暗态顺序,使整个晶体成为能量收集实体,并揭示了倾斜驱动的开关机制。

详情
AI中文摘要

光捕获组装体中的集体发射受局部跃迁偶极和发射单元的有限几何形状控制,而点偶极近似掩盖了这一事实。为了超越这一图像,我们利用紫色细菌的量子电动力学并矢格林张量构建了一个非厄米哈密顿量。我们为孤立的24个细菌叶绿素圆锥截头体及其P42$_1$2晶体学组装体构建了该哈密顿量。发现P42$_1$2单胞对称性反转了单环的明暗态顺序,将亚辐射态置于低能端,并揭示整个晶体是能量收集实体。倾斜驱动的开关仅在有限偶极载体(LH2)垂直于生长平面的晶体几何结构中被激活。空位和取向无序仅通过合作将开关阈值从较高的极角重新归一化到较低的值。

英文摘要

Collective emission in light-harvesting assemblies is governed by the local transition dipole and finite geometry of emitting units, a fact that point-dipole approximation obscures. To go beyond this picture, we develop a non-Hermitian Hamiltonian using the quantum electrodynamic dyadic Green's tensor for a purple bacteria. We construct it for the isolated 24-bacteriochlorophyll conical frustum and its P42$_1$2 crystallographic assembly. The P42$_1$2 unit-cell symmetry is found to invert the bright-dark ordering of the single ring, placing subradiant states at the low-energy end and revealing the entire crystal to be the energy-harvesting entity. Tilt-driven switching is activated only in crystal geometries where the finite dipole-carrier (LH2) lies perpendicular to the growth plane. Vacancy and orientational disorder work only in cooperation to renormalize the switching threshold from higher polar angles to lower values.

2606.02419 2026-06-11 physics.chem-ph cond-mat.mtrl-sci physics.comp-ph 版本更新

DPA4: Pushing the Accuracy-Cost Frontier of Interatomic Potentials with EMFA SO(2) Convolution

DPA4: 利用EMFA SO(2)卷积推动原子间势的精度-成本前沿

Tiancheng Li, Wentao Li, Anyang Peng, Jianming Xue, Linfeng Zhang, Duo Zhang, Han Wang

AI总结 本文提出DPA4架构,通过EMFA SO(2)等变卷积和编译器友好的训练路径,在降低参数和训练成本的同时,在Matbench Discovery等基准上达到最优精度-成本平衡。

详情
AI中文摘要

机器学习原子间势现在在标准基准上接近量子力学精度,但最具表现力的等变架构的训练成本已成为严重瓶颈。我们引入了DPA4,一种SE(3)-等变原子间势架构,具有EMFA(边缘条件、多焦点、注意力)SO(2)-等变卷积,该卷积结合了低秩边缘-节点SO(2)-等变乘积、用于消息非线性的多焦点设计以及用于消息聚合的包络门控注意力。Lebedev网格投影进一步将非线性中的SO(3)-等变性保持到机器精度。编译器友好的保守能量梯度训练路径在torch.compile下提供了高达约3倍的挂钟加速。在合规的Matbench Discovery基准上,DPA4-Pro在排行榜上获得了最佳综合性能得分(CPS),而276万参数的DPA4-Air以10.9倍更少的参数和42.9倍更少的训练计算量,超过了3010万参数的eSEN-30M-MP基线的精度。在SPICE-MACE-OFF上,540万参数的DPA4-Plus将650万参数的eSEN基线的总分子能量和力误差分别降低了29%和30%,而270万参数的DPA4-Air仍然以约2.4倍更少的参数超越了该基线。这些结果共同将DPA4置于Matbench Discovery上新的精度-成本帕累托前沿,并使其成为未来多任务大型原子模型(LAM)预训练的有力候选骨干。

英文摘要

Machine-learning interatomic potentials now approach quantum-mechanical accuracy on standard benchmarks, but the training cost of the most expressive equivariant architectures has become a serious bottleneck. We introduce DPA4, an SE(3)-equivariant interatomic-potential architecture with an EMFA (Edge-conditioned, Multi-Focus, Attention) SO(2)-equivariant convolution that combines a low-rank edge-node SO(2)-equivariant product, a multi-focus design for message nonlinearity, and envelope-gated attention for message aggregation. A Lebedev-grid projection further preserves SO(3)-equivariance in the nonlinearity to machine precision. A compiler-friendly conservative energy-gradient training path provides up to $\sim$3 times wall-clock speedup under torch compile. On the compliant Matbench Discovery benchmark, DPA4-Pro attains the best Combined Performance Score (CPS) on the leaderboard, while the 2.76M-parameter DPA4-Air exceeds the accuracy of the 30.1M-parameter eSEN-30M-MP baseline with 10.9$\times$ fewer parameters and 42.9$\times$ less training compute. On SPICE-MACE-OFF, the 5.4M-parameter DPA4-Plus lowers the aggregate molecular energy and force errors of the 6.5M-parameter eSEN baseline by 29% and 30%, while the 2.7M-parameter DPA4-Air still surpasses that baseline with $\sim$2.4$\times$ fewer parameters. Together these results place DPA4 on a new accuracy-cost Pareto frontier on Matbench Discovery and position it as a strong candidate backbone for future multi-task large atomistic model (LAM) pretraining.

2605.20819 2026-06-11 physics.chem-ph 版本更新

DynaMate2: runtime registration of expert-defined tools for agentic scientific workflow automation

DynaMate2:使代理AI民主化,用于专家设计的定制工作流

Orlando A. Mendible-Barreto, Ajay Vallabh, Ubaldo M. Córdova-Figueroa, Yamil J. Colón

AI总结 本文提出DynaMate2,一种分层代理框架和开源模板,旨在降低研究人员将现有专家定义的Python函数转换为AI可调用工具的门槛,通过让LLM负责任务路由和工具选择,而非生成科学代码,从而实现自动化科学工作流的民主化。

详情
AI中文摘要

计算化学和材料科学中的科学工作流通常涉及多个相互依赖的步骤,如模型准备、系统构建、模拟执行和数据分析,这些步骤经过多年研究已发展成高度专业化的验证代码库。尽管大型语言模型(LLM)代理框架已显示出自动化此类工作流的潜力,但现有系统是为特定、预定义的任务序列构建的。适应新领域或集成定制专家开发工具需要大量编程专业知识,限制了其在更广泛科学社区中的应用。本文介绍了DynaMate2,一种分层代理框架和开源模板,其核心设计目标是降低任何研究人员将现有专家定义的Python函数转换为AI可调用工具的门槛。关键设计原则是LLM从不被要求生成科学代码,因为所有领域逻辑都位于专家定义的工具中。LLM的唯一职责是路由任务、选择适当的工具,并使用输出指导后续行动。工具和代理可以在运行时从内联代码、现有源文件或纯语言描述中注册,且所有扩展会自动保留跨会话。我们通过端到端的分子动力学工作流演示了该框架。我们提供了一种工具注册协议,指导研究人员逐步将他们的验证代码集成到该框架中。DynaMate2作为开源参考实现发布,配有基于网页的界面,并设计为在任意科学领域中供社区驱动扩展的可重用模板。

英文摘要

Agentic large-language-model systems can coordinate scientific tools, but many implementations remain difficult for domain scientists to extend without modifying the source orchestration code or relying on unconstrained code generation. DynaMate2 is a LangGraph-based multi-agent framework for converting expert-defined Python functions into persistent AI-callable tools. The architecture separates domain execution from LLM supervision: registered tools perform scientific operations, while a supervisor LLM decomposes goals, selects specialist agents, routes inputs, and propagates outputs across steps. DynaMate2 supports: runtime tool registration from inline code, source files, and explicitly requested natural-language specifications; persistent storage of tools, agents, and conversation state; and a web interface for interactive workflow assembly. We demonstrate the framework on a molecular simulation workflow in which a single instruction retrieves a MACE foundation model, builds a NaCl-water configuration, runs an ASE molecular dynamics trajectory, and generates energy and temperature diagnostics. The demonstration illustrates how validated workflow components can be composed into a supervised agentic pipeline without rewriting the framework. DynaMate2 therefore provides a reusable template for extending LLM-based automation to research groups with existing Python workflows, while preserving the need for explicit tool validation, reproducibility logs, and deployment-specific safeguards.

2604.06841 2026-06-11 physics.chem-ph cond-mat.str-el 版本更新

Spin-adapted neural network backflow for symmetry-preserving simulations of strongly correlated electrons

自旋自适应神经网络反流用于强关联电子的对称性保持模拟

Yunzhi Li, Zibo Wu, Bohan Zhang, Wei-Hai Fang, Zhendong Li

AI总结 提出自旋自适应神经网络反流(SA-NNBF)变分波函数,通过结合构型依赖空间轨道和压缩自旋本征函数,消除自旋污染,在氢链和铁硫簇中实现更低变分能,并在FeMoco活性空间模型中达到与自旋自适应DMRG竞争的能量。

详情
Comments
10 pages, 5 figures
AI中文摘要

强关联分子通常包含密集的低能自旋态流形,使得总自旋对称性对于预测性电子结构理论至关重要。神经网络量子态提供了灵活的变分波函数,但常用的费米子架构不强制这种对称性,因此可能收敛到具有误导性能量和性质的自旋污染态。这里我们引入第二量子化中的自旋自适应神经网络反流(SA-NNBF)拟设,它结合了构型依赖的空间轨道和压缩的自旋本征函数。自旋本征函数的投影张量压缩方案和粒子-空穴表示使得使用SA-NNBF的变分蒙特卡罗计算对于包含超过一百个电子的活性空间变得实用。在氢链和铁硫簇中,SA-NNBF消除了自旋污染,并且一致地实现了比具有可比参数数量的标准NNBF更低的变分能量。对于FeMoco的CAS(113e,76o)活性空间模型,SA-NNBF产生了一个高度紧凑的自旋自适应变分态,在键维数$D=10000$时实现了与最近自旋自适应DMRG计算竞争的能量,同时使用的参数数量少几个数量级。我们的工作为开发用于化学现实强关联电子的自旋对称性保持神经网络量子态建立了一个通用框架。

英文摘要

Strongly correlated molecules often contain dense manifolds of low-lying spin states, making total-spin symmetry essential for predictive electronic-structure theory. Neural-network quantum states provide flexible variational wavefunctions, but commonly used fermionic architectures do not enforce this symmetry and can therefore converge to spin-contaminated states with misleading energies and properties. Here we introduce a spin-adapted neural-network backflow (SA-NNBF) ansatz in second quantization, which combines configuration-dependent spatial orbitals with a compressed spin eigenfunction. A projected tensor compression scheme for spin eigenfunctions and a particle-hole representation make variational Monte Carlo calculations with SA-NNBF practical for active spaces containing more than one hundred electrons. Across hydrogen chains and iron-sulfur clusters, SA-NNBF eliminates spin contamination and consistently achieves lower variational energies than standard NNBF with a comparable number of parameters. For the CAS(113e,76o) active-space model of FeMoco, SA-NNBF yields a highly compact spin-adapted variational state, achieving an energy competitive with recent spin-adapted DMRG calculations at bond dimension $D=10000$ while using orders of magnitude fewer parameters. Our work establishes a general framework for developing spin-symmetry-preserving neural-network quantum states for chemically realistic strongly correlated electrons.

2509.25070 2026-06-11 astro-ph.GA astro-ph.IM astro-ph.SR physics.chem-ph 版本更新

Interstellar Dust-Catalyzed Molecular Hydrogen Formation Enabled by Nuclear Quantum Effects

星际尘埃催化的分子氢形成:核量子效应的作用

Xiaolong Yang, Lile Wang, Di Li, Shenzhen Xu

AI总结 通过多尺度模拟,发现核量子效应在低温下克服经典玻尔兹曼抑制,使化学吸附氢原子高效形成H₂,为星际H₂形成提供第一性原理量子基础。

详情
Comments
29 pages, 20 figures
AI中文摘要

分子氢(H$_2$)是控制和塑造从星系演化到行星形成等广泛天体物理过程的关键化学物种之一。尽管尘埃颗粒表面的催化是星际介质中H$_2$的主要形成通道,但其在$20-200~\rm K$范围内的效率尚未完全理解。这里,我们使用结合了从头算级机器学习力场、约束路径积分蒙特卡罗和动力学蒙特卡罗的多尺度模拟,对完整的H$_2$形成序列(包括氢吸附、扩散、结合和解吸)进行了系统的量子力学研究。我们明确考虑了气体和尘埃温度的解耦,使我们的结果适用于光子主导区(PDRs)和致密冷云。我们的结果表明,在本文研究的裸露晶体表面(石墨和硅酸盐颗粒)上,物理吸附氢可忽略不计,而化学吸附氢原子中的核量子效应(NQEs)对于低温下高效形成至关重要,克服了经典的玻尔兹曼抑制。本工作对硅酸盐表面(以顽火辉石为例)和石墨颗粒进行了包含NQEs的定量研究,揭示了表面特异性吸附行为。这些发现为星际H$_2$形成提供了第一性原理量子基础,补充了经验乘数,并为尘埃组成和分子云演化提供了新的观测约束。该框架还可推广到完全NQEs下尘埃颗粒上的其他天体化学反应。

英文摘要

Molecular hydrogen (H$_2$) is one of the key chemical species that controls and shapes a wide spectrum of astrophysical processes from galaxy evolution to planet formation. Although catalyzation on dust grain surfaces is the dominant formation channel of H$_2$ in the interstellar medium, its efficiency across $20-200~\rm K$ has remained not fully understood. Here, using multiscale simulations combining ab-initio-level machine learning force fields, constrained path-integral Monte Carlo, and kinetic Monte Carlo, we perform a systematic, quantum-mechanical study of the full H$_2$ formation sequence, including hydrogen adsorption, diffusion, association and desorption. We explicitly consider the decoupling of gas and dust temperatures, making our results applicable to photon-dominated regions (PDRs) and dense cold clouds. Our results show that on the bare, crystalline surfaces studied here (graphitic and silicate grains), physisorbed hydrogen is negligible, and nuclear quantum effects (NQEs) in chemisorbed hydrogen atoms are essential for efficient formation at low temperatures, overcoming the classical Boltzmann suppression. This work presents a quantitative NQEs-inclusive study on silicate surfaces (exemplified by enstatite) and graphitic grains, revealing surface-specific adsorption behavior. These findings provide a first-principles quantum foundation for interstellar H$_2$ formation, complementing empirical multipliers, and enable new observational constraints on dust composition and molecular cloud evolution. The framework also extends to other astrochemical reactions on dust grains under full NQEs.

2501.09896 2026-06-11 physics.chem-ph

High-Accuracy Physical Property Prediction for Organics via Molecular Representation Learning: Bridging Data to Discovery

Qi Ou, Hongshuai Wang, Minyang Zhuang, Shangqian Chen, Lele Liu, Ning Wang, Zhifeng Gao

详情
Journal ref
npj Computational Materials volume 11, Article number: 224 (2025)
英文摘要

The ongoing energy crisis has underscored the urgent need for energy-efficient materials with high energy utilization efficiency, prompting a surge in research into organic compounds due to their environmental compatibility, cost-effective processing, and versatile modifiability. To address the high experimental costs and time-consuming nature of traditional trial-and-error methods in the discovery of highly functional organic compounds, we apply the 3D transformer-based molecular representation learning algorithm to construct a pre-trained model using 60 million semi-empirically optimized structures of small organic molecules, namely, Org-Mol, which is then fine-tuned with public experimental data to obtain prediction models for various physical properties. Despite the pre-training process relying solely on single molecular coordinates, the fine-tuned models achieves high accuracy (with $R^2$ values for the test set exceeding 0.95). These fine-tuned models are applied in a high-throughput screening process to identify novel immersion coolants among millions of automatically constructed ester molecules, resulting in the experimental validation of two promising candidates. This work not only demonstrates the potential of Org-Mol in predicting bulk properties for organic compounds but also paves the way for the rational and efficient development of ideal candidates for energy-saving materials.