arXivDaily arXiv每日学术速递 周一至周五更新
重置
2507.08794 2026-06-12 cs.LG cs.CL 版本更新

One Token to Fool LLM-as-a-Judge

一个令牌就能欺骗LLM裁判

Yulai Zhao, Haolin Liu, Dian Yu, Sunyuan Kung, Meijia Chen, Haitao Mi, Dong Yu

AI总结 发现基于参考的生成式奖励模型易受奖励黑客攻击,表面输入(如非词符号或通用推理开头)能持续引发假阳性奖励,提出使用截断模型输出作为对抗性负例的数据增强策略,构建鲁棒的Master奖励模型。

详情
AI中文摘要

大型语言模型(LLM)越来越被信任作为自动裁判,协助评估并为训练其他模型提供奖励信号,特别是在基于参考的设置中,如带可验证奖励的强化学习(RLVR)。然而,我们揭示了即使在这种基于参考的范式中也存在一个关键漏洞:生成式奖励模型系统性地容易受到奖励黑客攻击。我们发现,表面输入——我们称之为“万能钥匙”,例如非词符号(如“:”或“.”)或通用推理开头(如“思考过程:”或“让我们逐步解决这个问题。”)——可以在没有任何实质性推理的情况下持续引发假阳性奖励。我们的系统评估表明,这是一个广泛存在的失败,影响多种模型,包括领先的专有系统如GPT-o1和Claude-4。这些结果挑战了LLM裁判假定的鲁棒性,并对其可靠性构成重大威胁。为了解决这个问题,我们提出了一种简单而有效的数据增强策略,使用截断的模型输出作为对抗性负例。由此产生的Master奖励模型(Master-RMs)在对这些“万能钥匙”攻击方面表现出最先进的鲁棒性,同时在标准评估设置中保持高性能。我们通过跨模型规模、提示变化和常见推理时策略的漏洞全面分析来补充这些发现,为未来关于鲁棒LLM评估的研究提供见解。我们在https://this.url 和 https://this.url 发布我们的鲁棒通用领域奖励模型和合成训练数据。

英文摘要

Large language models (LLMs) are increasingly trusted as automated judges, assisting evaluation and providing reward signals for training other models, particularly in reference-based settings like Reinforcement Learning with Verifiable Rewards (RLVR). However, we uncover a critical vulnerability even in this reference-based paradigm: generative reward models are systematically susceptible to reward hacking. We find that superficial inputs, which we term ''master keys'' such as non-word symbols (e.g., '':'' or ''.'') or generic reasoning openers (e.g., ''Thought process:'' or ''Let's solve this problem step by step.''), can consistently elicit false positive rewards without any substantive reasoning. Our systematic evaluation demonstrates this is a widespread failure affecting a diverse range of models, including leading proprietary systems such as GPT-o1 and Claude-4. These results challenge the assumed robustness of LLM judges and pose a significant threat to their reliability. To address this, we propose a simple yet effective data augmentation strategy using truncated model outputs as adversarial negative examples. The resulting Master Reward Models (Master-RMs) demonstrate state-of-the-art robustness against these ''master key'' attacks while maintaining high performance in standard evaluation settings. We supplement these findings with a comprehensive analysis of the vulnerability across model scales, prompt variations, and common inference-time strategies, offering insights to guide future research on robust LLM evaluation. We release our robust, general-domain reward models and the synthetic training data at https://huggingface.co/sarosavo/Master-RM and https://huggingface.co/datasets/sarosavo/Master-RM.

2507.07879 2026-06-12 cs.SD eess.AS 版本更新

LISTEN: Lightweight Industrial Sound-representable Transformer for Edge Notification

LISTEN:面向边缘通知的轻量级工业声音可表示Transformer

Changheon Han, Yun Seok Kang, Yuseop Sim, Hyung Wook Park, Martin Byung-Guk Jun

AI总结 提出轻量级工业声音基础模型LISTEN,通过知识蒸馏从大模型IMPACT压缩,仅用少量数据微调即可在边缘设备上实现实时机器监控,性能接近大模型。

详情
Journal ref
Advanced Engineering Informatics, Volume 76, Part A, 2026, 104944
AI中文摘要

基于深度学习的机器听觉正在拓宽工业声学分析的范围,但其在实时车间中的广泛实施受到对每个新任务依赖大型、任务特定标注数据集的阻碍。虽然新兴的通用声音基础模型旨在减轻数据依赖性,但它们在实践中暴露出关键困境。通用声音基础模型计算成本高,并且在以音调谐波、宽带噪声和瞬态故障事件为特征的工业场景中失败,使得即时、现场部署不切实际。这些挑战共同意味着,在实时车间部署声音基础模型的实用端到端系统仍然难以实现。为了解决这一挑战,本研究引入了LISTEN(面向边缘通知的轻量级工业声音可表示Transformer),这是第一个专门针对工业声音的轻量级基础模型。通过从大规模教师模型IMPACT(基于声学认知Transformer的工业机器感知)进行知识蒸馏,我们构建了针对资源受限边缘环境优化的LISTEN。通过冻结骨干网络并仅对最小目标过程数据训练浅层头部,而不是进行完全微调或重新训练,LISTEN在多种制造过程中实现了与IMPACT几乎相同的性能。本研究进一步展示了一个完整的实时机器监控系统,包括使用工业物联网(IIoT)设备进行数据采集、使用最小标注数据进行快速模型适应,以及在低成本边缘设备上进行实时监控。通过在实时CNC机器上验证整个系统,这项工作建立了在活跃工业环境中部署轻量级工业声音基础模型的第一个可行的端到端系统。

英文摘要

Deep learning-based machine listening is broadening the scope of industrial acoustic analysis, yet its widespread implementation on live shop floors is hindered by the reliance on large, task-specific annotated datasets for every new task. While emerging general-purpose sound foundation models aim to alleviate data dependency, they reveal critical dilemmas in practice. General-purpose sound foundation models are computationally expensive and fail in industrial scenarios characterized by tonal harmonics, broadband noise, and transient fault events, making instant, on-site deployment impractical. These challenges combined mean that a practical, end-to-end system for deploying a sound foundation model on a live shop floor has remained elusive. To address this challenge, this study introduces LISTEN (Lightweight Industrial Sound-representable Transformer for Edge Notification), the first lightweight foundation model specialized for industrial sound. Through Knowledge Distillation (KD) from the large-scale teacher model IMPACT (Industrial Machine Perception via Acoustic Cognitive Transformer), we construct LISTEN optimized for resource-constrained edge environments. By freezing the backbone and training only a shallow head on minimal target-process data, rather than performing full fine-tuning or retraining, LISTEN achieves nearly identical performance to IMPACT across diverse manufacturing processes. This study further demonstrates a complete system for real-time machine monitoring, encompassing data acquisition with Industrial Internet of Things (IIoT) devices, rapid model adaptation using minimal annotated data, and real-time monitoring on a low-cost edge device. By validating the entire system on a live CNC machine, this work establishes the first feasible end-to-end system for deploying a lightweight industrial sound foundation model in an active industrial environment.

2501.08425 2026-06-12 cs.LG math.AP math.PR 版本更新

Is Stochastic Gradient Descent Effective? A PDE Perspective on Machine Learning processes

随机梯度下降有效吗?机器学习过程的PDE视角

Davide Barbieri, Matteo Bonforte, Peio Ibarrondo

AI总结 通过Fokker-Planck型抛物PDE分析SGD行为,区分漂移和扩散两个阶段,量化浓度现象并证明平均退出时间界限,为非凸损失和退化扩散矩阵下的渐近收敛提供新结果。

详情
AI中文摘要

本文分析了随机梯度下降(SGD)的行为,这是一种在监督学习中广泛使用的方法,通过最小化非凸损失函数来优化神经网络权重。自E、Li和Tai(2017)的开创性工作以来,此类过程的基本结构可以通过Fokker-Planck型抛物PDE来理解,这是我们分析的核心。尽管Fokker-Planck方程历史悠久且文献丰富,但当势函数非凸或扩散矩阵退化时,几乎一无所知,这是我们分析中面临的主要困难。我们识别出两种不同的阶段:在SGD的初始阶段,损失函数驱动权重集中在最近的局部最小值附近。我们将此阶段称为漂移阶段,并提供了关于这种集中现象的定量估计。接下来,我们引入扩散阶段,其中随机波动帮助学习过程逃离次优局部最小值。我们分析了平均退出时间(MET),并证明了MET的上下界。最后,我们针对非凸代价函数和退化扩散矩阵(不允许使用标准方法并需要新技术)研究了SGD的渐近收敛性。为此,我们利用了两种不同的方法:对偶方法和熵方法。我们提供了关于SGD动力学和有效性的新结果,建立了随机优化与PDE理论之间的深层联系,并为机器学习过程中的基本问题提供了一些答案和见解:SGD需要多长时间才能逃离一个坏的最小值?使用SGD时神经网络参数是否收敛?在SGD训练的第一阶段,参数如何演化?

英文摘要

In this paper we analyze the behaviour of the stochastic gradient descent (SGD), a widely used method in supervised learning for optimizing neural network weights via a minimization of non-convex loss functions. Since the pioneering work of E, Li and Tai (2017), the underlying structure of such processes can be understood via parabolic PDEs of Fokker-Planck type, which are at the core of our analysis. Even if Fokker-Planck equations have a long history and a extensive literature, almost nothing is known when the potential is non-convex or when the diffusion matrix is degenerate, and this is the main difficulty that we face in our analysis. We identify two different regimes: in the initial phase of SGD, the loss function drives the weights to concentrate around the nearest local minimum. We refer to this phase as the drift regime and we provide quantitative estimates on this concentration phenomenon. Next, we introduce the diffusion regime, where stochastic fluctuations help the learning process to escape suboptimal local minima. We analyze the Mean Exit Time (MET) and prove upper and lower bounds of the MET. Finally, we address the asymptotic convergence of SGD, for a non-convex cost function and a degenerate diffusion matrix, that do not allow to use the standard approaches, and require new techniques. For this purpose, we exploit two different methods: duality and entropy methods. We provide new results about the dynamics and effectiveness of SGD, offering a deep connection between stochastic optimization and PDE theory, and some answers and insights to basic questions in the Machine Learning processes: How long does SGD take to escape from a bad minimum? Do neural network parameters converge using SGD? How do parameters evolve in the first stage of training with SGD?

2605.29151 2026-06-12 math.AG cs.AI cs.NE 版本更新

Real-rootedness of the Poincaré polynomials of $\overline{\mathcal M}_{0,n}$: an AI-assisted proof

Poincaré多项式的实根性:一个AI辅助的证明

Gergely Bérczi, Young-Hoon Kiem

AI总结 通过引入双变量变形揭示隐藏的交错结构,证明了稳定有理曲线模空间Poincaré多项式的实根性,并进一步推广到Fulton-MacPherson空间。

详情
Comments
16 pages
AI中文摘要

我们证明了Deligne-Mumford模空间$\overline{\mathcal M}_{0,n}$(稳定$n$点有理曲线)的Poincaré多项式\[ P_n(t)=\sum_{i=0}^{n-3} \dim H^{2i}(\overline{\mathcal M}_{0,n};\mathbb{Q})t^i \]的实根性,证实了Aluffi-Chen-Marcolli的猜想。证明从Keel-Manin-Getzler递推开始,但其主要新思想是Poincaré多项式的双变量变形$F_m(y,t)$。这种变形揭示了单变量递推中不可见的隐藏交错结构。对于固定的$t<0$,$F_m$在$y$方向上的零点集由区间$0<y<1-t$上的Sturm-Rolle论证控制。原始多项式在切片$y=1$上恢复,移动根通过该切片的有序交叉同时给出了实根性和严格交错。因此,$\overline{\mathcal M}_{0,n}$的Betti数构成一个超对数凹序列。 我们进一步证明了Fulton-MacPherson空间$\mathbb{P}^1[n]$(复射影线退化中$n$个有序点)的Poincaré多项式的实根性和超对数凹性。 $\overline{\mathcal M}_{0,n}$的证明是通过与Co-Mathematician(Google DeepMind开发的智能体前沿模型系统)的迭代AI辅助工作流程获得的。人类的角色是提出问题、评估连续尝试、请求修复漏洞、将逐步发展的论证与文献进行比较,并组装最终可人工验证的证明。我们额外的人类贡献是观察到类似的残差变形策略适用于Fulton-MacPherson空间$\mathbb P^1[n]$,从而得到相应的实根性定理。

英文摘要

We prove real-rootedness for the Poincaré polynomial \[ P_n(t)=\sum_{i=0}^{n-3} \dim H^{2i}(\overline{\mathcal M}_{0,n};\mathbb{Q})t^i \] of the Deligne--Mumford moduli space $\overline{\mathcal M}_{0,n}$ of stable $n$-pointed rational curves, proving a conjecture of Aluffi--Chen--Marcolli. The proof starts from the Keel--Manin--Getzler recurrence, but its main new idea is a bivariate deformation $F_m(y,t)$ of the Poincaré polynomial. This deformation reveals a hidden interlacing structure not visible in the one-variable recurrence. For fixed $t<0$, the zero set of $F_m$ in the $y$-direction is controlled by a Sturm--Rolle argument on the interval $0<y<1-t$. The original polynomial is recovered on the slice $y=1$, and the ordered crossings of the moving roots through this slice give both real-rootedness and strict interlacing. Consequently, the Betti numbers of $\overline{\mathcal M}_{0,n}$ form an ultra-log-concave sequence. We further prove real-rootedness and ultra-log-concavity for the Poincaré polynomial of the Fulton--MacPherson space $\mathbb{P}^1[n]$ of $n$ ordered points in degenerations of the complex projective line. The proof for $\overline{\mathcal M}_{0,n}$ was obtained through an iterative AI-assisted workflow with Co-Mathematician, an agentic frontier-model system developed by Google DeepMind. Our role was to formulate the problem, evaluate the proposed proof attempts, identify gaps and request corrections, compare the developing argument with the literature, and refine the presentation of the final proof. Our additional human contribution was to observe that a similar residual deformation strategy applies to the Fulton--MacPherson spaces $\mathbb P^1[n]$, yielding the corresponding real-rootedness theorem.

2605.17062 2026-06-12 cs.CR cs.LG cs.SE 版本更新

The Range Shrinks, the Threat Remains: Re-evaluating LLM Package Hallucinations on the 2026 Frontier-Model Cohort

范围缩小,威胁依旧:重新评估2026前沿模型队列上的LLM包幻觉

Aleksandr Churilov

AI总结 本文重新评估了2026前沿模型队列上大型语言模型(LLM)的包幻觉现象,发现尽管幻觉率有所降低,但仍然存在威胁,识别出一组127个包名(109个在PyPI,18个在npm)被所有评估模型一致生成,构成一个跨模型的供应链攻击面,同时发现Python与JavaScript幻觉的不对称性以及DeepSeek V3.2和GPT-5.4-mini之间的高相似性。

详情
Comments
13 pages, 3 figures, 4 tables. v2: incorporates coordinated-disclosure feedback from PyPI Security and Socket.dev; registrable attack surface refined to 53 names (41 PyPI, 12 npm). Headline rates unchanged. Replication of Spracklen et al. (USENIX Security 2025). Data and code: https://github.com/churik5/slopsquatting-replication-2026 and https://doi.org/10.5281/zenodo.19859120
AI中文摘要

Spracklen等人(USENIX Security '25)表明,生成代码的大型语言模型会以5.2%至21.7%的比率生成不存在于PyPI或npm上的包名,从而为slopsquatting攻击(恶意包的注册)提供了攻击面。我们在这五款2025年10月至2026年3月期间发布的前沿代码能力LLM上重复了他们的方法:Claude Sonnet 4.6、Claude Haiku 4.5、GPT-5.4-mini、Gemini 2.5 Pro和DeepSeek V3.2。在199,845个经过PyPI和npm主列表验证的Python和JavaScript提示对中,我们测量到幻觉率在4.62%(Claude Haiku 4.5)到6.10%(GPT-5.4-mini)之间——比Spracklen观察到的模型间差异缩小了一个数量级,但威胁并未消失。除了重复研究外,我们识别出一组127个包名(109个在PyPI,18个在npm)被所有评估模型一致生成,构成一个跨模型的供应链攻击面,无法由单一模型研究揭示。我们进一步记录了Python与JavaScript幻觉的不对称性,推翻了Spracklen 2024年的发现,识别出Anthropic家族中的Haiku低于Sonnet的倒置现象,并观察到DeepSeek V3.2和GPT-5.4-mini之间的Jaccard相似性峰值(J=0.343),暗示共享的训练数据起源。

英文摘要

Spracklen et al. (USENIX Security '25) showed that code-generating large language models hallucinate package names that do not exist on PyPI or npm at rates ranging from 5.2% on commercial models to 21.7% on open-source models, creating an attack surface for slopsquatting -- the registration of malicious packages under hallucinated names. We replicate their methodology on five frontier code-capable LLMs released between October 2025 and March 2026: Claude Sonnet 4.6, Claude Haiku 4.5, GPT-5.4-mini, Gemini 2.5 Pro, and DeepSeek V3.2. Across 199,845 paired Python and JavaScript prompts validated against PyPI and npm master lists, we measure overall hallucination rates between 4.62% (Claude Haiku 4.5) and 6.10% (GPT-5.4-mini) -- an order-of-magnitude compression of the inter-model spread observed by Spracklen, but not a retirement of the threat. Beyond replication, we identify a set of 127 package names (109 on PyPI, 18 on npm) that all five evaluated models invent identically; following coordinated disclosure with PyPI Security and Socket.dev, 53 of these (41 on PyPI, 12 on npm) remain registrable by an attacker after each registry's existing defenses, constituting a model-agnostic supply-chain attack surface that no single-model study can reveal. We further document a Python-over-JavaScript hallucination asymmetry that inverts Spracklen's 2024 finding, identify a Haiku-below-Sonnet inversion within the Anthropic family, and observe a Jaccard-similarity peak between DeepSeek V3.2 and GPT-5.4-mini (J = 0.343) suggestive of shared training-data origins.

2603.02274 2026-06-12 q-bio.QM cs.AI 版本更新

Contextual Invertible World Models: A Neuro-Symbolic Agentic Framework for Colorectal Cancer Drug Response

上下文可逆世界模型:用于结直肠癌药物反应的神经符号智能框架

Christopher Baker, Tianyu Ren, Karen Rafferty, Hui Wang

AI总结 提出上下文可逆世界模型(CIWM),结合机器学习模拟器与大语言模型推理层,通过逆推理进行CRISPR扰动,揭示KRAS突变在5-氟尿嘧啶耐药中的主导作用及PIK3CA修复的意外效应。

详情
AI中文摘要

精准肿瘤学目前受到小N大P悖论的限制,即高维基因组数据丰富但药理学反应样本稀疏。虽然深度学习实现了预测准确性,但它常常无法提供临床采用所需的机制清晰度。我们提出了上下文可逆世界模型(CIWM),这是一个神经符号智能框架,通过将定量机器学习模拟器与大语言模型推理层集成来弥合这一差距。利用在Sanger GDSC数据集(\\( N=83 \\))上严格筛选的高保真数据工程流程,我们从体外伪影中分离出真正的生物信号,为复杂转录组学建立了严格的基线预测相关性(\\( r=0.268 \\))。通过逆推理,我们在结直肠癌景观中进行了计算机CRISPR扰动。该框架自主推翻了经典机制假设,识别出突变KRAS在驱动5-氟尿嘧啶耐药(\\( \Delta=-0.0469 \\))中相对于APC/Wnt轴具有层级优势,并通过映射到MAPK/PI3K网络的“KRAS盾牌”实现。此外,智能层识别出“PIK3CA悖论”,揭示修复PIK3CA通过触发补偿性反馈环过度激活主导的MAPK生存通路,无意中增加了化疗耐药性(\\( \Delta=+0.0085 \\))。

英文摘要

Precision oncology is currently limited by the small-N, large-P paradox, where high-dimensional genomic data is abundant but pharmacological response samples are sparse. While deep learning achieves predictive accuracy, it frequently fails to provide the mechanistic clarity required for clinical adoption. We present the Contextual Invertible World Model (CIWM), a Neuro-Symbolic Agentic Framework that bridges this gap by integrating a quantitative machine learning emulator with a Large Language Model reasoning layer. Utilising a stringently curated, high-fidelity data engineering pipeline on the Sanger GDSC dataset (\( N=83 \)), we isolate true biological signals from in vitro artifacts to establish a rigorous baseline predictive correlation for complex transcriptomics (\( r=0.268 \)). Through Inverse Reasoning, we perform in silico CRISPR perturbations across the colorectal landscape. The framework autonomously overturns classical mechanistic assumptions, identifying a hierarchical dominance of mutant KRAS over the APC/Wnt-axis in driving 5-fluorouracil resistance (\( Δ=-0.0469 \)) via a "KRAS Shield" mapped to MAPK/PI3K networks. Furthermore, the agentic layer identified a "PIK3CA Paradox", revealing that repairing PIK3CA inadvertently increases chemoresistance (\( Δ=+0.0085 \)) by triggering a compensatory feedback loop that hyperactivates the dominant MAPK survival pathway.

2603.24603 2026-06-12 q-bio.NC cs.AI 版本更新

Fusion Learning from Dynamic Functional Connectivity: Combining the Amplitude and Phase of fMRI Signals to Identify Brain Disorders

融合动态功能连接:结合fMRI信号的幅度和相位识别脑疾病

Jinlong Hu, Jiatong Huang, Zijian Cai

AI总结 提出多尺度融合学习框架MSFL,结合滑动窗口相关和相位同步两种互补的动态功能连接特征,在自闭症和抑郁症数据集上显著优于现有模型。

详情
AI中文摘要

基于静息态功能磁共振成像(fMRI)的动态功能连接(dFC)已广泛应用于脑科学研究。滑动窗口相关(SWC)方法通过计算脑区对信号幅度时间序列之间的相关系数,是构建dFC的常用方法。在本研究中,我们提出了一种集成方法,结合fMRI信号的幅度和相位信息,以提高脑疾病的检测能力。具体而言,我们引入了一个多尺度融合学习框架MSFL,该框架利用来自SWC和相位同步(PS)的两种互补dFC特征。其中,SWC捕获幅度相关性,而PS测量dFC内的相位相干性。我们使用两个公开数据集(ABIDE I和REST-meta-MDD)评估了MSFL在分类自闭症谱系障碍和重度抑郁症方面的有效性。结果表明,MSFL显著优于现有比较模型。此外,我们使用SHAP框架进行了模型解释分析,表明来自SWC和PS的两种dFC特征均有助于检测脑疾病。

英文摘要

Dynamic functional connectivity (dFC) derived from resting-state functional magnetic resonance imaging (fMRI) has been extensively utilized in brain science research. The sliding window correlation (SWC) method is a widely used approach for constructing dFC by computing correlation coefficients between amplitude time series of signals from pairs of brain regions. In this study, we propose an integrated approach that incorporates both amplitude and phase information of fMRI signals to improve the detection of brain disorders. Specifically, we introduce a multi-scale fusion learning framework, namely MSFL, which leverages two complementary dFC features derived from SWC and phase synchronization (PS). Here, SWC captures amplitude correlations, while PS measures phase coherence within dFC. We evaluated the efficacy of MSFL in classifying autism spectrum disorder and major depressive disorder using two publicly available datasets: ABIDE I and REST-meta-MDD, respectively. The results indicate that MSFL significantly outperforms existing comparative models. Moreover, we performed model explanation analysis using the SHAP framework, which showed that both types of dFC features from SWC and PS contribute to detecting brain disorders.

2602.04075 2026-06-12 cond-mat.mtrl-sci cs.LG 版本更新

Thermodynamic assessment of machine learning models for solid-state synthesis prediction

固态合成预测机器学习模型的热力学评估

Jane Schlesinger, Simon Hjaltason, Nathan J. Szymanski, Christopher J. Bartel

AI总结 评估了机器学习模型预测固态材料合成可行性的热力学一致性,发现模型普遍高估合成可能性,但部分分数与热力学启发式趋势一致。

详情
AI中文摘要

机器学习模型最近被用于预测假设的固态材料是否可合成。这些模型旨在绕过固态相变的直接第一性原理建模,而是从成功合成材料的大型数据库中学习。在这里,我们评估了几个最近引入的合成预测模型与材料和反应热力学的对齐程度,通过相对于凸包的能量和考虑枚举合成反应的热力学选择性的度量来量化。使用成功合成配方的数据集确定了这两个量的可能界限,超出该界限的材料被认为不太可能被合成。以这些界限为背景,使用CHGNet基础势计算了通过Chemeleon生成模型生成的数千种新假设材料的热力学量。将四个最近发表的用于可合成性预测的机器学习模型应用于同一数据集,并将所得预测与计算的热力学进行比较。我们发现这些模型普遍高估了合成的可能性,但一些模型分数确实与热力学启发式趋势一致,对稳定性较差或没有计算为热力学选择性的可用合成配方的材料分配较低的分数。总的来说,这项工作识别了机器学习模型在材料合成中存在的差距,并引入了一种在缺乏大量负例(失败合成)的情况下评估其质量的新方法。

英文摘要

Machine learning models have recently emerged to predict whether hypothetical solid-state materials can be synthesized. These models aim to circumvent direct first-principles modeling of solid-state phase transformations, instead learning from large databases of successfully synthesized materials. Here, we assess the alignment of several recently introduced synthesis prediction models with material and reaction thermodynamics, quantified by the energy with respect to the convex hull and a metric accounting for thermodynamic selectivity of enumerated synthesis reactions. A dataset of successful synthesis recipes was used to determine the likely bounds on both quantities beyond which materials can be deemed unlikely to be synthesized. With these bounds as context, thermodynamic quantities were computed using the CHGNet foundation potential for thousands of new hypothetical materials generated using the Chemeleon generative model. Four recently published machine learning models for synthesizability prediction were applied to this same dataset, and the resultant predictions were considered against computed thermodynamics. We find these models generally overpredict the likelihood of synthesis, but some model scores do trend with thermodynamic heuristics, assigning lower scores to materials that are less stable or do not have an available synthesis recipe that is calculated to be thermodynamically selective. In total, this work identifies existing gaps in machine learning models for materials synthesis and introduces a new approach to assess their quality in the absence of extensive negative examples (failed syntheses).

2410.00903 2026-06-12 stat.AP cs.CL cs.LG 版本更新

Causal Inference with Generative Artificial Intelligence: Application to Texts as Treatments

基于生成式人工智能的因果推断:以文本作为处理变量

Kosuke Imai, Kentaro Nakamura

AI总结 提出利用生成式AI(如大语言模型)生成处理变量并利用其内部表示进行因果效应估计,避免从数据中学习因果表示,提高估计准确性和效率。

详情
AI中文摘要

在本文中,我们展示了如何利用生成式人工智能(GenAI)的力量,增强以文本等高维非结构化数据作为处理变量时的因果推断有效性。具体而言,我们提出使用深度生成模型(如大语言模型,LLMs)高效地生成处理变量,并利用其内部表示进行后续的因果效应估计。我们表明,了解这种真实内部表示有助于将感兴趣的处理特征(如特定情感和某些主题)与其他可能未知的混淆特征分离开来。与现有方法不同,所提出的GenAI驱动推断(GPI)方法无需从数据中学习因果表示,因此能产生更准确和高效的估计。我们正式建立了非参数识别平均处理效应所需的条件,提出了一种避免重叠假设违反的估计策略,并通过应用双重机器学习推导了所提出估计量的渐近性质。最后,利用工具变量方法,我们将所提出的GPI方法扩展到处理特征基于人类感知的场景。GPI也适用于文本复用,即使用LLM重新生成现有文本。我们进行了模拟和实证研究,使用开源LLM Llama 3生成的文本数据,展示了我们的估计器相对于最先进的因果表示学习算法的优势。

英文摘要

In this paper, we demonstrate how to enhance the validity of causal inference with unstructured high-dimensional treatments like texts, by leveraging the power of generative Artificial Intelligence (GenAI). Specifically, we propose to use a deep generative model such as large language models (LLMs) to efficiently generate treatments and use their internal representation for subsequent causal effect estimation. We show that the knowledge of this true internal representation helps disentangle the treatment features of interest, such as specific sentiments and certain topics, from other possibly unknown confounding features. Unlike existing methods, the proposed GenAI-Powered Inference (GPI) methodology eliminates the need to learn causal representation from the data, and hence produces more accurate and efficient estimates. We formally establish the conditions required for the nonparametric identification of the average treatment effect, propose an estimation strategy that avoids the violation of the overlap assumption, and derive the asymptotic properties of the proposed estimator through the application of double machine learning. Finally, using an instrumental variables approach, we extend the proposed GPI methodology to the settings in which the treatment feature is based on human perception. The GPI is also applicable to text reuse where an LLM is used to regenerate existing texts. We conduct simulation and empirical studies, using the generated text data from an open-source LLM, Llama 3, to illustrate the advantages of our estimator over state-of-the-art causal representation learning algorithms.

2508.02548 2026-06-12 cs.DB cs.AI 版本更新

The KG-ER Conceptual Schema Language

KG-ER概念模式语言

Enrico Franconi, Benoît Groz, Jan Hidders, Nina Pardal, Sławek Staworko, Jan Van den Bussche, Piotr Wieczorek

AI总结 提出KG-ER概念模式语言,独立于知识图谱的表示方式描述其结构,并帮助捕获语义。

详情
Comments
Published in Proceedings of IRIS-AI (https://iris-ai.org)
AI中文摘要

我们提出KG-ER,一种用于知识图谱的概念模式语言,它独立于知识图谱的表示方式(关系数据库、属性图、RDF)描述其结构,同时有助于捕获知识图谱中存储信息的语义。

英文摘要

We propose KG-ER, a conceptual schema language for knowledge graphs that describes the structure of knowledge graphs independently of their representation (relational databases, property graphs, RDF) while helping to capture the semantics of the information stored in a knowledge graph.

2508.21531 2026-06-12 stat.ML cs.LG stat.CO 版本更新

Adaptive generative moment matching networks for improved learning of dependence structures

自适应生成矩匹配网络用于改进依赖结构学习

Marius Hofert, Gan Yao

AI总结 提出自适应带宽选择的最大均值差异混合核用于生成矩匹配网络,通过增加核数量和早停策略提升训练性能,在copula随机数生成、高维收敛率及金融数据依赖建模中优于传统方法。

详情
AI中文摘要

引入了一种用于最大均值差异(MMD)中混合核的自适应带宽选择程序,以拟合生成矩匹配网络(GMMNs),并展示了copula随机数生成器的改进学习。基于训练损失的相对误差,在训练过程中增加核的数量;此外,验证损失的相对误差被用作早停标准。虽然训练时间保持相似,但自适应训练GMMNs(AGMMNs)显著提高了训练性能,这通过验证MMD轨迹、样本和验证MMD值得以展示。在三个应用中,AGMMNs相比GMMNs和参数copula模型也表现出优越性。首先,首次在高达100维的维度中研究了基于copula的准随机与伪随机样本的估计量收敛速度。其次,重复的验证MMD以及蒙特卡洛和准蒙特卡洛应用证明了AGMMNs在去GARCH化后的标普500指数50个成分所隐含的copula模型上的改进训练。最后,后一个数据集和富时100指数的50个成分被用于证明AGMMNs的改进训练确实转化为改进的模型预测。

英文摘要

An adaptive bandwidth selection procedure for the mixture kernel in the maximum mean discrepancy (MMD) for fitting generative moment matching networks (GMMNs) is introduced, and improved learning of copula random number generators is demonstrated. Based on the relative error of the training loss, the number of kernels is increased during training; additionally, the relative error of the validation loss is used as an early stopping criterion. While training time remains similar, adaptively training GMMNs (AGMMNs) significantly increases training performance, which is shown based on validation MMD trajectories, samples and validation MMD values. Superiority of AGMMNs over GMMNs and parametric copula models is also demonstrated in terms of three applications. First, convergence rates of estimators based on quasi-random versus pseudo-random samples from copulas are investigated in dimensions as large as 100 for the first time. Second, replicated validation MMDs, as well as Monte Carlo and quasi-Monte Carlo applications demonstrate the improved training of AGMMNs for a copula model implied by the 50 constituents of the S&P 500 index after deGARCHing. Last, both the latter dataset and 50 constituents of the FTSE 100 are used to demonstrate that the improved training of AGMMNs indeed translates to an improved model prediction.

2605.04319 2026-06-12 math.CO math.AC 版本更新

Non-external Proofs of Lagrange Inversion Formula

拉格朗日反演公式的非外部证明

Dominik Beck, Piotr Maćkowiak

AI总结 本文给出形式幂级数拉格朗日反演公式的两个简单证明,仅使用形式幂级数分析的基本工具,不依赖外部概念。

详情
Comments
Introduced small corrections in abstract, introduction and proof of Theorem 1.1; a reference added
AI中文摘要

本文的目标是给出形式幂级数拉格朗日反演公式的两个简单证明。这两个证明都是非外部的,即它们使用的概念不超出形式幂级数分析基本工具的范围。

英文摘要

The goal of the paper is to present two simple proofs of the Lagrange Inversion Formula for formal power series. Both proofs are non-external in the sense that they use concepts that do not go beyond the scope of basic tools of formal power series analysis.

2605.03843 2026-06-12 physics.flu-dyn 版本更新

Evolution of passive scalar mixing layers in stratified and unstratified homogeneous turbulence

分层与未分层均匀湍流中被动标量混合层的演化

Stephen M. de Bruyn Kops, Peter N. Blossey, James J. Riley

AI总结 通过高分辨率大涡模拟研究稳定分层湍流中被动标量的混合,发现横向混合在分层与未分层情况下相似但分层时稍快,而垂向混合因分层抑制大尺度搅拌而几乎停止。

详情
Comments
22 pages, 15 figures
AI中文摘要

利用高分辨率大涡模拟衰减的分层和未分层均匀湍流,研究稳定分层流中被动标量的混合。两个被动标量混合层,一个在垂直方向,另一个在横向方向,模拟了相对于速度长度尺度非常大的羽流。在横向方向,被动标量的演化在分层和未分层情况下大致相似,尽管分层时扩散稍快。此外,分层情况下标量涨落强度更高,湍流/非湍流界面更间歇。但在垂直方向,分层情况几乎没有混合,因为分层阻止了大尺度搅拌。初始时,分层被动层增长,直到其宽度与水平速度的垂直积分长度成正比,而该积分长度本身受约束以保持垂直弗劳德数阶为一。在此早期增长之后,被动标量几乎没有额外扩散。如果平均剖面已知,则使用单常数模型可有效模拟横向方向的分层标量通量;如果必须假设剖面形状,则使用双常数模型。在后一种情况下,仅当标量与速度场处于准平衡状态,使得标量的长度尺度可从动能缩放时,模型才有效。本研究中,主动和被动标量的普朗特数为0.7。预计由更高普朗特数产生的反向浮力通量将影响被动标量混合。

英文摘要

High-resolution large-eddy simulations of decaying stratified and unstratified homogeneous turbulence are used to understand the mixing of passive scalars in stably stratified flows. Two passive scalar mixing layers, one in the vertical direction and the other in the transverse direction, are a model for a plume that is very large relative to the length scale of the velocity. In the transverse direction, the evolution of the passive scalar is broadly similar in the stratified and unstratified cases, although it does spread slightly faster when stratified. Also, the intensity of the scalar fluctuations is higher in the stratified case, and the turbulent/non-turbulent interface is more intermittent. In the vertical direction, though, the stratified case has almost no mixing because the stratification prevents large-scale stirring. Initially, the stratified passive layer grows until its width is proportional to the vertical integral length of the horizontal velocity, which is itself constrained to maintain the vertical Froude number order one. After this early growth, there is little additional spreading of the passive scalar. Modelling of the stratified scalar flux in the transverse direction is done effectively with a one-constant model if the mean profile is known, and a two-constant model if the profile shape must be assumed. In the latter case, the model is good only if the scalar is in quasi-equilibrium with the velocity field such that the length scale of the scalar can be scaled from the kinetic energy. In this study, the Prandtl number of the active and passive scalars is 0.7. It is anticipated that the reverse buoyancy flux resulting from higher Prandtl numbers will affect the passive scalar mixing.

2605.03747 2026-06-12 physics.app-ph 版本更新

Generalized Virtual-Wave Theory for Photothermal Coherence Tomography under Arbitrary Excitation Toward Non-Contact Industrial Inspection of Composite Materials

面向复合材料非接触工业检测的光热相干层析成像广义虚拟波理论

Pengfei Zhu, Julien Lecompagnon, Philipp Daniel Hirsch, Mathias Ziegler

AI总结 提出广义虚拟波光热层析框架,将扩散-波变换推广到任意边界激励,通过ADMM或截断SVD求解逆问题,实现热扩散场到波场的转换,提升复合材料缺陷检测的深度定位和层析重建质量。

详情
AI中文摘要

光热成像是一种用于复合材料亚表面检测的强大非接触、无损技术,但其性能从根本上受限于热扩散的扩散性和不可逆性,导致严重的图像模糊和模糊的深度解释。虚拟波的概念通过将扩散场与传播波场联系起来,提供了一条克服这一限制的途径,但现有方法主要局限于理想的脉冲激励。本文提出了一种广义虚拟波光热层析成像框架,将扩散-波变换扩展到任意边界激励,包括脉冲、谐波和啁啾波形。从带有一般源项的热方程出发,我们推导了测量扩散场与由波动方程控制的虚拟波场之间的Fredholm积分映射,明确地施加了因果性和热力学不可逆性。由此产生的病态逆问题根据激励特性使用ADMM或截断SVD求解。数值和实验结果表明,所提出的方法将模糊的热响应转换为具有清晰波前和反射的波状场,从而改善了深度定位和层析重建。在带有嵌入缺陷的碳纤维增强聚合物样品上的实验显示,与常规热成像技术相比,具有增强的对比度、更清晰的边界和更可靠的深度解释。这项工作为实际激励条件下基于波的光热层析成像建立了一个统一且物理基础的框架。

英文摘要

Photothermal imaging is a powerful noncontact and nondestructive technique for subsurface inspection of composite materials, yet its performance is fundamentally limited by the diffusive and irreversible nature of heat transport, leading to severe image blurring and ambiguous depth interpretation. The concept of virtual waves provides a route to overcome this limitation by linking diffusion fields to propagating wave fields, but existing approaches are largely restricted to idealized impulsive excitation. Here, we propose a generalized virtual-wave photothermal tomography framework that extends the diffusion-to-wave transformation to arbitrary boundary excitations, including pulsed, harmonic, and chirped waveforms. Starting from the heat equation with a general source term, we derive a Fredholm integral mapping between the measured diffusion field and a virtual wave field governed by a wave equation, explicitly enforcing causality and thermodynamic irreversibility. The resulting ill-posed inverse problem is solved using ADMM or truncated SVD, depending on the excitation characteristics. Numerical and experimental results demonstrate that the proposed method converts blurred thermal responses into wave-like fields with clear wavefronts and reflections, enabling improved depth localization and tomographic reconstruction. Experiments on carbon fiber reinforced polymer samples with embedded defects show enhanced contrast, sharper boundaries, and more reliable depth interpretation compared with conventional thermographic techniques. This work establishes a unified and physically grounded framework for wave-based photothermal tomography under realistic excitation conditions.

2605.02926 2026-06-12 physics.soc-ph physics.space-ph quant-ph 版本更新

Towards Geostrategic Critical Minerals and Materials Resilience: Secure Supply-Chain and Criticality Analyses for Quantum Technologies in Arctic and Space Environments

面向地缘战略关键矿产与材料韧性:北极与太空环境中量子技术的安全供应链与关键性分析

Min-Ha Lee, Alan J. Hurd, Jolante Wieke Van Wijk, Mauritz Kop

AI总结 本文针对极端环境中量子技术的供应链安全与关键性风险,提出可复现的“关键级别I”筛选方法,以铌和超导纳米线单光子探测器为例,分析上游关键矿产与材料对下游系统性能的影响,并建议建立量子关键性与关键矿产仪表盘作为决策支持工具。

详情
AI中文摘要

本文绘制了在极端环境中部署的量子技术的安全供应和关键性风险,将上游关键矿产与材料(CMMs)与下游系统性能、安全连续性和任务保障联系起来。它采用可复现的“关键级别I”筛选方法,识别那些供应集中度、必要性和有限可缓解性可能为量子部署造成瓶颈的材料。分析围绕两个用例展开:(i)铌作为超导量子计算及相关制造和工具链依赖的关键输入;(ii)太空级超导纳米线单光子探测器(SNSPDs),以及相邻的单光子探测器平台如SPADs,其中辐射、热循环、振动和电磁干扰可能降低器件指标,并在通信环境中威胁安全连续性。本文进一步将这些依赖关系置于美中在关键材料、精炼能力、出口管制和海外矿产收购方面的战略竞争中,同时将其与标准优先治理、后量子密码迁移以及量子网络的新兴安全逻辑联系起来。它认为静态的国家关键矿产清单不足以满足任务相关的量子技术需求,并提出一个专用的量子关键性与关键矿产(QCCM)仪表盘,作为跟踪量子平台间集中度、可替代性、资格认证瓶颈、库存缺口和地缘政治压力信号的动态决策支持工具。本文最后讨论了替代、多样化、储备、屏蔽、资格认证设计和标准对齐治理的含义,以支持安全、持续和任务相关的量子部署。

英文摘要

This manuscript maps secure-supply and criticality risks for quantum technologies deployed in extreme environments, linking upstream critical minerals and materials (CMMs) to downstream system performance, continuity of security, and mission assurance. It adopts a reproducible "Critical Level I" screening method to identify materials whose supply concentration, essentiality, and limited mitigatability can create bottlenecks for quantum deployment. The analysis is structured around two use cases: (i) niobium as a key input for superconducting quantum computing and related manufacturing and toolchain dependencies; and (ii) space-qualified superconducting nanowire single-photon detectors (SNSPDs), alongside adjacent single-photon detector platforms such as SPADs, where radiation, thermal cycling, vibration, and electromagnetic interference can degrade device metrics and, in communications settings, threaten continuity of security. The manuscript further situates these dependencies within U.S.-China strategic competition over critical materials, refining capacity, export controls, and overseas mineral acquisitions, while also connecting them to standards-first governance, post-quantum cryptography migration, and the emerging security logic of quantum networking. It argues that static national critical-minerals lists are insufficient for mission-relevant quantum technology and proposes a dedicated Quantum Criticality and Critical Minerals (QCCM) dashboard as a living decision-support tool for tracking concentration, substitutability, qualification bottlenecks, stockpiling gaps, and geopolitical stress signals across quantum platforms. The paper concludes with implications for substitution, diversification, stockpiling, shielding, qualification-by-design, and standards-aligned governance to support secure, sustained, and mission-relevant quantum deployment.

2605.02813 2026-06-12 gr-qc hep-th 版本更新

Derivation of the Smarr formula from the Komar charge in Einstein-nonlinear electrodynamics theories and applications to regular black holes

从爱因斯坦-非线性电动力学中的Komar荷推导Smarr公式及其在规则黑洞中的应用

Gabriele Barbagallo, Tomás Ortín

AI总结 通过将耦合常数提升为动力学场,构造广义Komar荷,推导出包含耦合常数贡献的Smarr公式,并应用于规则Bardeen黑洞的热力学分析。

详情
Comments
Latex file, 51 pages, 4 figures. Some references added and typos corrected
AI中文摘要

我们构造了四维中与爱因斯坦引力耦合的通用非线性电动力学(NLED)理论的广义Komar荷。通过将所有这些理论中存在的有量纲耦合常数提升为一个动力学场(由拉格朗日乘子强制其在壳上为常数),得到了该耦合常数的贡献。利用该荷,我们推导了这些理论中渐近平坦黑洞和孤立子解的Smarr公式,其中包含了耦合常数的贡献。此前,这一贡献是通过齐次性论证得到的。我们在广泛的爱因斯坦-NLED理论类别上检验了结果,并利用广义Komar荷的守恒详细分析了规则Bardeen黑洞的热力学,以理解事件视界内部规则黑洞的规则性。

英文摘要

We construct the generalized Komar charge of generic, non-linear theories of electrodynamics (NLED) in 4 dimensions coupled to Einstein gravity. The contribution of the dimensionful coupling constant present in all these theories is obtained by promoting it to a dynamical field which is forced to be constant on-shell by a Lagrange multiplier. We use this charge to derive a Smarr formula for asymptotically-flat black-hole and soliton solutions of these theories that includes the contribution of the coupling constant. Previously, this contribution had been found using homogeneity arguments. We test our results on a broad class of Einstein--NLED theories and analyze in detail the thermodynamics of the regular Bardeen black hole using the conservation of the generalized Komar charge to understand the regularity of regular black holes inside the event horizon.

2605.02891 2026-06-12 cond-mat.str-el cond-mat.supr-con 版本更新

Interlayer Five-Spin Polaron in Superconducting Bilayer Nickelates

超导双层镍酸盐中的层间五自旋极化子

Jiarui Li, Christopher T. Parzyck, Eder G. Lomeli, Yidi Liu, Taehun Kim, Heemin Lee, Zengqing Zhuo, Eun Kyo Ko, Yaoju Tarn, Cheng-Tai Kuo, Ronny Sutarto, Chunjing Jia, Vivek Thampy, Jonathan Pelliciari, Wanli Yang, Brian Moritz, Yijun Yu, Jun-Sik Lee, Valentina Bisogni, Thomas P. Devereaux, Harold Y. Hwang, Wei-Sheng Lee

AI总结 通过共振X射线散射研究超导双层镍酸盐薄膜,发现超导发生在无自旋密度波(SDW)的氧化学计量区域,氧空位促进SDW,表明两者相分离;结合光谱与理论,提出配体空穴主要位于层间顶角氧,形成稳定的层间五自旋极化子态作为超导基态。

详情
AI中文摘要

Ruddlesden-Popper镍酸盐中高$T_c$超导电性的发现激发了大量研究,旨在理解超越传统铜酸盐$d^9$构型基态的非传统电子态。理解磁性基态与多轨道物理之间的相互作用是建立超导微观机制的关键。在双层镍酸盐中,自旋密度波(SDW)序是非超导区域的一个显著特征,但其与超导配对的关联仍是一个未解问题。这里,我们使用共振X射线散射研究超导双层镍酸盐薄膜La$_2$PrNi$_2$O$_7$(LPNO)中SDW序的存在。通过比较超导和缺氧LPNO薄膜,我们发现超导发生在无SDW的氧化学计量区域,而氧空位促进SDW序,表明SDW与超导发生相分离。此外,Ni-$L_3$和O-$K$边光谱揭示了两个区域之间不同的电子结构——特别是沿$c$轴方向。我们的结果确定了氧化学计量是控制层间耦合从而控制双层镍酸盐电子结构的关键参数。结合理论,我们提出配体空穴主要位于双层间顶角氧上,形成稳定的层间五自旋极化子态,该态作为超导双层镍酸盐的基态。

英文摘要

The discovery of high-$T_c$ superconductivity in Ruddlesden-Popper nickelates has sparked substantial effort towards understanding unconventional electronic states beyond a traditional cuprate-like $d^9$ configurational ground state. An understanding of the interplay between magnetic ground states and multi-orbital physics is key for establishing a microscopic mechanism for superconductivity. In the bilayer nickelates, spin density wave (SDW) order is a prominent feature in the non-superconducting regime, yet its relation to superconducting pairing remains an open question. Here, we use resonant x-ray scattering to examine the existence of SDW order in superconducting bilayer nickelate thin films La$_2$PrNi$_2$O$_7$ (LPNO). Comparing superconducting and oxygen-deficient LPNO thin films, we find that superconductivity occurs in SDW-free, oxygen-stoichiometric regions, whereas oxygen-deficiency promotes SDW order, indicating phase segregation of SDW and superconductivity. Furthermore, Ni-$L_3$ and O-$K$ edge spectroscopy reveals distinct electronic structures - particularly along the $c$-axis - between the two regions. Our results identify oxygen stoichiometry as a key parameter controlling interlayer coupling and thus the electronic structure of bilayer nickelates. In concert with theory, we propose that a ligand hole primarily resides at the inter-bilayer apical oxygen, forming a robust interlayer five-spin polaron state, which serves as the ground state for superconducting bilayer nickelates.

2605.03128 2026-06-12 astro-ph.SR astro-ph.GA astro-ph.HE 版本更新

Double Neutron Star Delay Times Across Cosmic Metallicities: The Role of Helium Star Progenitors

双中子星在不同宇宙金属丰度下的延迟时间:氦星前身星的作用

Abhishek Chattaraj, Jeff J. Andrews, Max Briel, Tassos Fragos, Seth Gossage, Vicky Kalogera, Philipp M. Srivastava, Elizabeth Teng

AI总结 本文通过研究金属丰度对氦星-中子星前身系统演化的影响,利用POSYDON代码进行星族合成,揭示了双中子星延迟时间分布随金属丰度的变化,并解释了短伽马射线暴和r过程元素富集。

详情
Comments
Accepted in ApJ
AI中文摘要

金属丰度通过影响恒星内部不透明度和星风质量损失,在大质量双星演化中起着重要作用。本文研究了双中子星(DNS)延迟时间分布(DTD)如何受氦星-中子星前身系统的金属丰度依赖演化影响。基于单星和双星物理的见解,我们论证了在给定金属丰度下,氦主序期间的恒星半径设定了DNS诞生时轨道大小的下限。然后,我们使用详细的双星演化代码POSYDON进行星族合成,以说明不同金属丰度下的DTD。我们的结果表明,无论公共包层效率和合理的诞生速度如何,不同金属丰度下大多数DNS并合通常发生在恒星形成后不早于约40 Myr,并在80-250 Myr之间强烈峰值。约15%的DNS在80 Myr内并合,这可能解释了具有短暂恒星形成历史环境中的r过程富集,而≥20%的DNS在延迟时间>1 Gyr时并合,为年老、贫金属星系中的短伽马射线暴提供了解释。DTD的形状可能复杂,主导形成通道的金属丰度依赖分裂印刻出特征的双峰结构。尽管理想定向的诞生速度可以产生非常短的并合DNS,但我们发现所需速度大小与观测不一致。我们的工作对评估DNS并合在宇宙时间尺度上对r过程富集和伽马射线暴/千新星瞬变事件的贡献具有重要意义。

英文摘要

Metallicity can play a significant role in massive binary evolution through its impact on the opacity within stellar interiors and wind-driven mass loss. In this work, we investigate how the double neutron star (DNS) delay time distribution (DTD) is shaped by the metallicity-dependent evolution of the helium star$-$NS progenitor system. Drawing from insights rooted in single and binary star physics, we argue that at a given metallicity, the stellar radius during the helium main-sequence sets a lower limit on the size of the DNS orbit at birth. We then perform population synthesis with the detailed binary evolution code POSYDON to illustrate the resulting DTD across a range of metallicities. Our results indicate that, independent of the common envelope efficiency and reasonable natal kicks, the majority of DNS mergers across metallicities occur typically no earlier than $\simeq 40\,\rm{Myr}$ after star formation and peaks strongly between $80-250\,\rm{Myr}$. Roughly $15\%$ of DNSs merge within 80 Myr, which may explain $r$-process enrichment in environments with brief star formation histories, while $\gtrsim 20\%$ merge on delay times $>1$Gyr, providing an explanation for short gamma-ray bursts in old, metal-poor galaxies. The shape of the DTD can be complex, with a metallicity-dependent split in the dominant formation channel imprinting a characteristic double-peaked structure. Although ideally oriented natal kicks can produce very short merging DNS, we find that the required kick magnitudes are inconsistent with observations. Our work has implications for assessing the contribution of DNS mergers to $r$-process enrichment and gamma-ray bursts/kilonovae transients across cosmic time.

2606.10609 2026-06-12 math.RT 版本更新

Spin characters of the alternating group which are proportional to linear characters in characteristic 2

交错群在特征2中与线性特征成比例的旋量特征

Eoghan McDowell

AI总结 分类了交错群的旋量与非旋量不可约特征在2模约化下成比例的情况,等价于在奇阶元上成比例的情况。

详情
Comments
4 pages (v3: added explanation of notation for 4-bar-core)
AI中文摘要

我们分类了交错群的旋量与非旋量不可约特征何时具有成比例的2模约化。等价地,我们分类了这样一对特征何时在奇阶元上成比例。

英文摘要

We classify when a spin and a non-spin irreducible character of the alternating group have proportional 2-modular reductions. Equivalently, we classify when such a pair of characters are proportional on elements of odd order.

2606.06162 2026-06-12 cs.MA cs.GT 版本更新

Learning to Contest: Decentralized Robust Fairness in Cooperative MARL via Cross-Attention

学习竞争:通过交叉注意力实现合作多智能体强化学习中的去中心化鲁棒公平性

Can Savcı

AI总结 针对合作多智能体强化学习中公平性易被自私智能体利用的问题,提出基于交叉注意力的去中心化策略CAN,通过推断搭便车者数量并动态调整竞争力度,在分级竞争下实现接近集中式分配的鲁棒公平与效率。

详情
Comments
11 pages, 10 figures
AI中文摘要

最大化平等主义福利的公平合作多智能体强化学习(MARL)团队是可被利用的:一个自私的智能体通过搭便车占用公平智能体为提高最差者而放弃的剩余资源。集中式的基于需求的分配器可以消除这种利用,但只能通过将分配权从智能体手中夺走;去中心化策略是否能够鲁棒尚不清楚。我们证明这种无效性是完全竞争(all-or-nothing contention)的产物。在分级竞争(graded contention)下(竞争资源提供$1-c$,浪费$c$),我们证明对于任何$c<1$,最差的合作者如果与搭便车者竞争,严格优于让步,因此存在去中心化的杠杆(命题1)。实现这一杠杆是一个不确定性下的协调问题:搭便车者的数量未知且可变,因此任何固定规则都是次优的。我们引入CAN,一种基于智能体观察行为的置换等变交叉注意力策略,该策略推断搭便车者的数量并做出相应反应:当没有搭便车者时轮流分享,当存在时进行适度竞争。在与对抗性联盟(PSRO)训练后,CAN在整个竞争范围内保持最佳响应可剥削性低($\rho\approx1.2$-$1.5$,而未经保护时为$\rho=N$),在$D=0$时几乎无浪费(效率$\approx1.0$),在$D\geq1$时保留大部分效率(效率0.83-0.96),在两个指标上接近集中式预言机,无需中央分配器。公平MARL学习者在互补指标上失败(GGF/FEN产生可剥削性,SOTO过度竞争且浪费),而CAN两者兼得。在另外两个游戏中,我们发现其适用范围明确而非普遍:CAN保持高效且帕累托优于公平学习者,但其鲁棒性仅与竞争杠杆成比例:在多服务器游戏中强,在杠杆减弱时部分,在赢家通吃下消失(命题1失败)。我们还报告了其脆弱性:弱杠杆和零样本迁移到更大团队时在高竞争下性能下降。

英文摘要

Fair cooperative multi-agent reinforcement learning (MARL) teams that maximize an egalitarian welfare are exploitable: a single self-interested agent free-rides on the surplus that fair agents forgo to raise the worst-off, and the known remedy is a centralized need-based allocator. We show that a decentralized defense becomes possible once contention is graded: when a contested resource still delivers a fraction $1-c$, a worst-off cooperator that contests a free-rider strictly improves on yielding, so leverage exists for every $c < 1$. We introduce CAN, a permutation-equivariant cross-attention policy over agents' observed behaviour that infers how many free-riders are present and responds proportionally -- turn-taking when none, contesting just enough when some. Trained against an adversarial league, CAN keeps best-response exploitability near the centralized oracle ($ρ\approx 1.2\text{--}1.5$ vs. $ρ= N$ unprotected) at essentially no efficiency cost, whereas the fair-MARL learners (GGF, FEN, SOTO) each collapse to an exploitable or wasteful extreme. Giving those objectives CAN's identical adversarial training does not rescue them, so the objective -- not adversarial training alone -- is what makes hardening possible. Against a committed (non-adaptive) defector, every learned defense including ours provides deterrence rather than immunity, weakening as the leverage $(1-c)/2$ vanishes. Across further environments and team sizes the same principle sets the scope: robustness holds exactly as far as the game's contest leverage reaches, and we map that boundary rather than claim to remove it.

2606.05850 2026-06-12 physics.comp-ph 版本更新

Towards stable and accurate electron dynamics via neural network based time-dependent variational Monte Carlo

基于神经网络的时间相关变分蒙特卡洛实现稳定精确的电子动力学

Weizhong Fu, Zhe Li, Yubing Qian, Ruichen Li, Weiluo Ren, Ji Chen

AI总结 提出神经基组时间相关变分蒙特卡洛框架,通过将时间演化约束在神经基组张成的紧致流形上,实现了电子动力学的稳定高精度模拟,并准确计算了激光驱动偶极响应和动态极化率。

详情
AI中文摘要

相互作用电子的实时动力学处于量子力学与非平衡物理的交叉点,支配着分子和纳米材料中超快现象的微观起源。尽管神经网络变分蒙特卡洛在稳态计算中取得了前所未有的精度,但其扩展到实时演化仍然具有挑战性。在这项工作中,我们引入了神经基组时间相关变分蒙特卡洛框架,实现了电子动力学的稳定且高精度模拟。通过将时间演化约束在由神经基组张成的紧致定制流形上,我们有效绕过了不稳定性问题,并实现了长期稳定演化。此外,我们证明该框架在模拟氢原子和拉伸氢分子的激光驱动偶极响应时达到了基准质量精度,并准确提取了氦和铍原子的动态极化率。我们的工作揭示了神经网络波函数在精确描述实时电子动力学方面的巨大潜力,并为复杂时间相关电子现象的第一性原理模拟开辟了一条有前景的新途径。

英文摘要

Real-time dynamics of interacting electrons lies at the interface between quantum mechanics and non-equilibrium physics, governing the microscopic origin of ultrafast phenomena of molecules and nano-materials. Though neural network variational Monte Carlo has achieved unprecedented accuracy for stationary state calculations, its extension to real-time evolution remains challenging. In this work, we introduce the neural basis time-dependent variational Monte Carlo framework, which achieves stable and highly accurate simulations of electron dynamics. By constraining the time evolution to a compact, customized manifold spanned by the neural basis, we effectively bypass instability issues and achieve long-term stable evolution. Moreover, we demonstrate that this framework yields benchmark-quality accuracy in simulating the laser-driven dipole responses of the hydrogen atom and a stretched hydrogen molecule, and accurately extracts the dynamic polarizabilities of helium and beryllium atoms. Our work reveals the vast potential of neural network wavefunctions for accurately describing real-time electron dynamics and establishes a promising new route for first-principles simulations of complex, time-dependent electronic phenomena.

2606.05166 2026-06-12 astro-ph.CO 版本更新

Compressed Gaussian likelihood for the Planck low-$\ell$ data

压缩的\emph{高斯}似然用于\textit{Planck}低$\ell$数据

Nanoom Lee

AI总结 针对Planck CMB低$\ell$ E模式极化数据,基于Sroll2似然构建了一个压缩的高斯似然,解决了非高斯似然与Fisher矩阵分析不兼容的问题,并通过MCMC验证了其在$\Lambda$CDM和扩展模型中的准确性。

详情
Comments
7+2 pages, 4 figures. Comments are welcome
AI中文摘要

我们提出了一个压缩的\emph{高斯}似然,用于\textit{Planck} CMB低$\ell$ E模式极化数据,该似然基于\texttt{Sroll2}似然构建,后者提供了迄今为止对再电离光学深度$τ$的最严格约束。CMB低$\ell$ TT和EE似然的非高斯形式使其与需要解析高斯$χ^2$的Fisher矩阵分析(如Fisher偏差形式和Fisher预测)不兼容。我们证明,偏移对数正态似然的$χ^2$在对数变换的功率谱幅度中呈高斯形式,因此可以在Fisher矩阵分析中作为该变量真实高斯似然的代理,无需显式变量变换。在此基础上,我们将\texttt{Sroll2}似然压缩为少量分段偏移对数正态函数,并通过MCMC结合\textit{Planck}和ACT DR6数据对其与完整\texttt{Sroll2}似然进行验证,发现所有$Λ$CDM参数和扩展宇宙学模型均高度一致。我们进一步证明,从压缩似然得到的Fisher矩阵不确定性估计与完整MCMC后验吻合良好。我们发布了压缩似然\texttt{planck-gaussian-lowl},这是一个轻量级Python包,包含了先前工作中的压缩低$\ell$ TT似然,允许将Planck CMB低$\ell$数据轻松纳入任何基于高斯似然的分析中。该包公开在\href{https://github.com/nanoomlee/planck-gaussian-lowl}{github.com/nanoomlee/planck-gaussian-lowl}。

英文摘要

We present a compressed Gaussian likelihood for the Planck CMB low-$\ell$ E-mode polarization data, constructed from the SRoll2 likelihood which provides the tightest constraint on the reionization optical depth $τ$ to date. The non-Gaussian form of CMB low-$\ell$ TT and EE likelihoods makes them incompatible with Fisher matrix analyses that require an analytic Gaussian $χ^2$, such as the Fisher-bias formalism and Fisher forecasts. We show that the $χ^2$ of an offset log-normal likelihood takes a Gaussian form in the log-transformed power spectrum amplitudes, and can therefore serve as a proxy for the true Gaussian likelihood of this variable in Fisher matrix analyses, without any explicit change of variables. Building on this, we compress the SRoll2 likelihood into a small number of piecewise offset log-normal functions and validate it against the full SRoll2 likelihood via MCMC combined with Planck and ACT DR6 data, finding excellent agreement across all $Λ$CDM parameters and in extended cosmological models. We further demonstrate that Fisher matrix uncertainty estimates from our compressed likelihood agree well with the full MCMC posteriors. We release our compressed likelihood planck-gaussian-lowl, a lightweight Python package incorporating the compressed low-$\ell$ TT likelihood from previous work, allowing a straightforward incorporation of the Planck CMB low-$\ell$ data into any Gaussian-likelihood-based analysis. The package is publicly available at \href{https://github.com/nanoomlee/planck-gaussian-lowl}{github.com/nanoomlee/planck-gaussian-lowl}.

2606.03377 2026-06-12 cs.HC cs.CY 版本更新

Intellectual Humility as a Cognitive Filter for AI-Generated Health Misinformation. An Evolutionary Perspective on Epistemic Vigilance

智力谦逊作为AI生成健康错误信息的认知过滤器:基于进化视角的认知警觉研究

Marcin Rządeczka, Maciej Wodziński, Kacper Zacharski, Marcin Moskalewicz

AI总结 通过实验(N=99)发现,智力谦逊(对认知局限的元认知意识)能选择性过滤AI生成的健康对话中的伪科学内容,但对准确内容的可信度评估无显著影响,且谦逊与识别AI来源的能力无关,表明认知警觉作用于内容质量而非来源归因。

详情
Comments
9 pages, 2 figures
AI中文摘要

我们展示了一项研究(N=99)的实验结果,考察智力谦逊(IH),即对认知局限的元认知意识,如何影响对科学严谨性不同的AI生成健康对话的评估。参与者被随机分配评估三段关于运动与心理健康的对话:科学准确的、中度伪科学的或强烈伪科学的。结果显示,IH作为一种选择性认知过滤器。谦逊得分较高的个体认为伪科学内容的可信度显著较低,而对准确内容的可信度评估则无相关性。关键的是,谦逊并不能预测识别对话为AI来源的能力,表明认知警觉作用于内容质量而非来源归因。我们从进化视角解释这些发现,提出IH代表一种祖先适应,用于在信息不确定环境中导航。尽管人类缺乏检测AI来源的进化机制,但IH在检测AI生成内容中的利用企图方面仍然有效。该研究有助于理解基础模型如何改善或削弱人类的认知防御,尤其是在健康传播语境中。

英文摘要

We present experimental findings from a study (N=99) examining how intellectual humility (IH), i.e., the metacognitive awareness of epistemic limitations, affects the evaluation of AI-generated health dialogues varying in scientific rigor. Participants were randomly assigned to evaluate one of three dialogues about exercise and mental health: scientifically accurate, moderately pseudoscientific, or strongly pseudoscientific. Results reveal that IH functions as a selective cognitive filter. Individuals with higher humility scores rated pseudoscientific content as significantly less credible, while showing no correlation with credibility assessments of accurate content. Crucially, humility did not predict the ability to identify AI as the source of dialogues, suggesting that epistemic vigilance operates on content quality rather than source attribution. We interpret these findings through an evolutionary lens, proposing that IH represents an ancestral adaptation for navigating informationally uncertain environments. It remains effective at detecting exploitation attempts in AI-generated content, despite humans lacking evolved mechanisms for detecting AI sources. The study contributes to understanding how foundation models might improve or undermine human epistemic defenses, especially in health communication contexts.

2606.03317 2026-06-12 cs.SI 版本更新

Ollivier-Ricci curvature in cycle overlap mode

Ollivier-Ricci曲率在循环重叠模式中

Zexian Zhou, Bo Jiao

AI总结 针对大规模无标度图中Ollivier-Ricci曲率计算复杂度高或误差大的问题,提出一种基于3、4、5-循环的最优传输原理的曲率计算方法CCOM,通过贪心剪枝算法近似最优传输,显著提高曲率精度并降低时间消耗。

详情
Comments
26 pages, 9 figures
AI中文摘要

边(x,y)的Ollivier-Ricci曲率通过比较从x的邻居到y的邻居的传输距离来定义。它是一种结构度量,已在社区检测和深度神经网络等多个领域得到研究。然而,高计算复杂度或误差限制了其在大型无标度图中的应用。本文提出了一种最优传输原理,通过包含边(x,y)的3、4、5-循环来最小化距离,并设计了一种名为循环重叠模式曲率(CCOM)的曲率计算方法。在该方法中,提出了一种贪心和剪枝算法来近似最优传输原理。我们从理论和实验上验证了我们的CCOM方法能够在低时间消耗下显著提高真实世界网络上曲率的准确性。此外,我们在使用相同曲率框架的社区检测任务中将CCOM与基线近似方法进行了比较,并实验证实了CCOM在大型无标度图上的有效性。

英文摘要

Ollivier-Ricci curvature of an edge (x,y) is defined by comparing the distance taken to transport from neighbors of x to neighbors of y. It is a structural measure that has been studied in many fields such as community detection and deep neural networks. However, high computational complexity or error limits its application in large scale-free graphs. This paper proposes an optimal transport principle to minimize the distance by 3,4,5-cycles that include the edge (x,y), and designs a curvature calculation approach named Curvature in Cycle Overlap Mode (CCOM). In this approach, a greedy and pruning algorithm is proposed to approximate the optimal transport principle. We theoretically and experimentally verified that our approach CCOM can significantly improve the accuracy of the curvature on real-world networks with low time consumption. In addition, we compared CCOM with baseline approximation approaches in community detection tasks using the same curvature-based framework, and experimentally confirmed the effectiveness of CCOM on large scale-free graphs.

2606.03001 2026-06-12 cs.DC 版本更新

FOLD: Fuzzy Online Deduplication for Very Large Evolving Datasets via Approximate Nearest Neighbor Search

FOLD: 面向超大规模演化数据集的模糊在线去重方法(基于近似最近邻搜索)

Nelson Bore, Pritish Mishra, Constantin Adam, Eyal de Lara, Oana Balmau

AI总结 提出FOLD系统,利用增量更新的HNSW索引和位图表示改进Jaccard相似度计算,实现高召回率和高吞吐量的在线模糊去重。

详情
AI中文摘要

模糊去重是构建大型语言模型训练语料库的关键。然而,经典的局部敏感哈希管道随着语料库的增长扩展性差,且不适合持续摄入。我们提出了FOLD(模糊在线去重),一个在线模糊去重系统,为演化数据集提供高召回率和高吞吐量。FOLD对已接纳文档维护一个增量更新的HNSW索引,为每个传入文档检索一个小而高质量的候选邻域,而不是重复重建全局桶或重新扫描累积的语料库。据我们所知,FOLD是第一个使用HNSW的在线模糊去重系统。然而,直接应用Jaccard相似度会导致分数拥挤,使得在少量步骤内的图遍历不可靠。FOLD通过一种位图表示解决了这个问题,该表示在HNSW搜索期间提供了更具区分性、与Jaccard对齐的信号。在四个LLM规模的数据集(LM1B、C4、RealNews和Common Crawl)上,随着语料库的增长,FOLD保持快速和准确:在最大评估规模下,它保持93-97%的召回率,吞吐量比竞争替代方案高出2.09倍,而竞争方案的最佳召回率仅达到76%。

英文摘要

Fuzzy deduplication is key to constructing large language model training corpora. However, classic Locality-Sensitive Hashing (LSH) pipelines scale poorly as corpora grow and are ill-suited to continuous ingestion. The main issue is that each new document batch must be checked against the admitted corpus before insertion. As the corpus grows, the LSH buckets grow: each query can hit several large buckets and must scan the returned candidates. To solve this problem, we present RAD (Retrieval-Augmented Deduplication), an online fuzzy deduplication system that delivers both high recall and throughput for evolving datasets. RAD maintains an incrementally updated HNSW index over admitted documents, retrieving a small, high-quality candidate neighborhood for each incoming document instead of repeatedly re-scanning the accumulated corpus. RAD is the first online fuzzy deduplication system to use HNSW, leading to stable throughput as datasets grow. However, it is not easy to maintain high recall when using HNSW-style indexes. The core issue is the distance metric between graph nodes. Jaccard similarity, the metric used for fuzzy deduplication, yields low recall when applied out-of-the-box with an HNSW index. It leads to distance score crowding, making graph traversal unreliable within a bounded number of steps. RAD addresses this with a bitmap representation that provides a more discriminative, Jaccard-aligned signal during HNSW search. Across four LLM-scale datasets (LM1B, C4, RealNews, and Common Crawl), RAD preserves the scaling trajectory needed for online fuzzy deduplication: at 30M documents, it maintains 0.94-0.97 recall relative to state-of-the-art LSH solutions, and delivers up to an 8x throughput increase.

2606.02868 2026-06-12 eess.SY cs.SY 版本更新

Closed-Form PI and PID Tuning of All-Pole Plants up to Third Order for Monotonic Minimum-Settling Step Responses

针对单调最小稳定时间解的三阶以下被控对象的PI和PID整定

Senol Gulgonul

AI总结 提出一种统一的闭环解析PI/PID整定方法,针对三阶以下全极点被控对象,实现严格单调(零超调)且最小稳定时间的阶跃响应。

详情
Comments
v2: extended with monotonicity windows, third-order boundary theorem in final form, and comparisons; subsumes arXiv:2604.21294
AI中文摘要

提出一种统一的闭环解析PI/PID整定方法,针对三阶以下全极点被控对象,能够产生严格单调(零超调)且最小稳定时间的阶跃响应。设计目标是二项式闭环传递函数 p^n/(s+p)^n,该函数单调且鲁棒性仅依赖于阶数 n。由于在固定极点模式中添加左半平面零点只会减慢响应,最小稳定时间解要求控制器零点被抵消,这迫使控制器分子整除被控对象分母。贯彻这一原理表明,对于任何稳定的被控对象,精确的实增益解存在于:二阶以下被控对象使用PI控制器,三阶被控对象使用PID控制器;超出此范围时,残差二项式因子会出现一个复极点对,而一般被控对象不包含该复极点对。推导了一阶被控对象(PI)、具有实极点和复极点的二阶被控对象(PI和PID)、以及具有三个实极点或一个实极点加一个复极点对的三阶被控对象(PID)的显式增益。二阶PI情况作为最低阶实例被完整处理。单调性保证了 Mt=1,因此 Ms<2,相位裕度大于60度,增益裕度大于6 dB,对于二项式族这些值收紧为通用常数。数值验证确认了结果。

英文摘要

A unified, closed-form analytical PI/PID tuning method is presented for all-pole plants up to third order that yields a strictly monotonic (zero-overshoot) step response with minimum settling time. The design target is the binomial closed loop p^n/(s+p)^n, which is monotonic with robustness depending only on the order n. Because a fixed PI/PID cannot assign the closed-loop poles and the controller zeros independently, realizing this target exactly requires the controller zeros to be cancelled, which forces the controller numerator to divide the plant denominator. It follows that an exact, real-gained solution exists for any stable plant precisely up to second order with a PI controller and third order with a PID controller; beyond that the residual binomial factor acquires a complex pair of damping sqrt(3)/2, which a generic plant does not contain. Explicit gains are derived for first-order plants (PI), second-order plants with real and complex poles (PI and PID), and third-order plants with three real poles or one real pole plus a complex pair (PID). The freedom of the coincident designs is shown to be bounded: a quadratic nonnegativity condition gives the exact window of the design pole for strict monotonicity, which collapses at the pole-ratio-2 changeover for real poles and is nonempty for damping ratios above approximately 0.443 for complex poles. Monotonicity guarantees Mt = 1, hence Ms <= 2, phase margin >= 60 degrees, and gain margin >= 6 dB, tightening to universal constants for the binomial family. Load-disturbance attenuation obeys IAEd = 1/Ki, making the cost of cancellation explicit, and comparisons with SIMC, the CHR zero-overshoot rule, and deadbeat-fitted explicit formulas quantify the trade: at matched maximum sensitivity the proposed design settles faster than SIMC on the third-order example, with markedly lower controller gains and peak control effort.

2606.03633 2026-06-12 nucl-th 版本更新

Mechanical properties of the nucleon in the chiral confining model. II -- in-medium evolution of the nucleon properties

手征禁闭模型中核子的力学性质. II -- 核子性质的在介质演化

Guy Chanfray, Hubert Hansen, Bikram Keshari Pradhan

AI总结 在手征禁闭模型框架下,通过冯·劳厄稳定性条件确定介质中核子试验态,研究标量场对复合核子的响应以及禁闭和手征对称性破缺对核子质量演化的作用,并分析核子内部能量密度和压力分布的变化。

详情
Comments
22 pages, 8 figures
AI中文摘要

本文致力于在手征禁闭模型框架内研究束缚于核物质中的核子性质的演化。根据前期姊妹论文(标记为I)中建立的正式结果,通过施加冯·劳厄稳定性条件确定介质中核子试验态(局域化因子化波函数或动量投影态)。主要结果涉及复合核子对标量场的响应,以及禁闭和手征对称性破缺在介质中核子质量演化中的各自作用。该演化支配着核饱和机制所需的三体排斥力。我们还分析了介质中核子内部能量密度分布和压力分布的变化。此外,我们就束缚核子性质与中子星内部致密物质状态方程之间的映射提出了一些展望。

英文摘要

This article is devoted to the study of the evolution of the properties of nucleons bound in nuclear matter within the framework of the chiral confining model. The in-medium nucleon trial states (either localized factorized wave functions or momentum-projected states) are determined by imposing the von Laue stability condition, according to the formal results established in a preliminary companion paper (labeled as I). The main results concern the response of the composite nucleon to the scalar field, as well as the respective roles of confinement and chiral symmetry breaking in the evolution of the in-medium nucleon mass. This evolution governs the repulsive three-body forces required for the nuclear saturation mechanism. We also analyze the modification of the energy density distribution and the pressure distribution inside the in-medium nucleon. We also draw some perspectives concerning the mapping between bound nucleon properties and the equation of state of dense matter as realized in the deep interior of neutron stars.

2606.03588 2026-06-12 nucl-th 版本更新

Mechanical properties of the nucleon in the chiral confining model. I -- formal developments

手征禁闭模型中核子的力学性质 I——形式发展

Guy Chanfray, Hubert Hansen, Bikram Keshari Pradhan

AI总结 本文在手征禁闭模型框架下,通过von Laue稳定性条件确定核子试探态,形式化推导了核子内部总能量、平均压强、能量密度和压强分布的详细表达式。

详情
Comments
25 pages, 5 figures
AI中文摘要

我们在一类模型中讨论了核子的力学稳定性问题,其中大质量组分夸克受到禁闭势的作用,并与包围夸克核心的π云耦合。核子试探态(局域化因子化波函数或动量投影态)通过施加von Laue稳定性条件确定。本文主要致力于与核子内部总能量(质量)、平均压强、能量密度和压强分布详细表达式相关的形式方面。它将由一篇补充文章伴随,该文章讨论核子性质随密度的演化,与手征对称性恢复相关。

英文摘要

We discuss the issue of the mechanical stability of the nucleon within a class of models in which massive constituent quarks are subject to a confining potential and are coupled to a surrounding pion cloud enveloping the quark core. The nucleon trial states (either localized factorized wave functions or momentum-projected states) are determined by imposing the von Laue stability condition. This article is primarily devoted to the formal aspects related to the detailed expressions for the total energy (mass), the average pressure, the energy density and the pressure distribution inside the nucleon. It will be accompanied by a complementary article addressing the evolution of nucleon properties with density, associated with the restoration of chiral symmetry.

2606.02929 2026-06-12 gr-qc hep-th 版本更新

Kerr--Schild--AdS geometries in quadratic f(R) gravity: A no-go theorem

二次f(R)引力中的Kerr--Schild--AdS几何:一个不可行定理

Alikram N. Aliev

AI总结 研究二次f(R)引力中的Kerr--Schild--AdS几何,证明场方程动态强制恒定标量曲率并唯一选择Kerr--AdS解族,建立该几何类别的不可行定理。

详情
Comments
Minor revisions; 8 pages
AI中文摘要

我们在二次$f(R)$引力中研究Kerr--Schild--AdS几何,而不先验地施加恒定曲率条件$R=R_0$。对于几何上自然的Kerr--Schild--AdS子类,我们证明场方程动态强制恒定标量曲率,并唯一选择Kerr--AdS解族。因此,二次$f(R)$引力在爱因斯坦分支之外不存在Kerr--AdS黑洞解,从而为这类几何建立了一个不可行定理。

英文摘要

We investigate Kerr-Schild-AdS geometries in quadratic f(R) gravity without imposing the constant-curvature condition R=R_0 a priori. We show that the field equations dynamically enforce constant scalar curvature and uniquely select the Kerr--AdS family of solutions. Thus, quadratic f(R) gravity admits no Kerr--AdS solutions beyond the Einstein branch, establishing a no-go theorem for this class of geometries.

2606.03473 2026-06-12 cond-mat.dis-nn 版本更新

Hierarchical crack patterns: Identification of crack generations

层次裂纹模式:裂纹世代的识别

Yuri Yu. Tarasevich, Andrei S. Burmistrov, Andrei V. Eserkepov

AI总结 通过将有向无环图的拓扑排序应用于裂纹图像,提出一种鲁棒的层次裂纹世代分类方法。

详情
Comments
8 pages, 8 figures, 22 references
AI中文摘要

各种起源的层次裂纹模式在我们周围的世界中无处不在。我们将整个层次裂纹模式部分图像中的裂纹世代分类问题简化为经典的有向无环图拓扑排序。该分类方法对模式图像边界的合理偏移具有鲁棒性。

英文摘要

Identifying crack generations from microscopic images of hierarchical crack patterns is challenging due to the lack of temporal information and sensitivity to image boundaries. Existing algorithms often fragment individual cracks or lose stability when the observed fragment is shifted. We propose a method that reduces the classification problem to topological sorting of a directed acyclic graph (descendant$\to$parent), built from T-junctions and nearly collinear edges. Sequential removal of leaf vertices assigns generation numbers starting from the youngest. On 100 computer-generated networks, our method correctly classifies $\approx 70$\% of cracks at a window size of only three mean edge lengths, whereas a conventional approach that starts from primary cracks drops nearly to zero. The classification is highly stable against reasonable shifts of image boundaries but remains limited to strictly hierarchical networks.