arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2509.20324 2026-06-05 cs.CR cs.AI

RAG Security and Privacy: Formalizing the Threat Model and Attack Surface

RAG安全与隐私：形式化威胁模型和攻击面

Atousa Arzanipour, Rouzbeh Behnia, Reza Ebrahimi, Kaushik Dutta

发表机构 * University of California, Berkeley（加州大学伯克利分校）

AI总结本文研究了RAG系统中的安全与隐私问题，提出首个形式化的威胁模型，定义了攻击向量如文档级成员推断和数据中毒，以提升对RAG系统隐私和安全性的理解。

Comments Published at the 5th ICDM Workshop in November 2025

详情

DOI: 10.1109/ICDMW69685.2025.00165
Journal ref: 2025 IEEE International Conference on Data Mining Workshops (ICDMW), pp. 1387-1394, 2025

AI中文摘要

检索增强生成（RAG）是一种新兴的自然语言处理方法，结合大型语言模型（LLMs）与外部文档检索以生成更准确和基于事实的响应。尽管RAG在减少幻觉和提高事实一致性方面表现出色，但其也引入了与传统LLMs不同的隐私和安全挑战。现有研究表明，LLMs可通过训练数据记忆或对抗性提示泄露敏感信息，而RAG系统继承了许多这些漏洞。同时，RAG依赖外部知识库打开了新的攻击面，包括可能泄露检索文档的存在或内容信息，或注入恶意内容以操控模型行为。尽管存在这些风险，目前尚无正式框架定义RAG系统的威胁景观。本文通过提出首个形式化的RAG威胁模型，填补了文献中的关键空白。我们引入了基于对模型组件和数据访问的对手类型的结构化分类，并正式定义了关键威胁向量，如文档级成员推断和数据中毒，这些向量在实际部署中对隐私和完整性构成严重风险。通过建立正式定义和攻击模型，本文为更严谨和原则性的理解RAG系统的隐私和安全奠定了基础。

英文摘要

Retrieval-Augmented Generation (RAG) is an emerging approach in natural language processing that combines large language models (LLMs) with external document retrieval to produce more accurate and grounded responses. While RAG has shown strong potential in reducing hallucinations and improving factual consistency, it also introduces new privacy and security challenges that differ from those faced by traditional LLMs. Existing research has demonstrated that LLMs can leak sensitive information through training data memorization or adversarial prompts, and RAG systems inherit many of these vulnerabilities. At the same time, reliance of RAG on an external knowledge base opens new attack surfaces, including the potential for leaking information about the presence or content of retrieved documents, or for injecting malicious content to manipulate model behavior. Despite these risks, there is currently no formal framework that defines the threat landscape for RAG systems. In this paper, we address a critical gap in the literature by proposing, to the best of our knowledge, the first formal threat model for retrieval-RAG systems. We introduce a structured taxonomy of adversary types based on their access to model components and data, and we formally define key threat vectors such as document-level membership inference and data poisoning, which pose serious privacy and integrity risks in real-world deployments. By establishing formal definitions and attack models, our work lays the foundation for a more rigorous and principled understanding of privacy and security in RAG systems.

URL PDF HTML ☆

赞 0 踩 0

2509.02971 2026-06-05 stat.ML cs.LG cs.NA math.NA math.PR

Scale-Adaptive Generative Flows for Multiscale Scientific Data

多尺度科学数据的自适应生成流

Yifan Chen, Eric Vanden-Eijnden

发表机构 * Department of Mathematics, University of California, Los Angeles（加州大学洛杉矶分校数学系）； Machine Learning Lab, Capital Fund Management（资本基金管理有限公司机器学习实验室）； Courant Institute, New York University（纽约大学柯朗研究所）

AI总结本文提出了一种多尺度科学数据生成模型，通过设计噪声分布和插值计划，解决多尺度傅里叶谱数据中的数值挑战，提高了生成样本的质量和效率。

详情

AI中文摘要

基于流的生成模型在处理具有多尺度傅里叶谱的科学数据时常常面临数值挑战，通常在细尺度上产生较大的误差。我们通过在流匹配和随机插值框架内，通过噪声分布和插值计划的原理性设计来解决这个问题。在函数空间中工作可以确保生成模型在分辨率细化时仍然定义良好；漂移的Lipschitz正则性对这种函数空间的良定义性和固定分辨率下的积分成本都很重要。核心观察是噪声应至少与目标分布一样粗糙——通过傅里叶谱衰减来衡量——以保持Lipschitz常数有限。对于已知细尺度结构的高斯和近高斯目标，匹配谱噪声比标准白噪声选择更有效。对于更复杂的非高斯目标，匹配谱噪声可能不足以应对噪声比数据粗糙时出现的终端时间刚性问题，我们提出自适应插值计划来缓解这种情况。在合成高斯随机场和随机Allen-Cahn和Navier-Stokes方程不变测度上的数值实验展示了该方法，并证明了其在传统方法基础上以更低计算成本生成高质量样本的能力。

英文摘要

Flow-based generative models can face numerical challenges on scientific data with multiscale Fourier spectra, often producing large errors at fine scales. We approach this problem within the flow matching and stochastic interpolants framework, through the principled design of noise distributions and interpolation schedules. Working in function space ensures that the generative model remains well defined as the resolution is refined; the Lipschitz regularity of the drift is important to both this function-space well-posedness and the integration cost at fixed resolution. The central observation is that the noise should be at least as rough as the target distribution -- measured by Fourier-spectrum decay -- in order to keep the Lipschitz constant finite. For Gaussian and near-Gaussian targets whose fine-scale structure is known, matched-spectrum noise improves numerical efficiency over standard white-noise choices. For more complex non-Gaussian targets, matched-spectrum noise may not be sufficient, and we propose scale-adaptive interpolation schedules to mitigate the terminal-time stiffness that arises when the noise is rougher than the data. Numerical experiments on synthetic Gaussian random fields and on invariant measures of the stochastic Allen--Cahn and Navier--Stokes equations illustrate the approach and demonstrate its ability to generate high-fidelity samples at lower computational cost than traditional approaches.

URL PDF HTML ☆

赞 0 踩 0

2508.20693 2026-06-05 cs.DL cs.CL

Leveraging Large Language Models for Generating Research Topic Ontologies: A Multi-Disciplinary Study

利用大型语言模型生成研究主题本体：多学科研究

Tanay Aggarwal, Angelo Salatino, Francesco Osborne, Enrico Motta

发表机构 * Knowledge Media Institute, The Open University（开放大学知识媒体学院）； The Open University（开放大学）； University of Milano Bicocca（米兰比克卡大学）； Department of Business and Law, University of Milano Bicocca（米兰比克卡大学商学院与法学院）

AI总结本文研究了大型语言模型在生物医学、物理和工程学三个学科中识别研究主题语义关系的能力，通过零样本提示、链式思维提示和在现有本体上微调三种条件评估模型性能，并引入PEM-Rel-8K数据集验证跨学科迁移能力。

详情

AI中文摘要

研究领域本体和分类法对于管理和组织科学知识至关重要，因为它们有助于信息的高效分类、传播和检索。然而，创建和维护此类本体是昂贵且耗时的任务，通常需要多个领域专家的协同工作。因此，此类本体在不同学科中的覆盖程度不均，学科间连接有限，更新周期也较短。在本研究中，我们探讨了几种大型语言模型在生物医学、物理和工程学三个学科中识别研究主题间语义关系的能力。模型在三种不同的条件下进行评估：零样本提示、链式思维提示和在现有本体上微调。此外，我们通过测量模型在某一学科训练后应用到不同学科的表现，评估了微调模型的跨学科迁移能力。为了支持这项分析，我们引入了PEM-Rel-8K数据集，该数据集包含从生物医学、物理和工程学三个学科中最广泛采用的分类法中提取的超过8000个关系。我们的实验表明，将大型语言模型微调到PEM-Rel-8K上在所有学科中都表现出色。

英文摘要

Ontologies and taxonomies of research fields are critical for managing and organising scientific knowledge, as they facilitate efficient classification, dissemination and retrieval of information. However, the creation and maintenance of such ontologies are expensive and time-consuming tasks, usually requiring the coordinated effort of multiple domain experts. Consequently, ontologies in this space often exhibit uneven coverage across different disciplines, limited inter-discipline connectivity, and infrequent updating cycles. In this study, we investigate the capability of several large language models to identify semantic relationships among research topics within three academic disciplines: biomedicine, physics, and engineering. The models were evaluated under three distinct conditions: zero-shot prompting, chain-of-thought prompting, and fine-tuning on existing ontologies. Additionally, we assessed the cross-discipline transferability of fine-tuned models by measuring their performance when trained in one discipline and subsequently applied to a different one. To support this analysis, we introduce PEM-Rel-8K, a novel dataset consisting of over 8,000 relationships extracted from the most widely adopted taxonomies in the three disciplines considered in this study: MeSH, PhySH, and IEEE. Our experiments demonstrate that fine-tuning LLMs on PEM-Rel-8K yields excellent performance across all disciplines.

URL PDF HTML ☆

赞 0 踩 0

2508.19006 2026-06-05 q-fin.PR cs.LG econ.EM q-fin.CP

Is attention truly all we need? An empirical study of asset pricing in pretrained RNN sparse and global attention models

注意力真的全部我们需要吗？对预训练RNN稀疏和全局注意力模型在资产定价中的实证研究

Shanyan Lai

发表机构 * Department of Economics and Related Studies, Univiersity of York（经济与相关研究系，约克大学）

AI总结本文研究了预训练RNN注意力模型在资产定价中的应用，探讨了注意力机制在捕捉时间依赖性和长期记忆方面的改进，以及在不同市场条件下的稳定性。

Comments 72 pages including appendix

详情

AI中文摘要

本研究探讨了主流注意力机制，如加权注意力、Luong的三种注意力、全局自注意力和滑动窗口稀疏注意力，在顶级420只大型美国股票上的实证资产定价研究。这是首次将大规模最先进的（SOTA）注意力机制应用于资产定价领域。这些模型克服了传统机器学习资产定价方法的局限性，如误捕时间依赖性和短期记忆。此外，注意力机制中的强制因果掩码解决了未来数据泄漏问题，而这一问题被更先进的注意力模型如经典Transformer所忽视。所提出的注意力模型还考虑了资产定价数据的时间稀疏性，并通过部署简化模型结构来缓解潜在的过拟合问题。本文为未来实证经济研究提供了某些见解。所有模型均在三个时期内进行测试，涵盖新冠前、新冠期间和新冠后一年，以测试这些模型在极端市场条件下的稳定性。研究发现，在价值加权投资组合回测中，全局自注意力模型和滑动窗口稀疏注意力模型在获得绝对收益和对冲下行风险方面表现出色，在新冠期间静态交易成本情景下，它们分别实现了2.0和1.80的年化Sortino比率。此外，从绝对投资组合收益的角度来看，滑动窗口稀疏注意力模型在股票市值大小方面比全局自注意力模型表现更加稳定。

英文摘要

This study investigates the pre-trained RNN attention models with the mainstream attention mechanisms, such as additive attention, Luong's three attentions, global self-attention and sliding window sparse attention, for the empirical asset pricing research on the top 420 large-cap US stocks. This is the first paper on the large-scale state-of-the-art (SOTA) attention mechanisms applied in the asset pricing context. They overcome the limitations of the traditional machine learning-based asset pricing, such as mis-capturing the temporal dependency and short memory. Moreover, the enforced causal masks in the attention mechanisms address the future data leaking issue ignored by the more advanced attention-based models, such as the classic Transformer. The proposed attention models also consider the temporal sparsity characteristic of asset pricing data and mitigate potential overfitting issues by deploying the simplified model structures. This provides some insights for future empirical economic research. All models are examined in three periods, which cover pre-COVID-19, COVID-19 and one year post-COVID-19, for testing the stability of these models under extreme market conditions. The study finds that in value-weighted portfolio back testing, the global self-attention model and the sliding window sparse attention model exhibit excellent capabilities in deriving the absolute returns and hedging downside risks, while they achieve an annualized Sortino ratio of 2.0 and 1.80 respectively in the period with COVID-19 in the static transaction cost scenario. Moreover, the sliding window sparse attention model performs more stably than the global self-attention model from the perspective of absolute portfolio returns with respect to the size of stocks' market capitalization.

URL PDF HTML ☆

赞 0 踩 0

2508.10555 2026-06-05 physics.comp-ph cs.CE cs.LG

A Differentiable Framework for Full and Phaseless Data Inversion Using Neural Implicit Contrast-Source Representation

一种基于神经隐式对比源表示的全数据和相位less数据反演可微框架

Haoran Sun, Daoqi Liu, Hongyu Zhou, Maokun Li, Shenheng Xu, Fan Yang

发表机构 * Department of Electronic Engineering, Beijing National Research Center for Information Science and Technology (BNRist), and State Key laboratory of Space Network and Communications（电子工程系，北京信息科学与技术国家研究中心（BNRist），空间网络与通信国家重点实验室）

AI总结本文提出了一种基于神经隐式对比源表示的可微框架，用于全数据和相位less数据反演，通过引入轻量级残差多层感知机作为连续神经场，提升了反演精度和鲁棒性，同时通过总变分正则化将状态方程和数据方程结合，形成可微目标函数，实现了端到端的可微优化。

详情

AI中文摘要

在本研究中，我们扩展了对比源反演，将其扩展为一个完全可微、无监督的框架，基于神经隐式表示的对比源。具体来说，而不是使用像素级离散表示，对比源由一个轻量级残差多层感知机（ResMLP）参数化，作为连续神经场，该神经场基于空间坐标和发射器设置进行条件化。这种连续参数化提供了更灵活的对比源表示，并在有噪声测量的情况下提高了重建精度和鲁棒性。基于此表示，状态方程和数据方程与总变分正则化相结合，形成一个可微的目标函数。通过将VIE约束反演重新公式化为一个端到端的可微优化问题，网络参数和介质对比率通过自动微分联合优化。在相同框架内，通过仅修改数据失配函数，同时支持全数据和相位less数据反演。数值实验表明，该方案在各种噪声水平和测量设置下，比传统CSI具有更高的重建精度和鲁棒性。连续神经场进一步使超分辨率推理成为可能，在训练网格更细的分辨率下实现，将反演成本与重建保真度解耦。消融研究和与替代神经架构的比较进一步确认，对比源参数化和基于VIE的公式化对于观察到的改进都是必不可少的。

英文摘要

In this study, we extend the contrast source inversion to a fully differentiable, unsupervised framework based on a neural implicit representation of the contrast source. Specifically, instead of a pixel-wise discrete representation, the contrast source is parameterized by a lightweight residual multilayer perceptron (ResMLP) as a continuous neural field conditioned on spatial coordinates and transmitter settings. This continuous parameterization provides a more flexible representation of the contrast source and improves reconstruction accuracy and robustness under noisy measurements. Building on this representation, the state equation and data equation are combined with total-variation regularization to form a differentiable objective function. By reformulating the VIE-constrained inversion as an end-to-end differentiable optimization problem, the network parameters and the medium contrast are jointly optimized via automatic differentiation. Within the same framework, both full and phaseless data inversion are accommodated by only modifying the data misfit function. Numerical experiments demonstrate that this scheme yields higher reconstruction accuracy and robustness than conventional CSI across a range of noise levels and measurement settings. The continuous neural field further enables super-resolution inference at resolutions finer than the training grid, decoupling inversion cost from reconstruction fidelity. Ablation studies and comparisons with alternative neural architectures further confirm that the contrast source parameterization and VIE-based formulation are both essential to the observed improvements.

URL PDF HTML ☆

赞 0 踩 0

2508.00775 2026-06-05 eess.SY cs.LG cs.SY math.OC

生成器介导的老虎机：面向生成式人工智能的自适应干预的汤普森采样

Marc Brooks, Gabriel Durham, Kihyuk Hong, Ambuj Tewari

发表机构 * Department of Statistics, University of Michigan, Ann Arbor, MI (USA)（密歇根大学统计学系，安阿伯，MI (美国)）

AI总结本文提出了一种生成器介导的老虎机算法（GAMBITTS），用于解决生成式人工智能（GenAI）驱动的自适应干预问题。该算法通过建模治疗和奖励生成过程，利用观察到的治疗信息加速策略学习，并在模拟研究中优于传统算法。

Comments 39 pages, 12 figures

详情

Journal ref: Advances in Neural Information Processing Systems 38 (NeurIPS 2025)

AI中文摘要

近期生成式人工智能（GenAI）模型的进步使生成个性化内容成为可能，该内容能够适应最新的用户情境。尽管个性化决策系统通常采用老虎机建模，但GenAI的引入为经典序列学习问题带来了新的结构。在GenAI驱动的干预中，智能体选择查询，但环境会经历由生成模型产生的随机响应。标准老虎机方法并未显式考虑这种结构，其中动作仅通过随机、观察到的治疗影响奖励。我们引入生成器介导的老虎机-汤普森采样（GAMBITTS），一种针对这种动作/治疗分割设计的老虎机方法，以移动健康干预中的大型语言模型生成文本作为动机案例。GAMBITTS显式建模治疗和奖励生成过程，利用所交付的治疗信息，相对于标准方法加速策略学习。我们通过分解治疗和奖励中的不确定性来源，建立了GAMBITTS的遗憾界，并识别了其在某些条件下优于标准老虎机方法的保证条件。在模拟研究中，GAMBITTS通过利用观察到的治疗更准确地估计预期奖励，始终优于传统算法。

英文摘要

Recent advances in generative artificial intelligence (GenAI) models have enabled the generation of personalized content that adapts to up-to-date user context. While personalized decision systems are often modeled using bandit formulations, the integration of GenAI introduces new structure into otherwise classical sequential learning problems. In GenAI-powered interventions, the agent selects a query, but the environment experiences a stochastic response drawn from the generative model. Standard bandit methods do not explicitly account for this structure, where actions influence rewards only through stochastic, observed treatments. We introduce generator-mediated bandit-Thompson sampling (GAMBITTS), a bandit approach designed for this action/treatment split, using mobile health interventions with large language model-generated text as a motivating case study. GAMBITTS explicitly models both the treatment and reward generation processes, using information in the delivered treatment to accelerate policy learning relative to standard methods. We establish regret bounds for GAMBITTS by decomposing sources of uncertainty in treatment and reward, identifying conditions where it achieves stronger guarantees than standard bandit approaches. In simulation studies, GAMBITTS consistently outperforms conventional algorithms by leveraging observed treatments to more accurately estimate expected rewards.

URL PDF HTML ☆

赞 0 踩 0

2410.04309 2026-06-05 cs.CY cs.LG

Comprehensive Monitoring of Air Pollution Hotspots Using Sparse Sensor Networks

利用稀疏传感器网络全面监测空气污染热点

Ankit Bhardwaj, Ananth Balashankar, Shiva Iyer, Nita Soans, Anant Sudarshan, Rohini Pande, Lakshminarayanan Subramanian

发表机构 * New York University（纽约大学）； Google Research（谷歌研究）； Toyota InfoTechnology Center（丰田信息技术中心）； Kaiterra Inc（Kaiterra公司）； University of Warwick（沃里克大学）； Yale University（耶鲁大学）

AI总结本文通过结合预测建模和机理方法，利用新增的低成本传感器，发现新德里现有传感器网络之外的189个隐藏热点，并利用空间时间克里金法进行预测，同时开发了高斯烟雾扩散模型以解释热点形成机理，为资源受限环境下的空气污染管理提供了数据驱动和机理结合的解决方案。

详情

DOI: 10.1145/3748821

AI中文摘要

城市空气污染热点对健康构成重大威胁，但其检测和分析仍然受到公共传感器网络稀疏性的限制。本文通过结合预测建模和机理方法，全面监测污染热点。我们通过在新德里现有传感器网络中增加28个低成本传感器，收集了2018年5月1日至2020年11月1日期间30个月的PM2.5数据。应用已建立的热点定义，我们发现了除公共网络检测的660个热点外，还有189个隐藏热点。利用预测技术如空间时间克里金法，我们在50%的传感器故障率下实现了95%的精度和88%的召回率，在50%的缺失传感器情况下实现了98%的精度和95%的召回率。我们的预测模型的预期结果进一步被编译成政策建议，供公共当局参考。此外，我们开发了高斯烟雾扩散模型以理解热点形成的机理，结合了从本地来源衍生的排放清单。我们的机理模型能够解释65%的观测到的瞬时热点。我们的发现强调了在资源受限环境中，整合数据驱动的预测模型与基于物理的机理模型对于可扩展和稳健的空气污染管理的重要性。

英文摘要

Urban air pollution hotspots pose significant health risks, yet their detection and analysis remain limited by the sparsity of public sensor networks. This paper addresses this challenge by combining predictive modeling and mechanistic approaches to comprehensively monitor pollution hotspots. We enhanced New Delhi's existing sensor network with 28 low-cost sensors, collecting PM2.5 data over 30 months from May 1, 2018, to Nov 1, 2020. Applying established definitions of hotspots to this data, we found the existence of additional 189 hidden hotspots apart from confirming 660 hotspots detected by the public network. Using predictive techniques like Space-Time Kriging, we identified hidden hotspots with 95% precision and 88% recall with 50% sensor failure rate, and with 98% precision and 95% recall with 50% missing sensors. The projected results of our predictive models were further compiled into policy recommendations for public authorities. Additionally, we developed a Gaussian Plume Dispersion Model to understand the mechanistic underpinnings of hotspot formation, incorporating an emissions inventory derived from local sources. Our mechanistic model is able to explain 65% of observed transient hotspots. Our findings underscore the importance of integrating data-driven predictive models with physics-based mechanistic models for scalable and robust air pollution management in resource-constrained settings.

URL PDF HTML ☆

赞 0 踩 0

2412.06259 2026-06-05 eess.AS cs.SD

Leveraging Prompt Learning and Pause Encoding for Alzheimer's Disease Detection

利用提示学习和暂停编码进行阿尔茨海默病检测

Yin-Long Liu, Rui Feng, Jia-Hong Yuan, Zhen-Hua Ling

发表机构 * National Social Science Foundation of China（中华人民共和国国家社会科学基金）； Supercomputing Center of the USTC（中国科学技术大学超算中心）

AI总结本文提出通过提示学习和暂停信息编码改进基于转录文本的阿尔茨海默病检测，利用提示模板将分类任务转化为掩码语言建模任务，并通过比较不同自动语音识别模型和集成技术，达到95.8%的检测准确率。

Comments Accepted by ISCSLP 2024

详情

DOI: 10.1109/ISCSLP63861.2024.10799971
Journal ref: Proc. IEEE ISCSLP 2024, pp. 486-490, 2024

AI中文摘要

与其它临床筛查技术相比，基于语音和语言的自动化阿尔茨海默病（AD）检测方法具有非侵入性、成本效益和便利性。先前研究已证明微调预训练语言模型（PLMs）在AD检测中的有效性。然而，传统微调方法仅输入转录文本，其目标与PLMs预训练阶段使用的掩码语言建模（MLM）任务不一致。本文研究了基于提示的PLMs微调方法，通过在转录输入中插入提示模板将分类任务转化为MLM任务。同时探索了将强制对齐中的暂停信息纳入手动转录的影响。此外，我们比较了各种自动语音识别（ASR）模型的性能，并选择Whisper模型生成基于ASR的转录文本与手动转录进行比较。此外，跨不同PLMs（BERT和RoBERTa）使用不同随机种子应用多数投票和集成技术。最终，使用手动转录文本获得最大检测准确率为95.8%（均值87.9%，标准差3.3%），在ADReSS测试集上实现了仅使用转录文本进行AD检测的最先进性能。

通过AI赋能的钙组学增强心血管风险预测

Ammar Hoori, Sadeer Al-Kindi, Tao Hu, Yingnan Song, Hao Wu, Juhwan Lee, Nour Tashtish, Pingfu Fu, Robert Gilkeson, Sanjay Rajagopalan, David L. Wilson

发表机构 * Department of Biomedical Engineering, Case Western Reserve University（生物医学工程系，凯斯西储大学）； Harrington Heart and Vascular Institute, University Hospitals Cleveland Medical Center（哈灵顿心脏和血管研究所，克利夫兰医学中心）； School of Medicine, Case Western Reserve University（医学院，凯斯西储大学）； Department of Population and Quantitative Health Sciences, Case Western Reserve University（人口与定量健康科学系，凯斯西储大学）； Department of Radiology, University Hospitals Cleveland Medical Center（放射科，克利夫兰医学中心）； Department of Radiology, Case Western Reserve University（放射科，凯斯西储大学）

AI总结本文通过利用详细的钙沉积特征（即钙组学）结合AI方法，提高了主要不良心血管事件（MACE）预测的准确性，展示了钙组学在心血管风险预测中的应用价值。

Comments 12 pages, 8 figures, 2 tables, 4 pages supplemental, journal paper format (under review)

详情

DOI: 10.1038/s41598-024-60584-8

AI中文摘要

背景. 冠状动脉钙化（CAC）是预测主要不良心血管事件（MACE）的强大预测因子。传统的Agatston评分只是简单地将钙含量相加，尽管是非线性方式，但仍有改进钙沉积评估的空间，以更全面地捕捉疾病程度。目标. 确定是否可以通过使用详细的钙沉积特征（即钙组学）的AI方法来提高MACE预测。方法. 我们研究了钙沉积的其他特征，包括质量、体积、密度、空间分布、区域等的评估。我们使用带有弹性网络正则化的Cox模型，在2457例CT钙化评分（CTCS）中，该评分富集了MACE事件，来源于一个大型无成本CLARIFY计划（ClinicalTrials.gov标识符：NCT04075162）。我们采用了采样技术来增强模型训练。我们还研究了使用选定特征的Cox模型，以识别可解释的高风险特征。结果. 我们提出的钙组学模型，通过修改的合成下采样和上采样，给出了C指数（80.5%/71.6%）和两年AUC（82.4%/74.8%）（80:20，训练/测试），分别（采样仅应用于训练集）。结果优于Agatston，后者给出了C指数（71.3%/70.3%）和AUC（71.8%/68.8%）。在钙组学特征中，钙化数量、左前降支质量及扩散率（空间分布的度量）是增加风险的重要决定因素，而致密钙化（>1000HU）与较低风险相关。钙组学模型在保留测试中将63%的MACE患者重新分类到高风险组。分类净再分类指数为NRI=0.153。结论. AI分析冠状动脉钙化可比Agatston评分产生更好的结果。我们的发现表明，钙组学在改进风险预测中的应用价值。

英文摘要

Background. Coronary artery calcium (CAC) is a powerful predictor of major adverse cardiovascular events (MACE). Traditional Agatston score simply sums the calcium, albeit in a non-linear way, leaving room for improved calcification assessments that will more fully capture the extent of disease. Objective. To determine if AI methods using detailed calcification features (i.e., calcium-omics) can improve MACE prediction. Methods. We investigated additional features of calcification including assessment of mass, volume, density, spatial distribution, territory, etc. We used a Cox model with elastic-net regularization on 2457 CT calcium score (CTCS) enriched for MACE events obtained from a large no-cost CLARIFY program (ClinicalTri-als.gov Identifier: NCT04075162). We employed sampling techniques to enhance model training. We also investigated Cox models with selected features to identify explainable high-risk characteristics. Results. Our proposed calcium-omics model with modified synthetic down sampling and up sampling gave C-index (80.5%/71.6%) and two-year AUC (82.4%/74.8%) for (80:20, training/testing), respectively (sampling was applied to the training set only). Results compared favorably to Agatston which gave C-index (71.3%/70.3%) and AUC (71.8%/68.8%), respectively. Among calcium-omics features, numbers of calcifications, LAD mass, and diffusivity (a measure of spatial distribution) were important determinants of increased risk, with dense calcification (>1000HU) associated with lower risk. The calcium-omics model reclassified 63% of MACE patients to the high risk group in a held-out test. The categorical net-reclassification index was NRI=0.153. Conclusions. AI analysis of coronary calcification can lead to improved results as compared to Agatston scoring. Our findings suggest the utility of calcium-omics in improved prediction of risk.

URL PDF HTML ☆

赞 0 踩 0

0911.2381 2026-06-05 physics.data-an cond-mat.stat-mech cs.LG nlin.CD stat.ME

Analytical Determination of Fractal Structure in Stochastic Time Series

随机时间序列中分形结构的解析确定

Fermín Moscoso del Prado Martín

发表机构 * Laboratoire de Psychologie Cognitive ( UMR --6146) CNRS \& Aix--Marseille Universit\'e I, Marseille, France

AI总结本文提出了一种基于贝叶斯评估的分析框架，用于客观准确地推断时间序列的分形结构，同时推导出一种优于现有方法的Hurst指数最大似然估计器。

Comments 9 pages, 4 figures

2606.06201 2026-06-05 cs.AI

Learning to replenish: A hybrid deep reinforcement learning for dynamic inventory management in the pharmaceutical supply chains

学习补货：面向医药供应链动态库存管理的混合深度强化学习

Amandeep Kaur, Gyan Prakash

AI总结针对医药供应链中需求不确定和前置时间变化导致的库存管理难题，提出一种混合异步优势演员评论家分布式近端策略优化（A3C DPPO）算法，实现连续动作空间下的最优补货策略，降低库存成本并提高服务水平。

Comments Nil

详情

AI中文摘要

医药供应链（PSCs）因不可预测的需求模式和与补货相关的可变前置时间，在库存管理（IM）方面面临挑战。药品的有限保质期进一步加剧了这种复杂性，需要在充足库存和最小浪费之间取得微妙的平衡。这些相互交织的因素构成了一个复杂的优化问题，需要复杂的库存策略来确保产品可用性和PSC效率。本研究旨在为医药产品开发一种最优库存补货策略，能够处理由不确定需求和可变PSC条件产生的随机性。目标是最大化PSC的盈利能力，同时保持较高的患者服务水平。我们将问题建模为马尔可夫决策过程，并提出一种深度强化学习（DRL）方法，具体为混合异步优势演员评论家分布式近端策略优化（A3C DPPO）算法。该A3C DPPO算法针对IM中固有的连续动作空间进行了定制。数值结果表明，所提算法在动态场景下自适应更新库存补货策略，与各种基准相比，实现了更低的库存成本。我们还使用真实药品库存数据进行了数值验证，以确认所提算法的实际可行性。

英文摘要

Pharmaceutical supply chains (PSCs) struggle with inventory management (IM) due to unpredictable demand patterns and variable lead times associated with restocking. This complexity is further compounded by the finite shelf lives of pharmaceutical products, which necessitate a delicate balance between adequate stock and minimal waste. These intertwined factors create a complex optimization problem that requires sophisticated inventory strategies to ensure both product availability and PSC efficiency. This study aims to develop an optimal inventory replenishment policy for pharmaceutical products that can handle the stochasticity arising from uncertain demand and variable PSC conditions. The objective is to maximize the profitability of the PSC while maintaining a high patient service level. We formulate the problem as a Markov decision process and propose a deep reinforcement learning (DRL) approach, specifically, a hybrid asynchronous advantage actor critic distributed proximal policy optimization (A3C DPPO)algorithm. The A3C DPPO algorithm is tailored to handle the continuous action space inherent in IM. The numerical results demonstrate that the proposed algorithm adaptively updates the inventory replenishment strategy under dynamic scenarios, resulting in lower inventory costs compared to various benchmarks. We also conduct numerical validation using real-world pharmaceutical inventory data to confirm the practical feasibility of the proposed algorithm.

URL PDF HTML ☆

赞 0 踩 0

2606.05611 2026-06-05 cs.CV

用稀疏逼近方法学习PDE的解算子

Sebastian Neumayer, Daniel Potts, Fabian Taubert

AI总结本文提出一种结合乘积基展开与正交匹配追踪的稀疏高维方法，用于逼近偏微分方程的解算子，显著减少所需样本量，并在数值实验中与立方体稀疏逼近和傅里叶神经算子对比，展示了在稀疏表示下的准确性和可解释性。

详情

AI中文摘要

我们研究了使用稀疏高维技术逼近偏微分方程（PDE）解算子的问题。基于维度增量框架，我们将乘积基展开与稀疏恢复方法（特别是正交匹配追踪（OMP））相结合，与先前考虑的基于立方体的方法相比，大幅减少了所需样本量。我们在多个数值示例上评估了所得方法，在准确性、运行时间和样本量方面与基于立方体的稀疏逼近和傅里叶神经算子进行了比较。实验表明，我们的方法相对于其前身显著减少了所需的PDE求解次数，同时保持了有竞争力的准确性，特别是当解在所选基中具有稀疏表示时。此外，恢复的稀疏索引集为相关变量和参数交互提供了可解释的见解。

英文摘要

We investigate the approximation of solution operators for partial differential equations (PDEs) using sparse high-dimensional techniques. Building on a dimension-incremental framework, we combine product basis expansions with sparse recovery methods, specifically orthogonal matching pursuit (OMP), to substantially reduce the required sample size compared with a previously considered cubature-based approach. We evaluate the resulting method numerically on several examples, comparing it against both cubature-based sparse approximation and Fourier neural operators in terms of accuracy, runtime, and sample size. The experiments show that our approach considerably reduces the number of required PDE solves relative to its predecessor while maintaining competitive accuracy, particularly when the solution admits a sparse representation in the chosen basis. Furthermore, the recovered sparse index sets yield interpretable insights into the relevant variables and parameter interactions.

URL PDF HTML ☆

赞 0 踩 0

2602.23665 2026-06-05 cs.IR cs.LG cs.SI

Geodesic Semantic Search: Cartographic Navigation of Citation Graphs with Learned Local Riemannian Maps

测地语义搜索：基于学习局部黎曼度量的引文图导航

Brandon Yee, Lucas Wang, Kundana Kommini

AI总结本文提出Geodesic Semantic Search (GSS)，通过在引文图上学习节点特定的黎曼度量，实现几何感知的语义检索。不同于传统基于嵌入的检索依赖固定欧几里得距离，GSS在每个节点学习低秩度量张量，诱导局部正定度量，从而在保持模型可计算性的同时保证有效度量。检索过程通过多源Dijkstra算法在学习的测地距离上进行，随后通过最大边际相关性重排序和路径一致性过滤。在包含169,000篇arXiv论文的引文预测基准上，GSS在Recall@20上比SPECTER+FAISS基线提升了23%。我们提供了Bridge Recovery Guarantee，描述了测地检索在定性上优于直接相似性的情况，以及训练损失与检索质量的边际分离结果，并刻画了低秩度量参数化的表达能力。我们的分层粗到细检索方法结合k-means池化，将计算成本降低4倍，同时保持97%的检索质量。

Comments Substantial Revision Required

详情

AI中文摘要

我们提出了Geodesic Semantic Search (GSS)，一种检索系统，通过在引文图上学习节点特定的黎曼度量，以实现几何感知的语义检索。不同于标准基于嵌入的检索依赖固定欧几里得距离，\gss{}在每个节点学习一个低秩度量张量$\mL_i \in \R^{d imes r}$，诱导一个局部正定度量$\mG_i = \mL_i \mL_i^ op + \eps \mI$。这种参数化保证了有效的度量，同时保持模型的可计算性。检索过程通过在学习的测地距离上进行多源Dijkstra算法，随后通过最大边际相关性重排序和路径一致性过滤。在包含169,000篇arXiv论文的引文预测基准上，GSS在Recall@20上比SPECTER+FAISS基线提高了23%。我们提供了Bridge Recovery Guarantee，描述了测地检索在定性上优于直接相似性的情况，以及训练损失与检索质量的边际分离结果，并刻画了低秩度量参数化的表达能力。我们的分层粗到细检索方法结合k-means池化，将计算成本降低4倍，同时保持97%的检索质量。

英文摘要

We present Geodesic Semantic Search (GSS), a retrieval system that learns node-specific Riemannian metrics on citation graphs to enable geometry-aware semantic search. Unlike standard embedding-based retrieval that relies on fixed Euclidean distances, \gss{} learns a low-rank metric tensor $\mL_i \in \R^{d \times r}$ at each node, inducing a local positive semi-definite metric $\mG_i = \mL_i \mL_i^\top + \eps \mI$. This parameterization guarantees valid metrics while keeping the model tractable. Retrieval proceeds via multi-source Dijkstra on the learned geodesic distances, followed by Maximal Marginal Relevance reranking and path coherence filtering. On citation prediction benchmarks with 169K arXiv papers, GSS achieves 23\% relative improvement in Recall@20 over SPECTER+FAISS baselines. We provide a Bridge Recovery Guarantee characterizing when geodesic retrieval qualitatively outperforms direct similarity, a margin separation result connecting training loss to retrieval quality, and characterize the expressiveness of low-rank metric parameterization. Our hierarchical coarse-to-fine search with k-means pooling reduces computational cost by $4\times$ while maintaining 97\% retrieval quality.

URL PDF HTML ☆

赞 0 踩 0

2603.10457 2026-06-05 physics.plasm-ph cond-mat.stat-mech cs.LG physics.acc-ph

Beam-Plasma Collective Oscillations in Intense Charged-Particle Beams: Dielectric Response Theory, Langmuir Wave Dispersion, and Unsupervised Detection via Prometheus

强流带电粒子束中束-等离子体集体振荡：介电响应理论、朗缪尔波色散以及通过Prometheus的无监督检测

Brandon Yee, Wilson Collins, Michael Iofin, Jiayi Fu

AI总结本文研究了强流带电粒子束中束-等离子体集体振荡的理论和计算框架，通过介电响应理论、朗缪尔波色散关系以及Prometheus算法验证了束-等离子体过渡的特性，展示了其在中间能区的应用前景。

Comments Substantial Revision Required

详情

AI中文摘要

我们开发了一个理论和计算框架，用于研究强流带电粒子束在中间能量（10-100 MeV）下的束-等离子体集体振荡。在第一部分，我们建立了由Vlasov-Poisson系统支配的动能场理论，推导出三种束分布函数的Lindhard介电函数和随机相位近似（RPA）极化张量。我们通过介电函数epsilon(omega,q)=0证明了临界束密度n_c以上的未阻尼朗缪尔波模式的存在，获得了显式的束-等离子体色散关系，并表明Landau阻尼在粒子-空穴连续谱之上消失。等离子体频率Omega_p^2 = ne^2/(m*epsilon_0)通过f求和规则固定，与分布形状无关；更高的色散系数取决于速度矩。空间电荷效应驱动异常束展宽，具有sqrt(n-n_c)起始和q=2k_F处的Friedel振荡。束-等离子体过渡通过重整化群分析属于三维Ising普遍性类。在第二部分，我们利用Prometheus验证这些预测，Prometheus是基于静态结构因子数据S(q)训练的beta-VAE。Prometheus检测到高斯和均匀分布中的集体等离子体振荡起始，确认在退相干费米气体（n_c->0）中不存在，且在q=2k_F处解析了Kohn异常。通过PIC模拟得到的S(q,omega)色散分析验证了由f求和规则预测的分布无关的Omega_p。所有六个验证检查均通过。预测的特征——密度可调的等离子体共振在omega_p与sqrt(n)成正比、异常束展宽具有sqrt(n-n_c)起始以及Friedel振荡——在现有的中间能区束设施中是可访问的。

英文摘要

We develop a theoretical and computational framework for beam-plasma collective oscillations in intense charged-particle beams at intermediate energies (10-100 MeV). In Part I, we formulate a kinetic field theory governed by the Vlasov-Poisson system, deriving the Lindhard dielectric function and random phase approximation (RPA) polarization tensor for three beam distribution functions. We prove via the dielectric function epsilon(omega,q)=0 the existence of undamped Langmuir wave modes above a critical beam density n_c, obtain explicit beam-plasma dispersion relations, and show that Landau damping vanishes above the particle-hole continuum. The plasma frequency Omega_p^2 = ne^2/(m*epsilon_0) is fixed by the f-sum rule independently of distribution shape; higher dispersion coefficients depend on velocity moments. Space charge effects drive anomalous beam broadening with sqrt(n-n_c) onset and Friedel oscillations at q=2k_F. The beam-plasma transition belongs to the 3D Ising universality class via renormalization group analysis. In Part II, we validate these predictions using Prometheus, a beta-VAE trained on static structure factor data S(q) from particle-in-cell (PIC) beam simulations. Prometheus detects collective plasma oscillation onset in Gaussian and uniform distributions, confirms their absence in the degenerate Fermi gas (n_c -> 0), and resolves the Kohn anomaly at q=2k_F. Dispersion analysis of S(q,omega) from PIC simulations verifies the distribution-independent Omega_p predicted by the f-sum rule. All six validation checks pass. Predicted signatures -- density-tunable plasma resonances at omega_p proportional to sqrt(n), anomalous beam broadening with sqrt(n-n_c) onset, and Friedel oscillations -- are accessible at existing intermediate-energy beam facilities.

URL PDF HTML ☆

赞 0 踩 0

2606.06495 2026-06-05 astro-ph.CO

What it takes to solve the Hubble tension through Modifications of Cosmological Recombination II: in light of ACT DR6 and DESI DR2

通过修改宇宙学重组解决哈勃张力需要什么 II：基于 ACT DR6 和 DESI DR2

Nanoom Lee, Tianji Zhou

AI总结基于 ACT DR6 和 DESI DR2 数据，通过时变电子质量 $m_e(z)$ 寻找最小修改以解决哈勃张力，发现仅用 CMB 数据可完全解决，但加入 DESI BAO 后无法完全解决。

Comments 7+3 pages, 5 figures. Comments are welcome

详情

AI中文摘要

我们基于来自阿塔卡马宇宙学望远镜（ACT DR6）和暗能量光谱仪（DESI DR2）的最新数据，构建了数据驱动的哈勃张力解决方案。我们通过时变电子质量 $m_e(z)$ 寻找对重组历史的最小修改，该修改使从 CMB 数据推断的最佳拟合 $H_0$ 向 SH0ES 值增加，同时不恶化数据拟合。使用包括透镜效应的 Planck 和 ACT 数据，我们发现对 $m_e(z)$ 的微扰修改完全解决了哈勃张力，该解与之前仅使用 Planck 数据的工作具有相同的定性振荡结构，表明其对包含更精确和独立的 CMB 数据的鲁棒性。作为副产品，该解也缓解了 $S_8$ 张力。然而，一旦加入 DESI DR2 BAO 数据，对 $m_e(z)$ 的微扰修改无法完全解决哈勃张力。这反映了相同的基本限制：通过修改重组提高 $H_0$ 通常会降低 $\Omega_m$，与晚期宇宙学观测不一致。

英文摘要

We construct data-driven solutions to the Hubble tension, in light of recent data from the Atacama Cosmology Telescope (ACT DR6) and the Dark Energy Spectroscopic Instrument (DESI DR2). We search for the minimal modification to the recombination history through a time-varying electron mass $m_e(z)$ that increases the best-fit $H_0$ inferred from CMB data toward the SH0ES value, without worsening the fit to the data. Using Planck and ACT data including lensing, we find a perturbative modification to $m_e(z)$ that fully resolves the Hubble tension, with the solution sharing the same qualitative oscillatory structure as in previous work using Planck data alone, demonstrating its robustness to the inclusion of more precise and independent CMB data. As a byproduct, the solution also eases the $S_8$ tension. Once DESI DR2 BAO data are added, however, perturbative modifications to $m_e(z)$ cannot fully resolve the Hubble tension. This reflects the same fundamental limitation: raising $H_0$ by modifying recombination generically lowers $Ω_m$, being inconsistent with late-time cosmological observations.

URL PDF HTML ☆

赞 0 踩 0