arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 2042
2509.20324 2026-06-05 cs.CR cs.AI

RAG Security and Privacy: Formalizing the Threat Model and Attack Surface

RAG安全与隐私:形式化威胁模型和攻击面

Atousa Arzanipour, Rouzbeh Behnia, Reza Ebrahimi, Kaushik Dutta

发表机构 * University of California, Berkeley(加州大学伯克利分校)

AI总结 本文研究了RAG系统中的安全与隐私问题,提出首个形式化的威胁模型,定义了攻击向量如文档级成员推断和数据中毒,以提升对RAG系统隐私和安全性的理解。

Comments Published at the 5th ICDM Workshop in November 2025

详情
Journal ref
2025 IEEE International Conference on Data Mining Workshops (ICDMW), pp. 1387-1394, 2025
AI中文摘要

检索增强生成(RAG)是一种新兴的自然语言处理方法,结合大型语言模型(LLMs)与外部文档检索以生成更准确和基于事实的响应。尽管RAG在减少幻觉和提高事实一致性方面表现出色,但其也引入了与传统LLMs不同的隐私和安全挑战。现有研究表明,LLMs可通过训练数据记忆或对抗性提示泄露敏感信息,而RAG系统继承了许多这些漏洞。同时,RAG依赖外部知识库打开了新的攻击面,包括可能泄露检索文档的存在或内容信息,或注入恶意内容以操控模型行为。尽管存在这些风险,目前尚无正式框架定义RAG系统的威胁景观。本文通过提出首个形式化的RAG威胁模型,填补了文献中的关键空白。我们引入了基于对模型组件和数据访问的对手类型的结构化分类,并正式定义了关键威胁向量,如文档级成员推断和数据中毒,这些向量在实际部署中对隐私和完整性构成严重风险。通过建立正式定义和攻击模型,本文为更严谨和原则性的理解RAG系统的隐私和安全奠定了基础。

英文摘要

Retrieval-Augmented Generation (RAG) is an emerging approach in natural language processing that combines large language models (LLMs) with external document retrieval to produce more accurate and grounded responses. While RAG has shown strong potential in reducing hallucinations and improving factual consistency, it also introduces new privacy and security challenges that differ from those faced by traditional LLMs. Existing research has demonstrated that LLMs can leak sensitive information through training data memorization or adversarial prompts, and RAG systems inherit many of these vulnerabilities. At the same time, reliance of RAG on an external knowledge base opens new attack surfaces, including the potential for leaking information about the presence or content of retrieved documents, or for injecting malicious content to manipulate model behavior. Despite these risks, there is currently no formal framework that defines the threat landscape for RAG systems. In this paper, we address a critical gap in the literature by proposing, to the best of our knowledge, the first formal threat model for retrieval-RAG systems. We introduce a structured taxonomy of adversary types based on their access to model components and data, and we formally define key threat vectors such as document-level membership inference and data poisoning, which pose serious privacy and integrity risks in real-world deployments. By establishing formal definitions and attack models, our work lays the foundation for a more rigorous and principled understanding of privacy and security in RAG systems.

2509.02971 2026-06-05 stat.ML cs.LG cs.NA math.NA math.PR

Scale-Adaptive Generative Flows for Multiscale Scientific Data

多尺度科学数据的自适应生成流

Yifan Chen, Eric Vanden-Eijnden

发表机构 * Department of Mathematics, University of California, Los Angeles(加州大学洛杉矶分校数学系) Machine Learning Lab, Capital Fund Management(资本基金管理有限公司机器学习实验室) Courant Institute, New York University(纽约大学柯朗研究所)

AI总结 本文提出了一种多尺度科学数据生成模型,通过设计噪声分布和插值计划,解决多尺度傅里叶谱数据中的数值挑战,提高了生成样本的质量和效率。

详情
AI中文摘要

基于流的生成模型在处理具有多尺度傅里叶谱的科学数据时常常面临数值挑战,通常在细尺度上产生较大的误差。我们通过在流匹配和随机插值框架内,通过噪声分布和插值计划的原理性设计来解决这个问题。在函数空间中工作可以确保生成模型在分辨率细化时仍然定义良好;漂移的Lipschitz正则性对这种函数空间的良定义性和固定分辨率下的积分成本都很重要。核心观察是噪声应至少与目标分布一样粗糙——通过傅里叶谱衰减来衡量——以保持Lipschitz常数有限。对于已知细尺度结构的高斯和近高斯目标,匹配谱噪声比标准白噪声选择更有效。对于更复杂的非高斯目标,匹配谱噪声可能不足以应对噪声比数据粗糙时出现的终端时间刚性问题,我们提出自适应插值计划来缓解这种情况。在合成高斯随机场和随机Allen-Cahn和Navier-Stokes方程不变测度上的数值实验展示了该方法,并证明了其在传统方法基础上以更低计算成本生成高质量样本的能力。

英文摘要

Flow-based generative models can face numerical challenges on scientific data with multiscale Fourier spectra, often producing large errors at fine scales. We approach this problem within the flow matching and stochastic interpolants framework, through the principled design of noise distributions and interpolation schedules. Working in function space ensures that the generative model remains well defined as the resolution is refined; the Lipschitz regularity of the drift is important to both this function-space well-posedness and the integration cost at fixed resolution. The central observation is that the noise should be at least as rough as the target distribution -- measured by Fourier-spectrum decay -- in order to keep the Lipschitz constant finite. For Gaussian and near-Gaussian targets whose fine-scale structure is known, matched-spectrum noise improves numerical efficiency over standard white-noise choices. For more complex non-Gaussian targets, matched-spectrum noise may not be sufficient, and we propose scale-adaptive interpolation schedules to mitigate the terminal-time stiffness that arises when the noise is rougher than the data. Numerical experiments on synthetic Gaussian random fields and on invariant measures of the stochastic Allen--Cahn and Navier--Stokes equations illustrate the approach and demonstrate its ability to generate high-fidelity samples at lower computational cost than traditional approaches.

2508.20693 2026-06-05 cs.DL cs.CL

Leveraging Large Language Models for Generating Research Topic Ontologies: A Multi-Disciplinary Study

利用大型语言模型生成研究主题本体:多学科研究

Tanay Aggarwal, Angelo Salatino, Francesco Osborne, Enrico Motta

发表机构 * Knowledge Media Institute, The Open University(开放大学知识媒体学院) The Open University(开放大学) University of Milano Bicocca(米兰比克卡大学) Department of Business and Law, University of Milano Bicocca(米兰比克卡大学商学院与法学院)

AI总结 本文研究了大型语言模型在生物医学、物理和工程学三个学科中识别研究主题语义关系的能力,通过零样本提示、链式思维提示和在现有本体上微调三种条件评估模型性能,并引入PEM-Rel-8K数据集验证跨学科迁移能力。

详情
AI中文摘要

研究领域本体和分类法对于管理和组织科学知识至关重要,因为它们有助于信息的高效分类、传播和检索。然而,创建和维护此类本体是昂贵且耗时的任务,通常需要多个领域专家的协同工作。因此,此类本体在不同学科中的覆盖程度不均,学科间连接有限,更新周期也较短。在本研究中,我们探讨了几种大型语言模型在生物医学、物理和工程学三个学科中识别研究主题间语义关系的能力。模型在三种不同的条件下进行评估:零样本提示、链式思维提示和在现有本体上微调。此外,我们通过测量模型在某一学科训练后应用到不同学科的表现,评估了微调模型的跨学科迁移能力。为了支持这项分析,我们引入了PEM-Rel-8K数据集,该数据集包含从生物医学、物理和工程学三个学科中最广泛采用的分类法中提取的超过8000个关系。我们的实验表明,将大型语言模型微调到PEM-Rel-8K上在所有学科中都表现出色。

英文摘要

Ontologies and taxonomies of research fields are critical for managing and organising scientific knowledge, as they facilitate efficient classification, dissemination and retrieval of information. However, the creation and maintenance of such ontologies are expensive and time-consuming tasks, usually requiring the coordinated effort of multiple domain experts. Consequently, ontologies in this space often exhibit uneven coverage across different disciplines, limited inter-discipline connectivity, and infrequent updating cycles. In this study, we investigate the capability of several large language models to identify semantic relationships among research topics within three academic disciplines: biomedicine, physics, and engineering. The models were evaluated under three distinct conditions: zero-shot prompting, chain-of-thought prompting, and fine-tuning on existing ontologies. Additionally, we assessed the cross-discipline transferability of fine-tuned models by measuring their performance when trained in one discipline and subsequently applied to a different one. To support this analysis, we introduce PEM-Rel-8K, a novel dataset consisting of over 8,000 relationships extracted from the most widely adopted taxonomies in the three disciplines considered in this study: MeSH, PhySH, and IEEE. Our experiments demonstrate that fine-tuning LLMs on PEM-Rel-8K yields excellent performance across all disciplines.

2508.19006 2026-06-05 q-fin.PR cs.LG econ.EM q-fin.CP

Is attention truly all we need? An empirical study of asset pricing in pretrained RNN sparse and global attention models

注意力真的全部我们需要吗?对预训练RNN稀疏和全局注意力模型在资产定价中的实证研究

Shanyan Lai

发表机构 * Department of Economics and Related Studies, Univiersity of York(经济与相关研究系,约克大学)

AI总结 本文研究了预训练RNN注意力模型在资产定价中的应用,探讨了注意力机制在捕捉时间依赖性和长期记忆方面的改进,以及在不同市场条件下的稳定性。

Comments 72 pages including appendix

详情
AI中文摘要

本研究探讨了主流注意力机制,如加权注意力、Luong的三种注意力、全局自注意力和滑动窗口稀疏注意力,在顶级420只大型美国股票上的实证资产定价研究。这是首次将大规模最先进的(SOTA)注意力机制应用于资产定价领域。这些模型克服了传统机器学习资产定价方法的局限性,如误捕时间依赖性和短期记忆。此外,注意力机制中的强制因果掩码解决了未来数据泄漏问题,而这一问题被更先进的注意力模型如经典Transformer所忽视。所提出的注意力模型还考虑了资产定价数据的时间稀疏性,并通过部署简化模型结构来缓解潜在的过拟合问题。本文为未来实证经济研究提供了某些见解。所有模型均在三个时期内进行测试,涵盖新冠前、新冠期间和新冠后一年,以测试这些模型在极端市场条件下的稳定性。研究发现,在价值加权投资组合回测中,全局自注意力模型和滑动窗口稀疏注意力模型在获得绝对收益和对冲下行风险方面表现出色,在新冠期间静态交易成本情景下,它们分别实现了2.0和1.80的年化Sortino比率。此外,从绝对投资组合收益的角度来看,滑动窗口稀疏注意力模型在股票市值大小方面比全局自注意力模型表现更加稳定。

英文摘要

This study investigates the pre-trained RNN attention models with the mainstream attention mechanisms, such as additive attention, Luong's three attentions, global self-attention and sliding window sparse attention, for the empirical asset pricing research on the top 420 large-cap US stocks. This is the first paper on the large-scale state-of-the-art (SOTA) attention mechanisms applied in the asset pricing context. They overcome the limitations of the traditional machine learning-based asset pricing, such as mis-capturing the temporal dependency and short memory. Moreover, the enforced causal masks in the attention mechanisms address the future data leaking issue ignored by the more advanced attention-based models, such as the classic Transformer. The proposed attention models also consider the temporal sparsity characteristic of asset pricing data and mitigate potential overfitting issues by deploying the simplified model structures. This provides some insights for future empirical economic research. All models are examined in three periods, which cover pre-COVID-19, COVID-19 and one year post-COVID-19, for testing the stability of these models under extreme market conditions. The study finds that in value-weighted portfolio back testing, the global self-attention model and the sliding window sparse attention model exhibit excellent capabilities in deriving the absolute returns and hedging downside risks, while they achieve an annualized Sortino ratio of 2.0 and 1.80 respectively in the period with COVID-19 in the static transaction cost scenario. Moreover, the sliding window sparse attention model performs more stably than the global self-attention model from the perspective of absolute portfolio returns with respect to the size of stocks' market capitalization.

2508.10555 2026-06-05 physics.comp-ph cs.CE cs.LG

A Differentiable Framework for Full and Phaseless Data Inversion Using Neural Implicit Contrast-Source Representation

一种基于神经隐式对比源表示的全数据和相位less数据反演可微框架

Haoran Sun, Daoqi Liu, Hongyu Zhou, Maokun Li, Shenheng Xu, Fan Yang

发表机构 * Department of Electronic Engineering, Beijing National Research Center for Information Science and Technology (BNRist), and State Key laboratory of Space Network and Communications(电子工程系,北京信息科学与技术国家研究中心(BNRist),空间网络与通信国家重点实验室)

AI总结 本文提出了一种基于神经隐式对比源表示的可微框架,用于全数据和相位less数据反演,通过引入轻量级残差多层感知机作为连续神经场,提升了反演精度和鲁棒性,同时通过总变分正则化将状态方程和数据方程结合,形成可微目标函数,实现了端到端的可微优化。

详情
AI中文摘要

在本研究中,我们扩展了对比源反演,将其扩展为一个完全可微、无监督的框架,基于神经隐式表示的对比源。具体来说,而不是使用像素级离散表示,对比源由一个轻量级残差多层感知机(ResMLP)参数化,作为连续神经场,该神经场基于空间坐标和发射器设置进行条件化。这种连续参数化提供了更灵活的对比源表示,并在有噪声测量的情况下提高了重建精度和鲁棒性。基于此表示,状态方程和数据方程与总变分正则化相结合,形成一个可微的目标函数。通过将VIE约束反演重新公式化为一个端到端的可微优化问题,网络参数和介质对比率通过自动微分联合优化。在相同框架内,通过仅修改数据失配函数,同时支持全数据和相位less数据反演。数值实验表明,该方案在各种噪声水平和测量设置下,比传统CSI具有更高的重建精度和鲁棒性。连续神经场进一步使超分辨率推理成为可能,在训练网格更细的分辨率下实现,将反演成本与重建保真度解耦。消融研究和与替代神经架构的比较进一步确认,对比源参数化和基于VIE的公式化对于观察到的改进都是必不可少的。

英文摘要

In this study, we extend the contrast source inversion to a fully differentiable, unsupervised framework based on a neural implicit representation of the contrast source. Specifically, instead of a pixel-wise discrete representation, the contrast source is parameterized by a lightweight residual multilayer perceptron (ResMLP) as a continuous neural field conditioned on spatial coordinates and transmitter settings. This continuous parameterization provides a more flexible representation of the contrast source and improves reconstruction accuracy and robustness under noisy measurements. Building on this representation, the state equation and data equation are combined with total-variation regularization to form a differentiable objective function. By reformulating the VIE-constrained inversion as an end-to-end differentiable optimization problem, the network parameters and the medium contrast are jointly optimized via automatic differentiation. Within the same framework, both full and phaseless data inversion are accommodated by only modifying the data misfit function. Numerical experiments demonstrate that this scheme yields higher reconstruction accuracy and robustness than conventional CSI across a range of noise levels and measurement settings. The continuous neural field further enables super-resolution inference at resolutions finer than the training grid, decoupling inversion cost from reconstruction fidelity. Ablation studies and comparisons with alternative neural architectures further confirm that the contrast source parameterization and VIE-based formulation are both essential to the observed improvements.

2508.00775 2026-06-05 eess.SY cs.LG cs.SY math.OC

Learning to optimize with guarantees: a complete characterization of linearly convergent algorithms

学习优化并保证收敛性:线性收敛算法的完整表征

Andrea Martin, Ian R. Manchester, Luca Furieri

发表机构 * School of Electrical Engineering and Computer Science, and Digital Futures, KTH Royal Institute of Technology, Sweden(电气工程与计算机科学学院及数字未来学院,瑞典皇家理工学院) Australian Centre for Robotics and School of Aerospace, Mechanical and Mechatronic Engineering, The University of Sydney, Australia(澳大利亚机器人中心及航空航天、机械与机电工程学院,澳大利亚悉尼大学) Department of Engineering Sciences, University of Oxford, United Kingdom(工程科学系,英国牛津大学)

AI总结 本文研究了如何通过改进算法在特定问题分布下的平均性能,提出了一种线性收敛算法的完整表征方法,展示了如何通过基线算法和可训练的指数衰减修改来实现线性收敛,并在非凸、梯度主导函数、强凸函数和多面体可行集优化中验证了其有效性。

详情
AI中文摘要

许多经典优化算法的设计受到线性收敛速率在问题类中的认证驱动。本文考虑了如何改进算法在特定问题实例分布下的平均性能。虽然可以通过将可训练组件嵌入算法更新中来解决这一任务,但关键挑战是保持整个问题类中的最坏保证。对于复合优化问题的类别,我们证明所有线性收敛算法都可以参数化为一个基线线性收敛算法和一组可训练的指数衰减修改其更新规则的参数;关键在于这种参数化排除了且仅排除了那些不收敛的算法。我们的结果适用于改进经典算法(如梯度下降用于非凸、梯度主导函数;Nesterov加速方法用于平滑强凸函数;投影梯度方法用于多面体可行集优化)的平均性能。我们展示了如何利用我们的表征来学习优化并保证线性收敛和可行性。数值结果展示了在求解病态线性方程组和在线性动力学系统上运行模型预测控制方案时,相较于经典优化器的优势。

英文摘要

The design of many classical optimization algorithms is driven by the certification of linear convergence rates over classes of optimization problems. In this paper, we consider the problem of improving the average-case performance of an algorithm over a specific distribution of problem instances. While this task can be tackled by embedding trainable components into the algorithm updates, a key challenge is to preserve worst-case guarantees across the entire problem class. For classes of composite optimization problems, we show that all linearly convergent algorithms can be parametrized in terms of a baseline linearly convergent algorithm, and a set of trainable, exponentially-decaying modifications to its update rule; crucially, this parametrization excludes all-and only-the algorithms that do not converge linearly. Our results apply to improving the average-case performance of classical algorithms such as gradient descent for nonconvex, gradient-dominated functions; Nesterov's accelerated method for smooth, strongly convex functions; and projected gradient methods for optimization over polyhedral feasible sets. We illustrate how our characterization can be used for learning to optimize with linear convergence and feasibility guarantees. Numerical results showcase benefits over classical optimizers when solving ill-conditioned systems of linear equations and running a model predictive control scheme on a linear dynamical system.

2503.04712 2026-06-05 math.OC cs.LG

Efficiently Escaping Saddle Points under Generalized Smoothness via Self-Bounding Regularity

通过广义平滑性实现高效逃离鞍点的自界正则性

Daniel Yiming Cao, August Y. Chen, Karthik Sridharan, Benjamin Tang

发表机构 * Cornell University(康奈尔大学)

AI总结 本文研究了非凸函数(不一定光滑)的一阶优化方法,提出了一种新的框架,系统分析了在广义平滑性下一阶优化算法的收敛性,首次建立了在广义平滑性下一阶方法达到二阶 stationary 点的收敛保证。

Comments Camera ready version of NeurIPS 2025 paper. 97 pages

详情
AI中文摘要

我们研究了非凸函数(不一定光滑)的一阶优化方法,其中梯度和/或Hessian是Lipschitz连续的。平滑性在机器学习的理论和实践中是一个限制性假设,推动了大量关于使用一阶方法寻找满足广义平滑性函数的一阶 stationary 点的研究。我们开发了一种新的框架,使我们能够系统地研究在广义平滑性下一大类一阶优化算法(我们称之为减少过程)的收敛性。我们将该框架应用于分析在广义平滑性下一阶优化算法收敛到一阶和二阶 stationary 点的收敛性。作为结果,我们建立了在广义平滑性下一阶方法达到二阶 stationary 点的首次收敛保证。我们证明了几个经典例子落在该框架内,并突显了实际意义。

英文摘要

We study the optimization of non-convex functions that are not necessarily smooth (gradient and/or Hessian are Lipschitz) using first order methods. Smoothness is a restrictive assumption in machine learning in both theory and practice, motivating significant recent work on finding first order stationary points of functions satisfying generalizations of smoothness with first order methods. We develop a novel framework that lets us systematically study the convergence of a large class of first-order optimization algorithms (which we call decrease procedures) under generalizations of smoothness. We instantiate our framework to analyze the convergence of first order optimization algorithms to first and \textit{second} order stationary points under generalizations of smoothness. As a consequence, we establish the first convergence guarantees for first order methods to second order stationary points under generalizations of smoothness. We demonstrate that several canonical examples fall under our framework, and highlight practical implications.

2506.19260 2026-06-05 cs.CR cs.DC cs.LG

Topology-Aware Differential Privacy in Federated Learning

基于拓扑的联邦学习差分隐私

Murtaza Rangwala, Richard O. Sinnott, Rajkumar Buyya

发表机构 * Quantum Cloud Computing and Distributed Systems (qCLOUDS) Lab, School of Computing and Information Systems, The University of Melbourne(量子云计算与分布式系统实验室,计算与信息系统学院,墨尔本大学)

AI总结 本文研究了联邦学习中通信拓扑对差分隐私的影响,提出了一种拓扑感知的分布推理方法TADI,通过四个通道消解来隔离客户端泄露,并推导出一个加性互信息界,从而得到Fulcrum算法,该算法在非对称拓扑下优于均匀DP-SGD,在多个数据集上实现了隐私保护的提升。

Comments 16 pages, 6 figures, 2 tables. Data from the experiments and source code can be found here: https://doi.org/10.5281/zenodo.20507155

详情
AI中文摘要

联邦学习通过传输模型更新来保护客户端数据,而差分隐私随机梯度下降(DP-SGD)通过这些更新来限制内容层面的泄露。然而,现有的机制并未考虑联邦本身通信拓扑所揭示的信息。在跨硅部署中,一个了解拓扑和组织结构的被动攻击者能够访问DP-SGD未考虑的信息通道。本文正式化了这一威胁并推导出一种原则性的防御方法。我们引入了TADI(拓扑感知分布推理),一种通过四个通道消解的影子训练通道分解方法,通过将客户端泄露分解为参数、结构和组织组件,并证明了一个加性客户端互信息界,将可控机制项与不可控先验耦合地板分离。从该界中,我们推导出Fulcrum,一种闭合形式的平衡最小-最大最优噪声分配,当联邦的杠杆配置不对称时,严格优于均匀DP-SGD,当不对称时退化为均匀DP-SGD,使其无条件安全采用。在Fed-ISIC2019、Fed-Heart-Disease和合成CIFAR-10六个拓扑家族上评估,Fulcrum在无可测量的实用性成本下实现了隐私保护的提升。TADI通道分解确认参数通道在所有设置下受DP-SGD限制,先验耦合通道在匹配先验条件下经验达到,且在现实跨硅威胁模型下部署有利方向上保守。

英文摘要

Federated learning transmits only model updates to protect client data, and differentially private SGD (DP-SGD) bounds content-level leakage through those updates. Neither mechanism accounts for what the communication topology of the federation itself reveals. In cross-silo deployments, a passive adversary with knowledge of the topology and organisational structure has access to information channels that DP-SGD leaves entirely unaddressed. We formalise this threat and derive a principled defense. We introduce TADI (Topology-Aware Distributional Inference), a shadow-trained channel decomposition that isolates per-client leakage into parameter, structural, and organisational components via four channel ablations, and prove an additive per-client mutual-information bound separating a controllable mechanism term from an uncontrollable prior-coupling floor. From this bound we derive Fulcrum, a closed-form balanced min-max optimal noise allocation that strictly dominates uniform DP-SGD whenever the federation's leverage profile is asymmetric, and degenerates exactly to uniform DP-SGD when it is not, making it safe to adopt unconditionally. Evaluated on Fed-ISIC2019, Fed-Heart-Disease, and synthetic CIFAR-10 across six topology families, Fulcrum delivers privacy gains of up to 1.967 nats at no measurable utility cost. The TADI channel decomposition confirms that the parameter channel is bounded by DP-SGD across all settings, the prior-coupling channel is empirically attained under matched-prior conditions, and the bound is conservative in a deployment-favourable direction under realistic cross-silo threat models.

2505.16311 2026-06-05 stat.ML cs.LG stat.ME

Generator-Mediated Bandits: Thompson Sampling for GenAI-Powered Adaptive Interventions

生成器介导的老虎机:面向生成式人工智能的自适应干预的汤普森采样

Marc Brooks, Gabriel Durham, Kihyuk Hong, Ambuj Tewari

发表机构 * Department of Statistics, University of Michigan, Ann Arbor, MI (USA)(密歇根大学统计学系,安阿伯,MI (美国))

AI总结 本文提出了一种生成器介导的老虎机算法(GAMBITTS),用于解决生成式人工智能(GenAI)驱动的自适应干预问题。该算法通过建模治疗和奖励生成过程,利用观察到的治疗信息加速策略学习,并在模拟研究中优于传统算法。

Comments 39 pages, 12 figures

详情
Journal ref
Advances in Neural Information Processing Systems 38 (NeurIPS 2025)
AI中文摘要

近期生成式人工智能(GenAI)模型的进步使生成个性化内容成为可能,该内容能够适应最新的用户情境。尽管个性化决策系统通常采用老虎机建模,但GenAI的引入为经典序列学习问题带来了新的结构。在GenAI驱动的干预中,智能体选择查询,但环境会经历由生成模型产生的随机响应。标准老虎机方法并未显式考虑这种结构,其中动作仅通过随机、观察到的治疗影响奖励。我们引入生成器介导的老虎机-汤普森采样(GAMBITTS),一种针对这种动作/治疗分割设计的老虎机方法,以移动健康干预中的大型语言模型生成文本作为动机案例。GAMBITTS显式建模治疗和奖励生成过程,利用所交付的治疗信息,相对于标准方法加速策略学习。我们通过分解治疗和奖励中的不确定性来源,建立了GAMBITTS的遗憾界,并识别了其在某些条件下优于标准老虎机方法的保证条件。在模拟研究中,GAMBITTS通过利用观察到的治疗更准确地估计预期奖励,始终优于传统算法。

英文摘要

Recent advances in generative artificial intelligence (GenAI) models have enabled the generation of personalized content that adapts to up-to-date user context. While personalized decision systems are often modeled using bandit formulations, the integration of GenAI introduces new structure into otherwise classical sequential learning problems. In GenAI-powered interventions, the agent selects a query, but the environment experiences a stochastic response drawn from the generative model. Standard bandit methods do not explicitly account for this structure, where actions influence rewards only through stochastic, observed treatments. We introduce generator-mediated bandit-Thompson sampling (GAMBITTS), a bandit approach designed for this action/treatment split, using mobile health interventions with large language model-generated text as a motivating case study. GAMBITTS explicitly models both the treatment and reward generation processes, using information in the delivered treatment to accelerate policy learning relative to standard methods. We establish regret bounds for GAMBITTS by decomposing sources of uncertainty in treatment and reward, identifying conditions where it achieves stronger guarantees than standard bandit approaches. In simulation studies, GAMBITTS consistently outperforms conventional algorithms by leveraging observed treatments to more accurately estimate expected rewards.

2410.04309 2026-06-05 cs.CY cs.LG

Comprehensive Monitoring of Air Pollution Hotspots Using Sparse Sensor Networks

利用稀疏传感器网络全面监测空气污染热点

Ankit Bhardwaj, Ananth Balashankar, Shiva Iyer, Nita Soans, Anant Sudarshan, Rohini Pande, Lakshminarayanan Subramanian

发表机构 * New York University(纽约大学) Google Research(谷歌研究) Toyota InfoTechnology Center(丰田信息技术中心) Kaiterra Inc(Kaiterra公司) University of Warwick(沃里克大学) Yale University(耶鲁大学)

AI总结 本文通过结合预测建模和机理方法,利用新增的低成本传感器,发现新德里现有传感器网络之外的189个隐藏热点,并利用空间时间克里金法进行预测,同时开发了高斯烟雾扩散模型以解释热点形成机理,为资源受限环境下的空气污染管理提供了数据驱动和机理结合的解决方案。

详情
AI中文摘要

城市空气污染热点对健康构成重大威胁,但其检测和分析仍然受到公共传感器网络稀疏性的限制。本文通过结合预测建模和机理方法,全面监测污染热点。我们通过在新德里现有传感器网络中增加28个低成本传感器,收集了2018年5月1日至2020年11月1日期间30个月的PM2.5数据。应用已建立的热点定义,我们发现了除公共网络检测的660个热点外,还有189个隐藏热点。利用预测技术如空间时间克里金法,我们在50%的传感器故障率下实现了95%的精度和88%的召回率,在50%的缺失传感器情况下实现了98%的精度和95%的召回率。我们的预测模型的预期结果进一步被编译成政策建议,供公共当局参考。此外,我们开发了高斯烟雾扩散模型以理解热点形成的机理,结合了从本地来源衍生的排放清单。我们的机理模型能够解释65%的观测到的瞬时热点。我们的发现强调了在资源受限环境中,整合数据驱动的预测模型与基于物理的机理模型对于可扩展和稳健的空气污染管理的重要性。

英文摘要

Urban air pollution hotspots pose significant health risks, yet their detection and analysis remain limited by the sparsity of public sensor networks. This paper addresses this challenge by combining predictive modeling and mechanistic approaches to comprehensively monitor pollution hotspots. We enhanced New Delhi's existing sensor network with 28 low-cost sensors, collecting PM2.5 data over 30 months from May 1, 2018, to Nov 1, 2020. Applying established definitions of hotspots to this data, we found the existence of additional 189 hidden hotspots apart from confirming 660 hotspots detected by the public network. Using predictive techniques like Space-Time Kriging, we identified hidden hotspots with 95% precision and 88% recall with 50% sensor failure rate, and with 98% precision and 95% recall with 50% missing sensors. The projected results of our predictive models were further compiled into policy recommendations for public authorities. Additionally, we developed a Gaussian Plume Dispersion Model to understand the mechanistic underpinnings of hotspot formation, incorporating an emissions inventory derived from local sources. Our mechanistic model is able to explain 65% of observed transient hotspots. Our findings underscore the importance of integrating data-driven predictive models with physics-based mechanistic models for scalable and robust air pollution management in resource-constrained settings.

2412.06259 2026-06-05 eess.AS cs.SD

Leveraging Prompt Learning and Pause Encoding for Alzheimer's Disease Detection

利用提示学习和暂停编码进行阿尔茨海默病检测

Yin-Long Liu, Rui Feng, Jia-Hong Yuan, Zhen-Hua Ling

发表机构 * National Social Science Foundation of China(中华人民共和国国家社会科学基金) Supercomputing Center of the USTC(中国科学技术大学超算中心)

AI总结 本文提出通过提示学习和暂停信息编码改进基于转录文本的阿尔茨海默病检测,利用提示模板将分类任务转化为掩码语言建模任务,并通过比较不同自动语音识别模型和集成技术,达到95.8%的检测准确率。

Comments Accepted by ISCSLP 2024

详情
Journal ref
Proc. IEEE ISCSLP 2024, pp. 486-490, 2024
AI中文摘要

与其它临床筛查技术相比,基于语音和语言的自动化阿尔茨海默病(AD)检测方法具有非侵入性、成本效益和便利性。先前研究已证明微调预训练语言模型(PLMs)在AD检测中的有效性。然而,传统微调方法仅输入转录文本,其目标与PLMs预训练阶段使用的掩码语言建模(MLM)任务不一致。本文研究了基于提示的PLMs微调方法,通过在转录输入中插入提示模板将分类任务转化为MLM任务。同时探索了将强制对齐中的暂停信息纳入手动转录的影响。此外,我们比较了各种自动语音识别(ASR)模型的性能,并选择Whisper模型生成基于ASR的转录文本与手动转录进行比较。此外,跨不同PLMs(BERT和RoBERTa)使用不同随机种子应用多数投票和集成技术。最终,使用手动转录文本获得最大检测准确率为95.8%(均值87.9%,标准差3.3%),在ADReSS测试集上实现了仅使用转录文本进行AD检测的最先进性能。

英文摘要

Compared to other clinical screening techniques, speech-and-language-based automated Alzheimer's disease (AD) detection methods are characterized by their non-invasiveness, cost-effectiveness, and convenience. Previous studies have demonstrated the efficacy of fine-tuning pre-trained language models (PLMs) for AD detection. However, the objective of this traditional fine-tuning method, which involves inputting only transcripts, is inconsistent with the masked language modeling (MLM) task used during the pre-training phase of PLMs. In this paper, we investigate prompt-based fine-tuning of PLMs, converting the classification task into a MLM task by inserting prompt templates into the transcript inputs. We also explore the impact of incorporating pause information from forced alignment into manual transcripts. Additionally, we compare the performance of various automatic speech recognition (ASR) models and select the Whisper model to generate ASR-based transcripts for comparison with manual transcripts. Furthermore, majority voting and ensemble techniques are applied across different PLMs (BERT and RoBERTa) using different random seeds. Ultimately, we obtain maximum detection accuracy of 95.8% (with mean 87.9%, std 3.3%) using manual transcripts, achieving state-of-the-art performance for AD detection using only transcripts on the ADReSS test set.

2205.11518 2026-06-05 cs.CR cs.AI cs.LG

LIA: Privacy-Preserving Data Quality Evaluation in Federated Learning Using a Lazy Influence Approximation

LIA: 在联邦学习中使用懒惰影响近似进行隐私保护的数据质量评估

Ljubomir Rokvic, Panayiotis Danassis, Sai Praneeth Karimireddy, Boi Faltings

发表机构 * École Polytechnique Fédérale de Lausanne (EPFL)(瑞士联邦理工学院洛桑校区) Telenor Research(Telenor研究) University of Southern California(南加州大学)

AI总结 本文提出了一种新的隐私保护数据质量评估方法LIA,通过懒惰影响近似技术过滤和评分数据,在保持隐私的前提下有效识别低质量、损坏或恶意数据。

Comments Proceedings of the 2024 IEEE International Conference on Big Data (IEEE BigData 2024). A preliminary version of this work received the Best Paper Award at the International Workshop on Trustworthy Federated Learning at IJCAI (FL-IJCAI) 2023

详情
AI中文摘要

在联邦学习中,处理低质量、损坏或恶意数据至关重要。然而,传统数据估值方法由于隐私问题并不适用。为此,我们提出了一种简单而有效的方法,利用一种称为“懒惰影响”的新影响近似方法来过滤和评分数据,同时保持隐私。为此,每个参与者使用自己的数据来估计另一个参与者批次的影响,并将差分隐私混淆的评分发送给中央协调器。我们的方法已在各种模拟和现实世界设置中成功过滤出偏见和损坏的数据,实现了超过90%的召回率(有时高达100%),同时在ε ≤ 1的强差分隐私保证下保持性能。

英文摘要

In Federated Learning, it is crucial to handle low-quality, corrupted, or malicious data. However, traditional data valuation methods are not suitable due to privacy concerns. To address this, we propose a simple yet effective approach that utilizes a new influence approximation called "lazy influence" to filter and score data while preserving privacy. To do this, each participant uses their own data to estimate the influence of another participant's batch and sends a differentially private obfuscated score to the central coordinator. Our method has been shown to successfully filter out biased and corrupted data in various simulated and real-world settings, achieving a recall rate of over $>90\%$ (sometimes up to $100\%$) while maintaining strong differential privacy guarantees with $\varepsilon \leq 1$.

2410.04907 2026-06-05 math.CO cs.DM cs.LG cs.NE math.OC

Decomposition Polyhedra of Piecewise Linear Functions

分段线性函数的分解多面体

Marie-Charlotte Brandenburg, Moritz Grillo, Christoph Hertrich

发表机构 * Ruhr-Universität Bochum(博德姆鲁尔大学) Max Planck Institute MiS(马克斯·普朗克研究所MiS) University of Technology Nuremberg(纽伦堡技术大学)

AI总结 本文研究如何将连续分段线性函数分解为两个凸分段线性函数的差,通过固定多面体复形来确定非线性区域的可能位置,并证明分解集合形成一个多面体,从而为优化和神经网络理论提供新的见解。

详情
AI中文摘要

在本文中,我们致力于研究如何将连续分段线性(CPWL)函数分解为两个凸CPWL函数的差这一经常被研究的问题。每种CPWL函数都有无限多种这样的分解方式,但为了在优化和神经网络理论中应用,找到具有尽可能少线性片段的分解至关重要。我们通过反驳Tran和Wang最近提出的方法来展示这一问题的挑战性。为了使问题更具可处理性,我们提出固定一个底层多面体复形来确定可能的非线性区域位置。在此假设下,我们证明分解集合形成一个多面体,该多面体是两个平移锥的交集。我们证明不可约分解对应于该多面体的有界面,并且最小解必须是顶点。然后我们识别出具有唯一最小分解的情况,并展示我们的见解对亚模函数理论的影响。最后,我们改进了给定凸CPWL函数的神经网络构造,并应用我们的框架在非凸情况下获得结果。

英文摘要

In this paper we contribute to the frequently studied question of how to decompose a continuous piecewise linear (CPWL) function into a difference of two convex CPWL functions. Every CPWL function has infinitely many such decompositions, but for applications in optimization and neural network theory, it is crucial to find decompositions with as few linear pieces as possible. This is a highly challenging problem, as we further demonstrate by disproving a recently proposed approach by Tran and Wang [Minimal representations of tropical rational functions. Algebraic Statistics, 15(1):27-59, 2024]. To make the problem more tractable, we propose to fix an underlying polyhedral complex determining the possible locus of nonlinearity. Under this assumption, we prove that the set of decompositions forms a polyhedron that arises as intersection of two translated cones. We prove that irreducible decompositions correspond to the bounded faces of this polyhedron and minimal solutions must be vertices. We then identify cases with a unique minimal decomposition, and illustrate how our insights have consequences in the theory of submodular functions. Finally, we improve upon previous constructions of neural networks for a given convex CPWL function and apply our framework to obtain results in the nonconvex case.

2406.07049 2026-06-05 cs.NE cs.LG

GridPE: A Grid Cell-Inspired Unified Position Embedding for Arbitrary-Dimensional Spaces

GridPE: 一种基于网格细胞的统一位置嵌入方法用于任意维度空间

Boyang Li, Yulin Wu, Nuoxian Huang, Wenjia Zhang

发表机构 * New York University(纽约大学) Peking University(北京大学) Imperial College London(伦敦帝国学院) Tongji University(同济大学)

AI总结 本文提出GridPE,一种受哺乳动物空间认知中六边形周期编码启发的新型位置嵌入框架,旨在解决高维时空任务中位置嵌入的理论保障问题,通过结合计算神经科学原理和调和分析,为任意维度空间提供统一的位置嵌入解决方案。

详情
AI中文摘要

理解所有维度上的空间关系对于智能系统至关重要。然而,现有的位置嵌入方法,如旋转位置嵌入(RoPE),在高维时空任务如视频理解和机器人导航中缺乏理论保障。受哺乳动物空间认知中网格细胞的六边形周期编码启发,我们提出GridPE——一种结合计算神经科学原理与调和分析的新位置嵌入框架。我们的方法基于随机傅里叶特征,并利用神经科学原理构建高效的嵌入。理论上,我们证明任何平移不变的空间函数都可以通过有限个傅里叶基的求和来近似,这自然在一维情况下还原为RoPE。然后,我们从生物可用性角度推导出每个尺度下的频率向量方向和数量,以及不同尺度之间的最佳比例。这些推导等同于该维度中正则单纯形的中心与顶点之间的关系。我们验证了GridPE在多种空间建模任务中的有效性,包括2D图像分类(ImageNet100)和3D点云识别(ModelNet40)。我们的理论分析确立了GridPE作为任意维度空间位置嵌入的统一框架,而实验结果证明其优于现有方法。

英文摘要

Understanding spatial relationships across all dimensions is fundamental for intelligent systems. However, existing positional embeddings, such as Rotary Positional Embedding (RoPE), lack theoretical guarantees for high-dimensional spatiotemporal tasks like video understanding and robotic navigation. Inspired by the hexagonal periodic coding of grid cells in mammalian spatial cognition, we propose GridPE -- a novel positional embedding framework that integrates computational neuroscience principles with harmonic analysis. Our approach builds upon Random Fourier Features and leverages principles from neuroscience to construct efficient embeddings. Theoretically, we prove that any translation-invariant spatial function can be approximated by a finite sum of Fourier bases, which naturally reduces to RoPE in the one-dimensional case. We then derive the directions and quantities of frequency vectors at each scale in any Euclidean dimension, along with the optimal ratio between different scales, from a bioavailability perspective. These derivations are equivalent to the relationship between the centroid and the vertices of a regular simplex in that dimension. We validate GridPE across a range of spatial modeling tasks, including 2D image classification (ImageNet100) and 3D point cloud recognition (ModelNet40). Our theoretical analysis establishes GridPE as a unified framework for positional embedding in arbitrary-dimensional spaces, while empirical results demonstrate its superiority over existing methods.

2403.00965 2026-06-05 stat.AP cs.AI cs.LG

Binary Gaussian Copula Synthesis: an LLM-powered data augmentation framework for early dialysis prediction in chronic kidney disease

二元高斯卷积合成:一种基于LLM的数据增强框架,用于慢性肾病早期透析预测

Hamed Khosravi, Milad Khanchi, Mobina Noori, Srinjoy Das, Abdullah Al-Mamun, Imtiaz Ahmed

发表机构 * Department of Industrial & Management Systems Engineering, West Virginia University(威斯康星大学工业与管理系统工程系) Department of Electrical and Computer Engineering, Concordia University(康科迪亚大学电气与计算机工程系) Department of Computer Science, University of California, Davis(加州大学戴维斯分校计算机科学系) School of Mathematical & Data Sciences, West Virginia University(威斯康星大学数学与数据科学学院) School of Systems Science and Industrial Engineering, The State University of New York at Binghamton(纽约州立大学布法罗分校系统科学与工业工程学院) H. Milton Stewart School of Industrial and Systems Engineering, Georgia Institute of Technology(佐治亚理工学院H.米尔顿·斯图尔特工业与系统工程学院)

AI总结 本文提出Binary Gaussian Copula Synthesis (BGCS),一种专为二元临床数据设计的两阶段数据增强方法,通过生成合成少数类样本并过滤不合理的样本,提高了早期透析预测的性能。

详情
AI中文摘要

只有极少数慢性肾病(CKD)患者会进展到透析,这导致了严重的类别不平衡,限制了机器学习模型在早期透析预测中的性能。这一挑战进一步加剧了电子健康记录(EHR)数据的二元结构,而现有的大多数增强方法并未为此设计。我们提出了Binary Gaussian Copula Synthesis (BGCS),一种专为二元临床数据设计的两阶段数据增强方法。BGCS首先使用高斯卷积框架生成合成少数类样本,该框架明确建模二元特征之间的成对依赖关系,然后应用微调的GPT-2分类器过滤出临床上不合理的样本后再进行训练。我们在一个包含15,169名CKD患者的真实世界EHR数据集中评估了BGCS,该数据集来自西弗吉尼亚州,收集时间从2008年到2022年。我们将其与SMOTE、CTGAN和标准高斯卷积在四个机器学习分类器上进行了基准测试,共进行了25次独立运行。BGCS在所有比较方法中表现一致,实现了90天透析预测的最高少数类召回率,不同分类器的中位数值范围从0.78到0.87,且在真实数据上的分布忠实度最强,特征的均值p值为0.68。表现最好的BGCS增强模型被集成到一个可解释的决策树基于的临床决策支持系统中,用于透析风险分层,其中电解质失衡、心血管合并症和肾脏监测指标成为最显著的预测特征。这些发现表明,为二元EHR数据的结构特性设计的增强方法可以显著提高早期透析风险预测,并支持开发可解释的临床决策支持工具用于CKD护理。

英文摘要

Only a small fraction of patients with chronic kidney disease (CKD) progress to dialysis, creating severe class imbalance that limits the performance of machine learning models for early dialysis prediction. This challenge is compounded by the binary structure of electronic health record (EHR) data, for which most existing augmentation methods were not designed. We propose Binary Gaussian Copula Synthesis (BGCS), a two-stage data augmentation method tailored to binary clinical data. BGCS first generates synthetic minority-class samples using a Gaussian copula framework that explicitly models pairwise dependencies among binary features, then applies a fine-tuned GPT-2 classifier to filter out clinically implausible samples before training. We evaluated BGCS on a real-world EHR dataset of 15,169 patients with CKD from West Virginia collected between 2008 and 2022, benchmarking it against SMOTE, CTGAN, and standard Gaussian Copula across four machine learning classifiers over 25 independent runs. BGCS consistently outperformed all comparison methods, achieving the highest minority-class recall for 90-day dialysis prediction, with median values ranging from 0.78 to 0.87 across classifiers, and the strongest distributional fidelity to real data, with a mean p-value of 0.68 across features. The best-performing BGCS-augmented model was integrated into an interpretable decision tree-based clinical decision support system for dialysis risk stratification, with electrolyte imbalances, cardiovascular comorbidities, and renal monitoring indicators emerging as the most influential predictive features. These findings suggest that augmentation methods designed for the structural properties of binary EHR data can meaningfully improve early dialysis risk prediction and support the development of interpretable clinical decision-support tools for CKD care.

2308.12224 2026-06-05 q-bio.QM cs.AI

Enhancing cardiovascular risk prediction through AI-enabled calcium-omics

通过AI赋能的钙组学增强心血管风险预测

Ammar Hoori, Sadeer Al-Kindi, Tao Hu, Yingnan Song, Hao Wu, Juhwan Lee, Nour Tashtish, Pingfu Fu, Robert Gilkeson, Sanjay Rajagopalan, David L. Wilson

发表机构 * Department of Biomedical Engineering, Case Western Reserve University(生物医学工程系,凯斯西储大学) Harrington Heart and Vascular Institute, University Hospitals Cleveland Medical Center(哈灵顿心脏和血管研究所,克利夫兰医学中心) School of Medicine, Case Western Reserve University(医学院,凯斯西储大学) Department of Population and Quantitative Health Sciences, Case Western Reserve University(人口与定量健康科学系,凯斯西储大学) Department of Radiology, University Hospitals Cleveland Medical Center(放射科,克利夫兰医学中心) Department of Radiology, Case Western Reserve University(放射科,凯斯西储大学)

AI总结 本文通过利用详细的钙沉积特征(即钙组学)结合AI方法,提高了主要不良心血管事件(MACE)预测的准确性,展示了钙组学在心血管风险预测中的应用价值。

Comments 12 pages, 8 figures, 2 tables, 4 pages supplemental, journal paper format (under review)

详情
AI中文摘要

背景. 冠状动脉钙化(CAC)是预测主要不良心血管事件(MACE)的强大预测因子。传统的Agatston评分只是简单地将钙含量相加,尽管是非线性方式,但仍有改进钙沉积评估的空间,以更全面地捕捉疾病程度。目标. 确定是否可以通过使用详细的钙沉积特征(即钙组学)的AI方法来提高MACE预测。方法. 我们研究了钙沉积的其他特征,包括质量、体积、密度、空间分布、区域等的评估。我们使用带有弹性网络正则化的Cox模型,在2457例CT钙化评分(CTCS)中,该评分富集了MACE事件,来源于一个大型无成本CLARIFY计划(ClinicalTrials.gov标识符:NCT04075162)。我们采用了采样技术来增强模型训练。我们还研究了使用选定特征的Cox模型,以识别可解释的高风险特征。结果. 我们提出的钙组学模型,通过修改的合成下采样和上采样,给出了C指数(80.5%/71.6%)和两年AUC(82.4%/74.8%)(80:20,训练/测试),分别(采样仅应用于训练集)。结果优于Agatston,后者给出了C指数(71.3%/70.3%)和AUC(71.8%/68.8%)。在钙组学特征中,钙化数量、左前降支质量及扩散率(空间分布的度量)是增加风险的重要决定因素,而致密钙化(>1000HU)与较低风险相关。钙组学模型在保留测试中将63%的MACE患者重新分类到高风险组。分类净再分类指数为NRI=0.153。结论. AI分析冠状动脉钙化可比Agatston评分产生更好的结果。我们的发现表明,钙组学在改进风险预测中的应用价值。

英文摘要

Background. Coronary artery calcium (CAC) is a powerful predictor of major adverse cardiovascular events (MACE). Traditional Agatston score simply sums the calcium, albeit in a non-linear way, leaving room for improved calcification assessments that will more fully capture the extent of disease. Objective. To determine if AI methods using detailed calcification features (i.e., calcium-omics) can improve MACE prediction. Methods. We investigated additional features of calcification including assessment of mass, volume, density, spatial distribution, territory, etc. We used a Cox model with elastic-net regularization on 2457 CT calcium score (CTCS) enriched for MACE events obtained from a large no-cost CLARIFY program (ClinicalTri-als.gov Identifier: NCT04075162). We employed sampling techniques to enhance model training. We also investigated Cox models with selected features to identify explainable high-risk characteristics. Results. Our proposed calcium-omics model with modified synthetic down sampling and up sampling gave C-index (80.5%/71.6%) and two-year AUC (82.4%/74.8%) for (80:20, training/testing), respectively (sampling was applied to the training set only). Results compared favorably to Agatston which gave C-index (71.3%/70.3%) and AUC (71.8%/68.8%), respectively. Among calcium-omics features, numbers of calcifications, LAD mass, and diffusivity (a measure of spatial distribution) were important determinants of increased risk, with dense calcification (>1000HU) associated with lower risk. The calcium-omics model reclassified 63% of MACE patients to the high risk group in a held-out test. The categorical net-reclassification index was NRI=0.153. Conclusions. AI analysis of coronary calcification can lead to improved results as compared to Agatston scoring. Our findings suggest the utility of calcium-omics in improved prediction of risk.

0911.2381 2026-06-05 physics.data-an cond-mat.stat-mech cs.LG nlin.CD stat.ME

Analytical Determination of Fractal Structure in Stochastic Time Series

随机时间序列中分形结构的解析确定

Fermín Moscoso del Prado Martín

发表机构 * Laboratoire de Psychologie Cognitive ( UMR --6146) CNRS \& Aix--Marseille Universit\'e I, Marseille, France

AI总结 本文提出了一种基于贝叶斯评估的分析框架,用于客观准确地推断时间序列的分形结构,同时推导出一种优于现有方法的Hurst指数最大似然估计器。

Comments 9 pages, 4 figures

详情
Journal ref
Psychological Methods (2013) 18(4):514-34
AI中文摘要

当前确定时间序列是否具有分形结构(FS)的方法依赖于对Hurst指数估计值的主观评估。本文引入了贝叶斯评估的标度性,一种用于对时间序列的FS进行客观和准确推断的分析框架。该技术利用了时间序列相关扩散的标度性质。所得标准易于计算,并代表了支持时间序列标度域不同假设的证据的准确表征。此外,从该标准导出了H的闭式最大似然估计器,该估计器优于目前最好的估计器。

英文摘要

Current methods for determining whether a time series exhibits fractal structure (FS) rely on subjective assessments on estimators of the Hurst exponent (H). Here, I introduce the Bayesian Assessment of Scaling, an analytical framework for drawing objective and accurate inferences on the FS of time series. The technique exploits the scaling property of the diffusion associated to a time series. The resulting criterion is simple to compute and represents an accurate characterization of the evidence supporting different hypotheses on the scaling regime of a time series. Additionally, a closed-form Maximum Likelihood estimator of H is derived from the criterion, and this estimator outperforms the best available estimators.

2606.06201 2026-06-05 cs.AI

Learning to replenish: A hybrid deep reinforcement learning for dynamic inventory management in the pharmaceutical supply chains

学习补货:面向医药供应链动态库存管理的混合深度强化学习

Amandeep Kaur, Gyan Prakash

AI总结 针对医药供应链中需求不确定和前置时间变化导致的库存管理难题,提出一种混合异步优势演员评论家分布式近端策略优化(A3C DPPO)算法,实现连续动作空间下的最优补货策略,降低库存成本并提高服务水平。

Comments Nil

详情
AI中文摘要

医药供应链(PSCs)因不可预测的需求模式和与补货相关的可变前置时间,在库存管理(IM)方面面临挑战。药品的有限保质期进一步加剧了这种复杂性,需要在充足库存和最小浪费之间取得微妙的平衡。这些相互交织的因素构成了一个复杂的优化问题,需要复杂的库存策略来确保产品可用性和PSC效率。本研究旨在为医药产品开发一种最优库存补货策略,能够处理由不确定需求和可变PSC条件产生的随机性。目标是最大化PSC的盈利能力,同时保持较高的患者服务水平。我们将问题建模为马尔可夫决策过程,并提出一种深度强化学习(DRL)方法,具体为混合异步优势演员评论家分布式近端策略优化(A3C DPPO)算法。该A3C DPPO算法针对IM中固有的连续动作空间进行了定制。数值结果表明,所提算法在动态场景下自适应更新库存补货策略,与各种基准相比,实现了更低的库存成本。我们还使用真实药品库存数据进行了数值验证,以确认所提算法的实际可行性。

英文摘要

Pharmaceutical supply chains (PSCs) struggle with inventory management (IM) due to unpredictable demand patterns and variable lead times associated with restocking. This complexity is further compounded by the finite shelf lives of pharmaceutical products, which necessitate a delicate balance between adequate stock and minimal waste. These intertwined factors create a complex optimization problem that requires sophisticated inventory strategies to ensure both product availability and PSC efficiency. This study aims to develop an optimal inventory replenishment policy for pharmaceutical products that can handle the stochasticity arising from uncertain demand and variable PSC conditions. The objective is to maximize the profitability of the PSC while maintaining a high patient service level. We formulate the problem as a Markov decision process and propose a deep reinforcement learning (DRL) approach, specifically, a hybrid asynchronous advantage actor critic distributed proximal policy optimization (A3C DPPO)algorithm. The A3C DPPO algorithm is tailored to handle the continuous action space inherent in IM. The numerical results demonstrate that the proposed algorithm adaptively updates the inventory replenishment strategy under dynamic scenarios, resulting in lower inventory costs compared to various benchmarks. We also conduct numerical validation using real-world pharmaceutical inventory data to confirm the practical feasibility of the proposed algorithm.

2606.05611 2026-06-05 cs.CV

What's Under the Skin? Estimating Swine Body Condition

皮肤之下是什么?估算猪体况

Mk Bashar, Kuljit Bhatti, Gary Rohrer, Madonna Benjamin, Tami Brown-Brandl, Daniel Morris

AI总结 提出PigFormer系统,利用RGB-D深度图像通过两阶段流程(几何前端和切片注意力编码器)预测猪的皮下背膘厚度、腰肌深度和总组织厚度,实现非接触式体况监测。

详情
AI中文摘要

母猪体况是养殖者的重要指标,因为它对泌乳性能和仔猪存活率有很大影响。然而,生产中使用的体况测量方法(如视觉评分和卡尺)与底层组织成分的相关性较差。超声波扫描可以直接测量皮下背膘厚度和腰肌深度,但操作劳动密集且无法规模化生产。我们提出了PigFormer,一个端到端的两阶段系统,它从天花板安装的RGB-D相机获取原始深度帧,并预测最后肋骨处的皮下背膘厚度、腰肌深度和总组织厚度。第一阶段是几何前端,通过SAM3-to-MaskDINO分割蒸馏、地平面去除和方向归一化将原始深度转换为标准化高度图。第二阶段是切片注意力编码器,将每个高度图视为一系列横截面切片,并捕捉沿整个背侧表面的空间关系。在两个设施的多站点数据集(319头母猪和小母猪实例)上,PigFormer实现了2.43毫米的背膘平均绝对误差和3.87毫米的整体平均绝对误差。它优于强大的单阶段ResNet-18和ViT-small基线。PigFormer为商业养猪生产中实现连续、自动化、非接触式体况监测提供了一条实用途径。代码可在https://github.com/iambashar/Pigformer获取。

英文摘要

Sow body condition is an important indicator for growers as it has a large impact on lactation performance and piglet survival. However, body condition measures used during production, such as visual scoring and calipers, correlate poorly with underlying tissue composition. Ultrasound scans can provide direct measurements of subcutaneous backfat thickness and loin muscle depth, but their operation is labor intensive and not scalable for production. We present PigFormer, an end-to-end two-stage system that takes raw depth frames from a ceiling-mounted RGB-D camera and predicts subcutaneous backfat thickness, loin muscle depth, and total tissue thickness at the last rib. Stage 1 is a geometric front-end that converts raw depth into a standardized height map via SAM3-to-MaskDINO segmentation distillation, ground-plane removal, and orientation normalization. Stage 2 is a Slice Attention Encoder that treats each height map as a sequence of cross-sectional slices and captures spatial relationships along the full dorsal surface. On a multi-site dataset of 319 sow and gilt instances from two facilities, PigFormer achieves 2.43 mm backfat MAE and 3.87 mm overall MAE. It outperforms strong single-stage ResNet-18 and ViT-small baselines. PigFormer offers a practical path toward continuous, automated, non-contact body condition monitoring in commercial swine production. Code is available at https://github.com/iambashar/Pigformer.

2605.23415 2026-06-05 cs.LG cs.AI

Reflex: Reinforcement Learning with Reflection Symmetry Exploitation in State-Based Continuous Control

Reflex: 基于状态连续控制中利用反射对称性的强化学习

Shuai Zhen, Yifan Zhang, Yuling Wang, Yanhua Yu

AI总结 提出Reflex框架,通过反射对称性正则化机制将反射对称性融入策略学习,提升基于状态的连续控制任务的样本效率。

Comments Some of the data in the paper contain errors and need to be confirmed for modification

详情
AI中文摘要

强化学习长期面临样本效率低下的问题。缓解该问题的一种有前景的方法是利用群不变马尔可夫决策过程($G$-不变MDP)。现有工作主要关注基于图像的强化学习和旋转对称性(如$\mathrm{SO(2)}$),而基于状态的强化学习和反射对称性尚未得到充分探索。本文聚焦于基于状态的连续控制任务,通过引入Reflex范式来利用反射对称性,该范式可无缝集成到同策略和异策略强化学习算法中。我们形式化了两种反射类型——轴向反射和双侧反射,并刻画了它们对应的变换。基于对保持对称性的最优值函数和策略的理论分析,Reflex通过原则性的对称性正则化机制将反射对称性融入策略学习。我们将Reflex与PPO和SAC集成,并在OpenAI Gym和DeepMind Control基准测试套件上进行评估,结果表明相比标准基线,Reflex在提升样本效率的同时实现了更优的性能。我们的代码开源在https://github.com/TonyStark042/Reflex。

英文摘要

Reinforcement learning has long struggled with poor sample efficiency. One promising approach to mitigate this problem is leveraging group-invariant Markov Decision Processes ($G$-invariant MDPs). Existing works in this direction have primarily focused on image-based RL and rotational symmetry such as $\mathrm{SO(2)}$, leaving state-based RL and reflection symmetry largely underexplored. In this work, we focus on state-based continuous control tasks and exploit reflection symmetry by introducing Reflex, a paradigm that seamlessly integrates with both on-policy and off-policy RL algorithms. We formalize two types of reflection-axial reflection and bilateral reflection, and characterize their corresponding transformations. Building on a theoretical analysis of symmetry-preserving optimal value functions and policies, Reflex integrates reflection symmetry into policy learning through principled symmetry regularization mechanisms. We integrate Reflex with PPO and SAC, and evaluate it on a suite of OpenAI Gym and DeepMind Control benchmarks, demonstrating superior performance over standard baselines while improving sample efficiency. Our code is available at https://github.com/TonyStark042/Reflex.

2605.11632 2026-06-05 cs.CL cs.AI

Macro: Enhancing Multilingual Counterfactual Explanations through Alignment-as-Preference Optimization

Macro: 通过偏好对齐优化提升多语言反事实解释

Yilong Wang, Qianli Wang, Bohao Chu, Yihong Liu, Jing Yang, Simon Ostermann

AI总结 本文提出Macro框架,通过直接偏好优化改进多语言反事实解释,提升有效性的同时保持最小性,避免翻译基线的严重最小性问题,并在多个指标上优于监督微调方法。

Comments In submission

详情
AI中文摘要

自我生成的反事实解释(SCEs)是大型语言模型(LLMs)生成的最小修改输入(minimality),通过翻转自身预测(validity)来揭示黑箱LLM行为,提供因果基础的解释方法。然而,将其扩展到非主导语言仍具挑战性:现有方法难以生成有效SCEs,且有效性与最小性之间的权衡影响解释质量。我们引入Macro,一种偏好对齐框架,通过直接偏好优化(DPO)进行多语言SCE生成,使用复合评分函数构建偏好对,将权衡转化为可测量的偏好信号。在四个LLM和七个语言类型多样的语言上进行实验,结果显示,Macro在平均情况下比链式思维基线提高了12.55%的有效性,同时不降低最小性,避免了翻译基线的严重最小性问题。与监督微调相比,Macro在两个指标上表现更优,证实了显式偏好优化对于平衡此权衡的重要性。进一步分析显示,Macro增强了跨语言扰动对齐并缓解了常见生成错误。我们的结果突显了偏好优化作为提升多语言模型解释的有前途方向。

英文摘要

Self-generated counterfactual explanations (SCEs) are minimally modified inputs (minimality) generated by large language models (LLMs) that flip their own predictions (validity), offering a causally grounded approach to unraveling black-box LLM behavior. Yet extending them beyond English remains challenging: existing methods struggle to produce valid SCEs in non-dominant languages, and a persistent trade-off between validity and minimality undermines explanation quality. We introduce Macro, a preference alignment framework that applies Direct Preference Optimization (DPO) to multilingual SCE generation, using a composite scoring function to construct preference pairs that effectively translate the trade-off into measurable preference signals. Experiments across four LLMs and seven typologically diverse languages show that Macro improves validity by 12.55\% on average over the chain-of-thought baseline without degrading minimality, while avoiding the severe minimality violations of the translation-based baseline. Compared to supervised fine-tuning, Macro achieves superior performance on both metrics, confirming that explicit preference optimization is essential for balancing this trade-off. Further analyses reveal that Macro increases cross-lingual perturbation alignment and mitigates common generation errors. Our results highlight preference optimization as a promising direction for enhancing multilingual model explanations.

2604.16502 2026-06-05 cs.CV

Topology-Aware Layer Pruning for Large Vision-Language Models

面向拓扑的层剪枝用于大型视觉-语言模型

Pengcheng Zheng, Chaoning Zhang, Ya Wen, Wang Liu, Qigan Sun, Jiarong Mo, Jiaquan Zhang, Jewon Lee, Tae-Ho Kim, Kuien Liu, Tianyu Li, Caiyan Qin, Yang Yang

AI总结 本文提出了一种面向拓扑的层剪枝框架,用于大型视觉-语言模型,通过利用拓扑持续同调量化层间拓扑一致性,实现自适应剪枝以保留关键表示转换。

Comments This manuscript has been withdrawn by the authors. It reproduced the methodology of Gardinazzi et al., arXiv:2410.11042, without citation, and utilized code and data from the associated repository (github.com/RitAreaSciencePark/ZigZagLLMs) without disclosure or violate the MIT License. A revised future version with full attribution may be prepared. For any feedback, please contact Pengcheng Zheng

详情
AI中文摘要

大型语言模型(LLMs)在自然语言理解和推理方面展示了强大的能力,而最近的扩展将视觉输入纳入其中,使它们能够处理多模态信息。尽管有这些进展,大型视觉-语言模型(LVLMs)仍然带来了显著的计算和内存成本,阻碍了在资源受限场景中的部署。现有的层剪枝方法通常依赖于局部相似性度量或静态代理信号,无法捕捉模型深度中表示的全局和动态演变,这往往导致关键转换层被移除。为了解决这一限制,我们提出了一种面向拓扑的层剪枝框架用于LVLMs。具体而言,我们将层的隐藏状态表示为点云,并利用 extit{simplicial complexes}来建模其演变。通过利用 extit{zigzag persistent homology},我们量化了层间拓扑一致性,并实现了能够保留关键表示转换的自适应剪枝。在多样化的多模态基准上的广泛实验表明,所提出的框架在各种稀疏率范围内均优于现有剪枝方法。我们的代码可在https://github.com/zpc456/TopoVLM上获得。

英文摘要

Large Language Models (LLMs) have demonstrated strong capabilities in natural language understanding and reasoning, while recent extensions that incorporate visual inputs enable them to process multimodal information. Despite these advances, Large Vision-Language Models (LVLMs) incur substantial computational and memory costs, hindering deployment in resource-constrained scenarios. Existing layer pruning methods typically rely on local similarity metrics or static proxy signals, failing to capture the global and dynamic evolution of representations across model depth, which often leads to the removal of transition-critical layers. To address this limitation, we propose a topology-aware layer pruning framework for LVLMs. Specifically, we represent layer wise hidden states as point clouds and models their evolution using \textit{simplicial complexes}. By leveraging \textit{zigzag persistent homology}, we quantify inter-layer topological consistency and enable adaptive pruning that preserves critical representational transitions. Extensive experiments on diverse multimodal benchmarks demonstrate that the proposed framework consistently outperforms existing pruning methods across a wide range of sparsity ratios. Our code is available at https://github.com/zpc456/TopoVLM.

2604.01349 2026-06-05 cs.LG cs.CE physics.comp-ph

PI-JEPA: Label-Free Surrogate Pretraining for Coupled Multiphysics Simulation via Operator-Split Latent Prediction

PI-JEPA: 一种无需标签的替代预训练方法,用于通过操作符分裂潜在预测的耦合多物理场模拟

Brandon Yee, Pairie Koh

AI总结 该研究提出了一种无需完整PDE求解的替代预训练框架PI-JEPA,通过掩码潜在预测和每子操作符PDE残差正则化,在未标记的参数场上训练,从而减少多物理场替代部署所需的模拟预算。

Comments Substantial Revision Required

详情
AI中文摘要

流体模拟工作流程面临根本的数据不对称性:输入参数场(地质统计渗透率实现、孔隙度分布)可以自由生成任意数量,但现有神经操作符替代模型需要大量昂贵的标记模拟轨迹数据,无法利用这种未标记结构。我们引入PI-JEPA(物理信息联合嵌入预测架构),一种替代预训练框架,无需任何完整的PDE求解,通过在未标记参数场上的掩码潜在预测和每子操作符PDE残差正则化进行训练。预测器银行在结构上与 governing equations 的李-特罗特操作符分裂分解对齐,为每个子过程(压力、饱和度传输、反应)分配一个物理约束的潜在模块,使微调仅需100次标记模拟运行。在单相达西流中,PI-JEPA在N_ℓ=100时比FNO低1.9倍,比DeepONet低2.4倍,在N_ℓ=500时比监督-only训练提高24%,证明了无标签替代预训练显著减少了多物理场替代部署所需的模拟预算。

英文摘要

Reservoir simulation workflows face a fundamental data asymmetry: input parameter fields (geostatistical permeability realizations, porosity distributions) are free to generate in arbitrary quantities, yet existing neural operator surrogates require large corpora of expensive labeled simulation trajectories and cannot exploit this unlabeled structure. We introduce \textbf{PI-JEPA} (Physics-Informed Joint Embedding Predictive Architecture), a surrogate pretraining framework that trains \emph{without any completed PDE solves}, using masked latent prediction on unlabeled parameter fields under per-sub-operator PDE residual regularization. The predictor bank is structurally aligned with the Lie--Trotter operator-splitting decomposition of the governing equations, dedicating a separate physics-constrained latent module to each sub-process (pressure, saturation transport, reaction), enabling fine-tuning with as few as 100 labeled simulation runs. On single-phase Darcy flow, PI-JEPA achieves $1.9\times$ lower error than FNO and $2.4\times$ lower error than DeepONet at $N_\ell{=}100$, with 24\% improvement over supervised-only training at $N_\ell{=}500$, demonstrating that label-free surrogate pretraining substantially reduces the simulation budget required for multiphysics surrogate deployment.

2512.15783 2026-06-05 cs.AI cs.LG

Towards AI epidemiology: a measurement standardisation framework for prospective risk detection

迈向人工智能流行病学:一种用于前瞻性风险检测的测量标准化框架

Kit Tempest-Walters

AI总结 本文提出了一种测量标准化框架,用于在没有访问模型内部信息的情况下,将专家-人工智能交互压缩为结构化、可比较的领域,以进行前瞻性风险检测。该框架旨在定义其范围,包括语义和统计层面,并指定未来工作的实证测试协议。

Comments 29 pages, 3 figures

详情
AI中文摘要

本文提出了一种测量标准化框架,该框架将专家-人工智能交互压缩为结构化、可比较的领域,用于在部署的人工智能系统中进行前瞻性风险检测,而无需访问模型内部信息。本文的概念性论文的主要目的是定义该框架的范围,包括语义和统计层面,并指定未来工作的实证测试协议。该框架旨在支持的群体层面声明因此是阶段性的研究计划,而非本文中声称的结果。测量标准化支撑着接下来的三个声明。第一个是可靠性声明:在有限条件下,大型语言模型可以产生可靠的、标准化的评估,用于评估专家-人工智能交互的证据和对齐情况。第二个是治理声明:对齐分数在部署期间为专家提供即时信号,并为机构提供监控不同任务类型、模型和领域的对齐模式的基础。第三个是流行病学声明:一旦建立了测量标准化,聚合对齐分数可以用于研究与下游结果相关的关联,这在受监管的专业环境中是可能的。这引入了基于相关变量而非机理分析的“人工智能流行病学”的可能性。本文解决了第一个声明,并指定了调查第二个和第三个声明的协议。为了在未来研究中实现实证评估,本文阐述了定义的语法,以及基于成对Bootstrap推断的统计协议,DeLong测试用于成对AUCs作为灵敏度检查,预设的一侧非劣性边界为0.05,以及Holm-Bonferroni校正。

英文摘要

This paper proposes a measurement standardisation framework that compresses expert-AI interactions into structured, comparable fields for prospective risk detection in deployed AI systems, without access to model internals. The main aim of this concept paper is to define the scope of the framework, both semantically and statistically, and to specify a protocol for its empirical testing in future work. The population-level claims the framework is designed to support are therefore the subject of a staged research programme rather than results claimed in this paper. Measurement standardisation underpins all three claims that follow. The first is a reliability claim: under bounded conditions, large language models can produce reliable, standardised assessments of the evidential and policy alignment of expert-AI interactions. The second is a governance claim: alignment scores give experts an immediate signal during deployment and give institutions a basis for monitoring alignment patterns across mission types, models, and domains. The third is an epidemiological claim: once measurement standardisation is established, aggregate alignment scores could be used to study associations with downstream outcomes in regulated professional settings. This introduces the possibility of an "AI epidemiology" that detects risk based on correlated variables instead of mechanistic analysis. This paper addresses the first claim and specifies protocols for investigating the second and third. To enable empirical evaluation in future studies, this paper sets out a defined grammar, together with a statistical protocol based on paired bootstrap inference, DeLong's test for paired AUCs as a sensitivity check, a pre-specified one-sided non-inferiority margin of 0.05, and Holm-Bonferroni correction.

2507.00460 2026-06-05 cs.CL

Pitfalls of Evaluating Language Models with Open Benchmarks

使用开放基准评估语言模型的陷阱

Md. Najib Hasan, Md Mahadi Hassan Sibat, Mohammad Fakhruddin Babar, Souvika Sarkar, Monowar Hasan, Santu Karmaker

AI总结 本文探讨了使用开放基准评估语言模型时存在的数据泄露风险,并通过构建作弊模型验证了这种风险,指出开放基准可能无法反映实际应用效果,需补充私有或动态生成的基准以维持评估的完整性。

Comments After further review, we found that the core contribution and methodology substantially overlap with previously published work. As a result, the manuscript does not provide a sufficiently distinct or original contribution in its current form. To avoid repetition in the literature and prevent possible confusion for readers, we believe withdrawal is the most appropriate action

详情
AI中文摘要

开放大型语言模型(LLM)基准,如HELM和BIG-Bench,提供了标准化和透明的评估协议,支持语言模型(LM)研究中的比较分析、可重复性和系统性进展跟踪。然而,这种开放性也带来了在LM测试中数据泄露的显著风险——无论是故意还是无意的,从而削弱了排行榜的公平性和可靠性,并使其容易受到不法分子的操控。我们通过故意构建作弊模型来展示这一问题的严重性:构建BART、T5和GPT-2的较小变体,并直接在公开可用的测试集上进行微调。正如预期的那样,这些模型在目标基准上表现优异,但在可比的未见测试集上却表现糟糕。我们随后检查了任务特定的简单改写-based防护策略,以减轻数据泄露的影响,并评估了它们的有效性和局限性。我们的发现强调了三个关键点:(i)在有限的开放、静态基准上的高排行榜表现可能无法反映实际应用效果;(ii)私有或动态生成的基准应补充开放基准以维持评估的完整性;(iii)对当前基准评估实践的重新审视对于可靠和可信的LM评估至关重要。

英文摘要

Open Large Language Model (LLM) benchmarks, such as HELM and BIG-Bench, provide standardized and transparent evaluation protocols that support comparative analysis, reproducibility, and systematic progress tracking in Language Model (LM) research. Yet, this openness also creates substantial risks of data leakage during LM testing--deliberate or inadvertent, thereby undermining the fairness and reliability of leaderboard rankings and leaving them vulnerable to manipulation by unscrupulous actors. We illustrate the severity of this issue by intentionally constructing cheating models: smaller variants of BART, T5, and GPT-2, fine-tuned directly on publicly available test-sets. As expected, these models excel on the target benchmarks but fail terribly to generalize to comparable unseen testing sets. We then examine task specific simple paraphrase-based safeguarding strategies to mitigate the impact of data leakage and evaluate their effectiveness and limitations. Our findings underscore three key points: (i) high leaderboard performance on limited open, static benchmarks may not reflect real-world utility; (ii) private or dynamically generated benchmarks should complement open benchmarks to maintain evaluation integrity; and (iii) a reexamination of current benchmarking practices is essential for reliable and trustworthy LM assessment.

2511.10254 2026-06-05 cs.CV

Facial-R1: Aligning Reasoning and Recognition for Facial Emotion Analysis

Facial-R1: 通过推理与识别对齐实现面部情绪分析

Jiulong Wu, Yucheng Shen, Lingyong Yan, Haixin Sun, Deguo Xia, Jizhou Huang, Min Cao

AI总结 本文提出Facial-R1框架,通过三阶段对齐方法解决面部情绪分析中推理与识别不一致及推理幻觉的问题,并引入FEA-20K基准数据集,验证了其在多个标准基准上的最佳性能。

Comments Withdrawn by the authors due to pending intellectual property considerations. The authors have determined that the current version contains material that should not have been publicly disseminated at this stage

详情
AI中文摘要

Facial Emotion Analysis (FEA) extends traditional facial emotion recognition by incorporating explainable, fine-grained reasoning. The task integrates three subtasks: emotion recognition, facial Action Unit (AU) recognition, and AU-based emotion reasoning to model affective states jointly. While recent approaches leverage Vision-Language Models (VLMs) and achieve promising results, they face two critical limitations: (1) hallucinated reasoning, where VLMs generate plausible but inaccurate explanations due to insufficient emotion-specific knowledge; and (2) misalignment between emotion reasoning and recognition, caused by fragmented connections between observed facial features and final labels. We propose Facial-R1, a three-stage alignment framework that effectively addresses both challenges with minimal supervision. First, we employ instruction fine-tuning to establish basic emotional reasoning capability. Second, we introduce reinforcement training guided by emotion and AU labels as reward signals, which explicitly aligns the generated reasoning process with the predicted emotion. Third, we design a data synthesis pipeline that iteratively leverages the prior stages to expand the training dataset, enabling scalable self-improvement of the model. Built upon this framework, we introduce FEA-20K, a benchmark dataset comprising 17,737 training and 1,688 test samples with fine-grained emotion analysis annotations. Extensive experiments across eight standard benchmarks demonstrate that Facial-R1 achieves state-of-the-art performance in FEA, with strong generalization and robust interpretability.

英文摘要

Facial Emotion Analysis (FEA) extends traditional facial emotion recognition by incorporating explainable, fine-grained reasoning. The task integrates three subtasks: emotion recognition, facial Action Unit (AU) recognition, and AU-based emotion reasoning to model affective states jointly. While recent approaches leverage Vision-Language Models (VLMs) and achieve promising results, they face two critical limitations: (1) hallucinated reasoning, where VLMs generate plausible but inaccurate explanations due to insufficient emotion-specific knowledge; and (2) misalignment between emotion reasoning and recognition, caused by fragmented connections between observed facial features and final labels. We propose Facial-R1, a three-stage alignment framework that effectively addresses both challenges with minimal supervision. First, we employ instruction fine-tuning to establish basic emotional reasoning capability. Second, we introduce reinforcement training guided by emotion and AU labels as reward signals, which explicitly aligns the generated reasoning process with the predicted emotion. Third, we design a data synthesis pipeline that iteratively leverages the prior stages to expand the training dataset, enabling scalable self-improvement of the model. Built upon this framework, we introduce FEA-20K, a benchmark dataset comprising 17,737 training and 1,688 test samples with fine-grained emotion analysis annotations. Extensive experiments across eight standard benchmarks demonstrate that Facial-R1 achieves state-of-the-art performance in FEA, with strong generalization and robust interpretability.

2606.06046 2026-06-05 math.NA cs.LG cs.NA

Learning solution operators of PDEs with sparse approximation methods

用稀疏逼近方法学习PDE的解算子

Sebastian Neumayer, Daniel Potts, Fabian Taubert

AI总结 本文提出一种结合乘积基展开与正交匹配追踪的稀疏高维方法,用于逼近偏微分方程的解算子,显著减少所需样本量,并在数值实验中与立方体稀疏逼近和傅里叶神经算子对比,展示了在稀疏表示下的准确性和可解释性。

详情
AI中文摘要

我们研究了使用稀疏高维技术逼近偏微分方程(PDE)解算子的问题。基于维度增量框架,我们将乘积基展开与稀疏恢复方法(特别是正交匹配追踪(OMP))相结合,与先前考虑的基于立方体的方法相比,大幅减少了所需样本量。我们在多个数值示例上评估了所得方法,在准确性、运行时间和样本量方面与基于立方体的稀疏逼近和傅里叶神经算子进行了比较。实验表明,我们的方法相对于其前身显著减少了所需的PDE求解次数,同时保持了有竞争力的准确性,特别是当解在所选基中具有稀疏表示时。此外,恢复的稀疏索引集为相关变量和参数交互提供了可解释的见解。

英文摘要

We investigate the approximation of solution operators for partial differential equations (PDEs) using sparse high-dimensional techniques. Building on a dimension-incremental framework, we combine product basis expansions with sparse recovery methods, specifically orthogonal matching pursuit (OMP), to substantially reduce the required sample size compared with a previously considered cubature-based approach. We evaluate the resulting method numerically on several examples, comparing it against both cubature-based sparse approximation and Fourier neural operators in terms of accuracy, runtime, and sample size. The experiments show that our approach considerably reduces the number of required PDE solves relative to its predecessor while maintaining competitive accuracy, particularly when the solution admits a sparse representation in the chosen basis. Furthermore, the recovered sparse index sets yield interpretable insights into the relevant variables and parameter interactions.

2602.23665 2026-06-05 cs.IR cs.LG cs.SI

Geodesic Semantic Search: Cartographic Navigation of Citation Graphs with Learned Local Riemannian Maps

测地语义搜索:基于学习局部黎曼度量的引文图导航

Brandon Yee, Lucas Wang, Kundana Kommini

AI总结 本文提出Geodesic Semantic Search (GSS),通过在引文图上学习节点特定的黎曼度量,实现几何感知的语义检索。不同于传统基于嵌入的检索依赖固定欧几里得距离,GSS在每个节点学习低秩度量张量,诱导局部正定度量,从而在保持模型可计算性的同时保证有效度量。检索过程通过多源Dijkstra算法在学习的测地距离上进行,随后通过最大边际相关性重排序和路径一致性过滤。在包含169,000篇arXiv论文的引文预测基准上,GSS在Recall@20上比SPECTER+FAISS基线提升了23%。我们提供了Bridge Recovery Guarantee,描述了测地检索在定性上优于直接相似性的情况,以及训练损失与检索质量的边际分离结果,并刻画了低秩度量参数化的表达能力。我们的分层粗到细检索方法结合k-means池化,将计算成本降低4倍,同时保持97%的检索质量。

Comments Substantial Revision Required

详情
AI中文摘要

我们提出了Geodesic Semantic Search (GSS),一种检索系统,通过在引文图上学习节点特定的黎曼度量,以实现几何感知的语义检索。不同于标准基于嵌入的检索依赖固定欧几里得距离,\gss{}在每个节点学习一个低秩度量张量$\mL_i \in \R^{d imes r}$,诱导一个局部正定度量$\mG_i = \mL_i \mL_i^ op + \eps \mI$。这种参数化保证了有效的度量,同时保持模型的可计算性。检索过程通过在学习的测地距离上进行多源Dijkstra算法,随后通过最大边际相关性重排序和路径一致性过滤。在包含169,000篇arXiv论文的引文预测基准上,GSS在Recall@20上比SPECTER+FAISS基线提高了23%。我们提供了Bridge Recovery Guarantee,描述了测地检索在定性上优于直接相似性的情况,以及训练损失与检索质量的边际分离结果,并刻画了低秩度量参数化的表达能力。我们的分层粗到细检索方法结合k-means池化,将计算成本降低4倍,同时保持97%的检索质量。

英文摘要

We present Geodesic Semantic Search (GSS), a retrieval system that learns node-specific Riemannian metrics on citation graphs to enable geometry-aware semantic search. Unlike standard embedding-based retrieval that relies on fixed Euclidean distances, \gss{} learns a low-rank metric tensor $\mL_i \in \R^{d \times r}$ at each node, inducing a local positive semi-definite metric $\mG_i = \mL_i \mL_i^\top + \eps \mI$. This parameterization guarantees valid metrics while keeping the model tractable. Retrieval proceeds via multi-source Dijkstra on the learned geodesic distances, followed by Maximal Marginal Relevance reranking and path coherence filtering. On citation prediction benchmarks with 169K arXiv papers, GSS achieves 23\% relative improvement in Recall@20 over SPECTER+FAISS baselines. We provide a Bridge Recovery Guarantee characterizing when geodesic retrieval qualitatively outperforms direct similarity, a margin separation result connecting training loss to retrieval quality, and characterize the expressiveness of low-rank metric parameterization. Our hierarchical coarse-to-fine search with k-means pooling reduces computational cost by $4\times$ while maintaining 97\% retrieval quality.

2603.10457 2026-06-05 physics.plasm-ph cond-mat.stat-mech cs.LG physics.acc-ph

Beam-Plasma Collective Oscillations in Intense Charged-Particle Beams: Dielectric Response Theory, Langmuir Wave Dispersion, and Unsupervised Detection via Prometheus

强流带电粒子束中束-等离子体集体振荡:介电响应理论、朗缪尔波色散以及通过Prometheus的无监督检测

Brandon Yee, Wilson Collins, Michael Iofin, Jiayi Fu

AI总结 本文研究了强流带电粒子束中束-等离子体集体振荡的理论和计算框架,通过介电响应理论、朗缪尔波色散关系以及Prometheus算法验证了束-等离子体过渡的特性,展示了其在中间能区的应用前景。

Comments Substantial Revision Required

详情
AI中文摘要

我们开发了一个理论和计算框架,用于研究强流带电粒子束在中间能量(10-100 MeV)下的束-等离子体集体振荡。在第一部分,我们建立了由Vlasov-Poisson系统支配的动能场理论,推导出三种束分布函数的Lindhard介电函数和随机相位近似(RPA)极化张量。我们通过介电函数epsilon(omega,q)=0证明了临界束密度n_c以上的未阻尼朗缪尔波模式的存在,获得了显式的束-等离子体色散关系,并表明Landau阻尼在粒子-空穴连续谱之上消失。等离子体频率Omega_p^2 = ne^2/(m*epsilon_0)通过f求和规则固定,与分布形状无关;更高的色散系数取决于速度矩。空间电荷效应驱动异常束展宽,具有sqrt(n-n_c)起始和q=2k_F处的Friedel振荡。束-等离子体过渡通过重整化群分析属于三维Ising普遍性类。在第二部分,我们利用Prometheus验证这些预测,Prometheus是基于静态结构因子数据S(q)训练的beta-VAE。Prometheus检测到高斯和均匀分布中的集体等离子体振荡起始,确认在退相干费米气体(n_c->0)中不存在,且在q=2k_F处解析了Kohn异常。通过PIC模拟得到的S(q,omega)色散分析验证了由f求和规则预测的分布无关的Omega_p。所有六个验证检查均通过。预测的特征——密度可调的等离子体共振在omega_p与sqrt(n)成正比、异常束展宽具有sqrt(n-n_c)起始以及Friedel振荡——在现有的中间能区束设施中是可访问的。

英文摘要

We develop a theoretical and computational framework for beam-plasma collective oscillations in intense charged-particle beams at intermediate energies (10-100 MeV). In Part I, we formulate a kinetic field theory governed by the Vlasov-Poisson system, deriving the Lindhard dielectric function and random phase approximation (RPA) polarization tensor for three beam distribution functions. We prove via the dielectric function epsilon(omega,q)=0 the existence of undamped Langmuir wave modes above a critical beam density n_c, obtain explicit beam-plasma dispersion relations, and show that Landau damping vanishes above the particle-hole continuum. The plasma frequency Omega_p^2 = ne^2/(m*epsilon_0) is fixed by the f-sum rule independently of distribution shape; higher dispersion coefficients depend on velocity moments. Space charge effects drive anomalous beam broadening with sqrt(n-n_c) onset and Friedel oscillations at q=2k_F. The beam-plasma transition belongs to the 3D Ising universality class via renormalization group analysis. In Part II, we validate these predictions using Prometheus, a beta-VAE trained on static structure factor data S(q) from particle-in-cell (PIC) beam simulations. Prometheus detects collective plasma oscillation onset in Gaussian and uniform distributions, confirms their absence in the degenerate Fermi gas (n_c -> 0), and resolves the Kohn anomaly at q=2k_F. Dispersion analysis of S(q,omega) from PIC simulations verifies the distribution-independent Omega_p predicted by the f-sum rule. All six validation checks pass. Predicted signatures -- density-tunable plasma resonances at omega_p proportional to sqrt(n), anomalous beam broadening with sqrt(n-n_c) onset, and Friedel oscillations -- are accessible at existing intermediate-energy beam facilities.

2606.06495 2026-06-05 astro-ph.CO

What it takes to solve the Hubble tension through Modifications of Cosmological Recombination II: in light of ACT DR6 and DESI DR2

通过修改宇宙学重组解决哈勃张力需要什么 II:基于 ACT DR6 和 DESI DR2

Nanoom Lee, Tianji Zhou

AI总结 基于 ACT DR6 和 DESI DR2 数据,通过时变电子质量 $m_e(z)$ 寻找最小修改以解决哈勃张力,发现仅用 CMB 数据可完全解决,但加入 DESI BAO 后无法完全解决。

Comments 7+3 pages, 5 figures. Comments are welcome

详情
AI中文摘要

我们基于来自阿塔卡马宇宙学望远镜(ACT DR6)和暗能量光谱仪(DESI DR2)的最新数据,构建了数据驱动的哈勃张力解决方案。我们通过时变电子质量 $m_e(z)$ 寻找对重组历史的最小修改,该修改使从 CMB 数据推断的最佳拟合 $H_0$ 向 SH0ES 值增加,同时不恶化数据拟合。使用包括透镜效应的 Planck 和 ACT 数据,我们发现对 $m_e(z)$ 的微扰修改完全解决了哈勃张力,该解与之前仅使用 Planck 数据的工作具有相同的定性振荡结构,表明其对包含更精确和独立的 CMB 数据的鲁棒性。作为副产品,该解也缓解了 $S_8$ 张力。然而,一旦加入 DESI DR2 BAO 数据,对 $m_e(z)$ 的微扰修改无法完全解决哈勃张力。这反映了相同的基本限制:通过修改重组提高 $H_0$ 通常会降低 $\Omega_m$,与晚期宇宙学观测不一致。

英文摘要

We construct data-driven solutions to the Hubble tension, in light of recent data from the Atacama Cosmology Telescope (ACT DR6) and the Dark Energy Spectroscopic Instrument (DESI DR2). We search for the minimal modification to the recombination history through a time-varying electron mass $m_e(z)$ that increases the best-fit $H_0$ inferred from CMB data toward the SH0ES value, without worsening the fit to the data. Using Planck and ACT data including lensing, we find a perturbative modification to $m_e(z)$ that fully resolves the Hubble tension, with the solution sharing the same qualitative oscillatory structure as in previous work using Planck data alone, demonstrating its robustness to the inclusion of more precise and independent CMB data. As a byproduct, the solution also eases the $S_8$ tension. Once DESI DR2 BAO data are added, however, perturbative modifications to $m_e(z)$ cannot fully resolve the Hubble tension. This reflects the same fundamental limitation: raising $H_0$ by modifying recombination generically lowers $Ω_m$, being inconsistent with late-time cosmological observations.