arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2605.29635 2026-05-29 math.OC cs.LG

MoSSP: A Momentum-Based Single-Loop Stochastic Penalty Method for Nonconvex Constrained DC-Regularized Optimization

MoSSP: 基于动量的单环随机惩罚方法用于非凸约束DC正则化优化

Luxuan Li, Chunfeng Cui, Xiao Wang

AI总结提出MoSSP算法，一种基于动量的单环随机惩罚方法，用于解决具有非凸约束和DC正则化的随机优化问题，实现了O(ε^{-4})和O(ε^{-3})的oracle复杂度。

Comments 35 pages, 3 figures

详情

AI中文摘要

本文研究了一类具有差凸（DC）正则化的非凸约束随机问题，其中可行集可能是非凸的，且DC正则化子的凹部分允许非光滑。基本挑战在于在保持非凸约束可行性的同时实现良好的oracle复杂度。尽管单环算法能有效解决无约束DC优化问题，但它们在具有DC结构的约束优化中的潜力尚未被充分探索。为填补这一空白，我们开发了MoSSP，一种基于动量的单环随机惩罚方法，用于此类问题，并具有可证明的复杂度保证。关键思想是将单个随机近端梯度步骤应用于惩罚的Moreau包络加上凸DC部分，同时并行计算凹部分的近端映射。我们推导了两种算法变体：一种具有O(ε^{-4}) oracle复杂度的Polyak动量版本，用于寻找随机ε-KKT点，以及一种改进的O(ε^{-3})版本，结合了递归动量。实验结果证明了所提算法的有效性。

英文摘要

In this paper, we study a structured class of nonconvex constrained stochastic problems with difference-of-convex (DC) regularization, where the feasible set is possibly nonconvex and the concave part of the DC regularizer is allowed to be nonsmooth. The fundamental challenge lies in maintaining feasibility for nonconvex constraints while achieving favorable oracle complexity. Although single-loop algorithms efficiently solve unconstrained DC optimization problems, their potential for constrained optimization with DC structure remains largely unexplored. To address this gap, we develop MoSSP, a Momentum-based Single-loop Stochastic Penalty method for such problems with provable complexity guarantees. The key idea is to apply a single stochastic proximal-gradient step to the Moreau envelope of the penalty plus the convex DC part, with the concave part's proximal mapping computed in parallel. We derive two algorithm variants: a Polyak-momentum version with $O(\varepsilon^{-4})$ oracle complexity for finding stochastic $\varepsilon$-KKT points, and an improved $O(\varepsilon^{-3})$ version incorporating recursive momentum. Experimental results demonstrate the effectiveness of the proposed algorithms.

URL PDF HTML ☆

赞 0 踩 0

2605.29613 2026-05-29 eess.AS cs.SD

Decoding Strategies for Diffusion-Based ASR: A Systematic Evaluation of Confidence-Based Thresholding

基于扩散的ASR解码策略：基于置信度阈值的系统评估

Jeong Hun Yeo, Minsu Kim, Hyeongseop Rha, Yong Man Ro

AI总结本文系统评估了基于扩散语言模型的ASR中三种解码策略，提出使用基于负对数似然的不确定性度量来监控解码进度，发现基于阈值的策略在准确率和速度上均优于固定步数策略，其中静态阈值策略在匹配自回归解码准确率的同时具有更高效率。

详情

AI中文摘要

虽然基于LLM的自动语音识别（ASR）实现了高准确率，但其速度受限于顺序自回归解码。扩散语言模型（DLM）提供了一种并行替代方案，然而其解码策略在ASR场景中尚未得到充分探索。本文分析了三种用于DLM-based ASR的解码方案：固定步数、静态置信度阈值和动态置信度阈值。我们提出使用基于负对数似然的不确定性度量作为解码进度的代理来测量逐轮准确率。结果表明，基于阈值的策略在准确率和速度上均显著优于固定步数方案。我们将此归因于ASR独有的特性：大多数token在早期就达到高置信度，从而可以积极收集可靠token，仅将困难token留到后续轮次。值得注意的是，静态阈值策略在匹配自回归解码准确率的同时提供了更高的效率。

英文摘要

While LLM-based Automatic Speech Recognition (ASR) achieves high accuracy, its speed is limited by sequential autoregressive decoding. Diffusion Language Models (DLMs) offer a parallel alternative, yet their decoding strategies remain under-explored in ASR contexts. This paper analyzes three decoding schemes for DLM-based ASR: fixed-number, static confidence threshold, and dynamic confidence threshold. We propose measuring round-wise accuracy using Negative Log-Likelihood-based uncertainty as a proxy for decoding progress. Our results show that both threshold-based strategies significantly outperform fixed-number schemes in accuracy and speed. We attribute this to a property unique to ASR: most tokens reach high confidence early, allowing reliable ones to be harvested aggressively while leaving only difficult tokens for later rounds. Notably, the static-threshold strategy matches the accuracy of autoregressive decoding while offering superior efficiency.

URL PDF HTML ☆

赞 0 踩 0

2605.29612 2026-05-29 cs.MA cs.CL

CONCAT: Consensus- and Confidence-Driven Ad Hoc Teaming for Efficient LLM-Based Multi-Agent Systems

CONCAT: 基于共识与置信驱动的即席团队协作以实现高效的基于LLM的多智能体系统

Ziyang Ma, Dingyi Zhang, Sichu Liang, Jiajia Chu, Pengfei Xia, Hui Zang, Deyu Zhou

AI总结提出一种无需训练的共识与置信驱动即席团队协作框架CONCAT，通过聚类初始答案、选择高置信领导者并基于心智理论预测协作收益来动态组织多智能体交互，显著提升效率并降低延迟。

详情

AI中文摘要

尽管基于大型语言模型的多智能体系统在解决复杂任务和实现比单智能体系统更高的性能方面显示出能力，但由于智能体之间的密集通信，它们导致了巨大的计算开销。先前的研究致力于训练稀疏多智能体图或微调规划器以更好地编排工作流程。然而，这些额外的训练过程引入了计算成本，并将多智能体系统限制在特定领域，从而损害了其泛化能力。在本文中，我们提出了CONCAT，一种基于共识和置信驱动的即席团队协作的无训练多智能体协作框架，以高效组织智能体交互。具体来说，智能体根据其初始答案进行聚类，并根据智能体的置信度选择每个聚类的领导者。然后，基于心智理论设计启发式函数，根据领导者的答案和置信度预测每两个领导者之间的协作收益。最后，在根据预测收益驱逐一定比例的通信后，组织一个即席多智能体网络。在三个LLM和三个基准上的实验表明，CONCAT比LLM-Debate实现了高达2.02倍的效率（准确率/延迟比），并优于诸如AgentDropout等训练感知方法，同时在Qwen2.5-14B-Instruct上将平均延迟降低了50.1%，且无需任何任务特定训练。

英文摘要

Although large language model (LLM) based multi-agent systems (MAS) show their capability to solve complex tasks and achieve higher performance over single agent systems, they lead to huge computational overheads because of heavy communication between agents. Previous research has made efforts to train a sparse multi-agent graph or fine-tune a planner to orchestrate the workflow better. However, such extra training processes introduce computational costs and limit MAS to specific domains, therefore compromising their generalizability. In this paper, we propose CONCAT, a training-free multi-agent collaboration framework based on CONsensus and Confidence-driven Ad hoc Teaming to efficiently organize agent interactions. Specifically, agents are clustered based on their initial answers, and leaders of each cluster are selected based on the agents' confidence. Then, a heuristic function based on the Theory of Mind is designed to predict the collaboration benefits between every two leaders according to their answers and confidence. Finally, an ad hoc multi-agent network is organized after evicting a percentage of communications based on the predicted benefits. Experiments across three LLMs and three benchmarks show that CONCAT achieves up to 2.02x higher efficiency (accuracy/latency ratio) than LLM-Debate and outperforms training-aware methods such as AgentDropout, while reducing average latency by 50.1% on Qwen2.5-14B-Instruct, without any task-specific training.

URL PDF HTML ☆

赞 0 踩 0

2605.29587 2026-05-29 q-bio.QM cs.LG

FPLIER: Federated Pathway-Level Information Extractor

FPLIER：联邦通路级信息提取器

Daniele Malpetti, Christian Berchtold, Francesco Gualdi, Marco Scutari, Laura Azzimonti, Francesca Mangili

AI总结提出联邦学习框架FPLIER，通过安全聚合实现分布式基因表达数据上的通路级因子分解，并证明隐私风险由训练表达矩阵的秩决定。

Comments Accepted for publication at the ACM BCB '26 conference

详情

DOI: 10.1145/3807503.3819364

AI中文摘要

在转录组学中，通路级信息提取器（PLIER）等基因集感知因子分解方法在大型异质性表达数据集上训练时效果最佳。然而，由于隐私和治理限制，许多临床相关队列无法合并为单个数据集。我们提出FPLIER，这是PLIER的联邦扩展，能够在多个数据持有者之间进行分布式训练，同时整合公开可用数据集。通过安全聚合，FPLIER产生的训练更新在代数上等价于集中式池化数据方法，同时保持表达数据的本地性。我们在两个模拟联盟（来自K-CLIER和MultiPLIER研究）的多个场景中评估FPLIER，并展示其稳定收敛。我们进一步对针对中间训练统计量和发布模型的成员推断攻击进行了系统分析。结果表明，隐私风险由训练表达矩阵的秩决定。整合公开数据或降低数据维度会增加该秩，使系统趋向满秩状态，在此状态下训练样本与非训练样本对攻击者而言难以区分，成员推断性能接近随机猜测。

英文摘要

In transcriptomics, gene-set-aware factorization methods such as the Pathway Level Information Extractor (PLIER) are most effective when trained on large, heterogeneous expression compendia. Yet, many clinically relevant cohorts cannot be pooled into a single dataset due to privacy and governance constraints. We present FPLIER, a federated extension of PLIER that enables distributed training across multiple data holders while incorporating publicly available datasets. Through secure aggregation, FPLIER produces training updates algebraically equivalent to those of a centralized pooled-data approach while keeping expression data local. We evaluate FPLIER across multiple scenarios in two simulated consortia (from the K-CLIER and MultiPLIER studies) and demonstrate stable convergence. We further conduct a systematic analysis of membership inference attacks targeting both intermediate training statistics and the released model. Our results show that privacy risk is governed by the rank of the training expression matrix. Incorporating public data or reducing data dimensionality increases this rank, moving the system toward a full-rank regime in which training and non-training samples become indistinguishable to the attacker, and membership-inference performance approaches random guessing.

URL PDF HTML ☆

赞 0 踩 0

2605.28327 2026-05-29 stat.ML cs.LG q-fin.RM stat.AP

Insurance Pricing Optimization via Off-Policy Evaluation

通过离线策略评估进行保险定价优化

Sascha Günther, Dimitri Semenovich, Mario V. Wüthrich

AI总结本文提出基于离线策略评估和随机控制的保险定价方法，利用核化逆倾向得分估计器降低方差，并通过数据共享Lasso和神经网络两种策略优化方法实现最优定价。

详情

AI中文摘要

传统保险定价依赖于基于风险的原则，确保精算公平和偿付能力，但未明确考虑投保人的价格敏感性。我们将保险定价表述为一个决策问题，并使用离线策略评估和随机控制的工具进行研究。我们提出了一种核化逆倾向得分估计器，该估计器利用动作空间中的局部结构，与经典逆倾向得分估计器相比实现了方差减少。基于这些价值估计，我们研究了策略优化，并提出了两种计算最优定价规则的实用方法：一种可解释的数据共享Lasso公式和一种基于神经网络的灵活策略参数化。通过使用受控的合成旅行保险环境，我们实证验证了理论结果，并表明神经网络在策略优化方面优于现有技术。

英文摘要

Traditional insurance pricing relies on risk-based principles that ensure actuarial fairness and solvency but do not explicitly account for policyholders' price sensitivity. We formulate insurance pricing as a decision-making problem and study it using tools from off-policy evaluation and stochastic control. We propose a kernelized inverse propensity score estimator that exploits local structure in the action space and yields variance reduction compared to the classical inverse propensity score estimator. Building on these value estimates, we investigate policy optimization and present two practical approaches for computing optimal pricing rules: an interpretable data-shared Lasso formulation and a flexible policy parameterization based on neural networks. Using a controlled synthetic travel insurance environment, we empirically confirm the theoretical results and show that neural networks outperform existing techniques for policy optimization.

URL PDF HTML ☆

赞 0 踩 0

2605.26156 2026-05-29 cs.CR cs.AI cs.LG

Turning Bias into Bugs: Bandit-Guided Style Manipulation Attacks on LLM Judges

将偏见转化为漏洞：基于Bandit引导的LLM裁判风格操纵攻击

Xianglin Yang, Bryan Hooi, Gelei Deng, Tianwei Zhang, Jin Song Dong

AI总结提出BITE黑盒对抗框架，将风格编辑选择建模为上下文Bandit问题，通过LinUCB策略自适应选择编辑以误导LLM裁判并人为提高评分，攻击成功率超65%。

Comments Accepted to the Forty-Third International Conference on Machine Learning (ICML 2026)

详情

AI中文摘要

已知LLM裁判中的风格偏见，例如对冗长或特定句子结构的偏好，构成了一个未被充分探索的安全漏洞。在这项工作中，我们引入了BITE（偏见探索与利用），一个黑盒对抗框架，学习保持语义的编辑以误导LLM裁判并人为提高其分配的分数。我们将风格编辑的选择建模为上下文Bandit问题，并使用LinUCB策略自适应地选择编辑，以最大化裁判的分数，而无需访问模型参数或梯度。实验上，我们在多种LLM裁判和任务上测试了BITE，包括聊天机器人排行榜和AI审稿人基准上的逐点和成对比较。BITE实现了超过65%的攻击成功率，并在9分制上将分数提高了1-2分，同时保持了语义等价性。我们进一步评估了攻击的隐蔽性，表明BITE规避了标准的风格控制方法和几种检测基线。我们的发现暴露了LLM作为裁判范式的一个根本弱点，并激励了鲁棒的、对抗感知的评估。我们的代码可在https://github.com/xianglinyang/llm-as-a-judge-attack获取。

英文摘要

The known stylistic biases in LLM judges, such as a preference for verbosity or specific sentence structures, present an underexplored security vulnerability. In this work, we introduce BITE (BIas exploraTion and Exploitation), a black-box adversarial framework that learns semantics-preserving edits to mislead an LLM judge and artificially inflate the scores it assigns. We cast the selection of stylistic edits as a contextual bandit problem and use a LinUCB policy to adaptively choose edits that maximize the judge's score without access to model parameters or gradients. Empirically, we test BITE across a diverse range of LLM judges and tasks, including both pointwise and pairwise comparisons on chatbot leaderboards and AI-reviewer benchmarks. BITE achieves an attack success rate exceeding 65% and raises scores by 1-2 points on a 9-point scale, all while preserving semantic equivalence. We further assess the attack's stealthiness, showing that BITE evades standard style-control methods and several detection baselines. Our findings expose a fundamental weakness in the LLM-as-a-judge paradigm and motivate robust, attack-aware evaluation. Our code is available at https://github.com/xianglinyang/llm-as-a-judge-attack.

URL PDF HTML ☆

赞 0 踩 0

2605.25975 2026-05-29 cs.GR cs.CV

HyperBones: 基于超网络调节的实时骨骼驱动神经服装模拟

Astitva Srivastava, Hsiao-Yu Chen, Ryan Goldade, Philipp Herholz, Zhongshi Jiang, Gene Wei-Chin Lin, Lingchen Yang, Nikolaos Sarafianos, Tuur Stuyck, Doug Roble, Avinash Sharma, Egor Larionov

AI总结提出一种结合虚拟骨骼驱动粗粒度模拟和卷积神经映射恢复细粒度褶皱的实时神经服装模拟方法，通过超网络调节实现高效物理监督，无需外部模拟器。

详情

AI中文摘要

服装模拟的最新进展使高质量结果更接近实时性能。基于物理的模拟器可以产生精确的运动，但对于交互式应用而言计算成本仍然过高。相比之下，线性混合蒙皮效率高，但无法捕捉宽松服装的复杂动态，常常导致不真实的运动和视觉伪影。神经方法提供了一种有前景的替代方案，但在严格的运行时约束下仍难以合理动画化宽松衣物。我们提出了一种快速且物理上合理的动态服装模拟方法。我们的方法训练了一个由独立的粗粒度和细粒度组件组成的降维神经动力学模拟器。在粗粒度层面，服装由一组与轻量级神经网络集成的虚拟骨骼驱动。然后使用训练好的卷积神经映射恢复细粒度的褶皱细节。通过将身份特定计算与实时神经集成解耦，我们的架构在支持多样化的体型和运动的同时保持了高性能。我们进一步引入了一种有效的物理监督方案，无需依赖外部模拟器即可获得准确结果。实验表明，我们的方法产生了物理上合理的服装动态，能够泛化到各种运动和体型，并支持固定服装集。我们的模拟器在商用GPU上以300+ FPS运行，使其适用于实时应用。

英文摘要

Recent advances in garment simulation have brought high-quality results closer to real-time performance. Physics-based simulators can produce accurate motion, but remain too computationally expensive for interactive applications. In contrast, linear blend skinning is efficient, but cannot capture the complex dynamics of loose-fitting garments, often leading to unrealistic motion and visual artifacts. Neural methods offer a promising alternative, yet they still struggle to animate loose clothing plausibly under strict runtime constraints. We present a fast and physically plausible approach for dynamic garment simulation. Our method trains a reduced-space neural dynamics simulator composed of independent coarse- and fine-level components. At the coarse level, the garment is driven by a set of virtual bones integrated with a lightweight neural network. Fine-scale wrinkle details are then recovered using a trained convolutional neural map. By decoupling identity-specific computation from real-time neural integration, our architecture maintains high performance while supporting diverse body shapes and motions. We further introduce an effective physics-supervision scheme that enables accurate results without relying on an external simulator. Experiments show that our method produces physically plausible garment dynamics, generalizes across a range of motions and body shapes, and supports a fixed set of garments. Our simulator runs at 300+ FPS on a commodity GPU, making it suitable for real-time applications.

URL PDF HTML ☆

赞 0 踩 0

2605.16825 2026-05-29 cs.IR cs.AI

Echoes in Filter Bubble: Diagnosing and Curing Popularity Bias in Generative Recommenders

过滤气泡中的回声：诊断与治愈生成式推荐系统中的流行度偏差

Jun Yin, Bangguo Zhu, Peng Huo, Ruochen Liu, Hao Chen, Senzhang Wang, Shirui Pan, Chengqi Zhang

AI总结本文通过理论分析发现生成式推荐系统中的流行度偏差源于令牌级优化缺陷和物品分词的无差别性，并设计了非对称不相似度优化和基于骨架的分词方法（Ghost系统）来缓解偏差。

详情

AI中文摘要

最近，以统一端到端框架为特征的生成式推荐系统（GRs）在转变推荐范式方面展现出惊人的潜力。尽管有效，但我们认识到GRs仍然容易受到长期存在的流行度偏差问题的影响，该问题一直困扰着推荐社区。虽然少数研究尝试将传统的去偏方法扩展到GRs，但其效果有限，且GRs遭受流行度偏差的根本原因仍未得到充分探索。为弥补这一空白，本研究聚焦于GRs中的两个核心方面：生成框架的优化和基于语义索引的物品分词。基于理论分析，我们识别出严重的流行度偏差源于令牌级优化缺陷和物品分词的无差别性共同作用。据此，本研究通过设计非对称不相似度优化和基于骨架的分词，开发了一种名为Ghost的新型生成式推荐系统。在三个数据集上进行的广泛实证评估，与多个SOTA基线相比，表明Ghost显著缓解了流行度偏差并促进了更公平的推荐，同时仅对整体推荐效用造成轻微下降。

英文摘要

Recently, Generative Recommenders (GRs), characterized by a unified end-to-end framework, have exhibited astonishing potential in transforming the recommendation paradigm. Despite their effectiveness, we recognize that GRs are still susceptible to the long-standing issue of popularity bias that has pervaded the recommendation community. Although a few studies have attempted to extend traditional debiasing methods to GRs, their effectiveness is marginal, and the fundamental reason why GRs suffer from popularity bias remains under-explored. To bridge this gap, this study focuses on two core aspects in GRs: the optimization of generative framework and the item tokenization based on semantic index. Based on theoretical analyses, we identify that the severe popularity bias emerges from the confluence of a token-level optimization flaw and the undifferentiated property of item tokenization. Accordingly, this study develops a novel generative recommender system, called Ghost, by designing the asymmetric unlikelihood optimization and the skeleton-founded tokenization. Extensive empirical evaluations across three datasets, alongside multiple SOTA baselines, reveal that Ghost substantially alleviates popularity bias and promotes fairer recommendations, while incurring slight degradation to the overall recommendation utility.

URL PDF HTML ☆

赞 0 踩 0

2605.07596 2026-05-29 stat.ML cs.LG

A Refined Generalization Analysis for Extreme Multi-class Supervised Contrastive Representation Learning

极端多类监督对比表示学习的精细泛化分析

Nong Minh Hieu, Antoine Ledent

AI总结针对对比表示学习在有限标注数据中构造元组导致依赖性的问题，提出改进的U-统计量分析，得到与类别数R同阶的样本复杂度，并设计新估计器在长尾分布下实现O(k)的样本复杂度。

Comments Accepted at ICML 2026

详情

AI中文摘要

Bridge-RAG：一种基于抽象桥树的检索增强生成算法

Zihang Li, Wenjun Liu, Yikun Zong, Jiawen Tao, Siying Dai, Songcheng Ren, Zirui Liu, Yuhang Wang, Yanbing Jiang, Tong Yang

AI总结针对检索增强生成中准确性和效率的挑战，提出Bridge-RAG框架，通过抽象桥树结构实现多级检索，并集成布谷鸟过滤器实现O(1)实体查找，在保持高准确率的同时将检索速度提升至1.9倍。

详情

AI中文摘要

作为增强大型语言模型（LLMs）生成质量的重要范式，检索增强生成（RAG）面临着检索准确性和计算效率两方面的挑战。本文提出了一种名为Bridge-RAG的新型RAG框架。为了克服准确性挑战，我们引入了抽象概念来桥接查询实体和文档块，提供了稳健的语义理解。我们将抽象组织成树结构，并设计了多级检索策略以确保包含足够的上下文信息。虽然这种层次化组织显著提高了答案质量，但遍历树以定位包含查询实体的抽象不可避免地引入了额外的检索开销。为了恢复检索效率，我们进一步在CFT-RAG中集成了布谷鸟过滤器，该过滤器提供O(1)实体查找，并且自然适配了我们框架中实体到抽象的路径。大量实验表明，与结构化RAG基线相比，Bridge-RAG在所有指标上均实现了持续的准确性提升，并且检索速度最高提升了1.9倍。

英文摘要

As an important paradigm for enhancing the generation quality of Large Language Models (LLMs), retrieval-augmented generation (RAG) faces the two challenges regarding retrieval accuracy and computational efficiency. This paper presents a novel RAG framework called Bridge-RAG. To overcome the accuracy challenge, we introduce the concept of abstract to bridge query entities and document chunks, providing robust semantic understanding. We organize the abstracts into a tree structure and design a multi-level retrieval strategy to ensure the inclusion of sufficient contextual information. While this hierarchical organization substantially improves answer quality, traversing the tree to locate the abstracts that contain a query entity inevitably introduces additional retrieval overhead. To restore retrieval efficiency, we further integrate the Cuckoo Filter in CFT-RAG, which provides O(1) entity lookup and naturally fits the entity-to-abstract pathway of our framework. Extensive experiments show that Bridge-RAG achieves consistent accuracy improvements across all metrics and up to $1.9\times$ faster retrieval compared to structured RAG baselines.

URL PDF HTML ☆

赞 0 踩 0

2603.20329 2026-05-29 stat.ML cs.LG math.PR

Measure flow path recovery in Bayes Hilbert spaces

贝叶斯希尔伯特空间中的测度流路径恢复

S. David Mis, Maarten V. de Hoop

AI总结针对有限移动局部传感器恢复概率测度流的不适定问题，提出基于贝叶斯希尔伯特框架的变分理论，通过构造最小能量传输实现和线性化观测算子，分析可恢复性条件，并发展有限维约化方法实现稳定重建。

详情

AI中文摘要

我们研究使用贝叶斯希尔伯特框架从有限个移动局部传感器恢复概率测度流的不适定问题。相对于固定的参考概率测度，概率律由其中心化对数比坐标表示，因此演化律成为希尔伯特函数空间中的一条路径。对于足够正则的贝叶斯希尔伯特路径，我们通过在每个时间点求解加权纽曼问题，构造路径的规范最小能量传输实现，得到切方向上的内在传输形式。然后，我们直接在贝叶斯希尔伯特路径空间上制定逆问题。观测算子的线性化产生可观测性形式，可恢复性由其与传输几何通过联合传输-可观测性形式的相互作用决定。在无穷维环境中，我们发展了正则化变分理论，并识别了局部传感器的局限性：移动传感器可以使联合形式单射，但通常不能在整个状态空间上产生强制稳定性估计。这一障碍自然导致有限维贝叶斯希尔伯特约化。在那里，传输形式成为动能张量，线性化观测成为约化感知矩阵，因此可恢复性可以通过显式的格拉姆条件表达。我们证明局部凸起传感器检测每个固定的约化方向，有限个适当放置的静态传感器产生均匀的约化可观测性，并且存在依赖于路径的传感器轨迹，使得即使单个移动传感器也能恢复约化路径。最后，我们证明这些约化恢复结果可以提升到对由所选有限维子空间良好近似的路径的近似环境恢复，从而实现稳定重建至投影误差。

英文摘要

We study the ill-posed problem of recovering a probability measure flow from finitely many moving localized sensors using a Bayes Hilbert framework. Relative to a fixed reference probability measure, a probability law is represented by its centered log-ratio coordinates, so that an evolving law becomes a path in a Hilbert space of functions. For sufficiently regular Bayes Hilbert paths, we construct a canonical minimum-energy transport realization of the path by solving a weighted Neumann problem at each time, yielding an intrinsic transport form on tangent directions. We then formulate an inverse problem directly on Bayes Hilbert path space. Linearization of an observation operator yields an observability form, and recoverability is governed by its interaction with the transport geometry through a joint transport--observability form. In the ambient infinite-dimensional setting, we develop a regularized variational theory and identify limitations of localized sensing: mobile sensors can make the joint form injective, but they do not in general yield a coercive stability estimate on the full state space. This obstruction leads naturally to finite-dimensional Bayes Hilbert reductions. There the transport form becomes a kinetic tensor and the linearized observations become reduced sensing matrices, so recoverability can be expressed through explicit Gramian conditions. We show that localized bump sensors detect every fixed reduced direction, that finitely many suitably placed static sensors yield uniform reduced observability, and there exist path-dependent sensor trajectories such that even a single moving sensor can recover the reduced path. Finally, we show that these reduced recovery results lift to approximate ambient recovery for paths that are well approximated by the chosen finite-dimensional subspaces, yielding stable reconstruction up to projection error.

URL PDF HTML ☆

赞 0 踩 0

2602.20316 2026-05-29 astro-ph.SR cs.CV

Inspectorch: Efficient rare event exploration in solar observations

Inspectorch: 太阳观测中稀有事件的高效探索

C. J. Díaz Baso, I. J. Soler Poquet, C. Kuckein, M. van Noort, N. Poirier

AI总结提出基于流的密度估计模型Inspectorch，用于从高维太阳观测数据中高效识别稀有事件，并聚焦计算资源于极端现象。

Comments Comments: 12+1 pages, 11+2 figures, submitted to A&A

详情

AI中文摘要

扩散可微重采样

Jennifer Rosina Andersson, Zheng Zhao

AI总结针对序贯蒙特卡洛中的可微重采样问题，提出一种基于无训练扩散模型代理的信息性且即时可微的重采样方法，理论证明其一致性，并在多个滤波和参数估计基准上优于现有方法。

Comments In ICML 2026

2510.27663 2026-05-29 eess.IV cs.LG stat.ME stat.ML

Bayesian model selection and misspecification testing in imaging inverse problems only from noisy and partial measurements

仅从噪声和部分测量中进行成像逆问题的贝叶斯模型选择与误设定检验

Tom Sprunck, Marcelo Pereyra, Tobias Liaudat

AI总结提出一种结合贝叶斯交叉验证与数据分裂的通用方法，用于在无真实数据情况下对成像逆模型进行选择与误设定检测，兼容扩散采样器等贝叶斯成像采样器，计算成本低且准确率高。

详情

AI中文摘要

在线公平分配中的近似比例性

Davin Choo, Winston Fu, Derek Khu, Tzeh Yuan Neoh, Tze-Yang Poon, Nicholas Teh

AI总结研究在线公平分配问题中比例性（PROP1）的可近似性，通过非自适应对手和最大物品价值预测两种松弛方法，设计了具有鲁棒保证的在线算法。

Comments Appears in the 43rd International Conference on Machine Learning (ICML), 2026

详情

AI中文摘要

我们研究在线公平分配问题，其中不可分割的商品按顺序到达，必须立即且不可撤销地分配。先前的工作为近似经典概念（如至多一个商品的嫉妒无妒（EF1）和最大最小份额（MMS））建立了强不可能性结果，但至多一个商品的比例性（PROP1）的可近似性仍未解决。我们分两步解决这一差距。首先，我们展示了三种自然的贪婪分配规则（公平分配中的标准基线）无法保证对自适应对手的任何乘法近似到PROP1。这些局限性激发了两种松弛：（i）将注意力限制在非自适应对手上，以及（ii）在学习增强算法的精神下纳入粗略预测。在非自适应对手下，我们展示了均匀随机分配以高概率实现了有意义的PROP1近似，并且这一保证对于这种方法本质上是紧的；此外，当物品值足够小时，分配以高概率接近PROP1。最后，给定最大物品值（MIV）预测，我们设计了一种在线算法，该算法实现了PROP1的鲁棒近似保证，并在单边预测误差下优雅地退化。相比之下，我们展示了即使有完美的MIV预测，EF1、MMS和PROPX仍然不可近似。

英文摘要

We study the online fair division problem, where indivisible goods arrive sequentially and must be allocated immediately and irrevocably. Prior work establishes strong impossibility results for approximating classic notions such as envy-freeness up to one good (EF1) and maximin share (MMS) in this setting, but the approximability of proportionality up to one good (PROP1) has remained unresolved. We resolve this gap in two steps. First, we show that three natural greedy allocation rules (standard baselines in fair division) fail to guarantee any multiplicative approximation to PROP1 against an adaptive adversary. These limitations motivate two relaxations: (i) restricting attention to a non-adaptive adversary, and (ii) incorporating coarse predictions in the spirit of learning-augmented algorithms. Under a non-adaptive adversary, we show that the uniform random allocation achieves a meaningful PROP1 approximation with high probability, and this guarantee is essentially tight for this approach; moreover, when item values are sufficiently small, the allocation is near-PROP1 with high probability. Finally, given maximum item value (MIV) predictions, we design an online algorithm that achieves robust approximation guarantees for PROP1, and degrades gracefully under one-sided prediction error. In contrast, we show that EF1, MMS, and PROPX remain inapproximable even with perfect MIV predictions.

URL PDF HTML ☆

赞 0 踩 0