arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 4035
2603.04474 2026-05-12 cs.MA cs.AI

From Spark to Fire: Modeling and Mitigating Error Cascades in LLM-Based Multi-Agent Collaboration

Yizhe Xie, Congcong Zhu, Xinyue Zhang, Tianqing Zhu, Dayong Ye, Minfeng Qi, Huajie Chen, Wanlei Zhou

AI总结 本文研究了基于大语言模型的多智能体系统(LLM-MAS)中错误逐步扩散并导致系统性共识偏差的问题,提出了一种基于传播动力学的模型,用于分析和识别错误扩大的风险。通过实验,作者发现了三种主要的系统脆弱性,并设计了一种基于基因图谱的治理层,作为消息层插件,有效抑制内外部错误的传播,实验表明该方法在多种运行模式下能显著减少错误的级联扩散。

详情
英文摘要

Large Language Model-based Multi-Agent Systems (LLM-MAS) are increasingly applied to complex collaborative scenarios. However, their collaborative mechanisms may cause minor inaccuracies to gradually solidify into system-level false consensus through iteration. Such risks are difficult to trace since errors can propagate and amplify through message dependencies. Existing protections often rely on single-agent validation or require modifications to the collaboration architecture, which can weaken effective information flow and may not align with natural collaboration processes in real tasks. To address this, we propose a propagation dynamics model tailored for LLM-MAS that abstracts collaboration as a directed dependency graph and provides an early-stage risk criterion to characterize amplification risk. Through experiments on six mainstream frameworks, we identify three vulnerability classes: cascade amplification, topological sensitivity, and consensus inertia. We further instantiate an attack where injecting just a single atomic error seed leads to widespread failure. In response, we introduce a genealogy-graph-based governance layer, implemented as a message-layer plugin, that suppresses both endogenous and exogenous error amplification without altering the collaboration architecture. Experiments show that this approach prevents final infection in at least 89% of runs across operating modes and significantly mitigates the cascading spread of minor errors.

2602.10666 2026-05-12 eess.AS cs.LG cs.SD

From Diet to Free Lunch: Estimating Auxiliary Signal Properties using Dynamic Pruning Masks in Speech Enhancement Networks

Riccardo Miccini, Clément Laroche, Tobias Piechowiak, Xenofon Fafoutis, Luca Pezzarossa

AI总结 本文研究了如何在语音增强网络中利用动态通道剪枝(DynCP)生成的内部剪枝掩码来估计辅助信号属性,如语音活动检测(VAD)、噪声分类和基频(F0)估计,从而避免部署额外模型的需求。通过简单的可解释预测器,该方法在多个任务上取得了较高的准确率,且计算开销极小。研究不仅揭示了DynCP模型在下游任务中的学习行为,还提出了将其作为高效语音增强与信号属性联合估计的统一解决方案。

Comments Accepted for publication at the 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

详情
英文摘要

Speech Enhancement (SE) in audio devices is often supported by auxiliary modules for Voice Activity Detection (VAD), SNR estimation, or Acoustic Scene Classification to ensure robust context-aware behavior and seamless user experience. Just like SE, these tasks often employ deep learning; however, deploying additional models on-device is computationally impractical, whereas cloud-based inference would introduce additional latency and compromise privacy. Prior work on SE employed Dynamic Channel Pruning (DynCP) to reduce computation by adaptively disabling specific channels based on the current input. In this work, we investigate whether useful signal properties can be estimated from these internal pruning masks, thus removing the need for separate models. We show that simple, interpretable predictors achieve up to 93% accuracy on VAD, 84% on noise classification, and an R2 of 0.86 on F0 estimation. With binary masks, predictions reduce to weighted sums, inducing negligible overhead. Our contribution is twofold: on one hand, we examine the emergent behavior of DynCP models through the lens of downstream prediction tasks, to reveal what they are learning; on the other, we repurpose and re-propose DynCP as a holistic solution for efficient SE and simultaneous estimation of signal properties.

2601.22143 2026-05-12 cs.GR cs.CV

JUST-DUB-IT: Video Dubbing via Joint Audio-Visual Diffusion

Anthony Chen, Naomi Ken Korem, Gal Zeevi, Tavi Halperin, Matan Ben Yosef, Urska Jelercic, Ofir Bibi, Or Patashnik, Daniel Cohen-Or

AI总结 本文提出了一种基于音频-视觉扩散模型的视频配音方法JUST-DUB-IT,通过轻量级的LoRA适配器实现从输入视频生成对应语言的配音和同步面部动作。该方法利用生成模型自身生成多语言配对视频作为训练数据,通过在单个视频片段中切换语言并进行面部和音频修复,实现了高质量的配音效果,保持了说话人身份和唇形同步,同时在复杂运动和真实场景中表现出更强的鲁棒性。

Comments Project webpage available at https://justdubit.github.io

详情
英文摘要

Audio-Visual Foundation Models, which are pretrained to jointly generate sound and visual content, have recently shown an unprecedented ability to model multi-modal generation and editing, opening new opportunities for downstream tasks. Among these tasks, video dubbing could greatly benefit from such priors, yet most existing solutions still rely on complex, task-specific pipelines that struggle in real-world settings. In this work, we introduce a single-model approach that adapts a foundational audio-video diffusion model for video-to-video dubbing via a lightweight LoRA. The LoRA enables the model to condition on an input audio-video while jointly generating translated audio and synchronized facial motion. To train this LoRA, we leverage the generative model itself to synthesize paired multilingual videos of the same speaker. Specifically, we generate multilingual videos with language switches within a single clip, and then inpaint the face and audio in each half to match the language of the other half. By leveraging the rich generative prior of the audio-visual model, our approach preserves speaker identity and lip synchronization while remaining robust to complex motion and real-world dynamics. We demonstrate that our approach produces high-quality dubbed videos with improved visual fidelity, lip synchronization, and robustness compared to existing dubbing pipelines.

2601.20898 2026-05-12 eess.AS cs.CL cs.LG

Reducing Prompt Sensitivity in LLM-based Speech Recognition Through Learnable Projection

Sergio Burdisso, Esaú Villatoro-Tello, Shashi Kumar, Srikanth Madikeri, Andrés Carofilis, Pradeep Rangappa, Manjunath K E, Kadri Hacioglu, Petr Motlicek, Andreas Stolcke

AI总结 本文研究了基于大语言模型(LLM)的语音识别系统中提示(prompt)设计对性能的影响,指出固定手动提示在不同场景下表现不稳定。为此,作者提出了一种可学习的提示投影模块,无需修改原有模型结构,即可将提示嵌入映射到更有效的LLM输入空间区域。实验表明,该方法在多个数据集上有效提升了语音识别性能并减少了结果的波动性。

Comments Paper accepted at ICASSP 2026

详情
英文摘要

LLM-based automatic speech recognition (ASR), a well-established approach, connects speech foundation models to large language models (LLMs) through a speech-to-LLM projector, yielding promising results. A common design choice in these architectures is the use of a fixed, manually defined prompt during both training and inference. This setup not only enables applicability across a range of practical scenarios, but also helps maximize model performance. However, the impact of prompt design remains underexplored. This paper presents a comprehensive analysis of commonly used prompts across diverse datasets, showing that prompt choice significantly affects ASR performance and introduces instability, with no single prompt performing best across all cases. Inspired by the speech-to-LLM projector, we propose a prompt projector module, a simple, model-agnostic extension that learns to project prompt embeddings to more effective regions of the LLM input space, without modifying the underlying LLM-based ASR model. Experiments on four datasets show that the addition of a prompt projector consistently improves performance, reduces variability, and outperforms the best manually selected prompts.

2512.16875 2026-05-12 cs.DS cs.LG math.ST stat.ML stat.TH

Learning Confidence Ellipsoids and Applications to Robust Subspace Recovery

Chao Gao, Liren Shan, Vaidehi Srinivas, Aravindan Vijayaraghavan

AI总结 本文研究了在高维空间中为任意分布寻找置信椭球的问题,目标是在给定置信参数α的情况下,找到包含至少1−α概率质量的最小体积椭球。为了解决高维下传统方法难以高效近似的问题,作者提出了一种多项式时间算法,能够在体积近似因子与椭球条件数β的多项式关系下,保证覆盖足够概率质量,并给出了相应的计算复杂性下界。该方法基于最小体积外接椭球的对偶结构和几何Brascamp-Lieb不等式,为鲁棒子空间恢复问题提供了首个具有最坏情况近似保证的多项式时间算法。

详情
英文摘要

We study the problem of finding confidence ellipsoids for an arbitrary distribution in high dimensions. Given samples from a distribution $D$ and a confidence parameter $α$, the goal is to find the smallest volume ellipsoid $E$ which has probability mass $\mathbb{P}_{D}[E] \ge 1-α$. Ellipsoids are a highly expressive class of confidence sets as they can capture correlations in the distribution, and can approximate any convex set. In statistics, this is the classic minimum volume estimator introduced by Rousseeuw as a robust non-parametric estimator of location and scatter. However in high dimensions, it becomes NP-hard to obtain any non-trivial approximation factor in volume when the condition number $β$ of the ellipsoid (ratio of the largest to the smallest axis length) goes to $\infty$. This motivates the focus of our paper: can we efficiently find confidence ellipsoids with volume approximation guarantees when compared to ellipsoids of bounded condition number $β$? Our main result is a polynomial time algorithm that finds an ellipsoid $E$ whose volume is within a $O(β)^{γd}$ multiplicative factor of the volume of best $β$-conditioned ellipsoid while covering at least $1-O(α/γ)$ probability mass for any $γ\in (0,1)$. In particular, setting $γ= o(1)$, this gives a $O(β)^{o(d)}$ volume approximation, with a multiplicative loss in miscoverage. We complement this with a computational hardness result that shows that such a dependence on $β$ seems necessary, even with some slack in coverage. The algorithm and analysis uses the rich primal-dual structure of the minimum volume enclosing ellipsoid and the geometric Brascamp-Lieb inequality. As a consequence, we obtain the first polynomial time algorithm with approximation guarantees on worst-case instances of the robust subspace recovery problem.

2511.05476 2026-05-12 cs.SE cs.LG

A Metamorphic Testing Perspective on Knowledge Distillation for Language Models of Code: Does the Student Deeply Mimic the Teacher?

Md. Abdul Awal, Mrigank Rochan, Chanchal K. Roy

AI总结 该研究从元形态测试的角度探讨了代码语言模型的知识蒸馏问题,指出尽管学生模型在传统准确率指标上表现良好,但其在行为一致性方面可能与教师模型存在显著差异。为此,作者提出了MetaCompress框架,通过行为保持的元形态关系系统评估学生模型的行为保真度,实验表明该方法能有效揭示学生模型中高达62%的行为偏差,强调了在知识蒸馏过程中评估行为一致性的重要性。

Comments This paper has been accepted for publication in the Journal of Systems and Software (JSS)

详情
英文摘要

Transformer-based language models of code have achieved state-of-the-art performance across a wide range of software analytics tasks, but their practical deployment remains limited due to high computational costs, slow inference speeds, and significant environmental impact. To address these challenges, recent research has increasingly explored knowledge distillation as a method for compressing a large language model of code (the teacher) into a smaller model (the student) while maintaining performance. However, the degree to which a student model deeply mimics the predictive behavior and internal representations of its teacher remains largely unexplored, as current accuracy-based evaluation provides only a surface-level view of model quality and often fails to capture more profound discrepancies in behavioral fidelity between the teacher and student models. To address this gap, we empirically show that the student model often fails to deeply mimic the teacher model, resulting in up to 285% greater performance drop under adversarial attacks, which is not captured by traditional accuracy-based evaluation. Therefore, we propose MetaCompress, a metamorphic testing framework that systematically evaluates behavioral fidelity by comparing the outputs of teacher and student models under a set of behavior-preserving metamorphic relations. We evaluate MetaCompress on two widely studied tasks, using compressed versions of popular language models of code, obtained via three different knowledge distillation techniques: Compressor, AVATAR, and MORPH. The results show that MetaCompress identifies up to 62% behavioral discrepancies in student models, underscoring the need for behavioral fidelity evaluation within the knowledge distillation pipeline and establishing MetaCompress as a practical framework for testing compressed language models of code derived through knowledge distillation.

2511.01292 2026-05-12 stat.ML cs.LG

Optimal Attention Temperature Improves the Robustness of In-Context Learning under Distribution Shift in High Dimensions

Samet Demir, Zafer Dogan

AI总结 该研究探讨了如何通过调整注意力温度来提升预训练Transformer模型在分布偏移情况下的上下文学习(ICL)鲁棒性。在高维线性回归框架下,作者分析了一种具有近似softmax注意力机制的Transformer,并推导出分布偏移下ICL泛化误差的闭式表达式,发现存在一个最优注意力温度可最小化该误差。实验表明,调整注意力温度不仅能提升理论性能,还能在实际预训练大语言模型中有效增强对噪声上下文示例的鲁棒性。

Comments ICML 2026, 24 pages, 7 figures

详情
英文摘要

Pretrained Transformers can perform in-context learning (ICL) from a few demonstrations, but this ability can fail sharply when the test distribution differs from pretraining, a common deployment setting. We study attention temperature as a simple inference-time control for improving ICL robustness under such shifts. In a high-dimensional linear-regression framework, we analyze a Transformer with "approximate softmax" attention, which preserves softmax's normalization and temperature-dependent selectivity while remaining tractable. We derive a closed-form expression for the ICL generalization error under distribution shift, and show that it is minimized by an explicit optimal attention temperature. This characterization yields interpretable guidance by linking the best temperature to moments of the pre-softmax attention scores, and predicts when temperature adjustment can recover near Bayes-optimal performance. We validate the theory with extensive simulations, and further demonstrate gains on pretrained LLMs (GPT-2 and Llama2-7B) on question-answering benchmarks under distribution shift induced by noisy in-context demonstrations. Overall, attention temperature emerges as a principled, lightweight knob for improving the robustness of ICL in pretrained Transformers.

2510.15995 2026-05-12 q-fin.TR cs.GT cs.LG

The Invisible Handshake: Persistent Overpricing by Adaptive Market Agents

Luigi Foscari, Emanuele Guidotti, Nicolò Cesa-Bianchi, Tatjana Chavdarova, Alfio Ferrara

AI总结 本文研究了市场做市商与交易者之间的重复博弈中出现的持续高价现象。通过分析交易对价格的内生影响和外生冲击,作者定义了相对于无价格影响的反事实价格路径的高价,并刻画了能够产生持续高价的策略组合。研究发现,基于投影随机梯度上升等方法的去中心化学习机制可以在有限时间内达到高价区域,揭示了市场参与者自适应学习行为如何导致金融市场的持续高价问题。

详情
英文摘要

We study overpricing in a repeated game between two representative agents: a market maker, who controls market liquidity, and a market taker, who chooses trade quantities. Market prices evolve through the endogenous price impact of trades and exogenous shocks. We define overpricing relative to a counterfactual price path that holds fixed the same sequence of shocks while shutting down price impact, and characterize the set of feasible strategy profiles that generate persistent overpricing while respecting cash and inventory constraints. We provide a sufficient condition for decentralized learning to reach the overpricing region in finite time, and we show that this condition is satisfied, in particular, by projected stochastic gradient ascent. A key step in the analysis is a decomposition of the game into a competitive component, which favors zero price impact, and a collaborative component, which makes overpricing jointly profitable when aggregate inventory is positive. We further show that the same structural incentives govern both myopic and farsighted objectives. Together, these results show how decentralized learning by adaptive market agents can lead to persistent overpricing in financial markets.

2510.03761 2026-05-12 cs.CR cs.AI

You Have Been LaTeXpOsEd: A Systematic Analysis of Information Leakage in Preprint Archives Using Large Language Models

Richard A. Dubniczky, Bertalan Borsos, Tamas Bisztray, Norbert Tihanyi

AI总结 该研究系统分析了预印本平台(如arXiv)中可能存在的信息泄露问题,指出在缺乏清理的情况下,提交的原始LaTeX源文件、代码、图片和注释可能泄露敏感信息。研究提出了LaTeXpOsEd框架,结合模式匹配、逻辑过滤和大语言模型等技术,从超过1.2TB的10万份arXiv提交中发现了大量个人信息、云存储链接、会议提交凭证等敏感内容,揭示了预印本平台中存在的严重安全隐患,并呼吁学术界和平台运营方采取行动加以改进。

详情
英文摘要

The widespread use of preprint repositories such as arXiv has accelerated the communication of scientific results but also introduced overlooked security risks. Beyond PDFs, these platforms provide unrestricted access to original source materials, including LaTeX sources, auxiliary code, figures, and embedded comments. In the absence of sanitization, submissions may disclose sensitive information that adversaries can harvest using open-source intelligence. In this work, we present the first large-scale security audit of preprint archives, analyzing more than 1.2 TB of source data from 100,000 arXiv submissions. We introduce LaTeXpOsEd, a four-stage framework that integrates pattern matching, logical filtering, traditional harvesting techniques, and large language models (LLMs) to uncover hidden disclosures within non-referenced files and LaTeX comments. To evaluate LLMs' secret-detection capabilities, we introduce LLMSec-DB, a benchmark on which we tested 25 state-of-the-art models. Our analysis uncovered thousands of PII leaks, GPS-tagged EXIF files, publicly available Google Drive and Dropbox folders, editable private SharePoint links, exposed GitHub and Google credentials, and cloud API keys. We also uncovered confidential author communications, internal disagreements, and conference submission credentials, exposing information that poses serious reputational risks to both researchers and institutions. We urge the research community and repository operators to take immediate action to close these hidden security gaps. To support open science, we release all scripts and methods from this study but withhold sensitive findings that could be misused, in line with ethical principles. The source code and related material are available at the project website https://github.com/LaTeXpOsEd

2509.06172 2026-05-12 stat.AP cs.LG

Robust Analysis for Resilient AI System

Yu Wang, Ran Jin, Lulu Kang

AI总结 本文针对制造工业互联网(MII)系统中操作风险导致的数据异常问题,提出了一种新的鲁棒回归方法DPD-Lasso,结合密度幂散度与Lasso正则化,以处理AI韧性实验中的污染数据。该方法通过高效的迭代算法克服了计算瓶颈,并在气溶胶喷射打印的MII测试平台中验证了其在干净数据和含异常值数据下的可靠性和稳定性,为构建和验证韧性工业AI系统提供了重要工具。

Comments 10 pages, 3 figures

详情
Journal ref
2025 IEEE International Conference on Data Mining Workshops (ICDMW), Washington, DC, USA, 2025, pp. 1631-1641
英文摘要

Operational hazards in Manufacturing Industrial Internet (MII) systems generate severe data outliers that cripple traditional statistical analysis. This paper proposes a novel robust regression method, DPD-Lasso, which integrates Density Power Divergence with Lasso regularization to analyze contaminated data from AI resilience experiments. We develop an efficient iterative algorithm to overcome previous computational bottlenecks. Applied to an MII testbed for Aerosol Jet Printing, DPD-Lasso provides reliable, stable performance on both clean and outlier-contaminated data, accurately quantifying hazard impacts. This work establishes robust regression as an essential tool for developing and validating resilient industrial AI systems.

2507.23511 2026-05-12 eess.AS cs.AI cs.CL cs.SD

MECAT: A Multi-Experts Constructed Benchmark for Fine-Grained Audio Understanding Tasks

Yadong Niu, Tianzi Wang, Heinrich Dinkel, Xingwei Sun, Jiahao Zhou, Gang Li, Jizhong Liu, Xunying Liu, Junbo Zhang, Jian Luan

AI总结 本文提出MECAT,一个多专家构建的细粒度音频理解基准,旨在解决当前音频语言模型在细微理解层面的不足。该基准通过整合专业模型分析与链式推理大语言模型生成多视角、细粒度的描述和开放问答对,并引入新的评估指标DATE,以提升对模型输出细节程度的区分能力。实验表明,MECAT能够更准确地评估现有音频模型在细粒度理解任务中的表现与局限。

Comments Accepted to ICML 2026

详情
英文摘要

While large audio-language models have advanced open-ended audio understanding, they still fall short of nuanced human-level comprehension. This gap persists largely because current benchmarks, limited by data annotations and evaluation metrics, fail to reliably distinguish between generic and highly detailed model outputs. To this end, this work introduces MECAT, a Multi-Expert Constructed Benchmark for Fine-Grained Audio Understanding Tasks. Generated via a pipeline that integrates analysis from specialized expert models with Chain-of-Thought large language model reasoning, MECAT provides multi-perspective, fine-grained captions and open-set question-answering pairs. The benchmark is complemented by a novel metric: DATE (Discriminative-Enhanced Audio Text Evaluation). This metric penalizes generic terms and rewards detailed descriptions by combining single-sample semantic similarity with cross-sample discriminability. A comprehensive evaluation of state-of-the-art audio models is also presented, providing new insights into their current capabilities and limitations. The data and code are available at https://github.com/xiaomi-research/mecat

2507.07871 2026-05-12 cs.CR cs.AI cs.LG

Mitigating Watermark Forgery in Generative Models via Randomized Key Selection

Toluwani Aremu, Noor Hussein, Munachiso Nwadike, Samuele Poppi, Jie Zhang, Karthik Nandakumar, Neil Gong, Nils Lukas

AI总结 该论文研究了如何通过随机密钥选择来防止生成模型中的水印伪造攻击。现有方法通过在内容中嵌入多个水印密钥来抵御伪造,但可能影响模型性能,且在攻击者收集足够多水印样本时仍存在风险。本文提出了一种新的防御机制,通过为每次查询随机选择水印密钥,并仅在恰好一个密钥检测到水印时才接受内容为真实,从而在不降低模型性能的前提下,有效限制了攻击者的成功概率,实验表明攻击成功率可从接近100%降至2%。

详情
英文摘要

Watermarking enables GenAI providers to verify whether content was generated by their models. A watermark is a hidden signal in the content, whose presence can be detected using a secret watermark key. A core security threat are forgery attacks, where adversaries insert the provider's watermark into content \emph{not} produced by the provider, potentially damaging their reputation and undermining trust. Existing defenses resist forgery by embedding many watermarks with multiple keys into the same content, which can degrade model utility. However, forgery remains a threat when attackers can collect sufficiently many watermarked samples. We propose a defense that is provably forgery-resistant \emph{independent} of the number of watermarked content collected by the attacker, provided they cannot easily distinguish watermarks from different keys. Our scheme does not further degrade model utility. We randomize the watermark key selection for each query and accept content as genuine only if a watermark is detected by \emph{exactly} one key. We focus on the image and text modalities, but our defense is modality-agnostic, since it treats the underlying watermarking method as a black-box. Our method provably bounds the attacker's success rate and we empirically observe a reduction from near-perfect success rates to only $2\%$ at negligible computational overhead.

2506.20928 2026-05-12 stat.ML cs.LG

Active Learning for Manifold Gaussian Process Regression

Yuanxing Cheng, Lulu Kang, Yiwei Wang, Chun Liu

AI总结 本文提出了一种用于流形高斯过程回归的主动学习框架,将流形学习与策略性数据选择相结合,以提升高维空间中的预测精度。该方法联合优化一个用于降维的神经网络和潜空间中的高斯过程回归器,并通过主动学习准则最小化全局预测误差。实验表明,该框架在合成数据上的表现优于随机顺序学习,能够高效处理复杂且不连续的函数,同时保持计算可行性,具有重要的科学与工程应用价值。

Comments 13 pages, 6 figures

详情
Journal ref
2025 Winter Simulation Conference (WSC), Seattle, WA, USA, 2025, pp. 1-12
英文摘要

This paper introduces an active learning framework for manifold Gaussian Process (GP) regression, combining manifold learning with strategic data selection to improve accuracy in high-dimensional spaces. Our method jointly optimizes a neural network for dimensionality reduction and a Gaussian process regressor in the latent space, supervised by an active learning criterion that minimizes global prediction error. Experiments on synthetic data demonstrate superior performance over randomly sequential learning. The framework efficiently handles complex, discontinuous functions while preserving computational tractability, offering practical value for scientific and engineering applications. Future work will focus on scalability and uncertainty-aware manifold learning.

2504.02373 2026-05-12 eess.IV cs.CV

HPGN: Hybrid Priors-Guided Network for Compressed Low-Light Image Enhancement

Hantang Li, Qiang Zhu, Xiandong Meng, Lei Xiong, Shuyuan Zhu, Xiaopeng Fan

AI总结 在实际应用中,低光照图像通常为了高效存储和传输而被压缩,但现有方法大多忽视了压缩伪影的去除或难以建立统一的增强框架。为此,本文提出了一种结合压缩先验和光照先验的混合引导网络(HPGN),通过引入JPEG质量因子和DCT量化矩阵指导模块设计,实现了对不同压缩质量低光照图像的联合增强。实验结果表明,该方法在提升图像质量方面具有显著优势。

Comments 5 pages, 3 figures

详情
英文摘要

In practical applications, low-light images are often compressed for efficient storage and transmission. Most existing methods disregard compression artifacts removal or hardly establish a unified framework for joint task enhancement of low-light images with varying compression qualities. To address this problem, we propose an efficient hybrid priors-guided network (HPGN) that enhances compressed low-light images by integrating both compression and illumination priors. Our approach fully utilizes the JPEG quality factor (QF) and DCT quantization matrix (QM) to guide the design of efficient plug-and-play modules for joint tasks. Additionally, we employ a random QF generation strategy to guide model training, enabling a single model to enhance low-light images with different compression levels. Experimental results demonstrate the superiority of our proposed method.

2410.14927 2026-05-12 q-fin.TR cs.CE cs.LG

Hierarchical Reinforced Trader (HRT): A Bi-Level Approach for Optimizing Stock Selection and Execution

Zijie Zhao, Roy E. Welsch

AI总结 本文提出了一种基于双层强化学习框架的自动化股票交易系统——分层强化交易者(HRT),用于在多资产股票市场中进行文本感知的组合管理。HRT 将交易决策分为两个层级:高层控制器从市场和文本信号中提取稀疏的方向信号(买入、卖出或持有),而底层控制器则在考虑交易成本、回撤和文本风险等因素下,将这些方向转化为可行的组合权重调整。实验表明,HRT 在多个基准对比中表现出最优的风险收益比,提升了夏普比率并降低了交易周转率,验证了其在结合市场预测与文本风险信号方面的有效性。

详情
英文摘要

Automated equity trading requires converting noisy market and news signals into executable portfolio decisions under risk, turnover, and transaction costs. We propose Hierarchical Reinforced Trader (HRT), a bi-level reinforcement learning framework for text-aware portfolio management in multi-asset equity markets. HRT separates trading into two coordinated decisions: a factorized sparse High-Level Controller (HLC) selects asset-level increase, reduce, or hold directions from compact market and text-derived signals, while a risk-aware Low-Level Controller (LLC) converts these directions into feasible portfolio weight adjustments under turnover, drawdown, and text-risk penalties. This decomposition avoids enumerating the full joint action space and makes selection and execution easier to inspect. We evaluate HRT on an open stock-news benchmark with a fixed 89-stock Nasdaq universe, using 2013--2018 for training, 2019 for validation, and 2020--2023 for final out-of-sample testing; the test horizon is restricted to 2020--2023 due to public benchmark data availability under the same timestamp-clean text-aware protocol. Across market-proxy, same-universe portfolio, alpha-only, flat-RL, and hierarchical ablation baselines, HRT delivers the strongest learning-based return--risk--cost trade-off. The full model improves Sharpe from 1.06 for HRT-Base to 1.24, reduces daily turnover from 0.112 to 0.090, and remains robust under transaction-cost stress. These results suggest that separating sparse directional selection from risk-aware execution is an effective way to incorporate market forecasts and text-derived risk signals into portfolio management.

2409.19379 2026-05-12 math.CO cs.AI

Automated conjecturing with \emph{TxGraffiti}

Randy Davila

AI总结 本文介绍了名为 *TxGraffiti* 的自动化数学猜想生成程序,该程序基于数据驱动和启发式方法,旨在跨数学领域自动生成猜想。*TxGraffiti* 源自早期的 *Graffiti* 系统,已在图论等领域产生多项研究成果,并通过新开发的网络界面提升了用户交互体验。文章详细阐述了其数据收集、猜想生成与过滤机制,展示了其在数学研究中的实际贡献与应用潜力。

Comments Annals of Mathematics and Artificial Intelligence (2026)

详情
英文摘要

\emph{TxGraffiti} is a data-driven, heuristic-based computer program developed to automate the process of generating conjectures across various mathematical domains. Since its creation in 2017, \emph{TxGraffiti} has contributed to numerous mathematical publications, particularly in graph theory. In this paper, we present the design and core principles of \emph{TxGraffiti}, including its roots in the original \emph{Graffiti} program, which pioneered the automation of mathematical conjecturing. We describe the data collection process, the generation of plausible conjectures, and methods such as the \emph{Dalmatian} heuristic for filtering out redundant or transitive conjectures. Additionally, we highlight its contributions to the mathematical literature and introduce a new web-based interface that allows users to explore conjectures interactively. While we focus on graph theory, the techniques demonstrated extend to other areas of mathematics.

2006.02666 2026-05-12 eess.IV cs.CV

Deep Sequential Feature Learning in Clinical Image Classification of Infectious Keratitis

Yesheng Xu, Ming Kong, Wenjia Xie, Runping Duan, Zhengqing Fang, Yuxiao Lin, Qiang Zhu, Siliang Tang, Fei Wu, Yu-Feng Yao

AI总结 本文针对感染性角膜炎的临床图像分类问题,提出了一种基于序列级深度学习的模型,旨在准确区分感染性角膜病变的细微差异。该方法通过设计有效的机制保留临床图像的空间结构并提取关键特征,显著提升了分类性能。实验表明,该模型在120张测试图像上的诊断准确率达到80.00%,远超421位眼科医生49.27%的平均水平,展示了其在辅助诊断中的巨大潜力。

Comments Accepted by Engineering

详情
英文摘要

Infectious keratitis is the most common entities of corneal diseases, in which pathogen grows in the cornea leading to inflammation and destruction of the corneal tissues. Infectious keratitis is a medical emergency, for which a rapid and accurate diagnosis is needed for speedy initiation of prompt and precise treatment to halt the disease progress and to limit the extent of corneal damage; otherwise it may develop sight-threatening and even eye-globe-threatening condition. In this paper, we propose a sequential-level deep learning model to effectively discriminate the distinction and subtlety of infectious corneal disease via the classification of clinical images. In this approach, we devise an appropriate mechanism to preserve the spatial structures of clinical images and disentangle the informative features for clinical image classification of infectious keratitis. In competition with 421 ophthalmologists, the performance of the proposed sequential-level deep model achieved 80.00% diagnostic accuracy, far better than the 49.27% diagnostic accuracy achieved by ophthalmologists over 120 test images.

2004.06443 2026-05-12 stat.ML cs.LG

Particle-based Energetic Variational Inference

Yiwei Wang, Jiuhai Chen, Chun Liu, Lulu Kang

AI总结 本文提出了一种基于能量耗散律的变分推断新框架——能量变分推断(EVI),能够统一并推导出多种现有的粒子型变分推断方法,如Stein变分梯度下降(SVGD)。在此框架下,作者还提出了一种新的粒子型EVI方法,采用“先近似后变分”的策略,在每一步迭代中显著降低KL散度,数值实验表明该方法在保持目标分布忠实度方面优于现有方法。

Comments 17 pages, 7 figures

详情
Journal ref
2021, Statistics and Computing, Vol 31, 34
英文摘要

We introduce a new variational inference (VI) framework, called energetic variational inference (EVI). It minimizes the VI objective function based on a prescribed energy-dissipation law. Using the EVI framework, we can derive many existing Particle-based Variational Inference (ParVI) methods, including the popular Stein Variational Gradient Descent (SVGD) approach. More importantly, many new ParVI schemes can be created under this framework. For illustration, we propose a new particle-based EVI scheme, which performs the particle-based approximation of the density first and then uses the approximated density in the variational procedure, or "Approximation-then-Variation" for short. Thanks to this order of approximation and variation, the new scheme can maintain the variational structure at the particle level, and can significantly decrease the KL-divergence in each iteration. Numerical experiments show the proposed method outperforms some existing ParVI methods in terms of fidelity to the target distribution.

2605.10084 2026-05-12 eess.AS cs.AI cs.LG cs.SD

PoDAR: Power-Disentangled Audio Representation for Generative Modeling

Alejandro Luebs, Mithilesh Vaidya, Ishaan Kumar, Sumukh Badam, Stephen W. Bailey, Matthew Bendel, Jose Sotelo, Xingzhe He

AI总结 本文提出了一种名为PoDAR的音频表示方法,通过显式地将信号功率与语义内容解耦,显著提升了音频潜在空间的可建模性。该方法利用随机功率增强和潜在一致性目标,使生成模型的收敛速度加快并提升生成质量。实验表明,PoDAR在多个指标上优于基线方法,同时扩展了条件生成的适用范围。

Comments 9 pages, 3 figures

详情
英文摘要

The performance of audio latent diffusion models is primarily governed by generator expressivity and the modelability of the underlying latent space. While recent research has focused primarily on the former, as well as improving the reconstruction fidelity of audio codecs, we demonstrate that latent modelability can be significantly improved through explicit factor disentanglement. We present PoDAR (Power-Disentangled Audio Representation), a framework that utilizes a randomized power augmentation and latent consistency objective to decouple signal power from invariant semantic content. This factorization makes the latent space easier to model, which both accelerates the convergence of downstream generative models and improves final overall performance. When applied to a Stable Audio 1.0 VAE with an F5-TTS generator, PoDAR achieves about a $2\times$ acceleration in convergence to match baseline performance, while increasing final speaker similarity by 0.055 and UTMOS by 0.22 on the LibriSpeech-PC dataset. Furthermore, isolating power into dedicated channels enables the application of CFG exclusively to power-invariant content, effectively extending the stable guidance regime to higher scales.

2605.10076 2026-05-12 eess.IV cs.LG

A Stability Benchmark of Generative Regularizers for Inverse Problems

Alexander Denker, Johannes Hertrich, Sebastian Neumayer

AI总结 该论文研究了生成式正则化方法在逆问题中的稳定性表现,重点评估了其在不完美条件下的收敛性、对分布外数据的鲁棒性以及对前向算子和噪声模型误差的敏感性。作者通过数值实验对比了生成模型与基于变分优化的现代方法,揭示了生成先验在不同应用场景下的优势与局限,为选择合适的重建方法提供了参考依据。

详情
英文摘要

Generative (diffusion) priors demonstrate remarkable performance in addressing inverse problems in imaging. Yet, for scientific and medical imaging, it is crucial that reconstruction techniques remain stable and reliable under imperfect settings. Typical definitions of stability encompass the notion of ''convergent regularization'', robustness to out-of-distribution data, and to inaccuracies in the forward operator or noise model. We evaluate these properties numerically. Furthermore, we benchmark generative approaches against modern optimization-based methods inspired by the widely used variational techniques. Our results give insights for which settings and applications generative priors can deliver state-of-the-art reconstructions, and on those in which they fall short or may even be problematic.

2605.10036 2026-05-12 cs.NI cs.AI

Bridging the Cognitive Gap: A Unified Memory Paradigm for 6G Agentic AI-RAN

Xijun Wang, Zhaoyang Liu, Chenyuan Feng, Xiang Chen, Howard H. Yang, Tony Q. S. Quek

AI总结 随着6G的发展,无线接入网络需要超越传统自动化,引入具备感知、推理和演进能力的智能体AI。当前解耦架构中存在认知鸿沟,物理层被迫将高维状态压缩为低维指标,限制了智能体的语义理解能力。本文提出一种统一的内存范式,通过映射生物记忆层次到异构计算架构,打破感知与推理的界限,利用新型相干互连技术实现跨时间尺度的状态共享,从而在实时响应与长期上下文之间建立真正的自主6G网络。

Comments This work has been submitted to the IEEE for possible publication

详情
英文摘要

As 6G evolves, the radio access network must transcend traditional automation to embrace agentic AI capable of perception, reasoning, and evolution. A fundamental cognitive gap persists in current disaggregated architectures, where interfaces force the physical layer to compress high-dimensional states into low-dimensional metrics, trapping reasoning agents behind a semantic bottleneck. This article envisions a shift from interface-bound to memory-centric architectures. We propose a unified memory paradigm that dissolves the boundaries between sensing and reasoning by mapping biological memory hierarchies onto heterogeneous computing fabrics. Enabled by emerging coherent interconnects, this approach creates a cognitive continuum where microsecond-level reflexes, millisecond-level reasoning, and long-term evolution share state across time scales. By replacing message passing with zero-copy observability, we empower AI agents to bridge the gap between real-time responsiveness and long-horizon context for truly autonomous 6G networks.

2605.10015 2026-05-12 stat.ML cs.CR cs.LG

Differentially Private Sampling from Distributions via Wasserstein Projection

Shokichi Takakura, Seng Pei Liew, Satoshi Hasegawa

AI总结 本文研究了在差分隐私约束下从分布中采样的问题。与以往基于密度比的效用度量方法不同,本文提出以Wasserstein距离作为效用指标,克服了传统方法在捕捉分布支持几何结构和处理不同支持分布方面的不足。作者提出了基于Wasserstein投影的最小最大最优机制(WPM),并设计了相应的高效近似算法,提供了收敛性保证,为差分隐私采样提供了新的理论框架和实用方法。

详情
英文摘要

In this paper, we study the problem of sampling from a distribution under the constraint of differential privacy (DP). Prior works measure the utility of DP sampling with density ratio-based measures such as KL divergence. However, such formulations suffer from two key limitations: 1) they fail to capture the geometric structure of the support, and 2) they are not applicable when the supports of the distributions differ. To deal with these issues, we develop a novel framework for DP sampling with Wasserstein distance as the utility measure. In this formulation, we propose Wasserstein Projection Mechanism (WPM), a minimax optimal mechanism based on Wasserstein projection. Furthermore, we develop efficient algorithms for computing the proposed mechanisms approximately and provide convergence guarantees.

2605.10008 2026-05-12 physics.optics cs.CV cs.ET

Measurement-Adapted Eigentask Representations for Photon-Limited Optical Readout

Tianyang Chen, Mandar M. Sohoni, Saeed A. Khan, Jérémie Laydevant, Shi-Yuan Ma, Tianyu Wang, Peter L. McMahon, Hakan E. Türeci

AI总结 在低光条件下,光学读取面临光子噪声、探测器噪声和量化误差等限制,影响后续分类与决策的准确性。本文提出一种基于特征可分辨性的本征任务(eigentask)表示方法,用于对光学传感器输出进行噪声自适应的特征表示。实验表明,该方法在光子预算有限、样本稀缺和任务复杂度高的场景下显著优于主成分分析等传统方法,有效提升了分类性能与学习效率。

Comments 15+14 pages, 4+9 figures, 55 references

详情
英文摘要

Optical readout in low-light imaging is fundamentally limited by measurement noise, including photon shot noise, detector noise, and quantization error. In this regime, downstream inference depends not only on the optical front end, but also on how noisy high-dimensional sensor measurements are represented before classification or decision-making. Here we show that eigentasks provide a measurement-adapted representation for optical sensor outputs by ordering readout features according to their resolvability under noise. Using experimental data from a lens-based optical imaging system and a reanalysis of published data from a single-photon-detection neural network, we find that eigentask representations frequently outperform standard baselines including principal component analysis and filtering-based compression. The advantage is most pronounced in photon-limited, few-shot, and higher-difficulty classification regimes. In few-shot MPEG-7 classification, for example, the advantage over other methods reaches about 10 percentage points as the number of classes increases. In these settings, eigentasks yield more informative low-dimensional features and improve sample-efficient downstream learning. These results identify measurement-adapted representation as a promising strategy for optical inference when photon budget, acquisition time, and task complexity are constrained.

2605.09981 2026-05-12 q-bio.BM cs.AI

Yeti: A compact protein structure tokenizer for reconstruction and multi-modal generation

Nabin Giri, Steven Farrell, Kristofer E. Bouchard

AI总结 该研究提出了一种名为Yeti的紧凑型蛋白质结构分词器,旨在解决多模态模型中蛋白质结构、序列和功能注释联合建模的问题。Yeti基于无查找量化方法,通过端到端的流匹配目标进行训练,能够在保持高重建精度的同时实现优异的生成能力。实验表明,Yeti在参数数量大幅减少的情况下,仍能实现与现有模型相当甚至更优的结构重建和多模态生成性能,为高效训练多模态蛋白质生成模型提供了有力工具。

详情
英文摘要

Multimodal models that jointly reason over protein sequences, structures, and function annotations within a unified representation hold immense potential for integrating multimodal data and generating new proteins with designed functional properties. To utilize transformer architectures, such models require a tokenizer that converts protein structure from continuous atomic coordinates into discrete representations suitable for scalable multimodal training. The quality of such models are fundamentally upper bounded by the fidelity and expressiveness of the underlying tokenized structure. However, existing tokenizers prioritize reconstruction over generative abilities. To address these gaps, we introduce Yeti, a simple and compact protein structure tokenizer based on lookup free quantization and trained end to end with a flow matching objective for multimodal learning. Compared to existing models, Yeti generally achieves the best codebook utilization and token diversity, and second best reconstruction accuracy (with 10x fewer parameters than ESM3) on diverse datasets. To validate Yeti's generative capability, we trained a compact multimodal model jointly over its structure tokens and amino acid sequence entirely from scratch, with no pretrained initialization. The resulting multimodal model generates plausible structures under unconditional cogeneration of protein sequence and structures, achieving comparable results to 10x larger models. Together, these results demonstrate that Yeti is a compact and expressive protein structure tokenizer suitable for training multimodal models that cogenerates highly plausible sequences and structures.

2605.09971 2026-05-12 cs.HC cs.AI

HapticLDM: A Diffusion Model for Text-to-Vibrotactile Generation

Jiahao Xiong, Fei Wang, Anran Xu, Pinzhi Huang, Tao Wen, Lijia Pan, Cai Chen

AI总结 本文提出HapticLDM,一种基于潜在扩散模型的文本到触觉振动生成方法,旨在解决从自然语言生成准确、一致且完整的振动信号这一核心挑战。该方法通过引入强调动态特性的文本处理策略和全局去噪机制,提升了振动信号在时间包络上的连贯性与稳定性。实验结果表明,HapticLDM在真实感和语义对齐方面优于现有方法,并能生成多样化、细腻且物理精确的触觉反馈,有效简化了触觉设计流程。

详情
英文摘要

Text-to-vibration generation converts natural language into haptic feedback, enabling vibration-effect designers to get scenarios-fitted vibrations more efficiently, which shows great potentials in application fields such as metaverse, games, and film to enrich the user experience in interactive scenarios. The core challenge in this field is how to generate accurate, consistent, and complete vibrations according to textual semantics. Very recent autoregressive (AR) approaches (e.g., HapticGen) exhibit limited capacity in fully capturing global dependencies, owing to the inherent sequential nature of their modeling and prevailing data constraints. In this paper, we proposed HapticLDM, the first text-to-vibration generative model built upon Latent Diffusion Models (LDMs). Firstly, with respect to the data, we introduced a text-processing strategy that emphasizes dynamic characteristics to curate high-quality data pairs for fine-grained dynamic modeling. Secondly, HapticLDM incorporates a global denoising mechanism that regulates coherent and stable variations in the temporal envelope. Furthermore, we conduct extensive evaluations, including A/B testing against the state-of-the-art baseline and a user study involving 30 participants. The results demonstrate that our model enhances realism and semantic alignment. Qualitative feedback further indicates that HapticLDM simplifies the haptic design workflow while generating diverse, subtle, and physically precise vibrations.

2605.09960 2026-05-12 physics.geo-ph cs.LG cs.NA math.NA

Total Generalized Variation regularization closes the gap between neural-eld and classical methods in seismic travel-time tomography

Isao Kurosawa

AI总结 本文提出了一种基于全广义变分(TGV²)正则化的可微框架MIMIR,用于地震走时层析成像,通过傅里叶特征神经网络表示二维速度场,替代传统网格化的慢度向量,从而实现连续且无限可微的速度场建模。该方法消除了传统TGV计算中的内层Chambolle-Pock迭代,提升了计算效率,并在多个合成数据集上显著优于经典方法,验证了TGV²在恢复分段仿射结构上的优越性。研究指出,在物理信息神经网络反演中,正则化选择比网络结构更为关键。

Comments 15 pages, 6 figures. Manuscript submitted to Geophysical Journal International

详情
英文摘要

Travel-time tomography forces a trade-off between mesh resolution and stability in which the regularizer choice dominates what can be recovered. We introduce MIMIR, a differentiable framework that represents the 2D velocity field as a Fourier-feature neural network, replacing the grid-based slowness vector with a continuous, infinitely differentiable function. Prior neural-field tomography has staircased smooth fields under total-variation (TV) priors or oscillated near interfaces under $L^2$ Laplacian smoothing. We adopt second-order total generalized variation (TGV$^2$) and parametrize its auxiliary vector field as a second neural network jointly optimized with the velocity field, eliminating the inner Chambolle-Pock primal-dual loop that classically dominates TGV computation. On three synthetic benchmarks (Gaussian, horizontally layered, curved-fault inspired by OpenFWI) using cross-well acquisition, 5% travel-time noise, and five seeds, MIMIR-TGV$^2$ ties a classical FMM-LSMR baseline with auto-tuned hyperparameters on the Gaussian ($p=0.134$, paired $t$-test) and significantly outperforms it on layered ($p<0.0001$, 44% RMSE reduction) and curved-fault ($p=0.0002$, 33% reduction). Replacing TGV$^2$ with TV degrades performance on Gaussian ($p=0.004$) and layered ($p=0.003$); curriculum-annealed TV improves Gaussian RMSE by only 5.4%, confirming that TV's staircase bias is intrinsic to the regularizer rather than a scheduling artifact. The results empirically validate the Bredies-Kunisch-Pock prediction that piecewise-affine priors are better suited to subsurface velocity recovery than piecewise-constant TV priors. We argue that the central design choice in physics-informed neural-field inversion is not the network architecture but the regularizer. The full pipeline reproduces in under one hour on consumer hardware.

2605.09916 2026-05-12 math.MG cs.LG

The Observable Wasserstein Distance

Edivaldo Lopes dos Santos, Leandro Vicente Mauri, Washington Mio, Tom Needham

AI总结 本文提出了一种可观测的Wasserstein距离,用于在波兰度量空间上推导概率测度之间的Wasserstein距离下界,以克服大规模非欧几里得数据集中精确最优传输计算的困难。该方法通过1-利普希茨可观测函数将测度投影到实数轴上,并计算投影后分布之间的Wasserstein距离,定义了一个由子空间限制构成的伪度量层次结构。理论上的核心贡献是建立了一个与测度支撑集的度量覆盖维数相关的唯一性恢复结果,为欧几里得分布中的Cramér-Wold定理提供了度量空间的类比。实验表明,该层次结构在Wasserstein距离下界精度与计算效率之间提供了可调节的权衡。

详情
英文摘要

We introduce the observable Wasserstein distance, a framework for deriving lower bounds on the Wasserstein distance between probability measures on Polish metric spaces, designed to bypass the computational intractability of exact optimal transport in large-scale, non-Euclidean datasets. Analogous to the sliced Wasserstein distance in $\mathbb{R}^d$, our approach projects measures onto the real line via 1-Lipschitz observables and computes the Wasserstein distances between the resulting pushforward distributions. We define a hierarchy of pseudo-metrics by restricting observables to a nested chain of subspaces. A central theoretical contribution is an injectivity result linking the metric covering dimension of the support of a measure to the specific order in the hierarchy that guarantees unique recovery. This serves as a metric-space analogue to the Cramér-Wold Device for Euclidean distributions. We demonstrate that this hierarchy offers a tunable trade-off between sharpness as a lower bound on the Wasserstein distance and computational efficiency. We also present a discrete computational model for finite grids and numerical experiments validating the efficacy and utility of these approximations.

2605.09890 2026-05-12 cs.CR cs.LG

Deep Learning under Fractional-Order Differential Privacy

Mohammad Partohaghighi, Roummel Marcia

AI总结 本文提出了一种基于分数阶微分隐私的随机梯度下降方法(FO-DP-SGD),通过引入有限窗口的幂律加权历史输出,增强了隐私保护机制的记忆能力,同时保持了标准的“先求和后加噪”的结构。该方法在保证隐私的前提下,有效降低了每步的敏感度,从而提升了模型的精度与隐私-效用平衡。实验表明,FO-DP-SGD在多个数据集上优于现有的隐私保护优化方法。

详情
英文摘要

Differentially private stochastic gradient descent (DP-SGD) is a standard approach to privacy-preserving learning based on per-example clipping, subsampling, Gaussian perturbation, and privacy accounting. Classical DP-SGD releases a noisy version of the current clipped subsampled gradient sum. We propose Fractional-Order Differentially Private Stochastic Gradient Descent (\textbf{FO-DP-SGD}), a mechanism-level extension that replaces this current-only query, before Gaussian noise is added, with a fractional recursive query combining the current clipped sum with a finite-window, power-law-weighted aggregation of previously released private sum-level outputs. This injects fractional memory into the release mechanism while preserving the standard \emph{sum-then-noise-then-divide} structure. Under add/remove adjacency with Poisson subsampling, the current-step sensitivity analysis shows that the only newly data-dependent term is the scaled current clipped sum. Hence, conditioned on the private history, the effective \(\ell_2\)-sensitivity is at most \(βC\), where \(C\) is the clipping threshold and \(β\in(0,1]\) controls the current-step contribution. Thus, FO-DP-SGD admits standard per-step Rényi differential privacy accounting via a Poisson-subsampled Gaussian mechanism with effective noise-to-sensitivity ratio \(σ/β\), and composes to yield overall \((\varepsilon,δ)\)-differential privacy guarantees. FO-DP-SGD provides a framework for studying long-memory effects in private optimization. The fractional order, memory window, and mixing coefficient govern the trade-off among current-step sensitivity, signal retention, and private-history influence. Experiments on SVHN, CIFAR-10, and CIFAR-100 show improved test accuracy and privacy--utility performance over DP-SGD and private baselines including DP-Adam, DP-IS, SA-DP-SGD, ADP-AdamW, DP-SAT, and DP-Adam-AC.

2605.09881 2026-05-12 hep-ph cs.LG hep-ex

Dissecting Jet-Tagger Through Mechanistic Interpretability

Saurabh Rai, Sanmay Ganguly

AI总结 本文通过机制可解释性方法分析了用于顶夸克标签任务的粒子变换器模型,旨在揭示其内部用于喷注分类的计算回路及其物理表征内容。研究发现,一个由六个注意力头组成的稀疏回路能够恢复模型大部分性能,并具有清晰的源-中继-读出结构,其中早期层头作为因果源,中间层头选择性关注喷注子结构,晚期层头负责信号读出。结果表明,自然语言模型的可解释性方法可应用于喷注分类任务,且梯度下降过程可能在无监督条件下重现物理上有意义的喷注标签特征。

Comments 40 pages, 14 figures, 12 tables. Comments are welcome

详情
英文摘要

Mechanistic interpretability seeks to reverse engineer a trained neural network by identifying the minimal subset of internal components. We perform a mechanistic interpretability analysis of the Particle Transformer architecture, trained on the Top Quark Tagging reference dataset, with the goal of identifying the computational circuit responsible for jet classification and characterizing the physical content of its internal representations. Combining zero ablation, path patching with two complementary on-manifold corruption strategies and linear probing of the residual stream, we identify a sparse six-head circuit that recovers the great majority of the full model performance while admitting a clean source-relay-readout interpretation. In this circuit, a single early layer head serves as the primary causal source, a cluster of middle-layer heads acts as relays selectively attending to hard pairwise substructure and a single late-layer head reads out the aggregated signal. Linear probes show that the residual stream is preferentially aligned with the energy correlator basis over the $N$-subjettiness basis. Within the energy correlator basis, the model preferentially encodes 2-prong substructure observables over the 3-prong observables. A per-layer trained probe further reveals that the apparent single step commitment of the model to a classification decision in the first class attention block is in fact a basis rotation, with the discriminating signal already saturating in the particle attention stack. These results demonstrate that mechanistic interpretability methods developed for natural language models can be used for jet physics classifiers and indicate that gradient descent may rediscover physically meaningful aspects of jet tagging without supervision.

2605.09863 2026-05-12 cs.CR cs.AI cs.CL cs.IR cs.LG

Nautilus Compass: Black-box Persona Drift Detection for Production LLM Agents

Chunxiao Wang

AI总结 本文提出了一种名为 Nautilus Compass 的黑盒方法,用于检测生产环境中大型语言模型代理的“人格漂移”问题,即代理在长时间交互中偏离用户设定的约束和先前约定。该方法无需访问模型权重,仅通过用户提示与行为锚文本之间的余弦相似度进行检测,使用 BGE-m3 嵌入进行聚合计算,适用于如 Claude 和 GPT-4 等封闭 API 接口。实验表明,该方法在真实会话数据集上实现了较高的漂移检测性能,并且系统部署成本较低,具有实际应用价值。

Comments 19 pages, 6 figures. MIT-licensed code + reproduction scripts at github.com/chunxiaoxx/nautilus-compass

详情
英文摘要

Production LLM coding agents drift over long sessions: they forget user-specified constraints, slip into mistakes the user already flagged, and confabulate prior agreements. White-box approaches such as persona vectors require model weights and so cannot be applied to closed APIs (Claude, GPT-4) that most users actually interact with. We present Nautilus Compass, a black-box persona drift detector and agent memory layer for production coding agents. The method operates entirely at the prompt-text layer: cosine similarity between user prompts and behavioral anchor texts, aggregated by a weighted top-k mean using BGE-m3 embeddings. Compass is, to our knowledge, the only public agent memory layer (among Mem0, Letta, Cognee, Zep, MemOS, smrti verified May 2026) that does not call an LLM at index time to extract facts or build a graph; raw conversation text is embedded directly. The system ships as a Claude Code plugin, an MCP 2024-11-05 A2A server (Cursor, Cline, Hermes), a CLI, and a REST API on one daemon, with a Merkle-chained audit log for tamper-evident anchor updates. On a held-out test set built from real Claude Code session traces and labeled by an independent LLM judge, Compass reaches ROC AUC 0.83 for drift detection. The embedded retrieval pipeline scores 56.6% on LongMemEval-S v0.8 and 44.4% on EverMemBench-Dynamic (n=500), topping the four published EverMemBench Table 4 baselines. LongMemEval-S 56.6% is ~30 points below recent white-box leaders (90+%); we treat that as the architectural ceiling of the no-extraction design. End-to-end reproduction cost is $3.50 (~14x cheaper than GPT-4o-judged stacks). A paired cross-vendor behavior A/B accompanies these numbers as preliminary system-level evidence. Code, anchors, frozen test data, and audit-log tooling are MIT-licensed at github.com/chunxiaoxx/nautilus-compass.