arXivDaily arXiv每日学术速递 周一至周五更新
重置
CS计算机1059
2606.12287 2026-06-11 cs.NE cs.AI 新提交

SpikeDecoder: Realizing the GPT Architecture with Spiking Neural Networks

SpikeDecoder: 用脉冲神经网络实现GPT架构

Claas Beger, Florian Walter, Alois Knoll

AI总结 提出SpikeDecoder,一种基于脉冲神经网络(SNN)的Transformer解码器,用于自然语言处理,通过替换ANN模块和优化嵌入方法,在保持性能的同时降低理论能耗87%-93%。

详情
AI中文摘要

Transformer架构被广泛认为是自然语言处理最强大的工具,但由于大量复杂操作,其本质上存在高能耗问题。为解决这一问题,我们考虑脉冲神经网络(SNN),它通过天然的事件驱动方式处理信息,是传统人工神经网络(ANN)的节能替代方案。然而,这本质上使得SNN难以训练。通常,许多基于SNN的模型通过转换预训练的ANN来规避这一问题。最近,有研究尝试设计可直接训练的基于SNN的Transformer模型结构改编。尽管结果显示出巨大潜力,但应用领域是计算机视觉,且所提模型仅包含编码器模块。在本文中,我们提出SpikeDecoder,一种完全基于SNN的Transformer解码器模块实现,用于自然语言处理。通过一系列实验,我们分析了用脉冲替代方案交换ANN模型不同模块的影响,以识别权衡和性能损失的主要来源。我们进一步研究了残差连接的作用以及SNN兼容归一化技术的选择。除了模型架构的工作,我们还制定并比较了将文本数据投影为脉冲的不同嵌入方法。最后,我们证明,与ANN基线相比,所提出的基于SNN的解码器模块将理论能耗降低了87%至93%。

英文摘要

The Transformer architecture is widely regarded as the most powerful tool for natural language processing, but due to a high number of complex operations, it inherently faces the issue of high energy consumption. To address this issue, we consider Spiking Neural Networks (SNNs), which are an energy-efficient alternative to conventional Artificial Neural Networks (ANNs) due to their naturally event-driven approach to processing information. However, this inherently makes them difficult to train. Often, many SNN-based models circumvent this issue by converting pre-trained ANNs. More recently, attempts have been made to design directly trainable SNN-based adaptations of the Transformer model structure. Although the results showed great promise, the application field was computer vision. Moreover, the proposed model incorporates only encoder blocks. In this paper, we propose SpikeDecoder, a fully SNN-based implementation of the Transformer decoder block, for applications in natural language processing. In a series of experiments, we analyze the impact of exchanging different blocks of the ANN model with spike-based alternatives to identify trade-offs and significant sources of performance loss. We further investigate the role of residual connections and the selection of SNN-compatible normalization techniques. Besides the work on the model architecture, we formulate and compare different embedding methods to project text data into spikes. Finally, we demonstrate that our proposed SNN-based decoder block reduces the theoretical energy consumption by 87% to 93% compared to the ANN baseline.

2606.12286 2026-06-11 cs.CV 新提交

CellNet -- Localizing Cells using Sparse and Noisy Point Annotations

CellNet -- 利用稀疏和噪声点标注定位细胞

Benjamin Eckhardt, Dmytro Fishman, Stuart Fawke, Andrew Curtis, Bo Fussing, Constantin Pape

发表机构 * University of Göttingen(哥廷根大学) Wellcome Sanger Institute(威康桑格研究所) University of Tartu(塔尔图大学)

AI总结 提出基于回归的深度学习算法CellNet,利用稀疏点标注在相位对比显微镜图像中检测和计数细胞,减少标注负担,在低数据场景下优于零样本方法。

详情
Comments
Conference poster at Biology at Scale: From Variants to Cellular Programs and Functions
AI中文摘要

计数活细胞是许多生物学研究工作流程中的重要步骤。我们在Wellcome Sanger研究所的合作者通过大规模饱和基因组编辑筛选研究人类重要基因,这需要反复多次计数细胞。基于计算机视觉的自动化对于高通量和资源效率至关重要。在这项工作中,我们开发了一种基于回归的深度学习计算机视觉算法,用于检测和计数相位对比显微镜图像中的细胞。为了减少标注工作量(这在实际中常成为瓶颈),我们专注于仅使用稀疏点标注来计数细胞,这种标注方式快速且易于获取。通过与最先进的零样本方法比较,我们表明基于回归的计数在低数据场景下是一种有前景的替代方案。通过开发自动计数显微镜图像中活细胞的方法,我们为人类基因组的重要研究做出了贡献。代码可在以下网址获取:https://this https URL。

英文摘要

Counting living cells is an important step in many biological research workflows. Our collaborators at the Wellcome Sanger Institute study vital genes in humans via large scale saturation genome editing screening, which requires repeatedly counting cells a great number of times. Computer Vision based automation is crucial for high throughput and resource efficiency. In this work, we develop a regression-based deep learning computer vision algorithm to detect and count cells in phase-contrast microscopy images. To reduce annotation effort, which in practice often becomes a bottleneck, we focus on counting cells only using sparse point annotations, which are fast and easy to acquire. By comparison to state-of-the-art 0-shot methods, we show that regression-based counting is a promising alternative in low data regimes. Through developing methods to automatically count living cells in microscopy images, we contribute to valuable research on the human genome. The code is available at this https URL.

2606.12285 2026-06-11 cs.CY 新提交

Why AI Slop Matters, but Not Like That

为什么AI垃圾内容重要,但不是那样重要

Sachita Nishal, Marijn Sax, Kimon Kieslich

AI总结 本文回应《为什么垃圾内容重要》一文,通过内在和外部批判,指出其推理忽视了AI垃圾内容的社会技术背景,并基于伦理和社会科学视角,强调应关注其社会功能和审美价值,呼吁进行语境化和文化基础的讨论。

详情
AI中文摘要

这是对论文《为什么垃圾内容重要》的回应。通过提供内在和外部批判,我们认为作者的推理忽视了AI垃圾内容的社会技术背景。我们的论文呈现了一种基于伦理和社会科学的回应,将辩论聚焦于AI垃圾内容的社会功能和审美价值。我们得出结论,AI垃圾内容是一个重要的研究课题,但呼吁对该问题进行语境化和文化基础的讨论。为此,我们讨论了未来研究AI垃圾内容现象议程的一些关键要素。

英文摘要

This is a response to the paper ''Why Slop Matters''. By offering both immanent and external critique, we argue that the authors' reasoning neglects the socio-technical context of AI slop. Our paper presents an ethical and social science informed response that centers the debate on the social function and aesthetic value of AI slop. We conclude that AI slop is an important research subject but call for a contextual and culturally-grounded debate on the issue. To that end, we discuss some key elements of an agenda for future research on the phenomenon of AI slop.

2606.12282 2026-06-11 cs.SD cs.LG 新提交

PianoKontext: Expressive Performance Rendering from Deadpan Context

PianoKontext: 从平淡语境中生成富有表现力的演奏

Dmitrii Gavrilev

AI总结 提出PianoKontext,一种基于流匹配的钢琴演奏渲染模型,通过动态时间规整对齐乐谱与演奏的潜在表示,生成可变长度的表现力演奏。

详情
Comments
ICML 2026 Workshop on Machine Learning for Audio (Oral)
AI中文摘要

表现力演奏渲染(EPR)旨在根据音符序列生成逼真的演奏。然而,流匹配音频编辑模型仅操作相同时长的同步音乐样本,限制了它们对表现力时机的理解。我们提出了PianoKontext,一种针对古典钢琴音乐的流匹配渲染模型,该模型在预训练的Music2Latent模型的潜在空间中生成可变长度的演奏。我们将MIDI乐谱合成为平淡音频,并在潜在空间中使用动态时间规整(DTW)构建用于训练的对齐数据。对齐的嵌入在DiT块中拼接,从而简单有效地学习乐谱与演奏之间的依赖关系。音频样本可在我们的演示页面获取:此https URL。

英文摘要

Expressive performance rendering (EPR) aims to generate realistic performances constrained on sequences of notes. However, flow matching audio editing models manipulate only synchronized music samples of the same duration, limiting their understanding of expressive timing. We introduce PianoKontext, a flow matching rendering model for classical piano music that generates variable-length performances in the latent space of a pretrained Music2Latent model. We synthesize MIDI scores into deadpan audio and employ Dynamic Time Warping (DTW) in the latent space to construct paired data for training. The aligned embeddings are concatenated in DiT blocks, allowing for a simple and effective learning of the dependencies between the score and performances. Audio samples are available at our demo page: this https URL.

2606.12281 2026-06-11 cs.MA cs.AI cs.LG 新提交

CCKS: Consensus-based Communication and Knowledge Sharing

CCKS:基于共识的通信与知识共享

Jinyuan Zu, Xiaowei Lv, Yongcai Wang, Deying Li, Yunjun Han, Wenping Chen, Fengyi Zhang, Naiqi Wu

AI总结 针对多智能体强化学习中动作建议过度依赖教师指导的问题,提出基于共识的通信与知识共享框架,通过对比学习构建共识模型,平衡探索与学习,提升合作效率与性能。

详情
AI中文摘要

在分布式训练和分布式执行(DTDE)的协作多智能体强化学习(MARL)中,基于动作建议的知识共享促进了智能体间的可解释和可扩展合作。然而,当前的动作建议方法往往过于遵循教师的指导,而未评估师生兼容性,导致过度建议、稳定性欠佳和性能下降。为克服这些挑战,本文提出了一种基于共识的通信与知识共享(CCKS)框架,该框架允许智能体基于共识衍生的约束采纳建议,并更智能地遵循教师指令。该机制使智能体能够平衡探索与向经验丰富的教师学习,从而提升整体性能。关键在于共识模型的构建,为此我们提出在智能体训练阶段利用对比学习基于局部观测构建共识模型。在动作选择中,智能体根据共识和共享知识对动作进行评分和选择。CCKS设计为即插即用解决方案,可无缝集成到现有DTDE算法中。在Google Research Football环境和复杂的星际争霸II多智能体挑战中进行的实验表明,与当前的DTDE基线相比,集成CCKS显著提高了合作效率、学习速度和整体性能。代码可从此https URL获取。

英文摘要

In Decentralized Training and Decentralized Execution (DTDE) for cooperative Multi-Agent Reinforcement Learning (MARL), action-advising-based knowledge sharing promotes interpretable and scalable cooperation among agents. However, current action advising approaches often adhere too much to the teacher's guidance without evaluating teacher-student compatibility, which causes excessive advising, suboptimal stability, and degraded performance. To overcome these challenges, this paper presents a Consensus-based Communication and Knowledge Sharing (CCKS) framework, which allows agents to adopt recommendations based on consensus-derived constraints and to follow the teacher's instructions more smartly. This mechanism enables agents to balance exploration and learning from experienced teachers, improving overall performance. The key is the consensus model construction, for which we propose to employ contrastive learning to construct consensus models based on local observations in the agents' training phase. In action selection, agents score and choose actions based on consensus and shared knowledge. Designed as a plug-and-play solution, CCKS integrates seamlessly with existing DTDE algorithms. Experiments conducted in the Google Research Football environment and the complex StarCraft II Multi-Agent Challenge demonstrate that the integration with CCKS significantly improves cooperation efficiency, learning speed, and overall performance compared with current DTDE baselines. The code is available at this https URL.

2606.12280 2026-06-11 cs.LG 新提交

Holding the FP8 Quality Ceiling at 8-Bit Weights and Activations: INT8 and GGUF Post-Training Quantization of Ideogram 4.0 for Consumer GPUs

在8位权重和激活下保持FP8质量上限:Ideogram 4.0面向消费级GPU的INT8与GGUF后训练量化

Deep Gandhi, Ali Asaria, Tony Salomone

发表机构 * Transformer Lab

详情
AI中文摘要

后训练量化使得大型文本到图像扩散变换器能够在消费级GPU上运行,然而硬件特定的权衡很少被直接测量。我们对Ideogram 4.0——一个9.3B流匹配扩散变换器(DiT),以两个独立权重副本的形式部署,用于无分类器引导,并由Qwen3-VL-8B编码器调节——针对缺乏FP8张量核心的Ampere RTX 3090 GPU进行量化。我们的INT8 W8A8方案(逐通道权重、逐token动态激活、SmoothQuant以及对少量高脆弱性层的混合精度保护)保持了FP8的质量上限:在200提示基准上,INT8与FP8的配对同种子bootstrap置信区间在Pick和CLIP指标上均包含零,而INT8相比NF4提升了+1.9 CLIP(95%置信区间[+1.21,+2.64],排除零)。据我们所知,针对此类模型进行的逐类别OCR分析首次确认了文本可读性得以保留,而消融实验将前馈网络下投影的保护隔离为关键质量杠杆。我们的GGUF Q4_K量化在相同磁盘大小下优于NF4,并在质量-内存前沿上成为帕累托最优解,配对置信区间排除零(Q8_0质量中性)。最后,我们描述了8位量化在哪些方面有帮助以及哪些方面没有:INT8的权重与FP8的占用空间相当而非缩小,因此在Ampere上实现速度提升需要融合INT8内核。

英文摘要

Post-training quantization lets large text-to-image diffusion transformers run on consumer GPUs, yet the hardware-specific trade-offs are seldom measured directly. We quantize Ideogram 4.0 - a 9.3B flow-matching diffusion transformer (DiT), shipped as two separate-weight copies of a single-stream 34-layer backbone for classifier-free guidance and conditioned by a Qwen3-VL-8B encoder - for Ampere RTX 3090 GPUs, which lack FP8 tensor cores. Our INT8 W8A8 recipe (per-channel weights, per-token dynamic activations, SmoothQuant, and mixed-precision protection of a small high-fragility layer set) holds the FP8 quality ceiling: on a 200-prompt benchmark the paired same-seed bootstrap CI for INT8-FP8 includes zero on both Pick and CLIP, while INT8 improves on NF4 by $+1.9$ CLIP (95% CI $[+1.21,+2.64]$, excluding zero). A per-category OCR analysis, to our knowledge unreported for this model class, confirms text legibility is preserved, and an ablation isolates protection of the FFN down-projections as the dominant quality lever. Our GGUF Q4_K quantization beats NF4 at equal on-disk size and is the Pareto winner on the quality-memory frontier, with paired confidence intervals excluding zero (Q8_0 is quality neutral). Finally, we characterize where 8-bit quantization helps and where it does not: INT8's weights match FP8's footprint rather than shrink it, so a speed gain on Ampere awaits a fused INT8 kernel.

2606.12279 2026-06-11 cs.NE cs.AI cs.LG 新提交

Mathematical perspective on genetic algorithms with optimization guided operators

遗传算法与优化引导算子的数学视角

Anna Brandenberger, Ilan Doron-Arad, Elchanan Mossel

AI总结 本文从数学角度建模遗传算法,将优化问题转化为查询复杂度问题,并证明某些问题必须依赖生成、变异和重组算子,同时揭示了多样性在解池中的关键作用。

详情
Comments
18 pages, 1 figure
AI中文摘要

近期机器学习工作将遗传算法应用于推理阶段,以迭代改进优化问题的解。所涉及的基本变异和重组算子在性质上不同于经典研究。变异不再是随机的;机器学习算法以改进目标为目的对解进行变异。同样,重组不再基于父代解的随机拼接,而是基于机器学习的优化算子,其目标是从输入中合成改进的解。因此,这些变异和重组算子更有可能改进目标,但其计算成本更高。我们引入了一个遗传算法的通用模型,并使用强化学习的语言将优化问题表述为查询复杂度问题。然后我们研究专门模型。我们证明某些优化问题必须通过生成、变异和重组来解决。接着,我们在此框架内为一类问题获得了定性紧的算法,该算法捕捉了解池中多样性的非平凡作用,这是实际机器学习遗传算法的一个关键特征。

英文摘要

Recent work in ML applies genetic algorithms at inference time to iteratively improve solutions to optimization problems. The basic mutation and recombination operators involved are qualitatively different from those studied classically. Mutations are no longer random; an ML algorithm mutates a solution with the goal of improving an objective. Similarly, recombination is not based on random collages of parent solutions. Instead, it is an ML optimization-based operator whose goal is to synthesize improved solutions from its inputs. Thus, these mutation and recombination operators are more likely to improve the objective, but their computational cost is much higher. We introduce a general model of genetic algorithms and formulating optimization in this model as a query-complexity problem, using the language of reinforcement learning. We then study specialized models. We show that some optimization problems require generation, mutation, and recombination to be solved. We then obtain qualitatively tight algorithms for a family of problems within this framework that captures the nontrivial role of diversity in the solution pool, a key feature of practical ML genetic algorithms.

2606.12278 2026-06-11 cs.CV cs.LG 新提交

Finding Sparse Subnetworks in One Training Cycle via Progressive Magnitude-Based Pruning

通过渐进式幅度剪枝在一个训练周期内找到稀疏子网络

Romana Qureshi, Hafida Benhidour, Said Kerrache, Nahlah Aljeraisy

发表机构 * King Abdullah University of Science and Technology(阿卜杜拉国王科技大学) University of Jeddah(吉达大学) King Fahd University of Petroleum and Minerals(法赫德国王石油矿产大学) King Saud University(沙特国王大学)

AI总结 提出渐进式幅度剪枝方法,在单训练周期内线性增加稀疏度,基于权重幅度更新掩码,在CIFAR-10和MNIST上优于LTH、SNIP和GraSP等基线。

详情
AI中文摘要

神经网络剪枝通过移除不太重要的参数来减小模型大小,同时旨在保持预测性能。尽管彩票假说(LTH)表明,当从合适的初始化训练时,稀疏子网络可以匹配密集网络,但其迭代剪枝过程需要多个完整的训练周期。本工作评估了渐进式幅度剪枝作为一种单周期替代方案。该方法在训练期间使用线性调度逐渐增加稀疏度,并基于活跃权重幅度更新剪枝掩码。我们在CIFAR-10和MNIST上,针对ResNet、VGG风格和LeNet架构进行了系统实验,将所提方法与代表性的迭代和基于初始化的剪枝基线(包括LTH、SNIP和GraSP)进行比较。在CIFAR-10上,该方法在ResNet-18上以72.9%稀疏度达到95.12%的准确率,而LTH报告为90.5%。在极端稀疏度下,它在VGG类架构上以97%稀疏度达到93.13%的准确率,而SNIP约为92.0%;在VGG-19上以97.97%稀疏度达到93.44%的准确率,而GraSP在98%稀疏度下为92.19%。在ResNet-18上的稀疏度-准确率分析进一步表明,在70-85%稀疏度范围内,准确率保持在密集基线的0.1个百分点以内。这些结果表明,在所评估的设置下,渐进式幅度剪枝为神经网络稀疏化提供了一种有效的单周期方法。

英文摘要

Neural network pruning reduces model size by removing less important parameters while aiming to preserve predictive performance. Although the Lottery Ticket Hypothesis (LTH) shows that sparse subnetworks can match dense networks when trained from suitable initializations, its iterative pruning procedure requires multiple complete training cycles. This work evaluates progressive magnitude-based pruning as a single-cycle alternative. The method gradually increases sparsity during training using a linear schedule and updates pruning masks based on active weight magnitudes. We conduct systematic experiments on CIFAR-10 and MNIST across ResNet, VGG-style, and LeNet architectures, comparing the proposed method with representative iterative and initialization-based pruning baselines, including LTH, SNIP, and GraSP. On CIFAR-10, the method achieves 95.12\% accuracy on ResNet-18 at 72.9\% sparsity, compared with 90.5\% reported for LTH. At extreme sparsity, it achieves 93.13\% accuracy on a VGG-like architecture at 97\% sparsity, compared with approximately 92.0\% for SNIP, and 93.44\% accuracy on VGG-19 at 97.97\% sparsity, compared with 92.19\% for GraSP at 98\% sparsity. A sparsity-accuracy analysis on ResNet-18 further shows that accuracy remains within 0.1 percentage points of the dense baseline across 70--85\% sparsity. These results indicate that progressive magnitude-based pruning provides an effective single-cycle approach for neural network sparsification under the evaluated settings.

2606.12277 2026-06-11 cs.LG 新提交

Finding Multiple Interpretations in Datasets

在数据集中寻找多种解释

Matthew Chak, Paul Anderson

发表机构 * Department of Computer Science, California Polytechnic State University(加州州立理工大学计算机科学系)

AI总结 提出一种方法,在保持性能的同时,找到具有不同上下文感知特征但性能相似的模型集,以提取对潜在现象的洞察。

详情
AI中文摘要

在本文中,我们提出了一种方法,用于寻找在损失/准确率测量方面表现相似但具有高度不同上下文感知特征的模型集。通过在METABRIC数据集上的实验,我们表明所提出的方法找到了多个模型,这些模型的基因表达与对照组方法找到的模型高度不同,且没有性能损失。我们认为,只要目标是分析模型的任何全局特征以提取对正在研究的潜在现象的洞察,所提出的方法就很重要。

英文摘要

In this paper, we propose an approach to finding sets of similar-performing models (in terms of loss/accuracy measurements) with highly different context-aware characteristics. Through experiments on the METABRIC dataset, we show that the proposed method finds multiple models with highly different gene expressions than those found by the control methodology without performance penalties. We argue that the proposed methodology is important whenever one aims to analyze any global characteristic of a model to extract insight into the underlying phenomenon being studied.

2606.12273 2026-06-11 cs.CL 新提交

Beyond Fully Random Masking: Attention-Guided Denoising and Optimization for Diffusion Language Models

超越完全随机掩码:扩散语言模型的注意力引导去噪与优化

Jia Deng, Junyi Li, Wayne Xin Zhao, Jinpeng Wang, Hongyu Lu, Ji-Rong Wen

发表机构 * Gaoling School of Artificial Intelligence, Renmin University of China(中国人民大学高瓴人工智能学院) Department of Data Science, City University of Hong Kong(香港城市大学数据科学系) Meituan(美团) WeChat, Tencent(腾讯微信) Beijing Key Laboratory of Research on Large Models and Intelligent Governance(大型模型与智能治理北京市重点实验室)

AI总结 提出AGDO框架,利用注意力结构指导去噪顺序并强化关键令牌,在数学和编码基准上提升扩散语言模型的推理性能。

详情
Comments
13 pages. Accepted to ACL 2026 Main Conference
AI中文摘要

扩散大语言模型(dLLMs)通过并行解码提供了自回归模型的高效替代方案,然而现有的后训练方法大多依赖随机掩码策略,忽略了内在的令牌依赖关系。在这项工作中,我们对dLLMs中的注意力进行了实证分析,表明对未掩码上下文关注更强的令牌表现出更高的生成稳定性,并在推理中发挥关键作用。受这些发现启发,我们提出了AGDO,一种注意力引导的去噪与优化框架,将训练和优化与注意力导出的依赖关系对齐。AGDO基于注意力结构确定去噪顺序,并在监督微调和强化学习过程中强调注意力关键令牌。在数学和编码基准上的实验表明,AGDO持续提升推理性能,优于dLLMs的最先进后训练方法。

英文摘要

Diffusion large language models (dLLMs) offer an efficient alternative to autoregressive models through parallel decoding, yet existing post-training methods largely rely on random masking strategies that overlook intrinsic token dependencies. In this work, we present an empirical analysis of attention in dLLMs and show that tokens attending more strongly to unmasked context exhibit greater generation stability and play a critical role in reasoning. Motivated by these findings, we propose AGDO, an attention-guided denoising and optimization framework that aligns both training and optimization with attention-derived dependencies. AGDO determines the denoising order based on attention structure and emphasizes attention-critical tokens during supervised fine-tuning and reinforcement learning. Experiments on mathematical and coding benchmarks demonstrate that AGDO consistently improves reasoning performance, outperforming state-of-the-art post-training methods for dLLMs.

2606.12268 2026-06-11 cs.AI 新提交

The Impossibility of Eliciting Latent Knowledge

引出潜在知识的不可能性

Korbinian Friedl, Francis Rhys Ward, Paul Yushin Rapoport, Tom Everitt, Jonathan Richens

发表机构 * The London School of Economics and Political Science(伦敦政治经济学院) Independent(独立机构)

AI总结 本文利用因果影响图形式化定义引出潜在知识问题,证明不存在仅依赖行为反馈的训练策略能确保智能体诚实报告其信念。

详情
Comments
24 pages, 3 figures. Includes proofs in appendix
AI中文摘要

高级AI系统对其环境拥有广泛的知识;事实上,它们的知识可能(远远)超过其开发者或用户。因此,AI系统的一个理想属性是诚实——即它准确报告其对世界的信念。设计一个诚实的AI系统可能很困难,特别是当我们想询问关于环境中潜在变量的问题时——这些变量对与之交互的人类是隐藏的。这就引出了引出潜在知识(ELK)问题:训练AI智能体诚实报告其信念的问题。在本文中,我们使用因果影响图(CID)使ELK在形式上精确化。CID可用于描述智能体的训练环境与其主观世界表征之间的关系。我们使用CID来形式化可观测变量和潜在变量之间的区别,明确指定智能体诚实的确切含义,并正式定义目标泛化错误。我们证明,在某些情况下,开发者可以通过在训练期间提供正确的反馈来激励智能体诚实回答问题。然而,智能体泛化的一种自然但不理想的方式是提供人类会评估为真实的答案,而不是诚实的答案。我们证明了一个不可能性定理:不存在仅依赖于智能体行为且能确保产生诚实智能体的基于反馈的训练策略,即使在训练期间反馈是完美的。

英文摘要

Advanced AI systems have extensive knowledge of their environments; in fact, their knowledge may (far) exceed that of their developers or users. Consequently, a desirable property for an AI system is that it is honest -- that it accurately reports its beliefs about the world. Designing an AI system to be honest may be difficult, especially if we want to ask it questions about latent variables in the environment -- variables which are hidden from the human interacting with it. This gives rise to the problem of eliciting latent knowledge (ELK): the problem of training an AI agent to honestly report its beliefs. In this paper, we make ELK formally precise using Causal Influence Diagrams (CIDs). CIDs can be used to describe the relationship between an agent's training environment and its subjective representation of the world. We use CIDs to formalise the distinction between observable and latent variables, to specify what exactly it means for an agent to be honest, and to formally define goal misgeneralisation. We show that, under certain circumstances, developers can incentivise an agent to honestly answer questions by providing correct feedback during training. However, a natural, but undesirable, way for an agent to generalise is to provide answers which humans would evaluate as true, rather than honest answers. We prove an impossibility theorem stating: There is no feedback-based training strategy that depends only on agent behaviour and with certainty produces an honest agent, even if feedback is perfect during training.

2606.12263 2026-06-11 cs.CV 新提交

VOID: Defeating Unauthorized Mimicry in Latent Diffusion Models

VOID: 击败潜在扩散模型中的未授权模仿

Chunlin Qiu, Ang Li, Tianxiao Huang, Ruilin Gan, Yunjie Ge, Shenyi Zhang, Huayi Duan, Lingchen Zhao, Chao Shen, Qian Wang

发表机构 * School of Cyber Science and Engineering, Wuhan University(武汉大学网络空间安全学院) School of Computer Science, Wuhan University(武汉大学计算机学院) Institute for Math&AI, Wuhan University(武汉大学数学与人工智能研究所) The Hong Kong University of Science and Technology (Guangzhou)(香港科技大学(广州)) School of Cyber Science and Engineering, Xi’an Jiaotong University(西安交通大学网络空间安全学院)

AI总结 针对潜在扩散模型被用于未授权模仿的问题,提出VOID防御框架,通过操纵模型内在随机性,放大潜在编码误差并抵消目标引导信号,实现语义破坏,阻止未授权模仿,同时将扰动限制在人眼不可感知区域。

详情
Comments
To appear in the 35th USENIX Security Symposium (USENIX Security 2026)
AI中文摘要

虽然潜在扩散模型(LDM)彻底改变了视觉合成,但它们越来越多地被用于对个人的未授权模仿。现有防御通过注入欺骗性扰动,将生成图像引导至无关目标。然而,这种方法基于一个无根据的假设:微小的扰动能在LDM的整个生成过程中保持其欺骗效果。实际上,模型固有的恢复机制会移除这些扰动,导致个体身份在生成的图像中重新出现。我们提出VOID,一种通过操纵LDM内在随机性克服这一难题的防御框架。VOID以两种新颖方式扰动扩散管道:1)放大潜在编码误差以破坏图像的语义结构,以及2)抵消目标引导信号以抑制模型的恢复能力。这导致语义破坏,阻止任何未授权模仿。值得注意的是,安全增益不以视觉效用为代价,因为VOID同时设法将扰动限制在受保护图像的人眼不可感知区域。我们在5个数据集上对10种模仿攻击的24种最先进防御进行了全面评估,证明了VOID前所未有的保护能力:它将平均Frechet Inception Distance(FID)从113提高到365,比迄今为止最强的防御提升了223%。

英文摘要

While Latent Diffusion Models (LDMs) have revolutionized visual synthesis, they are increasingly exploited for unauthorized mimicry of individuals. Existing defenses inject deceptive perturbations to steer the generated images toward irrelevant targets. However, this approach hinges on an ungrounded assumption: subtle perturbations can maintain their deceptive efficacy throughout an LDM's extensive generation process. In reality, the model's innate restoration mechanism will remove such perturbations and cause individual identities to re-emerge in the images generated. We propose VOID, a defense framework that overcomes this conundrum by manipulating an LDM's intrinsic stochasticity. VOID perturbs the diffusion pipeline in two novel ways: 1) amplifying the latent encoding errors to shatter an image's semantic structure, and 2) counteracting the target guidance signals to suppress the model's restoration capabilities. This results in a semantic corruption that thwarts any unauthorized mimicry. Notably, the security gain does not come at the price of visual utility, as VOID simultaneously manages to confine perturbations to human-imperceptible regions of protected images. Our comprehensive evaluation of 24 state-of-the-art defenses against 10 mimicry attacks on 5 datasets demonstrates VOID's unprecedented protection power: it increases the average Frechet Inception Distance (FID) from 113 to 365, a 223% improvement over the strongest defense to date.

2606.12260 2026-06-11 econ.TH cs.AI cs.GT cs.LG stat.ML 新提交

Market Design for AI: Beyond the Copyright Binary

人工智能的市场设计:超越版权二元论

Yan Dai, Maryam Farboodi, Negin Golrezaei, Sepehr Shahshahani

AI总结 本文通过静态和动态博弈模型,分析AI训练数据市场中“自由使用”与“强知识产权”两种模式的失败,提出通过数据中介内部化外部性并补贴创新贡献的市场设计。

详情
AI中文摘要

我们如何设计一个用于训练AI模型的人类生成内容市场,既能促进技术进步,又能保留个人创作高质量内容的激励?现有方法采取两极立场:基于合理使用的“自由使用”模式和“强知识产权”模式。我们证明两者均失败:自由使用不补偿创作者,而通过建模为静态Stackelberg博弈,强知识产权也削弱了创作激励。我们发现这对更具创新性的创作者尤其如此,我们将此现象称为“原创性惩罚”。将这一见解扩展到动态模型,我们发现另一种市场失灵会损害AI模型性能,即使对于初始良好的模型也是如此:此类模型导致人类更依赖AI辅助创作,导致同质化内容反馈到训练中,从而降低模型性能——即“精确性诅咒”。我们进一步提出一种市场设计,通过数据中介内部化跨创作者外部性并补贴创新贡献,从而恢复效率。

英文摘要

How can we design a market of human-generated content for use in training AI models that both enables technological progress and preserves individual incentives for high-quality content creation? Existing approaches take polar positions: a "free-for-all" model based on fair use and a "strong intellectual property rights" model. We show that both fail: Free-for-all does not compensate creators, and -- by modeling as a static Stackelberg game -- strong intellectual property rights also underpower creative incentives. We find this especially true for more innovative creators, a phenomenon we term the "originality penalty." Extending this insight to a dynamic model, we find another market failure undermining AI model performance, even for an initially good model: Such a model induces greater reliance by humans on AI-assisted creation, resulting in homogenized content feeding back into training, which degrades the model performance -- a "curse of precision." We further propose a market design with a data intermediary internalizing cross-creator externalities and subsidizing innovative contributions, thereby restoring efficiency.

2606.12259 2026-06-11 cs.CR cs.AR 新提交

Partitioned Tags, Shared Data: Reconciling Strict Cache Isolation with Write-Shared Coherence

分区标签,共享数据:严格缓存隔离与写共享一致性的调和

Kartik Ramkrishnan, Stephen McCamant, Antonia Zhai, Pen Chung Yew

AI总结 提出SCP方法,通过仅分区标签、共享数据池并调整大小避免容量驱逐,结合时序混淆和写泄漏阈值,在严格隔离下实现写共享一致性,有效防御Prime+Probe和Flush+Reload攻击。

详情
AI中文摘要

缓存分区是针对基于驱逐的缓存侧信道攻击最强大的结构性防御之一,然而一个存在十年的设计问题阻碍了其在安全共享操作系统环境中的广泛部署。该问题是写共享一致性在严格分区下会崩溃。我们提出SCP(安全且一致的分区),它通过仅分区标签、共享单个数据池,并调整数据池大小以避免容量驱动的跨分区驱逐,从而将严格的驱逐隔离与写共享一致性结合起来。时序混淆将保护扩展到分区间的查找路径。通过将写操作在泄漏阈值超过后路由到LLC,减轻了共享可写行上的基于一致性的泄漏,这使得攻击者的写探测延迟与受害者活动无关。使用gem5实现,SCP缓解了Prime+Probe和Flush+Reload攻击,这些是更复杂缓存攻击的基础。我们还展示了一个共享可写行攻击被缓解。所有这些攻击的结果都不优于随机猜测。SCP的硬件成本是LLC SRAM适度增加2.8%。在我们评估的SPEC CPU2017基准测试中,性能在IPC上与DAWG相差在0.3%以内。共享密集型微基准测试展示了基于系统指定泄漏阈值的可调安全-性能权衡。

英文摘要

Cache partitioning is among the strongest structural defenses against eviction-based cache side channels, yet a decade-old design issue has blocked its widespread deployment in secure shared-OS settings. The issue is that write-shared coherence collapses under strict partitioning. We present SCP (Secure and Coherent Partitioning), which combines strict eviction isolation with write-shared coherence by partitioning only the tags, sharing a single data pool, and sizing the data pool so capacity-driven cross-partition eviction cannot occur. Timing obfuscation extends protections to the inter-partition lookup path. Coherence-based leakage on shared-writeable lines is mitigated by routing those writes through to the LLC once a leakage threshold is crossed, which makes attacker write probe latency independent of victim activity. Using gem5 for implementation, SCP mitigates Prime+Probe and Flush+Reload, which are the basis for more sophisticated cache attacks. We also demonstrate that a shared-writeable-line attack is mitigated. All these attacks yield results no better than random guessing. SCP's hardware cost is a modest +2.8% LLC SRAM. Performance matches DAWG within 0.3% IPC on the SPEC CPU2017 benchmarks that we evaluated. Sharing-intensive microbenchmarks demonstrate a tunable security-performance tradeoff based on a system-specified leakage threshold.

2606.12258 2026-06-11 cs.CV 新提交

Bridging Day and Night: Unsupervised Cross-Domain Re-Identification with Synergistic Prompt and Prototype Learning

连接昼夜:基于协同提示与原型学习的无监督跨域重识别

Jiyang Xu, Rui Liu, Hang Dai

发表机构 * School of Computer Science, Wuhan University(武汉大学计算机学院)

AI总结 提出无监督昼夜重识别框架,结合提示学习和原型表示学习,通过两阶段训练实现无标注跨域身份关联,性能媲美全监督方法。

详情
AI中文摘要

跨域昼夜重识别(ReID)面临昼夜场景间显著视觉外观差异的根本挑战。现有的全监督方法严重依赖劳动密集型标注,成本高昂且跨域泛化能力有限。本文研究无监督昼夜重识别,提出一种新颖框架,协同结合提示学习和基于原型的表示学习,无需人工标注即可关联跨域身份。我们的方法采用渐进式两阶段训练策略。第一阶段,利用视觉语言模型以无标注方式生成实例特定的文本提示。我们采用实例级对齐机制,将视觉特征和文本提示嵌入统一语义空间,通过实例感知的动态偏差适应将未标注的昼夜图像与可学习提示对齐。第二阶段,构建域特定原型记忆库,并引入两个互补模块:i) 域内身份关联模块,增强每个域内的特征判别性;ii) 跨域原型匹配模块,可靠识别正负原型对,从而建立昼夜间的鲁棒身份对应关系。在公开基准上的大量实验验证了方法的有效性。在无监督设置下,我们的框架取得了与最先进全监督方法相当的Rank-1准确率。

英文摘要

Cross-domain day-night re-identification (ReID) is fundamentally challenged by the substantial visual appearance discrepancies between daytime and nighttime scenes. Existing fully supervised methods rely heavily on labor-intensive annotations, which are costly and exhibit limited generalization across domains. In this work, we investigate unsupervised day-night ReID and propose a novel framework that synergistically combines prompt learning and prototype-based representation learning to associate identities across domains without requiring manual labels. Our approach follows a progressive two-stage training strategy. In the first stage, we exploit the vision-language model to generate instance-specific textual prompts in an annotation-free manner. We employ an instance-level alignment mechanism to embed visual features and textual prompts into a unified semantic space, aligning unlabeled day/night images with learnable prompts via instance-aware dynamic-bias adaptation. In the second stage, we construct domain-specific prototype memory banks and introduce two complementary modules: i) an intra-domain identity association module to enhance feature discriminability within each domain, and ii) a cross-domain prototype matching module to reliably identify positive and negative prototype pairs, thereby establishing robust identity correspondences across day and night. Extensive experiments on public benchmarks validate the effectiveness of our method. Under the unsupervised setting, our framework attains Rank-1 accuracy comparable to state-of-the-art fully supervised methods.

2606.12252 2026-06-11 cs.LG cs.AI 新提交

Using Explainability as a Training-Time Reliability Signal for Efficient ECG Classification

使用可解释性作为训练时可靠性信号实现高效心电图分类

Veerendhra Kumar Dangeti, Xiao Gu, Ying Weng, Shreyank N Gowda

发表机构 * School of Computer Science, University of Nottingham(诺丁汉大学计算机科学学院) Institute of Biomedical Engineering, Department of Engineering Science, University of Oxford(牛津大学工程科学系生物医学工程研究所) School of Computer Science, University of Nottingham Ningbo China(宁波诺丁汉大学计算机科学学院)

AI总结 提出ERTS方法,利用训练中的解释质量(Grad-CAM注意力图)区分信息性和不可靠不确定性,过滤低聚焦样本,在三个ECG数据集上提升macro-F1并降低训练成本。

详情
AI中文摘要

训练用于临床时间序列分析的深度神经网络计算需求高,但许多医疗环境缺乏重复模型开发和部署所需的资源。这一挑战在心电图分类中尤为明显,大数据集和长训练计划使效率变得重要。渐进式数据丢弃通过从梯度更新中排除已学习的样本来降低训练成本,但它依赖模型置信度,可能保留因噪声或歧义而难以处理而非有用信号的样本。在这项工作中,我们引入了ERTS,一种基于可解释性的可靠性训练信号,用于高效心电图分类。ERTS在训练期间利用解释质量来区分信息性和不可靠的不确定性。基于渐进式数据选择,我们计算候选样本的Grad-CAM注意力图,并推导出一个聚焦分数,衡量模型预测是否得到连贯且局部化模式的支持。低聚焦样本被过滤掉,而具有有意义注意力的样本优先进行梯度更新。我们在三个ECG数据集和多个骨干架构上评估ERTS,显示macro-F1的一致提升以及有效训练成本的降低。这些结果表明,解释质量可以作为改善临床时间序列学习中效率和可靠性的实用信号。代码将发布。

英文摘要

Training deep neural networks for clinical time-series analysis is computationally demanding, yet many healthcare settings lack the resources required for repeated model development and deployment. This challenge is particularly evident in electrocardiogram classification, where large datasets and long training schedules make efficiency practically important. Progressive Data Dropout reduces training cost by excluding samples from gradient updates once they are learned, but it relies on model confidence and may retain samples that are difficult due to noise or ambiguity rather than useful signal. In this work, we introduce ERTS, an explainability-based reliability training signal for efficient ECG classification. ERTS uses explanation quality during training to distinguish between informative and unreliable uncertainty. Building on progressive data selection, we compute Grad-CAM attention maps for candidate samples and derive a focus score that measures whether model predictions are supported by coherent and localised patterns. Samples with low focus are filtered out, while those with meaningful attention are prioritised for gradient updates. We evaluate ERTS across three ECG datasets and multiple backbone architectures, showing consistent improvements in macro-F1 alongside reduced effective training cost. These results suggest that explanation quality can serve as a practical signal for improving both efficiency and reliability in clinical time-series learning. Code will be released.

2606.12251 2026-06-11 cs.LG cs.AI cs.CR 新提交

Reinforcement Learning Disrupts Gradient-Based Adversarial Optimization

强化学习破坏基于梯度的对抗优化

Xinhai Zou, Chang Zhao, Alireza Aghabagherloo, Dave Singelée, Robin Degraeve, Bart Preneel

发表机构 * COSIC, KU Leuven(鲁汶大学COSIC) Imec Brubotics, VUB(布鲁塞尔自由大学Brubotics) DistriNet, KU Leuven(鲁汶大学DistriNet)

AI总结 研究通过强化学习训练图像分类器以破坏攻击者使用的梯度结构,发现RL作为隐式正则化器产生不稳定梯度方向和较小梯度幅度,使基于梯度的攻击失效,并与对抗训练结合实现双重防御。

详情
AI中文摘要

基于梯度的对抗攻击仍然是对深度神经网络(DNN)的主要威胁,因为它们利用梯度信息高效优化对抗扰动。为了解决这个问题,我们研究了强化学习(RL)训练是否可以通过使用策略梯度目标和epsilon-贪婪探索来训练图像分类器,从而破坏攻击者使用的梯度结构。通过在CIFAR-10、CIFAR-100和ImageNet-100上使用多种架构进行系统实验,我们发现RL训练的分类器显著破坏了基于梯度的对抗优化。为了解释这一点,我们使用损失景观可视化、静态和动态梯度指标以及预测熵进行了全面的机制分析。我们的分析揭示,RL充当隐式正则化器,产生具有高度不稳定梯度方向和较小梯度幅度的模型。这种组合使得每个PGD步骤在方向上不可靠且幅度有限,导致基于梯度的攻击在实际迭代预算内失败。我们进一步表明,将RL与对抗训练(RL-adv)结合提供了在两个互补层面运作的双层防御:RL退化攻击者可用的梯度信息(梯度级防御),而对抗训练强化决策边界(边界级防御)。RL-adv在所有评估的主要攻击类型(包括基于梯度的PGD、AutoAttack、基于迁移和基于查询的攻击)中实现了最高的鲁棒性,显著优于SL-adv。这些发现将RL诱导的梯度破坏识别为一种互补的鲁棒性机制,并激励未来研究结合SL效率与RL梯度正则化特性的混合SL-RL训练调度。

英文摘要

Gradient-based adversarial attacks remain a dominant threat to deep neural networks (DNNs), as they exploit gradient information to efficiently optimize adversarial perturbations. To address this, we investigate whether reinforcement learning (RL) training can disrupt the gradient structure used by attackers by training image classifiers with policy-gradient objectives and epsilon-greedy exploration. Through systematic experiments across CIFAR-10, CIFAR-100, and ImageNet-100 with multiple architectures, we find that RL-trained classifiers significantly disrupt gradient-based adversarial optimization. To explain this, we conduct a comprehensive mechanism analysis using loss landscape visualization, static and dynamic gradient indicators, and predictive entropy. Our analysis reveals that RL acts as an implicit regularizer, producing models with highly unstable gradient directions and smaller gradient magnitudes. This combination makes each PGD step both unreliable in direction and limited in magnitude, causing gradient-based attacks to fail within practical iteration budgets. We further show that combining RL with adversarial training (RL-adv) provides a dual-layer defense operating at two complementary levels: RL degrades gradient information available to attackers (gradient-level defense), while adversarial training strengthens decision boundaries (boundary-level defense). RL-adv achieves the highest robustness across all major attack types evaluated, including gradient-based (PGD, AutoAttack), transfer-based, and query-based attacks, outperforming SL-adv by a significant margin. These findings identify RL-induced gradient disruption as a complementary robustness mechanism and motivate future research on hybrid SL-RL training schedules that combine SL's efficiency with RL's gradient-regularization properties.

2606.12250 2026-06-11 cs.CL 新提交

Reassessing High-Performing LLMs on Polish Medical Exams: True Competence or Bias-Driven Performance?

重新评估高性能大语言模型在波兰医学考试中的表现:真实能力还是偏差驱动?

Antoni Lasik, Jakub Pokrywka, Łukasz Grzybowski, Jeremi Ignacy Kaczmarek, Gabriela Korzańska, Janusz Świeczkowski-Feiz, Oskar Pastuszek, Paulina Hoffman, Jakub Tomasz Dąbrowski, Wojciech Kusa

发表机构 * NASK National Research Institute(NASK国家研究所) Adam Mickiewicz University(亚当·密茨凯维奇大学) ARAAI Poland(ARAAI波兰) Poznań University of Medical Sciences(波兹南医科大学) Centre of Postgraduate Medical Education, Poland(波兰研究生医学教育中心) T. Marciniak Lower Silesian Specialist Hospital(T. 马尔奇尼亚克下西里西亚专科医院) Medical University of Warsaw(华沙医科大学)

AI总结 通过引入扩展和更具挑战性的波兰医学考试基准,减少MCQA伪影,发现标准MCQA分数高估了LLM的真实临床能力,最佳模型在更难的设置下分数下降28.4和31个百分点。

详情
Comments
26 pages total with references and appendix, preprint
AI中文摘要

医学领域的大语言模型(LLM)主要通过多项选择题问答(MCQA)进行评估,但由于猜测策略和答案偏差,这种方法可能高估真实的临床能力。为解决这些局限性,我们引入了一个基于波兰医学考试的扩展且更具挑战性的基准,增加了超过15,000道题目、两个新领域和四项结构修改,以减少MCQA特定伪影并更好地测试推理能力。我们评估了21个LLM,结果表明评估设计对结果影响很大。在我们的更难设置下,最佳模型(Qwen3.5-122B)在英语和波兰语考试中分别下降了28.4和31个百分点。尽管数据污染证据不足,但标准MCQA分数并不能可靠地反映真实的医学能力。为促进进一步研究,我们公开了该基准。

英文摘要

Large language models (LLMs) in medicine are mainly evaluated using multiple-choice question answering (MCQA), which can overestimate real clinical ability due to guessing strategies and answer biases. To address these limitations, we introduce an expanded and more challenging benchmark based on Polish medical exams, adding over 15,000 questions, two new domains, and four structural modifications that reduce MCQA-specific artifacts and better test reasoning. We evaluate 21 LLMs and show that evaluation design strongly affects results. Under our harder setup, the best model (Qwen3.5-122B) drops by 28.4 and 31 pp on English and Polish exams, respectively. Despite low evidence of data contamination, standard MCQA scores do not reliably reflect true medical competence. To facilitate further research, we make our benchmark publicly available.

2606.12248 2026-06-11 cs.CV 新提交

Damage-TriageFormer: A Foundation-Model Framework for Typology-Based Building Damage Assessment from Mono-Temporal Imagery

Damage-TriageFormer:基于类型学的单时相影像建筑损伤评估的基础模型框架

Yiming Xiao, Yu-Hsuan Ho, Sanjay Thasma, Junwei Ma, Ali Mostafavi

发表机构 * Texas A&M University(德克萨斯A&M大学) Resilitix Intelligence LLC Institute for a Disaster Resilient Texas(德克萨斯灾害韧性研究所)

AI总结 提出Damage-TriageFormer,一种基于单张灾后影像的建筑损伤类型学评估模型,通过扩展DINOv3 ViT-L骨干网络和两阶段门控损伤头,在三个灾害数据集上实现了宏观F1约0.62,无需灾前影像即可支持应急响应。

详情
AI中文摘要

决策相关的建筑损伤评估对于灾后资源优先分配和恢复至关重要,但大多数自动化方法要么将损伤扁平化为单一严重程度等级(无损伤、轻微、严重、摧毁),要么需要成对的灾前和灾后影像,而这对于突发灾害通常不可用。本文提出了Damage-TriageFormer,一种基于单张灾后影像、足迹条件化的模型,它生成损伤类型学而非严重程度等级。我们的贡献包括:(1)DamageTriage-Bench,一个基于NOAA应急响应影像(涵盖2018年迈克尔飓风、2024年海伦飓风和2025年洛杉矶野火复合灾害)构建的新基准,包含五个类型学类别,区分屋顶损伤和结构损伤,并在每个类别内区分部分和全部范围;(2)Damage-TriageFormer,它扩展了DINOv3 ViT-L骨干网络,结合简单特征金字塔进行更高分辨率的实例池化、两阶段门控损伤头以及辅助严重程度回归目标。我们的模型在验证集上达到宏观F1为0.624,在保留的分层测试集上为0.619,在运营分类最需要的地方表现最强,无损伤建筑和完全结构倒塌的每类F1分别为0.91和0.84。尽管罕见的完全屋顶损伤类别由于样本有限和固有的模糊标签边界仍然困难,但我们的结果表明,单张灾后影像可以支持可操作的建筑损伤分类,无需灾前参考即可实现有针对性的应急响应和资源分配。

英文摘要

Decision-relevant building damage assessment is critical for prioritizing resources and recovery after a disaster, yet most automated methods either flatten damage into a single severity scale (no damage, minor, major, destroyed) or require paired pre- and post-event imagery that is often unavailable for emerging hazards. This paper presents Damage-TriageFormer, a single-image, post-event, footprint-conditioned model that produces a damage typology rather than a severity scale. We contribute: (1) DamageTriage-Bench, a new benchmark built from NOAA Emergency Response Imagery across Hurricane Michael (2018), Hurricane Helene (2024), and the 2025 Los Angeles wildfire complex, with five typology classes that distinguish roof damage from structural damage and, within each, partial from total extent; and (2) Damage-TriageFormer, which extends a DINOv3 ViT-L backbone with a Simple Feature Pyramid for higher-resolution instance pooling, a two-stage gated damage head, and an auxiliary severity-regression objective. Our model achieves macro F1 of 0.624 on validation and 0.619 on a held-out stratified test set, performing strongest where operational triage needs it most, with per-class F1 of 0.91 and 0.84 on undamaged buildings and total structural collapse, respectively. While the rare Total Roof Damage class remains difficult due to its limited examples and an inherently ambiguous label boundary, our results show that single-image post-event imagery can support actionable building damage typing, enabling targeted emergency response and resource allocation without a pre-event reference.

2606.12247 2026-06-11 cs.CY cs.CL 新提交

Beyond Third-Person Audits: Situated Interaction Auditing for User-Centered LLM Bias Research

超越第三人称审计:以用户为中心的LLM偏见研究的场景交互审计

Andrés Abeliuk, Cinthia Sanchez Macias, Valentina Alarcón, Álvaro Madariaga, Claudia Lopez

AI总结 提出场景交互审计(SIA)框架,通过分析用户画像信号(如社会人口统计标记、写作风格和身份陈述)如何系统性地影响LLM响应质量、内容和语气,以用户为中心研究LLM偏见。

详情
AI中文摘要

大型语言模型(LLM)的偏见研究主要集中在第三人称审计上,即研究模型如何作为外部主体表征或评估人口群体。然而,这种范式忽略了一个结构性盲点:用户不在审计中。在实践中,LLM用于开放式的个人交互,在此过程中模型隐式地代表用户并相应调整其响应。当相同的请求因提问者不同而产生不同响应时,偏见不仅体现在模型如何描述他人,还体现在它如何对待对话者。我们提出场景交互审计(SIA),这是一个以用户为中心的框架,用于研究用户画像信号——隐式社会人口统计标记、写作风格和陈述身份——如何系统性地塑造LLM响应质量、内容和语气。我们通过一个案例研究来展示该框架,该案例研究跨多个任务领域交叉了性别和社会经济地位信号,并概述了SIA作为自然语言处理新使命的研究议程。

英文摘要

Research on bias in large language models (LLMs) has predominantly focused on third-person audits, which study how models represent or evaluate demographic groups as external subjects. However, this paradigm overlooks a structural blind spot because the user is absent from the audit. In practice, LLMs are used in open-ended, personal interactions, during which the model implicitly represents the user and adjusts its responses accordingly. When identical requests yield different responses depending on who is asking, bias manifests not in how the model describes others but in how it treats its interlocutor. We propose Situated Interaction Auditing (SIA), a user-centered framework for studying how user profile signals -- implicit sociodemographic markers, writing style, and stated identity -- systematically shape LLM response quality, content, and tone. We demonstrate the framework through a case study that intersects gender and socioeconomic status signals across multiple task domains and outline a research agenda for SIA as a new mission for natural language processing.

2606.12246 2026-06-11 cs.DC cs.IR 新提交

Efficient and Robust Online Learning to Rank in Decentralized Systems

去中心化系统中高效且鲁棒的在线学习排序

Marcel Gregoriadis, Martijn de Vos, Sayan Biswas, Anne-Marie Kermarrec, Johan Pouwelse

AI总结 提出RankGuard框架,通过用户间直接交换模型更新并利用私有点击历史防御投毒攻击,首次给出去中心化在线学习排序的收敛性证明,效率最高提升62倍。

详情
AI中文摘要

在在线学习排序(OLTR)中,排序模型直接从实时用户交互中训练,但现有系统依赖可信中央服务器来收集和处理这些交互。这使得操作者可以自由引入与用户利益冲突的偏见。去中心化学习提供了一种有吸引力的替代方案,允许用户通过直接相互交换模型更新来协作训练共享排序模型,无需任何中央权威。然而,在这种设置中,恶意节点可以发送投毒模型更新,降低诚实节点的排序质量。我们引入了RankGuard,一个去中心化OLTR框架,其中用户协作训练排序模型并直接与其他节点交换模型更新。RankGuard通过仔细评估传入模型与用户自己的私有点击历史(经位置偏差校正)来防御投毒攻击。仅当传入模型比当前本地模型更好地解释用户过去交互时,才进行聚合,这使得恶意节点极难构造出能通过此测试而又不真正帮助用户的更新。我们推导了RankGuard的理论收敛保证。据我们所知,这是去中心化OLTR算法的首次形式化收敛分析。我们使用四个标准基准和三个点击模型,针对四种投毒攻击(包括一种强大的自适应攻击)评估了RankGuard。在大多数设置中,RankGuard优于所有基线,同时效率比最接近的竞争者高出62倍。

英文摘要

In Online Learning to Rank (OLTR), ranking models are trained directly from live user interactions, but existing systems rely on a trusted central server to collect and process these interactions. This leaves operators free to introduce biases that conflict with user interests. Decentralized learning offers an attractive alternative, allowing users to collaboratively train a shared ranking model by exchanging model updates directly with one another, without any central authority. In such settings, however, malicious nodes can send poisoned model updates that degrade the ranking quality of honest nodes. We introduce RankGuard, a decentralized OLTR framework in which users collaboratively train ranking models and exchange model updates directly with other nodes. RankGuard defends against poisoning attacks by carefully evaluating incoming models against the user's own private click history, corrected for position bias. An incoming model is only aggregated if it better explains the user's past interactions than the current local model, making it fundamentally hard for malicious nodes to craft updates that pass this test without also genuinely helping the user. We derive a theoretical convergence guarantee of RankGuard. To the best of our knowledge, this is the first formal convergence analysis of a decentralized OLTR algorithm. We evaluate RankGuard against four poisoning attacks, including a powerful adaptive attack, using four standard benchmarks and three click models. RankGuard outperforms all baselines in most settings while being up to 62x more efficient than its closest competitors.

2606.12245 2026-06-11 cs.IR cs.AI 新提交

DiffCold: A Diffusion-based Generative Model for Cold-Start Item Recommendation

DiffCold: 基于扩散的生成模型用于冷启动物品推荐

Kangning Zhang, Yingjie Qin, Weinan Zhang, Yong Yu, Jianghao Lin

AI总结 针对冷启动物品推荐中的跷跷板困境,提出基于条件扩散的生成模型DiffCold,通过从内容重建温物品嵌入并保持流形结构,结合检索增强聚合器和模拟表示对齐模块,统一冷热物品表示。

详情
Comments
Accepted by ECML-PKDD 2026
AI中文摘要

冷启动物品推荐由于缺乏交互历史,在现实系统中仍然是一个持续的挑战。虽然先前的模型尝试利用物品内容特征来弥合这一差距,但它们普遍遭受\textbf{跷跷板困境}:提升冷物品的性能不可避免地会降低温物品的性能,反之亦然。我们发现这一困境源于根本的\textbf{分布差异}:温物品嵌入占据由丰富交互信号塑造的复杂“行为流形”,而冷物品嵌入则被限制在仅从辅助内容导出的“语义流形”上。现有方法通常强制在这些不一致空间之间进行刚性映射,导致模型为了适应冷物品而牺牲温表示的精度。为了解决这个问题,我们提出\textbf{DiffCold},一种基于扩散的生成模型,统一了温表示和冷表示。与GAN或VAE不同,DiffCold利用条件扩散从内容重建温物品嵌入,保留底层流形结构而不退化。我们进一步针对这一范式设计了两个特定模块:一个\textbf{检索增强聚合器},利用语义相似的温物品初始化生成,以绕过低效的噪声;以及一个\textbf{基于模拟的表示对齐}模块,通过对比学习强制生成嵌入与真实嵌入之间的分布一致性。在三个基准上的实验证实,DiffCold解决了跷跷板困境,在所有指标上持续优于最先进的方法。

英文摘要

Cold-start item recommendation remains a persistent challenge in real-world systems due to the absence of interaction histories. While prior models attempt to bridge this gap using item content features, they universally suffer from the \textbf{seesaw dilemma}: enhancing performance for cold items inevitably degrades performance for warm items, and vice versa. We identify that this dilemma stems from a fundamental \textbf{distributional disparity}: warm item embeddings occupy a complex ``behavioral manifold" shaped by rich interaction signals, whereas cold item embeddings are constrained to a ``semantic manifold" derived solely from auxiliary content. Existing methods often force a rigid mapping between these inconsistent spaces, causing the model to sacrifice the precision of warm representations to accommodate cold ones. To address this, we propose \textbf{DiffCold}, a diffusion-based generative model that unifies warm and cold representations. Unlike GANs or VAEs, DiffCold leverages conditional diffusion to reconstruct warm item embeddings from content, preserving the underlying manifold structure without degradation. We further tailor this paradigm with two specific designs: a \textbf{Retrieval-enhanced Aggregator} that initializes generation using semantically similar warm items to bypass inefficient noise, and a \textbf{Simulation-based Representation Alignment} module that enforces distribution consistency between generated and real embeddings via contrastive learning. Experiments on three benchmarks confirm that DiffCold resolves the seesaw dilemma, consistently outperforming state-of-the-art methods across all metrics.

2606.12243 2026-06-11 cs.CL cs.AI 新提交

VIA-SD: Verification via Intra-Model Routing for Speculative Decoding

VIA-SD:通过模型内路由进行推测解码的验证

Yuchen Xian, Yang He, Yunqiu Xu, Yi Yang

AI总结 提出VIA-SD多级验证框架,利用从完整验证器派生的精简验证器处理中等置信度令牌,减少大模型调用,在多个任务上实现10-20%加速。

详情
Comments
Accepted at the 43rd International Conference on Machine Learning (ICML 2026)
AI中文摘要

推测解码(SD)通过让轻量级草稿模型生成候选,由大型验证器并行验证,解决了LLM的高推理成本问题。现有的草稿-验证方法使用二元决策:接受或完全重新计算。然而,我们发现许多被拒绝的令牌可以通过从完整验证器通过模型内路由派生的精简子模型正确验证,而不是完整验证器。这促使我们使用精简验证器来处理需要中等验证资源的令牌,减少昂贵的大模型调用。我们提出了VIA-SD(通过模型内路由进行推测解码的验证),一种使用路由精简验证器的多级框架。草稿令牌分层处理:高置信度情况直接接受,中等置信度情况由精简验证器重新生成,不确定情况由完整模型验证。在四个代表性任务和多个模型家族中,VIA-SD将拒绝率降低了0.10-0.22,并在强SD基线基础上实现了10-20%的加速,同时相比非草稿解码实现了2.5-3倍的加速。此外,VIA-SD与现有SD框架兼容,无需修改其训练过程。我们的结果表明,多级SD是一种可扩展且高效的LLM推理通用范式。项目页面:此https URL

英文摘要

Speculative decoding (SD) addresses the high inference costs of LLMs by having lightweight drafters generate candidates for large verifiers to validate in parallel. Existing draft-verify methods use binary decisions: accept or fully recompute. Yet we find that many rejected tokens can be verified correctly by a slim submodel derived from the full verifier via intra-model routing, instead of the full verifier. This motivates our slim-verifier to handle tokens requiring moderate verification resources, reducing expensive large-model calls. We propose Verification via Intra-Model Routing for Speculative Decoding (VIA-SD), a multi-tier framework using a routed slim-verifier. Draft tokens are processed hierarchically: direct acceptance for high-confidence cases, slim-verifier regeneration for medium-confidence cases, and full-model verification for uncertain cases. Across four representative tasks and multiple model families, VIA-SD reduces rejection rates by 0.10-0.22 and delivers 10-20% speedups over strong SD baselines, while achieving 2.5-3x acceleration over non-drafting decoding. Moreover, VIA-SD is compatible with existing SD frameworks without modifying their training procedures. Our results suggest multi-tier SD as a general paradigm for scalable and efficient LLM inference. Project page: this https URL

2606.12240 2026-06-11 cs.LG cs.AI 新提交

Multi-Rate Mixture of Experts for Accelerating Liquid Neural Network Training

多速率专家混合模型加速液态神经网络训练

Shilong Zong, Almuatazbellah Boker, Hoda Eldardiry

发表机构 * Virginia Tech(弗吉尼亚理工大学)

AI总结 提出多速率专家混合框架,结合液态神经网络的多尺度动态与注意力机制,提升多变量时间序列建模的准确性和效率。

详情
AI中文摘要

多变量时间序列数据通常表现出复杂的时间依赖、不规则采样和跨多个时间尺度的异质动态,使得精确序列建模特别具有挑战性。传统的循环神经网络(RNN),如长短期记忆网络(LSTM),在离散时间下运行,可能难以有效捕捉连续和不规则的时间行为。液态神经网络(LNN)通过连续时间动态解决了其中一些限制,但标准LNN架构通常依赖单一动力系统,限制了其建模异质时间模式的能力。为了解决这些挑战,我们提出了一个基于液态神经网络的多速率专家混合(MR-MoE)框架。在所提出的架构中,多个基于LNN的专家以不同的时间尺度运行,使模型能够明确分离快速变化的动态和缓慢演变的时间趋势。门控网络进一步实现了基于输入条件的自适应专家专业化。此外,我们结合了特征级和时间注意力机制,以提高鲁棒性、可解释性和长程依赖建模能力。特征级注意力抑制噪声或无关变量,而时间注意力则选择性地关注信息丰富的历史状态。我们在一个复杂的多变量时间序列预测任务上评估了所提出的框架,并与强基线模型(包括LSTM、单体LNN和标准MoE模型)进行了比较。实验结果表明,所提出的MR-MoE框架在保持良好计算效率的同时,持续实现了改进的AUROC和AUPRC性能。这些结果突显了结合连续时间动态、多尺度专家分解和自适应注意力机制对时间序列建模的有效性。

英文摘要

Multivariate time-series data often exhibit complex temporal dependencies, irregular sampling, and heterogeneous dynamics across multiple time scales, making accurate sequence modeling particularly challenging. Traditional recurrent neural networks (RNNs), such as Long Short-Term Memory (LSTM) networks, operate in discrete time and may struggle to effectively capture continuous and irregular temporal behaviors. Liquid Neural Networks (LNNs) address some of these limitations through continuous-time dynamics, but standard LNN architectures typically rely on a single dynamical system, limiting their ability to model heterogeneous temporal patterns. To address these challenges, we propose a Multi-Rate Mixture-of-Experts (MR-MoE) framework built on top of Liquid Neural Networks. In the proposed architecture, multiple LNN-based experts operate at distinct time scales, enabling the model to explicitly separate fast-changing dynamics from slow-evolving temporal trends. A gating network further enables adaptive expert specialization based on input conditions. In addition, we incorporate both feature-level and temporal attention mechanisms to improve robustness, interpretability, and long-range dependency modeling. Feature-level attention suppresses noisy or irrelevant variables, while temporal attention selectively focuses on informative historical states. We evaluate the proposed framework on a complex multivariate time-series prediction task and compare it against strong baselines, including LSTM, monolithic LNN, and standard MoE models. Experimental results demonstrate that the proposed MR-MoE framework consistently achieves improved AUROC and AUPRC performance while maintaining favorable computational efficiency. These results highlight the effectiveness of combining continuous-time dynamics, multi-scale expert decomposition, and adaptive attention mechanisms for time-series modeling.

2606.12236 2026-06-11 cs.RO cs.CV 新提交

DrivingAgent: Design and Scheduling Agents for Autonomous Driving Systems

DrivingAgent: 自动驾驶系统的设计与调度智能体

Zhongyu Xia, Wenhao Chen, Yongtao Wang, Ming-Hsuan Yang

发表机构 * Wangxuan Institute of Computer Technology, Peking University(北京大学王选计算机技术研究所) University of California, Merced(加州大学默塞德分校)

AI总结 提出DrivingAgent框架,通过自动化模块开发(设计阶段)和强化学习训练的轻量级LLM实时调度(调度阶段),解决自动驾驶系统集成新模型和满足实时约束的挑战,在nuScenes和Bench2Drive上取得更优速度-精度权衡。

详情
AI中文摘要

许多自动驾驶系统越来越多地整合基础模型以提高泛化能力并处理长尾场景。然而,这一趋势带来了两个关键挑战:(i)设计和集成新模型的手动且劳动密集型过程,以及(ii)缺乏智能、动态的调度机制以满足严格的实时约束。虽然基于大语言模型(LLM)的智能体为自动化提供了有前景的途径,但现有框架并不适合自动驾驶。具体来说,它们未能区分系统设计和实时调度的根本不同需求,将模块视为不透明的黑盒,并且并非为持续运行而设计。为了解决这些局限性,我们提出了DrivingAgent,这是一个针对自动驾驶系统设计和调度双重挑战的新型智能体框架。在设计阶段,DrivingAgent通过解释系统架构、生成代码以及通过超网络训练验证模块来自动化模块开发。在调度阶段,它采用一个通过强化学习训练的轻量级LLM来实时动态编排系统模块,并由一个集成长期存储与带时间戳短期上下文的结构化记忆支持。实验结果表明,DrivingAgent在nuScenes和Bench2Drive基准测试上实现了更优的速度-精度权衡。

英文摘要

Many autonomous driving systems are increasingly incorporating foundation models to improve generalization and handle long-tail scenarios. However, this trend introduces two key challenges: (i) the manual and labor-intensive process of designing and integrating new models, and (ii) the lack of intelligent, dynamic scheduling mechanisms to meet strict real-time constraints. While Large Language Model (LLM)-based agents offer a promising avenue for automation, existing frameworks are ill-suited for autonomous driving. Specifically, they fail to distinguish between the fundamentally different requirements of system design and real-time scheduling, treat modules as opaque black boxes, and are not designed for continuous operation. To address these limitations, we propose DrivingAgent, a novel agent framework tailored to the dual challenges of autonomous driving system design and scheduling. In the design phase, DrivingAgent automates module development by interpreting system architecture, generating code, and validating modules via super-network training. In the scheduling phase, it employs a lightweight LLM trained with reinforcement learning to dynamically orchestrate system modules in real time, supported by a structured memory that integrates long-term storage with timestamped short-term context. Experimental results demonstrate that DrivingAgent achieves a superior speed--accuracy trade-off on both the nuScenes and Bench2Drive benchmarks.

2606.12235 2026-06-11 cs.AR 新提交

BenDi: An Energy-Efficient Quasi-Stochastic Systolic Architecture for Edge Bioelectronics

BenDi: 一种用于边缘生物电子学的节能准随机脉动架构

Bochen Ye, Yihan Pan, Shady Agwa, Themis Prodromakis

AI总结 提出BenDi架构,通过低电压、准随机乘法、脉动数据流和硬件感知量化,在边缘设备上高效运行CNN,相比二进制权重固定脉动架构,面积效率提升3.35倍,能效提升5倍,精度损失仅1%-3.3%。

详情
Comments
Accepted for presentation as a short paper at International Conference on Application-specific Systems, Architectures and Processors (ASAP 2026)
AI中文摘要

对生物医学信号(如心电图)的连续长期监测和诊断有助于减轻对公共健康日益增长的威胁。人工智能模型(如卷积神经网络)能够对相关疾病进行准确监测和分类;然而,它们需要的计算资源超出了传统AI硬件通常所能提供的,尤其是在资源受限的边缘环境中。在这项工作中,我们提出了BenDi,一种用于边缘生物电子系统的节能准随机脉动架构。BenDi利用从电路到软件量化的多个层次的能量和功率优化,包括低供电电压、用于准随机乘法的Ben-t-Pyramid数据格式、DiP脉动数据流以及硬件感知量化,以在有限的硬件预算下在边缘设备上高精度地处理CNN。使用商业22nm技术的硬件实现结果表明,在0.5V电压和100MHz频率下,BenDi架构相比最先进的基于二进制的权重固定脉动架构,面积缩小了3.35倍,能效提高了5倍。对于生物电子边缘系统,BenDi在能效和面积效率上分别比同类架构提高了一个数量级。这种显著的改进是以在MIT-BIH和Apnea-ECG基准测试上分别损失1%至3.3%的精度为代价的,与使用32位浮点格式的传统计算相比。

英文摘要

Continuous long-term monitoring and diagnosis of biomedical signals, such as electrocardiograms (ECGs), can help mitigate an increasing threat to public health. Artificial Intelligence (AI) models, such as Convolutional Neural Networks (CNNs), provide accurate monitoring and classification for relevant diseases; however, they require more computational resources than conventional AI hardware can typically afford, especially for a resource-constrained environment on the edge. In this work, we present BenDi, an energy-efficient quasi-stochastic systolic architecture for bioelectronic systems on the edge. BenDi leverages multiple levels of energy and power optimization, ranging from circuits to software quantization, including low supply voltage, the \underline{Ben}t-Pyramid data format for quasi-stochastic multiplication, the \underline{Di}P systolic dataflow, and hardware-aware quantization, to handle CNNs with high accuracy on the edge within limited hardware budgets. The hardware implementation results, using a commercial 22nm technology, show that BenDi architecture, at 0.5 Voltage and 100 MHz, offers 3.35x smaller area and 5x higher energy efficiency, compared to state-of-the-art binary-based weight-stationary systolic architectures. Regarding Bioelectronic edge systems, BenDi achieves an order-of-magnitude improvement in energy efficiency and another order-of-magnitude improvement in area efficiency, compared to its counterparts. This significant improvement comes at the cost of 1\% to 3.3\% accuracy loss on the MIT-BIH and Apnea-ECG benchmarks, respectively, compared with conventional computing using the 32-bit floating-point format.

2606.12234 2026-06-11 cs.CL 新提交

On The Effectiveness-Fluency Trade-Off In LLM Conditioning: A Systematic Study

论LLM条件控制中的效果-流畅性权衡:一项系统性研究

Iuri Macocco, Pau Rodríguez, Arno Blaas, Luca Zappella, Marco Baroni, Xavier Suau

发表机构 * Universitat Pompeu Fabra(庞培法布拉大学) Apple(苹果公司) ICREA(加泰罗尼亚研究与高级研究所)

AI总结 系统研究LLM条件控制方法在注入和移除目标概念时的效果与流畅性权衡,发现高效引导方法常以牺牲流畅性为代价,且激活引导方法在指令调优模型上效果较差。

详情
Comments
8 pages, 2 figure
AI中文摘要

控制大型语言模型(LLM)的输出是其可靠部署的核心挑战,然而对所涉及权衡的清晰理解仍然难以捉摸。当前的条件控制方法通常在评估时狭隘地关注其注入或移除目标概念的有效性,而忽略了生成质量。我们系统性地研究了注入和移除场景中的一系列条件控制方法。我们发现,高效的引导方法通常以流畅性的大幅损失为代价来实现条件控制。此外,我们识别出一个关键但先前被忽视的与训练范式的交互:激活引导方法在指令调优模型上的效果远不如在基础模型上。另一方面,简单的提示和全面的监督微调是概念注入的可行选择,但在概念移除方面效果不佳。最后,廉价计算的文本指标与昂贵的LLM作为评判者的评分高度相关,并为条件控制方法的行为提供了见解。

英文摘要

Controlling the output of Large Language Models (LLMs) is a central challenge for their reliable deployment, yet a clear understanding of the involved trade-offs remains elusive. Current approaches to conditioning are often evaluated with a narrow focus on their effectiveness at injecting or removing a target concept, neglecting generation quality. We systematically investigate a range of conditioning methods in both injection and removal scenarios. We find that efficient steering methods frequently achieve conditioning at a steep cost to fluency. Furthermore, we identify a critical yet previously overlooked interaction with the training paradigm: activation steering methods are far less effective on instruction-tuned models than on their base counterparts. Simple prompting and full-fledged supervised fine-tuning, on the other hand, are viable options for concept injection, but are not as good at concept removal. Finally, cheaply computed textual metrics highly correlate to costly LLM-as-judge scores, and provide insights on the behavior of conditioning methods.

2606.12232 2026-06-11 cs.LG 新提交

Re-evaluating Confidence Remasking in Masked Diffusion Language Models

重新评估掩蔽扩散语言模型中的置信度重新掩蔽

Stipe Frkovic, Metod Jazbec, Dan Zhang, Christian A. Naesseth, Ilija Bogunovic, Eric Nalisnick

发表机构 * UvA-Bosch Delta Lab, University of Amsterdam(阿姆斯特丹大学UvA-Bosch Delta实验室) Bosch Center for AI(博世人工智能中心) University of Basel(巴塞尔大学) Johns Hopkins University(约翰霍普金斯大学)

AI总结 本文重新评估了掩蔽扩散语言模型中一种无需训练的后验置信度重新掩蔽方法WINO,发现在标准解码设置下其收益甚微,且会加剧多样性坍塌问题。

详情
AI中文摘要

掩蔽扩散语言模型(dLLMs)最近已成为自回归语言模型的有竞争力的替代方案,其通过并行令牌生成实现更快的推理。然而,掩蔽公式的一个显著限制是,一旦令牌被解除掩蔽,就无法再修改,这使得dLLMs容易受到早期采样错误的影响。为了解决这个问题,越来越多的研究试图扩展掩蔽dLLMs,使其具有自我纠正(重新掩蔽)能力。其中一类有吸引力的方法以无需训练、事后方式基于令牌置信度实现,早期报告的结果令人鼓舞。在这项工作中,我们重新审视了代表性事后重新掩蔽方法WINO [Hong et al., 2026]的实证评估,发现在标准解码设置(较短的块长度)下,它相比于仅基于置信度的解除掩蔽 [Wu et al., 2025] 几乎没有带来好处。将评估扩展到非贪婪解码,我们发现虽然基于置信度的重新掩蔽可以在一定程度上减轻由增加随机性引入的错误,但它也加剧了先前报道的基于置信度的解除掩蔽导致的多样性坍塌。总体而言,我们的结果表明,事后基于置信度的重新掩蔽的好处高度依赖于设置,这凸显了需要更全面的评估框架。

英文摘要

Masked diffusion language models (dLLMs) have recently emerged as a competitive alternative to autoregressive language models, with the promise of faster inference via parallel token generation. A notable limitation of the masked formulation, however, is that once a token has been unmasked it can no longer be revised, leaving dLLMs vulnerable to early sampling mistakes. To address this, a growing body of work has sought to extend masked dLLMs with self-correcting (remasking) capabilities. One appealing subset of these methods does so in a training-free, post-hoc manner based on token confidences, with encouraging early reported results. In this work, we revisit the empirical evaluation of a representative post-hoc remasking method, WINO [Hong et al., 2026], and find that under standard decoding settings (shorter block lengths) it brings little-to-no benefit over confidence-based unmasking alone [Wu et al., 2025]. Extending the evaluation to non-greedy decoding, we find that while confidence-based remasking can mitigate errors introduced by increased stochasticity to some extent, it also exacerbates the diversity collapse previously reported for confidence-based unmasking. Overall, our results show that the benefits of post-hoc confidence-based remasking are highly setting-dependent, underscoring the need for a more comprehensive evaluation framework.

2606.12231 2026-06-11 cs.SE cs.AI 新提交

Rule Taxonomy and Evolution in AI IDEs: A Mining and Survey Study

AI IDE中的规则分类与演化:挖掘与调查研究

Guangzong Cai, Ruiyin Li, Peng Liang, Zengyang Li, Mojtaba Shahin

AI总结 通过挖掘83个开源项目中的7310条规则和99份从业者调查,建立了包含5个主类和25个子类的规则分类法,发现开发者重视架构约束但实际配置多为低级工作流和代码格式规则,规则演化主要由建设性上下文扩展和丰富驱动,且更新规则可使工件合规率平均提升22.99%。

详情
Comments
52 pages, 21 images, 8 tables, Manuscript submitted to a Journal (2026)
AI中文摘要

AI驱动的集成开发环境(AI IDE)的采用引入了“规则”作为一种新颖的软件工件,允许开发者将项目特定的约束和架构指导原则持久地注入到大语言模型(LLM)的上下文中。尽管这些规则在使AI行为与开发者意图对齐方面发挥作用,但它们的分类、演化及实际影响仍 largely unexplored。为填补这一空白,我们对AI IDE规则进行了混合方法实证研究。通过挖掘83个开源项目并提取7,310条规则,我们建立了一个包含5个主类和25个子类的全面分类法。随后,我们将这些工件与99名从业者的调查反馈进行三角验证。我们的分析发现开发者优先级与实际配置之间存在反差:虽然从业者认为架构约束非常重要,但仓库中的规则文件主要由低级工作流和代码格式约束组成。此外,我们对1,540个规则演化事件的分析表明,规则更新频繁。仓库数据进一步表明,规则演化主要由建设性上下文扩展(29.17%)和丰富(26.59%)驱动。相比之下,受访开发者报告修改规则主要是为了纠正AI错误(77.78%),通常通过添加新的负面约束而非编辑现有约束。最后,对160个规则演化事件的工件合规性评估显示,更新规则显著提高了软件工件的合规性,更新后平均工件合规率从49.14%提升至72.13%,增加了22.99%。我们的研究提供了实证见解,可帮助开发者优化提示策略,并指导工具构建者为AI IDE设计自动冲突检测和上下文管理机制。

英文摘要

The adoption of AI-powered Integrated Development Environments (AI IDEs) has introduced "Rules" as a novel software artifact, allowing developers to persistently inject project-specific constraints and architectural guidelines into the context of Large Language Models (LLMs). Despite their role in aligning AI behavior with developer intent, the taxonomy, evolution, and practical impact of these rules remain largely unexplored. To bridge this gap, we conducted a mixed-methods empirical study on AI IDE rules. By mining 83 open-source projects and extracting 7,310 rules, we established a comprehensive taxonomy comprising 5 primary and 25 secondary categories. We then triangulated these artifacts with survey responses from 99 practitioners. Our analysis identified a contrast between developer priorities and actual configurations: while practitioners rate architectural constraints as highly important, rule files in repositories primarily consist of low-level workflow and code formatting constraints. Furthermore, our analysis of 1,540 rule evolution events revealed that rules are updated frequently. Repository data further indicate that rule evolution is primarily driven by constructive context expansions (29.17%) and enrichments (26.59%). In contrast, surveyed developers reported modifying rules primarily to correct AI errors (77.78%), typically by adding new negative constraints rather than editing existing ones. Finally, an artifact compliance assessment of 160 rule evolution events revealed that updating rules significantly improves the adherence of software artifacts, with the average artifact compliance rate increasing by 22.99% (from 49.14% to 72.13%) following an update. Our study provides empirical insights that can help developers optimize prompting strategies and guide tool builders in designing automated conflict-detection and context-management mechanisms for AI IDEs.

2606.12226 2026-06-11 cs.CV eess.IV 新提交

An Electric Potential-Augmented Benchmark Dataset for Physics-Guided Image Reconstruction of Electrical Capacitance Tomography

一种电势增强的基准数据集,用于电容层析成像的物理引导图像重建

Xinqi Zhang, Qiming Ma, Lihui Peng

发表机构 * Department of Automation, Tsinghua University(清华大学自动化系)

AI总结 针对电容层析成像(ECT)数据驱动方法忽略电势场的问题,提出一个包含电势图的基准数据集,通过COMSOL-MATLAB管道生成20,000个样本,并验证其提升建模精度和鲁棒性。

详情
AI中文摘要

虽然深度学习显著推进了电容层析成像(ECT)的图像重建,但大多数数据驱动方法直接映射电容和介电常数分布,将传感器视为黑箱。这忽略了电势场——控制非线性和病态“软场”效应的基本物理联系。为解决此问题,我们提出一个电势增强的ECT基准数据集,旨在将ECT背后的潜在物理显式集成到学习过程中。通过COMSOL-MATLAB管道为八电极传感器生成示例,数据集包含20,000个随机样本,涵盖四种典型流型。关键的是,除了传统的电容向量和以图像形式描绘的介电常数分布外,每个样本还保留了八个激励方向的全场电势图。除了数据发布,我们还提供了ECT正问题和逆问题的说明性评估协议。通过在分布内(IID)和分布外(OOD)场景下的全面测试,我们系统地展示了包含电势图如何增强建模精度和鲁棒性。从根本上说,潜在场信息的显式包含显著降低了将物理定律集成到ECT建模中的障碍,从而为未来ECT图像重建的物理引导机器学习建立了标准化基础。

英文摘要

While deep learning has significantly advanced image reconstruction of Electrical Capacitance Tomography (ECT), most data-driven methods map directly between capacitance and permittivity distribution, treating the sensor as a black box. This overlooks the electric potential field -- the fundamental physical link governing the nonlinear and ill-posed ``soft-field'' effect. To address this, we propose an electric potential-augmented ECT benchmark dataset designed to explicitly integrate latent physics behind ECT into the learning process. Generated via a COMSOL-MATLAB pipeline for an eight-electrode sensor as an example, the dataset comprises 20,000 randomized samples across four typical flow patterns. Crucially, alongside the conventional capacitance vectors and permittivity distributions depicted as images, each sample preserves eight excitation-wise full-field potential maps. Beyond data release, we provide illustrative evaluation protocols for both forward and inverse problems of ECT. Through comprehensive testing on both in-distribution (IID) and out-of-distribution (OOD) scenarios, we systematically demonstrate how the inclusion of electric potential maps enhances modeling accuracy and robustness. Fundamentally, the explicit inclusion of latent field information significantly lowers the barrier to integrating physical laws into ECT modeling, thereby establishing a standardized foundation for future physics-guided machine learning of ECT image reconstruction.