2605.27466 2026-05-28 cs.MA cs.AI cs.LG stat.ML

AgensFlow: A Coordination-Policy Substrate for Multi-Agent Systems

AgensFlow：多智能体系统的协调策略基础

Nicole Koenigstein

AI总结提出AgensFlow框架，将多智能体协调视为在线策略学习问题，通过可学习路由优化协调流程，在分布式系统事件和安全咨询任务上验证了其优于固定管道基线。

Comments 7 pages, 4 figures, 4 tables. Code and reproducible evaluations available at: https://github.com/Nicolepcx/AgensFlow

详情

AI中文摘要

基于大语言模型（LLM）构建的多智能体系统需要许多难以先验固定的协调选择：调用哪个技能协议、哪个智能体角色应执行子任务、每个角色绑定哪个模型、角色之间如何交互、何时使用检索或验证，以及何时完全省略某个步骤。这些选择与任务机制和操作约束相互影响，因此静态管道和一次性模型比较只能提供设计空间的有限视角。本文介绍AgensFlow，一个开源框架，将多智能体协调视为部分可观测下的在线策略学习问题。该框架使协调决策可观测且可从重复轨迹中学习，而不是将技能、角色、模型、拓扑和评估选择视为固定的管道设计。AgensFlow在两个语料库上进行了评估：分布式系统事件任务和安全咨询任务。评估展示了三个主要结果：在协调密集型任务上，学习路由比固定管道基线达到更高质量的操作点；skip:X将拓扑压缩隔离为基础的有意义部分；热启动策略图可以在保持平台质量的同时减少探索成本。总体而言，结果支持学习型可审计路由可以改善静态布线下的协调密集型多智能体工作流。

英文摘要

Multi-agent systems built on large language models (LLMs) require many coordination choices that are difficult to fix a priori: which skill protocol to invoke, which agent role should perform a subtask, which model to bind to each role, how roles should interact, when to use retrieval or verification, and when to omit a step entirely. These choices interact with task regime and operational constraints, so static pipelines and one-off model comparisons provide only a limited view of the design space. This paper introduces AgensFlow, an open-source framework that treats multi-agent coordination as an online policy-learning problem under partial observability. The framework makes coordination decisions observable and learnable from repeated trajectories, rather than treating skill, role, model, topology, and evaluation choices as fixed pipeline design. AgensFlow is evaluated on two corpora: distributed-systems incident tasks and security-advisory tasks. The evaluation shows three main results: learned routing reaches a higher-quality operating point than a fixed pipeline baseline on coordination-heavy classes; skip:X isolates topology compression as a meaningful part of the substrate; and warm-started policy graphs can reduce exploration cost while preserving plateau quality. Overall, the results support that learned, auditable routing can improve coordination-heavy multi-agent workflows over static wiring.

URL PDF HTML ☆

赞 0 踩 0

2605.27463 2026-05-28 stat.ME cs.AI stat.AP

When prompt perturbations break your A/B test: A valid statistical test for generative surveying

当提示扰动破坏你的A/B测试：一种用于生成式调查的有效统计检验

Hayden Helm, Carey Priebe

AI总结针对生成式调查中LLM对提示设计敏感的问题，提出一种置换检验方法，在包含扰动结构的统计模型下保持有效性，并给出预算分配建议。

详情

AI中文摘要

生成式调查——利用基于LLM的角色集合对消息提供反馈——已成为传统市场研究的廉价且可扩展的替代方案。然而，LLM对提示设计中的微小变化很敏感，从生成式调查中得出的结论可能依赖于任意的措辞选择。控制这种敏感性需要在分析中包含语义等价的扰动。在本文中，我们表明，在包含现实扰动结构的生成式调查统计模型下，标准假设检验（包括符号检验和Wilcoxon符号秩检验）是无效的。我们提出了一种在该模型下有效的置换检验，并正式刻画了标准检验失效的条件。将我们的框架应用于一个简单的生成式调查问题，我们估计了相关参数，刻画了置换检验在现实条件下的功效，并提供了关于在角色、扰动和重复之间分配预算的实用指导。最后，我们表明，即使在同一个模型家族内，估计效应的大小和方向都对模型选择敏感。

英文摘要

Generative surveying -- where collections of LLM-based personas provide feedback on messages -- has emerged as a cheap and scalable alternative to traditional market research. However, LLMs are sensitive to small variations in prompt design and conclusions drawn from generative surveys may depend on arbitrary phrasing choices. Controlling for this sensitivity requires including semantically equivalent perturbations in the analysis. In this paper, we show that standard hypothesis tests, including the sign test and Wilcoxon signed-rank test, are invalid under a statistical model for generative surveying that includes realistic perturbation structure. We propose a permutation test that is valid under this model and formally characterize the conditions under which standard tests fail. Applying our framework to a simple generative surveying problem, we estimate relevant parameters, characterize the power of the permutation test under realistic conditions, and provide practical guidance on budget allocation across personas, perturbations, and replicates. Finally, we show that both the magnitude and direction of the estimated effect are sensitive to the choice of model, even within the same model family.

URL PDF HTML ☆

赞 0 踩 0

2605.27450 2026-05-28 cs.IR cs.LG

Context Features Are Cheap: Rank-Aware Decomposition for Efficient Feature Interaction in Recommender Systems

上下文特征很廉价：用于推荐系统中高效特征交互的秩感知分解

Yevgeny Tkach

AI总结提出一种秩感知分解方法，通过将上下文相关计算从每个候选一次减少为每个请求一次，在不改变模型预测的情况下显著提升工业推荐系统的吞吐量。

详情

AI中文摘要

现代工业推荐系统使用深度排序模型对N个候选与相同的用户和上下文特征进行评分。标准实现在前向传播早期广播上下文特征，每个请求冗余计算N次上下文相关操作。我们提出了一种秩感知分解方法，适用于现代推荐架构中的主要交互机制——因子分解机（FM）成对乘积、深度交叉网络（DCNv2）交叉层、自注意力和全连接（FC）投影层——基于一个简单的代数原理：对秩划分输入的任何线性或双线性操作都允许精确的块分解，将上下文相关计算从每个候选一次移动到每个请求一次，与原始模型恒等等价。闭式分析和受控消融实验验证了节省量随上下文特征数量呈二次方增长。将该分解应用于生产级DLRM风格排序器，无需任何架构更改，在相同模型预测下，每个pod的吞吐量提高了87.5%（峰值pod数量减少47%）。恒等等价分解仅适用于交叉网络和自注意力的第一层，因为每一层在其输出中混合了秩。为了在深度上扩展节省量，我们进一步引入了rDCN，一种DCNv2的架构变体，它在深度上保持秩纪律，并在训练噪声内匹配DCNv2的精度，同时总FLOPs减少67%，并勾勒了自注意力的类似架构变体。

英文摘要

Modern industrial recommender systems use a deep ranking model to score N candidates against the same user and context features. Standard implementations broadcast context features early in the forward pass, redundantly computing context-only operations N times per request. We present a rank-aware decomposition applicable to the dominant interaction mechanisms in modern recommender architectures-Factorization Machine (FM) pairwise products, Deep Cross Network (DCNv2) cross layers, self-attention, and fully connected (FC) projection layers-built on a single algebraic principle: any linear or bilinear operation over a rank-partitioned input admits an exact block decomposition that moves context-only computation from once-per-candidate to once-per-request, identity-equivalent to the original model. Closed-form analysis and controlled ablation verify that savings scale quadratically with the number of context features. Applied to a production DLRM-style ranker without any architectural change, the decomposition increases per-pod throughput by 87.5% (a 47% reduction in peak pod count) at identical model predictions. The identity-equivalent decomposition applies only at the first layer of cross networks and self-attention, since each layer mixes ranks in its output. To extend savings across depth, we further introduce rDCN, an architectural variant of DCNv2 that maintains rank discipline across depth and matches DCNv2 accuracy within training noise at 67% fewer total FLOPs, and sketch an analogous architectural variant for self-attention.

URL PDF HTML ☆

赞 0 踩 0

2605.27449 2026-05-28 cs.IR cs.AI

Checking Fact with Better Retrieval: Dynamic Contrastive Learning for Evidence Retrieval

用更好的检索核查事实：用于证据检索的动态对比学习

Zhongtian Hua, Yi Luo, Meijia Yu, Yingjie Han

AI总结提出动态自适应对比学习方法DACLR，通过事件级特征提取、两阶段检索和动态对比损失优化，提升多模态证据检索的准确性。

详情

AI中文摘要

在多模态事实核查领域，从不同模态检索证据的准确性对下游声明验证过程有显著影响。现有的通用多模态检索方法通常基于语义构建，导致检索到的证据与声明相似但不相关。本文提出了一种用于证据检索的动态自适应对比学习方法（DACLR）来解决这些问题。DACLR首先使用多模态大语言模型（MLLM）将多模态证据和声明统一转换为文本模态，并在事件级别提取这些信息的特征。然后，通过召回-重排序的两阶段检索方法进行证据检索。DACLR通过优化对比损失和挖掘难负样本，增强了检索阶段模型的事件感知能力。具体而言，DACLR基于InfoNCE损失在语义和事件两个层次设计了三个损失函数，并对应设置了三组难负样本候选。模型根据批内样本的准确性监督信号动态调整比例，使模型在不遗忘语义检索能力的情况下，学习声明与正样本在事件层面的相关性。大量的对比和消融实验证明了DACLR及其内部优化方法的有效性。进一步的研究也证明了DACLR在多模态证据检索领域的优势。

英文摘要

In the field of multimodal fact checking, the accuracy of retrieving evidence from different modalities has a significant impact on the downstream claim verification process. Existing general multimodal retrieval methods are often constructed based on semantics, resulting in the retrieved evidence being similar but not relevant to the claim. This paper proposes a \textbf{D}ynamic \textbf{A}daptive \textbf{C}ontrastive \textbf{L}earning method for evidence \textbf{R}etrieval called DACLR to address these issues. DACLR first uses a Multimodal Large Language Model (MLLM) to uniformly convert multimodal evidence and claims into text modalities, and extracts the features of these information at event level. Then, it conducts evidence retrieval through a two-stage retrieval method of recall-rerank. DACLR enhances the model's event perception ability of the retrieval stage by optimizing the contrastive loss and mining hard negative samples. Specifically, DACLR designs three loss functions at two levels (semantic and event) based on the InfoNCE loss.Corresponding to these, three sets of hard negative sample candidates are set up. The model dynamically adjusts the ratio based on the accuracy supervision signal of intra-batch samples, allowing the model to learn the correlation between claims and positive samples at the event level without forgetting the semantic retrieval ability. Extensive comparison and ablation experiments demonstrates the effectiveness of DACLR and its internal optimization methods. Further research also prove the advantages of DACLR in the field of multimodal evidence retrieval.

URL PDF HTML ☆

赞 0 踩 0

2605.27445 2026-05-28 cs.IR cs.AI

RAGe: A Retrieval-Augmented Generation Evaluation Framework

RAGe：一种检索增强生成评估框架

Larissa Guder, João Pedro de Moura, Arthur Accorsi, Gustavo Losch do Amaral, Maurício Cecílio Magnaguagno, Felipe Meneguzzi, Marcio Sorraglia Pinho, Dalvan Griebler

AI总结提出模块化框架RAGe，通过资源遥测和组件推荐，评估检索增强生成应用在准确性、效率和可扩展性之间的权衡，支持领域特定数据集的最佳组件选择。

详情

AI中文摘要

城市交叉口异构移动体的可微分模型预测安全

Wenzhe Song, Hao Zhang

AI总结提出可微分模型预测安全（DMPS）框架，将模型预测控制的前瞻性嵌入数据驱动的端到端强化学习架构，通过可微分安全评价器实现精确在线安全校正，在高密度混合交通仿真中将碰撞率降至5.6%以下。

Comments 6 pages. Published in IEEE IARCE 2025

详情

DOI: 10.1109/IARCE68366.2025.11485680
Journal ref: 2025 IEEE 5th International Conference on Industrial Automation, Robotics and Control Engineering (IARCE), Chongqing, China, 2025, pp. 1-6

AI中文摘要

自动驾驶车辆和移动机器人在城市环境中的即将集成对未来的智能交通系统提出了严峻的安全挑战。本文解决了在无信号交叉口协调具有不同动力学的异构智能体的复杂问题。我们引入了一种新颖的框架，称为可微分模型预测安全（DMPS），它将模型预测控制的前瞻性嵌入到数据驱动的端到端强化学习架构中。DMPS智能体学习一个潜在动力学模型，以预测依赖于其动作的未来轨迹。然后，一个学习到的可微分安全评价器评估这些轨迹的风险。关键的是，通过利用通过整个展开预测模型的反向传播，智能体可以高效地计算未来安全性相对于当前动作的梯度，从而实现最小且精确的在线安全校正。集成到多智能体训练方案中，DMPS在高密度混合车辆-机器人交通仿真中几乎消除了碰撞，碰撞率低于5.6%，在不牺牲能量和交通效率的情况下展示了最先进的安全性。

英文摘要

The imminent integration of autonomous vehicles and mobile robots in urban settings presents a critical safety challenge for future intelligent transportation systems. This paper addresses the complex problem of coordinating heterogeneous agents with disparate dynamics at unregulated intersections. We introduce a novel framework, differentiable model predictive safety (DMPS), which embeds the foresight of model-predictive control into a data-driven, end-to-end reinforcement learning architecture. DMPS agents learn a latent dynamics model to predict future trajectories contingent on their actions. A learned, differentiable safety critic then evaluates the risk of these trajectories. Crucially, by leveraging backpropagation through the entire unrolled predictive model, agents can efficiently compute the gradient of future safety with respect to their current action, enabling a minimal and precise online safety correction. Integrated into a multi-agent training scheme, DMPS virtually eliminates collisions to less than 5.6% in high-density, mixed vehicle-robot traffic simulations, demonstrating state-of-the-art safety without compromising energy and traffic efficiency.

URL PDF HTML ☆

赞 0 踩 0

2605.27417 2026-05-28 quant-ph cs.AI cs.LG

利用循环发放神经元和可学习梯度推进脉冲神经网络的直接训练

Feifan Zhou, Xiang Wei, Yang Liu, Qiang Yu

AI总结提出一种包含循环发放神经元、逐时间步可学习代理梯度和正负平衡损失函数的直接训练算法，以提升脉冲神经网络的信息表示能力和梯度传播精度，在多个数据集上取得竞争性性能并泛化至Transformer架构。

详情

AI中文摘要

脉冲神经网络（SNN）因其节能特性而备受关注，但与人工神经网络（ANN）相比仍存在显著性能差距。这一差距源于至少两个关键限制：首先，传统脉冲神经元的信息表示能力有限，未能充分利用膜电位的丰富动态；其次，固定代理梯度（SG）函数在时间步上导致梯度传播不精确，阻碍了有效的直接训练。为了解决这两个挑战，我们提出了一种新的直接训练算法，包含三个核心创新：第一，一种循环发放脉冲神经元模型，通过更有效地利用膜电位来增强信息表示能力；第二，一种逐时间步可学习的代理梯度函数，能够在反向传播过程中实现精确的梯度估计；第三，一种正负平衡损失函数，以实现正负膜电位之间的平衡，进一步提升SNN性能。大量实验表明，我们的方法在多个数据集上取得了竞争性性能。我们的方法可以无缝泛化到先进的Transformer架构，始终优于现有方法。我们的工作强调了进一步利用SNN内在膜动力学以提升性能的有效性，从而为推进高性能脉冲神经架构开辟了新途径。

英文摘要

Spiking Neural Networks (SNNs) have emerged with promising energy-efficient property, yet a substantial performance gap persists compared to Artificial Neural Networks (ANNs). This gap stems from at least two key limitations: first, conventional spiking neurons offer limited information representation capacity, underutilizing the rich dynamics of membrane potentials; second, fixed surrogate gradient (SG) functions across time steps leads to imprecise gradient propagation, impeding effective direct training. To address these two challenges, we propose a new direct training algorithm with three core innovations: first, a circulate-firing spiking neuron model that enhances information representation capacity by leveraging membrane potentials more effectively; second, a time-step-wise learnable surrogate gradient function, enabling accurate gradient estimation during backpropagation; third, a positive-negative balanced loss function to achieve equilibrium between positive and negative membrane potentials and further boost SNN performance. Extensive experiments demonstrate that our methods achieve competitive performance across multiple datasets. Our methods can generalize seamlessly to advanced architectures of Transformer, consistently outperforming existing methods. Our work highlights the effectiveness of further harnessing intrinsic membrane dynamics of SNNs for performance improvement, and thus open a new avenue for advancing high-performance spiking neural architectures.

URL PDF HTML ☆

赞 0 踩 0

2605.27411 2026-05-28 cs.NE cs.LG

Genetic algorithm vs. gradient descent for training a neural network architecture dedicated to low data regimes in small medical datasets

遗传算法与梯度下降在针对小医学数据集低数据量场景的神经网络架构训练中的比较

Amine Boukhari, Boglarka Ecsedi, Laszlo Papp, Mathieu Hatt

AI总结针对DEBI-NN架构，比较遗传算法与梯度下降在分类任务中的性能，发现遗传算法在决策边界和分类准确率上显著优于梯度下降。

详情

AI中文摘要

目的/引言：距离编码生物形态信息神经网络（DEBI-NN）是一种最近提出的架构，其中连接权重由欧几里得空间中神经元之间的距离定义。与直接训练权重的经典神经网络相比，这种方法大幅减少了可训练参数的数量。DEBI-NN的训练过程基于遗传算法（GA），而非深度学习中最常用的优化算法梯度下降（GD）。我们旨在为DEBI-NN设计并实现一个GD学习器，并评估其与GA相比的性能。材料与方法：我们设计了一种针对DEBI-NN的空间反向传播方案，并在分类任务中比较了GD和GA，使用了合成非线性“双月”数据集、两个临床医学影像放射组学数据集和一个胎儿心宫缩图数据集，样本量从n=85到n=2126。每个优化器通过针对每个数据集调整的超参数搜索进行调优。结果：在所有实验中，GA始终产生更优的决策边界和分类性能（合成：100% vs 83%；DLBCL：83% vs 78%；HECKTOR：80% vs 67%；胎儿：81% vs 66%），而GD表现出不稳定性，未能完全捕捉DEBI-NN空间编码固有的非线性模式。神经元相互依赖导致的纠缠梯度限制了经典反向传播的有效性。结论：这些发现凸显了基于梯度的方法在具有高度相互依赖空间参数的架构中的根本局限性，并确认了进化策略在训练DEBI-NN中的适用性。

英文摘要

Aim/Introduction: Distance-encoding biomorphic-informational neural network (DEBI-NN) is a recently proposed architecture in which connection weights are defined by the distances between neurons positioned in a Euclidian space. This approach drastically reduces the number of trainable parameters compared to classical neural networks in which weights are directly trained. The training process for DEBI-NN is based on a genetic algorithm (GA), rather than gradient descent (GD) which remains the prevailing optimization algorithm in deep learning. We aim to design and implement a GD learner for DEBI-NN and assess its performance compared to GA. Materials and Methods: We designed a spatial backpropagation scheme tailored to DEBI-NN and carried out a comparison between GD and GA for classification tasks, using a synthetic non-linear "two-moons" dataset, two clinical medical imaging radiomic datasets and a fetal cardiotocography dataset with a sample sizes ranging from n=85 to n=2126. Each optimizer was tuned through targeted hyperparameter searches adapted to each dataset. Results: Across all experiments, GA consistently produced superior decision boundaries and classification performance (Synthetic: 100% vs 83%; DLBCL: 83% vs 78%; HECKTOR: 80% vs 67%; Fetal: 81% vs 66%), whereas GD exhibited instability and failed to fully capture the non-linear patterns inherent to DEBI-NN's spatial encoding. The entangled gradients resulting from neuron interdependencies limit the effectiveness of classical backpropagation. Conclusion: These findings highlight fundamental limitations of gradient-based methods in architectures with highly interdependent spatial parameters and confirm the suitability of evolutionary strategies for training DEBI-NN.

URL PDF HTML ☆

赞 0 踩 0

2605.27409 2026-05-28 cs.NE cs.AI cs.LG

STARS: Spike Tail-Aware Relational Synthesis for ANN-to-SNN Data-Free Knowledge Distillation

STARS: 面向ANN到SNN无数据知识蒸馏的尖峰尾部感知关系合成

Shuhan Ye, Yi Yu, Qixin Zhang, Hui Lu, Jiaming He, Qinggang Zhang, Li Shen, Xudong Jiang

AI总结提出STARS方法，通过关系一致性对齐和尾部感知正则化增强BN引导的合成数据，解决SNN学生网络在无数据知识蒸馏中约束不足的问题，在多个数据集上提升性能。

详情

AI中文摘要

SNN有望实现高能效和低延迟推理，但其性能仍落后于ANN。ANN到SNN的知识蒸馏有助于缩小这一差距，但在实际部署中原始训练数据通常不可用。现有的无数据知识蒸馏（DFKD）方法通过匹配教师侧先验（尤其是BN统计量）来合成替代数据，但这些面向ANN的约束主要正则化均值和方差，因此对于响应依赖于阈值穿越动态的SNN学生网络而言，约束不足。本文提出尖峰尾部感知关系合成（STARS），一种用于ANN到SNN DFKD的即插即用方法，通过两个互补目标增强标准BN引导合成：关系一致性对齐（保持教师和学生之间的跨样本关系一致性）和尾部感知正则化（通过软超越教师导出阈值来正则化阈值相关的尾部概率）。这些目标共同生成合成批次，这些批次在保持教师有效性的同时，对SNN学生网络更具信息性。在CIFAR-10、CIFAR-100和Tiny-ImageNet上的多个ANN-SNN对实验表明，我们的方法一致改进了传统DFKD基线，甚至超过了若干KD方法，在CIFAR-10上提升高达4.6%，在CIFAR-100上提升高达6.7%，突显了在面向SNN的DFKD中，用关系约束和尾部感知约束补充BN匹配的重要性。

英文摘要

SNNs promise energy-efficient and low-latency inference, but their performance still trails that of ANNs. ANN-to-SNN knowledge distillation helps narrow this gap, yet the original training data are often unavailable in practical deployment settings. Existing data-free knowledge distillation (DFKD) methods synthesize surrogate data by matching teacher-side priors, especially BN statistics, but these ANN-oriented constraints mainly regularize mean and variance and therefore remain under-constrained for SNN students whose responses depend on threshold-crossing dynamics. In this paper, we propose Spike Tail-Aware Relational Synthesis (STARS), a plug-and-play method for ANN-to-SNN DFKD that augments standard BN-guided synthesis with two complementary objectives: Relational Consistency Alignment, which preserves cross-sample relational consistency between teacher and student, and Tail-Aware Regularization, which regularizes threshold-relevant tail probabilities through soft exceedance over teacher-derived thresholds. Together, these objectives generate synthetic batches that remain teacher-valid while becoming more informative for SNN students. Experiments on CIFAR-10, CIFAR-100, and Tiny-ImageNet across multiple ANN-SNN pairs show that our method consistently improves conventional DFKD baselines and even surpasses several KD methods, with gains of up to 4.6\% on CIFAR-10 and 6.7\% on CIFAR-100, highlighting the importance of complementing BN matching with relational and tail-aware constraints in SNN-oriented DFKD.

URL PDF HTML ☆

赞 0 踩 0

2605.27408 2026-05-28 quant-ph cs.LG cs.NA math.NA

Neural Quantum Spectral Operator Learning for Solving Partial Differential Equations

神经量子谱算子学习求解偏微分方程

Chanyoung Kim, Myeonghwan Seong, Yujin Kim, Daniel K. Park, Youngjoon Hong

AI总结提出首个混合量子-经典无监督算子学习框架NVQLS，利用Legendre-Galerkin弱形式解决VQLS符号歧义并引入神经嵌入编码，在1D/2D参数化PDE上实现高精度求解。

Comments 31 pages (main 9 pages), 17 figures, 8 tables

详情

AI中文摘要

偏微分方程（PDE）是物理和工程系统建模的核心，但重复求解参数化PDE仍然计算成本高昂。算子学习能够实现快速代理推理，但通常需要由昂贵的高保真PDE求解器生成的大规模输入-输出配对数据集。无监督算子学习框架减轻了数据依赖性，但仍受计算瓶颈限制。为解决这一问题，我们提出了神经变分量子线性求解器（NVQLS），这是首个利用Legendre-Galerkin弱形式的混合量子-经典算子学习框架。我们关键性地解决了VQLS能量最小化中的符号歧义，防止了错误的解表示。此外，我们引入了神经嵌入，一种新颖的编码方案，将变化的强迫项和PDE系数映射到参数化量子电路表示中。这些结构创新在高效态制备方案下提供了理论计算复杂度优势，同时相比代表性经典基线实现了更优的精度。在1D和2D参数化PDE上，在不同边界条件下的验证表明，NVQLS能够同时处理变化输入，为量子增强算子学习提供了一种可扩展的无监督方法。

英文摘要

Partial differential equations (PDEs) are central to modeling physical and engineering systems, but repeatedly solving parametric PDEs remains computationally expensive. Operator learning enables fast surrogate inference, yet typically requires large input-output paired datasets generated by costly high-fidelity PDE solvers. Unsupervised operator learning frameworks alleviate data dependency but remain hindered by computational bottlenecks. To address this, we propose Neural Variational Quantum Linear Solver (NVQLS), the first hybrid quantum-classical operator learning framework leveraging the Legendre--Galerkin weak formulation. We critically resolve the sign ambiguity in VQLS energy minimization, preventing erroneous solution representations. Additionally, we introduce a neural embedding, a novel encoding scheme to map varying forcings and PDE coefficients into parameterized quantum circuit representations. These structural innovations provide theoretical computational complexity advantages under efficient state preparation schemes, while achieving superior accuracy compared to a representative classical baseline. Validations on 1D and 2D parametric PDEs under diverse boundary conditions demonstrate NVQLS's capability to simultaneously process varying inputs, offering a scalable unsupervised approach to quantum-enhanced operator learning.

URL PDF HTML ☆

赞 0 踩 0

2605.27407 2026-05-28 cs.NE cs.AI cs.LG

Benchmarking Fairness in Spiking Neural Networks: Data Bias, Spurious Features, and Hardware Effects

脉冲神经网络中的公平性基准测试：数据偏差、虚假特征和硬件效应

Hudi He, Fukun Wang, Zhe Wang, Xinyi Wang, Shuhan Ye, Jiarui Liu, Qing Qing, Ziqi Xu, Xikun Zhang, Renqiang Luo

AI总结本文首次提出脉冲神经网络公平性基准，通过引入人口统计覆盖缺口、虚假特征泄漏和部署环境不匹配三个现实维度，系统评估了12种先进SNN在资源约束下的公平性-性能权衡。

详情

AI中文摘要

评估脉冲神经网络（SNN）的公平性需要反映现实世界复杂性的严格基准，然而现有评估仍受限于肤浅的数据集多样性和理想化的硬件假设。本文首次引入SNN的系统性公平性基准，解决三个关键的现实维度：（1）训练数据中的人口统计覆盖缺口，（2）虚假特征泄漏（例如，肤色作为类别标签的代理），以及（3）部署环境不匹配（例如，具有受限脉冲编码的边缘设备）。我们的框架整合了四个跨人口统计数据集（带有受控偏差注入）和三个神经形态硬件模拟器（Loihi 2、SpiNNaker），从而能够在资源约束下隔离分析公平性-性能权衡。对12种最先进SNN的标准化评估揭示了显著差异：在偏差数据上训练的模型对代表性不足群体的假阳性率高出23%，而硬件限制（例如，降低的脉冲精度）在边缘部署中进一步将准确率差距放大至41%。关键的是，为云端SNN开发的偏差缓解策略在资源约束下通常会退化，这凸显了需要联合优化公平性和硬件效率的协同设计原则。通过连接算法公平性研究与神经形态工程，我们的基准为医疗和自主系统等社会关键应用中的可信SNN奠定了基础。我们的代码可在以下网址获取：https://anonymous.4open.science/r/SNN-Benchmarks-8017。

英文摘要

Evaluating fairness in Spiking Neural Networks (SNNs) demands rigorous benchmarks that reflect real-world complexities, yet existing assessments remain limited by superficial dataset diversity and idealized hardware assumptions. This work introduces the first systematic fairness benchmark for SNNs, addressing three critical dimensions of realism: (1) demographic coverage gaps in training data, (2) spurious feature leakage (e.g., skin tone as a proxy for class labels), and (3) deployment-environment mismatches (e.g., edge devices with constrained spike encoding). Our framework integrates four cross-demographic datasets with controlled bias injections and three neuromorphic hardware simulators (Loihi 2, SpiNNaker), enabling isolated analysis of fairness-performance trade-offs under resource constraints. Standardized evaluations of 12 state-of-the-art SNNs reveal stark disparities: models trained on biased data exhibit 23\% higher false positive rates for underrepresented groups, while hardware limitations (e.g., reduced spike precision) further amplify accuracy gaps by up to 41\% in edge deployments. Critically, bias mitigation strategies developed for cloud-based SNNs often degrade under resource constraints, highlighting the need for co-design principles that jointly optimize fairness and hardware efficiency. By bridging algorithmic fairness research with neuromorphic engineering, our benchmark provides a foundation for trustworthy SNNs in socially critical applications such as healthcare and autonomous systems. Our code is available at: https://anonymous.4open.science/r/SNN-Benchmarks-8017.

URL PDF HTML ☆

赞 0 踩 0