arXivDaily arXiv每日学术速递 周一至周五更新
2606.20151 2026-06-19 cs.NE cs.AI 新提交

Hybrid ANN-SNN Pipeline with Local Plasticity

混合ANN-SNN流水线与局部可塑性

Denis Larionov, Khairutin Shtanchaev, Mikhail Kiselev, Mikhail Korovin, Ivan Tugoy

AI总结 提出一种混合ANN-SNN流水线,利用预训练ANN的丰富嵌入实现高性能SNN,通过速率编码和局部学习规则训练,在64类ImageNet上达到99.09%准确率。

Comments 9 pages, 4 figues, source-code available

详情
AI中文摘要

本文提出了一种混合ANN-SNN流水线,有效利用预训练人工神经网络(ANN)的丰富嵌入来实现高性能脉冲神经网络(SNN)。该架构将预训练的EfficientNet编码器与CoLaNET脉冲分类器耦合。我们通过速率编码将编码器的激活转换为脉冲序列,并使用局部、生物启发的学习规则训练后续的SNN分类器,绕过了端到端的梯度传播。该方法在64类ImageNet基准测试中达到了99.09%的准确率,展现了与传统深度网络相当的性能。该工作为将强大的预训练编码器适应于下游脉冲神经网络任务提供了一种生物上合理且高效的框架。

英文摘要

This work proposes a hybrid ANN-SNN pipeline that effectively leverages the rich embeddings of pretrained artificial neural networks (ANNs) to enable high-performance spiking neural networks (SNNs). The architecture couples a pretrained EfficientNet encoder with a CoLaNET spiking classifier. We convert the encoder's activations into spike trains via rate-coding and train the subsequent SNN classifier using local, biologically inspired learning rules, bypassing end-to-end gradient propagation. This approach achieves 99.09% accuracy on a 64-class ImageNet benchmark, demonstrating performance on par with conventional deep networks. The work presents a biologically plausible and efficient framework for adapting powerful pretrained encoders to downstream spiking neural network tasks.

2606.19861 2026-06-19 cs.NE 新提交

Weight Adaptation for Improving Parallel Performance of Adaptive Stochastic Natural Gradient

权重自适应提升自适应随机自然梯度的并行性能

Yutaro Yamada, Kento Uchida, Shinichi Shirakawa

AI总结 提出WA-ASNG,通过梯度上升自适应更新权重参数,最大化自然梯度信号,在二进制优化问题中优于PBIL和ASNG,并有效处理强噪声。

Comments Accepted at EvoCOP 2026 (Part of EvoStar 2026)

详情
AI中文摘要

基于概率模型的进化算法在黑箱优化中很有前景。具体来说,自适应随机自然梯度(ASNG)自适应地更新其学习率(概率模型进化算法中的典型超参数),从而实现高效且鲁棒的优化。尽管权重参数是常见的超参数,但随着对耗时任务并行评估需求的增加,如何为更大的种群规模设置合适的权重仍不清楚。在本文中,我们提出了权重自适应ASNG(WA-ASNG),它将权重自适应机制融入ASNG。我们从自然梯度的累积中计算更新方向的估计信号。然后,为了最大化该信号,WA-ASNG通过优化上的梯度上升自适应地更新其权重参数。学习率自适应在满足预期目标值单调改进的充分条件方面发挥作用,而权重自适应机制旨在最大化这种改进。实验结果表明,在二进制优化问题中,种群规模从25到100的各种设置下,WA-ASNG优于PBIL和ASNG。此外,WA-ASNG在存在强噪声的情况下也能高效运行。我们的代码可在此https URL获取。

英文摘要

Probabilistic model-based evolutionary algorithms are promising for black-box optimization. Specifically, the adaptive stochastic natural gradient (ASNG) adaptively updates its learning rate, a typical hyperparameter in probabilistic model-based evolutionary algorithms, thereby realizing efficient and robust optimization. Although weight parameters are common hyperparameters, with the increasing demand for parallel evaluation of time-consuming tasks, it remains unclear how to set suitable weights for larger population sizes. In this paper, we propose Weight Adaptation ASNG (WA-ASNG), which incorporates a weight adaptation mechanism into ASNG. We calculated the estimated signal of the update direction from the accumulations of the natural gradient. Then, to maximize the signal, WA-ASNG adaptively updates its weight parameters by a gradient ascent over the optimization. While the learning rate adaptation plays a role in satisfying a sufficient condition for monotonic improvement of the expected objective value, the mechanism of weight adaptation is intended to maximize this improvement. The experimental results demonstrate that WA-ASNG outperforms PBIL and ASNG across various settings with population sizes ranging from 25 to 100 for binary optimization problems. Furthermore, WA-ASNG can perform efficiently in the presence of strong noise. Our code is available at https://github.com/shiralab/WA-ASNG .

2606.20208 2026-06-19 cs.AI cs.DB cs.NE 交叉投稿

Beyond Accuracy: Measuring Logical Compliance of Predictive Models

超越准确性:衡量预测模型的逻辑合规性

Guillaume Olivier Delplanque, Pierre Genevès, Nabil Layaïda, Zephirin Faure

AI总结 提出规则违反分数(RVS),一种独立于预测准确性的评估指标,用于量化预测模型对逻辑规则的遵守程度,并通过实验证明两个准确率相近的模型可能表现出截然不同的逻辑合规性。

详情
AI中文摘要

机器学习模型主要通过预测性能指标进行评估,如排序质量、预测误差或分类准确性。虽然这些指标有效量化了预测与真实值的匹配程度,但它们不评估模型输出是否尊重预定义的逻辑或领域特定约束。在医疗、金融和自主系统等高安全性应用中,逻辑一致性与预测准确性同样关键,但尚无标准指标捕捉这一维度。我们引入了规则违反分数(RVS),这是一种互补的评估指标,独立于预测准确性,量化预测模型对给定逻辑规则集的遵守程度。RVS 对硬规则(严格约束)和软规则(统计规律)区别对待,可在任何数据集和任何在关系词汇上表达的预测模型上进行评估,并可通过为 Horn 规则自动生成的 SQL 查询进行计算。除了评估模型,RVS 还可以评估训练数据集的逻辑一致性,并帮助识别定义不良的规则。我们在三个基准测试上评估了 RVS,涵盖知识图谱链接预测和关系回归,包括基于规则、基于嵌入和神经符号的预测模型。我们的结果表明,两个实现相当预测准确性的模型可能表现出显著不同的逻辑合规性,揭示了标准指标无法捕捉的模型行为差异。

英文摘要

Machine learning models are predominantly evaluated through predictive performance metrics such as ranking quality, prediction error, or classification accuracy. While these metrics effectively quantify how closely predictions match the ground truth, they do not assess whether model outputs respect predefined logical or domain-specific constraints. In high-stakes applications, including healthcare, finance, and autonomous systems, logical consistency can be as critical as predictive accuracy, yet no standard metric captures this dimension. We introduce the Rule Violation Score (RVS), a complementary evaluation metric that quantifies the extent to which a predictive model respects a given set of logical rules, independently of predictive accuracy. RVS treats hard rules (strict constraints) and soft rules (statistical regularities) differently, can be evaluated on any dataset and on any predictive model expressed over a relational vocabulary, and can be computed using SQL queries that are automatically generated for Horn rules. Beyond evaluating models, RVS can also evaluate the logical consistency of training datasets and help identify poorly defined rules. We evaluate RVS on three benchmarks covering knowledge graph link prediction and relational regression, including rule-based, embedding-based, and neuro-symbolic predictive models. Our results demonstrate that two models achieving comparable predictive accuracy can exhibit substantially different levels of logical compliance, revealing differences in model behavior that standard metrics fail to capture.

2606.20442 2026-06-19 cs.LG cs.NA cs.NE math.NA 交叉投稿

Evolutionary Two-Stage Hyperparameter Optimization Strategies for Physics-Informed Neural Networks

物理信息神经网络的进化两阶段超参数优化策略

Fedor Buzaev, Dmitry Efremenko, Egor Bugaev, Andrei Ermakov, Denis Derkach, Daria Pugacheva, Fedor Ratnikov

发表机构 * HSE University(高等经济大学) AXXX

AI总结 针对物理信息神经网络训练不稳定、超参数敏感的问题,提出基于进化算法的两阶段优化策略,先低保真筛选再全训练,在三个PDE问题上显著降低误差。

Comments Equal advising: Daria Pugacheva and Fedor Ratnikov. Accepted to the ICLR 2026 Workshop on AI and PDEs

详情
AI中文摘要

物理信息神经网络(PINNs)通过将物理定律嵌入神经网络训练来求解偏微分方程(PDE)。然而,由于物理信息损失的高度非凸和多项结构,其性能受到不稳定收敛、训练平台期以及对架构和优化超参数的强敏感性的影响。在这种情况下,外循环超参数搜索是一个在异构参数上的噪声黑盒优化问题,经典的局部或基于梯度的策略容易陷入次优区域。进化算法凭借其基于种群的探索能力和处理混合、不可微搜索空间的能力,为发现有前景的配置提供了更稳健的机制。我们提出并研究了一种基于进化算法的两阶段方法,该方法结合了PINNs训练的探索和利用部分,以在固定计算预算下提高解的精度和鲁棒性。在第一阶段,我们执行具有截断轮次的低保真训练运行,以快速筛选候选配置,将超参数选择视为黑盒外循环问题。在第二阶段,只有最有希望的候选者使用标准基于梯度的优化器进行完全训练以细化解。在三个流行问题(即平流方程、Klein-Gordon方程和Helmholtz方程)上评估,我们的方法一致优于标准训练,并在受限计算资源内实现了显著更低的平均误差。

英文摘要

Physics-Informed Neural Networks (PINNs) solve Partial Differential Equations (PDEs) by embedding physical laws into neural network training. However, their performance suffers from unstable convergence, training plateaus, and strong sensitivity to architectural and optimization hyperparameters due to the highly non-convex and multi-term structure of the physics-informed loss. In this setting, the outer-loop hyperparameter search is a noisy and black-box optimization problem over heterogeneous parameters, where classical local or gradient-based strategies are easily trapped in suboptimal regions. Evolutionary algorithms, with their population-based exploration and ability to handle mixed, non-differentiable search spaces, provide a more robust mechanism for discovering promising configurations. We propose and investigate a two-stage approach based on evolutionary algorithms that combines exploration and exploitation parts of PINNs training to improve solution accuracy and robustness under fixed computational budgets. In the first stage, we perform low-fidelity training runs with truncated epochs to rapidly screen candidate configurations, treating hyperparameter selection as a black-box outer-loop problem. In the second stage, only the most promising candidates are fully trained with standard gradient-based optimizers to refine the solution. Evaluated on three popular problems, namely Advection, Klein-Gordon and Helmholtz equations, our method consistently outperforms standard training and achieves significantly lower mean error within constrained computational resources.

2312.15974 2026-06-19 cs.NE 版本更新

Analysing Rescaling, Discretisation, and Linearisation in RNNs for Neural System Modelling

Mariano Caruso, Cecilia Jarne

Comments 2 Figures, 15 pages

详情
英文摘要

Recurrent Neural Networks (RNNs) are widely used to model neural activity in Computational Neuroscience. Here, we explore the mathematical foundations of three fundamental procedures that can be implemented: temporal rescaling, discretisation, and linearisation. These techniques provide crucial tools for characterising the behaviour of RNNs, offering insights into their temporal dynamics, facilitating practical computational implementation, and allowing for linear approximations for analysis. We discuss the flexible order in which these procedures can be applied, emphasising their importance in modelling and analysing RNNs for neuroscience and formally prove that these three operations commute pairwise. We also explicitly describe the conditions under which these procedures can be considered interchangeable. Our findings directly inform the design of biologically plausible $\mathtt{RNN}$ models for simulating neural dynamics observed in decision-making circuits and motor control, where temporal scaling and stability are critical for matching experimental recordings. {Furthermore, we show that this exact commutativity guarantees the structural preservation of the network's controllability, preventing the emergence of inaccessible state-spaces under numerical discretisation or temporal rescaling.

2502.19193 2026-06-19 cs.SI cs.AI cs.NE 版本更新

Simulation of Language Evolution under Regulated Social Media Platforms: A Synergistic Approach of Large Language Models and Genetic Algorithms

受监管社交媒体平台下的语言演化模拟:大语言模型与遗传算法的协同方法

Jinyu Cai, Yusei Ishimizu, Mingyue Zhang, Munan Li, Jialong Li, Kenji Tei

AI总结 提出基于大语言模型的多智能体框架,结合遗传算法模拟用户语言策略在监管下的迭代演化,实验表明对话轮次增加可提升信息传递准确性和对话持续性。

Comments The manuscript has been accepted to IEEE Transactions on Computational Social Systems

详情
AI中文摘要

社交媒体平台经常实施限制性政策来调节用户内容,从而催生出创造性的规避语言策略。本文提出了一个基于大语言模型(LLMs)的多智能体框架,用于模拟在监管约束下语言策略的迭代演化。在该框架中,参与者智能体作为社交媒体用户,不断演化其语言表达,而监管智能体通过评估政策违规来模拟平台级别的监管。为了实现更逼真的模拟,我们采用了语言策略的双重设计(约束和表达)来区分冲突目标,并利用LLM驱动的遗传算法(GA)进行语言策略的选择、变异和交叉。该框架使用两种不同的场景进行评估:一个抽象的密码游戏和一个逼真的模拟非法宠物交易场景。实验结果表明,随着对话轮次的增加,不间断对话轮次的数量和信息传输的准确性都显著提高。此外,一项包含40名参与者的用户研究验证了生成对话和策略的现实相关性。消融研究也验证了GA的重要性,强调了其对长期适应性和整体结果改善的贡献。

英文摘要

Social media platforms frequently impose restrictive policies to moderate user content, prompting the emergence of creative evasion language strategies. This paper presents a multi-agent framework based on Large Language Models (LLMs) to simulate the iterative evolution of language strategies under regulatory constraints. In this framework, participant agents, as social media users, continuously evolve their language expression, while supervisory agents emulate platform-level regulation by assessing policy violations. To achieve a more faithful simulation, we employ a dual design of language strategies (constraint and expression) to differentiate conflicting goals and utilize an LLM-driven GA (Genetic Algorithm) for the selection, mutation, and crossover of language strategies. The framework is evaluated using two distinct scenarios: an abstract password game and a realistic simulated illegal pet trade scenario. Experimental results demonstrate that as the number of dialogue rounds increases, both the number of uninterrupted dialogue turns and the accuracy of information transmission improve significantly. Furthermore, a user study with 40 participants validates the real-world relevance of the generated dialogues and strategies. Moreover, ablation studies validate the importance of the GA, emphasizing its contribution to long-term adaptability and improved overall results.