arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2510.06719 2026-05-13 cs.CR cs.CL cs.LG

Differentially Private Synthetic Text Generation for Retrieval-Augmented Generation (RAG)

Junki Mori, Kazuya Kakizaki, Taiki Miyagawa, Jun Sakuma

AI总结该论文提出了一种名为DP-SynRAG的隐私保护检索增强生成（RAG）框架，旨在解决传统RAG系统在敏感领域应用中面临的隐私风险问题。不同于依赖查询时差分隐私的现有方法，DP-SynRAG利用大语言模型生成差分隐私的合成数据库，避免重复注入噪声带来的隐私损耗。实验表明，该方法在保持固定隐私预算的前提下，性能优于现有最先进的隐私保护RAG系统，为隐私友好的RAG应用提供了可扩展的解决方案。

Comments Accepted to ACL 2026 Findings

2510.05497 2026-05-13 cs.DC cs.AI cs.AR cs.LG

Patterns behind Chaos: Forecasting Data Movement for Efficient Large-Scale MoE LLM Inference

Zhongkai Yu, Yue Guan, Zihao Yu, Chenyang Zhou, Zhengding Hu, Shuyi Pei, Yangwook Kang, Yufei Ding, Po-An Tsai

AI总结本文研究了大规模混合专家（MoE）大语言模型推理过程中数据移动的模式，旨在提升其在多单元系统中的执行效率。通过分析2025年发布的四款大型MoE模型在24,000个不同任务上的运行情况，研究从时间和空间两个维度提炼出六个关键洞察，并据此提出适用于未来晶圆级GPU和现有GPU系统的优化方案，分别实现了6.6倍和1.25倍的性能提升。这是首个针对大规模MoE模型数据移动问题的系统性分析与应用研究。

2509.21711 2026-05-13 stat.ML cs.LG

Multi-modal Bayesian Neural Network Surrogates with Conjugate Last-Layer Estimation

Ian Taylor, Juliane Mueller, Julie Bessac

AI总结本文研究了如何利用多模态数据构建高效的代理模型，以支持对昂贵目标量的建模与分析。作者提出两种基于共轭后验分布的多模态贝叶斯神经网络代理模型，并利用变分推断方法进行参数估计，特别适用于存在部分缺失观测的情况。实验表明，与单模态模型相比，该方法在标量和时序数据上均表现出更高的预测精度和不确定性量化能力。

Comments 47 pages including references and appendix, 9 figures

2509.00931 2026-05-13 stat.ML cs.LG

Semi-Supervised Bayesian GANs with Log-Signatures for Uncertainty-Aware Credit Card Fraud Detection

David Hirnschall

AI总结本文提出了一种基于半监督贝叶斯生成对抗网络（GAN）的新型深度生成框架，用于信用卡欺诈检测，将问题建模为时间序列分类任务。该方法结合条件GAN进行目标数据增强，引入贝叶斯推理以量化预测不确定性，并利用对数符号（log-signatures）对交易历史进行鲁棒特征编码，同时设计了一种基于Wasserstein距离的损失函数以对齐生成样本与真实未标记样本。实验表明，该方法在BankSim数据集上优于现有基准，在不同标签比例下均表现出优异的统计和领域特定性能。

2508.20614 2026-05-13 stat.ML cs.LG stat.CO

Improving the Accuracy of Amortized Model Comparison with Self-Consistency

Šimon Kucharský, Aayush Mishra, Daniel Habermann, Stefan T. Radev, Paul-Christian Bürkner

AI总结该论文研究了如何提高免训练模型比较（Amortized Bayesian Model Comparison, BMC）的准确性，特别是在模拟模型存在偏差的情况下。作者提出了一种基于自一致性（self-consistency）损失的新方法，通过在未标记的真实数据上训练神经代理模型，以增强模型比较在分布偏移情况下的鲁棒性。实验表明，在开放世界场景下，结合自一致性训练的方法能显著提升BMC估计的准确性，尤其在模型严重偏差时效果更明显。

Comments 22 pages, 14 figures. This version extends our initial results presented at Reliable ML from Unreliable Data Workshop at NeurIPS 2025. Previously, this version appeared as arXiv:2512.14308v2, which has now been withdrawn: the two versions share too much content to be considered separate papers

2508.02455 2026-05-13 cs.SE cs.AI cs.IR

TreeRanker: Fast and Model-agnostic Ranking System for Code Suggestions in IDEs

Daniele Cipollone, Egor Bogomolov, Arie van Deursen, Maliheh Izadi

AI总结 TreeRanker 是一种快速且模型无关的代码建议排序系统，旨在提升 IDE 中代码补全功能的相关性。该方法利用语言模型对静态补全结果进行评分，通过构建前缀树并进行一次贪心解码遍历，实现了无需复杂调整的精确排序。其优势在于高效、通用，可兼容现有代码补全模型，为在 IDE 中集成语言模型提供了实用且有效的解决方案。

2506.10664 2026-05-13 stat.ML cs.LG

Sequential Off-Policy Learning with Logarithmic Smoothing

Maxime Haddouche, Otmane Sakhi

AI总结本文研究了序列离线策略学习问题，即在实际系统中不断更新和重新部署策略时，如何利用所有历史数据进行学习。作者提出了一种结合对数平滑估计与在线PAC-贝叶斯工具的简单算法，并证明在温和条件下对对数平滑方法的改进可以提升性能并加速收敛。该算法在批量设置下与当前最优离线方法相当，而在序列更新场景下则显著优于现有方法，实验验证了其有效性。

Comments AISTATS 2026

2506.07859 2026-05-13 quant-ph cs.LG

Deep reinforcement learning for near-deterministic preparation of cubic- and quartic-phase gates in photonic quantum computing

Amanuel Anteneh, Léandre Brunel, Carlos González-Arciniegas, Olivier Pfister

AI总结该研究利用深度强化学习方法，在光子量子计算中实现了近确定性的三次和四次相位门制备。通过训练深度神经网络控制量子光学电路，成功生成三次相位态，平均成功率达96%，仅需使用光子数分辨测量这一非高斯资源。研究还表明，相同资源可直接生成四次相位门，无需对三次门进行分解。

2506.00294 2026-05-13 astro-ph.IM cs.CV

Applying Vision Transformers on Spectral Analysis of Astronomical Objects

Luis Felipe Strano Moraes, Ignacio Becker, Pavlos Protopapas, Guillermo Cabrera-Vives

AI总结本文将预训练的视觉Transformer（ViT）应用于天文学光谱数据分析，通过将一维光谱转化为二维图像表示，使ViT能够通过空间自注意力机制捕捉局部和全局光谱特征。研究利用SDSS和LAMOST巡天的数百万条光谱数据对ViT进行微调，在恒星分类和红移估计等任务中表现出色，其分类准确率优于支持向量机和随机森林，且在跨类型泛化能力上达到与AstroCLIP相当的水平。这是首次将ViT应用于大规模真实光谱数据的分析，无需依赖合成输入。

Comments 9 pages, 9 figures

2505.20754 2026-05-13 stat.ML cs.LG

Stationary MMD Points

Zonghao Chen, Toni Karvonen, Heishiro Kanagawa, François-Xavier Briol, Chris. J. Oates

AI总结本文研究如何利用有限点集近似目标概率分布的问题，核心方法是通过最小化最大均值差异（MMD）来选择点集。由于MMD目标函数的非凸性，难以直接求得全局最优解，因此作者提出研究MMD的平稳点，这些点可以被准确计算。理论分析表明，对于相关再生核希尔伯特空间中的积分函数，平稳MMD点的数值积分误差收敛速度比MMD本身更快，并基于此提出了MMD梯度流作为计算平稳点的实用方法，同时给出了其收敛性的严格分析与误差界。

2505.17907 2026-05-13 stat.ML cs.LG

Approximating Simple ReLU Networks based on Spectral Decomposition of Fisher Information

Ka Long Keith Ho, Yoshinari Takeishi, Junichi Takeuchi

AI总结本文研究了具有随机隐藏权重的两层ReLU神经网络的费舍尔信息矩阵的性质。研究发现，其特征值分布高度集中在少数几个特征空间中，前三个特征空间的特征值之和占费舍尔信息矩阵迹的97.7%，且与参数数量无关。作者识别出对应这些主要特征空间的函数空间，发现其由阶数不超过2的球谐函数组成，该结果与神经切核的Mercer分解密切相关。

Comments 18 pages, 1 figure, 1 table

2505.16156 2026-05-13 stat.ML cs.LG

Integral Imprecise Probability Metrics

Siu Lun Chau, Michele Caprio, Krikamol Muandet

AI总结本文提出了一种基于Choquet积分的积分模糊概率度量（IIPM）框架，用于在模糊概率模型下比较概率分布之间的差异，扩展了经典概率度量的应用范围。该方法适用于包括下概率、概率区间和信念函数在内的多种模糊概率模型，能够有效衡量认识不确定性。理论分析表明IIPM满足度量空间的条件，并可用于描述模糊概率的弱收敛形式；实验验证显示其在分类任务中表现优异，尤其在类别数量较多时优于传统方法。

Comments 48 pages

详情

英文摘要

Quantifying differences between probability distributions is fundamental to statistics and machine learning, primarily for comparing statistical uncertainty. In contrast, epistemic uncertainty -- due to incomplete knowledge -- requires richer representations than those offered by classical probability. Imprecise probability (IP) theory offers such models, capturing ambiguity and partial belief. This has driven growing interest in imprecise probabilistic machine learning (IPML), where inference and decision-making rely on broader uncertainty models -- highlighting the need for metrics beyond classical probability. This work introduces the integral imprecise probability metric framework, a Choquet integral-based generalisation of classical integral probability metrics to the setting of capacities -- a broad class of IP models encompassing many existing ones, including lower probabilities, probability intervals, belief functions, and more. Theoretically, we establish conditions under which IIPM serves as a valid metric and metrises a form of weak convergence of capacities. Practically, IIPM not only enables comparison across different IP models but also supports the quantification of epistemic uncertainty~(EU) within a single IP model. In particular, by comparing an IP model with its conjugate, IIPM gives rise to a new class of epistemic uncertainty measures -- Maximum Mean Imprecision -- which satisfy key axiomatic properties proposed in the uncertainty quantification literature. We validate MMI through selective classification experiments, demonstrating strong empirical performance against established EU measures, and outperforming them when classical methods struggle to scale to a large number of classes. Our work advances both theory and practice in Imprecise Probabilistic Machine Learning, offering a principled framework for comparing and quantifying epistemic uncertainty under imprecision.

URL PDF HTML ☆

赞 0 踩 0

2504.13898 2026-05-13 cs.HC cs.AI

Social Human Robot Embodied Conversation (SHREC) Dataset: Benchmarking Foundational Models' Social Reasoning

Dong Won Lee, Yubin Kim, Denison Guvenoz, Sooyeon Jeong, Parker Malachowsky, Louis-Philippe Morency, Cynthia Breazeal, Hae Won Park

AI总结本文提出SHREC数据集，用于评估基础模型在现实人机交互中的社会推理能力。该数据集包含约400段真实人机交互视频和超过10000个标注，涵盖了机器人在情感理解、意图追踪等方面的社会挑战及错误表现。研究定义了八个基准任务，实验表明当前先进模型在社会推理方面仍存在显著性能差距，突显了开发社会智能AI的难度与方向。

Comments 23 pages, 11 figures

2504.10428 2026-05-13 stat.ML cs.DS cs.LG math.ST stat.TH

Smoothed Analysis of Learning from Positive Samples

Jane H. Lee, Anay Mehrotra, Manolis Zampetakis

AI总结本文研究了仅从正样本中学习二分类问题的平滑分析，旨在解决传统最坏情况下学习能力有限的问题。通过假设真实分布相对于参考分布是平滑的，作者证明了所有VC类在平滑模型下均可学习，并给出了所需的样本数量和高效算法。该成果还带来了未知截断估计、截断检测和多参考分布学习等多个应用领域的改进算法。

Comments Accepted for presentation at the 58th ACM Symposium on Theory of Computing (STOC), 2026; abstract shortened for arXiv

详情

英文摘要

Binary classification from positive-only samples is a variant of PAC learning where the learner receives i.i.d. positive samples and aims to learn a classifier with low error. Previous work by Natarajan, Gereb-Graus, and Shvaytser characterized learnability and revealed a largely negative picture: almost no interesting classes, including two-dimensional halfspaces, are learnable. This poses a challenge for applications from bioinformatics to ecology, where practitioners rely on heuristics. In this work, we initiate a smoothed analysis of positive-only learning. We assume samples from a reference distribution $D$ such that the true distribution $D^*$ is smooth with respect to it. In stark contrast to the worst-case setting, we show that all VC classes become learnable in the smoothed model, requiring $O(VC/ε^2)$ positive samples for $ε$ classification error. We also give an efficient algorithm for any class admitting $\mathrm{poly}(ε)$-approximation by degree-$k$ polynomials whose range is lower-bounded by a constant with respect to $D$ in L1-norm. It runs in time $\mathrm{poly}(d^k/ε)$, qualitatively matching L1-regression. Our results also imply faster or more general algorithms for: (1) estimation with unknown-truncation, giving the first polynomial-time algorithm for estimating exponential-family parameters from samples truncated to an unknown set approximable by non-negative polynomials in L1 norm, improving on [KTZ FOCS19; LMZ FOCS24], who required strong L2-approximation; (2) truncation detection for broad classes, including non-product distributions, improving on [DLNS STOC24]'s who required product distributions; and (3) learning from a list of reference distributions, where samples come from $O(1)$ distributions, one of which witnesses smoothness of $D^*$, as arises when list-decoding algorithms learn samplers for $D^*$ from corrupted data.

URL PDF HTML ☆

赞 0 踩 0

2502.04763 2026-05-13 cs.GT cs.LG

Shapley Value Approximation Based on k-Additive Games

Guilherme Dean Pelegrina, Patrick Kolpaczki, Eyke Hüllermeier

AI总结本文提出了一种基于$k$-可加博弈的夏普利值近似方法SVA$k_{\text{ADD}}$，用于解决多智能体公平分配问题中的计算复杂性难题。该方法通过拟合一个$k$-可加替代博弈，能够精确计算替代博弈的夏普利值，并将其作为原问题的估计值。实验表明，该方法在效率和准确性方面优于现有方法，为特征或数据点贡献度的量化提供了新的有效工具。

2412.11875 2026-05-13 stat.ML cs.LG

Bayesian Surrogate Training on Multiple Data Sources: A Hybrid Modeling Strategy

Philipp Reiser, Paul-Christian Bürkner, Anneli Guthke

AI总结该论文提出了一种融合仿真数据和真实测量数据的混合建模策略，用于提升代理模型的训练效果。研究通过两种概率方法，在代理模型训练过程中整合不同数据源的信息，一种是分别训练不同数据源的代理模型并融合预测分布，另一种是训练一个统一的代理模型以同时利用多源数据。这两种方法均采用了一种新颖的异构数据加权策略，能够提升预测精度与覆盖性，并有助于诊断仿真模型中的潜在问题。

2409.08290 2026-05-13 cs.NE cs.AI cs.LG

Reconsidering the energy efficiency of spiking neural networks

Zhanglu Yan, Zhenyu Bai, Weng-Fai Wong

AI总结本文重新评估了脉冲神经网络（SNN）相对于量化人工神经网络（QNN）在能效方面的优势。通过建立公平的对比基准，将具有 $T$ 个时间步的率编码 SNN 映射到等效的 $\lceil \log_2(T+1) \rceil$ 位 QNN，确保两者在表示能力和硬件需求上具有可比性。研究引入了涵盖计算和数据移动的详细能量模型，分析了多种网络和硬件参数的影响，发现 SNN 在特定条件下（如中等时间窗口和低脉冲率）确实具有更高的能效，并通过智能手表的实例展示了其实际节能效果。

2406.12017 2026-05-13 stat.ML cs.LG stat.CO

Sparsity-Constraint Optimization via Splicing Iteration

Jin Zhu, Junxian Zhu, Zezhi Wang, Borui Tang, Hongmei Lin, Xueqin Wang

AI总结本文提出了一种名为SCOPE的新型稀疏性约束优化算法，用于解决信号处理、统计和机器学习中的相关问题。该算法通过拼接迭代操作替代传统梯度步骤，无需调整连续超参数，从而实现了自然收敛。理论分析表明，SCOPE在稀疏度正确设定时能够线性收敛并准确恢复稀疏支撑集，且其理论结果不依赖于受限等距性质条件。实验表明，SCOPE在稀疏二次优化、稀疏分类器学习和稀疏马尔可夫网络恢复等任务中表现出优越的性能。

Comments 35 pages

2310.17025 2026-05-13 cs.NI cs.AI

netFound: Principled Design for Network Foundation Models

Sylee Beltiukov, Satyandra Guthula, Haarika Manda, Jaber Daneshamooz, Wenbo Guo, Walter Willinger, Arpit Gupta, Inder Monga

AI总结该论文提出了一种名为 netFound 的网络基础模型，旨在解决现有模型在流量分析任务中依赖数据捷径、嵌入空间退化以及无法捕捉外部网络条件等问题。研究提出了四个设计原则，包括协议感知的分词、操作上下文嵌入、突发流层次注意力机制和隐私优先的输入设计，并基于这些原则构建了 netFound 模型。实验表明，netFound 在表示质量、领域专家特征对齐和外部上下文识别任务中显著优于现有模型，同时在隐私保护方面也表现出色。

2110.01729 2026-05-13 stat.ML cs.LG

Stochastic tensor space feature theory with applications to robust machine learning

Julio Enrique Castrillon-Candas, Kaili Shi, Dingning Liu, Sicheng Yang, Xiaoling Zhang, Mark Kon, the Alzheimer's Disease Neuroimaging Initiative

AI总结本文提出了一种基于随机张量空间的多级正交子空间（MOS）Karhunen-Loeve特征理论，用于构建鲁棒的机器学习特征。通过将训练数据视为某个博赫纳空间中的随机场实例，并利用Karhunen-Loeve展开和层次化展开方法，构建多级正交子空间以检测异常信号成分，从而提取更具区分性的特征用于分类。实验表明，该方法在阿尔茨海默病血浆数据集上的分类准确率显著优于梯度提升、随机森林等主流机器学习方法。

2605.11829 2026-05-13 physics.optics cs.LG eess.SP physics.med-ph

Bin Latent Transformer (BiLT): A shift-invariant autoencoder for calibration-free spectral unmixing of turbid media

Martin Hohmann

AI总结该研究提出了一种名为Bin Latent Transformer (BiLT)的自编码器，用于在无需校准的情况下对混浊介质进行光谱解混，以准确恢复其组分的光学特性。其核心方法是用基于交叉注意力机制的编码器替代传统全连接网络，使模型对光谱波长的位置不敏感，从而提升在光谱仪校准漂移或硬件更换情况下的鲁棒性。实验表明，该模型在液体仿真实验中表现出优异的性能，并能泛化到不同仪器配置，具有重要的应用价值。

2605.11784 2026-05-13 cs.CE cs.AI cs.LG

Crash Assessment via Mesh-Based Graph Neural Networks and Physics-Aware Attention

Gabriel Curtosi, Carlos Manuel Ruiz Ruiz, Fabiola Cavaliere, Xabier Larráyoz Izcara

AI总结该研究提出了一种基于网格图神经网络和物理感知注意力机制的混合代理模型，用于高效预测整车侧面柱碰撞中的结构变形场。通过结合局部网格信息传递、几何感知的全局注意力以及稀疏接触感知修正，模型能够在保证计算效率的同时，准确捕捉短程结构交互和长程变形模式。实验表明，该方法在测试集上取得了3.20毫米的时序均方根误差，在精度、结构一致性及物理可解释性方面优于传统方法，为工业碰撞工程分析提供了快速而可靠的预测工具。

Comments 40 pages, 15 figures, 7 tables

详情

英文摘要

Full-vehicle crash simulations are computationally expensive, limiting their use in iterative design exploration. This work investigates learned hybrid surrogate models (MeshTransolver, MeshGeoTransolver, and MeshGeoFLARE) for predicting time-resolved structural deformation fields in an industrial lateral pole-impact benchmark. We evaluate whether neural surrogates can reproduce full-field crash kinematics with sufficient accuracy, spatial regularity, and structural plausibility for engineering interpretation. The proposed architectures combine local mesh message passing, geometry-aware global attention, and sparse contact-aware correction for autoregressive crash rollout. We compare mesh-based graph neural networks, attention-based geometric models, and hybrid architectures under a common training and hyperparameter configuration. The hybrid models capture both short-range structural interactions and long-range deformation patterns, while a sparse contact-aware variant assesses the effect of dynamic proximity interactions during rollout. On a 25-sample full-vehicle test set, the best hybrid model achieves a temporal mean root-mean-square error of 3.20 mm. While geometry-aware attention baselines are quantitatively competitive, qualitative side-view inspection shows they can introduce local spatial noise and deformation irregularities that complicate structural interpretation. In contrast, hybrid mesh-attention models provide the best balance between scalar accuracy, survival-space consistency, and physically interpretable displacement fields. These results suggest that crash surrogate assessment should combine global error metrics with downstream safety-relevant quantities and qualitative field inspection. The proposed methodology enables fast full-field predictions while preserving essential structural information for industrial crash-engineering analysis.

URL PDF HTML ☆

赞 0 踩 0

2605.11770 2026-05-13 cs.CR cs.AI cs.SY eess.SY

Behavioral Integrity Verification for AI Agent Skills

Yuhao Wu, Tung-Ling Li, Hongliang Liu

AI总结该研究提出了一种名为行为完整性验证（BIV）的方法，用于验证AI代理技能的实际能力是否与其声明一致，填补了现有安全机制在技能本身验证方面的空白。该方法结合确定性代码分析与大语言模型辅助的能力提取，构建了统一的分类体系，支持偏差分类、根源分析和恶意技能检测等下游任务。实验表明，BIV在大规模技能数据集上表现出色，揭示了技能描述与实现之间的广泛差距，并在恶意技能检测任务中取得了优于现有方法的高精度结果。

2605.11759 2026-05-13 cs.CE cs.LG cs.NA math.NA

A nonlinear extension of parametric model embedding for dimensionality reduction in parametric shape design

Andrea Serani, Giorgio Palma, Matteo Diez

AI总结在基于仿真的形状设计中，高维参数化限制了优化和设计空间探索的效率。本文提出了一种非线性扩展的参数化模型嵌入方法（NLPME），在保留几何驱动的潜变量和参数化重构原理的基础上，用非线性潜空间替代传统线性子空间，从而更高效地处理非线性几何变化。实验表明，NLPME在保持显式反向映射的同时，相比线性方法能以更少的潜变量达到相同的重构精度，具有更高的压缩效率和工程适用性。

2605.11758 2026-05-13 eess.IV cs.CV

DiffSegLung: Diffusion Radiomic Distillation for Unsupervised Lung Pathology Segmentation

Rezkellah Noureddine Khiati, Pierre-Yves Brillet, Catalin Fetita

AI总结本文提出了一种名为 DiffSegLung 的无监督肺部病理分割框架，旨在解决CT影像中缺乏标注数据以及现有扩散模型未能有效利用Hounsfield Unit（HU）信号的问题。该方法通过引入扩散放射组学蒸馏技术，利用手工设计的放射组学特征作为物理基础的“教师”模型，指导3D扩散U-Net的瓶颈特征学习，从而在无需标注的情况下提取病理区分性结构。实验表明，该方法在多个异质CT数据集上显著提升了分割性能和生成质量。

2605.11720 2026-05-13 cs.SE cs.AI cs.MA

A Research Agenda on Agents and Software Engineering: Outcomes from the Rio A2SE Seminar

Davide Taibi, Henry Muccini, Karthik Vaidhyanathan, Marcos Kalinowski, Michele Albano, Antonio Pedro Santos Alves, Renato Cerqueira, Mateus Devino, Matteo Esposito, Rodrigo Falcão, Vinicius Henning, Foutse Khomh, Valentina Lenarduzzi, Qinghua Lu, Matías Martínez, Henrique Mello, Daniel Mendez, Lucas Romao

AI总结随着智能体AI的兴起，软件工程正面临两个相互关联的变革方向：一方面智能体被越来越多地应用于支持软件工程任务，另一方面智能体AI系统本身作为复杂的系统，要求重新思考现有的软件工程实践。本文基于里约热内卢举行的A2SE研讨会成果，提出了一个由社区驱动的研究议程，明确了六个主题领域，并为每个领域设定了短期和长期的研究方向，为软件工程界提供了协调研究努力的结构化基础。

Comments 6 pages, 1 table, A2SE meeting, https://sites.google.com/view/a2se2026/home

2605.11718 2026-05-13 q-bio.NC cs.AI cs.NE

Self-organized MT Direction Maps Emerge from Spatiotemporal Contrastive Optimization

Zhaotian Gu, Molan Li, Jie Su, Chang Liu, Tianyi Qian, Dahui Wang

AI总结本研究探讨了灵长类视觉皮层背侧流中方向选择性图（如MT区）的计算起源问题。通过引入一种时空拓扑深度神经网络（TDANN），结合自监督对比学习与生物启发的空间损失函数，模型在自然视频训练中自发生成了类似大脑的运动方向图和拓扑针轮结构。研究揭示了MT区的方向选择特性源于任务驱动的判别压力与空间正则化之间的优化权衡，其表征定量匹配了猕猴MT区的生理基线，为背侧与腹侧视觉流的计算机制统一提供了新见解。

2605.11671 2026-05-13 cs.CR cs.AI cs.SE

Cochise: A Reference Harness for Autonomous Penetration Testing

Andreas Happe, Jürgen Cito

AI总结 Cochise 是一个用于自主渗透测试实验的轻量级 Python 框架，旨在提供一个标准化的实验平台以评估大语言模型驱动的渗透测试代理。该框架通过 SSH 连接 Linux 主机，支持可控的目标环境，并采用 Planner-Executor 架构分离长期状态与执行逻辑，提升实验的可控性和可复现性。研究还提供了回放与分析工具，便于研究人员对实验过程进行可视化和性能评估，推动对渗透测试代理行为与效率的深入研究。

2605.11653 2026-05-13 cs.CR cs.AI

Every Bit, Everywhere, All at Once: A Binomial Multibit LLM Watermark

Thibaud Gloaguen, Robin Staab, Mark Vero, Martin Vechev

AI总结随着大语言模型水印技术逐渐应用于商业场景，实际需求日益增长，要求水印能够承载更复杂的多比特负载，如用户ID或时间戳。本文提出了一种全新的多比特水印方法，通过二项式编码在每个词的位置直接嵌入负载的每一位，并结合状态编码器动态调整编码压力以提升效果。实验表明，该方法在消息准确性和鲁棒性方面优于8种基线方法，尤其在负载较大或失真较低的情况下优势更加明显，同时引入了按位置信度评分作为更具实用价值的评估指标。

2605.11652 2026-05-13 stat.ML cs.LG math.ST stat.TH

Posterior Contraction Rates for Sparse Kolmogorov-Arnold Networks in Anisotropic Besov Spaces

Jeunghun Oh, Kyeongwon Lee, Jaeyong Lee, Lizhen Lin

AI总结本文研究了稀疏贝叶斯Kolmogorov-Arnold网络（KAN）在各向异性Besov空间中的后验收缩速率，从贝叶斯角度为KAN提供了统计学基础。通过引入尖峰-平缓型稀疏先验，证明稀疏贝叶斯KAN能够达到近似最优的后验收缩率，且该速率依赖于目标函数的内在各向异性光滑性。通过在模型规模参数上设置超先验，后验还能自适应未知的各向异性光滑性并保持相应的近似最优速率。与基于稀疏MLP的模型相比，KAN的深度可保持固定，其复杂度可通过网络宽度、样条网格范围和参数稀疏性进行控制，从而有效避免维数灾难。