arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2606.01813 2026-06-02 cs.CL

OctoT2I：一种自我进化的智能文本到图像路由系统

Xu Jiang, Bin Chen, Gehui Li, Yule Duan, Ronggang Wang, Jian Zhang

发表机构 * School of Electronic and Computer Engineering, Peking University（电子与计算机工程学院，北京大学）； Guangdong Provincial Key Laboratory of Ultra High Definition Immersive Media Technology, Shenzhen Graduate School, Peking University（广东省超高清沉浸媒体技术重点实验室，北京大学深圳研究生院）

AI总结提出OctoT2I框架，通过自进化机制构建知识库并采用状态化多轮路由策略，联合优化生成质量与推理效率，在GenEval上达到0.96性能，同时实现90.3%推理加速和56.6%能效提升。

详情

AI中文摘要

并行异步自适应一阶方法的随机收敛性

Serge Gratton, Philippe L. Toint

发表机构 * Université de Toulouse, INP, IRIT, Toulouse, France（图卢兹大学，INP，IRIT，法国图卢兹）； IA Artificial and Natural Intelligence Toulouse Institute (ANITI)（图卢兹3IA人工智能与自然智能研究所（ANITI））； NAXYS, University of Namur, Namur, Belgium（NAXYS，纳慕尔大学，比利时纳慕尔）

AI总结本文提出一类新的异步自适应一阶优化方法，包括多种流行算法的异步变体，并分析其在非凸函数上的随机收敛性，达到O(1/√t)的收敛速率。

2606.01781 2026-06-02 cs.AI

Structure-Guided Adaptive Propagation for Protein-Protein Interaction Site Prediction

结构引导的自适应传播用于蛋白质-蛋白质相互作用位点预测

Enqiang Zhu, Yizi Liu, Yilong Luo, Yao Chen, Yu Zhang, Baoshan Ma

发表机构 * Institute of Computing Science and Technology, Guangzhou University（广州大学计算机科学与技术学院）； School of Computer Science, Peking University（北京大学计算机科学学院）； Information Science & Technology Department, Beijing Capital International Airport Co., Ltd.（北京首都国际机场有限公司信息科学与技术部）； School of Information Science and Technology, Dalian Maritime University（大连海事大学信息科学与技术学院）

AI总结提出SGAP-PPIS模型，利用等变图神经网络的多尺度几何状态生成残基级传播系数，实现自适应信息扩散，在Test_60上取得竞争性能。

Comments 9 pages, 3 figures

详情

AI中文摘要

准确预测蛋白质-蛋白质相互作用位点（PPIS）对于理解细胞过程、疾病机制和治疗靶点发现至关重要。基于图的深度学习通过整合残基级结构上下文推进了PPIS预测。然而，尽管蛋白质界面存在结构和功能异质性，大多数基于图的模型仍依赖固定传播方案，对所有残基一视同仁。这种传播可能限制信息扩散适应局部几何环境的能力，使得难以区分真正的相互作用位点和结构相似的非相互作用邻居。我们提出SGAP-PPIS，一种用于PPIS预测的结构引导自适应传播模型。SGAP-PPIS不使用固定传播机制，而是利用等变图神经网络的多尺度几何状态生成残基级传播系数。这种设计允许每个残基根据其几何微环境自适应地平衡局部特征保留和邻域扩散。实验结果表明，SGAP-PPIS在Test_60上达到了与最先进方法竞争的性能。消融研究表明，几何条件自适应传播、尺度对齐几何引导和多步传播状态表示共同推动了这些改进。

英文摘要

Accurate prediction of protein-protein interaction sites (PPIS) is essential for understanding cellular processes, disease mechanisms, and therapeutic target discovery. Graph-based deep learning has advanced PPIS prediction by incorporating residue-level structural context. However, most graph-based models still rely on fixed propagation schemes that treat all residues similarly, despite the structural and functional heterogeneity of protein interfaces. Such propagation may limit the ability to adapt information diffusion to local geometric environments, making it difficult to distinguish true interaction sites from structurally similar non-interacting neighbors. We present SGAP-PPIS, a structure-guided adaptive propagation model for PPIS prediction. Rather than using a fixed propagation mechanism, SGAP-PPIS leverages multi-scale geometric states from an equivariant graph neural network to generate residue-wise propagation coefficients. This design allows each residue to adaptively balance local feature preservation and neighborhood diffusion according to its geometric microenvironment. Experimental results show that SGAP-PPIS achieves competitive performance among the state-of-the-art methods on Test\_60. Ablation studies show that geometry-conditioned adaptive propagation, scale-aligned geometric guidance, and multi-step propagation-state representation jointly drive these improvements.

URL PDF HTML ☆

赞 0 踩 0

2606.01779 2026-06-02 cs.CL

HarnessForge: Joint Harness and Policy Evolution for Adaptive Agent Systems

HarnessForge：面向自适应智能体系统的协同框架与策略进化

Mingju Chen, Can Lv, Guibin Zhang, Heng Chang, Shiji Zhou

发表机构 * Beijing Advanced Innovation Center for Future Blockchain and Privacy Computing, School of Artificial Intelligence, Beihang University（北京未来区块链与隐私计算先进创新中心，人工智能学院，北京航空航天大学）； Tsinghua University（清华大学）

AI总结提出HarnessForge元自适应框架，通过框架-策略协同进化实现LLM智能体系统的全系统自适应，在多个基准上显著提升性能。

Comments 25 pages, 13 figures

详情

AI中文摘要

LLM智能体越来越需要在需要不同执行范式的异构任务环境中运行。这对固定智能体系统提出了挑战，并推动了超越孤立组件更新的系统级元自适应。虽然现有工作已自适应外部框架或训练底层推理策略，但全系统自适应仍未被充分表征。结构与执行之间的自适应空间很少被明确化，外部框架与内部推理器之间的兼容性也未得到联合优化。我们提出HarnessForge，一个用于进化LLM智能体系统的元自适应框架。HarnessForge将智能体系统形式化为一个框架-策略对，定义了一个稳定的自适应空间，将框架级执行结构与策略级推理行为分离。然后，它通过故障引导的框架裁剪和框架条件化的策略对齐执行框架-策略协同进化。在来自不同领域的五个基准上的实验表明，HarnessForge一致地改进了Qwen3-4B和Qwen3-8B骨干网络，优于仅框架和仅策略的基线，比最强基线提升高达12.0%，并实现了有利的展开效率权衡，证明了框架-策略协同进化是有效的，并且框架与推理策略之间的可执行兼容性对于智能体系统自适应至关重要。代码可在https://github.com/mingju-c/HarnessForge获取。

英文摘要

LLM agents are increasingly expected to operate across heterogeneous task regimes that require distinct execution paradigms. This challenges fixed agent systems and motivates system-level meta-adaptation beyond isolated component updates. While existing works have adapted external harness or trained underlying reasoning policies, full-system adaptation remains insufficiently characterized. The adaptation space between structure and execution is rarely made explicit, and the compatibility between the external harness and the internal reasoner is not optimized jointly. We propose HarnessForge, a meta-adaptive framework for evolving LLM agent systems. HarnessForge formulates an agent system as a harness--policy pair, defining a stable adaptation space that separates harness-level execution structure from policy-level reasoning behavior. It then performs harness--policy co-evolution through fault-guided harness tailoring and harness-conditioned policy alignment. Experiments across five benchmarks from diverse domains show that HarnessForge consistently improves both Qwen3-4B and Qwen3-8B backbones, outperforming harness-only and policy-only baselines with gains of up to 12.0\% over the strongest baseline and achieving favorable rollout-efficiency tradeoffs, demonstrating that harness--policy co-evolution is effective, and that executable compatibility between the harness and reasoning policy is essential for agent-system adaptation. The code is available at https://github.com/mingju-c/HarnessForge.

URL PDF HTML ☆

赞 0 踩 0

2606.01777 2026-06-02 cs.RO

Trans2Occ: Voxel Occupancy Estimation and Grasp for Transparent Objects from Simulation to Reality

Trans2Occ: 从仿真到现实的透明物体体素占用估计与抓取

Yixuan Yang, Sha Zhang, Rui Li, Zhenfei Yin, Xinzhu Ma, Yiran Qin, Lei Bai, Xudong Xu, Shilin Shan, Wangmeng Zuo, Yanyong Zhang, Wanli Ouyang, Feng Zheng, Shixiang Tang, Dongzhan Zhou

发表机构 * Shanghai AI Laboratory（上海人工智能实验室）； SUSTech（南方科技大学）； CUHK（香港中文大学）； Harbin Institute of Technology（哈尔滨工业大学）； University of Oxford（牛津大学）； Beihang University（北京航空航天大学）； Nanyang Technological University（南洋理工大学）； University of Science and Technology of China（中国科学技术大学）

AI总结提出基于单视图RGB输入的体素占用预测框架，结合仿真数据生成与规则抓取策略，实现透明物体的鲁棒3D感知与操作。

详情

AI中文摘要

透明物体由于折射和反射导致的深度感知不可靠，对机器人感知构成挑战。先前的方法依赖多视图重建或深度补全，但往往难以在真实机器人系统中扩展或部署。本文提出一个基于单视图RGB输入的透明物体感知与操作实用框架。我们的方法直接从单张图像预测体素空间占用，提供支持下游机器人抓取的几何感知表示。为实现大规模训练，我们构建了一个仿真流水线，在不同材质和光照条件下生成配对的RGB图像和体素占用标注。我们证明预测的占用表示对领域偏移具有鲁棒性，并能从仿真有效迁移到真实机器人设置，无需微调。基于占用构建的简单规则抓取策略进一步实现了透明物体的可靠抓取性能。在仿真和真实环境中的大量实验表明，我们的框架提供了准确的3D理解，并实现了透明物体的实用操作。这些结果表明，单视图占用预测为机器人中的透明物体感知提供了一种可扩展且有效的解决方案。

英文摘要

Transparent objects remain challenging for robotic perception due to unreliable depth sensing caused by refraction and reflection. While prior approaches rely on multi-view reconstruction or depth completion, they are often difficult to scale or deploy in real-world robotic systems. In this paper, we present a practical framework for transparent object perception and manipulation based on single-view RGB input. Our approach predicts voxel-space occupancy directly from a single image, providing a geometry-aware representation that supports downstream robotic grasping. To enable large-scale training, we construct a simulation pipeline that generates paired RGB images and voxel occupancy annotations under diverse materials and lighting conditions. We demonstrate that the predicted occupancy representation is robust to domain shifts and transfers effectively from simulation to real-world robotic setups without fine-tuning. A simple rule-based grasping strategy built on top of the occupancy further achieves reliable grasp performance on transparent objects. Extensive experiments in both simulation and real-world environments show that our framework provides accurate 3D understanding and enables practical manipulation of transparent objects. These results suggest that single-view occupancy prediction offers a scalable and effective solution for transparent object perception in robotics.

URL PDF HTML ☆

赞 0 踩 0

2606.01774 2026-06-02 cs.LG cs.AI

TriAlign: 迈向个性化大语言模型对齐中的通用真值一致性

Thi-Nhung Nguyen, Linhao Luo, Rollin Omari, Junae Kim, Thuy-Trang Vu, Dinh Phung

发表机构 * Department of Data Science & AI, Monash University（数据科学与人工智能系，墨尔本大学）； Defence Science and Technology Group, Australia（澳大利亚国防科学与技术集团）

AI总结针对个性化大语言模型在不同社会群体间存在的通用真值不一致问题，提出TriAlign框架，通过离线多智能体强化学习联合优化真值准确性、跨群体一致性和个性化，实现公平对齐。

详情

AI中文摘要

个性化大语言模型根据用户的偏好和社会属性调整响应，但可能在不同社会群体间引入显著的通用真值不一致性，即某些群体在客观任务上系统性地获得较不准确的响应。现有的对齐方法要么忽略个性化，要么主要关注主观偏好对齐，很大程度上忽视了通用真值的公平性和一致性。为填补这一空白，我们研究了真值不变对齐（TIA），这是一个针对个性化LLM的对齐问题，旨在确保通用真值在不同社会群体间保持一致，同时保留个性化。我们提出TriAlign，这是首个用于TIA的离线多智能体强化学习（MARL）框架，其中每个社会群体被建模为一个交互的智能体。TriAlign通过一个公平感知目标和一个显式的不一致性惩罚，联合优化通用真值准确性、跨群体真值一致性和个性化。跨多个基准的实验表明，TriAlign在这三个目标之间实现了比强基线更强的平衡，减少了跨社会群体的通用真值差异，同时提高了客观任务性能和个性化质量。

英文摘要

Personalized large language models adapt responses to users' preferences and social attributes, but can introduce substantial universal truth inconsistencies across social groups, where some groups systematically receive less accurate responses on objective tasks. Existing alignment methods either ignore personalization or mainly focus on subjective preference alignment, largely overlooking fairness and consistency in universal truths. To address this gap, we study Truth-Invariant Alignment (TIA), an alignment problem for personalized LLMs that aims to ensure universal truths remain consistent across social groups while preserving personalization. We propose TriAlign, the first offline multi-agent reinforcement learning (MARL) framework for TIA, where each social group is modeled as an agent interacting. TriAlign jointly optimizes universal truth accuracy, cross-group truth consistency, and personalization through a fairness-aware objective and an explicit inconsistency penalty. Experiments across diverse benchmarks demonstrate that TriAlign achieves a stronger balance among these three objectives than strong baselines, reducing universal truth disparities across social groups while improving both objective task performance and personalization quality.

URL PDF HTML ☆

赞 0 踩 0

2606.01753 2026-06-02 cs.CV

Quality-Guided Semi-Supervised Learning for Medical Image Segmentation

质量引导的半监督学习用于医学图像分割

Kumar Abhishek, Ghassan Hamarneh

发表机构 * School of Computing Science, Simon Fraser University, Canada（Simon Fraser大学计算机科学学院）

AI总结提出一种质量引导的半监督学习框架，通过专用网络估计分割质量，并利用质量感知正则化和伪标签重加权提升医学图像分割性能。

Comments Early Accept at MICCAI 2026, 13 pages, 2 figures

详情

AI中文摘要

训练准确的医学图像分割模型需要大量密集标注的数据，这既昂贵又耗时。半监督学习通过从大量未标注数据和少量标注数据中学习来缓解这一问题。然而，大多数现代半监督学习方法依赖未标注数据的伪标签，并通常通过模型置信度或不确定性来评估其可靠性，这些度量是自我指涉的，缺乏对分割质量的明确基础。相反，我们提出了一种质量引导的半监督学习框架，训练一个专用网络从图像-掩膜对中估计分割质量。该预测器在通过合成损坏生成的变质量掩膜上进行训练，这些损坏结合了部分训练分割模型产生的不完美输出，捕捉训练中遇到的真实错误模式。我们通过两种互补机制将质量预测器集成到半监督学习中：质量感知正则化损失和基于质量的伪标签样本重新加权方案。我们表明，我们的方法可以作为现有半监督学习框架的即插即用增强。在五个数据集和多种架构上的大量实验表明，与竞争性的半监督学习方法相比，我们的方法取得了一致的改进，推进了半监督医学图像分割的最新水平。

英文摘要

Training accurate medical image segmentation models requires large amounts of densely annotated data, which is costly and time-consuming to obtain. Semi-supervised learning (SSL) alleviates this by learning from both abundant unlabeled data and limited labeled data. However, most modern SSL methods rely on pseudolabels for unlabeled data, and typically assess their reliability through model confidence or uncertainty, measures that are self-referential and lack explicit grounding in segmentation quality. Instead, we propose a quality-guided SSL framework that trains a dedicated network to estimate segmentation quality from image-mask pairs. The predictor is trained on variable-quality masks generated through synthetic corruptions augmented with imperfect outputs from partially trained segmentation models, capturing realistic error patterns encountered during training. We integrate the quality predictor into SSL through two complementary mechanisms: a quality-aware regularization loss and a quality-based pseudolabel sample reweighting scheme. We show that our method serves as a drop-in enhancement to existing SSL frameworks. Extensive experiments across five datasets and multiple architectures demonstrate consistent improvements over competing SSL methods, advancing the state-of-the-art in semi-supervised medical image segmentation.

URL PDF HTML ☆

赞 0 踩 0

2606.01747 2026-06-02 cs.CL cs.AI

Construction of Historical Knowledge Graphs Based on BERT and Graph Neural Networks

基于BERT和图神经网络的历史知识图谱构建

Ping Li, Bartlomiej Brzozka

发表机构 * Shandong Management University（山东管理大学）； Maria Curie-Sklodowska University（玛丽·居里-斯洛多夫斯卡大学）

AI总结本文提出结合BERT和图神经网络的高层架构，从历史文本中提取实体和关系，构建知识图谱，在精度、召回率和F1分数上优于传统方法和深度学习基线。

Comments 9 pages, 4 figures

详情

AI中文摘要

通过数字人文研究和规模化历史数据分析，大量传统历史文本被转换为结构化知识图谱。本文提出一种结合双向编码器表示（BERT）和图神经网络（GNN）的高层架构，用于从各类历史文本中提取实体和关系。传统历史文本系统地解决了语言歧义、上下文限制的引用以及缺乏既定语法规范的问题。本研究根据上述建议，开发了一种基于FastRQNet和预训练视觉-语言模型Vilt-qaformer+RoBInet的新型图像检索系统。实验充分利用了市政记录、议会文件和历史信函的全面数据集。与传统基于规则的技术和其他流行的深度学习基线相比，联合BERT-GNN系统获得了更高的精度、召回率和F1分数（表2）。该结构在创建知识图谱时能够以足够的准确性和全面性处理复杂的嵌套结构和隐式引用问题。上述实验表明，将关系图学习算法与上下文敏感的语义表示技术相结合，可以自动提取历史数据，为知识库积累累积的智慧。

英文摘要

Through digital humanities research and scale-up historical data analysis, a significant amount of traditional historical text is converted into structured knowledge graphs. This paper provides a high-level architecture that combines bidirectional encoder representations of transformers (BERT) and graph neural networks (GNN) to extract the entities and relationships from various types of historical texts. The texts of traditional history resolve linguistic ambiguities, references limited by context, and a lack of established grammatical norms in a systematic way. This study develops a new image retrieval system based on FastRQNet and pre-trained vision-language model Vilt-qaformer+RoBInet in accordance with the aforementioned recommendations. The experiments make full use of a comprehensive collection of municipal records, parliamentary documents, and historical correspondence. When compared to conventional rule-based techniques and other popular deep-learning baselines, the joint BERT-GNN system obtains greater Precision, Recall, and F1-score (Table 2). Complex nested structures and implicit reference issues can be handled by this structure with sufficient accuracy and thoroughness when creating knowledge graphs. The aforementioned experiments show that combining relational graph learning algorithms with context-sensitive semantic representation techniques can automatically extract historical data to add accumulated wisdom to the knowledge repository.

URL PDF HTML ☆

赞 0 踩 0

2606.01746 2026-06-02 cs.CV cs.LG

Sensitivity as a Double-Edged Sword: A Trade-off Between Discriminability and Adversarial Robustness

敏感性是一把双刃剑：判别性与对抗鲁棒性之间的权衡

Kai Wang

发表机构 * University of California, Berkeley（加州大学伯克利分校）

AI总结本文发现全连接分类器的高敏感性带来判别性但也导致脆弱性，而ℓ2距离分类器的不敏感性带来鲁棒性但限制性能，为此提出基于混合原型混合框架的ℓ2重分类器，通过融合稳定原型和动态原型实现判别性与鲁棒性的平衡，并设计混合替代攻击评估协议。

Comments 13 pages including reference, 4 figures

详情

AI中文摘要

现代神经网络极易受到对抗性扰动的影响。在这项工作中，我们指出这种脆弱性部分源于广泛使用的全连接分类器对此类扰动的敏感性。相比之下，简单的基于ℓ2距离的分类器表现出显著更强的鲁棒性。我们提供了充分的理论和实证分析，表明全连接分类器的高敏感性使其具有判别性，但也使其脆弱；相反，ℓ2分类器的不敏感性赋予了鲁棒性但限制了性能。受这种权衡的启发，我们提出了一种基于混合原型混合框架的新型ℓ2重分类器。该方法保留了全连接分类器的判别能力，同时利用了ℓ2距离的鲁棒性。它通过融合两种原型类型来产生基于ℓ2距离的预测：（1）通过指数移动平均更新的稳定数据集级原型，以及（2）使用直通估计器从全连接分类器预测生成的动态批量级原型。然而，这种基于直通估计器的动态架构给评估带来了重大挑战，例如梯度混淆和前向不连续性。为了解决这个问题，我们提出了一种新的严格评估协议——混合替代攻击，该协议使用多个替代模型以及强大的AutoAttack，以确保公平和稳健的评估。大量实验表明，我们的轻量级即插即用模块只需极少的微调，就能有效增强各种现有最先进对抗训练模型的对抗鲁棒性。

英文摘要

Modern neural networks are highly susceptible to adversarial perturbations. In this work, we identify that part of this vulnerability stems from the sensitivity of the widely used fully connected (FC) classifiers to such perturbations. In contrast, simple $\ell_2$ distance-based classifiers exhibit significantly greater robustness. We provide thorough theoretical and empirical analysis showing that while FC classifiers' high sensitivity makes them discriminative, it also makes them vulnerable. Conversely, $\ell_2$-classifiers' insensitivity grants robustness but limits performance. Motivated by this trade-off, we propose a novel $\ell_2$-reclassifier based on a Hybrid Prototype Mixing (HPM) framework. This method retains the discriminative power of FC classifiers while leveraging the robustness of $\ell_2$ distance. It yields $\ell_2$-distance-based predictions by fusing two prototype types: (1) stable, dataset-level prototypes updated via EMA, and (2) dynamic, batch-level prototypes generated from the FC classifier's predictions using a Straight-Through Estimator (STE). However, this dynamic, STE-based architecture introduces significant challenges for evaluation, such as gradient obfuscation and forward discontinuity. To address this, we propose a new, rigorous evaluation protocol, the Mixed Surrogate Attack (MSA), which uses multiple surrogates along with powerful AutoAttack to ensure a fair and robust assessment. Extensive experiments demonstrate that our lightweight, plug-and-play module, with minimal fine-tuning, effectively enhances the adversarial robustness of various existing SOTA adversarially trained models.

URL PDF HTML ☆

赞 0 踩 0

2606.01738 2026-06-02 cs.CL cs.AI

捷径通往虚无：揭秘深度虚假回归

Guanrong Xu, Jessica Li, Hao Wang, Yuzhe Yang

发表机构 * University of California, Los Angeles（加州大学洛杉矶分校）； Rutgers University（罗格斯大学）； Yang AI Lab（杨人工智能实验室）

AI总结针对连续预测中的虚假相关性，提出利用标签和特征空间中虚假属性的相似性来校准分布，从而提升模型在分布偏移下的泛化能力。

详情

AI中文摘要

现实世界中的回归常常存在捷径：在训练中与连续目标虚假相关的属性，在部署偏移下不可靠；使用此类捷径回归目标可能在测试时灾难性失败。现有关于虚假相关性的研究主要关注分类，其中标签是分类的且组是自然定义的。然而，许多现实任务需要连续预测，其中不存在硬标签边界或离散的组-标签对。我们将深度虚假回归（DSR）定义为从具有属性-标签混淆的回归数据中学习，处理连续虚假相关性，并在测试时泛化到所有属性-标签组合。受分类和回归捷径内在差异的启发，我们提出利用标签和特征空间中虚假属性之间的相似性，从而在跨属性校准标签和学习特征分布时考虑邻近目标和相关组。在涵盖计算机视觉、环境感知和大语言模型（LLM）回归的常见真实世界DSR数据集上的大量实验验证了我们策略的优越性能。我们的工作填补了研究连续预测中虚假相关性的基准和技术空白。

英文摘要

Real-world regression often exhibits shortcuts: attributes that are spuriously correlated with continuous targets in training, yet unreliable under deployment shifts; regressing targets using such shortcuts may fail catastrophically at test time. Existing studies on spurious correlations focus primarily on classification, where labels are categorical and groups are naturally defined. However, many real-world tasks require continuous prediction, where hard label boundaries or discrete group-label pairs do not exist. We define Deep Spurious Regression (DSR) as learning from regression data with attribute-label confounding, addressing continuous spurious correlations, and generalizing to all attribute-label combinations at test time. Motivated by the intrinsic difference between classification and regression shortcuts, we propose to exploit the similarity among spurious attributes in both label and feature spaces, thereby accounting for nearby targets and related groups while calibrating both label and learned feature distributions across attributes. Extensive experiments on common real-world DSR datasets that span computer vision, environmental sensing, and large language model (LLM) regression verify the superior performance of our strategies. Our work fills the gap in benchmarks and techniques for studying spurious correlations in continuous prediction.

URL PDF HTML ☆

赞 0 踩 0

2606.01722 2026-06-02 cs.LG cs.AI cs.DC

Post-Deterministic Distributed Systems: A New Foundation for Trustworthy Autonomous Infrastructure

后确定性分布式系统：可信自主基础设施的新基础

Jun He, Deying Yu

发表机构 * OpenKedge Inc.（OpenKedge公司）

AI总结本文提出后确定性分布式系统（PDDS）模型，以协调确定性代码、随机模型和自主代理共存的异构环境，并定义了五大架构支柱及新的故障分类。

Comments 8 pages, 1 table

详情

AI中文摘要

几十年来，分布式系统通常假设正确的参与者执行协议指定的行为，具有稳定、外部定义和确定性的语义。经典理论广泛参数化了网络时序、通信拓扑和故障域，但参与者模型相对固定。将自主推理引擎、随机模型驱动代理和策略驱动参与者集成到云控制平面、事件响应系统和金融基础设施中，挑战了这一假设的普遍性。这些代理通常产生不同的推理路径、不同的操作轨迹和异构的内部表示，同时实现语义等价且正确的结果。在本文中，我们引入后确定性分布式系统（PDDS）作为研究和工程模型，用于协调确定性代码、随机模型和自主代理共存的异构环境。我们表明，经典分布式计算模型构成了这种参与者通用模型的零歧义特例。我们并非主张确定性系统消失；而是确定性执行不能再作为自主基础设施的通用参与者假设。最后，我们概述了后确定性基础设施的五大架构支柱：协议驱动开发、可验证代理基础设施、自主状态控制平面、语义法定保证和认知状态复制。认知状态复制将持久性和一致性模型从数据可见性扩展到知识可见性，实现代理记忆、可验证语义回滚以及跨推理参与者的连贯性。我们还定义了在此环境中出现的故障类别的分类法。

英文摘要

For decades, distributed systems have typically assumed that correct participants execute protocol-specified behavior with stable, externally defined, and deterministic semantics. Classical theory has extensively parameterized network timing, communication topologies, and failure domains, but this participant model has remained comparatively fixed. The integration of autonomous reasoning engines, stochastic model-driven agents, and policy-driven actors into cloud control planes, incident response systems, and financial infrastructure challenges the universality of this assumption. These agents often produce divergent reasoning paths, distinct operational traces, and heterogeneous internal representations while achieving semantically equivalent and correct outcomes. In this paper, we introduce Post-Deterministic Distributed Systems (PDDS) as a research and engineering model for coordinating heterogeneous environments where deterministic code, stochastic models, and autonomous agents coexist. We show that classical distributed computing models form a zero-ambiguity special case of this participant-general model. We do not argue that deterministic systems disappear; rather, deterministic execution can no longer serve as the universal participant assumption for autonomous infrastructure. Finally, we outline five architectural pillars of post-deterministic infrastructure: Protocol-Driven Development, Verifiable Agentic Infrastructure, Autonomous State Control Planes, Semantic Quorum Assurance, and Epistemic State Replication. Epistemic State Replication extends persistence and consistency models from data visibility to knowledge visibility, enabling agentic memory, Verifiable Semantic Rollback, and coherence across reasoning participants. We also define a taxonomy of failure classes that arise in this setting.

URL PDF HTML ☆

赞 0 踩 0

2606.01720 2026-06-02 cs.LG

A Note on Stability for Orthogonalized Matrix Momentum with Client Sampling

关于带客户端采样的正交化矩阵动量的稳定性注记

Da Chang, Qiankun Shi, Lvgang Zhang, Yu Li, Ruijie Zhang

发表机构 * University of Chinese Academy of Sciences（中国科学院大学）； Sun Yat-sen University（中山大学）； Southern University of Science and Technology（南方科技大学）； George Washington University（乔治华盛顿大学）

AI总结研究带客户端采样的分布式矩阵优化中正交化动量更新的有限样本泛化界，通过耦合邻域稳定性递归和加权集中步骤导出上尾保证。

详情

AI中文摘要

我们研究了带矩阵值参数和正交化动量更新的客户端采样分布式优化方案的有限样本泛化。核心量是当每轮只有一部分客户端参与时，返回模型上总体目标与经验目标之间的差距。在独立异构客户端数据、不等本地样本计数和固定聚合权重下，我们通过耦合邻域稳定性递归和加权集中步骤导出了有限轮上尾保证。该界限通过放大因子 $Y_i(\mathcal C)$ 保留客户端选择计数；在均匀全参与全批次情况下，当控制依赖于时间范围的放大项时，它产生 $\widetilde{\mathcal O}(n^{-1}+n^{-1/2})$ 的缩放。矩阵正交化规则要求沿配对轨迹是Lipschitz的，该条件由正则化极型映射和归一化有限步Newton-Schulz正交化器满足。对于未正则化的矩阵符号，相同的论证需要耦合谱分离，而高斯平滑给出了有限轮平滑变体。一个一维反例说明了为什么间隙、平滑或正则性条件是必要的。

英文摘要

We study finite-sample generalization for a client-sampled distributed optimization scheme with matrix-valued parameters and orthogonalized momentum updates. The central quantity is the gap between the population and empirical objectives at the returned model when only a subset of clients participates in each round. Under independent heterogeneous client data, unequal local sample counts, and fixed aggregation weights, we derive a finite-round upper-tail guarantee from a coupled-neighbor stability recursion and a weighted concentration step. The bound keeps the client-selection counts through the amplification factor $Y_i(\mathcal C)$; in the uniform full-participation full-batch regime, it yields $\widetilde{\mathcal O}(n^{-1}+n^{-1/2})$ scaling whenever the horizon-dependent amplification terms are controlled. The matrix-orthogonalization rule is required to be Lipschitz along paired trajectories, a condition satisfied by regularized polar-type maps and normalized finite-step Newton--Schulz orthogonalizers. For the unregularized matrix sign, the same argument requires coupled spectral separation, whereas Gaussian smoothing gives a finite-round smoothed variant. A one-dimensional counterexample shows why a gap, smoothing, or regularity condition is necessary.

URL PDF HTML ☆

赞 0 踩 0

2606.01719 2026-06-02 cs.LG cs.AI cs.CR

Fair Finetuning Mitigates Distribution Inference Attacks

公平微调缓解分布推断攻击

Rakshit Naidu

发表机构 * Rakshit Naidu

AI总结提出公平微调（FFt）方法，通过在等几率约束下对互补分布样本进行微调，将模型公平性指标与分布推断攻击中的对抗优势联系起来，并给出理论界限，实验证明能有效降低攻击成功率。

Comments 16 pages (11 main, 5 appendix)

详情

AI中文摘要

在敏感数据上训练的机器学习模型可能会无意中泄露其训练分布的群体级信息——这种威胁被称为分布推断攻击（DIA）。具有黑盒访问权限的对手可以在不直接观察任何训练数据的情况下推断敏感的人口统计属性，如子群比例。尽管已经提出了差分隐私和属性遗忘等防御措施，但公平性约束与分布泄漏之间的联系尚未被探索。我们提出了公平微调（FFt）：在等几率（EO）约束下，对来自互补分布的样本进行微调。我们提供了完整的理论刻画，证明了紧界 $ ext{Adv}(\mathcal{A},M_f) \le Δ_{ ext{EO}} \cdot W$，其中 $W$ 量化了两个训练分布通过其敏感属性组成的可区分程度。我们还建立了FFt降低对抗优势的必要条件，并证明了该界的紧性。我们在六个数据集上进行了评估，涵盖表格数据（ACS Income、COMPAS、German Credit）、图像数据（UTKFaces）和自然语言处理数据（Bias in Bios）。基于重演的FFt在所有设置中一致地将对抗准确率差距降低到检测阈值 $τ=0.1$ 以下；在ACS Income上，差距从约15%下降到4%以下。我们的工作提供了第一个将模型测量的EO差异直接与其在DIA博弈中的对抗优势联系起来的正式界限，为统一的公平性和隐私防御开辟了新途径。

英文摘要

Machine learning models trained on sensitive data can inadvertently leak population-level information about their training distributions -- a threat known as distribution inference attack (DIA). An adversary with black-box access can infer sensitive demographic properties, such as subgroup proportions, without observing any training data directly. While defenses such as differential privacy and property unlearning have been proposed, the link between fairness constraints and distributional leakage remains unexplored. We propose Fair Fine-tuning (FFt): a trained model is fine-tuned on samples from the complementary distribution under an Equalized Odds (EO) constraint. We provide a complete theoretical characterization, proving the tight bound $\text{Adv}(\mathcal{A},M_f) \le Δ_{\text{EO}} \cdot W$, where $W$ quantifies how distinguishable the two training distributions are by their sensitive-attribute composition. We also establish a necessary condition for FFt to reduce adversarial advantage and prove tightness of the bound. We evaluate across six datasets spanning tabular (ACS Income, COMPAS, German Credit), image (UTKFaces), and NLP (Bias in Bios) modalities. Rehearsal-based FFt consistently reduces the adversarial accuracy gap below the detection threshold $τ!=!0.1$ across all settings; on ACS Income, the gap falls from $\sim!15%$ to under $4%$. Our work provides the first formal bound connecting a model's measured EO disparity directly to its adversarial advantage in the DIA game, opening a new avenue for unified fairness-and-privacy defenses.

URL PDF HTML ☆

赞 0 踩 0