arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 21516
1803.07226 2026-06-04 cs.CV cs.DS cs.NA math.NA

Learning the Hierarchical Parts of Objects by Deep Non-Smooth Nonnegative Matrix Factorization

通过深度非光滑非负矩阵分解学习物体的层次部分

Jinshi Yu, Guoxu Zhou, Andrzej Cichocki, Shengli Xie

发表机构 * RIKEN(日本理化学研究所) SKOLTECH(莫斯科SKOLTECH)

AI总结 本文提出深度非光滑非负矩阵分解方法,通过更深层架构学习复杂数据的层次特征,结合非负约束生成部分特征并提取更高层次抽象特征,实验表明其在聚类分析中表现优异。

详情
AI中文摘要

非光滑非负矩阵分解(nsNMF)能够产生更局部化、重叠更少的特征表示,同时保持对数据的良好拟合。然而,nsNMF及其他现有NMF方法由于其浅层结构无法学习复杂数据的层次特征。为填补这一空白,本文提出了一种深度nsNMF方法,其架构比标准nsNMF更深入。深度nsNMF不仅由于非负约束生成部分特征,还通过组合低层特征生成更高层次的抽象特征。深入描述了深度架构如何帮助在dnsNMF中高效发现抽象特征。此外,本文还表明深度nsNMF与深度自编码器有密切关系,表明所提模型继承了深度学习和NMF的主要优势。大量实验表明,所提方法在聚类分析中表现出色。

英文摘要

Nonsmooth Nonnegative Matrix Factorization (nsNMF) is capable of producing more localized, less overlapped feature representations than other variants of NMF while keeping satisfactory fit to data. However, nsNMF as well as other existing NMF methods is incompetent to learn hierarchical features of complex data due to its shallow structure. To fill this gap, we propose a deep nsNMF method coined by the fact that it possesses a deeper architecture compared with standard nsNMF. The deep nsNMF not only gives parts-based features due to the nonnegativity constraints, but also creates higher-level, more abstract features by combing lower-level ones. The in-depth description of how deep architecture can help to efficiently discover abstract features in dnsNMF is presented. And we also show that the deep nsNMF has close relationship with the deep autoencoder, suggesting that the proposed model inherits the major advantages from both deep learning and NMF. Extensive experiments demonstrate the standout performance of the proposed method in clustering analysis.

1709.05363 2026-06-04 cs.RO cs.GT cs.SY eess.SY

Synthesis of surveillance strategies via belief abstraction

通过信念抽象合成监视策略

Suda Bharadwaj, Rayna Dimitrova, Ufuk Topcu

发表机构 * University of Leicester(莱斯特大学)

AI总结 本文研究了合成具有监视目标的机器人控制器问题,通过将问题建模为单侧部分信息游戏,并利用抽象技术减少状态空间爆炸,从而实现监视策略的合成。

详情
AI中文摘要

我们研究了合成具有监视目标的机器人控制器问题,即机器人需维持对移动目标位置的了解。我们将该问题建模为单侧部分信息游戏,其中代理的胜利条件由时序逻辑公式指定。该规范形式化了用户提供的监视需求,包括额外的非监视任务。为了合成满足规范的监视策略,我们将部分信息游戏转换为完美信息游戏,利用抽象技术缓解此类转换通常导致的指数级状态空间爆炸。这使得可以使用现成的工具进行反应合成。我们使用反例引导的细化技术自动实现足够的抽象精度以合成监视策略。我们在两个案例研究中评估了所提出的方法,展示了其在大规模状态空间和多样化需求中的适用性。

英文摘要

We study the problem of synthesizing a controller for a robot with a surveillance objective, that is, the robot is required to maintain knowledge of the location of a moving, possibly adversarial target. We formulate this problem as a one-sided partial-information game in which the winning condition for the agent is specified as a temporal logic formula. The specification formalizes the surveillance requirement given by the user, including additional non-surveillance tasks. In order to synthesize a surveillance strategy that meets the specification, we transform the partial-information game into a perfect-information one, using abstraction to mitigate the exponential blow-up typically incurred by such transformations. This enables the use of off-the-shelf tools for reactive synthesis. We use counterexample-guided refinement to automatically achieve abstraction precision that is sufficient to synthesize a surveillance strategy. We evaluate the proposed method on two case-studies, demonstrating its applicability to large state-spaces and diverse requirements.

1703.02660 2026-06-04 cs.LG cs.AI cs.RO cs.SY eess.SY

Towards Generalization and Simplicity in Continuous Control

连续控制中的泛化与简洁性

Aravind Rajeswaran, Kendall Lowrey, Emanuel Todorov, Sham Kakade

发表机构 * University of Washington(华盛顿大学)

AI总结 本文展示简单线性与RBF参数化策略可解决多种连续控制任务,性能可与更复杂网络相媲美,且多样初始化提升泛化能力。

Comments NIPS 2017, Project page: https://sites.google.com/view/simple-pol

详情
AI中文摘要

本文表明,简单线性及RBF参数化策略可训练解决多种连续控制任务,包括OpenAI Gym基准。这些策略性能可与更复杂参数化方法相媲美。现有训练测试场景受限且易过拟合,导致仅轨迹中心策略。多样初始化产生更具全局性的策略,允许系统在大扰动下恢复,如补充视频所示。

英文摘要

This work shows that policies with simple linear and RBF parameterizations can be trained to solve a variety of continuous control tasks, including the OpenAI gym benchmarks. The performance of these trained policies are competitive with state of the art results, obtained with more elaborate parameterizations such as fully connected neural networks. Furthermore, existing training and testing scenarios are shown to be very limited and prone to over-fitting, thus giving rise to only trajectory-centric policies. Training with a diverse initial state distribution is shown to produce more global policies with better generalization. This allows for interactive control scenarios where the system recovers from large on-line perturbations; as shown in the supplementary video.

1803.05026 2026-06-04 cs.LG cs.CV cs.IT cs.NA math.IT math.NA

Principal Component Analysis with Tensor Train Subspace

张量列车子空间下的主成分分析

Wenqi Wang, Vaneet Aggarwal, Shuchin Aeron

发表机构 * Purdue University(普渡大学)

AI总结 本文提出TT-PCA算法,通过保持低秩张量结构来估计结构化的张量列车子空间,相比PCA和Tucker-PCA更具鲁棒性,实验验证其有效性。

详情
AI中文摘要

张量列车是一种分层张量网络结构,通过参数化大规模多维数据集来缓解维度灾难。本文提出TT-PCA算法,用于从给定数据中估计这种结构化的张量列车子空间。通过保持低秩张量结构,TT-PCA比PCA或Tucker-PCA更具鲁棒性,这在测试扩展YaleFace数据集B时得到了数值验证。

英文摘要

Tensor train is a hierarchical tensor network structure that helps alleviate the curse of dimensionality by parameterizing large-scale multidimensional data via a set of network of low-rank tensors. Associated with such a construction is a notion of Tensor Train subspace and in this paper we propose a TT-PCA algorithm for estimating this structured subspace from the given data. By maintaining low rank tensor structure, TT-PCA is more robust to noise comparing with PCA or Tucker-PCA. This is borne out numerically by testing the proposed approach on the Extended YaleFace Dataset B.

1803.02553 2026-06-04 cs.LG cs.SY eess.SY stat.ML

Graph Learning from Filtered Signals: Graph System and Diffusion Kernel Identification

基于滤波信号的图学习:图系统与扩散核识别

Hilmi E. Egilmez, Eduardo Pavez, Antonio Ortega

发表机构 * Department of Electrical Engineering, University of Southern California(电气工程系,南加州大学)

AI总结 本文提出一种新的图信号处理框架,用于从滤波信号类中构建图模型。通过将图建模问题转化为图系统识别问题,学习加权图(图拉普拉斯矩阵)和图滤波器(图拉普拉斯矩阵函数)。算法能从多信号观测中联合识别图和图滤波器,适用于学习扩散核,并在真实气候数据集上验证了其有效性。

Comments Submitted to IEEE Trans. on Signal and Information Processing over Networks (13 pages)

详情
AI中文摘要

本文介绍了一种新的图信号处理框架,用于从滤波信号类中构建图模型。在我们的框架中,图建模被公式化为图系统识别问题,目标是学习加权图(图拉普拉斯矩阵)和图滤波器(图拉普拉斯矩阵函数)。为了求解提出的问题,开发了一种算法,从多个信号/数据观测中联合识别图和图滤波器(GBF)。我们的算法在GBF是一一对应函数的假设下有效。所提出的方法可以应用于学习扩散(热)核,这些核在各种领域中用于建模扩散过程。此外,对于特定的图滤波器选择,所提出的问题减少为图拉普拉斯估计问题。我们的实验结果表明,所提出算法优于当前最先进的方法。我们还实现了该框架在一个真实气候数据集上,用于温度信号建模。

英文摘要

This paper introduces a novel graph signal processing framework for building graph-based models from classes of filtered signals. In our framework, graph-based modeling is formulated as a graph system identification problem, where the goal is to learn a weighted graph (a graph Laplacian matrix) and a graph-based filter (a function of graph Laplacian matrices). In order to solve the proposed problem, an algorithm is developed to jointly identify a graph and a graph-based filter (GBF) from multiple signal/data observations. Our algorithm is valid under the assumption that GBFs are one-to-one functions. The proposed approach can be applied to learn diffusion (heat) kernels, which are popular in various fields for modeling diffusion processes. In addition, for specific choices of graph-based filters, the proposed problem reduces to a graph Laplacian estimation problem. Our experimental results demonstrate that the proposed algorithm outperforms the current state-of-the-art methods. We also implement our framework on a real climate dataset for modeling of temperature signals.

1709.04407 2026-06-04 cs.RO cs.LG cs.SY eess.SY

An Inversion-Based Learning Approach for Improving Impromptu Trajectory Tracking of Robots with Non-Minimum Phase Dynamics

基于逆向学习的方法用于改进具有非最小相位动态的机器人即兴轨迹跟踪

Siqi Zhou, Mohamed K. Helwa, Angela P. Schoellig

发表机构 * Dynamic Systems Lab(动态系统实验室) Institute for Aerospace Studies(航空航天研究 institute) University of Toronto(多伦多大学) Cairo University(开罗大学)

AI总结 本文提出一种基于学习的方法,用于改进非最小相位系统的即兴轨迹跟踪,通过直接从输入输出数据学习稳定近似逆向,验证了方法的稳定性与高精度跟踪效果。

Comments Accepted for publication in the IEEE Robotics and Automation Letters (RA-L), July 2018

详情
AI中文摘要

本文提出了一种基于学习的方法,用于非最小相位系统的即兴轨迹跟踪。逆向前馈方法常用于提高跟踪性能,但无法直接应用于非最小相位系统,因为其固有不稳定。为解决此问题,现有方法假设系统模型已知,并使用预动作或逆向近似技术。本文提出了一种从输入输出数据直接学习稳定近似逆向的方法。通过理论讨论、模拟和两种不同平台的实验,展示了所提方法的稳定性及其在高精度即兴跟踪中的有效性。此外,本文还表明,在训练中包含更多信息,尽管通常被认为有用,但未必能提高性能,反而可能引发不稳定性并影响整体方法的效果。

英文摘要

This paper presents a learning-based approach for impromptu trajectory tracking for non-minimum phase systems, i.e., systems with unstable inverse dynamics. Inversion-based feedforward approaches are commonly used for improving tracking performance; however, these approaches are not directly applicable to non-minimum phase systems due to their inherent instability. In order to resolve the instability issue, existing methods have assumed that the system model is known and used pre-actuation or inverse approximation techniques. In this work, we propose an approach for learning a stable, approximate inverse of a non-minimum phase baseline system directly from its input-output data. Through theoretical discussions, simulations, and experiments on two different platforms, we show the stability of our proposed approach and its effectiveness for high-accuracy, impromptu tracking. Our approach also shows that including more information in the training, as is commonly assumed to be useful, does not lead to better performance but may trigger instability and impact the effectiveness of the overall approach.

1802.10275 2026-06-04 cs.LG cs.NA math.NA stat.ML

Solving for high dimensional committor functions using artificial neural networks

利用人工神经网络求解高维承诺函数

Yuehaw Khoo, Jianfeng Lu, Lexing Ying

发表机构 * Department of Mathematics, Stanford University(数学系,斯坦福大学) Department of Mathematics, Department of Chemistry and Department of Physics, Duke University(数学系、化学系和物理系,杜克大学) Department of Mathematics and ICME, Stanford University(数学系和ICME,斯坦福大学)

AI总结 本文提出基于人工神经网络的方法,用于研究由随机过程支配的状态转换。通过变分公式和神经网络参数化,获得高维Fokker-Planck方程的承诺函数数值解,证明在高维问题中可实现中等精度。

Comments 12 pages, 6 figures

详情
AI中文摘要

在本注释中,我们提出了一种基于人工神经网络的方法,用于研究由随机过程支配的状态转换。特别是,我们旨在为过渡路径理论的核心对象——承诺函数,设计数值方案,该函数满足高维Fokker-Planck方程。通过处理此类偏微分方程的变分公式,并将承诺函数参数化为神经网络,可以利用随机算法优化神经网络权重来获得近似解。数值示例表明,对于高维问题可以实现中等精度。

英文摘要

In this note we propose a method based on artificial neural network to study the transition between states governed by stochastic processes. In particular, we aim for numerical schemes for the committor function, the central object of transition path theory, which satisfies a high-dimensional Fokker-Planck equation. By working with the variational formulation of such partial differential equation and parameterizing the committor function in terms of a neural network, approximations can be obtained via optimizing the neural network weights using stochastic algorithms. The numerical examples show that moderate accuracy can be achieved for high-dimensional problems.

1708.08113 2026-06-04 cs.AI cs.SY eess.SY

Novel Sensor Scheduling Scheme for Intruder Tracking in Energy Efficient Sensor Networks

新颖的传感器调度方案用于能量高效的入侵者跟踪

Raghuram Bharadwaj Diddigi, Prabuchandran K. J., Shalabh Bhatnagar

发表机构 * Department of Computer Science and Automation, Indian Institute of Science(计算机科学与自动化系,印度科学研究院)

AI总结 本文提出基于POMDP的强化学习算法,用于在能量受限下高效跟踪入侵者,通过UCT方法实现状态和动作空间的扩展,验证了算法在大规模问题中的有效性。

详情
AI中文摘要

我们考虑使用无线传感器网络跟踪入侵者的问题。在每个时间点,必须确定最优数量和正确配置的传感器以供电,因为供电消耗能量,因此需要在准确跟踪入侵者位置和传感器能耗之间进行权衡。该问题在部分可观测马尔可夫决策过程(POMDP)框架中进行建模。即使对于文献中的最先进算法,维度灾难使问题难以处理。本文在POMDP框架下将入侵检测(ID)问题建模为合适的状态-动作空间,并开发一种利用上置信树搜索(UCT)方法的强化学习(RL)算法来解决ID问题。通过仿真,我们证明了我们的算法在状态和动作空间增大时表现良好且可扩展。

英文摘要

We consider the problem of tracking an intruder using a network of wireless sensors. For tracking the intruder at each instant, the optimal number and the right configuration of sensors has to be powered. As powering the sensors consumes energy, there is a trade off between accurately tracking the position of the intruder at each instant and the energy consumption of sensors. This problem has been formulated in the framework of Partially Observable Markov Decision Process (POMDP). Even for the state-of-the-art algorithm in the literature, the curse of dimensionality renders the problem intractable. In this paper, we formulate the Intrusion Detection (ID) problem with a suitable state-action space in the framework of POMDP and develop a Reinforcement Learning (RL) algorithm utilizing the Upper Confidence Tree Search (UCT) method to solve the ID problem. Through simulations, we show that our algorithm performs and scales well with the increasing state and action spaces.

1802.08138 2026-06-04 cs.AI cs.GT cs.SY eess.SY

Reliable Intersection Control in Non-cooperative Environments

非合作环境中的可靠交叉口控制

Muhammed O. Sayin, Chung-Wei Lin, Shinichi Shiraishi, Tamer Başar

发表机构 * University of Illinois at Urbana-Champaign(伊利诺伊大学厄巴纳-香槟分校) Toyota InfoTechnology Center(丰田信息技术中心)

AI总结 本文提出一种可靠交叉口控制机制,用于非合作环境中的战略自主和联网车辆。通过分析车辆的战略行为,确定纳什均衡,并识别社会最优均衡以实现公平分配。

Comments Extended version (including proofs of theorems and lemmas) of the paper: M. O. Sayin, C.-W. Lin, S. Shiraishi, and T. Basar, "Reliable intersection control in non-cooperative environments", to appear in the Proceedings of American Control Conference, 2018

详情
AI中文摘要

我们提出了一种可靠的交叉口控制机制,用于战略自主和联网车辆(智能体)在非合作环境中。每个智能体可以获取其最早可能和期望的通过时间,并向交叉口管理者报告通过时间,管理者按先到先得的原则分配交叉口时间。然而,智能体可能有冲突利益并采取策略性行为。为此,我们分析智能体的战略行为,并为所有可能场景制定纳什均衡。此外,在所有纳什均衡中,我们识别出一个社会最优均衡,以实现公平的交叉口分配,并相应地描述一种策略证明的交叉口机制,该机制实现了可靠的交叉口控制,使得策略性智能体没有动机策略性地报告他们的通过时间。

英文摘要

We propose a reliable intersection control mechanism for strategic autonomous and connected vehicles (agents) in non-cooperative environments. Each agent has access to his/her earliest possible and desired passing times, and reports a passing time to the intersection manager, who allocates the intersection temporally to the agents in a First-Come-First-Serve basis. However, the agents might have conflicting interests and can take actions strategically. To this end, we analyze the strategic behaviors of the agents and formulate Nash equilibria for all possible scenarios. Furthermore, among all Nash equilibria we identify a socially optimal equilibrium that leads to a fair intersection allocation, and correspondingly we describe a strategy-proof intersection mechanism, which achieves reliable intersection control such that the strategic agents do not have any incentive to misreport their passing times strategically.

1802.07346 2026-06-04 cs.RO cs.SY eess.SP eess.SY stat.AP

Cooperative Robot Localization Using Event-triggered Estimation

基于事件触发估计的协作机器人定位

Michael Ouimet, David Iglesias, Nisar Ahmed, Sonia Martinez

发表机构 * SPAWAR Systems Center Pacific(SPAWAR太平洋系统中心) University of Colorado Boulder(科罗拉多大学博尔德分校) University of California San Diego(加州大学圣地亚哥分校)

AI总结 本文提出一种低通信开销的协作定位算法,通过事件触发机制仅在状态估计创新度高时发送测量,结合协方差交叠机制实现网络状态同步,实验验证了其在多种动态模型下的高效定位性能。

Comments Revised submission in review with AIAA Journal of Aerospace Information Systems (JAIS), submitted February 17, 2018

详情
AI中文摘要

本文描述了一种新颖的低通信开销协作定位算法,适用于移动无人机器人团队。利用基于事件的估计范式,机器人仅在状态估计创新度较高时向邻居发送测量。由于代理已知触发测量的条件,缺少的测量信息也被融合到状态估计中。机器人使用协方差交叠(CI)机制偶尔同步其对完整网络状态的局部估计。此外,启发式平衡动态确保在大直径网络中,局部误差协方差始终保持在预期范围内。线性和非线性动态/测量模型的仿真表明,事件触发方法在广泛的操作条件下实现了接近最优的状态估计性能,即使仅使用传统全数据共享所需通信开销的小部分。还检验了所提方法对丢包通信的鲁棒性以及网络拓扑与基于CI的同步需求之间的关系。

英文摘要

This paper describes a novel communication-spare cooperative localization algorithm for a team of mobile unmanned robotic vehicles. Exploiting an event-based estimation paradigm, robots only send measurements to neighbors when the expected innovation for state estimation is high. Since agents know the event-triggering condition for measurements to be sent, the lack of a measurement is thus also informative and fused into state estimates. The robots use a Covariance Intersection (CI) mechanism to occasionally synchronize their local estimates of the full network state. In addition, heuristic balancing dynamics on the robots' CI-triggering thresholds ensure that, in large diameter networks, the local error covariances remains below desired bounds across the network. Simulations on both linear and nonlinear dynamics/measurement models show that the event-triggering approach achieves nearly optimal state estimation performance in a wide range of operating conditions, even when using only a fraction of the communication cost required by conventional full data sharing. The robustness of the proposed approach to lossy communications, as well as the relationship between network topology and CI-based synchronization requirements, are also examined.

1702.03258 2026-06-04 cs.RO cs.LG cs.SY eess.SY

Adaptive and Resilient Soft Tensegrity Robots

自适应且具有韧性的软 tensegrity 机器人

John Rieffel, Jean-Baptiste Mouret

发表机构 * Union College(联合学院) Inria Nancy Grand - Est CNRS, Loria, UMR 7503(法国国家科学研究中心(CNRS)、洛里亚实验室(Loria)、UMR 7503)

AI总结 本文提出一种易于组装的基于 tensegrity 的软机器人,能产生高动态运动步态,并在物理损伤下表现出结构和行为韧性,通过机器学习算法实现有效步态发现。

Comments video: https://youtu.be/SuLQDhrk9tQ

详情
AI中文摘要

生物体结合软(如肌肉)和硬(如骨骼)材料,赋予其内在灵活性和韧性,这在传统刚性机器人中往往缺失。新兴的软机器人领域试图利用这些特性创造韧性机器。然而,软材料的性质给设计、制造和控制带来了重大挑战,迄今为止,大多数软机器人的步态都是通过经验试错法手动设计的。本文描述了一种易于组装的基于 tensegrity 的软机器人,能够产生高度动态的运动步态,并在面对物理损伤时表现出结构和行为的韧性。使这一成果成为可能的是使用一种机器学习算法,能够以最少的物理试验发现有效的步态。这些结果进一步支持了软机器人方法,旨在利用复杂材料动力学的相互作用,以生成丰富的动态行为。

英文摘要

Living organisms intertwine soft (e.g., muscle) and hard (e.g., bones) materials, giving them an intrinsic flexibility and resiliency often lacking in conventional rigid robots. The emerging field of soft robotics seeks to harness these same properties in order to create resilient machines. The nature of soft materials, however, presents considerable challenges to aspects of design, construction, and control -- and up until now, the vast majority of gaits for soft robots have been hand-designed through empirical trial-and-error. This manuscript describes an easy-to-assemble tensegrity-based soft robot capable of highly dynamic locomotive gaits and demonstrating structural and behavioral resilience in the face of physical damage. Enabling this is the use of a machine learning algorithm able to discover effective gaits with a minimal number of physical trials. These results lend further credence to soft-robotic approaches that seek to harness the interaction of complex material dynamics in order to generate a wealth of dynamical behaviors.

1802.06314 2026-06-04 cs.RO cs.AI cs.SY eess.SY

Autonomous Vehicle Speed Control for Safe Navigation of Occluded Pedestrian Crosswalk

自动驾驶车辆速度控制:安全通过遮挡人行横道

Sarah Thornton

发表机构 * Dynamic Design Lab(动态设计实验室)

AI总结 本文提出基于部分可观测马尔可夫决策过程的速度控制方法,用于安全通过遮挡人行横道,通过动态规划计算控制策略以应对感知限制。

Comments 6 pages, 9 figures

详情
AI中文摘要

人类和自动驾驶车辆传感器的感知能力有限。当这些限制与涉及易受伤害道路使用者的场景重合时,必须在运动规划器中考虑这些限制。在遮挡人行横道的场景中,接近车辆的速度应是道路上不确定性量的函数。在本工作中,纵向控制器被建模为部分可观测马尔可夫决策过程,并使用动态规划计算控制策略。该控制策略将速度剖面传递给模型预测转向控制器。

英文摘要

Both humans and the sensors on an autonomous vehicle have limited sensing capabilities. When these limitations coincide with scenarios involving vulnerable road users, it becomes important to account for these limitations in the motion planner. For the scenario of an occluded pedestrian crosswalk, the speed of the approaching vehicle should be a function of the amount of uncertainty on the roadway. In this work, the longitudinal controller is formulated as a partially observable Markov decision process and dynamic programming is used to compute the control policy. The control policy scales the speed profile to be used by a model predictive steering controller.

1705.08435 2026-06-04 cs.LG cs.CR cs.DC cs.SY eess.SY stat.ML

Personalized and Private Peer-to-Peer Machine Learning

个性化与隐私保护的点对点机器学习

Aurélien Bellet, Rachid Guerraoui, Mahsa Taziki, Marc Tommasi

发表机构 * INRIA EPFL(瑞士联邦理工学院) Université de Lille(里尔大学)

AI总结 本文提出一种高效算法,实现去中心化且异步的个性化机器学习,在强隐私要求下保证收敛性。通过差分隐私保护数据隐私,并分析隐私与效用的平衡。实验表明,在非隐私情况下优于先前方法,隐私约束下可显著提升模型性能。

Comments 20 pages, to appear in the Proceedings of the 21st International Conference on Artificial Intelligence and Statistics (AISTATS 2018)

详情
AI中文摘要

随着连接个人设备的兴起和隐私问题的出现,需要能够利用大量代理数据学习个性化模型的机器学习算法,同时满足严格的隐私要求。本文介绍了一种高效的算法,以完全去中心化(点对点)和异步方式解决上述问题,并具有可证明的收敛速度。我们展示了如何使算法具有差分隐私性,以保护个人数据集信息的泄露,并正式分析效用与隐私之间的权衡。我们的实验表明,在非隐私情况下,我们的方法显著优于先前工作,在隐私约束下,我们可以在孤立学习的模型上取得显著改进。

英文摘要

The rise of connected personal devices together with privacy concerns call for machine learning algorithms capable of leveraging the data of a large number of agents to learn personalized models under strong privacy requirements. In this paper, we introduce an efficient algorithm to address the above problem in a fully decentralized (peer-to-peer) and asynchronous fashion, with provable convergence rate. We show how to make the algorithm differentially private to protect against the disclosure of information about the personal datasets, and formally analyze the trade-off between utility and privacy. Our experiments show that our approach dramatically outperforms previous work in the non-private case, and that under privacy constraints, we can significantly improve over models learned in isolation.

1705.01292 2026-06-04 cs.RO cs.SY eess.SY

A General Safety Framework for Learning-Based Control in Uncertain Robotic Systems

一种用于不确定机器人系统中基于学习的控制的通用安全框架

Jaime F. Fisac, Anayo K. Akametalu, Melanie N. Zeilinger, Shahab Kaynama, Jeremy Gillula, Claire J. Tomlin

发表机构 * Department of Mechanical and Process Engineering, ETH Zurich(瑞士苏黎世联邦理工学院机械与过程工程系) Clearpath Robotics(Clearpath机器人公司) Electronic Frontier Foundation(电子前沿基金会)

AI总结 本文提出一种通用安全框架,结合Hamilton-Jacobi可达性方法和任意学习算法,通过近似系统动力学知识确保约束满足,同时减少对学习过程的干扰,并引入贝叶斯机制提升安全分析。

Comments Accepted for publication in IEEE Transactions on Automatic Control. Video with experiments: https://youtu.be/WAAxyeSk2bw

详情
AI中文摘要

基于学习的控制方案在物理世界中应用的有效性强烈推动了其在机器人系统中的使用。然而,保证学习过程中的正确操作目前仍是一个未解决的问题,这对安全关键系统至关重要。我们提出了一种基于Hamilton-Jacobi可达性方法的通用安全框架,可以与任意学习算法协同工作。该方法利用对系统动力学的近似知识来保证约束满足,同时尽可能减少对学习过程的干扰。我们进一步引入了贝叶斯机制,通过系统获取的新证据来细化安全分析,从而在适当的时候减少初始保守性,同时通过实时验证加强保证。结果是一种最不具限制性的、安全保持的控制律,仅在(a)计算的安全保证要求时或(b)由于新观察到的置信度下降时介入。我们通过概率和最坏情况分析相结合的理论方法证明了安全保证,并在四旋翼飞行器上进行了实验验证。尽管安全分析基于一个简单的质点模型,四旋翼通过策略梯度强化学习成功到达了合适的控制器,从未发生碰撞,并在飞行过程中安全地远离强外部干扰。

英文摘要

The proven efficacy of learning-based control schemes strongly motivates their application to robotic systems operating in the physical world. However, guaranteeing correct operation during the learning process is currently an unresolved issue, which is of vital importance in safety-critical systems. We propose a general safety framework based on Hamilton-Jacobi reachability methods that can work in conjunction with an arbitrary learning algorithm. The method exploits approximate knowledge of the system dynamics to guarantee constraint satisfaction while minimally interfering with the learning process. We further introduce a Bayesian mechanism that refines the safety analysis as the system acquires new evidence, reducing initial conservativeness when appropriate while strengthening guarantees through real-time validation. The result is a least-restrictive, safety-preserving control law that intervenes only when (a) the computed safety guarantees require it, or (b) confidence in the computed guarantees decays in light of new observations. We prove theoretical safety guarantees combining probabilistic and worst-case analysis and demonstrate the proposed framework experimentally on a quadrotor vehicle. Even though safety analysis is based on a simple point-mass model, the quadrotor successfully arrives at a suitable controller by policy-gradient reinforcement learning without ever crashing, and safely retracts away from a strong external disturbance introduced during flight.

1802.03981 2026-06-04 cs.LG cs.SY eess.SY stat.ML

Spectral Filtering for General Linear Dynamical Systems

谱滤波用于通用线性动态系统

Elad Hazan, Holden Lee, Karan Singh, Cyril Zhang, Yi Zhang

发表机构 * Department of Computer Science, Princeton University(普林斯顿大学计算机科学系) Google Brain(谷歌大脑) Department of Mathematics, Princeton University(普林斯顿大学数学系)

AI总结 本文提出一种多项式时间算法,用于学习无系统识别假设的隐状态线性动态系统,无需假设系统转移矩阵的谱半径。该算法扩展了谱滤波技术,通过新的凸松弛方法高效识别相位。

详情
AI中文摘要

我们提出了一种多项式时间算法,用于学习隐状态线性动态系统,而无需系统识别,也不假设系统转移矩阵的谱半径。该算法扩展了最近引入的谱滤波技术,该技术先前仅应用于具有对称转移矩阵的系统,通过新的凸松弛方法允许高效识别相位。

英文摘要

We give a polynomial-time algorithm for learning latent-state linear dynamical systems without system identification, and without assumptions on the spectral radius of the system's transition matrix. The algorithm extends the recently introduced technique of spectral filtering, previously applied only to systems with a symmetric transition matrix, using a novel convex relaxation to allow for the efficient identification of phases.

1605.00031 2026-06-04 cs.LG cs.CV cs.NA math.NA stat.ML

Deep Convolutional Neural Networks on Cartoon Functions

深度卷积神经网络在卡通函数上的应用

Philipp Grohs, Thomas Wiatowski, Helmut Bölcskei

发表机构 * 1 Dept. Math., ETH Zurich, Switzerland

AI总结 本文研究深度卷积神经网络在卡通函数上的变形稳定性,提出考虑结构特性的新结果,适用于具有尖锐和弯曲不连续性的信号。

Comments This is a slightly updated version of the paper published in the ISIT proceedings. Specifically, we corrected errors in the arguments on the volume of tubes. Note that this correction does not affect the main statements of the paper

详情
Journal ref
Proc. of IEEE International Symposium on Information Theory (ISIT), Barcelona, Spain, pp. 1163-1167, July 2016
AI中文摘要

Wiatowski和Bölcskei, 2015证明了深度卷积神经网络基于的特征提取器的变形稳定性和垂直平移不变性由网络结构本身保证,而非特定卷积核和非线性。虽然平移不变性结果适用于平方可积函数,变形稳定性界仅适用于带限函数。然而,许多实际相关信号(如自然图像)表现出尖锐和弯曲的不连续性,因此不是带限的。本文的主要贡献是针对Donoho, 2001引入的卡通函数类建立变形稳定性界。

英文摘要

Wiatowski and Bölcskei, 2015, proved that deformation stability and vertical translation invariance of deep convolutional neural network-based feature extractors are guaranteed by the network structure per se rather than the specific convolution kernels and non-linearities. While the translation invariance result applies to square-integrable functions, the deformation stability bound holds for band-limited functions only. Many signals of practical relevance (such as natural images) exhibit, however, sharp and curved discontinuities and are, hence, not band-limited. The main contribution of this paper is a deformation stability result that takes these structural properties into account. Specifically, we establish deformation stability bounds for the class of cartoon functions introduced by Donoho, 2001.

1705.07364 2026-06-04 cs.LG cs.CV cs.NA math.NA

Stabilizing Adversarial Nets With Prediction Methods

用预测方法稳定对抗网络

Abhay Yadav, Sohil Shah, Zheng Xu, David Jacobs, Tom Goldstein

发表机构 * University of Maryland, College Park(马里兰大学 College Park 分校)

AI总结 本文提出一种改进的随机梯度下降方法,通过稳定对抗网络的训练过程,使其更可靠地收敛到鞍点,提高训练稳定性与效率。

Comments Accepted at ICLR 2018

详情
AI中文摘要

对抗神经网络在数据科学中解决了很多重要问题,但训练却极具挑战性。这些困难源于对抗网络的最优权重对应于损失函数的鞍点而非极小值。通常用于此类问题的交替随机梯度方法难以可靠收敛到鞍点,且当收敛时对学习率极为敏感。本文提出一种简单的随机梯度下降修改方法,以稳定对抗网络。理论和实践中均表明,所提方法可靠收敛到鞍点,并在更宽的训练参数范围内保持稳定。这使对抗网络更少出现'崩溃'现象,并允许使用更大的学习率进行更快的训练。

英文摘要

Adversarial neural networks solve many important problems in data science, but are notoriously difficult to train. These difficulties come from the fact that optimal weights for adversarial nets correspond to saddle points, and not minimizers, of the loss function. The alternating stochastic gradient methods typically used for such problems do not reliably converge to saddle points, and when convergence does happen it is often highly sensitive to learning rates. We propose a simple modification of stochastic gradient descent that stabilizes adversarial networks. We show, both in theory and practice, that the proposed method reliably converges to saddle points, and is stable with a wider range of training parameters than a non-prediction method. This makes adversarial networks less likely to "collapse," and enables faster training with larger learning rates.

1802.00922 2026-06-04 cs.RO cs.NI cs.OS cs.SY eess.SY

Realizing Uncertainty-Aware Timing Stack in Embedded Operating System

在嵌入式操作系统中实现不确定性感知的定时栈

Amr Alanwar, Fatima M. Anwar, Joao P Hespanha, Mani Srivastava

发表机构 * University of California(加州大学) University of California, Santa Barbara(加州大学圣巴巴拉分校) Los Angeles(洛杉矶)

AI总结 本文提出了一种基于卡尔曼滤波的时间同步协议,用于在嵌入式系统中处理时间不确定性,通过标准嵌入式Linux平台实现不确定性感知的时钟模型。

Comments In Proc. of the Embedded Operating Systems Workshop, 2016

详情
AI中文摘要

时间感知对于运行在商用平台和操作系统上的广泛新兴应用至关重要,包括网络物理系统和物联网。传统上,通过最佳努力的后台服务在设备间同步时间,其性能既不可观察也不可控制,从而消耗系统资源,而不管应用需求如何,同时不允许应用程序和操作系统服务适应系统时间中的不确定性变化。本文提倡重新思考系统堆栈中时间的管理方式。在本文中,我们提出了一种新的时钟模型,该模型真实地描述了各种时间不确定性的来源。然后,我们提出了一种基于卡尔曼滤波的时间同步协议,该协议能够适应由时钟模型暴露的不确定性。我们对不确定性感知时钟模型和同步协议的实现是基于标准嵌入式Linux平台的。

英文摘要

Time awareness is critical to a broad range of emerging applications -- in Cyber-Physical Systems and Internet of Things -- running on commodity platforms and operating systems. Traditionally, time is synchronized across devices through a best-effort background service whose performance is neither observable nor controllable, thus consuming system resources independently of application needs while not allowing the applications and OS services to adapt to changes in uncertainty in system time. We advocate for rethinking how time is managed in a system stack. In this paper, we propose a new clock model that characterizes various sources of timing uncertainties in true time. We then present a Kalman filter based time synchronization protocol that adapts to the uncertainties exposed by the clock model. Our realization of a uncertainty-aware clock model and synchronization protocol is based on a standard embedded Linux platform.

1704.00227 2026-06-04 cs.LG cs.NA math.NA

Online and Stable Learning of Analysis Operators

在线和稳定的分析算子学习

Michael Sandbichler, Karin Schnass

发表机构 * Department of Mathematics, University of Innsbruck(因斯布鲁克大学数学系)

AI总结 本文提出四种在线学习分析算子的迭代算法,基于优化原则,改进了分析K-SVD和分析SimCO,通过投影梯度下降、隐式欧拉方案和奇异值策略,在合成和图像数据上表现出更好的恢复率和更快的收敛速度。

Comments 21 pages, 12 figures, 6 tables

详情
AI中文摘要

本文提出了四种用于学习分析算子的迭代算法。它们基于分析K-SVD和分析SimCO所依赖的相同优化原则。FAOL和SAOL算法基于投影梯度下降法,使用最优步长;IAOL算法受隐式欧拉方案启发,无需选择步长;SVAOL算法采用类似分析K-SVD的策略,但避免其高计算成本。所有算法在每一步都证明能减少或保持目标函数,并提供了其平稳点的特征描述。进一步在合成和图像数据上测试,与分析SimCO相比,显示出更好的恢复率和更快的目标函数衰减。在最终的去噪实验中,所提算法的表现与最先进的ASimCO算法相当或更优。

英文摘要

In this paper four iterative algorithms for learning analysis operators are presented. They are built upon the same optimisation principle underlying both Analysis K-SVD and Analysis SimCO. The Forward and Sequential Analysis Operator Learning (FAOL and SAOL) algorithms are based on projected gradient descent with optimally chosen step size. The Implicit AOL (IAOL) algorithm is inspired by the implicit Euler scheme for solving ordinary differential equations and does not require to choose a step size. The fourth algorithm, Singular Value AOL (SVAOL), uses a similar strategy as Analysis K-SVD while avoiding its high computational cost. All algorithms are proven to decrease or preserve the target function in each step and a characterisation of their stationary points is provided. Further they are tested on synthetic and image data, compared to Analysis SimCO and found to give better recovery rates and faster decay of the objective function respectively. In a final denoising experiment the presented algorithms are again shown to perform similar to or better than the state-of-the-art algorithm ASimCO.

1801.09877 2026-06-04 cs.RO cs.SY eess.SY

On the Use of the Observability Gramian for Partially Observed Robotic Path Planning Problems

关于可观测性格拉姆矩阵在部分观测机器人路径规划问题中的应用

Mohammadhussein Rafieisakhaei, Suman Chakravorty, P. R. Kumar

发表机构 * Department of Electrical and Computer Engineering(电气与计算机工程系)

AI总结 本文探讨了利用可观测性格拉姆矩阵作为估计性能代理进行优化的局限性,指出其可能产生误导性的路径规划结果。

Comments 6 pages, 9 figures. CDC 2017

详情
Journal ref
2017 IEEE 56th Annual Conference on Decision and Control (CDC)
AI中文摘要

优化可观测性格拉姆矩阵作为估计性能的代理可能在存在观测不确定性的情况下提供无关或误导性的轨迹,这对路径规划提出了挑战。

英文摘要

Optimizing measures of the observability Gramian as a surrogate for the estimation performance may provide irrelevant or misleading trajectories for planning under observation uncertainty.

1705.07252 2026-06-04 cs.LG cs.NA math.NA

SVM via Saddle Point Optimization: New Bounds and Distributed Algorithms

通过鞍点优化实现SVM:新的界和分布式算法

Yifei Jin, Lingxiao Huang, Jian Li

发表机构 * Tsinghua University(清华大学) EPFL(苏黎世联邦理工学院)

AI总结 本文提出基于鞍点优化的新算法,为硬边距SVM和ν-SVM提供近线性时间复杂度的解决方案,并在分布式环境下实现高效通信。

详情
AI中文摘要

我们研究了两种重要的SVM变体:硬边距SVM(用于线性可分情况)和ν-SVM(用于线性不可分情况)。我们从鞍点优化的角度提出新算法。我们的算法在两种变体上均能实现(1-ε)近似解,运行时间为~O(nd +n√(d/ε)),其中n是点数,d是维度。目前最好的ν-SVM算法基于二次规划方法,最坏情况下需要Ω(n²d)时间~\cite{joachims1998making,platt199912}。本文为ν-SVM提供了首个近线性时间算法。硬边距SVM的最佳算法由Gilbert算法~\cite{gartner2009coresets}实现,需要O(nd/ε)时间。我们的算法将运行时间提高了√d/√ε倍。此外,我们的算法可以自然地在分布式设置中实现。我们证明我们的算法需要~O(k(d +√(d/ε)))的通信成本,其中k是客户端数量,这几乎接近理论下界。数值实验支持我们的理论,并显示我们的算法在高维、大规模和密集数据集上比先前方法收敛更快。

英文摘要

We study two important SVM variants: hard-margin SVM (for linearly separable cases) and $ν$-SVM (for linearly non-separable cases). We propose new algorithms from the perspective of saddle point optimization. Our algorithms achieve $(1-ε)$-approximations with running time $\tilde{O}(nd+n\sqrt{d / ε})$ for both variants, where $n$ is the number of points and $d$ is the dimensionality. To the best of our knowledge, the current best algorithm for $ν$-SVM is based on quadratic programming approach which requires $Ω(n^2 d)$ time in worst case~\cite{joachims1998making,platt199912}. In the paper, we provide the first nearly linear time algorithm for $ν$-SVM. The current best algorithm for hard margin SVM achieved by Gilbert algorithm~\cite{gartner2009coresets} requires $O(nd / ε)$ time. Our algorithm improves the running time by a factor of $\sqrt{d}/\sqrtε$. Moreover, our algorithms can be implemented in the distributed settings naturally. We prove that our algorithms require $\tilde{O}(k(d +\sqrt{d/ε}))$ communication cost, where $k$ is the number of clients, which almost matches the theoretical lower bound. Numerical experiments support our theory and show that our algorithms converge faster on high dimensional, large and dense data sets, as compared to previous methods.

1705.07262 2026-06-04 cs.LG cs.AI cs.NE cs.SY eess.SY

Batch Reinforcement Learning on the Industrial Benchmark: First Experiences

批量强化学习在工业基准上的应用:初步经验

Daniel Hein, Steffen Udluft, Michel Tokic, Alexander Hentschel, Thomas A. Runkler, Volkmar Sterzing

发表机构 * Technical University of Munich, Department of Informatics(慕尼黑技术大学信息学院) Siemens AG, Corporate Technology(西门子股份公司企业技术部)

AI总结 本文研究了粒子群优化策略在工业基准上的表现,展示了其在真实应用场景中的有效性,相比传统方法,PSO-P在性能和鲁棒性上表现突出。

详情
Journal ref
2017 International Joint Conference on Neural Networks (IJCNN), Anchorage, AK, 2017, pp. 4214-4221
AI中文摘要

粒子群优化策略(PSO-P)近期被引入并证明在与学术强化学习基准的非策略、批量设置中产生了显著成果。为进一步研究其在真实应用中的性质和可行性,本文在所谓的工业基准(IB)上研究PSO-P,这是一个旨在通过包含工业应用中发现的各种方面(如连续状态和动作空间、高维部分可观测状态空间、延迟效应和复杂随机性)而变得真实的新强化学习(RL)基准。PSO-P在IB上的实验结果与基于模型的递归控制神经网络(RCNN)和基于模型的神经拟合Q迭代(NFQ)推导出的闭式控制策略的结果进行比较。实验表明,PSO-P不仅对学术基准感兴趣,也对真实世界工业应用感兴趣,因为它在我们的IB设置中也产生了最佳表现的策略。与其它已建立的RL技术相比,PSO-P在性能和鲁棒性上表现出色,仅需相对较低的努力来找到合适的参数或做出复杂的设计决策。

英文摘要

The Particle Swarm Optimization Policy (PSO-P) has been recently introduced and proven to produce remarkable results on interacting with academic reinforcement learning benchmarks in an off-policy, batch-based setting. To further investigate the properties and feasibility on real-world applications, this paper investigates PSO-P on the so-called Industrial Benchmark (IB), a novel reinforcement learning (RL) benchmark that aims at being realistic by including a variety of aspects found in industrial applications, like continuous state and action spaces, a high dimensional, partially observable state space, delayed effects, and complex stochasticity. The experimental results of PSO-P on IB are compared to results of closed-form control policies derived from the model-based Recurrent Control Neural Network (RCNN) and the model-free Neural Fitted Q-Iteration (NFQ). Experiments show that PSO-P is not only of interest for academic benchmarks, but also for real-world industrial applications, since it also yielded the best performing policy in our IB setting. Compared to other well established RL techniques, PSO-P produced outstanding results in performance and robustness, requiring only a relatively low amount of effort in finding adequate parameters or making complex design decisions.

1701.08092 2026-06-04 cs.CV cs.NA math.NA

Double-sided probing by map of Asplund's distances using Logarithmic Image Processing in the framework of Mathematical Morphology

通过使用对数图像处理的Asplund距离映射实现双面探测

Guillaume Noyel, Michel Jourlin

发表机构 * International Prevention Research Institute(国际预防研究所) Lab. H. Curien(H. Curien实验室) UMR CNRS 5516(CNRS 5516联合研究单位)

AI总结 本文在数学形态学框架下,利用对数图像处理的标量乘法建立数学形态学与探针与灰度函数之间Asplund距离映射的联系,并通过实例展示该方法在模式匹配中的应用。

Comments The final publication is available at link.springer.com

详情
Journal ref
13th International Symposium on Mathematical Morphology, ISMM 2017, May 2017, Fontainebleau, France. Springer International Publishing, pp.408-420, 2017, Mathematical Morphology and Its Applications to Signal and Image Processing: 13th International Symposium, ISMM 2017, Fontainebleau, France, May 15--17, 2017, Proceedings. http://cmm.ensmp.fr/ismm2017/
AI中文摘要

我们通过使用对数图像处理的标量乘法,建立了数学形态学与探针和灰度函数之间Asplund距离映射之间的联系。我们证明该映射是函数通过结构函数(即探针)进行膨胀和腐蚀的比值的对数。膨胀和腐蚀是将图像的格映射到正函数的格中的映射。使用平坦的结构元素,可以通过图像的膨胀和腐蚀来简化Asplund距离映射的表达,这些映射仍保留在图像的格中。我们通过一个使用非平坦结构函数的模式匹配示例来展示我们的方法。

英文摘要

We establish the link between Mathematical Morphology and the map of Asplund's distances between a probe and a grey scale function, using the Logarithmic Image Processing scalar multiplication. We demonstrate that the map is the logarithm of the ratio between a dilation and an erosion of the function by a structuring function: the probe. The dilations and erosions are mappings from the lattice of the images into the lattice of the positive functions. Using a flat structuring element, the expression of the map of Asplund's distances can be simplified with a dilation and an erosion of the image; these mappings stays in the lattice of the images. We illustrate our approach by an example of pattern matching with a non-flat structuring function.

1607.07942 2026-06-04 cs.AI cs.IT cs.SY eess.SY math.IT

Multiple scan data association by convex variational inference

通过凸变分推断实现多扫描数据关联

Jason L. Williams, Roslyn A. Lau

发表机构 * Defence Science and Technology Group, Australia(澳大利亚国防科学与技术集团) National Security, Intelligence, Surveillance and Reconnaissance Division(国家安全、情报、监视与侦察部门) Queensland University of Technology, Australia(澳大利亚昆士兰理工大学) Maritime Division, Defence Science and Technology Group, Australia(澳大利亚国防科学与技术集团的海军部门)

AI总结 本文研究多扫描数据关联问题,提出基于分数自由能的凸优化方法,改进了传统信念传播算法,提升目标跟踪精度。

详情
AI中文摘要

数据关联,即对目标与测量之间的对应关系进行推理,是目标跟踪中的基础问题。最近,信念传播(BP)作为一种估计测量与目标关联的边缘概率的有希望方法出现,提供了快速且准确的估计。BP在特定形式中的出色表现可能归因于其隐含优化的底层自由能的凸性。本文研究多扫描数据关联问题,即对目标与多个测量集之间的对应关系进行推理的问题,这可能对应于不同的传感器或不同的时间步。我们发现单扫描BP形式的多扫描扩展是非凸的,并展示了由此产生的不良行为。使用最近提出的分数自由能(FFE)构建了凸自由能。为单扫描FFE提供了一个收敛的、类似BP的算法,并用于通过对偶坐标上升优化多扫描自由能。最后,基于联合概率数据关联(JPDA)的变分解释,我们开发了一个类似于JPDA的序列变体算法,但保留了来自先前扫描的一致性约束。所提出方法的性能在仅靠方位角的目标定位问题上得到验证。

英文摘要

Data association, the reasoning over correspondence between targets and measurements, is a problem of fundamental importance in target tracking. Recently, belief propagation (BP) has emerged as a promising method for estimating the marginal probabilities of measurement to target association, providing fast, accurate estimates. The excellent performance of BP in the particular formulation used may be attributed to the convexity of the underlying free energy which it implicitly optimises. This paper studies multiple scan data association problems, i.e., problems that reason over correspondence between targets and several sets of measurements, which may correspond to different sensors or different time steps. We find that the multiple scan extension of the single scan BP formulation is non-convex and demonstrate the undesirable behaviour that can result. A convex free energy is constructed using the recently proposed fractional free energy (FFE). A convergent, BP-like algorithm is provided for the single scan FFE, and employed in optimising the multiple scan free energy using primal-dual coordinate ascent. Finally, based on a variational interpretation of joint probabilistic data association (JPDA), we develop a sequential variant of the algorithm that is similar to JPDA, but retains consistency constraints from prior scans. The performance of the proposed methods is demonstrated on a bearings only target localisation problem.

1402.6964 2026-06-04 cs.LG cs.DC cs.NA math.NA stat.ML

Scalable methods for nonnegative matrix factorizations of near-separable tall-and-skinny matrices

可扩展的非负矩阵分解方法用于近可分离的高瘦矩阵

Austin R. Benson, Jason D. Lee, Bartek Rajwa, David F. Gleich

发表机构 * Stanford University(斯坦福大学) Purdue University(普渡大学) Purdue University Institute for Computational and Bindley Biosciences Center(普渡大学计算与Bindley生物科学中心) Computer Science Mathematical Engineering(计算机科学数学工程)

AI总结 本文提出高效算法处理高瘦矩阵的非负矩阵分解,通过正交变换保持分离性,适用于流式数据和分布式计算环境。

详情
Journal ref
Proceedings of Neural Information Processing Systems, 2014
AI中文摘要

许多算法在假设矩阵近可分离的情况下用于非负矩阵分解。本文展示如何使这些算法高效处理行数远多于列数的高瘦矩阵。改进方法的关键是保持NMF问题分离性的正交矩阵变换。最终方法只需单次遍历数据矩阵,适用于流式、多核和MapReduce架构。我们在TB级合成矩阵和科学计算、生物信息学的真实数据上验证了算法的有效性。

英文摘要

Numerous algorithms are used for nonnegative matrix factorization under the assumption that the matrix is nearly separable. In this paper, we show how to make these algorithms efficient for data matrices that have many more rows than columns, so-called "tall-and-skinny matrices". One key component to these improved methods is an orthogonal matrix transformation that preserves the separability of the NMF problem. Our final methods need a single pass over the data matrix and are suitable for streaming, multi-core, and MapReduce architectures. We demonstrate the efficacy of these algorithms on terabyte-sized synthetic matrices and real-world matrices from scientific computing and bioinformatics.

2606.05150 2026-06-04 cs.NE cs.AI

Multi-Column RBF Neural Network Using Adaptive and Non-Adaptive Particle Swarm Optimization

使用自适应和非自适应粒子群优化的多列RBF神经网络

Ammar Hoori, Yuichi Motai

发表机构 * Department of Biomedical Engineering, Case Western Reserve University(生物医学工程系,凯斯西储大学) Department of Electrical and Computer Engineering, Virginia Commonwealth University(电气与计算机工程系,弗吉尼亚 Commonwealth 大学)

AI总结 针对大规模数据集下RBF神经网络训练的可扩展性问题,提出基于粒子群优化(PSO)和自适应PSO(APSO)的多列RBF网络(MC-PSO和MC-APSO),通过并行训练多个RBFN并利用子集专门化提高精度和速度。

Comments 15 Page, Under Review

详情
AI中文摘要

使用梯度下降算法训练的径向基函数神经网络(RBFN)在浅层和深层网络中提供了有效的全连接结构。误差校正(ErrCor)是一种先进的基于梯度的训练方法,它选择最优隐藏单元以提高精度。另外,作为基于种群的算法,粒子群优化算法(PSO)利用群体经验优化RBFN参数,提供全局搜索和对局部最小值的鲁棒性。自适应PSO(APSO)作为PSO的改进变体出现。APSO算法通过在优化过程中动态调整群体参数来提高收敛速度。ErrCor和PSO都显示出改进的结果和有竞争力的收敛性。然而,对于大规模数据集,这些方法面临可扩展性挑战,如过多的核计算和大的隐藏层结构。最近的多列RBFN方法(MCRN)通过在并行系统中部署小型RBFN来提高ErrCor性能。受MCRN成功的启发,我们提出了两种改进PSO性能的新方法:使用PSO的多列RBFN(MC-PSO)和使用APSO的多列RBFN(MC-APSO)。这些方法引入了使用进化群方法训练的并行RBFN结构。每个RBFN独立地在数据集的特定空间子集上使用PSO或APSO算法进行训练。这些经过专门训练的RBFN针对各自的子集进行了定制。在测试期间,只有测试实例邻居所在的选定RBFN对多列输出有贡献。这种专门化提高了精度,而并行性提高了速度。我们在各种基准数据集上评估了所提出的方法。MC-PSO和MC-APSO在精度和召回率方面优于ErrCor、PSO、APSO和MCRN。在大多数实验中,它们还表现出更快的训练和测试时间。

英文摘要

The radial basis function neural network (RBFN) trained with a gradient descending algorithm provides an effective fully connected structure in both shallow and deep networks. The error correction (ErrCor), a state-of-the-art gradient-based training method, selects optimal hidden units to improve accuracy. Alternatively, as a population-based algorithm, the particle swarm optimization algorithm (PSO) uses the swarm experience to optimize RBFN parameters, offering global search and robustness to local minima. Adaptive PSO (APSO) has emerged as an improved variant of PSO. APSO algorithm improves convergence speed by dynamically adjusting swarm parameters during optimization. Both ErrCor and PSO demonstrate improved results and competitive convergence. However, with large datasets, these methods face scalability challenges such as excessive kernel computations and large hidden layer structures. A recent multi-column RBFN approach (MCRN) improves ErrCor performance by deploying small RBFNs in a parallel system. Inspired by MCRN's success, we propose two novel approaches to improve PSO performance: the multi-column RBFN with PSO (MC-PSO) and the multi-column RBFN with APSO (MC-APSO). These methods introduce parallel RBFN structures trained using evolutionary swarm methods. Each RBFN is independently trained on a specific spatial subset of the dataset using either PSO or APSO algorithms. These resulting specialist-trained RBFNs are tailored to their respective subsets. During testing, only selected RBFNs, where the test instance neighbors are located, contribute to the multi-column output. This specialization improves accuracy, while parallelism enhances speed. We evaluate the proposed methods on various benchmark datasets. The MC-PSO and MC-APSO outperform ErrCor, PSO, APSO, and MCRN in terms of accuracy and recall. They also demonstrate faster training and testing times in most experiments.

2606.05129 2026-06-04 cs.CR cs.LG

Preserving Data Privacy in Learning Causal Structure with Fully Homomorphic Encryption

在全同态加密下学习因果结构时保护数据隐私

Jian Yang, Yuan Tong, Qinbin Li, Zeyi Wen, Xiaofang Zhou

发表机构 * Hong Kong University of Science and Technology (Guangzhou)(香港理工大学(广州)) Hong Kong University of Science and Technology(香港理工大学) University of California, Berkeley(加州大学伯克利分校)

AI总结 针对分布式因果结构学习中的隐私泄露问题,提出基于全同态加密的方法,通过电路简化、除法和对数近似以及SIMD批处理技术,在加密数据上高效完成因果结构学习,并支持扩展到差分隐私。

详情
AI中文摘要

保护数据隐私是结构数据管理和数据挖掘中的重要课题。然而,分布式因果结构学习中的隐私泄露问题是一个持续的挑战,特别是在需要数据传输和计算的情况下。在本文中,我们提出了一种基于全同态加密(FHE)的方法,该方法在密文上进行计算,保持数据在传输和计算过程中加密。然而,由于FHE计算成本高且对除法和对数运算的支持有限,将FHE应用于因果结构学习具有挑战性。为了应对这一挑战,我们提出了一系列新颖的技术,包括(i)电路简化以提高效率,(ii)通过牛顿-拉夫森倒数和泰勒展开近似除法和对数,以及(iii)使用SIMD加速的批处理技术来增强整个学习过程。此外,我们的方法可以轻松扩展到FHE之外,通过展示其可移植性来支持差分隐私。实验结果表明,我们的方法在测试的数据集上实现了与明文版本高度一致且可比的因果结构。最后,即使在FHE的隐私保护下,我们的方法也能在几十分钟内高效且实际地完成因果结构学习。

英文摘要

Preserving data privacy is an important topic in structural data management and data mining. However, the issue of privacy leakage in distributed causal structure learning is a persistent challenge, especially in cases where data transmission and computation are required. In this paper, we propose a method based on fully homomorphic encryption (FHE) that performs calculations on ciphertexts, keeping data encrypted in transition and computation. Nevertheless, adopting FHE to causal structure learning is challenging due to the high computation cost and limited support on division as well as logarithm operations in FHE. To tackle this challenge, we propose a series of novel techniques including (i) circuit simplification for better efficiency, (ii) approximation of division and logarithm through Newton-Raphson Reciprocal and Taylor expansion, and (iii) a batching technique with SIMD-acceleration to enhance the whole learning process. Additionally, our method can be easily extended beyond FHE by demonstration of its portability to support differential privacy. Empirical results show that our method achieves high consistency and comparable causal structure with the plaintext version in the datasets tested. Last, our method is efficient and practical to complete learning causal structures in tens of minutes even under the privacy protection of FHE.

2606.05124 2026-06-04 cs.GR cs.CV cs.LG

Geometry Gaussians: Decoupling Appearance and Geometry in Gaussian Splatting

几何高斯:在高斯泼溅中解耦外观与几何

Hongyu Zhou, Zorah Lähner

发表机构 * University of Bonn(波恩大学) Lamarr Institut(拉马尔研究所)

AI总结 针对3D高斯泼溅在几何表示与外观渲染间的冲突,提出通过为每个溅射添加几何不透明度参数并配合透明度优化流程,实现几何与外观的解耦,提升复杂场景(尤其是透明物体)的渲染与几何性能。

详情
AI中文摘要

在3D高斯泼溅(3DGS)成功用于新视角合成后,许多工作探索了如何将其用于几何表面表示。然而,直接从3DGS中提取准确的几何信息仍然具有挑战性,且往往会降低外观渲染质量。在这项工作中,我们通过使用完整的地面真值纹理和几何信息进行训练,证明了默认形式的3DGS本质上不适合同时表示纹理和几何。我们还提出了一种简单的解决方案,即为每个溅射应用一个额外的几何不透明度参数,并配合可选的透明度策划优化流程。我们的实验,无论是使用地面真值还是视觉基础模型的几何输入,都表明这一改变在多种数据集上提高了渲染和几何性能,尤其是对于包含透明物体的复杂场景,我们的方法带来了显著提升。

英文摘要

After the success of 3D Gaussian Splatting (3DGS) for novel view synthesis, many works have explored how to also use it for geometric surface representation. However, extracting accurate geometric information directly from 3DGS remains challenging and can often reduce the appearance rendering quality. In this work, we show that 3DGS in its default form is inheritedly unsuited to represent texture and geometry at the same time, by training with complete ground-truth texture and geometry information. We also propose a simple solution by applying a single additional geometry opacity parameter to each splat, together with an optional transparency-curated optimization pipeline. Our experiments, both with ground-truth and vision foundation model geometric input, show that this change leads to improved rendering and geometry performance on a wide variety of dataset, and especially complex scenes with transparent objects benefit significantly from our method.

2606.05045 2026-06-04 math.DS cs.LG

Learning Control-Affine Reduced-Order Models via Autoencoders

通过自编码器学习控制仿射降阶模型

Ali Mjalled, Martin Mönnigmann

发表机构 * Automatic Control and Systems Theory Ruhr-Universität Bochum(自动控制与系统理论 梅尔恩大学波恩分校)

AI总结 提出一种利用自编码器同时学习降阶潜在空间和控制仿射状态空间动力学的框架,并扩展为序列模型以提高预测精度,通过反馈线性化验证其有效性。

详情
AI中文摘要

本文提出了一种用于识别控制仿射降阶模型(ROM)的框架。该方法利用自编码器(AE)将高维状态以及潜在的高维输入变换为适合控制仿射状态空间动力学的降维潜在变量。这是通过同时训练AE和状态空间模型实现的。此外,我们将离散ROM公式扩展为基于序列的模型,该模型处理状态和输入历史以提高预测精度,同时保持控制仿射结构。我们通过对导出的模型应用反馈线性化来激励我们的框架,并提出了有效使用它的指南。所提出的框架在两个数值示例上进行了评估,并将其性能与基线模型(其中AE识别具有线性状态空间动力学的潜在空间)进行了比较。评估涉及测试数据上ROM的预测精度及其将系统控制到期望状态或轨迹的有效性。

英文摘要

We present in this paper a framework for the identification of control-affine reduced-order models (ROMs). The proposed method utilizes autoencoders (AEs) to transform the high-dimensional states, and potentially the high-dimensional inputs, into reduced latent ones suitable for control-affine state-space dynamics. This is achieved by simultaneous training of the AE and the state-space model. In addition, we extend the discrete ROM formulation to a sequence-based model, which processes state and input histories to improve prediction accuracy while preserving the control-affine structure. We motivate our framework by applying feedback linearization to the derived models, and we present guidelines for its efficient use. The proposed framework is assessed on two numerical examples and its performance is compared to a baseline model, where the AE identifies a latent space with linear state-space dynamics. The assessment involves evaluating the prediction accuracy of the ROM on test data and its effectiveness in controlling the system to a desired state or trajectory.

2606.05037 2026-06-04 cs.SE cs.AI

Self-Reflective APIs: Structure Beats Verbosity for AI Agent Recovery

自反式API:结构优于冗长,助力AI代理恢复

Arquimedes Canedo, Grama Chethan

发表机构 * Siemens Digital Industries Software, USA(西门子数字工业软件公司)

AI总结 提出自反式API,在验证失败时返回机器可读的结构化建议,使AI代理无需外部推理即可修复请求并重试,在Anthropic模型上将任务完成率提升36.7-40.0个百分点,且每成功令牌效率提升1.8-2.2倍。

详情
AI中文摘要

当AI代理调用API并遇到验证错误时,它需要的不仅仅是哪里出错了——它需要下一步该做什么。自反式API在验证失败时返回一个机器可读的 recovery_feedback.suggestions[] 负载,足以让代理修复请求并在无需外部推理的情况下重试。在一个经过泄露审计的试点实验(每单元N=30,3个LLM,10个对抗性任务)中,结构化建议在Anthropic模型上将任务完成率提升了+36.7至40.0个百分点(Fisher精确检验 p ≤ 0.0022),每成功令牌效率提高了1.8至2.2倍。在gpt-4o-mini上提升不显著(p=0.435);在计费API上的第二个领域复制确认了这一模式。该比较仅在审计了LLM基准测试中两个未记录的答案泄露类别后才成立。我们提供了 audit_prompt_leakage.py 作为可重用的CI基础设施。代码和数据:https://github.com/arquicanedo/self-reflective-apis。

英文摘要

When an AI agent calls an API and hits a validation error, it needs more than what went wrong -- it needs what to do next. A self-reflective API returns, on validation failure, a machine-readable recovery\_feedback.suggestions[] payload sufficient for the agent to repair the request and retry without external reasoning. On a leak-audited pilot ($N{=}30$ per cell, 3 LLMs, 10 adversarial tasks), structured suggestions lift task-completion rate by $+36.7$--$40.0$pp over plain-English diagnoses on Anthropic models (Fisher's exact $p \le 0.0022$), at $1.8$--$2.2\times$ better per-success token efficiency. The lift is not significant on gpt-4o-mini ($p{=}0.435$); a second-domain replication on a billing API confirms the pattern. The comparison only holds after auditing two undocumented classes of answer leakage in LLM benchmarks. We shipaudit\_prompt\_leakage.py as reusable CI infrastructure. Code and data: https://github.com/arquicanedo/self-reflective-apis.