arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 21511
1612.01597 2026-06-04 math.NA cs.IT cs.LG cs.NA math.IT

Deterministic and Probabilistic Conditions for Finite Completability of Low-Tucker-Rank Tensor

低 Tucker 等秩张量有限可补全的确定性和概率条件

Morteza Ashraphijuo, Vaneet Aggarwal, Xiaodong Wang

发表机构 * Department of Electrical Engineering, Columbia University(哥伦比亚大学电气工程系) School of IE, Purdue University(普渡大学工业工程学院)

AI总结 本文研究了在给定某些 Tucker 等秩组件的情况下,张量有限可补全的采样模式的基本条件。通过在 Tucker 流形上进行代数几何分析,提出了确定性必要和充分条件,同时研究了概率条件并给出了采样概率的下界,以确保所提出的确定性条件在高概率下成立。此外,利用所提出的几何方法,提出了一个保证采样张量有唯一补全的采样模式充分条件。

详情
AI中文摘要

我们研究了在给定某些 Tucker 等秩组件的情况下,张量有限可补全的采样模式的基本条件。为了找到确定性必要和充分条件,我们提出了一种在 Tucker 流形上的代数几何分析,这与传统在 Grassmannian 流形上的几何方法不同,允许在分析中纳入多个秩组件。这种分析刻画了一组基于采样模式定义的多项式的代数独立性,这与有限补全密切相关。然后研究了概率条件,并给出了采样概率的下界,该下界保证所提出的关于采样模式的确定性条件在有限补全中以高概率成立。此外,利用所提出的有限补全几何方法,我们提出了一种关于采样模式的充分条件,该条件保证采样张量存在唯一的补全。

英文摘要

We investigate the fundamental conditions on the sampling pattern, i.e., locations of the sampled entries, for finite completability of a low-rank tensor given some components of its Tucker rank. In order to find the deterministic necessary and sufficient conditions, we propose an algebraic geometric analysis on the Tucker manifold, which allows us to incorporate multiple rank components in the proposed analysis in contrast with the conventional geometric approaches on the Grassmannian manifold. This analysis characterizes the algebraic independence of a set of polynomials defined based on the sampling pattern, which is closely related to finite completion. Probabilistic conditions are then studied and a lower bound on the sampling probability is given, which guarantees that the proposed deterministic conditions on the sampling patterns for finite completability hold with high probability. Furthermore, using the proposed geometric approach for finite completability, we propose a sufficient condition on the sampling pattern that ensures there exists exactly one completion for the sampled tensor.

1905.03349 2026-06-04 eess.SY cs.RO cs.SY

A New Hybrid Control Architecture to Attenuate Large Horizontal Wind Disturbance for a Small-Scale Unmanned Helicopter

一种新型混合控制架构用于抑制小型无人驾驶直升机的大水平风扰动

Xiaorui Zhu, Wenwu Zeng, Zexiang Li, Chunyang Zheng

发表机构 * School of Mechanical Engineering and Automation, Harbin Institute of Technology Shenzhen Graduate School(哈尔滨工业大学机械工程与自动化学院深圳研究生院) Department of Electronic and Computer Engineering, Hong Kong University of Science and Technology(香港理工大学电子与计算机工程系)

AI总结 本文提出了一种结合风洞实验数据和反步算法的新方法,用于抑制小型无人驾驶直升机的大水平风扰动,通过混合控制架构实现更精确快速的扰动抑制。

详情
AI中文摘要

本文提出了一种新颖的方法,用于抑制小型无人驾驶自主直升机的大水平风扰动,结合风洞实验数据和反步算法。大水平风扰动对自主直升机有害,尤其是小型直升机,由于其低惯性以及多输入间的强交叉耦合效应。为实现更精确和快速的扰动抑制,提出了一种新的混合控制架构,利用基于风洞实验数据的直接力/力矩补偿。在该架构中,大水平风扰动被视为控制系统的附加输入,而非平衡状态附近的微小扰动。然后设计反步算法以保证直升机稳定收敛至期望位置。所提出的技术最终在HIROBO Eagle平台上通过仿真进行评估,并与传统风速补偿方法进行比较。

英文摘要

This paper presents a novel method to attenuate large horizontal wind disturbance for a small-scale unmanned autonomous helicopter combining wind tunnel-based experimental data and a backstepping algorithm. Large horizontal wind disturbance is harmful to autonomous helicopters, especially to small ones because of their low inertia and the high cross-coupling effects among the multiple inputs. In order to achieve more accurate and faster attenuation of large wind disturbance, a new hybrid control architecture is proposed to take advantage of the direct force/moment compensation based on the wind tunnel experimental data. In this architecture, large horizontal wind disturbance is treated as an additional input to the control system instead of a small perturbation around the equilibrium state. A backstepping algorithm is then designed to guarantee the stable convergence of the hilicopter to the desired position. The proposed technique is finally evaluated in simulation on the platform, HIROBO Eagle, compared with a traditional wind velocity compensation method.

1905.02606 2026-06-04 eess.SY cs.AI cs.SY

Optimal Control of Complex Systems through Variational Inference with a Discrete Event Decision Process

通过变分推断优化复杂系统的控制:离散事件决策过程

Wen Dong, Bo Liu, Fan Yang

发表机构 * University at Buffalo(布法罗大学) Auburn University(阿伯伯大学)

AI总结 本文提出了一种基于变分推断的方法,将复杂社会网络决策问题建模为离散事件决策过程,以解决高维状态-动作空间中的维度灾难问题,从而在现实交通场景中实现更高的系统预期奖励、更快的收敛速度和更低的价值函数方差。

详情
AI中文摘要

复杂社会系统由相互关联的个体组成,其相互作用导致群体行为。现实复杂系统的最优控制有广泛的应用,包括道路交通管理、流行病预防和信息传播。然而,由于高维和非线性系统动态以及决策者面临的爆炸性状态和动作空间,实现此类现实复杂系统控制具有挑战性。现有方法可分为基于模拟和解析两类。现有的模拟方法在蒙特卡洛积分中具有高方差,而解析方法则面临建模不准确的问题。我们采用模拟建模来指定复杂系统的复杂动态,并为在具有高维状态-动作空间的复杂网络中搜索最优策略开发了解析解。为了捕捉复杂系统的动态,我们将复杂社会网络决策问题建模为离散事件决策过程。为了解决复杂系统中的维度灾难和在高维状态-动作空间中的搜索问题,我们将复杂系统的控制减少到变分推断和参数学习,引入Bethe熵近似,并开发了期望传播算法。我们提出的方法在现实交通场景中比最先进的解析和采样方法在系统预期奖励、收敛速度和价值函数方差方面表现更优。

英文摘要

Complex social systems are composed of interconnected individuals whose interactions result in group behaviors. Optimal control of a real-world complex system has many applications, including road traffic management, epidemic prevention, and information dissemination. However, such real-world complex system control is difficult to achieve because of high-dimensional and non-linear system dynamics, and the exploding state and action spaces for the decision maker. Prior methods can be divided into two categories: simulation-based and analytical approaches. Existing simulation approaches have high-variance in Monte Carlo integration, and the analytical approaches suffer from modeling inaccuracy. We adopted simulation modeling in specifying the complex dynamics of a complex system, and developed analytical solutions for searching optimal strategies in a complex network with high-dimensional state-action space. To capture the complex system dynamics, we formulate the complex social network decision making problem as a discrete event decision process. To address the curse of dimensionality and search in high-dimensional state action spaces in complex systems, we reduce control of a complex system to variational inference and parameter learning, introduce Bethe entropy approximation, and develop an expectation propagation algorithm. Our proposed algorithm leads to higher system expected rewards, faster convergence, and lower variance of value function in a real-world transportation scenario than state-of-the-art analytical and sampling approaches.

1905.01028 2026-06-04 eess.SY cs.MA cs.RO cs.SY

Robust Cooperative Formation Control of Fixed-Wing Unmanned Aerial Vehicles

固定翼无人飞行器鲁棒协同编队控制

Qingrui Zhang, Hugh H. T. Liu

发表机构 * Institute for Aerospace Studies, University of Toronto(航空航天研究学院,多伦多大学)

AI总结 本文研究了固定翼无人机在紧密编队飞行中用于节能的鲁棒协同编队控制问题,提出了一种新的协同控制方法,通过虚拟结构概念解决大规模无人机编队中虚拟领导者设计的难题,并采用协作者滤波器生成平滑的参考信号,同时利用不确定性与扰动观测器补偿气动耦合导致的模型不确定性,最终实现了鲁棒的编队控制。

详情
AI中文摘要

本文研究了固定翼无人飞行器在紧密编队飞行中用于节能的鲁棒协同编队控制问题。提出了一种新的协同控制方法。采用虚拟结构的概念来解决在编队飞行中为大量无人机设计虚拟领导者所面临的困难。为了提高暂态性能,期望轨迹通过一组协作者滤波器生成平滑的参考信号,即虚拟领导者的状态。利用不确定性与扰动观测器估计并补偿由于无人机之间气动耦合导致的模型不确定性。因此,整个设计包含三个主要组成部分:用于运动规划的协作者滤波器、基准协同控制以及不确定性与扰动观测。所提出的编队控制器至少可以保证编队跟踪的最终有界控制性能。如果满足某些条件,则可以实现渐近编队跟踪控制。本文的主要贡献在于两个方面:1)通过虚拟结构概念解决了设计虚拟领导者的问题;2)提出了一种针对大量无人机之间存在气动耦合的紧密编队飞行的鲁棒协同控制器。所提出设计的效率将通过五架无人机紧密编队飞行的数值模拟来展示。

英文摘要

Robust cooperative formation control is investigated in this paper for fixed-wing unmanned aerial vehicles in close formation flight to save energy. A novel cooperative control method is developed. The concept of virtual structure is employed to resolve the difficulty in designing virtual leaders for a large number of UAVs in formation flight. To improve the transient performance, desired trajectories are passed through a group of cooperative filters to generate smooth reference signals, namely the states of the virtual leaders. Model uncertainties due to aerodynamic couplings among UAVs are estimated and compensated using uncertainty and disturbance observers. The entire design, therefore, contains three major components: cooperative filters for motion planning, baseline cooperative control, and uncertainty and disturbance observation. The proposed formation controller could at least secure ultimate bounded control performance for formation tracking. If certain conditions are satisfied, asymptotic formation tracking control could be obtained. Major contributions of this paper lie in two aspects: 1) the difficulty in designing virtual leaders is resolved in terms of the virtual structure concept; 2) a robust cooperative controller is proposed for close formation flight of a large number of UAVs suffering from aerodynamic couplings in between. The efficiency of the proposed design will be demonstrated using numerical simulations of five UAVs in close formation flight.

1809.07874 2026-06-04 math.OC cs.RO cs.SY eess.SY

Task-Driven Estimation and Control via Information Bottlenecks

基于信息瓶颈的任务驱动估计与控制

Vincent Pacelli, Anirudha Majumdar

发表机构 * Princeton University(普林斯顿大学)

AI总结 本文提出了一种基于信息瓶颈理论的任务驱动估计与控制框架,通过任务相关的变量构建高效表示,并设计了在线算法进行状态估计以实现鲁棒的控制策略。

Comments 9 pages, 4 figures, abridged version accepted to ICRA2019; Incorporates changes in final conference submission

详情
AI中文摘要

我们的目标是开发一种原理明确且通用的算法框架,用于机器人系统的任务驱动估计与控制。最先进的机器人控制系统通常依赖于准确估计机器人完整状态(例如,运行中的机器人可能估计关节角度和速度、躯干状态以及相对于目标的位置)。然而,完整状态表示通常对于特定任务来说过于丰富,可能导致显著的计算不高效和对状态估计误差的脆弱性。相反,我们提出了一种方法,避免使用这种丰富的表示,并试图创建任务驱动的表示。关键的技术洞察是利用信息瓶颈理论来形式化“任务驱动表示”的概念,以信息理论量度来衡量表示的最小性。我们提出了新的迭代算法,用于自动合成(离线)任务驱动表示(以一组任务相关变量(TRVs)给出)和一个以TRVs为函数的高效控制策略。我们还提出了在线算法来估计TRVs,以便应用控制策略。我们证明了我们的方法在理论和彻底的模拟实验(包括向目标位置奔跑的弹簧加载倒立摆)中都对未建模的测量不确定性具有显著的鲁棒性。

英文摘要

Our goal is to develop a principled and general algorithmic framework for task-driven estimation and control for robotic systems. State-of-the-art approaches for controlling robotic systems typically rely heavily on accurately estimating the full state of the robot (e.g., a running robot might estimate joint angles and velocities, torso state, and position relative to a goal). However, full state representations are often excessively rich for the specific task at hand and can lead to significant computational inefficiency and brittleness to errors in state estimation. In contrast, we present an approach that eschews such rich representations and seeks to create task-driven representations. The key technical insight is to leverage the theory of information bottlenecks}to formalize the notion of a "task-driven representation" in terms of information theoretic quantities that measure the minimality of a representation. We propose novel iterative algorithms for automatically synthesizing (offline) a task-driven representation (given in terms of a set of task-relevant variables (TRVs)) and a performant control policy that is a function of the TRVs. We present online algorithms for estimating the TRVs in order to apply the control policy. We demonstrate that our approach results in significant robustness to unmodeled measurement uncertainty both theoretically and via thorough simulation experiments including a spring-loaded inverted pendulum running to a goal location.

1905.02176 2026-06-04 math.NA cs.CG cs.CV cs.NA math.DG

Computation of Circular Area and Spherical Volume Invariants via Boundary Integrals

通过边界积分计算圆面积和球体积不变量

Riley O'Neill, Pedro Angulo-Umana, Jeff Calder, Bo Hessburg, Peter J. Olver, Chehrzad Shakiban, Katrina Yezzi-Woodley

发表机构 * Department of Mathematics, University of St. Thomas(圣托马斯大学数学系) School of Mathematics, University of Minnesota(明尼苏达大学数学学院) Department of Anthropology, University of Minnesota(明尼苏达大学人类学系)

AI总结 本文提出通过边界积分计算平面曲线的圆面积不变量和曲面的球体积不变量,利用散度定理将积分转化为边界积分,并扩展到高维超曲面,为三角化曲面提供了一种无需离散环境空间的计算方法,应用于考古学中骨折碎片的特征检测。

详情
AI中文摘要

我们展示如何通过线积分和表面积分分别计算平面曲线的圆面积不变量和曲面的球体积不变量。我们利用散度定理将面积和体积积分分别表示为特定核的线积分和表面积分;我们的结果也扩展到高维超曲面。所得到的表面积分可以在三角化网格上解析计算。这为三角化曲面计算球体积不变量提供了一种简单的计算算法,无需离散环境空间。我们讨论了该方法在考古学中对感兴趣骨折碎片特征检测的潜在应用。

英文摘要

We show how to compute the circular area invariant of planar curves, and the spherical volume invariant of surfaces, in terms of line and surface integrals, respectively. We use the Divergence Theorem to express the area and volume integrals as line and surface integrals, respectively, against particular kernels; our results also extend to higher dimensional hypersurfaces. The resulting surface integrals are computable analytically on a triangulated mesh. This gives a simple computational algorithm for computing the spherical volume invariant for triangulated surfaces that does not involve discretizing the ambient space. We discuss potential applications to feature detection on broken bone fragments of interest in anthropology.

1903.10623 2026-06-04 eess.SY cs.RO cs.SY

Attitude- and Cruise Control of a VTOL Tiltwing UAV

VTOL 倾转翼无人机的姿态与巡航控制

David Rohr, Thomas Stastny, Sebastian Verling, Roland Siegwart

发表机构 * IEEE

AI总结 本文提出了一种过量驱动的垂直起降(VTOL)倾转翼无人机的数学建模、控制器设计和飞行测试方法,通过简化空气动力学和第一原理,建立了能够捕捉关键空气动力学效应的动态模型,包括螺旋桨滑流对机翼的影响和机翼后失速特性。通过优化如功率最优调平等方法解决了无人机的过量驱动问题,并开发了由低层姿态控制器和高层巡航控制器组成的控制系统,通过系统线性化和查找表确定飞行包线内调平的强非线性变化,通过广泛的飞行测试验证了控制系统在所有飞行阶段(悬停、过渡、巡航)的性能。

Comments 8 pages

详情
AI中文摘要

本文提出了一种垂直起降(VTOL)倾转翼无人机的数学建模、控制器设计和飞行测试方法。基于简化空气动力学和第一原理,建立了能够捕捉关键空气动力学效应的动态模型,包括螺旋桨滑流对机翼的影响和机翼后失速特性。基于该模型分析了稳态飞行包线和相应的调平作用,并通过优化如功率最优调平等方法解决了无人机的过量驱动问题。所开发的控制系统由两个控制器组成:首先是一个基于动态反向和串行方法的低层姿态控制器,用于处理冗余执行器的分配。其次是一个高层巡航控制器,用于跟踪所需的垂直速度。该控制器基于系统线性化和查找表,以确定飞行包线内调平的强非线性变化。我们通过广泛的飞行测试验证了该控制系统的性能,适用于所有飞行阶段(悬停、过渡、巡航)。

英文摘要

This paper presents the mathematical modeling, controller design, and flight-testing of an over-actuated Vertical Take-off and Landing (VTOL) tiltwing Unmanned Aerial Vehicle (UAV). Based on simplified aerodynamics and first-principles, a dynamical model of the UAV is developed which captures key aerodynamic effects including propeller slipstream on the wing and post-stall characteristics of the airfoils. The model-based steady-state flight envelope and the corresponding trim-actuation is analyzed and the overactuation of the UAV solved by optimizing for, e.g., power-optimal trims. The developed control system is composed of two controllers: First, a low-level attitude controller based on dynamic inversion and a daisy-chaining approach to handle allocation of redundant actuators. Secondly, a higher-level cruise controller to track a desired vertical velocity. It is based on a linearization of the system and look-up tables to determine the strong and nonlinear variation of the trims throughout the flight-envelope. We demonstrate the performance of the control-system for all flight phases (hover, transition, cruise) in extensive flight-tests.

1706.07119 2026-06-04 eess.SY cs.LG cs.SY

"Parallel Training Considered Harmful?": Comparing series-parallel and parallel feedforward network training

并行训练是否有害?:比较系列-并行与并行前馈网络训练

Antônio H. Ribeiro, Luis A. Aguirre

发表机构 * Department of Electronic Engineering at Universidade Federal de Minas Gerais (UFMG) - Av. Ant\ o nio Carlos 6627, 31270-901, Belo Horizonte, MG, Brazil

AI总结 本文比较了系列-并行和并行前馈网络训练方法,探讨了其在鲁棒性、计算成本和收敛性方面的表现,发现并行训练在更现实的场景中表现更优。

详情
Journal ref
Neurocomputing 316:222--231, (2018)
AI中文摘要

动态系统神经网络模型可以并行或系列-并行配置进行训练。受早期论述影响,一些论文认为系列-并行配置比并行配置更优,因其计算成本更低、训练稳定性更好且结果更准确。另一方面,其他研究则认为并行训练更稳健,能产生更准确的长期预测。本文的主要贡献是通过统一框架比较两种方法。我们关注三个方面:i)在噪声下的估计鲁棒性;ii)计算成本;iii)收敛性。统一的数学框架和模拟研究显示了每种训练方法在不同情境下的验证结果,发现并行训练在更现实的场景中表现更优。一个使用测量数据的例子似乎支持这一结论。我们还通过新的复杂度分析和数值示例表明,两种方法的计算成本相似,但系列-并行训练更易于并行化。一些关于稳定性和收敛性性质的非正式讨论也在示例中进行了探讨。

英文摘要

Neural network models for dynamic systems can be trained either in parallel or in series-parallel configurations. Influenced by early arguments, several papers justify the choice of series-parallel rather than parallel configuration claiming it has a lower computational cost, better stability properties during training and provides more accurate results. Other published results, on the other hand, defend parallel training as being more robust and capable of yielding more accu- rate long-term predictions. The main contribution of this paper is to present a study comparing both methods under the same unified framework. We focus on three aspects: i) robustness of the estimation in the presence of noise; ii) computational cost; and, iii) convergence. A unifying mathematical framework and simulation studies show situations where each training method provides better validation results, being parallel training better in what is believed to be more realistic scenarios. An example using measured data seems to reinforce such claim. We also show, with a novel complexity analysis and numerical examples, that both methods have similar computational cost, being series series-parallel training, however, more amenable to parallelization. Some informal discussion about stability and convergence properties is presented and explored in the examples.

1803.10986 2026-06-04 math.NA cs.LG cs.NA stat.ML

Error Analysis and Improving the Accuracy of Winograd Convolution for Deep Neural Networks

卷积在深度神经网络中的误差分析及精度提升

Barbara Barabasz, Andrew Anderson, Kirk M. Soodhalter, David Gregg

发表机构 * School of Computer Science and Statistics(计算机科学与统计学学院) Trinity College Dublin(都柏林三一学院) School of Mathematics(数学学院)

AI总结 本文分析了Winograd卷积算法的误差,并提出改进方法以提高其精度,通过Huffman编码优化求和误差,实验选择采样点并探索混合精度卷积等方法以减少浮点误差。

详情
AI中文摘要

流行的深度神经网络(DNNs)大部分执行时间用于计算卷积。Winograd算法族可以显著减少所需的算术运算次数,并存在于许多DNN软件框架中。然而,性能提升是以浮点(FP)数值精度的降低为代价。在本文中,我们分析了最坏情况下的FP误差并证明了算法的范数和条件数的估计。我们证明误差界随卷积大小的增加呈指数增长,但改进后的算法的误差界比原始算法更小。我们提出几种减少FP误差的方法。我们提出基于Huffman编码的通用评估顺序以减少求和误差。我们通过实验研究采样

英文摘要

Popular deep neural networks (DNNs) spend the majority of their execution time computing convolutions. The Winograd family of algorithms can greatly reduce the number of arithmetic operations required and is present in many DNN software frameworks. However, the performance gain is at the expense of a reduction in floating point (FP) numerical accuracy. In this paper, we analyse the worst case FP error and prove the estimation of norm and conditioning of the algorithm. We show that the bound grows exponentially with the size of the convolution, but the error bound of the \textit{modified} algorithm is smaller than the original one. We propose several methods for reducing FP error. We propose a canonical evaluation ordering based on Huffman coding that reduces summation error. We study the selection of sampling "points" experimentally and find empirically good points for the most important sizes. We identify the main factors associated with good points. In addition, we explore other methods to reduce FP error, including mixed-precision convolution, and pairwise summation across DNN channels. Using our methods we can significantly reduce FP error for a given block size, which allows larger block sizes and reduced computation.

1904.12394 2026-06-04 math.DS cs.NA cs.RO math.NA

Stability conditions of an ODE arising in human motion and its numerical simulation

来自人体运动的ODE的稳定性条件及其数值模拟

Takahiro Kosugi, Hitoshi Kino, Masaaki Goto, Yuki Matsutani

发表机构 * Department of Intelligent Mechanical Engineering, Faculty of Engineering, Fukuoka Institute of Technology(福冈技术学院智能机械工程系) Department of Robotics, Faculty of Engineering, Kindai University(近畿大学机器人系)

AI总结 本文研究了一个来自肌骨系统前馈位置控制的ODE的平衡点稳定性,提出了一种渐近稳定的充分条件,并通过惩罚ODE的数值模拟和实验结果验证了该条件。

Comments 15 pages, 7 figures

详情
AI中文摘要

本文讨论了一个来自肌骨系统前馈位置控制的常微分方程(ODE)的平衡点稳定性。所研究的系统包含一个连杆、一个关节和两条具有路由点的肌肉。系统的运动收敛性强烈依赖于肌骨系统中肌肉的排列方式。本文获得了渐近稳定的充分条件。此外,还描述了惩罚ODE的数值模拟和实验结果。

英文摘要

This paper discusses the stability of an equilibrium point of an ordinary differential equation (ODE) arising from a feed-forward position control for a musculoskeletal system. The studied system has a link, a joint and two muscles with routing points. The motion convergence of the system strongly depends on the muscular arrangement of the musculoskeletal system. In this paper, a sufficient condition for asymptotic stability is obtained. Furthermore, numerical simulations of the penalized ODE and experimental results are described.

1904.11352 2026-06-04 math.NA cs.LG cs.NA

Construction of the similarity matrix for the spectral clustering method: numerical experiments

谱聚类方法相似性矩阵的构造:数值实验

Paola Favati, Grazia Lotti, Ornella Menchi, Francesco Romani

发表机构 * IIT - CNR(意大利国家研究 council(CNR)- 国立应用科学研究院(IIT)) Dipartimento di Scienze Matematiche, Fisiche e Informatiche, University of Parma(帕尔马大学数学、物理和信息科学系) Dipartimento di Informatica, University of Pisa(比萨大学信息科学系)

AI总结 本文研究了谱聚类中相似性矩阵的构造问题,通过直接基于数据集关联图或其最小生成树(MST)来考虑稀疏性和尺度参数σ的选择,进行人工和真实数据集的数值实验以比较方法性能。

Comments Submitted to Journal of Computational and Applied Mathematics

详情
AI中文摘要

谱聚类是一种通过相似性矩阵的特征向量来发现数据集结构的强大方法。当个体聚类结构高度非凸时,它通常优于传统聚类算法如k-均值。其准确性取决于如何定义数据点对之间的相似性。两个重要因素影响相似性矩阵的构造:底层加权图的稀疏性,这主要取决于数据点之间的距离,以及相似性函数。当使用高斯相似性函数时,尺度参数σ的选择可能至关重要。本文基于数据集关联图或其最小生成树(MST)直接或间接地考察了稀疏性和σ的选择。为了比较方法性能,已进行了大量人工和真实数据集的数值实验。

英文摘要

Spectral clustering is a powerful method for finding structure in a dataset through the eigenvectors of a similarity matrix. It often outperforms traditional clustering algorithms such as $k$-means when the structure of the individual clusters is highly non-convex. Its accuracy depends on how the similarity between pairs of data points is defined. Two important items contribute to the construction of the similarity matrix: the sparsity of the underlying weighted graph, which depends mainly on the distances among data points, and the similarity function. When a Gaussian similarity function is used, the choice of the scale parameter $σ$ can be critical. In this paper we examine both items, the sparsity and the selection of suitable $σ$'s, based either directly on the graph associated to the dataset or on the minimal spanning tree (MST) of the graph. An extensive numerical experimentation on artificial and real-world datasets has been carried out to compare the performances of the methods.

1904.10597 2026-06-04 eess.SY cs.LG cs.SY

Autonomous Voltage Control for Grid Operation Using Deep Reinforcement Learning

利用深度强化学习实现电网运行的自主电压控制

Ruisheng Diao, Zhiwei Wang, Di Shi, Qianyun Chang, Jiajun Duan, Xiaohu Zhang

发表机构 * GEIRI North America(GEIRI北美中心) State Grid Corporation of China(国家电网公司)

AI总结 本文提出Grid Mind框架,通过深度强化学习实现自主电网控制,解决传统方法在处理可再生能源和需求响应动态性带来的挑战,提升电网运行的安全性和经济性。

Comments To be published (Accepted) in: Proceedings of the Power and Energy Society General Meeting (PESGM), Atlanta, GA, 2019

详情
AI中文摘要

现代电力电网正面临由快速增长的可再生能源和需求响应的随机性和动态性带来的巨大挑战。传统理论假设和运营规则可能被违反,而现有控制系统由于缺乏计算能力和准确的电网模型,在实时应用中难以适应,导致电网安全和经济运行日益受到关注。现有运营控制措施通常是离线确定的,优化程度较低。本文提出了一种新的范式Grid Mind,用于利用深度强化学习实现自主电网运营控制。所提出的AI代理可通过与大量离线模拟的交互学习其控制策略,并适应新的变化,包括负载/发电变化以及拓扑变化。经过适当训练的代理在IEEE 14节点系统上测试了数万种场景,并在应用自主电压控制以实现安全电网运行方面展示了有希望的性能。

英文摘要

Modern power grids are experiencing grand challenges caused by the stochastic and dynamic nature of growing renewable energy and demand response. Traditional theoretical assumptions and operational rules may be violated, which are difficult to be adapted by existing control systems due to the lack of computational power and accurate grid models for use in real time, leading to growing concerns in the secure and economic operation of the power grid. Existing operational control actions are typically determined offline, which are less optimized. This paper presents a novel paradigm, Grid Mind, for autonomous grid operational controls using deep reinforcement learning. The proposed AI agent for voltage control can learn its control policy through interactions with massive offline simulations, and adapts its behavior to new changes including not only load/generation variations but also topological changes. A properly trained agent is tested on the IEEE 14-bus system with tens of thousands of scenarios, and promising performance is demonstrated in applying autonomous voltage controls for secure grid operation.

1904.07479 2026-06-04 eess.SY cs.MA cs.RO cs.SY

Robust nonlinear control of close formation flight

紧密编队飞行的鲁棒非线性控制

Qingrui Zhang, Hugh H. T. Liu

发表机构 * University of Toronto(多伦多大学)

AI总结 本文研究了在动态飞行操作中,为跟随飞行器在受领航机生成的尾涡影响下实现精确位置控制的问题,提出了一种鲁棒非线性编队控制算法,通过基线控制律和扰动观测器实现精确编队跟踪控制,通过数值模拟验证了设计的有效性。

Comments 33 pages, 20 figures

详情
AI中文摘要

本文研究了紧密编队飞行的鲁棒非线性控制问题。其目标是在动态飞行操作中,为跟随飞行器在受领航机生成的尾涡影响下实现精确位置控制。一个关键问题是如何确保在存在不确定性和扰动的情况下,位置误差保持有界。本文提出了一种鲁棒非线性编队控制算法,以实现精确的紧密编队跟踪控制。所提出的控制算法由基线控制律和扰动观测器组成。基线控制律用于稳定紧密编队飞行的非线性动力学,而扰动观测器用于补偿系统不确定性和编队相关的空气动力学扰动。通过所提出的设计,可以保证位置控制性能在期望的有界范围内,从而为在紧密编队中使用该设计获得足够的阻力减少。所提出设计的有效性通过两架飞行器的紧密编队飞行数值模拟得以验证。

英文摘要

This paper investigates the robust nonlinear close formation control problem. It aims to achieve precise position control at dynamic flight operation for a follower aircraft under the aerodynamic impact due to the trailing vortices generated by a leader aircraft. One crucial concern is the control robustness that ensures the boundedness of position error subject to uncertainties and disturbances to be regulated with accuracy. This paper develops a robust nonlinear formation control algorithm to fulfill precise close formation tracking control. The proposed control algorithm consists of baseline control laws and disturbance observers. The baseline control laws are employed to stabilize the nonlinear dynamics of close formation flight, while the disturbance observers are introduced to compensate system uncertainties and formation-related aerodynamic disturbances. The position control performance can be guaranteed within the desired boundedness to harvest enough drag reduction for a follower aircraft in close formation using the proposed design. The efficacy of the proposed design is demonstrated via numerical simulations of close formation flight of two aircraft.

1812.03467 2026-06-04 math.NA cs.LG cs.MS cs.NA math.OC

A note on solving nonlinear optimization problems in variable precision

关于在变量精度下求解非线性优化问题的注记

S. Gratton, Ph. L. Toint

发表机构 * NAXYS, University of Namur(NAXYS,纳慕尔大学)

AI总结 本文提出一种高效的信赖域算法变体,用于高性能计算,通过多精度计算有效降低目标函数和梯度评估的能耗。

Comments 11 pages, 2 figures

详情
AI中文摘要

本文简要考虑了Carter (1993)和Conn, Gould和Toint (2000)提出的动态精度信赖域算法变体,作为非常高性能计算领域的一个工具,该领域中允许多精度计算对于控制能量耗散至关重要。所呈现的数值实验表明,使用该方法可以显著降低目标函数和梯度评估的'能耗',通过高效利用多精度计算。

英文摘要

This short note considers an efficient variant of the trust-region algorithm with dynamic accuracy proposed Carter (1993) and Conn, Gould and Toint (2000) as a tool for very high-performance computing, an area where it is critical to allow multi-precision computations for keeping the energy dissipation under control. Numerical experiments are presented indicating that the use of the considered method can bring substantial savings in objective function's and gradient's evaluation "energy costs" by efficiently exploiting multi-precision computations.

1904.05732 2026-06-04 math.NA cs.LG cs.NA

A Kaczmarz Algorithm for Solving Tree Based Distributed Systems of Equations

一种用于求解树结构分布式方程组的Kaczmarz算法

Chinmay Hegde, Fritz Keinert, Eric S. Weber

发表机构 * Electrical and Computer Engineering, Iowa State University, Ames, IA 50011(电子与计算机工程系,爱荷华州立大学,爱荷华州阿姆斯,IA 50011) Department of Mathematics, Iowa State University, 396 Carver Hall, Ames, IA 50011(数学系,爱荷华州立大学,396号Carver大楼,爱荷华州阿姆斯,IA 50011)

AI总结 本文提出了一种改进的Kaczmarz算法,用于求解分布式环境中的线性方程组,该算法适用于具有树结构的网络,并在方程组一致时收敛到解,在不一致时收敛到加权最小二乘解。

详情
AI中文摘要

Kaczmarz算法是一种用于求解线性方程组的迭代方法。我们介绍了一种改进的Kaczmarz算法,用于在分布式环境中求解线性方程组,即方程组中的方程分布在网络中的多个节点上。我们引入的修改是为了一个具有树结构的网络,允许节点之间传递解的估计值。我们证明了在不增加对方程的额外假设的情况下,改进的算法收敛。我们展示了当系统一致时,该算法收敛到解或最小范数解。我们还展示了在方程组不一致的情况下,改进的放松Kaczmarz算法当松弛参数接近0时收敛到加权最小二乘解。

英文摘要

The Kaczmarz algorithm is an iterative method for solving systems of linear equations. We introduce a modified Kaczmarz algorithm for solving systems of linear equations in a distributed environment, i.e. the equations within the system are distributed over multiple nodes within a network. The modification we introduce is designed for a network with a tree structure that allows for passage of solution estimates between the nodes in the network. We prove that the modified algorithm converges under no additional assumptions on the equations. We demonstrate that the algorithm converges to the solution, or the solution of minimal norm, when the system is consistent. We also demonstrate that in the case of an inconsistent system of equations, the modified relaxed Kaczmarz algorithm converges to a weighted least squares solution as the relaxation parameter approaches $0$.

1904.05423 2026-06-04 eess.SY cs.RO cs.SY

Game-Theoretic Modeling of Multi-Vehicle Interactions at Uncontrolled Intersections

多车辆在无信号交叉口交互的博弈建模

Nan Li, Yu Yao, Ilya Kolmanovsky, Ella Atkins, Anouck Girard

发表机构 * Robotics Institute, University of Michigan(密歇根大学机器人研究所)

AI总结 本文提出了一种基于博弈论的框架,用于建模自动驾驶和人工驾驶车辆在无信号交叉口的交互行为,通过参数化交叉口布局和几何结构,展示了模型在交通场景中的合理性和计算效率。

Comments 18 pages, 13 figures, 1 table

详情
AI中文摘要

受开发用于验证和验证自动驾驶系统在包含自动驾驶和人工驾驶车辆的交通中的仿真工具的需要所驱动,我们提出了一种用于建模无信号交叉口车辆交互的框架。所提出的交互建模方法基于博弈论,包含多个并发的领导者-追随者对,并考虑了常见的交通规则。我们参数化交叉口布局和几何结构以建模具有各种配置的无信号交叉口,并应用所提出的方法来建模这些交叉口处车辆的交互行为。基于各种交通场景的仿真结果,我们表明该模型表现出预期的交通行为,包括能够再现从真实世界交通数据中提取的场景以及在解决交通冲突方面的合理性能。该模型进一步基于服务水平交通质量评级系统进行验证,并展示了与传统多玩家博弈论模型相比的可管理计算复杂性。

英文摘要

Motivated by the need to develop simulation tools for verification and validation of autonomous driving systems operating in traffic consisting of both autonomous and human-driven vehicles, we propose a framework for modeling vehicle interactions at uncontrolled intersections. The proposed interaction modeling approach is based on game theory with multiple concurrent leader-follower pairs, and accounts for common traffic rules. We parameterize the intersection layouts and geometries to model uncontrolled intersections with various configurations, and apply the proposed approach to model the interactive behavior of vehicles at these intersections. Based on simulation results in various traffic scenarios, we show that the model exhibits reasonable behavior expected in traffic, including the capability of reproducing scenarios extracted from real-world traffic data and reasonable performance in resolving traffic conflicts. The model is further validated based on the level-of-service traffic quality rating system and demonstrates manageable computational complexity compared to traditional multi-player game-theoretic models.

1904.04685 2026-06-04 math.NA cs.LG cs.NA math.OC

On the approximation of the solution of partial differential equations by artificial neural networks trained by a multilevel Levenberg-Marquardt method

利用多级Levenberg-Marquardt方法训练人工神经网络近似偏微分方程解

Henri Calandra, Serge Gratton, Elisa Riccietti, Xavier Vasseur

发表机构 * INPT-IRIT, University of Toulouse and ENSEEIHT(INPT-IRIT,图卢兹大学和ENSEEIHT) ISAE-SUPAERO, University of Toulouse(ISAE-SUPAERO,图卢兹大学)

AI总结 本文研究了利用人工神经网络近似偏微分方程解的问题,提出了一种多级Levenberg-Marquardt方法用于训练,通过数值实验展示了该方法在训练人工神经网络时的高效性。

详情
AI中文摘要

本文关注利用人工神经网络近似偏微分方程解的问题。这里使用前馈神经网络来近似偏微分方程的解。将学习问题公式化为最小二乘问题,选择偏微分方程的残差作为损失函数,同时采用多级Levenberg-Marquardt方法作为训练方法。这种设置使我们能够进一步了解多级方法的潜力。确实,当最小二乘问题来源于人工神经网络的训练时,需要优化的变量并不受任何几何约束,标准的插值和限制算子不能再被使用。因此,提出了一种受代数多重网格方法启发的启发式方法,用于构造多级转移算子。数值实验显示,与标准的一级过程相比,新的多级优化方法在训练人工神经网络时表现出令人鼓舞的结果。

英文摘要

This paper is concerned with the approximation of the solution of partial differential equations by means of artificial neural networks. Here a feedforward neural network is used to approximate the solution of the partial differential equation. The learning problem is formulated as a least squares problem, choosing the residual of the partial differential equation as a loss function, whereas a multilevel Levenberg-Marquardt method is employed as a training method. This setting allows us to get further insight into the potential of multilevel methods. Indeed, when the least squares problem arises from the training of artificial neural networks, the variables subject to optimization are not related by any geometrical constraints and the standard interpolation and restriction operators cannot be employed any longer. A heuristic, inspired by algebraic multigrid methods, is then proposed to construct the multilevel transfer operators. Numerical experiments show encouraging results related to the efficiency of the new multilevel optimization method for the training of artificial neural networks, compared to the standard corresponding one-level procedure.

1904.03742 2026-06-04 eess.SY cs.RO cs.SY

Nonlinear Model Predictive Control for 3D Formation of Multirotor Micro Aerial Vehicles with Relative Sensing in Local Coordinates

多旋翼微型飞行器在局部坐标系中基于相对感知的三维编队非线性模型预测控制

I. Kagan Erunsal, Rodrigo Ventura, Alcherio Martinoli

发表机构 * Institute for Systems and Robotics (ISR), IST, Lisbon, Portugal(系统与机器人研究所(ISR),IST,里斯本,葡萄牙)

AI总结 本文提出了一种基于相对感知信息的多旋翼微型飞行器三维编队控制方法,采用集中式非线性模型预测控制策略,通过引入六自由度数学模型实现了对编队的鲁棒控制。

Comments 8 pages, 10 figures, IROS'2019 (submitted)

详情
AI中文摘要

复杂的任务如监视、建设、搜索和救援可以受益于多旋翼微型飞行器(MAVs)的机动性,以获得稳健、协作的系统行为和编队控制是这些复杂任务的重要组成部分。本文聚焦于利用仅有的相对传感信息实现多旋翼MAVs的三维编队控制。它提出了一种集中式非线性模型预测控制(NMPC)方法,在领导者-追随者方案中。介绍了一个现实的六自由度数学模型,并用于控制律。问题的制定基于NMPC和相对传感框架,相对于机器人的局部坐标系。这种制定使编队不依赖于全局或共同参考系的完整知识以及昂贵的全局定位传感器。通过考虑新的制定,提出了基于实时迭代(RTI)的最优控制问题(OCP)的解决方案。设计了一个广泛的场景来测试和验证该策略。结果评估表明,在模型不确定性和本地传感器噪声以及编队动态突然变化的情况下,取得了令人满意的鲁棒性能。

英文摘要

The complex tasks such as surveillance, construction, search and rescue can benefit of the maneuverability of multirotor Micro Aerial Vehicles (MAVs) to obtain robust, cooperative system behavior and formation control is a prominent component of the these complex tasks. This work focuses on the problem of three-dimensional formation control of multirotor MAVs by using exclusively relative sensory information. It proposes a centralized Nonlinear Model Predictive Control (NMPC) approach in a leader-follower scheme. A realistic six degrees of freedom mathematical model of a multirotor MAVs is introduced and leveraged in the control laws. The formulation of the problem is performed based on NMPC and relative sensing framework with respect to local coordinate frames of the robots. This type of formulation makes the formation independent of the full knowledge of global or common reference frames and the utilization of expensive global localization sensors. Real-time Iteration (RTI) based solution to optimal control problem (OCP) is proposed by taking the novel formulation into account. An extensive scenario is designed to test and validate the strategy. Evaluation of the results suggests that satisfactory robust performance is achieved and maintained under model uncertainty and noise in local sensors and even in cases where the dynamics of the formation suddenly changes.

1803.08552 2026-06-04 eess.SY cs.LG cs.SY

Linear model predictive safety certification for learning-based control

基于线性模型的预测安全认证用于基于学习的控制

Kim P. Wabersich, Melanie N. Zeilinger

发表机构 * ETH Zurich(苏黎世联邦理工学院)

AI总结 本文提出了一种模型预测安全认证(MPSC)方案,用于具有加性扰动的多边形线性系统,以解决基于学习的控制器缺乏安全保证的问题。通过引入MPC来确保系统在安全目标集内运行,并通过场景优化提出了一种实用的数据驱动设计方法。

详情
AI中文摘要

尽管已多次证明基于学习的控制器可以提供优越的性能,但它们通常缺乏安全保证。本文旨在通过引入一种模型预测安全认证(MPSC)方案来解决这一问题,该方案适用于具有加性扰动的多边形线性系统。该方案验证所提出的学习输入的安全性,并尽可能最小地修改以保持系统在给定约束集内。安全因此与模型预测控制器(MPC)提供可行轨迹以达到安全目标集的存在相关。一种鲁棒的MPC公式考虑了学习环境中模型通常不确定的事实,从而在所提出的MPSC策略下始终保证约束满足。MPSC方案可用于扩展任何潜在保守的安全状态集用于学习,并证明了一种迭代技术用于扩大安全集。最后,提出了一种使用场景优化的实用数据驱动设计方法用于MPSC。

英文摘要

While it has been repeatedly shown that learning-based controllers can provide superior performance, they often lack of safety guarantees. This paper aims at addressing this problem by introducing a model predictive safety certification (MPSC) scheme for polytopic linear systems with additive disturbances. The scheme verifies safety of a proposed learning-based input and modifies it as little as necessary in order to keep the system within a given set of constraints. Safety is thereby related to the existence of a model predictive controller (MPC) providing a feasible trajectory towards a safe target set. A robust MPC formulation accounts for the fact that the model is generally uncertain in the context of learning, which allows proving constraint satisfaction at all times under the proposed MPSC strategy. The MPSC scheme can be used in order to expand any potentially conservative set of safe states for learning and we prove an iterative technique for enlarging the safe set. Finally, a practical data-based design procedure for MPSC is proposed using scenario optimization.

1708.01945 2026-06-04 stat.ML cs.LG cs.NA math.NA

A Bootstrap Method for Error Estimation in Randomized Matrix Multiplication

一种用于随机矩阵乘法误差估计的自助法

Miles E. Lopes, Shusen Wang, Michael W. Mahoney

发表机构 * Department of Statistics University of California at Davis(加州大学戴维斯分校统计学系) Department of Computer Science Stevens Institute of Technology(史蒂文斯理工学院计算机科学系) International Computer Science Institute and Department of Statistics University of California at Berkeley(伯克利大学国际计算机科学研究所和统计学系)

AI总结 本文提出了一种自助方法,用于直接估计随机矩阵乘法(降维)的准确性,作为解决一般问题的原型设置。该方法在计算上不显著增加标准降维方法的成本,并通过插值技术实现,同时提供了理论和实证结果以证明其有效性。

详情
Journal ref
Journal of Machine Learning Research, 20(39): 1-40, 2019
AI中文摘要

近年来,随机方法在数值线性代数中受到越来越多的关注,作为解决大规模问题的一般方法。通常,这些方法的核心成分是某种形式的随机降维,这加速了计算,但也引入了随机近似误差。在这种情况下,降维步骤编码了成本与精度之间的权衡。然而,成本与精度之间的精确数值关系通常未知,因此用户可能难以准确知道(1)给定解的准确性,或(2)为了达到给定的准确性水平需要多少计算。在本文中,我们研究随机矩阵乘法(草图)作为解决这些问题的原型设置。作为解决方案,我们开发了一种自助方法,用于直接估计准确性作为降维函数的函数(而不是导出降维的最坏情况界限)。从计算角度来看,所提出的方法不显著增加标准草图方法的成本,并且这得益于一种“插值”技术。此外,我们提供了理论和实证结果,以证明所提出方法的有效性。

英文摘要

In recent years, randomized methods for numerical linear algebra have received growing interest as a general approach to large-scale problems. Typically, the essential ingredient of these methods is some form of randomized dimension reduction, which accelerates computations, but also creates random approximation error. In this way, the dimension reduction step encodes a tradeoff between cost and accuracy. However, the exact numerical relationship between cost and accuracy is typically unknown, and consequently, it may be difficult for the user to precisely know (1) how accurate a given solution is, or (2) how much computation is needed to achieve a given level of accuracy. In the current paper, we study randomized matrix multiplication (sketching) as a prototype setting for addressing these general problems. As a solution, we develop a bootstrap method for \emph{directly estimating} the accuracy as a function of the reduced dimension (as opposed to deriving worst-case bounds on the accuracy in terms of the reduced dimension). From a computational standpoint, the proposed method does not substantially increase the cost of standard sketching methods, and this is made possible by an "extrapolation" technique. In addition, we provide both theoretical and empirical results to demonstrate the effectiveness of the proposed method.

1904.01855 2026-06-04 math.OC cs.LG cs.SY eess.SY stat.ML

A Stochastic Interpretation of Stochastic Mirror Descent: Risk-Sensitive Optimality

随机镜像下降的随机解释:风险敏感最优性

Navid Azizan, Babak Hassibi

发表机构 * California Institute of Technology(加州理工学院)

AI总结 本文提出随机镜像下降(SMD)是一种风险敏感最优估计器,适用于非高斯分布的未知权重向量和加性噪声,同时引入了对称SMD(SSMD)的改进版本。

详情
AI中文摘要

随机镜像下降(SMD)是一种相对较新的算法家族,最近在优化、机器学习和控制领域得到了广泛应用。它可以被视为经典随机梯度算法(SGD)的推广,其中权重向量的更新不是沿着随机梯度的负方向进行,而是在一个由梯度的(严格凸)势函数定义的“镜像域”中进行。这种势函数及其产生的镜像域相比SGD提供了更大的算法灵活性。尽管许多SMD的性质已经在文献中得到研究,但本文提出了SMD的一个新解释,即当未知权重向量和加性噪声非高斯且属于指数分布族时,SMD是一个风险敏感最优估计器。分析还建议了SMD的一种改进版本,称为对称SMD(SSMD)。证明依赖于Bregman散度的一些简单性质,使我们能够将结果从二次函数和高斯分布扩展到某些凸函数和指数分布族,方式较为流畅。

英文摘要

Stochastic mirror descent (SMD) is a fairly new family of algorithms that has recently found a wide range of applications in optimization, machine learning, and control. It can be considered a generalization of the classical stochastic gradient algorithm (SGD), where instead of updating the weight vector along the negative direction of the stochastic gradient, the update is performed in a "mirror domain" defined by the gradient of a (strictly convex) potential function. This potential function, and the mirror domain it yields, provides considerable flexibility in the algorithm compared to SGD. While many properties of SMD have already been obtained in the literature, in this paper we exhibit a new interpretation of SMD, namely that it is a risk-sensitive optimal estimator when the unknown weight vector and additive noise are non-Gaussian and belong to the exponential family of distributions. The analysis also suggests a modified version of SMD, which we refer to as symmetric SMD (SSMD). The proofs rely on some simple properties of Bregman divergence, which allow us to extend results from quadratics and Gaussians to certain convex functions and exponential families in a rather seamless way.

1903.09698 2026-06-04 math.NA cs.LG cs.NA

CUR Decompositions, Approximations, and Perturbations

CUR分解、近似与扰动

Keaton Hamm, Longxiu Huang

发表机构 * Department of Mathematics, University of Arizona, Tucson, AZ 85719 USA(亚利桑那大学数学系,图森,亚利桑那州,85719 USA) Department of Mathematics, Vanderbilt University, Nashville, TN 37240 USA(范德比尔特大学数学系,纳什维尔,田纳西州,37240 USA)

AI总结 本文探讨了用于降维和低秩矩阵近似的CUR分解方法,综述并比较了文献中的不同观点,提出了一种新的精确CUR分解特征,并对噪声低秩矩阵的CUR近似进行了新颖的扰动分析,同时给出了新的列和行采样结果,证明了低秩矩阵的CUR分解在高概率下得以实现,并展示了这些采样方法在之前研究的扰动下的稳定性以及相关方法和界限的数值示例。

Comments 40 pages

详情
AI中文摘要

本文讨论了用于降维和低秩矩阵近似的CUR分解这一有用的工具。文献中关于该方法的各种观点被综合并进行比较和对比;其中包括对精确CUR分解的新特征。对噪声低秩矩阵的CUR近似的新型扰动分析被进行,该分析将这些近似与底层低秩部分的潜在CUR分解进行比较。此外,我们给出了新的列和行采样结果,允许得出结论:低秩矩阵的CUR分解以高概率获得。然后,我们展示了这些采样方法在之前研究的扰动下的稳定性,并提供了所讨论的方法和界限的数值示例。

英文摘要

This article discusses a useful tool in dimensionality reduction and low-rank matrix approximation called the CUR decomposition. Various viewpoints of this method in the literature are synergized and are compared and contrasted; included in this is a new characterization of exact CUR decompositions. A novel perturbation analysis is performed on CUR approximations of noisy versions of low-rank matrices, which compares them with the putative CUR decomposition of the underlying low-rank part. Additionally, we give new column and row sampling results which allow one to conclude that a CUR decomposition of a low-rank matrix is attained with high probability. We then illustrate the stability of these sampling methods under the perturbations studied before, and provide numerical illustrations of the methods and bounds discussed.

1810.04351 2026-06-04 math.AP cs.LG cs.NA math.NA math.PR

Properly-weighted graph Laplacian for semi-supervised learning

带权图拉普拉斯算子用于半监督学习

Jeff Calder, Dejan Slepcev

发表机构 * Department of Mathematics, University of Minnesota(明尼苏达大学数学系) Department of Mathematical Sciences, Carnegie Mellon University(卡内基梅隆大学数学科学系)

AI总结 本文提出了一种带权图拉普拉斯算子的方法,以解决传统半监督学习方法在标签数据与未标签数据比例降低时性能下降的问题,通过在拉普拉斯正则化中正确设置权重,使估计器在大样本极限下保持良好和稳定,证明了所提出的方法在无限样本极限下收敛于连续变分问题的光滑解。

详情
AI中文摘要

传统图拉普拉斯方法在半监督学习中,当标签数据与未标签数据的比例降低时,性能显著下降,这是由于图拉普拉斯的退化。最近有几种方法被提出以解决这个问题,然而我们表明其中一些方法在大数据极限下仍然不恰当。在本文中,我们展示了一种正确设置拉普拉斯正则化中权重的方法,使得估计器在大样本极限下保持良好和稳定。我们证明了我们的半监督学习算法在无限样本极限下收敛于一个连续变分问题的光滑解,该解连续地达到标签值。我们的方法快速且易于实现。

英文摘要

The performance of traditional graph Laplacian methods for semi-supervised learning degrades substantially as the ratio of labeled to unlabeled data decreases, due to a degeneracy in the graph Laplacian. Several approaches have been proposed recently to address this, however we show that some of them remain ill-posed in the large-data limit. In this paper, we show a way to correctly set the weights in Laplacian regularization so that the estimator remains well posed and stable in the large-sample limit. We prove that our semi-supervised learning algorithm converges, in the infinite sample size limit, to the smooth solution of a continuum variational problem that attains the labeled values continuously. Our method is fast and easy to implement.

1808.10788 2026-06-04 stat.ML cs.LG cs.NA math.NA

Data-driven discovery of PDEs in complex datasets

基于数据的复杂数据集中的PDE发现

Jens Berg, Kaj Nyström

发表机构 * Department of Mathematics, Uppsala University(乌普萨拉大学数学系)

AI总结 本文通过机器学习方法从复杂数据集中发现隐藏的偏微分方程,展示了如何通过数据转换和特征选择来揭示物理过程的PDE,并在非线性二次PDE和瑞典温度分布模拟中验证了该方法的有效性。

详情
AI中文摘要

许多科学和工程中的过程可以用偏微分方程(PDEs)来描述。传统上,PDEs是通过考虑物理基本原理来推导感兴趣的物理量之间的关系。另一种方法是测量感兴趣的量并使用深度学习来逆向工程描述物理过程的PDEs。本文使用机器学习,特别是深度学习,来从测量数据中发现复杂数据集中的PDEs。我们包括来自已知模型问题的数据示例和来自气象站的实测数据。我们展示了输入数据的必要转换相当于发现的PDE中的坐标转换,并详细阐述了特征和模型选择。证明了非线性二次PDE的动力学可以被普通微分方程准确描述,该方程由我们的深度学习算法自动发现。更有趣的是,我们在瑞典温度分布更复杂的模拟中也展示了类似的结果。

英文摘要

Many processes in science and engineering can be described by partial differential equations (PDEs). Traditionally, PDEs are derived by considering first principles of physics to derive the relations between the involved physical quantities of interest. A different approach is to measure the quantities of interest and use deep learning to reverse engineer the PDEs which are describing the physical process. In this paper we use machine learning, and deep learning in particular, to discover PDEs hidden in complex data sets from measurement data. We include examples of data from a known model problem, and real data from weather station measurements. We show how necessary transformations of the input data amounts to coordinate transformations in the discovered PDE, and we elaborate on feature and model selection. It is shown that the dynamics of a non-linear, second order PDE can be accurately described by an ordinary differential equation which is automatically discovered by our deep learning algorithm. Even more interestingly, we show that similar results apply in the context of more complex simulations of the Swedish temperature distribution.

1810.08754 2026-06-04 math.NA cs.LG cs.NA eess.SP

BCR-Net: a neural network based on the nonstandard wavelet form

BCR-Net: 一种基于非标准小波形式的神经网络

Yuwei Fan, Cindy Orozco Bohorquez, Lexing Ying

发表机构 * Department of Mathematics, Stanford University(斯坦福大学数学系) Institute for Computational and Mathematical Engineering, Stanford University(斯坦福大学计算与数学工程研究所) Department of Mathematics and ICME, Stanford University(斯坦福大学数学系和计算与数学工程研究所) Facebook AI Research, Menlo Park, CA(脸书人工智能研究(Menlo Park, CA))

AI总结 本文提出了一种基于非标准小波形式的神经网络架构,该架构通过将非标准形式的矩阵向量乘法算法表示为线性神经网络,其中每个多分辨率计算的尺度都由局部连接的线性子网络完成,并通过用更深层次和强大的非线性子网络替换线性子网络来扩展以解决非线性问题。

Comments 17 pages and 9 figures

详情
AI中文摘要

本文提出了一种新颖的神经网络架构,灵感来源于Beylkin、Coifman和Rokhlin在[Communications on Pure and Applied Mathematics, 44(2), 141-183]中提出的一种非标准形式。非标准形式是一种高效的基于小波的压缩方案,用于线性积分算子。在本文中,我们首先将非标准形式的矩阵向量乘法算法表示为线性神经网络,其中每个多分辨率计算的尺度都通过局部连接的线性子网络完成。为了处理非线性问题,我们提出了一种扩展,称为BCR-Net,通过将每个线性子网络替换为更深层次和更强大的非线性子网络。数值结果展示了新架构的有效性,通过近似出现在均质理论和随机计算中的非线性映射。

英文摘要

This paper proposes a novel neural network architecture inspired by the nonstandard form proposed by Beylkin, Coifman, and Rokhlin in [Communications on Pure and Applied Mathematics, 44(2), 141-183]. The nonstandard form is a highly effective wavelet-based compression scheme for linear integral operators. In this work, we first represent the matrix-vector product algorithm of the nonstandard form as a linear neural network where every scale of the multiresolution computation is carried out by a locally connected linear sub-network. In order to address nonlinear problems, we propose an extension, called BCR-Net, by replacing each linear sub-network with a deeper and more powerful nonlinear one. Numerical results demonstrate the efficiency of the new architecture by approximating nonlinear maps that arise in homogenization theory and stochastic computation.

1902.06094 2026-06-04 cs.NE cs.LG cs.SY eess.SY

Differentiable reservoir computing

可微 reservoir 计算

Lyudmila Grigoryeva, Juan-Pablo Ortega

发表机构 * Department of Mathematics and Statistics(数学与统计学系) Centre National de la Recherche Scientifique (CNRS)(国家科学研究中心(CNRS))

AI总结 本文研究了 reservoir 计算系统在不同可微性条件下的特性,提出了一种新的方法来分析 reservoir 过滤器的可微性,并展示了其在混沌动力系统学习中的应用。

Comments 60 pages

详情
AI中文摘要

在过去二十年中,大量努力致力于确定 reservoir 计算系统在所谓的回声状态(ESP)和衰减记忆(FMP)特性下的情况。这些重要特性在数学上相当于全局 reservoir 系统解的存在性和连续性。本文通过为非常一般类别的离散时间确定性输入刻画 reservoir 过滤器的可微性,从而补充了这一研究。这构成了对长期研究 ESP 和 FMP 的重要贡献,并特别与现有研究中的 ESP 输入依赖性相关联。文献中已证明可微性是学习混沌动力系统吸引子的关键特征。在分析情况下,利用泰勒定理构造了 reservoir 过滤器的 Volterra 型级数表示,并提供了相应的近似界。最后,这些结果的推论表明,任何衰减记忆过滤器都可以通过具有有限记忆的有限 Volterra 级数均匀近似。

英文摘要

Much effort has been devoted in the last two decades to characterize the situations in which a reservoir computing system exhibits the so-called echo state (ESP) and fading memory (FMP) properties. These important features amount, in mathematical terms, to the existence and continuity of global reservoir system solutions. That research is complemented in this paper with the characterization of the differentiability of reservoir filters for very general classes of discrete-time deterministic inputs. This constitutes a novel strong contribution to the long line of research on the ESP and the FMP and, in particular, links to existing research on the input-dependence of the ESP. Differentiability has been shown in the literature to be a key feature in the learning of attractors of chaotic dynamical systems. A Volterra-type series representation for reservoir filters with semi-infinite discrete-time inputs is constructed in the analytic case using Taylor's theorem and corresponding approximation bounds are provided. Finally, it is shown as a corollary of these results that any fading memory filter can be uniformly approximated by a finite Volterra series with finite memory.

1903.09136 2026-06-04 stat.ML cs.LG cs.SY eess.SP eess.SY

On Approximate Nonlinear Gaussian Message Passing On Factor Graphs

关于因子图上的近似非线性高斯信息传递

Eike Petersen, Christian Hoffmann, Philipp Rostalski

发表机构 * Institute for Electrical Engineering in Medicine, University of Lübeck(医学电气工程研究所,吕贝克大学)

AI总结 本文提出了一种基于因子图的近似高斯信息传递规则,用于处理确定性非线性变换节点,通过数值求积和Rauch-Tung-Striebel型近似方法,为非线性问题的求解提供了新的算法框架。

详情
Journal ref
2018 IEEE Statistical Signal Processing Workshop (SSP)
AI中文摘要

因子图近年来因其作为信号处理、估计和控制算法表示和构建的统一框架而受到越来越多的关注。因子图工具包中似乎未充分探索的一个能力是利用表格信息传递规则处理确定性非线性变换,如非线性滤波和平滑问题中的变换。在本贡献中,我们基于前向传递的数值求积过程和后向传递的Rauch-Tung-Striebel型近似方法,为满足马尔可夫性质的任意因子图中的确定性非线性变换节点提供了通用的前向(滤波)和后向(平滑)近似高斯信息传递规则。这些信息传递规则可用于推导许多使用因子图求解非线性问题的算法,如基于所提出信息传递规则的非线性修改Bryson-Frazier(MBF)平滑器的提出。

英文摘要

Factor graphs have recently gained increasing attention as a unified framework for representing and constructing algorithms for signal processing, estimation, and control. One capability that does not seem to be well explored within the factor graph tool kit is the ability to handle deterministic nonlinear transformations, such as those occurring in nonlinear filtering and smoothing problems, using tabulated message passing rules. In this contribution, we provide general forward (filtering) and backward (smoothing) approximate Gaussian message passing rules for deterministic nonlinear transformation nodes in arbitrary factor graphs fulfilling a Markov property, based on numerical quadrature procedures for the forward pass and a Rauch-Tung-Striebel-type approximation of the backward pass. These message passing rules can be employed for deriving many algorithms for solving nonlinear problems using factor graphs, as is illustrated by the proposition of a nonlinear modified Bryson-Frazier (MBF) smoother based on the presented message passing rules.

1903.08781 2026-06-04 cs.AR cs.DC cs.RO cs.SY eess.SY

Fault-Tolerant Nanosatellite Computing on a Budget

在预算内实现容错的纳卫星计算

Christian M. Fuchs, Nadia Murillo, Aske Plaat, Erik Van der Kouwe, Daniel Harsono, Todor Stefanov

发表机构 * Leiden Institute of Advanced Computer Science(莱顿先进计算机科学研究所) Leiden Observatory(莱顿天文台) Leiden University(莱顿大学) European Space Agency(欧洲航天局) Netherlands Research School for Astronomy(荷兰天文研究学校) Royal Netherlands Academy of Arts and Sciences(荷兰皇家艺术与科学学院)

AI总结 本文提出了一种基于线程级粗粒度锁步的软件容错方法,通过故障注入验证,利用FPGA实现 tiled MPSoC 架构,以满足未来科学和商业航天任务的高性能需求,同时提供强故障覆盖保障。

详情
Journal ref
Conference on Radiation Effects on Components and Systems 2018 (RADECS)
AI中文摘要

微卫星和纳卫星已成为各种商业和科学应用的流行平台,但目前主要适用于短时和低优先级空间任务,由于其可靠性较低。部分原因在于其依赖于便宜、低功能尺寸的COTS组件,这些组件最初为嵌入式和移动市场设计,传统硬件投票概念对此无效。软件容错概念已被证明对这类系统有效,但因成熟度低而被航天行业忽视,因为大多数仅在理论中研究。在实践中,载荷仪器和微型卫星的设计者通常被迫牺牲可靠性以满足尖端科学和创新商业应用所需的性能水平。因此,我们开发了一种基于线程级粗粒度锁步的软件容错方法,通过故障注入验证。为了提供强长期故障覆盖,我们的架构实现为FPGA上的tiled MPSoC,利用部分重新配置以及混合关键性。该架构可以在极低成本下满足当前和未来科学和商业航天任务的高性能需求,同时为长期任务提供必要的强故障覆盖保障。该架构是为一个为期4年的ESA项目开发的。与两家工业合作伙伴一起,我们正在开发原型,然后进行辐射测试。

英文摘要

Micro- and nanosatellites have become popular platforms for a variety of commercial and scientific applications, but today are considered suitable mainly for short and low-priority space missions due to their low reliability. In part, this can be attributed to their reliance upon cheap, low-feature size, COTS components originally designed for embedded and mobile-market applications, for which traditional hardware-voting concepts are ineffective. Software-fault-tolerance concepts have been shown effective for such systems, but have largely been ignored by the space industry due to low maturity, as most have only been researched in theory. In practice, designers of payload instruments and miniaturized satellites are usually forced to sacrifice reliability in favor deliver the level of performance necessary for cutting-edge science and innovative commercial applications. Thus, we developed a software-fault-tolerance-approach based upon thread-level coarse-grain lockstep, which was validated using fault-injection. To offer strong long-term fault coverage, our architecture is implemented as tiled MPSoC on an FPGA, utilizing partial reconfiguration, as well as mixed criticality. This architecture can satisfy the high performance requirements of current and future scientific and commercial space missions at very low cost, while offering the strong fault-coverage guarantees necessary for platform control even for missions with a long duration. This architecture was developed for a 4-year ESA project. Together with two industrial partners, we are developing a prototype to then undergo radiation testing.

1903.07353 2026-06-04 eess.SY cs.RO cs.SY

Joint axis estimation for fast and slow movements using weighted gyroscope and acceleration constraints

利用加权陀螺仪和加速度约束的快速和慢速运动联合轴估计

Fredrik Olsson, Thomas Seel, Dustin Lehmann, Kjartan Halvorsen

发表机构 * 1 4 Department of Information Technology, Uppsala University, Uppsala, Sweden 2 3 Department of Electrical Engineering 4 Department of Mecatronics, Tecnol \'o gico de Monterrey, Mexico City, Mexico

AI总结 本文提出了一种结合加速度和陀螺仪约束的新型方法,用于准确估计关节轴,通过调整残差权重和非线性加速度范数差,实现快速和慢速运动中的高精度估计。

Comments 8 pages, 4 figures, 1 table

详情
AI中文摘要

传感器到肢体的校准是惯性运动跟踪中的关键步骤。当两个肢体通过铰链关节连接时,例如在人体膝关节和手指关节以及许多机械臂中,必须在内在传感器坐标系中确定关节轴向量。存在通过求解基于运动关节约束的优化问题来确定这些坐标的办法,这些约束涉及测量的加速度或角速度。在本文中,我们证明仅使用其中一个约束会导致在快速或慢速运动中估计不准确。我们提出了一种基于结合两种约束的成本函数的新方法。通过仅使用加速度计和陀螺仪读数,避免了对均匀磁场的严格假设。为了结合两种传感器的优点,残差权重根据估计信号方差和非线性加速度范数差自动调整。该方法使用来自上肢外骨骼九种不同运动的真实数据进行评估。结果表明,与以往方法不同,所提出的方法在仅五秒后即可对所有快速和慢速运动实现准确的关节轴估计。

英文摘要

Sensor-to-segment calibration is a crucial step in inertial motion tracking. When two segments are connected by a hinge joint, for example in human knee and finger joints as well as in many robotic limbs, then the joint axis vector must be identified in the intrinsic sensor coordinate systems. There exist methods that identify these coordinates by solving an optimization problem that is based on kinematic joint constraints, which involve either the measured accelerations or the measured angular rates. In the current paper we demonstrate that using only one of these constraints leads to inaccurate estimates at either fast or slow motions. We propose a novel method based on a cost function that combines both constraints. The restrictive assumption of a homogeneous magnetic field is avoided by using only accelerometer and gyroscope readings. To combine the advantages of both sensor types, the residual weights are adjusted automatically based on the estimated signal variances and a nonlinear weighting of the acceleration norm difference. The method is evaluated using real data from nine different motions of an upper limb exoskeleton. Results show that, unlike previous approaches, the proposed method yields accurate joint axis estimation after only five seconds for all fast and slow motions.

1903.05817 2026-06-04 eess.SY cs.LG cs.SY

A New Approach for Distributed Hypothesis Testing with Extensions to Byzantine-Resilience

分布式假设检验的一种新方法及其对拜占庭容错的扩展

Aritra Mitra, John A. Richards, Shreyas Sundaram

发表机构 * School of Electrical and Computer Engineering at Purdue University(普渡大学电气与计算机工程学院) Sandia National Laboratories(桑迪亚国家实验室)

AI总结 本文提出了一种新的分布式学习规则,用于在时间序列中联合观察资料下学习真实的状态,该方法不采用信念平均,且能扩展到处理网络中某些代理的恶意行为。

Comments To appear in the Proceedings of the American Control Conference, 2019

详情
AI中文摘要

我们研究了一个场景,其中一组代理各自接收部分信息的私人观察,试图协作学习能够解释他们随时间变化的联合观察资料的真实状态(在一组假设中)。为了解决这个问题,我们提出了一种分布式学习规则,与现有方法不同,它不采用任何形式的“信念平均”。具体来说,每个代理维护一个本地信念(对每个假设),该信念以贝叶斯方式更新,不受网络影响,同时维护一个实际信念,该信念在更新(除归一化外)时是其自身本地信念和邻居实际信念的最小值。在对代理信号结构和底层通信图的最小要求下,我们建立了所提出信念更新规则的一致性,即我们证明了代理的实际信念几乎必然渐近地集中在真实状态上。作为我们方法的一个关键好处,我们展示了我们的学习规则可以扩展到捕捉网络中某些代理的恶意行为,通过拜占庭对手模型。特别是,我们证明在适当的观察模型和网络拓扑条件下,每个非恶意代理几乎必然渐近地学习世界的真实状态。

英文摘要

We study a setting where a group of agents, each receiving partially informative private observations, seek to collaboratively learn the true state (among a set of hypotheses) that explains their joint observation profiles over time. To solve this problem, we propose a distributed learning rule that differs fundamentally from existing approaches, in the sense, that it does not employ any form of "belief-averaging". Specifically, every agent maintains a local belief (on each hypothesis) that is updated in a Bayesian manner without any network influence, and an actual belief that is updated (up to normalization) as the minimum of its own local belief and the actual beliefs of its neighbors. Under minimal requirements on the signal structures of the agents and the underlying communication graph, we establish consistency of the proposed belief update rule, i.e., we show that the actual beliefs of the agents asymptotically concentrate on the true state almost surely. As one of the key benefits of our approach, we show that our learning rule can be extended to scenarios that capture misbehavior on the part of certain agents in the network, modeled via the Byzantine adversary model. In particular, we prove that each non-adversarial agent can asymptotically learn the true state of the world almost surely, under appropriate conditions on the observation model and the network topology.