arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 21511
1811.06350 2026-06-04 eess.SY cs.MA cs.RO cs.SC cs.SY math.OC

Temporal viability regulation for control affine systems with applications to mobile vehicle coordination under time-varying motion constraints

时间可行性调节用于控制仿射系统及其在时间变化运动约束下移动车辆协调中的应用

Marcus Greiff, Zhiyong Sun, Anders Robertsson, Rolf Johansson

发表机构 * LCCC Linnaeus Center(LCCC 林纳厄中心) ELLIIT Excellence Center(ELLIIT 卓越中心) Lund University(Lund 大学)

AI总结 本文提出了一种时间可行性调节理论,用于一般动态控制系统,特别是控制仿射系统,通过时间变化约束函数参数化时间变化可行集,以确保动态控制系统在时间变化可行集中不变,从而执行时间依赖状态约束。同时,本文还给出了控制仿射系统可行控制输入存在的充分条件,并将该理论应用于移动车辆协调。

Comments 7 pages, 3 figures. Submitted to a conference for publication

详情
AI中文摘要

受控不变集和动态控制系统的可行性调节在许多控制和协调应用中发挥了重要作用。本文开发了一种针对一般动态控制系统的时序可行性调节理论,特别是针对控制仿射系统。时间变化可行集由时间变化约束函数参数化,旨在将动态控制系统调节为在时间变化可行集中不变,以执行时间依赖的状态约束。我们考虑了在定义时间可行集时的时间变化等式和不等式约束。我们还提出了控制仿射系统可行控制输入存在的充分条件。所开发的时序可行性调节理论应用于移动车辆协调。

英文摘要

Controlled invariant set and viability regulation of dynamical control systems have played important roles in many control and coordination applications. In this paper we develop a temporal viability regulation theory for general dynamical control systems, and in particular for control affine systems. The time-varying viable set is parameterized by time-varying constraint functions, with the aim to regulate a dynamical control system to be invariant in the time-varying viable set so that temporal state-dependent constraints are enforced. We consider both time-varying equality and inequality constraints in defining a temporal viable set. We also present sufficient conditions for the existence of feasible control input for the control affine systems. The developed temporal viability regulation theory is applied to mobile vehicle coordination.

1804.06128 2026-06-04 math.NA cs.CV cs.NA

Fast and Accurate Tensor Completion with Total Variation Regularized Tensor Trains

快速且准确的张量补全与总变分正则化张量列车

Ching-Yun Ko, Kim Batselier, Wenjian Yu, Ngai Wong

发表机构 * Department of Electrical and Electronic Engineering, The University of Hong Kong(香港大学电子工程系) Delft Center for Systems and Control, Delft University of Technology(代尔夫特理工大学系统与控制中心)

AI总结 本文提出了一种基于张量列车的新型张量补全方法,通过总变分和Tikhonov正则化提升了补全速度和可扩展性,尤其在已知数据极少时表现优异。

Comments 13 pages. Source code and supplemental materials are available via: https://github.com/IRENEKO/TTC Updates 11/13: included more comparisons and experimental results

详情
AI中文摘要

我们提出了一种基于张量列车的新张量补全方法。待补全的张量被建模为低秩张量列车,其中利用已知的张量条目及其坐标来更新张量列车。为图像和视频补全专门提出了一种新的张量列车初始化程序,已被证明能确保补全算法的快速收敛。张量列车框架还显示出能够轻松容纳总变分和Tikhonov正则化,因为它们具有低秩张量列车表示。图像和视频修复实验验证了所提方案在速度和可扩展性方面的优越性,在相似精度下比现有张量补全方法快了高达155倍。此外,我们展示了所提方案在已知数据极少时(例如,1%)相比现有算法具有显著优势。

英文摘要

We propose a new tensor completion method based on tensor trains. The to-be-completed tensor is modeled as a low-rank tensor train, where we use the known tensor entries and their coordinates to update the tensor train. A novel tensor train initialization procedure is proposed specifically for image and video completion, which is demonstrated to ensure fast convergence of the completion algorithm. The tensor train framework is also shown to easily accommodate Total Variation and Tikhonov regularization due to their low-rank tensor train representations. Image and video inpainting experiments verify the superiority of the proposed scheme in terms of both speed and scalability, where a speedup of up to 155X is observed compared to state-of-the-art tensor completion methods at a similar accuracy. Moreover, we demonstrate the proposed scheme is especially advantageous over existing algorithms when only tiny portions (say, 1%) of the to-be-completed images/videos are known.

1808.00113 2026-06-04 eess.SY cs.LG cs.RO cs.SY math.OC

Learning Stabilizable Dynamical Systems via Control Contraction Metrics

通过控制收缩度量学习可稳定化的动态系统

Sumeet Singh, Vikas Sindhwani, Jean-Jacques E. Slotine, Marco Pavone

发表机构 * Dept. of Aeronautics and Astronautics, Stanford University(航空航天系,斯坦福大学) Google Brain Robotics, New York(谷歌大脑机器人,纽约) Dept. of Mechanical Engineering, Massachusetts Institute of Technology(机械工程系,麻省理工学院)

AI总结 本文提出了一种新的框架,用于学习可稳定化的非线性动态系统,以实现机器人连续控制任务。核心方法是开发一种基于稳定性的控制理论正则化器,以确保学习到的系统可以配备一个稳健的控制器,能够稳定任何系统生成的开环轨迹。通过利用收缩理论、统计学习和凸优化工具,我们提供了一个通用且可操作的半监督算法来学习可稳定化的动态系统,可以应用于复杂的欠驱动系统。在模拟平面四旋翼系统上验证了所提算法,并观察到与传统回归技术学习的模型相比,使用控制理论正则化模型在轨迹生成和跟踪性能上有显著改进,尤其是在使用少量示范示例时。结果展示了将标准基于模型的强化学习算法与非线性控制理论概念结合的必要性,以提高可靠性。

Comments To appear at WAFR 2018. v2: re-structured Sections 3 & 4 to improve clarity; expanded discussion on limitations & future work in Section 5; added details on training & validation, significantly expanded experiments

详情
AI中文摘要

我们提出了一种新的框架,用于学习可稳定化的非线性动态系统,以实现机器人连续控制任务。关键思想是开发一种基于稳定性的控制理论正则化器,用于动态拟合,该正则化器保证所学习的系统可以配备一个稳健的控制器,能够稳定任何系统可能生成的开环轨迹。通过利用收缩理论、统计学习和凸优化工具,我们提供了一个通用且可操作的半监督算法来学习可稳定化的动态系统,可以应用于复杂的欠驱动系统。我们在模拟平面四旋翼系统上验证了所提算法,并观察到与传统回归技术学习的模型相比,使用控制理论正则化模型在轨迹生成和跟踪性能上有显著改进,尤其是在使用少量示范示例时。所呈现的结果展示了将标准基于模型的强化学习算法与非线性控制理论概念结合的必要性,以提高可靠性。

英文摘要

We propose a novel framework for learning stabilizable nonlinear dynamical systems for continuous control tasks in robotics. The key idea is to develop a new control-theoretic regularizer for dynamics fitting rooted in the notion of stabilizability, which guarantees that the learned system can be accompanied by a robust controller capable of stabilizing any open-loop trajectory that the system may generate. By leveraging tools from contraction theory, statistical learning, and convex optimization, we provide a general and tractable semi-supervised algorithm to learn stabilizable dynamics, which can be applied to complex underactuated systems. We validated the proposed algorithm on a simulated planar quadrotor system and observed notably improved trajectory generation and tracking performance with the control-theoretic regularized model over models learned using traditional regression techniques, especially when using a small number of demonstration examples. The results presented illustrate the need to infuse standard model-based reinforcement learning algorithms with concepts drawn from nonlinear control theory for improved reliability.

1811.03621 2026-06-04 cs.HC cs.CV cs.LG cs.SY eess.SY stat.ML

Satyam: Democratizing Groundtruth for Machine Vision

Satyam: 机器视觉领域地面真实数据的民主化

Hang Qiu, Krishna Chintalapudi, Ramesh Govindan

发表机构 * University of Southern California(南加州大学) Microsoft Research(微软研究院)

AI总结 本文提出Satyam系统,通过简化流程使非专业人员能够高效收集机器视觉的地面真实数据,从而提升自动驾驶、交通监控和视频监控系统的性能。

详情
AI中文摘要

机器学习的民主化已经导致了用于自动驾驶、交通监控和视频监控的基于机器学习的机器视觉系统。然而,没有大大简化收集地面真实数据的过程,真正的民主化就无法实现。这种地面真实数据的收集对于确保在不同条件下具有良好的性能是必要的。在本文中,我们提出了Satyam系统的设计和评估,这是一个首次出现的系统,使非专业人士能够以最小的努力启动机器视觉的地面真实数据收集任务。Satyam利用一个众包平台,亚马逊机械 Turk,并自动化了地面真实数据收集的几个具有挑战性的方面:创建和启动定制的网页用户界面任务以获取所需的真实数据,控制结果质量以应对垃圾邮件发送者和未经训练的工人,根据任务复杂性调整价格,过滤表现差的垃圾邮件发送者和工人,以及处理工人的报酬。我们通过几种流行的基准视觉数据集验证了Satyam,并展示了通过Satyam获得的真实数据与由训练专家获得的数据相当,并且在用于训练时提供匹配的机器学习性能。

英文摘要

The democratization of machine learning (ML) has led to ML-based machine vision systems for autonomous driving, traffic monitoring, and video surveillance. However, true democratization cannot be achieved without greatly simplifying the process of collecting groundtruth for training and testing these systems. This groundtruth collection is necessary to ensure good performance under varying conditions. In this paper, we present the design and evaluation of Satyam, a first-of-its-kind system that enables a layperson to launch groundtruth collection tasks for machine vision with minimal effort. Satyam leverages a crowdtasking platform, Amazon Mechanical Turk, and automates several challenging aspects of groundtruth collection: creating and launching of custom web-UI tasks for obtaining the desired groundtruth, controlling result quality in the face of spammers and untrained workers, adapting prices to match task complexity, filtering spammers and workers with poor performance, and processing worker payments. We validate Satyam using several popular benchmark vision datasets, and demonstrate that groundtruth obtained by Satyam is comparable to that obtained from trained experts and provides matching ML performance when used for training.

1803.08287 2026-06-04 eess.SY cs.AI cs.LG cs.RO cs.SY

Learning-based Model Predictive Control for Safe Exploration

基于学习的模型预测控制用于安全探索

Torsten Koller, Felix Berkenkamp, Matteo Turchetta, Andreas Krause

发表机构 * Vector Institute(向量研究所) Max Planck ETH Center for Learning Systems(马克斯·普朗克-ETH学习系统中心)

AI总结 本文提出了一种基于学习的模型预测控制方法,通过高斯过程先验假设构建可证明准确的轨迹置信区间,从而提供可证明的高概率安全保证,用于动态系统的安全高效探索和学习。

Comments Proc. of the Conference on Decision and Control, 2018

详情
AI中文摘要

基于学习的方法在没有显著系统先验知识的情况下成功解决了复杂控制任务。然而,这些方法通常不提供任何安全保证,这限制了它们在安全关键的现实应用中的使用。在本文中,我们提出了一种基于学习的模型预测控制方案,可以提供可证明的高概率安全保证。为此,我们利用高斯过程先验对动态特性进行正则性假设,以构建可证明准确的预测轨迹置信区间。与以往的方法不同,我们不假设模型不确定性是独立的。基于这些预测,我们保证轨迹满足安全约束。此外,我们使用终端集约束递归地保证在每个迭代中都存在安全的控制动作。在我们的实验中,我们展示了所提出算法可以安全且高效地探索和学习动态系统。

英文摘要

Learning-based methods have been successful in solving complex control tasks without significant prior knowledge about the system. However, these methods typically do not provide any safety guarantees, which prevents their use in safety-critical, real-world applications. In this paper, we present a learning-based model predictive control scheme that can provide provable high-probability safety guarantees. To this end, we exploit regularity assumptions on the dynamics in terms of a Gaussian process prior to construct provably accurate confidence intervals on predicted trajectories. Unlike previous approaches, we do not assume that model uncertainties are independent. Based on these predictions, we guarantee that trajectories satisfy safety constraints. Moreover, we use a terminal set constraint to recursively guarantee the existence of safe control actions at every iteration. In our experiments, we show that the resulting algorithm can be used to safely and efficiently explore and learn about dynamic systems.

1811.02052 2026-06-04 eess.SY cs.LG cs.MA cs.SY

Managing engineering systems with large state and action spaces through deep reinforcement learning

通过深度强化学习管理具有大状态和动作空间的工程系统

C. P. Andriotis, K. G. Papakonstantinou

发表机构 * Department of Civil & Environmental Engineering(土木与环境工程系) The Pennsylvania State University(宾夕法尼亚州立大学) University Park(大学公园) USA(美国)

AI总结 本文提出了一种集成的深度强化学习框架,用于管理具有大状态和动作空间的多组件工程系统,通过开发深度集中多智能体Actor-Critic(DCMAC)方法,提供高效生命周期策略,以应对高维空间中的复杂决策问题。

详情
AI中文摘要

工程系统的决策可以高效地建模为马尔可夫决策过程(MDP)或部分可观测马尔可夫决策过程(POMDP)。典型的MDP和POMDP解决方案利用离线环境知识,为相对较小的状态和动作空间提供详细策略。然而,在大型多组件系统中,这些空间的规模容易爆炸,因为系统状态和动作随着组件数量的增加呈指数级增长,而整个系统的环境动态难以用显式形式描述,只能通过数值模拟器获取。在本工作中,为了解决这些问题,引入了一个集成的深度强化学习(DRL)框架。开发了深度集中多智能体Actor-Critic(DCMAC),一种离线策略的Actor-Critic DRL方法,为在高维空间中运行的大型多组件系统提供高效的生命周期策略。除了深度函数近似,用于参数化大型状态空间外,DCMAC还采用了动作的因子化表示,能够指定个体化的组件级和子系统级决策,同时保持整个系统的集中价值函数。DCMAC在与深度Q网络(DQN)解决方案和精确策略相比时表现良好,并在基于时间、基于条件和周期性策略的优化基线之上表现更优。

英文摘要

Decision-making for engineering systems can be efficiently formulated as a Markov Decision Process (MDP) or a Partially Observable MDP (POMDP). Typical MDP and POMDP solution procedures utilize offline knowledge about the environment and provide detailed policies for relatively small systems with tractable state and action spaces. However, in large multi-component systems the sizes of these spaces easily explode, as system states and actions scale exponentially with the number of components, whereas environment dynamics are difficult to be described in explicit forms for the entire system and may only be accessible through numerical simulators. In this work, to address these issues, an integrated Deep Reinforcement Learning (DRL) framework is introduced. The Deep Centralized Multi-agent Actor Critic (DCMAC) is developed, an off-policy actor-critic DRL approach, providing efficient life-cycle policies for large multi-component systems operating in high-dimensional spaces. Apart from deep function approximations that parametrize large state spaces, DCMAC also adopts a factorized representation of the system actions, being able to designate individualized component- and subsystem-level decisions, while maintaining a centralized value function for the entire system. DCMAC compares well against Deep Q-Network (DQN) solutions and exact policies, where applicable, and outperforms optimized baselines that are based on time-based, condition-based and periodic policies.

1811.02033 2026-06-04 stat.ML cs.LG cs.NA math.AP math.NA

Physics-Informed Generative Adversarial Networks for Stochastic Differential Equations

基于随机微分方程的物理信息生成对抗网络

Liu Yang, Dongkun Zhang, George Em Karniadakis

发表机构 * Division of Applied Mathematics, Brown University, Providence, RI 02912, USA(应用数学系,布朗大学,普罗维德恩,罗德岛州,02912,美国)

AI总结 本文提出了一种新的物理信息生成对抗网络(PI-GANs),通过统一解决随机问题中的正向、逆向和混合问题,利用自动微分将物理定律编码到GAN架构中,展示了PI-GANs在高维随机微分方程求解中的准确性和有效性。

详情
AI中文摘要

我们开发了一种新的物理信息生成对抗网络(PI-GANs),以统一的方式解决基于有限散射测量的正向、逆向和混合随机问题。与仅依赖数据训练的常规GANs不同,我们通过自动微分将 governing 物理定律以随机微分方程(SDEs)的形式编码到GANs的架构中。特别地,我们应用了Wasserstein GANs with gradient penalty(WGAN-GP),因为其比vanilla GANs具有更强的稳定性。我们首先测试了WGAN-GP在基于来自稀疏放置传感器的同时读取数据中不同相关长度的高斯过程的近似能力。我们获得了良好的生成随机过程对目标过程的近似,即使输入噪声维度与目标随机过程的有效维度不匹配。我们还研究了判别器和生成器的过拟合问题,并发现生成器也出现过拟合,除了判别器之外。随后,我们考虑了解决需要近似三个随机过程(即解、激励和扩散系数)的椭圆SDEs。我们使用了三个生成器,其中两个是前馈深度神经网络(DNNs),另一个是由SDE诱导的神经网络。根据数据,我们使用一个或多个前馈DNNs作为PI-GANs中的判别器。在此,我们展示了PI-GANs在解决最多30维的SDEs中的准确性和有效性,但原则上,PI-GANs可以处理非常高的维数问题,只要有更多的传感器数据,并且计算成本具有低多项式增长。

英文摘要

We developed a new class of physics-informed generative adversarial networks (PI-GANs) to solve in a unified manner forward, inverse and mixed stochastic problems based on a limited number of scattered measurements. Unlike standard GANs relying only on data for training, here we encoded into the architecture of GANs the governing physical laws in the form of stochastic differential equations (SDEs) using automatic differentiation. In particular, we applied Wasserstein GANs with gradient penalty (WGAN-GP) for its enhanced stability compared to vanilla GANs. We first tested WGAN-GP in approximating Gaussian processes of different correlation lengths based on data realizations collected from simultaneous reads at sparsely placed sensors. We obtained good approximation of the generated stochastic processes to the target ones even for a mismatch between the input noise dimensionality and the effective dimensionality of the target stochastic processes. We also studied the overfitting issue for both the discriminator and generator, and we found that overfitting occurs also in the generator in addition to the discriminator as previously reported. Subsequently, we considered the solution of elliptic SDEs requiring approximations of three stochastic processes, namely the solution, the forcing, and the diffusion coefficient. We used three generators for the PI-GANs, two of them were feed forward deep neural networks (DNNs) while the other one was the neural network induced by the SDE. Depending on the data, we employed one or multiple feed forward DNNs as the discriminators in PI-GANs. Here, we have demonstrated the accuracy and effectiveness of PI-GANs in solving SDEs for up to 30 dimensions, but in principle, PI-GANs could tackle very high dimensional problems given more sensor data with low-polynomial growth in computational cost.

1811.01774 2026-06-04 cs.SE cs.RO cs.SY eess.SY

SCAV'18: Report of the 2nd International Workshop on Safe Control of Autonomous Vehicles

SCAV'18: 第二届安全控制自主车辆国际研讨会报告

Mario Gleirscher, Sven Linker, Stefan Kugele

发表机构 * University of York, UK(英国约克大学) Technical University of Munich, Germany(德国慕尼黑技术大学) University of Liverpool, UK(英国利兹大学)

AI总结 本文总结了第二届SCAV研讨会的讨论、开放问题、关键信息和结论,探讨了自主车辆安全控制的核心挑战与未来方向。

Comments 3 pages, 1 table

详情
AI中文摘要

本报告总结了第二届SCAV研讨会的讨论、开放问题、关键信息和结论。

英文摘要

This report summarizes the discussions, open issues, take-away messages, and conclusions of the 2nd SCAV workshop.

1811.01721 2026-06-04 math.NA cs.LG cs.NA

Rethinking floating point for deep learning

重新思考深度学习中的浮点运算

Jeff Johnson

发表机构 * Facebook AI Research(脸书人工智能研究)

AI总结 本文提出了一种新的混合对数乘/线性加、Kulisch累加和渐缩编码的8位对数浮点格式,以提高能效并保持精度,同时在不重新训练网络的情况下,实现了与原始float32 ResNet-50模型在ImageNet上的高精度性能。

详情
AI中文摘要

减少神经网络硬件开销以实现更快或更低功耗的推理和训练是研究的活跃领域。使用整数乘加的统一量化已得到充分研究,这需要学习许多量化参数、微调训练或其他先决条件。很少有努力致力于改进浮点相对于此基准线;它仍然效率低下,字大小减少导致所需动态范围的剧烈损失。我们通过一种新的混合对数乘/线性加、Kulisch累加和Gustafson的正数格式的渐缩编码,将浮点改进为在28nm ASIC工艺上比等效位宽的整数硬件更节能,同时在8位中保持精度。通过仅使用四舍五入到最近的偶数,无需网络重新训练,所有数学和float32参数的替换都可以直接使用。此开源的8位对数浮点在ImageNet上达到原始float32 ResNet-50 CNN模型的top-1精度为0.9%和top-5精度为0.2%。与int8量化不同,它仍然是通用的浮点运算,可以即开即用。我们的8/38位对数浮点乘加在28nm工艺上综合并功率分析,其功率为8/32位整数乘加的1.12倍,面积为0.96倍。在16位时,我们的对数浮点乘加的功率为IEEE 754 float16融合乘加的0.59倍,面积为0.68倍,保持相同的显著位精度和动态范围,证明了其在训练ASICs中的实用性。

英文摘要

Reducing hardware overhead of neural networks for faster or lower power inference and training is an active area of research. Uniform quantization using integer multiply-add has been thoroughly investigated, which requires learning many quantization parameters, fine-tuning training or other prerequisites. Little effort is made to improve floating point relative to this baseline; it remains energy inefficient, and word size reduction yields drastic loss in needed dynamic range. We improve floating point to be more energy efficient than equivalent bit width integer hardware on a 28 nm ASIC process while retaining accuracy in 8 bits with a novel hybrid log multiply/linear add, Kulisch accumulation and tapered encodings from Gustafson's posit format. With no network retraining, and drop-in replacement of all math and float32 parameters via round-to-nearest-even only, this open-sourced 8-bit log float is within 0.9% top-1 and 0.2% top-5 accuracy of the original float32 ResNet-50 CNN model on ImageNet. Unlike int8 quantization, it is still a general purpose floating point arithmetic, interpretable out-of-the-box. Our 8/38-bit log float multiply-add is synthesized and power profiled at 28 nm at 0.96x the power and 1.12x the area of 8/32-bit integer multiply-add. In 16 bits, our log float multiply-add is 0.59x the power and 0.68x the area of IEEE 754 float16 fused multiply-add, maintaining the same signficand precision and dynamic range, proving useful for training ASICs as well.

1605.03364 2026-06-04 math.NA cs.LG cs.NA stat.ML

Active Uncertainty Calibration in Bayesian ODE Solvers

在贝叶斯微分方程求解器中的主动不确定性校准

Hans Kersting, Philipp Hennig

发表机构 * Max-Planck-Institute for Intelligent Systems(马克斯·普朗克智能系统研究所)

AI总结 本文研究了如何在贝叶斯微分方程求解器中平衡计算成本与概率校准,提出了一种基于过滤的方法Bayesian Quadrature filtering (BQF),通过主动学习梯度测量的不精确性来提高不确定性校准。

Comments 10 pages, 3 figures, published at UAI 2016. Changes for Version 3: fixed minor index mistake in equation (14) (q-1-i instead of q+1-i on top of the product)

详情
Journal ref
Proceedings of the Thirty-Second Conference on Uncertainty in Artificial Intelligence (UAI2016) 309--3018
AI中文摘要

在统计学和机器学习中,对微分方程(ODEs)求解器的兴趣正在重新增长,这些求解器返回概率测度而非点估计。最近,Conrad等人引入了一种基于采样的方法类,这些方法在特定意义上是'well-calibrated'的。但是,这些方法的计算成本显著高于经典方法。另一方面,Schober等人指出经典Runge-Kutta ODE求解器与高斯滤波器之间存在精确的联系,这只能提供粗糙的概率校准,但计算开销可忽略不计。通过将ODE的解视为线性高斯SDE中的近似推断,我们研究了一类概率ODE求解器,这些求解器在计算成本和概率校准之间取得了平衡,并识别出不准确的梯度测量是不确定性的关键来源。我们提出了一种新的基于过滤的方法Bayesian Quadrature filtering (BQF),该方法利用贝叶斯二次法主动学习梯度测量的不精确性,通过收集多个梯度评估来提高不确定性校准。

英文摘要

There is resurging interest, in statistics and machine learning, in solvers for ordinary differential equations (ODEs) that return probability measures instead of point estimates. Recently, Conrad et al. introduced a sampling-based class of methods that are 'well-calibrated' in a specific sense. But the computational cost of these methods is significantly above that of classic methods. On the other hand, Schober et al. pointed out a precise connection between classic Runge-Kutta ODE solvers and Gaussian filters, which gives only a rough probabilistic calibration, but at negligible cost overhead. By formulating the solution of ODEs as approximate inference in linear Gaussian SDEs, we investigate a range of probabilistic ODE solvers, that bridge the trade-off between computational cost and probabilistic calibration, and identify the inaccurate gradient measurement as the crucial source of uncertainty. We propose the novel filtering-based method Bayesian Quadrature filtering (BQF) which uses Bayesian quadrature to actively learn the imprecision in the gradient measurement by collecting multiple gradient evaluations.

1810.08907 2026-06-04 math.OC cs.LG cs.NA math.CA math.NA stat.ML

Understanding the Acceleration Phenomenon via High-Resolution Differential Equations

通过高分辨率微分方程理解加速现象

Bin Shi, Simon S. Du, Michael I. Jordan, Weijie J. Su

发表机构 * Florida International University(佛罗里达国际大学) Carnegie Mellon University(卡内基梅隆大学) University of California, Berkeley(加州大学伯克利分校) University of Pennsylvania(宾夕法尼亚大学)

AI总结 本文通过高分辨率微分方程研究优化算法的加速现象,提出了一种新的极限过程,能够区分Nesterov加速梯度法和Polyak重力球方法,并揭示了NAG-C在非强凸函数下的收敛特性。

Comments 82 pages, 11 figures

详情
AI中文摘要

基于梯度的优化算法可以从极限常微分方程(ODEs)的角度进行研究。受现有ODEs无法区分Nesterov加速梯度法(用于强凸函数)和Polyak重力球方法的启发,我们研究了一种替代的极限过程,以获得高分辨率的ODEs。我们证明这些ODEs允许一个通用的Lyapunov函数框架,用于连续和离散时间下的收敛分析。我们还证明这些ODEs是底层算法更准确的替代品;特别是,它们不仅区分NAG-SC和Polyak重力球方法,还允许识别一个称为“梯度修正”的项,该项存在于NAG-SC中但不在重力球方法中,并负责两种方法收敛性质的差异。我们还利用高分辨率ODE框架研究Nesterov加速梯度法用于(非强凸)函数,揭示了一个此前未知的结果——NAG-C以反立方速率最小化平方梯度范数。最后,通过修改NAG-C的高分辨率ODE,我们获得了一族新的优化方法,这些方法被证明在光滑凸函数上保持NAG-C的加速收敛率。

英文摘要

Gradient-based optimization algorithms can be studied from the perspective of limiting ordinary differential equations (ODEs). Motivated by the fact that existing ODEs do not distinguish between two fundamentally different algorithms---Nesterov's accelerated gradient method for strongly convex functions (NAG-SC) and Polyak's heavy-ball method---we study an alternative limiting process that yields high-resolution ODEs. We show that these ODEs permit a general Lyapunov function framework for the analysis of convergence in both continuous and discrete time. We also show that these ODEs are more accurate surrogates for the underlying algorithms; in particular, they not only distinguish between NAG-SC and Polyak's heavy-ball method, but they allow the identification of a term that we refer to as "gradient correction" that is present in NAG-SC but not in the heavy-ball method and is responsible for the qualitative difference in convergence of the two methods. We also use the high-resolution ODE framework to study Nesterov's accelerated gradient method for (non-strongly) convex functions, uncovering a hitherto unknown result---that NAG-C minimizes the squared gradient norm at an inverse cubic rate. Finally, by modifying the high-resolution ODE of NAG-C, we obtain a family of new optimization methods that are shown to maintain the accelerated convergence rates of NAG-C for smooth convex functions.

1810.13084 2026-06-04 math.OC cs.DC cs.LG cs.MA cs.SY eess.SY

Provably Accelerated Randomized Gossip Algorithms

可证明加速的随机传话算法

Nicolas Loizou, Michael Rabbat, Peter Richtárik

发表机构 * The University of Edinburgh, UK(爱丁堡大学) Facebook AI Research, Montreal(脸书人工智能研究) KAUST, KSA(王国立科技大学)

AI总结 本文提出了一种可证明加速的随机传话算法,用于解决平均一致性问题。该算法受到最近开发的加速随机Kaczmarz方法的启发,该方法是解决线性系统问题的流行方法。在每次传话迭代中,网络中的所有节点都会更新它们的值,但只有成对的节点交换私有信息。还展示了在流行无线传感器网络上的数值实验,展示了我们协议的优势。

详情
AI中文摘要

在本文中,我们提出了新的可证明加速的传话算法,用于解决平均一致性问题。所提出的协议受到最近开发的加速随机Kaczmarz方法的启发,这是一种用于解决线性系统问题的流行方法。在每次传话迭代中,网络中的所有节点都会更新它们的值,但只有成对的节点交换它们的私有信息。还展示了在流行无线传感器网络上的数值实验,展示了我们协议的优势。

英文摘要

In this work we present novel provably accelerated gossip algorithms for solving the average consensus problem. The proposed protocols are inspired from the recently developed accelerated variants of the randomized Kaczmarz method - a popular method for solving linear systems. In each gossip iteration all nodes of the network update their values but only a pair of them exchange their private information. Numerical experiments on popular wireless sensor networks showing the benefits of our protocols are also presented.

1806.03085 2026-06-04 stat.ML cs.LG cs.NA math.NA

A Stein variational Newton method

一种Stein变分牛顿方法

Gianluca Detommaso, Tiangang Cui, Alessio Spantini, Youssef Marzouk, Robert Scheichl

发表机构 * University of Bath(巴斯大学) The Alan Turing Institute(艾伦·图灵研究所) Monash University(莫纳什大学) Massachusetts Institute of Technology(麻省理工学院) Heidelberg University(海德堡大学)

AI总结 本文提出了一种基于Stein变分梯度下降(SVGD)的改进方法,通过引入二阶信息加速并推广了该算法,实现了函数空间中的牛顿迭代,并展示了在多个测试案例中显著的计算效率提升。

Comments 18 pages, 7 figures

详情
Journal ref
NIPS 2018
AI中文摘要

Stein变分梯度下降(SVGD)最近被提出作为一种通用的非参数变分推断算法 [Liu & Wang, NIPS 2016]:它通过在再生核希尔伯特空间上实现一种函数梯度下降的形式来最小化目标分布与其近似分布之间的Kullback-Leibler散度。在本文中,我们通过引入二阶信息来加速和推广SVGD算法,从而在函数空间中近似出一种牛顿迭代。我们还展示了二阶信息如何导致更有效的核选择。我们在多个测试案例中观察到相对于原始SVGD算法有显著的计算效率提升。

英文摘要

Stein variational gradient descent (SVGD) was recently proposed as a general purpose nonparametric variational inference algorithm [Liu & Wang, NIPS 2016]: it minimizes the Kullback-Leibler divergence between the target distribution and its approximation by implementing a form of functional gradient descent on a reproducing kernel Hilbert space. In this paper, we accelerate and generalize the SVGD algorithm by including second-order information, thereby approximating a Newton-like iteration in function space. We also show how second-order information can lead to more effective choices of kernel. We observe significant computational gains over the original SVGD algorithm in multiple test cases.

1810.11505 2026-06-04 eess.SY cs.LG cs.SY

Stability-certified reinforcement learning: A control-theoretic perspective

具有稳定性认证的强化学习:控制论视角

Ming Jin, Javad Lavaei

发表机构 * Department of Industrial Engineering and Operations Research, University of California, Berkeley(工业工程与运营管理系,加州大学伯克利分校) Department of Industrial Engineering and Operations Research, and the Tsinghua-Berkeley Shenzhen Institute, University of California, Berkeley(工业工程与运营管理系,以及清华-伯克利深圳研究院,加州大学伯克利分校)

AI总结 本文从控制论角度研究强化学习策略与非线性动力系统连接时的稳定性认证问题,提出了一种基于半定规划可行性的方法,通过调节策略的输入输出梯度来获得鲁棒稳定性保证,并通过实验验证了该方法在多飞行编队和电力系统频率调节任务中的有效性。

详情
AI中文摘要

我们研究了强化学习策略与非线性动力系统连接时稳定性认证的重要问题。我们证明,通过调节策略的输入输出梯度,可以基于所提出的半定规划可行性问题获得强鲁棒稳定性保证。该方法能够通过利用问题特定的结构来认证一组稳定的控制器;进一步地,我们分析并建立了其(非)保守性。在两个分布式控制任务,即多飞行编队和电力系统频率调节上的实证评估表明,强化学习代理在稳定性认证的参数空间内能够表现出高性能,并且在长期内表现出稳定的学習行为。

英文摘要

We investigate the important problem of certifying stability of reinforcement learning policies when interconnected with nonlinear dynamical systems. We show that by regulating the input-output gradients of policies, strong guarantees of robust stability can be obtained based on a proposed semidefinite programming feasibility problem. The method is able to certify a large set of stabilizing controllers by exploiting problem-specific structures; furthermore, we analyze and establish its (non)conservatism. Empirical evaluations on two decentralized control tasks, namely multi-flight formation and power system frequency regulation, demonstrate that the reinforcement learning agents can have high performance within the stability-certified parameter space, and also exhibit stable learning behaviors in the long run.

1804.04310 2026-06-04 cs.IT cs.LG cs.NA math.IT math.NA math.OC

Exact Reconstruction of Euclidean Distance Geometry Problem Using Low-rank Matrix Completion

利用低秩矩阵补全进行欧几里得距离几何问题的精确重构

Abiy Tasissa, Rongjie Lai

发表机构 * Department of Mathematics, Rensselaer Polytechnic Institute(罗切斯特理工学院数学系)

AI总结 本文提出了一种利用低秩矩阵补全方法来解决欧几里得距离几何问题的框架,通过引入双基方法理论分析重构问题,并在不同三维数据和蛋白质分子上的数值测试验证了算法的有效性和效率。

Comments 28 pages, revised proof of Theorem 1, added proof of form of $H^{-1}$, presentation improved

详情
AI中文摘要

欧几里得距离几何问题出现在许多应用中,从计算化学中确定分子构象到传感器网络中的定位。当距离信息不完整时,该问题可以被公式化为核范数最小化问题。在本文中,该最小化程序被重新表述为一个低秩r Gram矩阵相对于合适基底的矩阵补全问题。众所周知的限制等距性质在此场景中无法满足。相反,引入了双基方法来理论分析重构问题。如果Gram矩阵满足某些与参数ν相关的相干条件,则主要结果表明,从O(nrνlog²(n))均匀随机样本中可以以很高的概率恢复n个点的底层配置。计算上,设计了简单且快速的算法来解决欧几里得距离几何问题。在不同三维数据和蛋白质分子上的数值测试验证了所提算法的有效性和效率。

英文摘要

The Euclidean distance geometry problem arises in a wide variety of applications, from determining molecular conformations in computational chemistry to localization in sensor networks. When the distance information is incomplete, the problem can be formulated as a nuclear norm minimization problem. In this paper, this minimization program is recast as a matrix completion problem of a low-rank $r$ Gram matrix with respect to a suitable basis. The well known restricted isometry property can not be satisfied in this scenario. Instead, a dual basis approach is introduced to theoretically analyze the reconstruction problem. If the Gram matrix satisfies certain coherence conditions with parameter $ν$, the main result shows that the underlying configuration of $n$ points can be recovered with very high probability from $O(nrν\log^{2}(n))$ uniformly random samples. Computationally, simple and fast algorithms are designed to solve the Euclidean distance geometry problem. Numerical tests on different three dimensional data and protein molecules validate effectiveness and efficiency of the proposed algorithms.

1810.11178 2026-06-04 eess.SY cs.LG cs.SY math.OC

Using solar and load predictions in battery scheduling at the residential level

在住宅层面利用太阳能和负载预测进行电池调度

Richard Bean, Hina Khan

发表机构 * Redback Technologies Brisbane, Australia(红背技术布里斯班,澳大利亚) School of ITEE The University of Queensland, Australia(信息技术工程学院昆士兰大学,澳大利亚)

AI总结 本文提出了一种新的电池调度算法,通过预测负载和太阳能发电量来优化住宅用户的电力成本,该算法在不同电价下可实现1%至10%的节能效果。

Comments This paper was presented at the 8th Solar Integration Workshop and published in the workshop's proceedings

详情
AI中文摘要

智能太阳能逆变器可用于存储、监控和管理家庭的太阳能能源。我们描述了一种带有电池的智能太阳能逆变器系统,该系统可以自动运行或通过网络接收命令以在给定速率充电和放电。为了使电池存储在财务上可行并对消费者有利,可以采用有效的电池调度算法。特别是当地区内实施分时电价时,在某些情况下可以调度电池以节省个人客户的费用,相比自动模式。因此,本文提出了并评估了为太阳能能源住宅消费者设计的新电池调度算法。所提出的电池调度算法优化了住宅用户的下一24小时电力成本。通过根据负载和太阳能发电量的预测来控制电池存储系统的充放电,实现成本最小化。调度问题被公式化为一个线性规划问题。我们使用几个月的每小时负载和光伏数据对83个逆变器进行了计算机模拟。模拟结果表明,影响优化可行性的关键因素是电价和每个逆变器的光伏与负载比率。根据电价,可以预期比自动方法节省1%至10%。本文中所用的预测方法也显示出优于基本“持久性”预测方法。我们还检查了提高预测准确性和优化有效性的方法。

英文摘要

Smart solar inverters can be used to store, monitor and manage a home's solar energy. We describe a smart solar inverter system with battery which can either operate in an automatic mode or receive commands over a network to charge and discharge at a given rate. In order to make battery storage financially viable and advantageous to the consumers, effective battery scheduling algorithms can be employed. Particularly, when time-of-use tariffs are in effect in the region of the inverter, it is possible in some cases to schedule the battery to save money for the individual customer, compared to the "automatic" mode. Hence, this paper presents and evaluates the performance of a novel battery scheduling algorithm for residential consumers of solar energy. The proposed battery scheduling algorithm optimizes the cost of electricity over next 24 hours for residential consumers. The cost minimization is realized by controlling the charging/discharging of battery storage system based on the predictions for load and solar power generation values. The scheduling problem is formulated as a linear programming problem. We performed computer simulations over 83 inverters using several months of hourly load and PV data. The simulation results indicate that key factors affecting the viability of optimization are the tariffs and the PV to Load ratio at each inverter. Depending on the tariff, savings of between 1% and 10% can be expected over the automatic approach. The prediction approach used in this paper is also shown to out-perform basic "persistence" forecasting approaches. We have also examined the approaches for improving the prediction accuracy and optimization effectiveness.

1810.09675 2026-06-04 math.NA cs.LG cs.NA

SwitchNet: a neural network model for forward and inverse scattering problems

SwitchNet: 一种用于正反散射问题的神经网络模型

Yuehaw Khoo, Lexing Ying

发表机构 * Department of Mathematics, Stanford University, Stanford, CA 94305.(斯坦福大学数学系) Department of Mathematics(数学系) ICME, Stanford University, Stanford, CA 94305. Facebook AI Research, Menlo Park, CA 94025.(斯坦福大学计算数学与工程系,Facebook人工智能研究实验室)

AI总结 SwitchNet通过建立散射体与散射场之间的映射,解决基于波方程的反散射问题,利用低秩结构和稀疏连接的切换层减少参数量并提升训练效率。

Comments 19 pages, 7 figures

详情
AI中文摘要

我们提出了一种新颖的神经网络架构SwitchNet,用于通过建立散射体与散射场(反之亦然)之间的映射来解决基于波方程的反散射问题。使用神经网络解决此问题的主要困难在于,散射体对散射波场有全局影响,使得通常具有局部连接的卷积神经网络不适用。虽然可以使用全连接网络来处理此类问题,但参数数量与输入和输出数据的大小呈二次增长。通过利用散射问题固有的低秩结构,并引入一种具有稀疏连接的新型切换层,SwitchNet架构使用了更少的参数,并促进了训练过程。数值实验显示在学习散射体与散射波场之间的正反映射方面具有令人鼓舞的准确性。

英文摘要

We propose a novel neural network architecture, SwitchNet, for solving the wave equation based inverse scattering problems via providing maps between the scatterers and the scattered field (and vice versa). The main difficulty of using a neural network for this problem is that a scatterer has a global impact on the scattered wave field, rendering typical convolutional neural network with local connections inapplicable. While it is possible to deal with such a problem using a fully connected network, the number of parameters grows quadratically with the size of the input and output data. By leveraging the inherent low-rank structure of the scattering problems and introducing a novel switching layer with sparse connections, the SwitchNet architecture uses much fewer parameters and facilitates the training process. Numerical experiments show promising accuracy in learning the forward and inverse maps between the scatterers and the scattered wave field.

1810.10078 2026-06-04 stat.ML cs.LG cs.NA math.NA

Model Selection for Nonnegative Matrix Factorization by Support Union Recovery

通过支持联合恢复进行非负矩阵分解的模型选择

Zhaoqiang Liu

发表机构 * Department of Electrical and Computer Engineering(电子工程系)

AI总结 本文提出了一种通过计算经验二阶矩并恢复与经验二阶矩相关的矩阵中非零行的索引集来自动选择非负矩阵分解的潜在维度的算法,该算法在理论上有保证地检测出真实的潜在维度。

详情
AI中文摘要

非负矩阵分解(NMF)因其非减性和部分基性质而被广泛应用于机器学习和信号处理,因为它增强了可解释性。通常假设潜在维度(或组件数量)是给定的。尽管已经设计了大量NMF算法,但关于具有理论保证的自动NMF模型选择的文献却很少。在本文中,我们提出了一种算法,首先从经验四阶累积张量中计算经验二阶矩,然后通过恢复与经验二阶矩相关的矩阵中非零行的索引集(即非零行的索引集)来估计潜在维度。通过假设数据的生成模型并加入额外的温和条件,我们的算法可以证明性地检测到真实的潜在维度。我们在合成示例上展示了所提出的算法能够找到近似正确的组件数量。

英文摘要

Nonnegative matrix factorization (NMF) has been widely used in machine learning and signal processing because of its non-subtractive, part-based property which enhances interpretability. It is often assumed that the latent dimensionality (or the number of components) is given. Despite the large amount of algorithms designed for NMF, there is little literature about automatic model selection for NMF with theoretical guarantees. In this paper, we propose an algorithm that first calculates an empirical second-order moment from the empirical fourth-order cumulant tensor, and then estimates the latent dimensionality by recovering the support union (the index set of non-zero rows) of a matrix related to the empirical second-order moment. By assuming a generative model of the data with additional mild conditions, our algorithm provably detects the true latent dimensionality. We show on synthetic examples that our proposed algorithm is able to find an approximately correct number of components.

1810.09929 2026-06-04 eess.SP cs.RO cs.SY eess.SY

Teleoperated Robotic Arm Movement Using EMG Signal With Wearable MYO Armband

使用可穿戴MYO臂带的电信号实现远程操控机械臂运动

Hussein F. Hassan, Sadiq J. Abou-Loukh, Ibraheem Kasim Ibraheem

发表机构 * University of Baghdad, College of Engineering, Department of Electrical Engineering(巴格达大学,工程学院,电气工程系)

AI总结 本研究通过分析表面肌电信号,利用可穿戴MYO臂带区分七种手部运动,采用模式识别系统实现机械臂的实时控制,其中SVM分类器在准确率上达到96.57%。

详情
AI中文摘要

本研究的主要目的是基于通过无线Myo手势臂带获取的表面肌电信号(sEMG)实时控制五自由度机械臂,以区分七种手部运动。sEMG信号是生物电信号,用于估计和记录肌肉收缩和放松过程中产生的电信号,代表神经肌肉活动。因此,通过人体手臂肌肉利用sEMG信号来控制机械臂被视为一种重要的方法。无线Myo手势臂带用于从前臂记录sEMG信号。为了分析这些信号,采用了模式识别系统,该系统由三个主要部分组成:分段、特征提取和分类。重叠技术用于分段信号。从每个分段中提取六个时域特征(MAV、WL、RMS、AR、ZC和SSC)。采用分类器(SVM、LDA和KNN)以比较它们,以获得系统的最佳准确率。结果表明,SVM在准确率上达到96.57%,优于LDA的96.01%和KNN的92.67%。

英文摘要

The main purpose of this research is to move the robotic arm (5DoF) in real-time, based on the surface Electromyography (sEMG) signals, as obtained from the wireless Myo gesture armband to distinguish seven hand movements. The sEMG signals are biomedical signals that estimate and record the electrical signals produced in muscles through their contraction and relaxation, representing neuromuscular activities. Therefore, controlling the robotic arm via the muscles of the human arm using sEMG signals is considered to be one of the most significant methods. The wireless Myo gesture armband is used to record sEMG signals from the forearm. In order to analyze these signals, the pattern recognition system is employed, which consists of three main parts: segmentation, feature extraction, and classification. Overlap technique is chosen for segmenting part of the signal. Six time domain features (MAV, WL, RMS, AR, ZC, and SSC) are extracted from each segment. The classifiers (SVM, LDA, and KNN) are employed to enable comparison between them in order to obtain optimum accuracy of the system. The results show that the SVM achieves higher system accuracy at 96.57 %, compared to LDA reaching 96.01 %, and 92.67 % accuracy achieved by KNN.

1807.03475 2026-06-04 math.OC cs.RO cs.SY eess.SY

On Controller Design for Systems on Manifolds in Euclidean Space

在欧几里得空间中系统控制器设计方法

Dong Eui Chang

发表机构 * Electrical Engineering(电气工程) Korea Advanced Institute of Science(韩国科学技术院)

AI总结 本文提出了一种在欧几里得空间中为定义在流形上的系统设计控制器的新方法,通过将状态空间流形嵌入到欧几里得空间中,并在该空间中扩展系统以增加最终动态中的横截稳定性,从而设计出适用于原始系统的控制器。

Comments International Journal of Robust and Nonlinear Control (Accepted July 2018

详情
Journal ref
International J of Robust and Nonlinear Control, 28(16), 4981--4998, 2018
AI中文摘要

本文提出了一种在欧几里得空间中为定义在流形上的系统设计控制器的新方法。该方法的思路是将给定控制系统的状态空间流形M嵌入到某个欧几里得空间R^n中,将系统从M扩展到环境空间R^n,并在该空间中修改系统以在R^n中的最终动态中增加M的横截稳定性。在环境空间R^n中为最终系统设计控制器,然后将其限制到M上,从而得到原始系统在M上的控制器。该方法的优点是仅使用一个单一的全局笛卡尔坐标系在环境空间R^n中进行控制器合成,并且任何在R^n中的控制器设计方法,如线性化方法,都可以全局应用于控制器合成。所提出的方法成功应用于以下两个基准系统的跟踪问题:完全驱动的刚体系统和四旋翼无人机系统。

英文摘要

A new method is developed to design controllers in Euclidean space for systems defined on manifolds. The idea is to embed the state-space manifold $M$ of a given control system into some Euclidean space $\mathbb R^n$, extend the system from $M$ to the ambient space $\mathbb R^n$, and modify it outside $M$ to add transversal stability to $M$ in the final dynamics in $\mathbb R^n$. Controllers are designed for the final system in the ambient space $\mathbb R^n$. Then, their restriction to $M$ produces controllers for the original system on $M$. This method has the merit that only one single global Cartesian coordinate system in the ambient space $\mathbb R^n$ is used for controller synthesis, and any controller design method in $\mathbb R^n$, such as the linearization method, can be globally applied for the controller synthesis. The proposed method is successfully applied to the tracking problem for the following two benchmark systems: the fully actuated rigid body system and the quadcopter drone system.

1810.09000 2026-06-04 eess.SY cs.RO cs.SY

Safe Adaptive Cruise Control with Road Grade Preview and V2V Communication

安全自适应巡航控制系统与道路坡度预览及车对车通信

Roya Firoozi, Shima Nazari, Jacopo Guanetti, Ryan O'Gorman, Francesco Borrelli

发表机构 * University of Michigan(密歇根大学)

AI总结 本文提出了一种安全自适应巡航控制系统,利用道路坡度和前车运动预览,通过模型预测控制框架优化舒适性、安全性和能耗,采用新颖的方法计算鲁棒不变终端集以确保车辆间安全距离,仿真结果验证了该控制算法的有效性。

详情
AI中文摘要

我们提出了一种安全的自适应巡航控制系统(ACC),该系统利用道路坡度和前车运动预览。ACC控制器通过模型预测控制(MPC)框架设计,以优化舒适性、安全性和能耗以及速度跟踪精度。安全通过计算鲁棒不变终端集来实现。本文提出了一种新颖的方法来计算此类集合,该方法比现有方法更少保守。所提出的控制器在道路坡度变化和前车运动预测不确定性的情况下,始终确保车辆间安全距离。仿真结果将所提控制器与不考虑先前坡度知识的控制器在车-following和自动驾驶交叉场景中进行比较。结果证明了所提控制算法的有效性。

英文摘要

We present the design of a safe Adaptive Cruise Control (ACC) which uses road grade and lead vehicle motion preview. The ACC controller is designed by using a Model Predictive Control (MPC) framework to optimize comfort, safety, energy-efficiency and speed tracking accuracy. Safety is achieved by computing a robust invariant terminal set. The paper presents a novel approach to compute such set which is less conservative than existing methods. The proposed controller ensures safe inter-vehicle spacing at all times despite changes in the road grade and uncertainty in the predicted motion of the lead vehicle. Simulation results compare the proposed controller with a controller that does not incorporate prior grade knowledge on two scenarios including car-following and autonomous intersection crossing. The results demonstrate the effectiveness of the proposed control algorithm.

1810.08759 2026-06-04 eess.SY cs.AI cs.SY

Design of robust H_inf fuzzy output feedback controller for affine nonlinear systems:Fuzzy Lyapunov function approach

面向仿真非线性系统的鲁棒H_∞模糊输出反馈控制器设计:模糊Lyapunov函数方法

Leila Rajabpour, Mokhtar Shasadeghi, Alireza Barzegar

发表机构 * University of Technology Malaysia(技术大学马来西亚) Shiraz University of Technology(谢尔兹技术大学) Nanyang Technological University(南洋理工大学)

AI总结 本文提出了一种基于非二次Lyapunov函数和引入松弛矩阵技术的新系统方法,用于具有扰动的仿真非线性系统。首先,将仿真非线性系统表示为Takagi-Sugeno(T-S)模糊双线性模型。随后,基于并行分布式补偿(PDC)方案设计鲁棒H_∞控制器。通过利用Lyapunov函数推导出稳定性条件,以线性矩阵不等式(LMIs)形式表达。此外,提出一些松弛矩阵以减少LMIs稳定性条件的保守性。最后,通过详细讨论等温连续搅拌釜反应器(CSTR)用于Van de Vusse反应器的应用,来说明所提方法的优点并验证其有效性。

详情
AI中文摘要

本文提出了一种基于非二次Lyapunov函数和引入松弛矩阵技术的新系统方法,用于具有扰动的仿真非线性系统。为实现目标,首先将仿真非线性系统表示为Takagi-Sugeno(T-S)模糊双线性模型。随后,基于并行分布式补偿(PDC)方案设计鲁棒H_∞控制器。然后,通过利用Lyapunov函数推导出稳定性条件,以线性矩阵不等式(LMIs)形式表达。此外,提出一些松弛矩阵以减少LMIs稳定性条件的保守性。最后,通过详细讨论等温连续搅拌釜反应器(CSTR)用于Van de Vusse反应器的应用,来说明所提方法的优点并验证其有效性。

英文摘要

In this paper, we propose a new systematic approach based on nonquadratic Lyapunov function and technique of introducing slack matrices, for a class of affine nonlinear systems with disturbance. To achieve the goal, first, the affine nonlinear system is represented via Takagi-Sugeno (T-S) fuzzy bilinear model. Subsequently, the robust H_inf controller is designed based on parallel distributed compensation (PDC) scheme. Then, the stability conditions are derived in terms of linear matrix inequalities (LMIs) by utilizing Lyapunov function. Moreover, some slack matrices are proposed to reduce the conservativeness of the LMI stability conditions. Finally, for illustrating the merits and verifying the effectiveness of the proposed approach, the application of an isothermal continuous stirred tank reactor (CSTR) for Van de Vusse reactor is discussed in details.

1709.00483 2026-06-04 math.NA cs.CV cs.NA math.OC stat.ML

Iteratively Linearized Reweighted Alternating Direction Method of Multipliers for a Class of Nonconvex Problems

迭代线性化加权交替方向乘子法用于一类非凸问题

Tao Sun, Hao Jiang, Lizhi Cheng, Wei Zhu

发表机构 * Department of Mathematics, National University of Defense Technology(国防科技大学数学系) College of Computer, National University of Defense Technology(国防科技大学计算机学院) The State Key Laboratory for High Performance Computation, National University of Defense Technology(国防科技大学高性能计算国家重点实验室) Hunan Key Laboratory for Computation and Simulation in Science and Engineering, School of Mathematics and Computational Science, Xiangtan University(湖南计算与模拟科学工程重点实验室,湘潭大学数学与计算科学学院)

AI总结 本文提出了一种迭代线性化加权交替方向乘子法,用于解决信号处理和机器学习中常见的非凸和非光滑问题,该方法通过将子问题转化为凸问题以提高求解效率,并证明了算法的全局收敛性。

详情
AI中文摘要

在本文中,我们考虑解决在信号处理和机器学习研究中频繁出现的一类非凸和非光滑问题。传统的交替方向乘子法在数学和计算上解决非凸和非光滑子问题时遇到了困难。为此,我们提出了一种加权交替方向乘子法。在该算法中,所有子问题都是凸的,易于求解。我们还提供了几种保证以确保收敛性,并利用Kurdyka-Łojasiewicz性质证明了该算法全局收敛到辅助函数的临界点。展示了几个数值结果以证明所提算法的有效性。

英文摘要

In this paper, we consider solving a class of nonconvex and nonsmooth problems frequently appearing in signal processing and machine learning research. The traditional alternating direction method of multipliers encounters troubles in both mathematics and computations in solving the nonconvex and nonsmooth subproblem. In view of this, we propose a reweighted alternating direction method of multipliers. In this algorithm, all subproblems are convex and easy to solve. We also provide several guarantees for the convergence and prove that the algorithm globally converges to a critical point of an auxiliary function with the help of the Kurdyka-Łojasiewicz property. Several numerical results are presented to demonstrate the efficiency of the proposed algorithm.

1810.04859 2026-06-04 cs.IT cs.AI cs.LG cs.SY eess.SY math.IT math.ST stat.TH

Policy Design for Active Sequential Hypothesis Testing using Deep Learning

使用深度学习的主动顺序假设检验政策设计

Dhruva Kartik, Ekraam Sabir, Urbashi Mitra, Prem Natarajan

发表机构 * USC Information Sciences Institute(美国南加州大学信息科学研究所)

AI总结 本文研究了如何利用深度学习设计更有效的主动顺序假设检验策略,通过比较新提出的启发式方法与现有方法,展示了在某些场景下性能的显著提升。

Comments Accepted at 56th Annual Allerton Conference on Communication, Control, and Computing

详情
AI中文摘要

信息论在通信、压缩和假设检验等各类问题中取得了很大的成功,而随机控制理论则通过动态规划对部分可观测马尔可夫决策过程(POMDPs)的最优策略进行表征。然而,一般情况下找到这些问题的最优策略是计算上困难的,因此在实践中通常采用启发式方法。深度学习可以作为一种工具,用于设计更好的启发式方法。本文考虑了主动顺序假设检验问题,目标是通过自适应选择适当的查询来以最少的样本量可靠地推断真实假设。该问题可以建模为POMDP,并且文献中已存在其价值函数的界。然而,最优策略尚未被识别,各种启发式方法被使用。本文提出了两种新的启发式方法:一种基于深度强化学习,另一种基于KL散度零和博弈。这些启发式方法与最先进的解决方案进行了比较,并通过数值实验表明,在某些场景下,所提出的启发式方法能够显著优于现有方法。

英文摘要

Information theory has been very successful in obtaining performance limits for various problems such as communication, compression and hypothesis testing. Likewise, stochastic control theory provides a characterization of optimal policies for Partially Observable Markov Decision Processes (POMDPs) using dynamic programming. However, finding optimal policies for these problems is computationally hard in general and thus, heuristic solutions are employed in practice. Deep learning can be used as a tool for designing better heuristics in such problems. In this paper, the problem of active sequential hypothesis testing is considered. The goal is to design a policy that can reliably infer the true hypothesis using as few samples as possible by adaptively selecting appropriate queries. This problem can be modeled as a POMDP and bounds on its value function exist in literature. However, optimal policies have not been identified and various heuristics are used. In this paper, two new heuristics are proposed: one based on deep reinforcement learning and another based on a KL-divergence zero-sum game. These heuristics are compared with state-of-the-art solutions and it is demonstrated using numerical experiments that the proposed heuristics can achieve significantly better performance than existing methods in some scenarios.

1803.01066 2026-06-04 eess.SY cs.LG cs.SY math.OC

Specialized Interior Point Algorithm for Stable Nonlinear System Identification

用于稳定非线性系统辨识的专用内点算法

Jack Umenberger, Ian R. Manchester

发表机构 * Australian Centre for Field Robotics(澳大利亚机器人场实验室) The University of Sydney(悉尼大学)

AI总结 本文提出了一种专用内点算法,通过利用问题中的特殊结构,将计算复杂度从数据集长度的三次方降低到线性增长,从而提高了非线性系统辨识的效率,并展示了其在新数据上的优越泛化能力。

Comments accepted to IEEE Transactions on Automatic Control

详情
AI中文摘要

从数据估计非线性动态模型面临着许多挑战,包括模型不稳定性和长期仿真保真度的非凸性。最近,拉格朗日松弛法被提出作为近似仿真保真度和保证稳定性的方法,通过半正定规划(SDP),然而由此产生的SDP具有较大的维度,限制了其在实际问题中的应用。在本文中,我们开发了一种路径跟随内点算法,利用问题中的特殊结构,将计算复杂度从数据集长度的三次方降低到线性增长。新的算法使经验比较成为可能,包括非线性ARX方法,并展示了对新数据的优越泛化能力。我们还探讨了稳定性约束的“正则化”效应,作为替代回归子集选择的方法。

英文摘要

Estimation of nonlinear dynamic models from data poses many challenges, including model instability and non-convexity of long-term simulation fidelity. Recently Lagrangian relaxation has been proposed as a method to approximate simulation fidelity and guarantee stability via semidefinite programming (SDP), however the resulting SDPs have large dimension, limiting their utility in practical problems. In this paper we develop a path-following interior point algorithm that takes advantage of special structure in the problem and reduces computational complexity from cubic to linear growth with the length of the data set. The new algorithm enables empirical comparisons to established methods including Nonlinear ARX, and we demonstrate superior generalization to new data. We also explore the "regularizing" effect of stability constraints as an alternative to regressor subset selection.

1801.08383 2026-06-04 eess.SY cs.LG cs.SY stat.ML

Data-Driven Impulse Response Regularization via Deep Learning

基于深度学习的数据驱动脉冲响应正则化

Carl Andersson, Niklas Wahlström, Thomas B. Schön

发表机构 * Department of Information Technology, Uppsala University(信息技术系,乌普萨拉大学)

AI总结 本文提出了一种新的数据驱动模型,用于稳定线性单输入单输出系统的脉冲响应估计,该模型在利用输入输出数据中的隐藏模式方面优于非参数模型。

详情
AI中文摘要

我们考虑了稳定线性单输入单输出系统脉冲响应估计的问题。这是一个已广泛研究的问题,其中灵活的非参数模型最近在性能上超越了传统的有限维模型结构。受这一发展和深度学习的成功启发,我们提出了一种新的灵活的数据驱动模型。我们的实验表明,新模型能够比非参数模型更充分地利用输入输出数据中的隐藏模式。

英文摘要

We consider the problem of impulse response estimation of stable linear single-input single-output systems. It is a well-studied problem where flexible non-parametric models recently offered a leap in performance compared to the classical finite-dimensional model structures. Inspired by this development and the success of deep learning we propose a new flexible data-driven model. Our experiments indicate that the new model is capable of exploiting even more of the hidden patterns that are present in the input-output data as compared to the non-parametric models.

1810.03733 2026-06-04 math.NA cs.LG cs.NA

Find the dimension that counts: Fast dimension estimation and Krylov PCA

找出计数的维度:快速维度估计和Krylov PCA

Shashanka Ubaru, Abd-Krim Seghouane, Yousef Saad

发表机构 * IBM T. J. Watson Research Center(IBM T.J.沃森研究中心) The University of Melbourne(墨尔本大学) University of Minnesota(明尼苏达大学)

AI总结 本文提出了一种新的方法,用于同时估计协方差矩阵主子空间的维度并获得子空间的近似值,该方法结合了Krylov子空间方法,避免了显式计算样本协方差矩阵和完整的特征分解,从而在大规模数据应用中具有成本效益。

详情
AI中文摘要

高维数据和具有许多自由度的系统通常由协方差矩阵来表征。在本文中,我们考虑同时估计这些协方差矩阵主子空间的维度并获得子空间近似值的问题。这个问题出现在流行的主成分分析(PCA)中,并在许多机器学习、数据分析、信号和图像处理等应用中出现。我们首先提出了一种新的方法来估计主子空间的维度。然后展示如何将该方法与Krylov子空间方法结合,以同时估计维度并获得子空间的近似。维度估计无需额外成本。所提出的方法基于模型选择框架,其中新的选择标准是基于随机矩阵扰动理论思想推导的。我们进行了理论分析,(a) 显示所提出的方法在数据点数量 $n ightarrow \infty$ 时具有强一致性(即得出最优解),(b) 分析了有限 $n$ 情况下的精确维度估计条件。利用最近的结果,我们展示了所提算法也能产生近最优的PCA。所提出的方法避免显式形成样本协方差矩阵(与数据相关)并计算完整的特征分解。因此,该方法成本低廉,这在现代数据应用中特别有利,因为协方差矩阵可能非常大。数值实验展示了所提方法在各种应用中的性能。

英文摘要

High dimensional data and systems with many degrees of freedom are often characterized by covariance matrices. In this paper, we consider the problem of simultaneously estimating the dimension of the principal (dominant) subspace of these covariance matrices and obtaining an approximation to the subspace. This problem arises in the popular principal component analysis (PCA), and in many applications of machine learning, data analysis, signal and image processing, and others. We first present a novel method for estimating the dimension of the principal subspace. We then show how this method can be coupled with a Krylov subspace method to simultaneously estimate the dimension and obtain an approximation to the subspace. The dimension estimation is achieved at no additional cost. The proposed method operates on a model selection framework, where the novel selection criterion is derived based on random matrix perturbation theory ideas. We present theoretical analyses which (a) show that the proposed method achieves strong consistency (i.e., yields optimal solution as the number of data-points $n\rightarrow \infty$), and (b) analyze conditions for exact dimension estimation in the finite $n$ case. Using recent results, we show that our algorithm also yields near optimal PCA. The proposed method avoids forming the sample covariance matrix (associated with the data) explicitly and computing the complete eigen-decomposition. Therefore, the method is inexpensive, which is particularly advantageous in modern data applications where the covariance matrices can be very large. Numerical experiments illustrate the performance of the proposed method in various applications.

1810.03275 2026-06-04 math.NA cs.CV cs.NA

TV-regularized CT Reconstruction and Metal Artifact Reduction Using Inequality Constraints with Preconditioning

基于不等式约束与预条件的TV正则化CT重建及金属伪影消除

Clemens Schiffer

发表机构 * Karl-Franzens-Universität Graz(格拉茨卡尔-弗里德里希大学)

AI总结 本文提出了一种结合不等式约束与预条件的TV正则化方法,用于CT重建中减少金属伪影,通过Chambolle-Pock算法和预条件的Douglas-Rachford分裂法及ADMM算法实现快速收敛,验证了模型在真实和合成数据中的有效性。

Comments Master's Thesis, as submitted at the University of Graz

详情
AI中文摘要

总变分(TV)正则化被应用于X射线计算机断层扫描(CT)以减少金属伪影。本新型模型通过在受金属影响的sinogram数据上引入不等式约束,增强了Tikhonov正则化(具有L²数据保真项)和总变分正则化,以建模金属引起的误差。所提出的优化问题通过Chambolle-Pock算法进行离散化和求解。通过预条件的Douglas-Rachford分裂法以及高级方向乘子法(ADMM)实现了更快的收敛。该方法被应用于真实和合成数据,证明了模型在减少金属伪影方面的可行性。CT数据的技术细节及其处理在附录中给出。

英文摘要

Total variation(TV) regularization is applied to X-Ray computed tomography(CT) in an effort to reduce metal artifacts. Tikhonov regularization with $L^2$ data fidelity term and total variation regularization is augmented in this novel model by inequality constraints on sinogram data affected by metal to model errors caused by metal. The formulated problem is discretized and solved using the Chambolle-Pock algorithm. Faster convergence is achieved using preconditioning in a Douglas-Rachford spitting method as well as Advanced Direction Method of Multipliers(ADMM). The methods are applied to real and synthetic data demonstrating feasibility of the model to reduce metal artifacts. Technical details of CT data used and its processing are given in the appendix.

1810.03025 2026-06-04 stat.ML cs.AI cs.LG cs.SY eess.SY

Discretizing Logged Interaction Data Biases Learning for Decision-Making

对记录交互数据进行离散化会偏学习决策制定

Peter Schulam, Suchi Saria

发表机构 * Johns Hopkins University(约翰霍普金斯大学)

AI总结 本文研究了对非等间隔时间序列数据进行离散化对决策制定模型训练的影响,指出离散化引入了偏差,并提出使用连续时间模型来避免这一问题。

Comments This is a standalone short paper describing a new type of bias that can arise when learning from time series data for sequential decision-making problems

详情
AI中文摘要

时间序列数据通常在非等间隔时间点测量,常通过离散化作为预处理步骤。例如,客户到达时间的数据可能通过将每小时内的到达次数相加来简化,从而生成更易建模的离散时间序列。在本文摘要中,我们展示离散化引入了影响决策制定模型训练的偏差。我们称这种现象为离散化偏差,并表明可以通过使用连续时间模型来避免它。

英文摘要

Time series data that are not measured at regular intervals are commonly discretized as a preprocessing step. For example, data about customer arrival times might be simplified by summing the number of arrivals within hourly intervals, which produces a discrete-time time series that is easier to model. In this abstract, we show that discretization introduces a bias that affects models trained for decision-making. We refer to this phenomenon as discretization bias, and show that we can avoid it by using continuous-time models instead.

1810.02866 2026-06-04 eess.SP cs.LG cs.SY eess.SY

Artificial Intelligence Assisted Power Grid Hardening in Response to Extreme Weather Events

人工智能辅助的电网加固以应对极端天气事件

Rozhin Eskandarpour, Amin Khodaei, A. Paaso, N. M. Abdullah

发表机构 * University of Denver(丹佛大学) ComEd(ComEd公司) US National Committee(美国国家委员会)

AI总结 本文提出了一种基于人工智能的电网加固模型,旨在提高电网在极端天气事件中的韧性。首先,提出一个机器学习模型来预测组件状态(运行或停电),然后将这些预测输入到加固模型中,确定分布式发电(DG)单元的放置位置。与现有文献不同,本文通过考虑两个目标的复杂依赖关系,共同优化电网经济性和韧性。在标准IEEE 118节点测试系统上的数值模拟展示了所提加固模型的优势和适用性。结果表明,通过去中心化和分布式本地能源资源,所提加固模型可以产生更稳健的解决方案,显著保护系统免受多个组件因极端事件而停电的影响。

详情
Journal ref
2018 Grid of the Future Symposium
AI中文摘要

本文提出了一种基于人工智能的电网加固模型,旨在提高电网在极端天气事件中的韧性。首先,提出一个机器学习模型来预测组件状态(运行或停电)。然后,将这些预测输入到加固模型中,确定分布式发电(DG)单元的战略放置位置。与现有文献不同,本文通过考虑两个目标的复杂依赖关系,共同优化电网的经济性和韧性。在标准IEEE 118节点测试系统上的数值模拟展示了所提加固模型的优势和适用性。结果表明,通过去中心化和分布式本地能源资源,所提加固模型可以产生更稳健的解决方案,显著保护系统免受多个组件因极端事件而停电的影响。

英文摘要

In this paper, an artificial intelligence based grid hardening model is proposed with the objective of improving power grid resilience in response to extreme weather events. At first, a machine learning model is proposed to predict the component states (either operational or outage) in response to the extreme event. Then, these predictions are fed into a hardening model, which determines strategic locations for placement of distributed generation (DG) units. In contrast to existing literature in hardening and resilience enhancement, this paper co-optimizes grid economic and resilience objectives by considering the intricate dependencies of the two. The numerical simulations on the standard IEEE 118-bus test system illustrate the merits and applicability of the proposed hardening model. The results indicate that the proposed hardening model through decentralized and distributed local energy resources can produce a more robust solution that can protect the system significantly against multiple component outages due to an extreme event.