arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

1710.02066 2026-06-04 eess.SY cs.RO cs.SY

Feedback Regularization and Geometric PID Control for Robust Stabilization of a Planar Three-link Hybrid Bipedal Walking Model

反馈校正与几何PID控制用于平面三连杆混合双足步行模型的鲁棒稳定化

W. M. L. T. Weerakoon, T. W. U. Madhushani, D. H. S. Maithripala, J. M. Berg

发表机构 * Department of Mechanical Engineering, University of Peradeniya（珀斯德尼亚大学机械工程系）； Postgraduate and Research Unit, Sri Lanka Technological Campus（斯里兰卡科技校园研究生与研究单位）； Department of Mechanical Engineering, Texas Tech University（德克萨斯技术大学机械工程系）

AI总结本文应用一种 recently 开发的几何PID控制器来稳定一个平面三连杆混合动态步行模型。该模型有三个连杆，代表机器人躯干和两个无膝腿，每个髋关节有独立的控制力矩。几何PID控制器是为完全驱动的机械系统开发的，但在摆动相中，三连杆双足机器人有三个自由度但只有两个控制输入。通过选择两个“虚拟约束”来强制执行，解决欠驱动问题，并验证所得到的二维零动力学的稳定性。所得到的受控动力学不具有机械系统的结构，但通过“反馈校正”恢复了这种结构，随后使用几何PID控制来提供对虚拟约束的鲁棒渐近调节。所提出的方法可以容忍更大的倾斜变化，展示了几何方法的价值和积分作用的益处。

Comments Preprint submitted to 2018 American Control Conference

详情

DOI: 10.23919/ACC.2018.8431829

AI中文摘要

This paper applies a recently developed geometric PID controller to stabilize a three-link planar bipedal hybrid dynamic walking model. The three links represent the robot torso and two kneeless legs, with an independent control torque available at each hip joint. The geometric PID controller is derived for fully actuated mechanical systems, however in the swing phase the three-link biped robot has three degrees of freedom and only two controls. Following the bipedal walking literature, underactuation is addressed by choosing two "virtual constraints" to enforce, and verifying the stability of the resulting two-dimensional zero dynamics. The resulting controlled dynamics do not have the structure of a mechanical system, however this structure is restored using "feedback regularization," following which geometric PID control is used to provide robust asymptotic regulation of the virtual constraints. The proposed method can tolerate significantly greater variations in inclination, showing the value of the geometric methods, and the benefit of integral action.

英文摘要

This paper applies a recently developed geometric PID controller to stabilize a three-link planar bipedal hybrid dynamic walking model. The three links represent the robot torso and two kneeless legs, with an independent control torque available at each hip joint. The geometric PID controller is derived for fully actuated mechanical systems, however in the swing phase the three-link biped robot has three degrees of freedom and only two controls. Following the bipedal walking literature, underactuation is addressed by choosing two "virtual constraints" to enforce, and verifying the stability of the resulting two-dimensional zero dynamics. The resulting controlled dynamics do not have the structure of a mechanical system, however this structure is restored using "feedback regularization," following which geometric PID control is used to provide robust asymptotic regulation of the virtual constraints. The proposed method can tolerate significantly greater variations in inclination, showing the value of the geometric methods, and the benefit of integral action.

URL PDF HTML ☆

赞 0 踩 0

1812.03434 2026-06-04 cs.CG cs.CV cs.NA math.NA physics.med-ph q-bio.QM

Area-preserving mapping of 3D ultrasound carotid artery images using density-equalizing reference map

利用密度相等参考图进行3D超声颈动脉图像的面积保持映射

Gary P. T. Choi, Bernard Chiu, Chris H. Rycroft

发表机构 * Mathematics Group, Lawrence Berkeley National Laboratory（伯克利国家实验室数学组）

AI总结本文提出了一种新的密度相等参考图（DERM）方法，用于将3D颈动脉表面映射到标准化的2D颈动脉模板，重点是通过最小化局部面积变形来保持局部几何结构，从而提高对血管壁加斑块厚度（VWT）的定量监测和比较能力。

详情

DOI: 10.1109/TBME.2019.2963783
Journal ref: IEEE Transactions on Biomedical Engineering, 67(9), 1507-1517 (2020)

AI中文摘要

颈动脉动脉粥样斑块是一种发生在颈动脉分叉处的局部疾病。为了定量监测血管壁加斑块厚度（VWT）的局部变化，并比较不同患者或同一患者在不同超声扫描会话中的VWT分布，需要一种映射技术来调整不同颈动脉模型的几何变化。在本工作中，我们提出了一种新的方法，称为密度相等参考图（DERM），用于将3D颈动脉表面映射到标准化的2D颈动脉模板，重点是通过最小化局部面积变形来保持颈动脉表面的局部几何结构。初始映射是由之前描述的弧长缩放（ALS）映射方法生成的，该方法将3D颈动脉表面投影到2D非凸L形域。随后通过变形ALS映射，利用所提出的结合密度相等映射和参考映射技术的算法，构建出平滑且面积保持的扁平化映射。这种结合使首次实现了将3D表面映射到标准化的非凸平面域的1:1映射，且以面积保持的方式。使用20个颈动脉表面模型的评估显示，与ALS映射方法相比，所提出的方法将扁平化映射的面积变形减少了超过80%。

英文摘要

Carotid atherosclerosis is a focal disease at the bifurcations of the carotid artery. To quantitatively monitor the local changes in the vessel-wall-plus-plaque thickness (VWT) and compare the VWT distributions for different patients or for the same patients at different ultrasound scanning sessions, a mapping technique is required to adjust for the geometric variability of different carotid artery models. In this work, we propose a novel method called density-equalizing reference map (DERM) for mapping 3D carotid surfaces to a standardized 2D carotid template, with an emphasis on preserving the local geometry of the carotid surface by minimizing the local area distortion. The initial map was generated by a previously described arc-length scaling (ALS) mapping method, which projects a 3D carotid surface onto a 2D non-convex L-shaped domain. A smooth and area-preserving flattened map was subsequently constructed by deforming the ALS map using the proposed algorithm that combines the density-equalizing map and the reference map techniques. This combination allows, for the first time, one-to-one mapping from a 3D surface to a standardized non-convex planar domain in an area-preserving manner. Evaluations using 20 carotid surface models show that the proposed method reduced the area distortion of the flattening maps by over 80% as compared to the ALS mapping method.

URL PDF HTML ☆

赞 0 踩 0

1711.03127 2026-06-04 eess.SY cs.LG cs.SY math.OC

Energy Storage Arbitrage in Real-Time Markets via Reinforcement Learning

通过强化学习实现实时市场的能源存储套利

Hao Wang, Baosen Zhang

发表机构 * Department of Electrical Engineering, University of Washington（华盛顿大学电气工程系）

AI总结本文通过强化学习设计了一个时间套利策略，用于能源存储，解决了实时价格套利中价格高度不确定带来的策略设计难题，通过设计奖励函数提升性能。

详情

DOI: 10.1109/PESGM.2018.8586321
Journal ref: 2018 IEEE Power & Energy Society General Meeting (PESGM)

AI中文摘要

在本文中，我们通过强化学习推导出一个时间套利策略用于存储。实时价格套利是存储单元的重要收入来源，但设计良好的策略 proved 难以实现，因为价格的高不确定性。我们采用强化学习来设计一个最优的套利策略。该策略通过存储单元重复的充放电操作，通过更新价值矩阵来学习。我们设计了一个奖励函数，不仅反映了充放电决策的即时利润，还结合了历史信息。仿真结果表明，与现有算法相比，我们设计的奖励函数导致了显著的性能提升。

英文摘要

In this paper, we derive a temporal arbitrage policy for storage via reinforcement learning. Real-time price arbitrage is an important source of revenue for storage units, but designing good strategies have proven to be difficult because of the highly uncertain nature of the prices. Instead of current model predictive or dynamic programming approaches, we use reinforcement learning to design an optimal arbitrage policy. This policy is learned through repeated charge and discharge actions performed by the storage unit through updating a value matrix. We design a reward function that does not only reflect the instant profit of charge/discharge decisions but also incorporate the history information. Simulation results demonstrate that our designed reward function leads to significant performance improvement compared with existing algorithms.

URL PDF HTML ☆

赞 0 踩 0

1710.09691 2026-06-04 eess.SY cs.LG cs.RO cs.SY

Iterative Machine Learning for Precision Trajectory Tracking with Series Elastic Actuators

迭代机器学习用于系列弹性执行器的高精度轨迹跟踪

Nathan Banka, W. Tony Piaskowy, Joseph Garbini, Santosh Devasia

发表机构 * Ultra-Precision Controls Lab（超精密控制实验室）； University of Washington（华盛顿大学）

AI总结本文研究了在系列弹性执行器中使用迭代学习方法提高位置跟踪精度的问题，通过迭代学习生成前馈命令，利用复值高斯过程回归技术估计局部系统模型，从而减少跟踪误差。

Comments 9 pages, 16 figure. Submitted to AMC Workshop

详情

DOI: 10.1109/AMC.2019.8371094
Journal ref: 2018 IEEE 15th International Workshop on Advanced Motion Control (AMC), Tokyo, 2018, pp. 234-239

AI中文摘要

当机器人在未知环境中操作时，位置的小误差可能导致接触力的大幅变化，尤其是对于典型的高阻抗设计。这可能会损坏周围环境或机器人本身。系列弹性执行器（SEAs）是一种减少机器人手臂输出阻抗以提高对环境施加力的控制能力的流行方法。然而，这种增加的力控制能力伴随着较低的位置精度和带宽。本文探讨了使用迭代学习的前馈命令来改进使用SEAs时的位置跟踪。在每次迭代中，系统对量化输入的输出响应被用来估计线性化的局部系统模型。这些估计的模型是通过复值高斯过程回归（cGPR）技术获得的，然后用于基于前一次迭代的误差生成新的前馈输入命令。本文展示了该迭代机器学习（IML）技术在双自由度（2-DOF）机器人手臂上的应用，并证明了IML方法能够成功收敛以减少跟踪误差。

英文摘要

When robots operate in unknown environments small errors in postions can lead to large variations in the contact forces, especially with typical high-impedance designs. This can potentially damage the surroundings and/or the robot. Series elastic actuators (SEAs) are a popular way to reduce the output impedance of a robotic arm to improve control authority over the force exerted on the environment. However this increased control over forces with lower impedance comes at the cost of lower positioning precision and bandwidth. This article examines the use of an iteratively-learned feedforward command to improve position tracking when using SEAs. Over each iteration, the output responses of the system to the quantized inputs are used to estimate a linearized local system models. These estimated models are obtained using a complex-valued Gaussian Process Regression (cGPR) technique and then, used to generate a new feedforward input command based on the previous iteration's error. This article illustrates this iterative machine learning (IML) technique for a two degree of freedom (2-DOF) robotic arm, and demonstrates successful convergence of the IML approach to reduce the tracking error.

URL PDF HTML ☆

赞 0 踩 0

1812.07725 2026-06-04 math.OC cs.LG cs.NA math.NA math.PR stat.ML

Breaking Reversibility Accelerates Langevin Dynamics for Global Non-Convex Optimization

打破可逆性加速Langevin动力学用于全局非凸优化

Xuefeng Gao, Mert Gurbuzbalaban, Lingjiong Zhu

发表机构 * Department of Systems Engineering and Engineering Management, The Chinese University of Hong Kong, Shatin, N.T. Hong Kong（系统工程与工程管理系，香港中文大学（深圳））； Department of Management Science and Information Systems and the DIMACS Institute, Rutgers University, Piscataway, NJ-08854, United States of America（管理科学与信息系统系及DIMACS研究所，罗杰斯大学）； Department of Mathematics, Florida State University, 1017 Academic Way, Tallahassee, FL-32306, United States of America（数学系，佛罗里达州立大学）

AI总结本文研究了非可逆Langevin动力学在全局非凸优化中的应用，通过分析非可逆动力学算法的收敛性和混合速率，证明了非可逆算法在寻找局部极小值和探索状态空间方面的效率提升。

详情

AI中文摘要

Langevin动力学（LD）已被证明是一种强大的技术，用于优化非凸目标，作为一种高效的算法来寻找局部极小值，而最终在更长的时间尺度上访问全局极小值。LD基于一阶Langevin扩散，其时间是可逆的。我们研究了两种基于非可逆Langevin扩散的变种：欠阻尼Langevin动力学（ULD）和具有非对称漂移的Langevin动力学（NLD）。采用Tzen、Liang和Raginsky（2018）为LD到非可逆扩散的技术，我们证明了对于给定的局部极小值，其在初始化点任意距离内，以高概率，ULD轨迹会在依赖于局部极小值Hessian最小特征值的复发时间内结束于该局部极小值的小邻域之外，或者在复发时间内进入该邻域并停留可能极长的逃逸时间。ULD算法在Hessian最小特征值的依赖性方面优于Tzen、Liang和Raginsky（2018）中LD的复发时间。对于NLD算法也获得了相似的结果和改进。我们还展示了非可逆变种在离散时间中能够更快地退出局部极小值的吸引盆地，当目标函数有两个局部极小值被鞍点分隔时，并量化了改进的幅度。我们的分析表明，非可逆Langevin算法在寻找局部极小值和探索状态空间方面更有效。我们的分析基于在局部极小值周围对目标函数的二次近似。作为我们分析的副产品，我们获得了两个非可逆Langevin算法在2-Wasserstein距离下的最优混合速率。

英文摘要

Langevin dynamics (LD) has been proven to be a powerful technique for optimizing a non-convex objective as an efficient algorithm to find local minima while eventually visiting a global minimum on longer time-scales. LD is based on the first-order Langevin diffusion which is reversible in time. We study two variants that are based on non-reversible Langevin diffusions: the underdamped Langevin dynamics (ULD) and the Langevin dynamics with a non-symmetric drift (NLD). Adopting the techniques of Tzen, Liang and Raginsky (2018) for LD to non-reversible diffusions, we show that for a given local minimum that is within an arbitrary distance from the initialization, with high probability, either the ULD trajectory ends up somewhere outside a small neighborhood of this local minimum within a recurrence time which depends on the smallest eigenvalue of the Hessian at the local minimum or they enter this neighborhood by the recurrence time and stay there for a potentially exponentially long escape time. The ULD algorithms improve upon the recurrence time obtained for LD in Tzen, Liang and Raginsky (2018) with respect to the dependency on the smallest eigenvalue of the Hessian at the local minimum. Similar result and improvement are obtained for the NLD algorithm. We also show that non-reversible variants can exit the basin of attraction of a local minimum faster in discrete time when the objective has two local minima separated by a saddle point and quantify the amount of improvement. Our analysis suggests that non-reversible Langevin algorithms are more efficient to locate a local minimum as well as exploring the state space. Our analysis is based on the quadratic approximation of the objective around a local minimum. As a by-product of our analysis, we obtain optimal mixing rates for quadratic objectives in the 2-Wasserstein distance for two non-reversible Langevin algorithms we consider.

URL PDF HTML ☆

赞 0 踩 0

1905.03419 2026-06-04 eess.SY cs.RO cs.SY

Testing Scenario Library Generation for Connected and Automated Vehicles, Part I: Methodology

连接和自动化车辆测试场景库生成方法研究，第一部分：方法论

Shuo Feng, Yiheng Feng, Chunhui Yu, Yi Zhang, Henry X. Liu

发表机构 * University of Michigan（密歇根大学）

AI总结本文提出了一种系统框架，用于生成连接和自动化车辆（CAVs）的测试场景库（TSLG），考虑不同的操作设计域（ODDs）、CAV模型和性能指标，通过引入新的场景关键性度量标准和多起始优化方法，提高测试效率。

Comments 11 pages,3 figures

详情

DOI: 10.1109/TITS.2020.2972211
Journal ref: IEEE Transactions on Intelligent Transportation Systems, 2020

AI中文摘要

在连接和自动化车辆（CAVs）的发展和部署过程中，测试和评估是一个关键步骤，但目前尚无系统框架用于生成测试场景库。本研究旨在提供一种通用框架，用于解决测试场景库生成（TSLG）问题，考虑不同的操作设计域（ODDs）、CAV模型和性能指标。给定一个ODD，测试场景库被定义为一组关键场景，可用于CAV测试。每个测试场景通过新提出的度量标准——场景关键性进行评估，该度量标准可以计算为机动挑战和暴露频率的组合。为了寻找关键场景，设计了辅助目标函数，并应用了多起始优化方法和种子填充技术。所提出的框架在理论上被证明可以以远少的测试次数获得准确的评估结果，与道路测试方法相比。在本研究的第二部分中，通过三个案例研究来演示所提出的方法。基于强化学习的技术被应用于在高维场景下增强搜索方法。

英文摘要

Testing and evaluation is a critical step in the development and deployment of connected and automated vehicles (CAVs), and yet there is no systematic framework to generate testing scenario library. This study aims to provide a general framework for the testing scenario library generation (TSLG) problem with different operational design domains (ODDs), CAV models, and performance metrics. Given an ODD, the testing scenario library is defined as a critical set of scenarios that can be used for CAV test. Each testing scenario is evaluated by a newly proposed measure, scenario criticality, which can be computed as a combination of maneuver challenge and exposure frequency. To search for critical scenarios, an auxiliary objective function is designed, and a multi-start optimization method along with seed-filling is applied. The proposed framework is theoretically proved to obtain accurate evaluation results with much fewer number of tests, if compared with the on-road test method. In part II of the study, three case studies are investigated to demonstrate the proposed methodologies. Reinforcement learning based technique is applied to enhance the searching method under high-dimensional scenarios.

URL PDF HTML ☆

赞 0 踩 0

1810.00182 2026-06-04 eess.SY cs.MA cs.RO cs.SY math.OC

Collaborative target-tracking control using multiple autonomous fixed-wing UAVs with constant speeds

使用多架自主固定翼无人机进行协同目标跟踪控制

Zhiyong Sun, Hector Garcia de Marina, Brian D. O. Anderson, Changbin Yu

发表机构 * Department of Electrical Engineering, Eindhoven University of Technology（埃因霍温理工大学电子工程系）； Universidad Complutense de Madrid（马德里complutense大学）； Research School of Electrical, Energy and Material Engineering, Australian National University（澳大利亚国立大学电气、能源和材料工程研究学校）； Optus-Curtin Centre of Excellence in Artificial Intelligence, Curtin University（Curtin大学人工智能卓越中心）

AI总结本文研究了使用多架固定翼无人机进行协同跟踪控制的问题，通过设计控制器使无人机群体能够协同跟踪目标的位置和速度，提出了相对速度条件和参考速度构造方法，并讨论了常速约束下的控制器设计和性能限制。

Comments 33 pages (single column). To be published in the AIAA Journal of Guidance, Dynamics, and Control

详情

AI中文摘要

本文考虑了使用一组具有恒定且非相同速度的固定翼无人飞行器（UAV）进行协同跟踪控制的问题。固定翼UAV的动力学通过具有非完整约束的单轮型方程建模，假设UAV在名义操作模式下以恒定高度飞行。控制器设计使得所有固定翼UAV作为群体能够协同跟踪目标的位置和速度。我们首先提出了跟踪UAV与目标之间相对速度的条件，以确保在UAV受恒定速度约束时跟踪目标的可行性。我们构造了一个包含目标速度和位置作为反馈的参考速度，该参考速度由群体质心跟踪。通过这种方式，所有车辆的航向被控制，使群体质心沿着成功跟踪目标轨迹的参考轨迹移动。进一步设计了一个间距控制器，以确保所有车辆靠近群体质心轨迹。还详细讨论了控制器设计中的权衡以及由于恒速约束导致的目标跟踪控制性能限制。提供了三架固定翼UAV跟踪目标旋翼机的实验结果。

英文摘要

This paper considers a collaborative tracking control problem using a group of fixed-wing unmanned aerial vehicles (UAVs) with constant and non-identical speeds. The dynamics of fixed-wing UAVs are modelled by unicycle-type equations with nonholonomic constraints, assuming that UAVs fly at constant altitudes in the nominal operation mode. The controller is designed such that all fixed-wing UAVs as a group can collaboratively track a desired target's position and velocity. We first present conditions on the relative speeds of tracking UAVs and the target to ensure that the tracking objective can be achieved when UAVs are subject to constant speed constraints. We construct a reference velocity that includes both the target's velocity and position as feedback, which is to be tracked by the group centroid. In this way, all vehicles' headings are controlled such that the group centroid follows a reference trajectory that successfully tracks the target's trajectory. A spacing controller is further devised to ensure that all vehicles stay close to the group centroid trajectory. Trade-offs in the controller design and performance limitations of the target tracking control due to the constant-speed constraint are also discussed in detail. Experimental results with three fixed-wing UAVs tracking a target rotorcraft are provided.

URL PDF HTML ☆

赞 0 踩 0

1802.09099 2026-06-04 math.OC cs.RO cs.SY eess.SY

Pareto optimal multi-robot motion planning

Pareto最优多机器人运动规划

Guoxiang Zhao, Minghui Zhu

发表机构 * School of Electrical Engineering and Computer Science, Pennsylvania State University（宾夕法尼亚州立大学电气工程与计算机科学学院）

AI总结本文研究了一类多机器人协调问题，目标是使机器人以最短时间到达目标区域并避免碰撞。提出了一种新的数值算法来识别Pareto最优解，确保没有机器人可以单方面减少旅行时间而不延长其他机器人的。通过室内多机器人平台和计算机模拟实验，展示了该算法的 anytime 特性。

1709.01610 2026-06-04 math.OC cs.AI cs.SY eess.SY nlin.AO

A second order primal-dual method for nonsmooth convex composite optimization

一种用于非光滑凸复合优化的二阶对偶方法

Neil K. Dhingra, Sei Zhen Khong, Mihailo R. Jovanović

发表机构 * Numerica Corporation（Numerica公司）； University of Southern California（南加州大学）

AI总结本文提出了一种二阶对偶方法，用于求解目标函数为强凸二次可微项与可能非可微凸正则化项之和的优化问题。通过引入辅助变量，利用非光滑正则化项的近似算子将增强拉格朗日函数转换为一次但非二次连续可微的函数，其鞍点对应于原始优化问题的解。进一步开发了全局收敛的定制算法，利用对偶增强拉格朗日函数作为 merit 函数，并证明了搜索方向可高效计算且具有二次/超线性渐近收敛性。

Comments 32 pages, 8 figures

详情

AI中文摘要

我们开发了一种二阶对偶方法，用于求解目标函数由强凸二次可微项和可能非可微凸正则化项之和构成的优化问题。在引入辅助变量后，我们利用非光滑正则化项的近似算子，将相应的增强拉格朗日函数转换为一次但非二次连续可微的函数。该函数的鞍点对应于原始优化问题的解。我们通过一般化的Hessian来定义该函数上的二阶更新，并证明了相应微分包含的全局指数稳定性。此外，我们开发了一种全局收敛的定制算法，利用对偶增强拉格朗日函数作为 merit 函数。我们证明了搜索方向可以高效计算，并证明了二次/超线性渐近收敛性。我们使用 $\ell_1$-正则化的模型预测控制问题和设计空间不变系统分布式控制器的问题来展示本方法的优越性和有效性。

英文摘要

We develop a second order primal-dual method for optimization problems in which the objective function is given by the sum of a strongly convex twice differentiable term and a possibly nondifferentiable convex regularizer. After introducing an auxiliary variable, we utilize the proximal operator of the nonsmooth regularizer to transform the associated augmented Lagrangian into a function that is once, but not twice, continuously differentiable. The saddle point of this function corresponds to the solution of the original optimization problem. We employ a generalization of the Hessian to define second order updates on this function and prove global exponential stability of the corresponding differential inclusion. Furthermore, we develop a globally convergent customized algorithm that utilizes the primal-dual augmented Lagrangian as a merit function. We show that the search direction can be computed efficiently and prove quadratic/superlinear asymptotic convergence. We use the $\ell_1$-regularized model predictive control problem and the problem of designing a distributed controller for a spatially-invariant system to demonstrate the merits and the effectiveness of our method.

URL PDF HTML ☆

赞 0 踩 0

1905.00820 2026-06-04 eess.SY cs.LG cs.SY math.OC

On the smoothness of nonlinear system identification

关于非线性系统辨识的光滑性

Antônio H. Ribeiro, Koen Tiels, Jack Umenberger, Thomas B. Schön, Luis A. Aguirre

发表机构 * Dept. of Information Technology, Uppsala University, Sweden（信息科技系，乌普萨拉大学，瑞典）； Dept. of Mechanical Engineering, Eindhoven University of Technology, The Netherlands（机械工程系，埃因霍温理工大学，荷兰）

AI总结本文研究了预测误差参数估计中线性和非线性系统优化问题的光滑性，提出通过多阶段法解决参数空间中模型非合同区域导致的Lipschitz常数和β-光滑性指数指数级增长的问题。

详情

DOI: 10.1016/j.automatica.2020.109158
Journal ref: Automatica, vol. 121, 109158, Nov. 2020

AI中文摘要

我们从新的角度探讨了在预测误差参数估计中线性和非线性系统出现的优化问题的光滑性。我们证明，在参数空间中模型非合同的区域，目标函数的Lipschitz常数和β-光滑性可能会随着仿真长度指数级增长，使得在这些区域内难以数值地找到极小值，甚至难以逃离这些区域。除了对这一问题提供理论理解外，本文还提出了多阶段法作为可行的解决方案。所提出的方法最小化预测模型与观测值之间的误差。与其在整个数据集上运行预测模型不同，多阶段法将数据分成更小的子集，并在每个子集上运行预测模型，使仿真长度成为设计参数，并使使用标准方法不可行的问题变得可行。通过在优化中包含约束条件，获得了与原问题的等价性。新方法通过估计具有混沌或不稳定行为的非线性系统的参数以及神经网络的参数进行了说明。我们还比较了所提出的方法与多步预测误差最小化方法的性能。

英文摘要

We shed new light on the \textit{smoothness} of optimization problems arising in prediction error parameter estimation of linear and nonlinear systems. We show that for regions of the parameter space where the model is not contractive, the Lipschitz constant and $β$-smoothness of the objective function might blow up exponentially with the simulation length, making it hard to numerically find minima within those regions or, even, to escape from them. In addition to providing theoretical understanding of this problem, this paper also proposes the use of multiple shooting as a viable solution. The proposed method minimizes the error between a prediction model and the observed values. Rather than running the prediction model over the entire dataset, multiple shooting splits the data into smaller subsets and runs the prediction model over each subset, making the simulation length a design parameter and making it possible to solve problems that would be infeasible using a standard approach. The equivalence to the original problem is obtained by including constraints in the optimization. The new method is illustrated by estimating the parameters of nonlinear systems with chaotic or unstable behavior, as well as neural networks. We also present a comparative analysis of the proposed method with multi-step-ahead prediction error minimization.

URL PDF HTML ☆

赞 0 踩 0

1501.07242 2026-06-04 math.NA cs.LG cs.NA math.OC

Escaping the Local Minima via Simulated Annealing: Optimization of Approximately Convex Functions

通过模拟退火逃离局部极小值：近似凸函数的优化

Alexandre Belloni, Tengyuan Liang, Hariharan Narayanan, Alexander Rakhlin

发表机构 * The Fuqua School of Business, Duke University（德克萨斯大学福克商学院）； Department of Statistics, The Wharton School, University of Pennsylvania（宾夕法尼亚大学沃顿商学院统计系）； Department of Statistics and Department of Mathematics, University of Washington（华盛顿大学统计系和数学系）

AI总结本文研究了如何通过模拟退化方法优化近似凸函数，提出了一种基于Hit-and-Run方法的采样算法，能够有效避免局部极小值问题，并在零阶随机凸优化中实现了高效的ε-极小值求解。

Comments 27 pages

详情

Journal ref: Proceedings of the 28th Conference on Learning Theory 40 (2015) 240-265

AI中文摘要

我们考虑在$\mathbb{R}^n$中有限凸集上仅使用函数评估来优化近似凸函数的问题。该问题被转化为使用Hit-and-Run方法从近似对数凹分布中采样，证明其具有与对数凹分布采样相同的$\mathcal{O}^*$复杂度。除了将对数凹分布的分析扩展到近似对数凹分布外，Hit-and-Run漫步的一维采样器的实现需要新的方法和分析。该算法基于模拟退火，不依赖一阶条件，从而本质上免疫于局部极小值。然后，我们将该方法应用于不同的激励问题。在零阶随机凸优化的背景下，所提出的方法在诱导一个$\mathcal{O}(ε/n)$-近似对数凹分布后，通过$\mathcal{O}^*(n^{7.5}ε^{-2})$的噪声函数评估产生一个$ε$-极小值。我们还详细考虑了当“非凸性程度”向函数最优解衰减时的情况。本文讨论的方法还包括隐私计算经验风险最小化、两阶段随机规划以及在线学习中的近似动态规划应用。

英文摘要

We consider the problem of optimizing an approximately convex function over a bounded convex set in $\mathbb{R}^n$ using only function evaluations. The problem is reduced to sampling from an \emph{approximately} log-concave distribution using the Hit-and-Run method, which is shown to have the same $\mathcal{O}^*$ complexity as sampling from log-concave distributions. In addition to extend the analysis for log-concave distributions to approximate log-concave distributions, the implementation of the 1-dimensional sampler of the Hit-and-Run walk requires new methods and analysis. The algorithm then is based on simulated annealing which does not relies on first order conditions which makes it essentially immune to local minima. We then apply the method to different motivating problems. In the context of zeroth order stochastic convex optimization, the proposed method produces an $ε$-minimizer after $\mathcal{O}^*(n^{7.5}ε^{-2})$ noisy function evaluations by inducing a $\mathcal{O}(ε/n)$-approximately log concave distribution. We also consider in detail the case when the "amount of non-convexity" decays towards the optimum of the function. Other applications of the method discussed in this work include private computation of empirical risk minimizers, two-stage stochastic programming, and approximate dynamic programming for online learning.

URL PDF HTML ☆

赞 0 踩 0

1707.02568 2026-06-04 math.NA cs.LG cs.NA math.OC math.PR

Solving high-dimensional partial differential equations using deep learning

利用深度学习解决高维偏微分方程

Jiequn Han, Arnulf Jentzen, Weinan E

发表机构 * Program in Applied and Computational Mathematics, Princeton University（普林斯顿大学应用与计算数学项目）； Department of Mathematics, Princeton University（普林斯顿大学数学系）； Beijing Institute of Big Data Research, Beijing（北京大数据研究院）

AI总结本文提出了一种基于深度学习的方法，用于解决高维抛物型偏微分方程，通过将偏微分方程转化为反向随机微分方程，并利用神经网络近似未知解的梯度，有效提高了高维问题的准确性和效率。

Comments 13 pages, 6 figures

详情

DOI: 10.1073/pnas.1718942115
Journal ref: Proceedings of the National Academy of Sciences, 115(34), 8505-8510 (2018)

AI中文摘要

开发用于求解高维偏微分方程（PDEs）的算法长期以来一直是一个极具挑战性的问题，由于著名的“维度灾难”问题。本文介绍了一种基于深度学习的方法，能够处理一般的高维抛物型PDEs。为此，PDEs被重新表述为反向随机微分方程，并且未知解的梯度通过神经网络近似，这在很大程度上类似于深度强化学习，其中梯度作为策略函数。在非线性Black-Scholes方程、Hamilton-Jacobi-Bellman方程和Allen-Cahn方程等示例上的数值结果表明，所提出的算法在高维情况下在准确性和成本方面都非常有效。这为经济学、金融学、运筹学和物理学开辟了新的可能性，通过同时考虑所有参与的代理、资产、资源或粒子，而不是对它们之间的相互关系做出任意假设。

英文摘要

Developing algorithms for solving high-dimensional partial differential equations (PDEs) has been an exceedingly difficult task for a long time, due to the notoriously difficult problem known as the "curse of dimensionality". This paper introduces a deep learning-based approach that can handle general high-dimensional parabolic PDEs. To this end, the PDEs are reformulated using backward stochastic differential equations and the gradient of the unknown solution is approximated by neural networks, very much in the spirit of deep reinforcement learning with the gradient acting as the policy function. Numerical results on examples including the nonlinear Black-Scholes equation, the Hamilton-Jacobi-Bellman equation, and the Allen-Cahn equation suggest that the proposed algorithm is quite effective in high dimensions, in terms of both accuracy and cost. This opens up new possibilities in economics, finance, operational research, and physics, by considering all participating agents, assets, resources, or particles together at the same time, instead of making ad hoc assumptions on their inter-relationships.

URL PDF HTML ☆

赞 0 踩 0

1709.05963 2026-06-04 math.NA cs.LG cs.NA cs.NE math.PR stat.ML

Machine learning approximation algorithms for high-dimensional fully nonlinear partial differential equations and second-order backward stochastic differential equations

基于机器学习的近似算法用于高维非线性偏微分方程和二阶反向随机微分方程

Christian Beck, Weinan E, Arnulf Jentzen

发表机构 * ETH Zurich（苏黎世联邦理工学院）； Beijing Institute of Big Data Research（北京大数据研究院）； Princeton University（普林斯顿大学）； Peking University（北京大学）

AI总结本文提出了一种基于机器学习的高维非线性二阶偏微分方程的求解方法，通过将非线性偏微分方程与二阶反向随机微分方程联系起来，并利用深度神经网络进行空间近似和随机梯度下降优化，展示了该方法在高维Black-Scholes-Barenblatt方程、Hamilton-Jacobi-Bellman方程和非线性期望问题中的高效性和准确性。

Comments 56 pages, 12 figures

详情

DOI: 10.1007/s00332-018-9525-3
Journal ref: J. Nonlinear Sci. 29, 1563-1619 (2019)

AI中文摘要

高维偏微分方程（PDE）出现在金融行业的多个模型中，例如衍生品定价模型、信用估值调整（CVA）模型或投资组合优化模型。这些应用中的PDE通常是高维的，因为维度对应于投资组合中的金融资产数量。此外，由于需要在模型中纳入某些非线性现象，如违约风险、交易成本、波动率不确定性（Knightian不确定性）或交易限制，这些PDE往往是完全非线性的。此类高维完全非线性PDE的求解极具挑战性，因为标准近似方法的计算努力随着维度呈指数增长。在本工作中，我们提出了一种新的方法来求解高维完全非线性二阶PDE。该方法可以特别用于采样高维非线性期望。该方法基于（i）完全非线性二阶PDE与二阶反向随机微分方程（2BSDE）之间的联系，（ii）PDE和2BSDE问题的合并公式，（iii）2BSDE的时间前向离散化和通过深度神经网络的空间近似，以及（iv）随机梯度下降型优化过程。使用Python中的TENSORFLOW获得的数值结果展示了该方法在100维Black-Scholes-Barenblatt方程、100维Hamilton-Jacobi-Bellman方程和100维G-布朗运动的非线性期望问题中的效率和准确性。

英文摘要

High-dimensional partial differential equations (PDE) appear in a number of models from the financial industry, such as in derivative pricing models, credit valuation adjustment (CVA) models, or portfolio optimization models. The PDEs in such applications are high-dimensional as the dimension corresponds to the number of financial assets in a portfolio. Moreover, such PDEs are often fully nonlinear due to the need to incorporate certain nonlinear phenomena in the model such as default risks, transaction costs, volatility uncertainty (Knightian uncertainty), or trading constraints in the model. Such high-dimensional fully nonlinear PDEs are exceedingly difficult to solve as the computational effort for standard approximation methods grows exponentially with the dimension. In this work we propose a new method for solving high-dimensional fully nonlinear second-order PDEs. Our method can in particular be used to sample from high-dimensional nonlinear expectations. The method is based on (i) a connection between fully nonlinear second-order PDEs and second-order backward stochastic differential equations (2BSDEs), (ii) a merged formulation of the PDE and the 2BSDE problem, (iii) a temporal forward discretization of the 2BSDE and a spatial approximation via deep neural nets, and (iv) a stochastic gradient descent-type optimization procedure. Numerical results obtained using ${\rm T{\small ENSOR}F{\small LOW}}$ in ${\rm P{\small YTHON}}$ illustrate the efficiency and the accuracy of the method in the cases of a $100$-dimensional Black-Scholes-Barenblatt equation, a $100$-dimensional Hamilton-Jacobi-Bellman equation, and a nonlinear expectation of a $ 100 $-dimensional $ G $-Brownian motion.

URL PDF HTML ☆

赞 0 踩 0

1706.04702 2026-06-04 math.NA cs.LG cs.NA cs.NE math.PR stat.ML

Deep learning-based numerical methods for high-dimensional parabolic partial differential equations and backward stochastic differential equations

基于深度学习的高维抛物型偏微分方程和反向随机微分方程的数值方法

Weinan E, Jiequn Han, Arnulf Jentzen

发表机构 * Beijing Institute of Big Data Research (China)（北京大数据研究院（中国））； Princeton University (USA)（普林斯顿大学（美国））； Peking University (China)（北京大学（中国））； ETH Zurich (Switzerland)（苏黎世联邦理工学院（瑞士））

AI总结本文提出了一种基于深度学习的算法，通过将反向随机微分方程与强化学习类比，利用解的梯度作为策略函数，采用神经网络近似策略函数，有效解决了高维非线性偏微分方程和反向随机微分方程的问题。

Comments 39 pages, 15 figures

详情

DOI: 10.1007/s40304-017-0117-6
Journal ref: Commun. Math. Stat. 5, 349-380 (2017)

AI中文摘要

我们提出了一种新的算法，用于求解高维抛物型偏微分方程（PDEs）和反向随机微分方程（BSDEs），通过将BSDE与强化学习进行类比，将解的梯度作为策略函数，损失函数由给定的终端条件与BSDE解之间的误差构成。策略函数随后通过神经网络进行近似，如深度强化学习中所做的那样。使用TensorFlow进行的数值结果展示了所提出算法在解决物理和金融领域中多个100维非线性PDEs方面的效率和准确性，例如Allen-Cahn方程、Hamilton-Jacobi-Bellman方程以及金融衍生品的非线性定价模型。

英文摘要

We propose a new algorithm for solving parabolic partial differential equations (PDEs) and backward stochastic differential equations (BSDEs) in high dimension, by making an analogy between the BSDE and reinforcement learning with the gradient of the solution playing the role of the policy function, and the loss function given by the error between the prescribed terminal condition and the solution of the BSDE. The policy function is then approximated by a neural network, as is done in deep reinforcement learning. Numerical results using TensorFlow illustrate the efficiency and accuracy of the proposed algorithms for several 100-dimensional nonlinear PDEs from physics and finance such as the Allen-Cahn equation, the Hamilton-Jacobi-Bellman equation, and a nonlinear pricing model for financial derivatives.

URL PDF HTML ☆

赞 0 踩 0

1902.09135 2026-06-04 math.NA cs.CV cs.NA eess.IV

A Dual Symmetric Gauss-Seidel Alternating Direction Method of Multipliers for Hyperspectral Sparse Unmixing

一种双对称Gauss-Seidel交替方向乘子法用于超光谱稀疏解混

Longfei Ren, Chengjing Wang, Peipei Tang, Zheng Ma

发表机构 * School of Information Science and technology, and the Provincial Key Lab of Information Coding and Trans- mission, Southwest Jiaotong University（信息科学与技术学院，信息编码与传输省重点实验室，西南交通大学）； School of Mathematics, Southwest Jiaotong University（数学学院，西南交通大学）； School of Computer and Computing Science, Zhejiang University City College（计算机与计算科学学院，浙江大学城市学院）

AI总结本文提出了一种高效的双对称Gauss-Seidel交替方向乘子法（sGS-ADMM）用于带有总变分正则化的超光谱稀疏解混，解决了传统ADMM在计算效率和收敛性方面的不足，并通过实验验证了该方法在解混效率和图像质量上的优越性。

Comments 30 pages, 6 figures

详情

AI中文摘要

由于稀疏解混已成为超光谱解混的有前景的方法，最近一些空间上下文信息已被用来提高解混性能。总变分（TV）已被广泛用于促进空间均匀性和相邻像素之间的平滑性。然而，带有TV正则项的超光谱稀疏解混的计算任务很重。此外，对于带有TV正则项的超光谱稀疏解混的原始交替方向乘子法（ADMM）的收敛性尚未详细解释。在本文中，我们设计了一种高效的、收敛的双对称Gauss-Seidel ADMM（sGS-ADMM）用于带有TV正则项的超光谱稀疏解混。我们还对这种算法进行了全局收敛性和局部线性收敛率的分析。如数值实验所示，我们的算法在解混效率上明显优于最先进的算法。更重要的是，我们能够获得质量更高的图像。

英文摘要

Since sparse unmixing has emerged as a promising approach to hyperspectral unmixing, some spatial-contextual information in the hyperspectral images has been exploited to improve the performance of the unmixing recently. The total variation (TV) has been widely used to promote the spatial homogeneity as well as the smoothness between adjacent pixels. However, the computation task for hyperspectral sparse unmixing with a TV regularization term is heavy. Besides, the convergence of the primal alternating direction method of multipliers (ADMM) for the hyperspectral sparse unmixing with a TV regularization term has not been explained in details. In this paper, we design an efficient and convergent dual symmetric Gauss-Seidel ADMM (sGS-ADMM) for hyperspectral sparse unmixing with a TV regularization term. We also present the global convergence and local linear convergence rate analysis for this algorithm. As demonstrated in numerical experiments, our algorithm can obviously improve the efficiency of the unmixing compared with the state-of-the-art algorithm. More importantly, we can obtain images with higher quality.

URL PDF HTML ☆

赞 0 踩 0

1808.07452 2026-06-04 math.NA cs.LG cs.NA

Generalized Canonical Polyadic Tensor Decomposition

广义规范张量分解

David Hong, Tamara G. Kolda, Jed A. Duersch

发表机构 * Sandia National Laboratories（桑迪亚国家实验室）； University of Michigan（密歇根大学）

AI总结本文提出了一种广义规范张量分解（GCP），能够使用除均方误差外的其他损失函数，如逻辑损失或KL散度，从而适用于二分类或计数数据，并展示了其在社交网络互动、小鼠神经活动和印度月降雨量等真实数据集上的灵活性。

详情

DOI: 10.1137/18M1203626
Journal ref: SIAM Review, Vol. 62, No. 1, pp. 133-163, 2020

AI中文摘要

张量分解是数据科学中一种基本的无监督机器学习方法，应用于网络分析和传感器数据处理等领域。本文开发了一种广义规范（GCP）低秩张量分解，允许使用除均方误差外的其他损失函数。例如，我们可以使用逻辑损失或Kullback-Leibler散度，从而实现二分类或计数数据的张量分解。我们为各种场景提出了多种统计动机的损失函数。我们提供了一个通用的框架，用于计算梯度和处理缺失数据，使标准优化方法能够用于拟合模型。我们展示了GCP在多个真实世界示例中的灵活性，包括社交网络中的互动、小鼠神经活动以及印度的月降雨量测量。

英文摘要

Tensor decomposition is a fundamental unsupervised machine learning method in data science, with applications including network analysis and sensor data processing. This work develops a generalized canonical polyadic (GCP) low-rank tensor decomposition that allows other loss functions besides squared error. For instance, we can use logistic loss or Kullback-Leibler divergence, enabling tensor decomposition for binary or count data. We present a variety statistically-motivated loss functions for various scenarios. We provide a generalized framework for computing gradients and handling missing data that enables the use of standard optimization methods for fitting the model. We demonstrate the flexibility of GCP on several real-world examples including interactions in a social network, neural activity in a mouse, and monthly rainfall measurements in India.

URL PDF HTML ☆

赞 0 踩 0

1904.01514 2026-06-04 math.NA cs.LG cs.NA

Data driven approximation of parametrized PDEs by Reduced Basis and Neural Networks

基于降阶基和神经网络的数据驱动参数化PDE近似

Niccolò Dal Santo, Simone Deparis, Luca Pegolotti

发表机构 * SCI-SB-SD, École Polytechnique Fédérale de Lausanne (EPFL), Station 8, 1015 Lausanne, Switzerland（SCI-SB-SD，瑞士联邦理工学院（洛桑联邦理工学院），8号站，1015洛桑，瑞士）

AI总结本文提出一种结合降阶基方法和神经网络的数据驱动方法，用于近似参数化偏微分方程，通过减少物理参数的计算成本来估计感兴趣的场，如材料样本的温度或流体的速度。

详情

DOI: 10.1016/j.jcp.2020.109550

AI中文摘要

我们致力于利用基于降阶基方法和机器学习的数据驱动方法来近似偏微分方程。我们假设感兴趣的物理现象可以由参数化偏微分方程建模，但物理参数的值未知或难以直接测量。我们的方法允许在域内少量点的数据基础上估计感兴趣的场，例如材料样本的温度或流体的速度。我们提出使用神经网络嵌入降阶基求解器作为最后一层的 exotic 激活函数来完成此任务。降阶基求解器考虑了底层的物理现象，并从随机选择的物理参数值期间获得的快照中构建。随后，相同的全阶解用于训练神经网络。事实上，所选架构类似于一个不对称自动编码器，其中解码器是降阶基求解器，因此不包含可训练参数。所得到的自动编码器的潜在空间包括参数依赖的量，这些量为降阶基求解器提供输入，这取决于所考虑的偏微分方程，可能是物理参数本身或微分算子的仿射分解系数。

英文摘要

We are interested in the approximation of partial differential equations with a data-driven approach based on the reduced basis method and machine learning. We suppose that the phenomenon of interest can be modeled by a parametrized partial differential equation, but that the value of the physical parameters is unknown or difficult to be directly measured. Our method allows to estimate fields of interest, for instance temperature of a sample of material or velocity of a fluid, given data at a handful of points in the domain. We propose to accomplish this task with a neural network embedding a reduced basis solver as exotic activation function in the last layer. The reduced basis solver accounts for the underlying physical phenomenonon and it is constructed from snapshots obtained from randomly selected values of the physical parameters during an expensive offline phase. The same full order solutions are then employed for the training of the neural network. As a matter of fact, the chosen architecture resembles an asymmetric autoencoder in which the decoder is the reduced basis solver and as such it does not contain trainable parameters. The resulting latent space of our autoencoder includes parameter-dependent quantities feeding the reduced basis solver, which -- depending on the considered partial differential equation -- are the values of the physical parameters themselves or the affine decomposition coefficients of the differential operators.

URL PDF HTML ☆

赞 0 踩 0

1810.05947 2026-06-04 eess.SY cs.AI cs.SY math.OC

Robust Model Predictive Control of Irrigation Systems with Active Uncertainty Learning and Data Analytics

具有主动不确定性学习和数据分析的灌溉系统鲁棒模型预测控制

Chao Shang, Wei-Han Chen, Abraham Duncan Stroock, Fengqi You

发表机构 * Department of Automation, Tsinghua University（自动化系，清华大学）

AI总结本文提出了一种数据驱动的鲁棒模型预测控制方法，结合机理模型和数据驱动模型，通过构建不确定性集来提高灌溉系统的控制效率和可靠性，实验证明该方法能显著减少用水量并提升控制性能。

详情

DOI: 10.1109/TCST.2019.2916753
Journal ref: IEEE Transactions on Control Systems Technology, vol. 28, no. 4, pp. 1493-1504, 2020

AI中文摘要

我们开发了一种新型的数据驱动鲁棒模型预测控制（DDRMPC）方法，用于自动控制灌溉系统。核心思想是将机理模型（描述土壤含水量变化的动力学）和数据驱动模型（表征蒸散发和降水预测误差的不确定性）整合到一个系统控制框架中。为了更好地捕捉不确定性分布的支持，我们采用了一种基于学习的新方法，通过历史数据构建不确定性集。对于蒸散发预测误差，采用基于支持向量聚类的不确定性集，该方法可以方便地从历史数据中构建。而对于降水预测误差，我们分析了其分布对预测值的依赖性，并进一步设计了基于此类不确定性的特性定制的不确定性集。这样，整体不确定性分布可以被详细描述，最终有助于做出合理且高效的控制决策。为了确保数据驱动不确定性集的质量，采用训练-校准方案以提供理论性能保证。采用广义仿射决策规则以获得最优控制问题的可计算近似，从而确保DDRMPC的实用性。使用真实数据的案例研究显示，DDRMPC能够可靠地保持土壤含水量在安全水平以上并避免作物破坏。所提出的DDRMPC方法相比精细调优的开环控制策略，总用水量减少了40%。与精心调优的规则基控制和确定性等价模型预测控制相比，所提出的DDRMPC方法可以显著减少总用水量并提高控制性能。

英文摘要

We develop a novel data-driven robust model predictive control (DDRMPC) approach for automatic control of irrigation systems. The fundamental idea is to integrate both mechanistic models, which describe dynamics in soil moisture variations, and data-driven models, which characterize uncertainty in forecast errors of evapotranspiration and precipitation, into a holistic systems control framework. To better capture the support of uncertainty distribution, we take a new learning-based approach by constructing uncertainty sets from historical data. For evapotranspiration forecast error, the support vector clustering-based uncertainty set is adopted, which can be conveniently built from historical data. As for precipitation forecast errors, we analyze the dependence of their distribution on forecast values, and further design a tailored uncertainty set based on the properties of this type of uncertainty. In this way, the overall uncertainty distribution can be elaborately described, which finally contributes to rational and efficient control decisions. To assure the quality of data-driven uncertainty sets, a training-calibration scheme is used to provide theoretical performance guarantees. A generalized affine decision rule is adopted to obtain tractable approximations of optimal control problems, thereby ensuring the practicability of DDRMPC. Case studies using real data show that, DDRMPC can reliably maintain soil moisture above the safety level and avoid crop devastation. The proposed DDRMPC approach leads to a 40% reduction of total water consumption compared to the fine-tuned open-loop control strategy. In comparison with the carefully tuned rule-based control and certainty equivalent model predictive control, the proposed DDRMPC approach can significantly reduce the total water consumption and improve the control performance.

URL PDF HTML ☆

赞 0 踩 0

1904.02209 2026-06-04 math.OC cs.RO cs.SY eess.SY

The Green Choice: Learning and Influencing Human Decisions on Shared Roads

绿色选择：学习和影响共享道路上的人类决策

Erdem Bıyık, Daniel A. Lazar, Dorsa Sadigh, Ramtin Pedarsani

发表机构 * Department of Electrical Engineering, Stanford University（斯坦福大学电气工程系）； Department of Electrical and Computer Engineering, UC Santa Barbara（加州大学圣芭芭拉分校电气与计算机工程系）； Department of Computer Science, Stanford University（斯坦福大学计算机科学系）

AI总结本文研究如何通过设计价格策略来影响人类在共享道路上的决策，以最大化道路使用效率并减少交通延误，核心方法是基于用户偏好算法和交通基本图模型的规划优化。

Comments Submitted to CDC 2019

详情

DOI: 10.1109/CDC40024.2019.9030169

AI中文摘要

自动驾驶车辆通过编队有潜力提高道路容量，即使在与人类驾驶员共享道路的情况下。然而，当道路网络用户自私地选择路线时，所产生的交通配置可能非常低效。为此，我们考虑如何影响人类决策以减少道路上的拥堵。我们考虑一个由平行道路组成的网络，有两种交通模式：(i) 人类驾驶员会选择他们可用的最快路线，(ii) 滚动出行服务提供一系列自动驾驶车辆出行选项，每种选项有不同的价格。在本工作中，我们试图设计这些价格，使得当自动驾驶服务用户选择这些选项，而人类驾驶员自私地选择他们的路线时，道路使用最大化，交通延误最小化。为此，我们正式化了自动驾驶服务用户在不同价格/延误值路线之间做出选择的模型。开发基于偏好的算法来学习用户的偏好，并使用与交通基本图相关的车辆流模型，我们制定了一种规划优化以最大化社会目标，并展示了所提出路线和学习方案的好处。

英文摘要

Autonomous vehicles have the potential to increase the capacity of roads via platooning, even when human drivers and autonomous vehicles share roads. However, when users of a road network choose their routes selfishly, the resulting traffic configuration may be very inefficient. Because of this, we consider how to influence human decisions so as to decrease congestion on these roads. We consider a network of parallel roads with two modes of transportation: (i) human drivers who will choose the quickest route available to them, and (ii) ride hailing service which provides an array of autonomous vehicle ride options, each with different prices, to users. In this work, we seek to design these prices so that when autonomous service users choose from these options and human drivers selfishly choose their resulting routes, road usage is maximized and transit delay is minimized. To do so, we formalize a model of how autonomous service users make choices between routes with different price/delay values. Developing a preference-based algorithm to learn the preferences of the users, and using a vehicle flow model related to the Fundamental Diagram of Traffic, we formulate a planning optimization to maximize a social objective and demonstrate the benefit of the proposed routing and learning scheme.

URL PDF HTML ☆

赞 0 踩 0

1902.02095 2026-06-04 eess.SY cs.LG cs.SY

Space Navigator: a Tool for the Optimization of Collision Avoidance Maneuvers

空间导航：碰撞规避 maneuver 优化工具

Leonid Gremyachikh, Dmitrii Dubov, Nikita Kazeev, Andrey Kulibaba, Andrey Skuratov, Anton Tereshkin, Andrey Ustyuzhanin, Lubov Shiryaeva, Sergej Shishkin

发表机构 * National Research University Higher School of Economics, Laboratory of Methods for Big Data Analysis（俄罗斯国家研究大学高等经济学院，大数据分析方法实验室）； Yandex School of Data Analysis（Yandex数据科学学院）； Phygitalism

AI总结本文提出了一种名为“空间导航”的模块化自主碰撞规避系统，通过结合领域知识与强化学习方法，解决千级卫星星座的碰撞规避 maneuver 优化问题。

Comments Submitted to AAS Advances in the Astronautical Sciences, presented at IAA SciTech Forum 2018

1711.11417 2026-06-04 eess.SY cs.LG cs.SY

Scalable synthesis of safety certificates from data with application to learning-based control

可扩展的数据合成安全证书及在基于学习的控制中的应用

Kim P. Wabersich, Melanie N. Zeilinger

发表机构 * Institute for Dynamic Systems and Control, ETH Zurich（动态系统与控制研究所，苏黎世联邦理工学院）

AI总结本文提出了一种高效的方法来合成安全集和控制律，通过基于凸优化问题的近似方法，提高了可扩展性，同时利用高斯过程先验减少保守性，应用于自动驾驶车队等场景。

1804.08871 2026-06-04 eess.SY cs.RO cs.SY

Representing the Unknown - Impact of Uncertainty on the Interaction between Decision Making and Trajectory Generation

表示未知 - 不确定性对决策制定与轨迹生成交互的影响

Marcus Nolte, Susanne Ernst, Jan Richelmann, Markus Maurer

发表机构 * Institute of Control Engineering（控制工程研究所）

AI总结本文探讨了不确定性对自动驾驶车辆运动规划问题参数和环境模型要求的影响，强调了决策制定与轨迹生成之间明确接口的重要性。

1904.03152 2026-06-04 eess.SY cs.CL cs.NE cs.SY

Data-driven Modelling of Dynamical Systems Using Tree Adjoining Grammar and Genetic Programming

基于树附加语法学和遗传编程的数据驱动动态系统建模

Dhruv Khandelwal, Maarten Schoukens, Roland Tóth

发表机构 * Department of Electrical Engineering（电气工程系）； Eindhoven University of Technology（埃因霍温理工大学）

AI总结本文提出了一种利用树附加语法学和遗传编程进行非线性动态系统数据驱动建模的方法，通过自动化建模过程并分析不同挑战下的性能。

Comments Paper accepted at IEEE CEC 2019

详情

DOI: 10.1109/CEC.2019.8790250

AI中文摘要

最先进的数据驱动非线性动态系统建模方法通常需要与专家用户交互。为了部分自动化从数据中建模物理系统的过程，许多基于进化算法的方法被提出用于模型结构选择，特别是针对非线性系统。最近，一种利用遗传编程（GP）进行非线性动态系统数据驱动建模的方法被提出。该方法的创新点在于对噪声的建模以及使用树附加语法来塑造GP探索的搜索空间。在本文中，我们报告了该方法在三个案例研究中的结果。每个案例研究均基于真实的物理系统。这些案例研究提出了各种挑战。特别是，这些挑战涵盖了对真实系统先验知识的不同程度、可用数据量、系统动态的复杂性以及系统中非线性特性。基于案例研究中取得的结果，我们对所提出的方法的性能进行了批判性分析。

英文摘要

State-of-the-art methods for data-driven modelling of non-linear dynamical systems typically involve interactions with an expert user. In order to partially automate the process of modelling physical systems from data, many EA-based approaches have been proposed for model-structure selection, with special focus on non-linear systems. Recently, an approach for data-driven modelling of non-linear dynamical systems using Genetic Programming (GP) was proposed. The novelty of the method was the modelling of noise and the use of Tree Adjoining Grammar to shape the search-space explored by GP. In this paper, we report results achieved by the proposed method on three case studies. Each of the case studies considered here is based on real physical systems. The case studies pose a variety of challenges. In particular, these challenges range over varying amounts of prior knowledge of the true system, amount of data available, the complexity of the dynamics of the system, and the nature of non-linearities in the system. Based on the results achieved for the case studies, we critically analyse the performance of the proposed method.

URL PDF HTML ☆

赞 0 踩 0

1607.01027 2026-06-04 math.OC cs.LG cs.NA math.NA stat.ML

Accelerate Stochastic Subgradient Method by Leveraging Local Growth Condition

通过利用局部增长条件加速随机子梯度方法

Yi Xu, Qihang Lin, Tianbao Yang

发表机构 * Department of Computer Science（计算机科学系）； Department of Management Sciences（管理科学系）； The University of Iowa（爱荷华大学）

AI总结本文提出了一种新的理论，表明在最优解邻域内目标函数的局部增长率足以量化一阶随机凸优化的全局收敛率，通过局部区域逐步缩小的方法改进了加速随机子梯度方法的收敛性，并在实践中提出了无需知道乘法增长常数和增长率的实用变体。

详情

AI中文摘要

在本文中，我们为一阶随机凸优化开发了一种新理论，表明全局收敛率足以由最优解邻域内目标函数的局部增长率量化。具体而言，如果目标函数F(w)在ε子水平集内以速度‖w - w*‖_2^{1/θ}增长，其中w*是w最近的最优解，θ∈(0,1]表示局部增长率，则达到ε最优解的一阶随机优化迭代复杂度可以为~O(1/ε^{2(1-θ)})，这在至多对数因子范围内是最佳的。为了实现更快的全局收敛，我们通过在历史解的局部区域中迭代求解原始问题，开发了两种不同的加速随机子梯度方法，该局部区域的大小随着解接近最优集而逐渐减小。除了理论改进外，这项工作还包含了使所提算法实用的新贡献：(i) 我们提出了可以运行而无需知道乘法增长常数和增长率θ的加速随机子梯度方法的实用变体；(ii) 我们考虑了机器学习中的广泛问题集，以证明所提算法比传统随机子梯度方法具有更快的收敛速度。我们还表征了所提算法的复杂性，以确保在不假设光滑性的情况下梯度较小。

英文摘要

In this paper, a new theory is developed for first-order stochastic convex optimization, showing that the global convergence rate is sufficiently quantified by a local growth rate of the objective function in a neighborhood of the optimal solutions. In particular, if the objective function $F(\mathbf w)$ in the $ε$-sublevel set grows as fast as $\|\mathbf w - \mathbf w_*\|_2^{1/θ}$, where $\mathbf w_*$ represents the closest optimal solution to $\mathbf w$ and $θ\in(0,1]$ quantifies the local growth rate, the iteration complexity of first-order stochastic optimization for achieving an $ε$-optimal solution can be $\widetilde O(1/ε^{2(1-θ)})$, which is optimal at most up to a logarithmic factor. To achieve the faster global convergence, we develop two different accelerated stochastic subgradient methods by iteratively solving the original problem approximately in a local region around a historical solution with the size of the local region gradually decreasing as the solution approaches the optimal set. Besides the theoretical improvements, this work also includes new contributions towards making the proposed algorithms practical: (i) we present practical variants of accelerated stochastic subgradient methods that can run without the knowledge of multiplicative growth constant and even the growth rate $θ$; (ii) we consider a broad family of problems in machine learning to demonstrate that the proposed algorithms enjoy faster convergence than traditional stochastic subgradient method. We also characterize the complexity of the proposed algorithms for ensuring the gradient is small without the smoothness assumption.

URL PDF HTML ☆

赞 0 踩 0

1809.09170 2026-06-04 math.NA cs.LG cs.NA math.DS stat.ML

Numerical Aspects for Approximating Governing Equations Using Data

利用数据近似求解方程的数值方面

Kailiang Wu, Dongbin Xiu

发表机构 * Department of Mathematics, The Ohio State University, Columbus, OH 43210, USA.（数学系，俄亥俄州立大学，哥伦布，OH 43210，USA）

AI总结本文提出了一种有效的数值算法，用于从测量数据中局部恢复未知的偏微分方程，通过使用多项式等标准基函数进行高精度近似，并讨论了准确近似的关键因素，如使用大量短轨迹数据而非单一长轨迹数据，以及展示了线性和非线性系统的数值示例。

Comments 26 pages, 17 figures

详情

DOI: 10.1016/j.jcp.2019.01.030
Journal ref: Journal of Computational Physics, 384, 200-221, 2019

AI中文摘要

我们提出了有效的数值算法，用于从测量数据中局部恢复未知的偏微分方程。我们采用一组标准基函数，例如多项式，来高精度地近似求解方程。在将问题转化为函数近似问题后，我们讨论了几个重要的方面以确保准确的近似。最值得注意的是，我们讨论了使用大量短轨迹数据burst而非单一长轨迹数据的重要性。随后，我们提出了几种数值算法以实现准确的近似，并给出了最终方程近似的误差估计。然后，我们展示了线性和非线性系统的一系列广泛数值示例，以展示我们方程恢复算法的性质和有效性。

英文摘要

We present effective numerical algorithms for locally recovering unknown governing differential equations from measurement data. We employ a set of standard basis functions, e.g., polynomials, to approximate the governing equation with high accuracy. Upon recasting the problem into a function approximation problem, we discuss several important aspects for accurate approximation. Most notably, we discuss the importance of using a large number of short bursts of trajectory data, rather than using data from a single long trajectory. Several options for the numerical algorithms to perform accurate approximation are then presented, along with an error estimate of the final equation approximation. We then present an extensive set of numerical examples of both linear and nonlinear systems to demonstrate the properties and effectiveness of our equation recovery algorithms.

URL PDF HTML ☆

赞 0 踩 0

1902.09964 2026-06-04 eess.SY cs.LG cs.SY stat.ML

A Neural-Network-Based Model Predictive Control of Three-Phase Inverter With an Output LC Filter

基于神经网络的三相逆变器模型预测控制及输出LC滤波器

Ihab S. Mohamed, Stefano Rovetta, Ton Duc Do, Tomislav Dragicevic, Ahmed A. Zaki Diab

发表机构 * 1 INRIA Sophia Antipolis - M\'editerran\'ee, University C\ ote d'Azur, France (e-mail: ) ； 2 Department of Informatics, Bioengineering, Robotics ； Systems Engineering, University of Genoa, Italy (e-mail: ) ； 3 Department of Robotics ； Mechatronics, School of Science ； Technology (SST), Nazarbayev University, Astana Z05H0P9, Republic of Kazakhstan (e-mail: ) ； 4 Department of Energy Technology, Aalborg University, Denmark (e-mail: ) ； 5 Electrical Engineering Department, Faculty of Engineering, Minia University, Egypt (e-mail: )

AI总结本文提出了一种结合模型预测控制（MPC）和前馈人工神经网络（ANN）的双级逆变器控制方案，旨在降低总谐波失真（THD）并提高系统在不同负载类型下的稳态和动态性能。通过MPC生成神经网络训练数据，利用训练好的ANN实现无MPC的电压跟踪，通过MATLAB/Simulink仿真验证了该策略的优越性能。

Comments 13 pages, 15 figures, 3 tables. This article has been submitted to IEEE Access

详情

DOI: 10.1109/ACCESS.2019.2938220

AI中文摘要

模型预测控制（MPC）已成为一种well-established的现代控制方法，用于具有输出LC滤波器的三相逆变器，其中需要高质量电压和低总谐波失真（THD）。尽管MPC是一种直观的控制器，易于理解和实现，但它有显著缺点，即需要大量的在线计算来解决优化问题。另一方面，在电力电子和驱动领域，基于人工神经网络的无模型方法的应用正在迅速增长。本文提出了一种新的双级逆变器控制方案，结合MPC和前馈ANN，旨在降低THD并提高系统在不同负载类型下的稳态和动态性能。首先，MPC在训练阶段用于生成用于训练所提出神经网络所需的数据。然后，一旦神经网络经过微调，就可以在不需要使用MPC的情况下在线用于电压跟踪目的。所提出的基于ANN的控制策略通过MATLAB/Simulink工具进行仿真验证，考虑了不同的负载条件。此外，评估了基于ANN的控制器在多种线性和非线性负载下的不同运行条件下性能，并与MPC的性能进行比较，证明了所提出基于ANN的控制策略在稳态和动态性能方面的优异表现。

英文摘要

Model predictive control (MPC) has become one of the well-established modern control methods for three-phase inverters with an output LC filter, where a high-quality voltage with low total harmonic distortion (THD) is needed. Although it is an intuitive controller, easy to understand and implement, it has the significant disadvantage of requiring a large number of online calculations for solving the optimization problem. On the other hand, the application of model-free approaches such as those based on artificial neural networks approaches is currently growing rapidly in the area of power electronics and drives. This paper presents a new control scheme for a two-level converter based on combining MPC and feed-forward ANN, with the aim of getting lower THD and improving the steady and dynamic performance of the system for different types of loads. First, MPC is used, as an expert, in the training phase to generate data required for training the proposed neural network. Then, once the neural network is fine-tuned, it can be successfully used online for voltage tracking purpose, without the need of using MPC. The proposed ANN-based control strategy is validated through simulation, using MATLAB/Simulink tools, taking into account different loads conditions. Moreover, the performance of the ANN-based controller is evaluated, on several samples of linear and non-linear loads under various operating conditions, and compared to that of MPC, demonstrating the excellent steady-state and dynamic performance of the proposed ANN-based control strategy.

URL PDF HTML ☆

赞 0 踩 0

1904.05856 2026-06-04 math.OC cs.LG cs.SY eess.SY

Connections Between Adaptive Control and Optimization in Machine Learning

适应控制与机器学习中优化方法之间的联系

Joseph E. Gaudio, Travis E. Gibson, Anuradha M. Annaswamy, Michael A. Bolender, Eugene Lavretsky

发表机构 * Massachusetts Institute of Technology（麻省理工学院）； Brigham and Women’s Hospital and Harvard Medical School（布莱尔妇女医院和哈佛医学院）； Air Force Research Laboratory（空军研究实验室）； The Boeing Company（波音公司）

AI总结本文探讨了适应控制与机器学习中常用优化方法之间的联系，通过分析更新法则的相似性，讨论了稳定性、性能和学习等共同概念，并提出了新的交集和改进算法分析的机会，特别是通过这些交集的见解解决了高阶学习问题。

Comments 18 pages

1712.01491 2026-06-04 eess.SY cs.RO cs.SY

TrackerBots: Autonomous Unmanned Aerial Vehicle for Real-Time Localization and Tracking of Multiple Radio-Tagged Animals

TrackerBots: 用于实时定位和跟踪多个无线电标签动物的自主无人机

Hoa Van Nguyen, Michael Chesser, Lian Pin Koh, S. Hamid Rezatofighi, Damith C. Ranasinghe

发表机构 * Wiley（威立出版集团）

AI总结本文提出了一种名为TrackerBots的自主无人机系统，利用RSSI测量实现低成本轻量级的多只无线电标签动物的实时定位和跟踪，通过粒子滤波和POMDP动态路径规划提高导航效率并节省电量。

Comments The accepted version to the Journal of Field Robotics, published after the embargo period (12 months)

详情

DOI: 10.1002/rob.21857
Journal ref: Journal of Field Robotics. 2019; 36: 617 - 635

AI中文摘要

自主空中机器人为通过高效收集时间空间粒度信息研究濒危物种的栖息地和行为提供了新可能。我们提出了一种新的自主空中车辆系统-TrackerBots，用于跟踪和定位多个无线电标签动物。利用非常规场常用的高频（VHF）无线电颈环接收信号强度指示（RSSI）值的简单性，实现了一种低成本轻量级的跟踪平台，适合与无人机（UAV）集成。由于基于RSSI测量的系统存在不确定性和非线性，我们的跟踪和规划方法整合了粒子滤波用于跟踪和定位；部分可观测马尔可夫决策过程（POMDP）用于动态路径规划。这种方法允许无人机在最大信息增益方向上自主导航以定位多个移动动物并减少探索时间；从而节省机载电池电量。我们还采用了搜索终止标准的概念以在空中系统的功率限制内最大化所定位动物的数量。我们通过广泛的模拟和使用两个移动VHF无线电标签的实地实验验证了我们的实时和在线方法。

英文摘要

Autonomous aerial robots provide new possibilities to study the habitats and behaviors of endangered species through the efficient gathering of location information at temporal and spatial granularities not possible with traditional manual survey methods. We present a novel autonomous aerial vehicle system-TrackerBots-to track and localize multiple radio-tagged animals. The simplicity of measuring the received signal strength indicator (RSSI) values of very high frequency (VHF) radio-collars commonly used in the field is exploited to realize a low cost and lightweight tracking platform suitable for integration with unmanned aerial vehicles (UAVs). Due to uncertainty and the nonlinearity of the system based on RSSI measurements, our tracking and planning approaches integrate a particle filter for tracking and localizing; a partially observable Markov decision process (POMDP) for dynamic path planning. This approach allows autonomous navigation of a UAV in a direction of maximum information gain to locate multiple mobile animals and reduce exploration time; and, consequently, conserve onboard battery power. We also employ the concept of a search termination criteria to maximize the number of located animals within power constraints of the aerial system. We validated our real-time and online approach through both extensive simulations and field experiments with two mobile VHF radio-tags.

URL PDF HTML ☆

赞 0 踩 0

1812.08625 2026-06-04 math.NA cs.LG cs.NA physics.comp-ph stat.ML

Deep Theory of Functional Connections: A New Method for Estimating the Solutions of PDEs

深度函数连接理论：一种用于估计偏微分方程解的新方法

Carl Leake

发表机构 * Ph.D. Student, Aerospace Engineering, Texas A\&M University, College Station, TX（航空航天工程博士研究生，德克萨斯A&M大学，学院站，德克萨斯）

AI总结本文提出了一种名为深度函数连接理论（TFC）的新方法，通过将神经网络与TFC结合来估计偏微分方程（PDEs）的解。该方法将带有边界条件的PDEs转换为无约束优化问题，利用神经网络作为自由函数来求解无约束优化问题，并通过残差平方作为损失函数进行无监督训练。与传统方法相比，该方法无需离散化域，且能提供整个训练域的闭合形式解析解。

Comments 14 pages, 7 figures

详情

DOI: 10.3390/make2010004
Journal ref: Mach. Learn. Knowl. Extr. 2020, 2(1), 37-55

AI中文摘要

本文提出了一种名为深度函数连接理论（TFC）的新方法，通过将神经网络与TFC结合来估计偏微分方程（PDEs）的解。TFC用于将带有边界条件的PDEs转换为无约束优化问题，通过将边界条件嵌入到一个“约束表达式”中。在本工作中，神经网络被选为自由函数，并用于求解现在无约束的优化问题。损失函数取为PDE残差的平方。然后，神经网络以无监督的方式训练以解决无约束优化问题。与用于估计PDE解的流行方法相比，该方法有两个主要区别。首先，该方法不需要将域离散化为网格，而是在线性训练阶段随机采样域中的点。其次，训练后，该方法在整个训练域内提供闭合形式、解析、可微的解的近似。相比之下，其他流行方法如果需要在不在离散化网格上的点上估计解，则需要插值。深度TFC方法用于解决四个具有各种边界条件的问题。

英文摘要

This article presents a new methodology called deep Theory of Functional Connections (TFC) that estimates the solutions of partial differential equations (PDEs) by combining neural networks with TFC. TFC is used to transform PDEs with boundary conditions into unconstrained optimization problems by embedding the boundary conditions into a "constrained expression." In this work, a neural network is chosen as the free function, and used to solve the now unconstrained optimization problem. The loss function is taken as the square of the residual of the PDE. Then, the neural network is trained in an unsupervised manner to solve the unconstrained optimization problem. This methodology has two major differences when compared with popular methods used to estimate the solutions of PDEs. First, this methodology does not need to discretize the domain into a grid, rather, this methodology randomly samples points from the domain during the training phase. Second, after training, this methodology represents a closed form, analytical, differentiable approximation of the solution throughout the entire training domain. In contrast, other popular methods require interpolation if the estimated solution is desired at points that do not lie on the discretized grid. The deep TFC method for estimating the solution of PDEs is demonstrated on four problems with a variety of boundary conditions.

URL PDF HTML ☆

赞 0 踩 0

1809.02341 2026-06-04 math.OC cs.LG cs.NA math.NA stat.ML

A Fast Anderson-Chebyshev Acceleration for Nonlinear Optimization

非线性优化中的快速安德森-切比雪夫加速方法

Zhize Li, Jian Li

发表机构 * King Abdullah University of Science and Technology（卡布斯大学）； Tsinghua University（清华大学）

AI总结本文提出了一种快速安德森-切比雪夫加速方法，用于非线性优化问题，该方法在二次函数上实现了最优收敛率O(√κ ln(1/ε))，并提供了通用非线性问题的收敛分析，同时提出了动态猜测超参数的算法。

Comments To appear in AISTATS 2020

详情

AI中文摘要

安德森加速（或安德森混合）是一种高效的固定点迭代方法$ x_{t+1}=G(x_t) $，例如梯度下降可以视为迭代应用操作$ G(x) riangleq x-α abla f(x) $。本文表明，安德森加速结合切比雪夫多项式可以实现最优收敛率$ O(\sqrtκ\ln rac{1}ε) $，这改进了之前对于二次函数提供的结果$ O(κ\ln rac{1}ε) $（Toth and Kelley, 2015）。此外，我们为一般非线性问题提供了收敛分析。此外，如果超参数（例如Lipschitz光滑参数$ L $）不可用，我们提出了一种猜测算法来动态猜测它们，并证明了类似的收敛率。最后，实验结果表明，所提出的安德森-切比雪夫加速方法比其他算法如普通梯度下降（GD）、Nesterov加速GD收敛更快。此外，这些算法结合所提出的猜测算法（动态猜测超参数）实现了更好的性能。

英文摘要

Anderson acceleration (or Anderson mixing) is an efficient acceleration method for fixed point iterations $x_{t+1}=G(x_t)$, e.g., gradient descent can be viewed as iteratively applying the operation $G(x) \triangleq x-α\nabla f(x)$. It is known that Anderson acceleration is quite efficient in practice and can be viewed as an extension of Krylov subspace methods for nonlinear problems. In this paper, we show that Anderson acceleration with Chebyshev polynomial can achieve the optimal convergence rate $O(\sqrtκ\ln\frac{1}ε)$, which improves the previous result $O(κ\ln\frac{1}ε)$ provided by (Toth and Kelley, 2015) for quadratic functions. Moreover, we provide a convergence analysis for minimizing general nonlinear problems. Besides, if the hyperparameters (e.g., the Lipschitz smooth parameter $L$) are not available, we propose a guessing algorithm for guessing them dynamically and also prove a similar convergence rate. Finally, the experimental results demonstrate that the proposed Anderson-Chebyshev acceleration method converges significantly faster than other algorithms, e.g., vanilla gradient descent (GD), Nesterov's Accelerated GD. Also, these algorithms combined with the proposed guessing algorithm (guessing the hyperparameters dynamically) achieve much better performance.

URL PDF HTML ☆

赞 0 踩 0