arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

1801.08995 2026-06-04 eess.SY cs.RO cs.SY

Trajectory Generation using Sharpness Continuous Dubins-like Paths with Applications in Control of Heavy Duty Vehicles

利用连续尖锐度的Dubins-like路径生成轨迹，应用于重载车辆控制

Rui Oliveira, Pedro F. Lima, Marcello Cirillo, Jonas Mårtensson, Bo Wahlberg

发表机构 * Scania, Autonomous Transport Solutions（斯堪尼亚，自主运输解决方案）

AI总结本文提出了一种考虑转向执行器速率和扭矩限制的轨迹生成框架，通过连续尖锐度曲线提升重载车辆的自主驾驶性能。

Comments 18 pages, 7 figures, accepted for publication at ECC 2018 - European Control Conference

详情

AI中文摘要

我们提出了一种用于控制轮式车辆的轨迹生成框架，考虑转向执行器的速率和扭矩限制。关键思想是直接考虑转向执行器的速率和扭矩限制，而传统方法仅考虑曲率速率限制。我们提出新的尖锐度连续曲线概念，结合三次和Sigmoid曲率轨迹以及圆弧来引导车辆。所获得的轨迹具有平滑且连续可微的转向角剖面。这些轨迹为低层控制器提供更易跟踪的参考信号，从而提高性能。所获得的转向剖面的平滑性也提高了乘客舒适度。该方法具有快速的计算时间，可通过简单预计算进一步加速。我们详细讨论了该方法的路径规划应用，并通过仿真展示了其优势和实时能力。

英文摘要

We present a trajectory generation framework for control of wheeled vehicles under steering actuator constraints. The motivation is smooth autonomous driving of heavy vehicles. The key idea is to take into account rate, and additionally, torque limitations of the steering actuator directly. Previous methods only take into account curvature rate limitations, which deal indirectly with steering rate limitations. We propose the new concept of Sharpness Continuous curves, which uses cubic and sigmoid curvature trajectories together with circular arcs to steer the vehicle. The obtained trajectories are characterized by a smooth and continuously differentiable steering angle profile. These trajectories provide low-level controllers with reference signals which are easier to track, resulting in improved performance. The smoothness of the obtained steering profiles also results in increased passenger comfort. The method is characterized by a fast computation time, which can be further speeded up through the use of simple pre-computations. We detail possible path planning applications of the method, and conduct simulations that show its advantages and real time capabilities.

URL PDF HTML ☆

赞 0 踩 0

1702.04837 2026-06-04 stat.ML cs.LG cs.NA math.NA

Sketched Ridge Regression: Optimization Perspective, Statistical Perspective, and Model Averaging

草图岭回归：优化视角、统计视角和模型平均

Shusen Wang, Alex Gittens, Michael W. Mahoney

发表机构 * International Computer Science Institute and Department of Statistics University of California at Berkeley（国际计算机科学研究所和统计学系加州大学伯克利分校）； Computer Science Department Rensselaer Polytechnic Institute（计算机科学系拉特格斯理工学院）

AI总结本文从优化和统计角度研究了草图和Hessian草图在矩阵岭回归中的影响，发现经典草图能近似最优解，而Hessian草图则不同。通过理论和实验表明，模型平均可显著降低真实与草图解间的风险差距。

Comments To appear in Journal of Machine Learning Research, 2018. A short version has appeared in International Conference on Machine Learning (ICML), 2017

详情

Journal ref: Journal of Machine Learning Research, 19, pp1-50, 2018

AI中文摘要

我们探讨了经典草图和Hessian草图在近似求解矩阵岭回归（MRR）问题中的统计和优化影响。先前研究量化了经典草图对更简单的最小二乘回归（LSR）问题的影响。我们证明经典草图对MRR的优化属性的影响与对LSR的影响类似：即恢复近似最优解。相反，Hessian草图没有这种保证，其近似误差由响应中的“质量”与最优目标值之间的微妙交互决定。对于两种类型的近似，sketched MRR中的正则化导致与sketched LSR不同的统计特性。特别是，在sketched MRR中存在偏误-方差权衡，这在sketched LSR中不存在。我们提供了sketched MRR的偏误和方差的上界和下界，这些界限表明经典草图显著增加方差，而Hessian草图显著增加偏误。经验上，sketched MRR的解的风险可能比最优MRR解高一个数量级。我们理论和实证表明，模型平均显著降低真实解与sketched解风险之间的差距。因此，在并行或分布式设置中，草图结合模型平均是一种强大的技术，能够快速获得近似最优解，同时大幅减轻草图带来的统计风险增加。

英文摘要

We address the statistical and optimization impacts of the classical sketch and Hessian sketch used to approximately solve the Matrix Ridge Regression (MRR) problem. Prior research has quantified the effects of classical sketch on the strictly simpler least squares regression (LSR) problem. We establish that classical sketch has a similar effect upon the optimization properties of MRR as it does on those of LSR: namely, it recovers nearly optimal solutions. By contrast, Hessian sketch does not have this guarantee, instead, the approximation error is governed by a subtle interplay between the "mass" in the responses and the optimal objective value. For both types of approximation, the regularization in the sketched MRR problem results in significantly different statistical properties from those of the sketched LSR problem. In particular, there is a bias-variance trade-off in sketched MRR that is not present in sketched LSR. We provide upper and lower bounds on the bias and variance of sketched MRR, these bounds show that classical sketch significantly increases the variance, while Hessian sketch significantly increases the bias. Empirically, sketched MRR solutions can have risks that are higher by an order-of-magnitude than those of the optimal MRR solutions. We establish theoretically and empirically that model averaging greatly decreases the gap between the risks of the true and sketched solutions to the MRR problem. Thus, in parallel or distributed settings, sketching combined with model averaging is a powerful technique that quickly obtains near-optimal solutions to the MRR problem while greatly mitigating the increased statistical risk incurred by sketching.

URL PDF HTML ☆

赞 0 踩 0

1801.07745 2026-06-04 math.OC cs.AI cs.CG cs.NA math.NA

Optimal Transport on Discrete Domains

离散域上的最优传输

Justin Solomon

发表机构 * MIT Department of Electrical Engineering and Computer Science（麻省理工学院电气工程与计算机科学系）； MIT Department of Electrical Engineering（麻省理工学院电气工程系）

AI总结本文探讨了离散最优传输的最新进展，结合偏微分方程与凸分析，提出理论支持的模型，适用于数万到数百万顶点的领域。

详情

AI中文摘要

受物流问题中供需匹配的启发，最优传输（或蒙特卡洛问题）涉及在几何域上定义的概率分布的匹配。在最明显的离散化中，最优传输成为大规模线性规划问题，通常在三角网格、图、点云等图形和机器学习中遇到的域上难以高效求解。然而，最近的数值最优传输突破使可扩展性达到数量级更大的问题，可在几秒钟内解决。本文讨论了利用离散和光滑问题方面理解的数值最优传输进展。最先进的离散最优传输技术结合了偏微分方程（PDE）与凸分析的洞察，以重新公式化、离散化和优化运输问题。最终结果是一组理论上支持的模型，适用于具有数万或数百万顶点的领域。由于数值最优传输是一个相对较新的学科，特别强调了识别和解释需要数学洞察和额外研究的开放问题。

英文摘要

Inspired by the matching of supply to demand in logistical problems, the optimal transport (or Monge--Kantorovich) problem involves the matching of probability distributions defined over a geometric domain such as a surface or manifold. In its most obvious discretization, optimal transport becomes a large-scale linear program, which typically is infeasible to solve efficiently on triangle meshes, graphs, point clouds, and other domains encountered in graphics and machine learning. Recent breakthroughs in numerical optimal transport, however, enable scalability to orders-of-magnitude larger problems, solvable in a fraction of a second. Here, we discuss advances in numerical optimal transport that leverage understanding of both discrete and smooth aspects of the problem. State-of-the-art techniques in discrete optimal transport combine insight from partial differential equations (PDE) with convex analysis to reformulate, discretize, and optimize transportation problems. The end result is a set of theoretically-justified models suitable for domains with thousands or millions of vertices. Since numerical optimal transport is a relatively new discipline, special emphasis is placed on identifying and explaining open problems in need of mathematical insight and additional research.

URL PDF HTML ☆

赞 0 踩 0

1804.10432 2026-06-04 math.NA cs.CV cs.NA math.DG

Variational Regularization of Inverse Problems for Manifold-Valued Data

变分正则化用于流形值数据的反问题

Martin Storath, Andreas Weinmann

发表机构 * Image Analysis and Learning Group, Interdisciplinary Center for Scientific Computing, Universität Heidelberg, Germany（图像分析与学习组，跨学科科学计算中心，海德堡大学，德国）； Department of Mathematics and Natural Sciences, Hochschule Darmstadt, and Institute of Computational Biology, Helmholtz Zentrum München, Germany（数学与自然科学系，达姆施塔特应用科学大学，以及计算生物学研究所，海德堡研究中心，德国）

AI总结本文研究流形值数据的变分正则化反问题，提出TV和TGV正则化方法，并通过合成和真实数据验证其有效性。

1804.07010 2026-06-04 stat.ML cs.LG cs.SY eess.SY math.AP math.OC

Forward-Backward Stochastic Neural Networks: Deep Learning of High-dimensional Partial Differential Equations

前向-后向随机神经网络：高维偏微分方程的深度学习

Maziar Raissi

发表机构 * Division of Applied Mathematics, Brown University（布朗大学应用数学系）

AI总结本文提出一种高维偏微分方程求解方法，利用深度神经网络和随机微分方程的联系，避免数值离散化限制，解决维度诅咒问题。

1804.06586 2026-06-04 eess.SY cs.RO cs.SY

Composite Adaptive Control for Bilateral Teleoperation Systems without Persistency of Excitation

双向往复操作系统无持续激励的复合自适应控制

Yuling Li, Yixin Yin, Sen Zhang, Jie Dong, Rolf Johansson

发表机构 * School of Automation and Electrical Engineering, University of Science and Technology Beijing（自动化与电气工程学院，北京科技大学）； Department of Automatic Control, Lund University, P.O. Box 118, 22100 Lund, Sweden（自动控制系，卢德大学）

AI总结本文提出一种无需持续激励条件的复合自适应控制方法，用于解决非线性双向往复操作系统中参数收敛问题，通过线性矩阵不等式给出闭环系统稳定性准则，并通过仿真验证了方法的有效性。

Comments 21 pages, 9 figures, submitted to Journal of The Franklin Institute

1804.04349 2026-06-04 eess.SY cs.RO cs.SE cs.SY

On the Application of ISO 26262 in Control Design for Automated Vehicles

在自动驾驶车辆控制设计中应用ISO 26262

Georg Schildbach

发表机构 * Institute for Electrical Engineering in Medicine, University of Luebeck（医学电气工程研究所，吕贝克大学）

AI总结本文探讨了ISO 26262标准在自动驾驶车辆控制设计中的应用，分析了其在高自动化车辆中的适用性与争议，并总结了该标准的安全设计步骤。

Comments In Proceedings SCAV 2018, arXiv:1804.03406

详情

DOI: 10.4204/EPTCS.269.7
Journal ref: EPTCS 269, 2018, pp. 74-82

AI中文摘要

过去十年间，自动驾驶车辆的研究经历了爆炸性增长。然而，其实际应用的主要障碍是一个令人信服的安全概念。随着算法日益复杂和车辆自动化水平提高，这一问题愈发重要。功能安全领域提供了一种系统方法，用于识别潜在风险源并提高车辆安全性。该领域基于航空航天、过程和其他行业数十年的实践经验。这些经验汇总成汽车领域的功能安全标准ISO 26262，已被广泛采用。然而，其在高自动化车辆中的适用性和相关性却存在争议。本文对这一讨论进行了批判性分析，并总结了ISO 26262的主要步骤，以实现自动驾驶车辆的安全控制设计。

英文摘要

Research on automated vehicles has experienced an explosive growth over the past decade. A main obstacle to their practical realization, however, is a convincing safety concept. This question becomes ever more important as more sophisticated algorithms are used and the vehicle automation level increases. The field of functional safety offers a systematic approach to identify possible sources of risk and to improve the safety of a vehicle. It is based on practical experience across the aerospace, process and other industries over multiple decades. This experience is compiled in the functional safety standard for the automotive domain, ISO 26262, which is widely adopted throughout the automotive industry. However, its applicability and relevance for highly automated vehicles is subject to a controversial debate. This paper takes a critical look at the discussion and summarizes the main steps of ISO 26262 for a safe control design for automated vehicles.

URL PDF HTML ☆

赞 0 踩 0

1804.03036 2026-06-04 eess.IV cs.RO cs.SY eess.SY

Image Moment Models for Extended Object Tracking

图像矩模型用于扩展目标跟踪

Gang Yao, Ashwin Dani

发表机构 * Department of Electrical and Computer Engineering, University of Connecticut Storrs（电气与计算机工程系，康涅狄格大学斯托尔斯分校）

AI总结本文提出基于图像矩的新型模型，用于估计和跟踪复杂轨迹下的目标形状。通过无迹卡尔曼滤波-交互多模型算法估计目标形状及位置速度，结合IOU和RMSE指标进行验证。

详情

Journal ref: IEEE Transactions on Aerospace and Electronic Systems, 2018

AI中文摘要

本文提出了一种基于图像矩的新型模型，用于估计和跟踪具有复杂轨迹的物体形状。假设相机静止注视移动物体，内部点特征作为测量值。假设椭圆近似为基本形状，利用图像矩组合估计椭圆形状。推导了在恒速或协调转弯运动模型下图像矩的动态模型，作为物体形状估计的函数。应用无迹卡尔曼滤波-交互多模型（UKF-IMM）滤波算法估计目标形状（近似为椭圆）并跟踪其位置和速度。基于平均对数似然推导IMM滤波器的似然函数。展示了所提UKF-IMM算法与图像矩模型的仿真结果，显示了在复杂轨迹中移动物体的估计结果。通过交并比（IOU）和位置和速度均方根误差（RMSE）等指标与文献中的基准算法进行比较。还展示了从四旋翼无人机捕获的真实图像数据结果。

英文摘要

In this paper, a novel image moments based model for shape estimation and tracking of an object moving with a complex trajectory is presented. The camera is assumed to be stationary looking at a moving object. Point features inside the object are sampled as measurements. An ellipsoidal approximation of the shape is assumed as a primitive shape. The shape of an ellipse is estimated using a combination of image moments. Dynamic model of image moments when the object moves under the constant velocity or coordinated turn motion model is derived as a function for the shape estimation of the object. An Unscented Kalman Filter-Interacting Multiple Model (UKF-IMM) filter algorithm is applied to estimate the shape of the object (approximated as an ellipse) and track its position and velocity. A likelihood function based on average log-likelihood is derived for the IMM filter. Simulation results of the proposed UKF-IMM algorithm with the image moments based models are presented that show the estimations of the shape of the object moving in complex trajectories. Comparison results, using intersection over union (IOU), and position and velocity root mean square errors (RMSE) as metrics, with a benchmark algorithm from literature are presented. Results on real image data captured from the quadcopter are also presented.

URL PDF HTML ☆

赞 0 踩 0

1804.03973 2026-06-04 eess.SY cs.AI cs.SY

Reasoning about Safety of Learning-Enabled Components in Autonomous Cyber-physical Systems

关于自主机电系统中学习组件安全性的推理

Cumhur Erkan Tuncali, James Kapinski, Hisahiro Ito, Jyotirmoy V. Deshmukh

发表机构 * Toyota Research Institute of North America（丰田北美研究院）； University of Southern California（南加州大学）

AI总结本文提出基于模拟的方法生成屏障证书函数，用于验证包含神经网络控制器的机电系统安全性。通过线性规划求解器从随机初始状态获得的模拟轨迹中找到候选生成函数，并利用SMT求解器验证其安全性。

Comments Invited paper in conference: Design Automation Conference (DAC) 2018

1804.02814 2026-06-04 eess.SY cs.RO cs.SY

Nonlinear Unknown Input and State Estimation Algorithm in Mobile Robots

移动机器人中的非线性未知输入和状态估计算法

Pinyao Guo, Hunmin Kim, Nurali Virani, Jun Xu, Minghui Zhu, Peng Liu

发表机构 * College of Information Sciences（信息科学学院）； School of Electrical Engineering（电气工程学院）； GE Global Research（GE全球研究）

AI总结本文提出一种适用于非线性动态模型和随机噪声的移动机器人未知输入和状态估计算法，通过传感器数据和控制指令检测并量化传感器和执行器的异常。

Comments arXiv admin note: text overlap with arXiv:1708.01834

1610.02967 2026-06-04 math.OC cs.LG cs.NA math.NA stat.ML

Distributed Convex Optimization with Many Convex Constraints

具有许多凸约束的分布式凸优化

Joachim Giesen, Sören Laue

发表机构 * Friedrich-Schiller-University Jena（耶拿弗里德里希-施特劳斯大学）

AI总结本文提出一种扩展的ADMM方法，用于解决具有众多凸约束的分布式凸优化问题，继承了ADMM和增广拉格朗日方法的收敛性保证。

1712.08585 2026-06-04 math.OC cs.CV cs.NA math.NA

Denoising of image gradients and total generalized variation denoising

图像梯度去噪与总泛化变分去噪

Birgit Komander, Dirk A. Lorenz, Lena Vestweber

发表机构 * Institute of Analysis and Algebra（分析与代数研究所）； TU Braunschweig（布拉unsch维格技术大学）

AI总结本文重新审视全变分去噪，提出增强模型假设已获得图像梯度估计，改进了图像重建质量，并推导出与总泛化变分去噪方法相似的模型，提出约束去噪模型和参数自由的变分去噪模型，采用Chambolle-Pock和Douglas-Rachford方法进行数值实验，验证了预处理对收敛速度的提升。

详情

AI中文摘要

我们重新审视全变分去噪，并研究一个增强模型，其中假设已获得图像梯度的估计。我们证明这会提高图像重建质量，并推导出所得到的模型类似于总泛化变分去噪方法，从而为该模型提供了新的动机。进一步，我们提出使用约束去噪模型，并开发一个基本无参数的变分去噪模型，即所有模型参数都直接从噪声图像中估计。此外，我们使用Chambolle-Pock的对偶方法以及Douglas-Rachford方法用于新模型。对于后者，必须解决大规模的偏微分方程离散化。我们提出以不精确的方式使用预条件共轭梯度法进行处理，并为此推导出预条件器。数值实验表明，所得到的方法具有良好的去噪性能，并且预处理显著提高了收敛速度。最后我们分析了不同TGV去噪问题形式的对偶间隙，并推导出一个简单的停止准则。

英文摘要

We revisit total variation denoising and study an augmented model where we assume that an estimate of the image gradient is available. We show that this increases the image reconstruction quality and derive that the resulting model resembles the total generalized variation denoising method, thus providing a new motivation for this model. Further, we propose to use a constraint denoising model and develop a variational denoising model that is basically parameter free, i.e. all model parameters are estimated directly from the noisy image. Moreover, we use Chambolle-Pock's primal dual method as well as the Douglas-Rachford method for the new models. For the latter one has to solve large discretizations of partial differential equations. We propose to do this in an inexact manner using the preconditioned conjugate gradients method and derive preconditioners for this. Numerical experiments show that the resulting method has good denoising properties and also that preconditioning does increase convergence speed significantly. Finally we analyze the duality gap of different formulations of the TGV denoising problem and derive a simple stopping criterion.

URL PDF HTML ☆

赞 0 踩 0

1710.10781 2026-06-04 math.NA cs.CV cs.LG cs.NA stat.ML

Stochastic variance reduced multiplicative update for nonnegative matrix factorization

随机方差缩减乘法更新用于非负矩阵分解

Hiroyuki Kasai

发表机构 * Graduate School of Informatics and Engineering, The University of Electro-Communications（信息与工程研究生院，东京电波通信大学）

AI总结本文提出一种随机方差缩减乘法更新算法，改进非负矩阵分解的收敛速度，通过数值实验验证其在不同数据集上的优越性。

Comments IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2018)

1803.11411 2026-06-04 eess.SY cs.LG cs.MA cs.SY

Observer-based Adaptive Optimal Output Containment Control problem of Linear Heterogeneous Multi-agent Systems with Relative Output Measurements

基于观测器的自适应最优输出包容控制问题：线性异构多智能体系统中的相对输出测量

Majid Mazouchi, Mohammad Bagher Naghibi-Sistani, Seyed Kamal Hosseini Sani, Farzaneh Tatari, Hamidreza Modares

发表机构 * Department of Electrical Engineering, Ferdowsi University of Mashhad, Mashhad, Iran（马什哈德法尔多西大学电气工程系）； Department of Electrical Engineering, University of Semnan, Semnan, Iran（塞姆南大学电气工程系）； Missouri University of Science（密苏里科技大学）

AI总结本文提出了一种基于相对输出反馈的最优解法，用于线性异构多智能体系统的包容控制问题，通过分布式最优控制协议确保跟随器输出处于领导者输出的凸包内并优化暂态性能，采用分布式观测器估计不可用的状态和凸包。

详情

AI中文摘要

本文开发了一种基于最优相对输出反馈的解决方案，用于解决线性异构多智能体系统的包容控制问题。提出了一种分布式最优控制协议，使跟随器的输出落在领导者输出的凸包内（即期望或安全区域），并优化其暂态性能。所提出的最优控制解决方案由反馈部分和前馈部分组成，反馈部分依赖于跟随器的状态，前馈部分依赖于领导者状态的凸包。为了符合大多数实际应用，假设反馈和前馈状态不可用，并使用两个分布式观测器进行估计。即，由于跟随器无法直接感知其绝对状态，设计了一个分布式观测器，仅使用相对于邻居的输出测量（例如通过机器人中的范围传感器测量）和邻居广播的信息来估计其状态。此外，还设计了一个自适应分布式观测器，通过在通信网络上交换信息来估计领导者状态的凸包。所提出的观测器放松了所有跟随器必须知道领导者动态完整知识的严格要求。接下来，开发了一种基于Actor-Critic结构的离策略强化学习算法，用于在线解决最优包容控制问题，使用相对输出测量，无需所有跟随器知道领导者动态。最后，通过数值模拟验证了理论结果。

英文摘要

This paper develops an optimal relative output-feedback based solution to the containment control problem of linear heterogeneous multi-agent systems. A distributed optimal control protocol is presented for the followers to not only assure that their outputs fall into the convex hull of the leaders' output (i.e., the desired or safe region), but also optimizes their transient performance. The proposed optimal control solution is composed of a feedback part, depending of the followers' state, and a feed-forward part, depending on the convex hull of the leaders' state. To comply with most real-world applications, the feedback and feed-forward states are assumed to be unavailable and are estimated using two distributed observers. That is, since the followers cannot directly sense their absolute states, a distributed observer is designed that uses only relative output measurements with respect to their neighbors (measured for example by using range sensors in robotic) and the information which is broadcasted by their neighbors to estimate their states. Moreover, another adaptive distributed observer is designed that uses exchange of information between followers over a communication network to estimate the convex hull of the leaders' state. The proposed observer relaxes the restrictive requirement of knowing the complete knowledge of the leaders' dynamics by all followers. An off-policy reinforcement learning algorithm on an actor-critic structure is next developed to solve the optimal containment control problem online, using relative output measurements and without requirement of knowing the leaders' dynamics by all followers. Finally, the theoretical results are verified by numerical simulations.

URL PDF HTML ☆

赞 0 踩 0

1705.10887 2026-06-04 stat.ML cs.CV cs.LG cs.NA math.NA

Efficient, sparse representation of manifold distance matrices for classical scaling

高效表示经典标度中的流形距离矩阵

Javier S. Turek, Alexander Huth

发表机构 * Intel Labs（英特尔实验室）； The University of Texas at Austin（得克萨斯大学奥斯汀分校）

AI总结本文提出一种基于双调和插值的稀疏方法，用于高效表示流形距离矩阵，相比现有方法速度快2倍，内存占用低20倍，能处理大规模点集。

Comments Conference CVPR 2018

1701.01394 2026-06-04 cs.DS cs.LG cs.NA math.NA stat.ML

On spectral partitioning of signed graphs

关于带符号图的谱划分

Andrew V. Knyazev

发表机构 * Mitsubishi Electric Research Laboratories (MERL)（三菱电机研究实验室（MERL））

AI总结本文讨论了带符号图谱划分中标准图拉普拉斯矩阵优于符号拉普拉斯矩阵，指出基于符号拉普拉斯矩阵主特征向量的划分方法更有效，负特征值有助于提高计算效率。

Comments 12 pages, 10 figures. Rev 2 to appear in proceedings of the SIAM Workshop on Combinatorial Scientific Computing 2018 (CSC18)

1803.09792 2026-06-04 cs.MA cs.RO cs.SY eess.SY

Min-Max Tours for Task Allocation to Heterogeneous Agents

为异构智能体分配任务的最优化路线

Amritha Prasad, Han-Lim Choi, Shreyas Sundaram

发表机构 * School of Electrical and Computer Engineering at Purdue University（普渡大学电气与计算机工程学院）

AI总结研究如何为异构移动智能体分配任务，以最小化任何智能体完成任务并返回仓库的最大成本，提出了一种三阶段算法，提供5倍近似比。

详情

AI中文摘要

我们考虑一个场景，其中一组异构移动智能体位于仓库，一组任务分散在地理区域中。智能体分为不同类型，任务分为专用任务（只能由特定类型的智能体完成）和通用任务（可由任何智能体完成）。每对任务之间的距离已指定，并满足三角不等式。给定此场景，我们研究如何将这些任务分配给可用智能体（受类型兼容性约束），以最小化任何智能体完成任务并返回仓库的最大成本。该问题属于NP难问题，我们提出了一种三阶段算法来解决该问题，无论总智能体数量和每种类型智能体数量如何，该算法提供5倍的近似比。我们还证明，在仅有一种类型智能体的情况下，该算法的近似因子为4。

英文摘要

We consider a scenario consisting of a set of heterogeneous mobile agents located at a depot, and a set of tasks dispersed over a geographic area. The agents are partitioned into different types. The tasks are partitioned into specialized tasks that can only be done by agents of a certain type, and generic tasks that can be done by any agent. The distances between each pair of tasks are specified, and satisfy the triangle inequality. Given this scenario, we address the problem of allocating these tasks among the available agents (subject to type compatibility constraints) while minimizing the maximum cost to tour the allocation by any agent and return to the depot. This problem is NP-hard, and we give a three phase algorithm to solve this problem that provides 5-factor approximation, regardless of the total number of agents and the number of agents of each type. We also show that in the special case where there is only one agent of each type, the algorithm has an approximation factor of 4.

URL PDF HTML ☆

赞 0 踩 0

1711.02271 2026-06-04 math.NA cs.LG cs.NA

High-order Tensor Completion for Data Recovery via Sparse Tensor-train Optimization

高阶张量补全：通过稀疏张量-列车优化实现数据恢复

Longhao Yuan, Qibin Zhao, Jianting Cao

发表机构 * Graduate School of Engineering, Saitama Institute of Technology, Japan（埼玉工科大学工学研究科）； Tensor Learning Unit, RIKEN Center for Advanced Intelligence Project (AIP), Japan（日本先进情报项目（AIP）RIKEN智能学习单元）； School of Automation, Guangdong University of Technology, China（广东技术大学自动化学院）； School of Computer Science and Technology, Hangzhou Dianzi University, China（杭州电子科技大学计算机科学与技术学院）

AI总结本文提出稀疏张量-列车优化算法，通过将缺失数据视为稀疏张量并利用一阶优化方法求解张量-列车分解因子，有效提升低阶和高阶张量补全性能，尤其在高缺失率下表现优异。

Comments 5 pages (include 1 page of reference) ICASSP 2018

1707.03770 2026-06-04 eess.SY cs.LG cs.SY math.OC

Fastest Convergence for Q-learning

Q-learning 最快收敛算法

Adithya M. Devraj, Sean P. Meyn

发表机构 * University of Florida（佛罗里达大学）； University of California, Berkeley（加州大学伯克利分校）

AI总结本文提出Zap Q-learning算法，通过矩阵增益设计实现渐近方差最优，并通过ODE分析证明其瞬态行为接近确定性牛顿-拉夫森法，实验验证其在非理想参数化设置下的快速收敛性。

1612.05971 2026-06-04 eess.SY cs.AI cs.GT cs.SY math.OC

An Integrated Optimization + Learning Approach to Optimal Dynamic Pricing for the Retailer with Multi-type Customers in Smart Grids

在智能电网中面向多类型顾客的零售商最优动态定价集成优化与学习方法

Fanlin Meng, Xiao-Jun Zeng, Yan Zhang, Chris J. Dent, Dunwei Gong

发表机构 * School of Engineering and Computing Sciences, Durham University（工程与计算科学学院，达勒姆大学）； School of Computer Science, The University of Manchester（计算机科学学院，曼彻斯特大学）； College of Information System and Management, National University of Defense Technology（信息系统与管理学院，国防科技大学）； School of Mathematics, University of Edinburgh（数学学院，爱丁堡大学）； School of Information and Control Engineering, China University of Mining and Technology（信息与控制工程学院，中国矿业大学）

AI总结本文针对智能电网中零售商服务三种不同类型的顾客问题，提出两级决策框架，结合优化与学习方法实现动态定价优化，通过仿真实验验证模型的有效性。

Comments 38 pages, 6 figures

详情

DOI: 10.1016/j.ins.2018.03.039

AI中文摘要

本文考虑智能电网中零售商服务三种不同类型的顾客的现实场景，即具有嵌入智能电表的最优家庭能源管理系统顾客（C-HEMS）、仅具有智能电表的顾客（C-SM）以及无智能电表的顾客（C-NONE）。本文的主要目标是支持零售商在混合顾客群体中做出最优的日提前动态定价决策。为此，我们提出一个两级决策框架，其中零售商作为上层代理首先宣布未来24小时的电力价格，顾客作为下层代理随后根据价格调度其能源使用。对于下层问题，我们根据不同顾客的独特特征建模其价格响应性。对于上层问题，我们优化动态价格以最大化零售商利润，同时满足现实市场约束。上述两级模型通过基于遗传算法（GA）的分布式优化方法解决，其可行性和有效性通过仿真结果得到验证。

英文摘要

In this paper, we consider a realistic and meaningful scenario in the context of smart grids where an electricity retailer serves three different types of customers, i.e., customers with an optimal home energy management system embedded in their smart meters (C-HEMS), customers with only smart meters (C-SM), and customers without smart meters (C-NONE). The main objective of this paper is to support the retailer to make optimal day-ahead dynamic pricing decisions in such a mixed customer pool. To this end, we propose a two-level decision-making framework where the retailer acting as upper-level agent firstly announces its electricity prices of next 24 hours and customers acting as lower-level agents subsequently schedule their energy usages accordingly. For the lower level problem, we model the price responsiveness of different customers according to their unique characteristics. For the upper level problem, we optimize the dynamic prices for the retailer to maximize its profit subject to realistic market constraints. The above two-level model is tackled by genetic algorithms (GA) based distributed optimization methods while its feasibility and effectiveness are confirmed via simulation results.

URL PDF HTML ☆

赞 0 踩 0

1803.06989 2026-06-04 math.ST cs.LG cs.NA math.NA stat.ML stat.TH

Numerical Integration on Graphs: where to sample and how to weigh

图上的数值积分：在哪里采样和如何加权

George C. Linderman, Stefan Steinerberger

发表机构 * Program in Applied Mathematics, Yale University, New Haven, CT 06511, USA（应用数学项目，耶鲁大学，新 Haven, CT 06511, USA）； Department of Mathematics, Yale University, New Haven, CT 06511, USA（数学系，耶鲁大学，新 Haven, CT 06511, USA）

AI总结研究图上数值积分问题，通过热球最优打包几何问题重构积分，提出采样策略与加权方法，验证方法效率。

1803.06775 2026-06-04 quant-ph cs.AI cs.ET cs.SY eess.SY

Comparing and Integrating Constraint Programming and Temporal Planning for Quantum Circuit Compilation

比较和整合约束编程与时间规划用于量子电路编译

Kyle E. C. Booth, Minh Do, J. Christopher Beck, Eleanor Rieffel, Davide Venturelli, Jeremy Frank

发表机构 * Quantum Artificial Intelligence Laboratory, NASA Ames Research Center（量子人工智能实验室，美国国家航空航天局阿姆斯特朗研究中心）； Planning and Scheduling Group, NASA Ames Research Center（规划与调度组，美国国家航空航天局阿姆斯特朗研究中心）； USRA Research Institute for Advanced Computer Science（美国宇航局高级计算机科学研究所）； Stinger Ghaffarian Technologies, Inc.（Stinger Ghaffarian技术公司）； Department of Mechanical & Industrial Engineering, University of Toronto（多伦多大学机械与工业工程系）

AI总结本文比较了约束编程与时间规划在量子电路编译中的应用，提出混合方法提升求解质量，证明混合方法在多数问题中优于单独使用时间规划。

Comments 9 pages, 2 figures, Proceedings of the 28th International Conference of Automated Planning and Scheduling 2018 (ICAPS-18)

详情

AI中文摘要

最近，将一般量子算法编译为近期量子处理器的makespan最小化问题被引入人工智能社区。研究显示时间规划是量子电路编译（QCC）问题的一种强大方法。本文探讨了约束编程（CP）作为时间规划的替代和补充方法。我们通过引入两个新的问题变体扩展了先前工作，这些变体结合了量子计算社区识别的重要特征。我们应用时间规划和CP解决基准和扩展的QCC问题，作为单独和混合方法。我们的混合方法利用时间规划找到的解决方案预热CP，利用前者在任务选项性高的问题中找到满意解的能力，而CP通常难以处理。CP模型受益于预热提供的推断边界，从而找到更高质量的解。实证评估表明，虽然单独使用CP仅在最小问题中具有竞争力，但CP与时间规划的混合方法在多数问题类别中表现优于单独使用时间规划。

英文摘要

Recently, the makespan-minimization problem of compiling a general class of quantum algorithms into near-term quantum processors has been introduced to the AI community. The research demonstrated that temporal planning is a strong approach for a class of quantum circuit compilation (QCC) problems. In this paper, we explore the use of constraint programming (CP) as an alternative and complementary approach to temporal planning. We extend previous work by introducing two new problem variations that incorporate important characteristics identified by the quantum computing community. We apply temporal planning and CP to the baseline and extended QCC problems as both stand-alone and hybrid approaches. Our hybrid methods use solutions found by temporal planning to warm start CP, leveraging the ability of the former to find satisficing solutions to problems with a high degree of task optionality, an area that CP typically struggles with. The CP model, benefiting from inferred bounds on planning horizon length and task counts provided by the warm start, is then used to find higher quality solutions. Our empirical evaluation indicates that while stand-alone CP is only competitive for the smallest problems, CP in our hybridization with temporal planning out-performs stand-alone temporal planning in the majority of problem classes.

URL PDF HTML ☆

赞 0 踩 0

1703.02419 2026-06-04 stat.CO cs.LG cs.SY eess.SY

Probabilistic learning of nonlinear dynamical systems using sequential Monte Carlo

利用序贯蒙特卡洛方法进行非线性动力系统概率学习

Thomas B. Schön, Andreas Svensson, Lawrence Murray, Fredrik Lindsten

发表机构 * Department of Information Technology, Uppsala University（乌普萨拉大学信息科技系）

AI总结本文提出基于序贯蒙特卡洛方法的概率非线性状态空间模型学习方法，通过粒子Metropolis-Hastings算法实现参数空间的高效采样，并展示了该方法在动态系统建模中的应用。

Comments Thomas B. Schön, Andreas Svensson, Lawrence Murray and Fredrik Lindsten, 2018. Probabilistic learning of nonlinear dynamical systems using sequential Monte Carlo. In Mechanical Systems and Signal Processing, Volume 104, pp. 866-883

详情

DOI: 10.1016/j.ymssp.2017.10.033

AI中文摘要

概率建模能够表示和操纵数据、模型、预测和决策中的不确定性。本文关注从测量数据中学习动态系统概率模型的问题，特别是非线性状态空间模型的学习。由于该问题没有闭式解，因此必须使用近似方法。本文提供了一个自包含的介绍，介绍了一种最先进的方法——粒子Metropolis-Hastings算法，该算法已被证明能提供实用的近似。这是一种基于蒙特卡洛的方法，其中粒子滤波用于引导马尔可夫链蒙特卡洛方法通过参数空间。粒子Metropolis-Hastings算法的一个关键优点是，在温和的假设下，它保证收敛到“真实解”，尽管它基于仅有限数量粒子的粒子滤波。本文还提供了一个数值示例，通过为序贯蒙特卡洛方法量身定制的建模语言来展示该方法。此类建模语言的目的是将高级蒙特卡洛方法（包括粒子Metropolis-Hastings）的威力带给大量用户，而无需他们了解所有底层数学细节。

英文摘要

Probabilistic modeling provides the capability to represent and manipulate uncertainty in data, models, predictions and decisions. We are concerned with the problem of learning probabilistic models of dynamical systems from measured data. Specifically, we consider learning of probabilistic nonlinear state-space models. There is no closed-form solution available for this problem, implying that we are forced to use approximations. In this tutorial we will provide a self-contained introduction to one of the state-of-the-art methods---the particle Metropolis--Hastings algorithm---which has proven to offer a practical approximation. This is a Monte Carlo based method, where the particle filter is used to guide a Markov chain Monte Carlo method through the parameter space. One of the key merits of the particle Metropolis--Hastings algorithm is that it is guaranteed to converge to the "true solution" under mild assumptions, despite being based on a particle filter with only a finite number of particles. We will also provide a motivating numerical example illustrating the method using a modeling language tailored for sequential Monte Carlo methods. The intention of modeling languages of this kind is to open up the power of sophisticated Monte Carlo methods---including particle Metropolis--Hastings---to a large group of users without requiring them to know all the underlying mathematical details.

URL PDF HTML ☆

赞 0 踩 0

1803.03104 2026-06-04 eess.SY cs.CV cs.SY math.DS stat.ML

Applicability and interpretation of the deterministic weighted cepstral distance

确定性加权谱距的应用与解释

Oliver Lauwers, Bart De Moor

发表机构 * KU Leuven, Department of Electrical Engineering (ESAT), STADIUS Center for Dynamical Systems, Signal Processing（库勒文大学电子工程系（ESAT）、动态系统信号处理与数据分析中心）

AI总结本文结合系统理论和机器学习，研究了加权谱距在确定性线性时不变单输入单输出模型中的应用，提出了一种基于输入输出信号信息评估系统稳定性和相位类型的纯数据驱动方法。

Comments 18 pages, 5 figures, submitted for review to Automatica

详情

AI中文摘要

量化数据对象之间的相似性是现代数据科学的重要部分。决定使用哪种相似性度量非常依赖于具体应用。本文结合系统理论和机器学习的见解，研究了之前为ARMA模型信号定义的加权谱距。我们将其扩展到可逆的确定性线性时不变单输入单输出模型，并评估其适用性。我们证明了该距离总能以底层模型的极点和零点进行解释，并在稳定、最小相位或不稳定、最大相位模型的情况下，可以以子空间角度进行几何解释。然后，我们提出了一种仅使用输入/输出信号信息的方法来评估生成模型的稳定性和相位类型。通过这种方式，我们证明了扩展的加权谱距与加权谱模型范数之间的联系。通过这种方式，我们提供了一种纯数据驱动的方法来评估输入/输出信号对的不同底层动态，而无需任何系统识别步骤。这在时间序列聚类等机器学习任务中可能很有用。本文还发布了一个iPython教程，包含各种方法和算法的实现，以及一些证明等价性的数值示例。

英文摘要

Quantifying similarity between data objects is an important part of modern data science. Deciding what similarity measure to use is very application dependent. In this paper, we combine insights from systems theory and machine learning, and investigate the weighted cepstral distance, which was previously defined for signals coming from ARMA models. We provide an extension of this distance to invertible deterministic linear time invariant single input single output models, and assess its applicability. We show that it can always be interpreted in terms of the poles and zeros of the underlying model, and that, in the case of stable, minimum-phase, or unstable, maximum-phase models, a geometrical interpretation in terms of subspace angles can be given. We then devise a method to assess stability and phase-type of the generating models, using only input/output signal information. In this way, we prove a connection between the extended weighted cepstral distance and a weighted cepstral model norm. In this way, we provide a purely data-driven way to assess different underlying dynamics of input/output signal pairs, without the need for any system identification step. This can be useful in machine learning tasks such as time series clustering. An iPython tutorial is published complementary to this paper, containing implementations of the various methods and algorithms presented here, as well as some numerical illustrations of the equivalences proven here.

URL PDF HTML ☆

赞 0 踩 0

1803.01626 2026-06-04 stat.ML cs.LG cs.SY eess.SY

Variance-Aware Regret Bounds for Undiscounted Reinforcement Learning in MDPs

考虑平均回报准则下未知离散马尔可夫决策过程（MDP）中的强化学习的方差意识后悔界

Mohammad Sadegh Talebi, Odalric-Ambrym Maillard

发表机构 * KTH Royal Institute of Technology（皇家理工学院）； INRIA Lille – Nord Europe（里尔-北欧洲研究所）

AI总结本文基于平均回报准则，重新审视未知离散MDP中的强化学习问题，通过引入局部方差代替MDP直径，改进KL-UCRL算法的后悔界，提供更优的性能保证。

Comments To appear in Proceedings of the 29th International Conference on Algorithmic Learning Theory (ALT 2018)

详情

AI中文摘要

在未知和离散的马尔可夫决策过程（MDP）中，考虑在单一流观测下进行强化学习的问题，当学习者从初始状态开始与系统交互时。我们通过引入偏倚函数的局部方差代替MDP的直径，重新审视该问题的最小最大下界。此外，我们提供了KL-UCRL算法的新型分析，建立了高概率的后悔界，其规模为$\widetilde {\mathcal O}\Bigl({\textstyle \sqrt{S\sum_{s,a}{\bf V}^\star_{s,a}T}}\Big)$，适用于周期性MDP。该界优于之前已知的$\widetilde {\mathcal O}(DS\sqrt{AT})$界，其中$A$和$D$分别表示每个状态的最大动作数和MDP的直径。我们最终在一些基准MDP中比较了两个界的主导项，表明在某些情况下，所推导的界可以提供一个数量级的改进。我们的分析利用了运输引理的新变体结合KL集中不等式，我们认为这些方法具有独立的兴趣。

英文摘要

The problem of reinforcement learning in an unknown and discrete Markov Decision Process (MDP) under the average-reward criterion is considered, when the learner interacts with the system in a single stream of observations, starting from an initial state without any reset. We revisit the minimax lower bound for that problem by making appear the local variance of the bias function in place of the diameter of the MDP. Furthermore, we provide a novel analysis of the KL-UCRL algorithm establishing a high-probability regret bound scaling as $\widetilde {\mathcal O}\Bigl({\textstyle \sqrt{S\sum_{s,a}{\bf V}^\star_{s,a}T}}\Big)$ for this algorithm for ergodic MDPs, where $S$ denotes the number of states and where ${\bf V}^\star_{s,a}$ is the variance of the bias function with respect to the next-state distribution following action $a$ in state $s$. The resulting bound improves upon the best previously known regret bound $\widetilde {\mathcal O}(DS\sqrt{AT})$ for that algorithm, where $A$ and $D$ respectively denote the maximum number of actions (per state) and the diameter of MDP. We finally compare the leading terms of the two bounds in some benchmark MDPs indicating that the derived bound can provide an order of magnitude improvement in some cases. Our analysis leverages novel variations of the transportation lemma combined with Kullback-Leibler concentration inequalities, that we believe to be of independent interest.

URL PDF HTML ☆

赞 0 踩 0

1803.01579 2026-06-04 eess.SY cs.RO cs.SY

Motion and Cooperative Transportation Planning for Multi-Agent Systems under Temporal Logic Formulas

多智能体系统在时序逻辑公式下的运动与协作运输规划

Christos K. Verginis, Dimos V. Dimarogonas

发表机构 * KTH Center for Autonomous Systems（KTH 自主系统中心）

AI总结本文提出一种混合控制框架，用于在高阶目标表达为线性时序逻辑（LTL）公式的情况下，多智能体系统的运动规划。设计控制协议以实现智能体在预定义兴趣区域间的过渡及协作运输物体。通过抽象智能体与物体的耦合行为为有限状态转移系统，设计满足智能体和物体规格的高层多智能体计划。

Comments Submitted to IEEE Transactions on Automation Science and Engineering. arXiv admin note: text overlap with arXiv:1611.05186

1803.00491 2026-06-04 stat.ML cs.LG cs.NA math.NA

The Power Mean Laplacian for Multilayer Graph Clustering

多层图聚类的幂均值拉普拉斯

Pedro Mercado, Antoine Gautier, Francesco Tudisco, Matthias Hein

发表机构 * Department of Mathematics and Computer Science, Saarland University（萨尔兰大学数学与计算机科学系）； Department of Mathematics and Statistics, University of Strathclyde（斯特拉斯克莱德大学数学与统计学系）

AI总结本文提出一种参数化的矩阵幂均值方法，用于融合多层图的拉普拉斯矩阵，分析其在随机块模型中的期望性能，并在真实数据中验证其在不同设置下恢复真实聚类的能力。

Comments 19 pages, 3 figures. Accepted in Artificial Intelligence and Statistics (AISTATS), 2018

1707.01625 2026-06-04 eess.SY cs.AI cs.GT cs.SY

Optimal Vehicle Dispatching Schemes via Dynamic Pricing

通过动态定价实现最优车辆调度方案

Mengjing Chen, Weiran Shen, Pingzhong Tang, Song Zuo

发表机构 * IIIS, Tsinghua University（清华大学信息科学与技术学院）

AI总结本文通过经济方法解决网约车平台在地理和时间信息下的最优定价和车辆调度问题，提出高效算法计算最优随机定价方案，并通过实验证明其优于固定定价和涨价机制。

1802.00930 2026-06-04 cs.NE cs.LG cs.NA math.NA

Mixed Precision Training of Convolutional Neural Networks using Integer Operations

使用整数运算进行卷积神经网络的混合精度训练

Dipankar Das, Naveen Mellempudi, Dheevatsa Mudigere, Dhiraj Kalamkar, Sasikanth Avancha, Kunal Banerjee, Srinivas Sridharan, Karthik Vaidyanathan, Bharat Kaul, Evangelos Georganas, Alexander Heinecke, Pradeep Dubey, Jesus Corbal, Nikita Shustrov, Roma Dubtsov, Evarist Fomenko, Vadim Pirogov

发表机构 * Parallel Computing Lab（并行计算实验室）； Intel Labs, India（英特尔实验室，印度）； Product Architecture Group（产品架构组）； Intel Labs, SC Intel, OR（英特尔实验室，SC英特尔，美国）； Software Services Group（软件服务组）； Intel, OR（英特尔，美国）

AI总结本文提出了一种基于整数运算的混合精度训练方法，在ImageNet-1K数据集上训练了ResNet-50、GoogLeNet-v1等SOTA网络，实现了比FP32更高的训练吞吐量和相同精度下的最高准确率。

Comments Published as a conference paper at ICLR 2018

详情

AI中文摘要

当前混合精度训练的SOTA主要基于低精度浮点运算，如FP16累积到FP32的变种。然而，在低精度和混合精度整数训练领域，已有研究要么针对非SOTA网络（如仅AlexNet用于ImageNet-1K），要么针对较小的数据集（如CIFAR-10）。本文在通用硬件上训练了SOTA视觉理解神经网络，使用整数运算。特别关注整数融合乘加（FMA）运算，其输入为两个INT16操作数，输出为INT32。我们提出了张量的共享指数表示，并开发了适用于常见神经网络操作的动态定点（DFP）方案。研究了高效整数卷积核的开发细节，包括处理INT32累加器溢出的方法。我们实现了ResNet-50、GoogLeNet-v1、VGG-16和AlexNet的CNN训练，这些网络在相同迭代次数下达到或超过FP32的SOTA准确率，无需改变超参数，并在端到端训练吞吐量上提高了1.8倍。据我们所知，这些结果是首次在通用硬件上使用SOTA CNNs在ImageNet-1K数据集上实现INT16训练的结果，并实现了最高报告的准确率。

英文摘要

The state-of-the-art (SOTA) for mixed precision training is dominated by variants of low precision floating point operations, and in particular, FP16 accumulating into FP32 Micikevicius et al. (2017). On the other hand, while a lot of research has also happened in the domain of low and mixed-precision Integer training, these works either present results for non-SOTA networks (for instance only AlexNet for ImageNet-1K), or relatively small datasets (like CIFAR-10). In this work, we train state-of-the-art visual understanding neural networks on the ImageNet-1K dataset, with Integer operations on General Purpose (GP) hardware. In particular, we focus on Integer Fused-Multiply-and-Accumulate (FMA) operations which take two pairs of INT16 operands and accumulate results into an INT32 output.We propose a shared exponent representation of tensors and develop a Dynamic Fixed Point (DFP) scheme suitable for common neural network operations. The nuances of developing an efficient integer convolution kernel is examined, including methods to handle overflow of the INT32 accumulator. We implement CNN training for ResNet-50, GoogLeNet-v1, VGG-16 and AlexNet; and these networks achieve or exceed SOTA accuracy within the same number of iterations as their FP32 counterparts without any change in hyper-parameters and with a 1.8X improvement in end-to-end training throughput. To the best of our knowledge these results represent the first INT16 training results on GP hardware for ImageNet-1K dataset using SOTA CNNs and achieve highest reported accuracy using half-precision

URL PDF HTML ☆

赞 0 踩 0

1708.07827 2026-06-04 math.OC cs.LG cs.NA math.NA stat.ML

Second-Order Optimization for Non-Convex Machine Learning: An Empirical Study

非凸机器学习中的二阶优化：一项实证研究

Peng Xu, Farbod Roosta-Khorasani, Michael W. Mahoney

发表机构 * Institute for Computational and Mathematical Engineering, Stanford University（计算与数学工程研究所，斯坦福大学）； School of Mathematics and Physics, University of Queensland（数学与物理学院，昆士兰大学）； International Computer Science Institute, Berkeley, USA（国际计算机科学研究所，伯克利，美国）； International Computer Science Institute and Department of Statistics, University of California at Berkeley（国际计算机科学研究所和统计学系，加州大学伯克利分校）

AI总结本文通过实证研究评估了非凸机器学习问题中牛顿型方法的性能，证明其在泛化性能和超参数鲁棒性方面优于传统SGD，能有效逃离平坦区域和鞍点。

Comments 21 pages, 11 figures. Restructure the paper and add experiments

详情

AI中文摘要

尽管随机梯度下降（SGD）等一阶优化方法在机器学习（ML）中广泛应用，但它们存在收敛速度慢、超参数设置敏感、易陷入高训练误差和难以逃离平坦区域及鞍点等缺陷。在高度非凸设置（如神经网络中）尤为明显。受此启发，近期有研究关注二阶方法，旨在通过捕捉曲率信息缓解这些不足。本文报告了针对非凸ML问题的一类牛顿型方法——信任区域（TR）和自适应三次正则化（ARC）算法的子采样变体的详细实证评估。在此过程中，我们证明这些方法不仅在计算上与手工调优的SGD加动量方法具有竞争力，泛化性能可比或更优，而且对超参数设置具有高度鲁棒性。此外，与SGD加动量相比，这些牛顿型方法利用曲率信息的方式使其能够无缝逃离平坦区域和鞍点。

英文摘要

While first-order optimization methods such as stochastic gradient descent (SGD) are popular in machine learning (ML), they come with well-known deficiencies, including relatively-slow convergence, sensitivity to the settings of hyper-parameters such as learning rate, stagnation at high training errors, and difficulty in escaping flat regions and saddle points. These issues are particularly acute in highly non-convex settings such as those arising in neural networks. Motivated by this, there has been recent interest in second-order methods that aim to alleviate these shortcomings by capturing curvature information. In this paper, we report detailed empirical evaluations of a class of Newton-type methods, namely sub-sampled variants of trust region (TR) and adaptive regularization with cubics (ARC) algorithms, for non-convex ML problems. In doing so, we demonstrate that these methods not only can be computationally competitive with hand-tuned SGD with momentum, obtaining comparable or better generalization performance, but also they are highly robust to hyper-parameter settings. Further, in contrast to SGD with momentum, we show that the manner in which these Newton-type methods employ curvature information allows them to seamlessly escape flat regions and saddle points.

URL PDF HTML ☆

赞 0 踩 0