arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 21511
1801.08995 2026-06-04 eess.SY cs.RO cs.SY

Trajectory Generation using Sharpness Continuous Dubins-like Paths with Applications in Control of Heavy Duty Vehicles

利用连续尖锐度的Dubins-like路径生成轨迹,应用于重载车辆控制

Rui Oliveira, Pedro F. Lima, Marcello Cirillo, Jonas Mårtensson, Bo Wahlberg

发表机构 * Scania, Autonomous Transport Solutions(斯堪尼亚,自主运输解决方案)

AI总结 本文提出了一种考虑转向执行器速率和扭矩限制的轨迹生成框架,通过连续尖锐度曲线提升重载车辆的自主驾驶性能。

Comments 18 pages, 7 figures, accepted for publication at ECC 2018 - European Control Conference

详情
AI中文摘要

我们提出了一种用于控制轮式车辆的轨迹生成框架,考虑转向执行器的速率和扭矩限制。关键思想是直接考虑转向执行器的速率和扭矩限制,而传统方法仅考虑曲率速率限制。我们提出新的尖锐度连续曲线概念,结合三次和Sigmoid曲率轨迹以及圆弧来引导车辆。所获得的轨迹具有平滑且连续可微的转向角剖面。这些轨迹为低层控制器提供更易跟踪的参考信号,从而提高性能。所获得的转向剖面的平滑性也提高了乘客舒适度。该方法具有快速的计算时间,可通过简单预计算进一步加速。我们详细讨论了该方法的路径规划应用,并通过仿真展示了其优势和实时能力。

英文摘要

We present a trajectory generation framework for control of wheeled vehicles under steering actuator constraints. The motivation is smooth autonomous driving of heavy vehicles. The key idea is to take into account rate, and additionally, torque limitations of the steering actuator directly. Previous methods only take into account curvature rate limitations, which deal indirectly with steering rate limitations. We propose the new concept of Sharpness Continuous curves, which uses cubic and sigmoid curvature trajectories together with circular arcs to steer the vehicle. The obtained trajectories are characterized by a smooth and continuously differentiable steering angle profile. These trajectories provide low-level controllers with reference signals which are easier to track, resulting in improved performance. The smoothness of the obtained steering profiles also results in increased passenger comfort. The method is characterized by a fast computation time, which can be further speeded up through the use of simple pre-computations. We detail possible path planning applications of the method, and conduct simulations that show its advantages and real time capabilities.

1702.04837 2026-06-04 stat.ML cs.LG cs.NA math.NA

Sketched Ridge Regression: Optimization Perspective, Statistical Perspective, and Model Averaging

草图岭回归:优化视角、统计视角和模型平均

Shusen Wang, Alex Gittens, Michael W. Mahoney

发表机构 * International Computer Science Institute and Department of Statistics University of California at Berkeley(国际计算机科学研究所和统计学系加州大学伯克利分校) Computer Science Department Rensselaer Polytechnic Institute(计算机科学系拉特格斯理工学院)

AI总结 本文从优化和统计角度研究了草图和Hessian草图在矩阵岭回归中的影响,发现经典草图能近似最优解,而Hessian草图则不同。通过理论和实验表明,模型平均可显著降低真实与草图解间的风险差距。

Comments To appear in Journal of Machine Learning Research, 2018. A short version has appeared in International Conference on Machine Learning (ICML), 2017

详情
Journal ref
Journal of Machine Learning Research, 19, pp1-50, 2018
AI中文摘要

我们探讨了经典草图和Hessian草图在近似求解矩阵岭回归(MRR)问题中的统计和优化影响。先前研究量化了经典草图对更简单的最小二乘回归(LSR)问题的影响。我们证明经典草图对MRR的优化属性的影响与对LSR的影响类似:即恢复近似最优解。相反,Hessian草图没有这种保证,其近似误差由响应中的“质量”与最优目标值之间的微妙交互决定。对于两种类型的近似,sketched MRR中的正则化导致与sketched LSR不同的统计特性。特别是,在sketched MRR中存在偏误-方差权衡,这在sketched LSR中不存在。我们提供了sketched MRR的偏误和方差的上界和下界,这些界限表明经典草图显著增加方差,而Hessian草图显著增加偏误。经验上,sketched MRR的解的风险可能比最优MRR解高一个数量级。我们理论和实证表明,模型平均显著降低真实解与sketched解风险之间的差距。因此,在并行或分布式设置中,草图结合模型平均是一种强大的技术,能够快速获得近似最优解,同时大幅减轻草图带来的统计风险增加。

英文摘要

We address the statistical and optimization impacts of the classical sketch and Hessian sketch used to approximately solve the Matrix Ridge Regression (MRR) problem. Prior research has quantified the effects of classical sketch on the strictly simpler least squares regression (LSR) problem. We establish that classical sketch has a similar effect upon the optimization properties of MRR as it does on those of LSR: namely, it recovers nearly optimal solutions. By contrast, Hessian sketch does not have this guarantee, instead, the approximation error is governed by a subtle interplay between the "mass" in the responses and the optimal objective value. For both types of approximation, the regularization in the sketched MRR problem results in significantly different statistical properties from those of the sketched LSR problem. In particular, there is a bias-variance trade-off in sketched MRR that is not present in sketched LSR. We provide upper and lower bounds on the bias and variance of sketched MRR, these bounds show that classical sketch significantly increases the variance, while Hessian sketch significantly increases the bias. Empirically, sketched MRR solutions can have risks that are higher by an order-of-magnitude than those of the optimal MRR solutions. We establish theoretically and empirically that model averaging greatly decreases the gap between the risks of the true and sketched solutions to the MRR problem. Thus, in parallel or distributed settings, sketching combined with model averaging is a powerful technique that quickly obtains near-optimal solutions to the MRR problem while greatly mitigating the increased statistical risk incurred by sketching.

1801.07745 2026-06-04 math.OC cs.AI cs.CG cs.NA math.NA

Optimal Transport on Discrete Domains

离散域上的最优传输

Justin Solomon

发表机构 * MIT Department of Electrical Engineering and Computer Science(麻省理工学院电气工程与计算机科学系) MIT Department of Electrical Engineering(麻省理工学院电气工程系)

AI总结 本文探讨了离散最优传输的最新进展,结合偏微分方程与凸分析,提出理论支持的模型,适用于数万到数百万顶点的领域。

详情
AI中文摘要

受物流问题中供需匹配的启发,最优传输(或蒙特卡洛问题)涉及在几何域上定义的概率分布的匹配。在最明显的离散化中,最优传输成为大规模线性规划问题,通常在三角网格、图、点云等图形和机器学习中遇到的域上难以高效求解。然而,最近的数值最优传输突破使可扩展性达到数量级更大的问题,可在几秒钟内解决。本文讨论了利用离散和光滑问题方面理解的数值最优传输进展。最先进的离散最优传输技术结合了偏微分方程(PDE)与凸分析的洞察,以重新公式化、离散化和优化运输问题。最终结果是一组理论上支持的模型,适用于具有数万或数百万顶点的领域。由于数值最优传输是一个相对较新的学科,特别强调了识别和解释需要数学洞察和额外研究的开放问题。

英文摘要

Inspired by the matching of supply to demand in logistical problems, the optimal transport (or Monge--Kantorovich) problem involves the matching of probability distributions defined over a geometric domain such as a surface or manifold. In its most obvious discretization, optimal transport becomes a large-scale linear program, which typically is infeasible to solve efficiently on triangle meshes, graphs, point clouds, and other domains encountered in graphics and machine learning. Recent breakthroughs in numerical optimal transport, however, enable scalability to orders-of-magnitude larger problems, solvable in a fraction of a second. Here, we discuss advances in numerical optimal transport that leverage understanding of both discrete and smooth aspects of the problem. State-of-the-art techniques in discrete optimal transport combine insight from partial differential equations (PDE) with convex analysis to reformulate, discretize, and optimize transportation problems. The end result is a set of theoretically-justified models suitable for domains with thousands or millions of vertices. Since numerical optimal transport is a relatively new discipline, special emphasis is placed on identifying and explaining open problems in need of mathematical insight and additional research.

1804.10432 2026-06-04 math.NA cs.CV cs.NA math.DG

Variational Regularization of Inverse Problems for Manifold-Valued Data

变分正则化用于流形值数据的反问题

Martin Storath, Andreas Weinmann

发表机构 * Image Analysis and Learning Group, Interdisciplinary Center for Scientific Computing, Universität Heidelberg, Germany(图像分析与学习组,跨学科科学计算中心,海德堡大学,德国) Department of Mathematics and Natural Sciences, Hochschule Darmstadt, and Institute of Computational Biology, Helmholtz Zentrum München, Germany(数学与自然科学系,达姆施塔特应用科学大学,以及计算生物学研究所,海德堡研究中心,德国)

AI总结 本文研究流形值数据的变分正则化反问题,提出TV和TGV正则化方法,并通过合成和真实数据验证其有效性。

详情
AI中文摘要

本文考虑了在反问题设置中流形值数据的变分正则化。特别是,我们考虑了带有间接测量算子的TV和TGV正则化。我们提供了关于正则化问题良定性的结果,并给出了在流形设置中实现这些模型的算法。此外,我们通过合成和真实数据的实验结果,展示了所提出方案的应用潜力。

英文摘要

In this paper, we consider the variational regularization of manifold-valued data in the inverse problems setting. In particular, we consider TV and TGV regularization for manifold-valued data with indirect measurement operators. We provide results on the well-posedness and present algorithms for a numerical realization of these models in the manifold setup. Further, we provide experimental results for synthetic and real data to show the potential of the proposed schemes for applications.

1804.07010 2026-06-04 stat.ML cs.LG cs.SY eess.SY math.AP math.OC

Forward-Backward Stochastic Neural Networks: Deep Learning of High-dimensional Partial Differential Equations

前向-后向随机神经网络:高维偏微分方程的深度学习

Maziar Raissi

发表机构 * Division of Applied Mathematics, Brown University(布朗大学应用数学系)

AI总结 本文提出一种高维偏微分方程求解方法,利用深度神经网络和随机微分方程的联系,避免数值离散化限制,解决维度诅咒问题。

详情
AI中文摘要

经典偏微分方程数值方法因依赖精细的时空网格而受维度诅咒限制。受现代深度学习技术启发,本文提出一种可扩展的算法,通过深度神经网络近似未知解,并利用自动微分优势。通过将高维偏微分方程与前向-后向随机微分方程联系起来,利用布朗运动独立实现作为训练数据,测试了Black-Scholes-Barenblatt和Hamilton-Jacobi-Bellman方程等100维基准问题的有效性。

英文摘要

Classical numerical methods for solving partial differential equations suffer from the curse dimensionality mainly due to their reliance on meticulously generated spatio-temporal grids. Inspired by modern deep learning based techniques for solving forward and inverse problems associated with partial differential equations, we circumvent the tyranny of numerical discretization by devising an algorithm that is scalable to high-dimensions. In particular, we approximate the unknown solution by a deep neural network which essentially enables us to benefit from the merits of automatic differentiation. To train the aforementioned neural network we leverage the well-known connection between high-dimensional partial differential equations and forward-backward stochastic differential equations. In fact, independent realizations of a standard Brownian motion will act as training data. We test the effectiveness of our approach for a couple of benchmark problems spanning a number of scientific domains including Black-Scholes-Barenblatt and Hamilton-Jacobi-Bellman equations, both in 100-dimensions.

1804.06586 2026-06-04 eess.SY cs.RO cs.SY

Composite Adaptive Control for Bilateral Teleoperation Systems without Persistency of Excitation

双向往复操作系统无持续激励的复合自适应控制

Yuling Li, Yixin Yin, Sen Zhang, Jie Dong, Rolf Johansson

发表机构 * School of Automation and Electrical Engineering, University of Science and Technology Beijing(自动化与电气工程学院,北京科技大学) Department of Automatic Control, Lund University, P.O. Box 118, 22100 Lund, Sweden(自动控制系,卢德大学)

AI总结 本文提出一种无需持续激励条件的复合自适应控制方法,用于解决非线性双向往复操作系统中参数收敛问题,通过线性矩阵不等式给出闭环系统稳定性准则,并通过仿真验证了方法的有效性。

Comments 21 pages, 9 figures, submitted to Journal of The Franklin Institute

详情
AI中文摘要

复合自适应控制方案利用系统跟踪误差和预测误差驱动更新律,已成为提高系统性能的常用方法。然而,为保证参数收敛需满足强持续激励(PE)条件。本文提出一种新颖的复合自适应控制,用于非线性双向往复操作系统,无需PE条件即可实现参数收敛。闭环双向往复系统稳定性准则以线性矩阵不等式形式给出。新的跟踪性能指标用于评估主从之间位置跟踪性能。仿真研究展示了所提方法的有效性。

英文摘要

Composite adaptive control schemes, which use both the system tracking errors and the prediction error to drive the update laws, have become widespread in achieving an improvement of system performance. However, a strong persistent-excitation (PE) condition should be satisfied to guarantee the parameter convergence. This paper proposes a novel composite adaptive control to guarantee parameter convergence without PE condition for nonlinear teleoperation systems with dynamic uncertainties and time-varying communication delays. The stability criteria of the closed-loop teleoperation system are given in terms of linear matrix inequalities. New tracking performance measures are proposed to evaluate the position tracking between the master and the slave. Simulation studies are given to show the effectiveness of the proposed method.

1804.04349 2026-06-04 eess.SY cs.RO cs.SE cs.SY

On the Application of ISO 26262 in Control Design for Automated Vehicles

在自动驾驶车辆控制设计中应用ISO 26262

Georg Schildbach

发表机构 * Institute for Electrical Engineering in Medicine, University of Luebeck(医学电气工程研究所,吕贝克大学)

AI总结 本文探讨了ISO 26262标准在自动驾驶车辆控制设计中的应用,分析了其在高自动化车辆中的适用性与争议,并总结了该标准的安全设计步骤。

Comments In Proceedings SCAV 2018, arXiv:1804.03406

详情
Journal ref
EPTCS 269, 2018, pp. 74-82
AI中文摘要

过去十年间,自动驾驶车辆的研究经历了爆炸性增长。然而,其实际应用的主要障碍是一个令人信服的安全概念。随着算法日益复杂和车辆自动化水平提高,这一问题愈发重要。功能安全领域提供了一种系统方法,用于识别潜在风险源并提高车辆安全性。该领域基于航空航天、过程和其他行业数十年的实践经验。这些经验汇总成汽车领域的功能安全标准ISO 26262,已被广泛采用。然而,其在高自动化车辆中的适用性和相关性却存在争议。本文对这一讨论进行了批判性分析,并总结了ISO 26262的主要步骤,以实现自动驾驶车辆的安全控制设计。

英文摘要

Research on automated vehicles has experienced an explosive growth over the past decade. A main obstacle to their practical realization, however, is a convincing safety concept. This question becomes ever more important as more sophisticated algorithms are used and the vehicle automation level increases. The field of functional safety offers a systematic approach to identify possible sources of risk and to improve the safety of a vehicle. It is based on practical experience across the aerospace, process and other industries over multiple decades. This experience is compiled in the functional safety standard for the automotive domain, ISO 26262, which is widely adopted throughout the automotive industry. However, its applicability and relevance for highly automated vehicles is subject to a controversial debate. This paper takes a critical look at the discussion and summarizes the main steps of ISO 26262 for a safe control design for automated vehicles.

1804.03036 2026-06-04 eess.IV cs.RO cs.SY eess.SY

Image Moment Models for Extended Object Tracking

图像矩模型用于扩展目标跟踪

Gang Yao, Ashwin Dani

发表机构 * Department of Electrical and Computer Engineering, University of Connecticut Storrs(电气与计算机工程系,康涅狄格大学斯托尔斯分校)

AI总结 本文提出基于图像矩的新型模型,用于估计和跟踪复杂轨迹下的目标形状。通过无迹卡尔曼滤波-交互多模型算法估计目标形状及位置速度,结合IOU和RMSE指标进行验证。

详情
Journal ref
IEEE Transactions on Aerospace and Electronic Systems, 2018
AI中文摘要

本文提出了一种基于图像矩的新型模型,用于估计和跟踪具有复杂轨迹的物体形状。假设相机静止注视移动物体,内部点特征作为测量值。假设椭圆近似为基本形状,利用图像矩组合估计椭圆形状。推导了在恒速或协调转弯运动模型下图像矩的动态模型,作为物体形状估计的函数。应用无迹卡尔曼滤波-交互多模型(UKF-IMM)滤波算法估计目标形状(近似为椭圆)并跟踪其位置和速度。基于平均对数似然推导IMM滤波器的似然函数。展示了所提UKF-IMM算法与图像矩模型的仿真结果,显示了在复杂轨迹中移动物体的估计结果。通过交并比(IOU)和位置和速度均方根误差(RMSE)等指标与文献中的基准算法进行比较。还展示了从四旋翼无人机捕获的真实图像数据结果。

英文摘要

In this paper, a novel image moments based model for shape estimation and tracking of an object moving with a complex trajectory is presented. The camera is assumed to be stationary looking at a moving object. Point features inside the object are sampled as measurements. An ellipsoidal approximation of the shape is assumed as a primitive shape. The shape of an ellipse is estimated using a combination of image moments. Dynamic model of image moments when the object moves under the constant velocity or coordinated turn motion model is derived as a function for the shape estimation of the object. An Unscented Kalman Filter-Interacting Multiple Model (UKF-IMM) filter algorithm is applied to estimate the shape of the object (approximated as an ellipse) and track its position and velocity. A likelihood function based on average log-likelihood is derived for the IMM filter. Simulation results of the proposed UKF-IMM algorithm with the image moments based models are presented that show the estimations of the shape of the object moving in complex trajectories. Comparison results, using intersection over union (IOU), and position and velocity root mean square errors (RMSE) as metrics, with a benchmark algorithm from literature are presented. Results on real image data captured from the quadcopter are also presented.

1804.03973 2026-06-04 eess.SY cs.AI cs.SY

Reasoning about Safety of Learning-Enabled Components in Autonomous Cyber-physical Systems

关于自主机电系统中学习组件安全性的推理

Cumhur Erkan Tuncali, James Kapinski, Hisahiro Ito, Jyotirmoy V. Deshmukh

发表机构 * Toyota Research Institute of North America(丰田北美研究院) University of Southern California(南加州大学)

AI总结 本文提出基于模拟的方法生成屏障证书函数,用于验证包含神经网络控制器的机电系统安全性。通过线性规划求解器从随机初始状态获得的模拟轨迹中找到候选生成函数,并利用SMT求解器验证其安全性。

Comments Invited paper in conference: Design Automation Conference (DAC) 2018

详情
AI中文摘要

我们提出一种基于模拟的方法,用于生成用于验证包含神经网络控制器的机电系统(CPS)安全性的屏障证书函数。利用线性规划求解器,从通过随机选择初始状态获得的CPS模型模拟轨迹中找到候选生成函数。然后选择生成函数的水平集作为屏障证书,表示从给定初始状态集无法到达不安全系统状态。通过SMT求解器验证屏障证书属性。该方法在自主车辆的Dubins车模型上进行了案例研究,该模型由神经网络控制以跟随给定路径。

英文摘要

We present a simulation-based approach for generating barrier certificate functions for safety verification of cyber-physical systems (CPS) that contain neural network-based controllers. A linear programming solver is utilized to find a candidate generator function from a set of simulation traces obtained by randomly selecting initial states for the CPS model. A level set of the generator function is then selected to act as a barrier certificate for the system, meaning it demonstrates that no unsafe system states are reachable from a given set of initial states. The barrier certificate properties are verified with an SMT solver. This approach is demonstrated on a case study in which a Dubins car model of an autonomous vehicle is controlled by a neural network to follow a given path.

1804.02814 2026-06-04 eess.SY cs.RO cs.SY

Nonlinear Unknown Input and State Estimation Algorithm in Mobile Robots

移动机器人中的非线性未知输入和状态估计算法

Pinyao Guo, Hunmin Kim, Nurali Virani, Jun Xu, Minghui Zhu, Peng Liu

发表机构 * College of Information Sciences(信息科学学院) School of Electrical Engineering(电气工程学院) GE Global Research(GE全球研究)

AI总结 本文提出一种适用于非线性动态模型和随机噪声的移动机器人未知输入和状态估计算法,通过传感器数据和控制指令检测并量化传感器和执行器的异常。

Comments arXiv admin note: text overlap with arXiv:1708.01834

详情
AI中文摘要

本技术报告描述并推导了一种新型非线性未知输入和状态估计算法(NUISE),用于移动机器人。该算法针对具有非线性动态模型且受传感器和执行器随机噪声影响的真实机器人设计。利用传感器读数和计划的控制指令,该算法能够检测并量化传感器和执行器的异常。随后,我们阐述了两种不同移动机器人的动态模型,以展示NUISE的应用。本报告作为[1]的补充文档。

英文摘要

This technical report provides the description and the derivation of a novel nonlinear unknown input and state estimation algorithm (NUISE) for mobile robots. The algorithm is designed for real-world robots with nonlinear dynamic models and subject to stochastic noises on sensing and actuation. Leveraging sensor readings and planned control commands, the algorithm detects and quantifies anomalies on both sensors and actuators. Later, we elaborate the dynamic models of two distinctive mobile robots for the purpose of demonstrating the application of NUISE. This report serves as a supplementary document for [1].

1610.02967 2026-06-04 math.OC cs.LG cs.NA math.NA stat.ML

Distributed Convex Optimization with Many Convex Constraints

具有许多凸约束的分布式凸优化

Joachim Giesen, Sören Laue

发表机构 * Friedrich-Schiller-University Jena(耶拿弗里德里希-施特劳斯大学)

AI总结 本文提出一种扩展的ADMM方法,用于解决具有众多凸约束的分布式凸优化问题,继承了ADMM和增广拉格朗日方法的收敛性保证。

详情
AI中文摘要

我们解决在分布式环境下求解具有许多凸约束的凸优化问题。我们的方法基于一种扩展的交替方向乘子法(ADMM),该方法近期在大数据领域受到广泛关注。尽管ADMM早在数十年前就被发明,但迄今为止只能应用于无约束问题或具有线性等式或不等式约束的问题。我们的扩展方法能够直接处理任意不等式约束。它结合了ADMM在分布式环境下求解凸优化问题的能力,以及增广拉格朗日方法求解约束优化问题的能力,并且我们证明它继承了ADMM和增广拉格朗日方法的收敛保证。

英文摘要

We address the problem of solving convex optimization problems with many convex constraints in a distributed setting. Our approach is based on an extension of the alternating direction method of multipliers (ADMM) that recently gained a lot of attention in the Big Data context. Although it has been invented decades ago, ADMM so far can be applied only to unconstrained problems and problems with linear equality or inequality constraints. Our extension can handle arbitrary inequality constraints directly. It combines the ability of ADMM to solve convex optimization problems in a distributed setting with the ability of the Augmented Lagrangian method to solve constrained optimization problems, and as we show, it inherits the convergence guarantees of ADMM and the Augmented Lagrangian method.

1712.08585 2026-06-04 math.OC cs.CV cs.NA math.NA

Denoising of image gradients and total generalized variation denoising

图像梯度去噪与总泛化变分去噪

Birgit Komander, Dirk A. Lorenz, Lena Vestweber

发表机构 * Institute of Analysis and Algebra(分析与代数研究所) TU Braunschweig(布拉unsch维格技术大学)

AI总结 本文重新审视全变分去噪,提出增强模型假设已获得图像梯度估计,改进了图像重建质量,并推导出与总泛化变分去噪方法相似的模型,提出约束去噪模型和参数自由的变分去噪模型,采用Chambolle-Pock和Douglas-Rachford方法进行数值实验,验证了预处理对收敛速度的提升。

详情
AI中文摘要

我们重新审视全变分去噪,并研究一个增强模型,其中假设已获得图像梯度的估计。我们证明这会提高图像重建质量,并推导出所得到的模型类似于总泛化变分去噪方法,从而为该模型提供了新的动机。进一步,我们提出使用约束去噪模型,并开发一个基本无参数的变分去噪模型,即所有模型参数都直接从噪声图像中估计。此外,我们使用Chambolle-Pock的对偶方法以及Douglas-Rachford方法用于新模型。对于后者,必须解决大规模的偏微分方程离散化。我们提出以不精确的方式使用预条件共轭梯度法进行处理,并为此推导出预条件器。数值实验表明,所得到的方法具有良好的去噪性能,并且预处理显著提高了收敛速度。最后我们分析了不同TGV去噪问题形式的对偶间隙,并推导出一个简单的停止准则。

英文摘要

We revisit total variation denoising and study an augmented model where we assume that an estimate of the image gradient is available. We show that this increases the image reconstruction quality and derive that the resulting model resembles the total generalized variation denoising method, thus providing a new motivation for this model. Further, we propose to use a constraint denoising model and develop a variational denoising model that is basically parameter free, i.e. all model parameters are estimated directly from the noisy image. Moreover, we use Chambolle-Pock's primal dual method as well as the Douglas-Rachford method for the new models. For the latter one has to solve large discretizations of partial differential equations. We propose to do this in an inexact manner using the preconditioned conjugate gradients method and derive preconditioners for this. Numerical experiments show that the resulting method has good denoising properties and also that preconditioning does increase convergence speed significantly. Finally we analyze the duality gap of different formulations of the TGV denoising problem and derive a simple stopping criterion.

1710.10781 2026-06-04 math.NA cs.CV cs.LG cs.NA stat.ML

Stochastic variance reduced multiplicative update for nonnegative matrix factorization

随机方差缩减乘法更新用于非负矩阵分解

Hiroyuki Kasai

发表机构 * Graduate School of Informatics and Engineering, The University of Electro-Communications(信息与工程研究生院,东京电波通信大学)

AI总结 本文提出一种随机方差缩减乘法更新算法,改进非负矩阵分解的收敛速度,通过数值实验验证其在不同数据集上的优越性。

Comments IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2018)

详情
AI中文摘要

非负矩阵分解(NMF)是一种降维和因子分析方法,其因子矩阵具有低秩非负约束。考虑到NMF中的随机学习,本文特别针对最流行的乘法更新(MU)规则,该规则收敛速度较慢。本文提出一种随机梯度的方差缩减技术,数值比较表明,所提出的算法在不同合成和实际数据集上均优于现有算法。

英文摘要

Nonnegative matrix factorization (NMF), a dimensionality reduction and factor analysis method, is a special case in which factor matrices have low-rank nonnegative constraints. Considering the stochastic learning in NMF, we specifically address the multiplicative update (MU) rule, which is the most popular, but which has slow convergence property. This present paper introduces on the stochastic MU rule a variance-reduced technique of stochastic gradient. Numerical comparisons suggest that our proposed algorithms robustly outperform state-of-the-art algorithms across different synthetic and real-world datasets.

1803.11411 2026-06-04 eess.SY cs.LG cs.MA cs.SY

Observer-based Adaptive Optimal Output Containment Control problem of Linear Heterogeneous Multi-agent Systems with Relative Output Measurements

基于观测器的自适应最优输出包容控制问题:线性异构多智能体系统中的相对输出测量

Majid Mazouchi, Mohammad Bagher Naghibi-Sistani, Seyed Kamal Hosseini Sani, Farzaneh Tatari, Hamidreza Modares

发表机构 * Department of Electrical Engineering, Ferdowsi University of Mashhad, Mashhad, Iran(马什哈德法尔多西大学电气工程系) Department of Electrical Engineering, University of Semnan, Semnan, Iran(塞姆南大学电气工程系) Missouri University of Science(密苏里科技大学)

AI总结 本文提出了一种基于相对输出反馈的最优解法,用于线性异构多智能体系统的包容控制问题,通过分布式最优控制协议确保跟随器输出处于领导者输出的凸包内并优化暂态性能,采用分布式观测器估计不可用的状态和凸包。

详情
AI中文摘要

本文开发了一种基于最优相对输出反馈的解决方案,用于解决线性异构多智能体系统的包容控制问题。提出了一种分布式最优控制协议,使跟随器的输出落在领导者输出的凸包内(即期望或安全区域),并优化其暂态性能。所提出的最优控制解决方案由反馈部分和前馈部分组成,反馈部分依赖于跟随器的状态,前馈部分依赖于领导者状态的凸包。为了符合大多数实际应用,假设反馈和前馈状态不可用,并使用两个分布式观测器进行估计。即,由于跟随器无法直接感知其绝对状态,设计了一个分布式观测器,仅使用相对于邻居的输出测量(例如通过机器人中的范围传感器测量)和邻居广播的信息来估计其状态。此外,还设计了一个自适应分布式观测器,通过在通信网络上交换信息来估计领导者状态的凸包。所提出的观测器放松了所有跟随器必须知道领导者动态完整知识的严格要求。接下来,开发了一种基于Actor-Critic结构的离策略强化学习算法,用于在线解决最优包容控制问题,使用相对输出测量,无需所有跟随器知道领导者动态。最后,通过数值模拟验证了理论结果。

英文摘要

This paper develops an optimal relative output-feedback based solution to the containment control problem of linear heterogeneous multi-agent systems. A distributed optimal control protocol is presented for the followers to not only assure that their outputs fall into the convex hull of the leaders' output (i.e., the desired or safe region), but also optimizes their transient performance. The proposed optimal control solution is composed of a feedback part, depending of the followers' state, and a feed-forward part, depending on the convex hull of the leaders' state. To comply with most real-world applications, the feedback and feed-forward states are assumed to be unavailable and are estimated using two distributed observers. That is, since the followers cannot directly sense their absolute states, a distributed observer is designed that uses only relative output measurements with respect to their neighbors (measured for example by using range sensors in robotic) and the information which is broadcasted by their neighbors to estimate their states. Moreover, another adaptive distributed observer is designed that uses exchange of information between followers over a communication network to estimate the convex hull of the leaders' state. The proposed observer relaxes the restrictive requirement of knowing the complete knowledge of the leaders' dynamics by all followers. An off-policy reinforcement learning algorithm on an actor-critic structure is next developed to solve the optimal containment control problem online, using relative output measurements and without requirement of knowing the leaders' dynamics by all followers. Finally, the theoretical results are verified by numerical simulations.

1705.10887 2026-06-04 stat.ML cs.CV cs.LG cs.NA math.NA

Efficient, sparse representation of manifold distance matrices for classical scaling

高效表示经典标度中的流形距离矩阵

Javier S. Turek, Alexander Huth

发表机构 * Intel Labs(英特尔实验室) The University of Texas at Austin(得克萨斯大学奥斯汀分校)

AI总结 本文提出一种基于双调和插值的稀疏方法,用于高效表示流形距离矩阵,相比现有方法速度快2倍,内存占用低20倍,能处理大规模点集。

Comments Conference CVPR 2018

详情
AI中文摘要

Geodesic距离矩阵可以揭示对非刚性变形不敏感的形状特性,因此常用于分析和表示3-D形状。然而,这些矩阵随点数的平方增长,因此对于大规模点集常用低秩近似来存储和分析。本文提出了一种新颖的稀疏方法,利用双调和插值高效表示流形距离矩阵。该方法利用数据流形的知识,学习一个稀疏插值算子,通过部分点近似距离。我们证明,与现有方法相比,该方法在处理大规模点集的MDS问题时速度快2倍,内存占用低20倍,质量相似。这使得分析之前不可行的大规模点集成为可能。

英文摘要

Geodesic distance matrices can reveal shape properties that are largely invariant to non-rigid deformations, and thus are often used to analyze and represent 3-D shapes. However, these matrices grow quadratically with the number of points. Thus for large point sets it is common to use a low-rank approximation to the distance matrix, which fits in memory and can be efficiently analyzed using methods such as multidimensional scaling (MDS). In this paper we present a novel sparse method for efficiently representing geodesic distance matrices using biharmonic interpolation. This method exploits knowledge of the data manifold to learn a sparse interpolation operator that approximates distances using a subset of points. We show that our method is 2x faster and uses 20x less memory than current leading methods for solving MDS on large point sets, with similar quality. This enables analyses of large point sets that were previously infeasible.

1701.01394 2026-06-04 cs.DS cs.LG cs.NA math.NA stat.ML

On spectral partitioning of signed graphs

关于带符号图的谱划分

Andrew V. Knyazev

发表机构 * Mitsubishi Electric Research Laboratories (MERL)(三菱电机研究实验室(MERL))

AI总结 本文讨论了带符号图谱划分中标准图拉普拉斯矩阵优于符号拉普拉斯矩阵,指出基于符号拉普拉斯矩阵主特征向量的划分方法更有效,负特征值有助于提高计算效率。

Comments 12 pages, 10 figures. Rev 2 to appear in proceedings of the SIAM Workshop on Combinatorial Scientific Computing 2018 (CSC18)

详情
AI中文摘要

我们主张标准图拉普拉斯矩阵比符号拉普拉斯矩阵更适合带符号图的谱划分。简单例子表明,基于符号拉普拉斯矩阵主特征向量的划分可能无意义,而基于标准图拉普拉斯矩阵Fiedler向量的划分更有效。我们观察到负特征值对带符号图的谱划分有益,使Fiedler向量更容易计算。

英文摘要

We argue that the standard graph Laplacian is preferable for spectral partitioning of signed graphs compared to the signed Laplacian. Simple examples demonstrate that partitioning based on signs of components of the leading eigenvectors of the signed Laplacian may be meaningless, in contrast to partitioning based on the Fiedler vector of the standard graph Laplacian for signed graphs. We observe that negative eigenvalues are beneficial for spectral partitioning of signed graphs, making the Fiedler vector easier to compute.

1803.09792 2026-06-04 cs.MA cs.RO cs.SY eess.SY

Min-Max Tours for Task Allocation to Heterogeneous Agents

为异构智能体分配任务的最优化路线

Amritha Prasad, Han-Lim Choi, Shreyas Sundaram

发表机构 * School of Electrical and Computer Engineering at Purdue University(普渡大学电气与计算机工程学院)

AI总结 研究如何为异构移动智能体分配任务,以最小化任何智能体完成任务并返回仓库的最大成本,提出了一种三阶段算法,提供5倍近似比。

详情
AI中文摘要

我们考虑一个场景,其中一组异构移动智能体位于仓库,一组任务分散在地理区域中。智能体分为不同类型,任务分为专用任务(只能由特定类型的智能体完成)和通用任务(可由任何智能体完成)。每对任务之间的距离已指定,并满足三角不等式。给定此场景,我们研究如何将这些任务分配给可用智能体(受类型兼容性约束),以最小化任何智能体完成任务并返回仓库的最大成本。该问题属于NP难问题,我们提出了一种三阶段算法来解决该问题,无论总智能体数量和每种类型智能体数量如何,该算法提供5倍的近似比。我们还证明,在仅有一种类型智能体的情况下,该算法的近似因子为4。

英文摘要

We consider a scenario consisting of a set of heterogeneous mobile agents located at a depot, and a set of tasks dispersed over a geographic area. The agents are partitioned into different types. The tasks are partitioned into specialized tasks that can only be done by agents of a certain type, and generic tasks that can be done by any agent. The distances between each pair of tasks are specified, and satisfy the triangle inequality. Given this scenario, we address the problem of allocating these tasks among the available agents (subject to type compatibility constraints) while minimizing the maximum cost to tour the allocation by any agent and return to the depot. This problem is NP-hard, and we give a three phase algorithm to solve this problem that provides 5-factor approximation, regardless of the total number of agents and the number of agents of each type. We also show that in the special case where there is only one agent of each type, the algorithm has an approximation factor of 4.

1711.02271 2026-06-04 math.NA cs.LG cs.NA

High-order Tensor Completion for Data Recovery via Sparse Tensor-train Optimization

高阶张量补全:通过稀疏张量-列车优化实现数据恢复

Longhao Yuan, Qibin Zhao, Jianting Cao

发表机构 * Graduate School of Engineering, Saitama Institute of Technology, Japan(埼玉工科大学工学研究科) Tensor Learning Unit, RIKEN Center for Advanced Intelligence Project (AIP), Japan(日本先进情报项目(AIP)RIKEN智能学习单元) School of Automation, Guangdong University of Technology, China(广东技术大学自动化学院) School of Computer Science and Technology, Hangzhou Dianzi University, China(杭州电子科技大学计算机科学与技术学院)

AI总结 本文提出稀疏张量-列车优化算法,通过将缺失数据视为稀疏张量并利用一阶优化方法求解张量-列车分解因子,有效提升低阶和高阶张量补全性能,尤其在高缺失率下表现优异。

Comments 5 pages (include 1 page of reference) ICASSP 2018

详情
AI中文摘要

本文针对张量数据补全问题,采用张量-列车分解因其强大的表示能力和线性可扩展性。我们提出稀疏张量-列车优化(STTO)算法,将不完整数据视为稀疏张量,并使用一阶优化方法求解张量-列车分解因子。我们的算法在低阶和高阶情况的模拟实验中表现良好。我们还采用张量化方法将数据转换为高阶形式以提升算法性能。各种图像恢复实验的结果表明,我们的方法优于其他补全算法。尤其是在缺失率非常高时,例如90%到99%,我们的方法显著优于最先进的方法。

英文摘要

In this paper, we aim at the problem of tensor data completion. Tensor-train decomposition is adopted because of its powerful representation ability and linear scalability to tensor order. We propose an algorithm named Sparse Tensor-train Optimization (STTO) which considers incomplete data as sparse tensor and uses first-order optimization method to find the factors of tensor-train decomposition. Our algorithm is shown to perform well in simulation experiments at both low-order cases and high-order cases. We also employ a tensorization method to transform data to a higher-order form to enhance the performance of our algorithm. The results of image recovery experiments in various cases manifest that our method outperforms other completion algorithms. Especially when the missing rate is very high, e.g., 90\% to 99\%, our method is significantly better than the state-of-the-art methods.

1707.03770 2026-06-04 eess.SY cs.LG cs.SY math.OC

Fastest Convergence for Q-learning

Q-learning 最快收敛算法

Adithya M. Devraj, Sean P. Meyn

发表机构 * University of Florida(佛罗里达大学) University of California, Berkeley(加州大学伯克利分校)

AI总结 本文提出Zap Q-learning算法,通过矩阵增益设计实现渐近方差最优,并通过ODE分析证明其瞬态行为接近确定性牛顿-拉夫森法,实验验证其在非理想参数化设置下的快速收敛性。

详情
AI中文摘要

本文提出的Zap Q-learning算法是对Watkins原始算法及近期竞争对手的改进,是一种设计使得渐近方差最优的矩阵增益算法。此外,通过ODE分析表明,其瞬态行为与确定性牛顿-拉夫森实现接近,这得益于对矩阵增益序列的两时间尺度更新方程。分析表明,该方法即使在非理想参数化设置下也能实现稳定高效的计算。数值实验验证了其在非理想情况下的快速收敛性。本文的次要目标是教程性的,前半部分对强化学习算法进行了综述,重点在于最小方差算法。

英文摘要

The Zap Q-learning algorithm introduced in this paper is an improvement of Watkins' original algorithm and recent competitors in several respects. It is a matrix-gain algorithm designed so that its asymptotic variance is optimal. Moreover, an ODE analysis suggests that the transient behavior is a close match to a deterministic Newton-Raphson implementation. This is made possible by a two time-scale update equation for the matrix gain sequence. The analysis suggests that the approach will lead to stable and efficient computation even for non-ideal parameterized settings. Numerical experiments confirm the quick convergence, even in such non-ideal cases. A secondary goal of this paper is tutorial. The first half of the paper contains a survey on reinforcement learning algorithms, with a focus on minimum variance algorithms.

1612.05971 2026-06-04 eess.SY cs.AI cs.GT cs.SY math.OC

An Integrated Optimization + Learning Approach to Optimal Dynamic Pricing for the Retailer with Multi-type Customers in Smart Grids

在智能电网中面向多类型顾客的零售商最优动态定价集成优化与学习方法

Fanlin Meng, Xiao-Jun Zeng, Yan Zhang, Chris J. Dent, Dunwei Gong

发表机构 * School of Engineering and Computing Sciences, Durham University(工程与计算科学学院,达勒姆大学) School of Computer Science, The University of Manchester(计算机科学学院,曼彻斯特大学) College of Information System and Management, National University of Defense Technology(信息系统与管理学院,国防科技大学) School of Mathematics, University of Edinburgh(数学学院,爱丁堡大学) School of Information and Control Engineering, China University of Mining and Technology(信息与控制工程学院,中国矿业大学)

AI总结 本文针对智能电网中零售商服务三种不同类型的顾客问题,提出两级决策框架,结合优化与学习方法实现动态定价优化,通过仿真实验验证模型的有效性。

Comments 38 pages, 6 figures

详情
AI中文摘要

本文考虑智能电网中零售商服务三种不同类型的顾客的现实场景,即具有嵌入智能电表的最优家庭能源管理系统顾客(C-HEMS)、仅具有智能电表的顾客(C-SM)以及无智能电表的顾客(C-NONE)。本文的主要目标是支持零售商在混合顾客群体中做出最优的日提前动态定价决策。为此,我们提出一个两级决策框架,其中零售商作为上层代理首先宣布未来24小时的电力价格,顾客作为下层代理随后根据价格调度其能源使用。对于下层问题,我们根据不同顾客的独特特征建模其价格响应性。对于上层问题,我们优化动态价格以最大化零售商利润,同时满足现实市场约束。上述两级模型通过基于遗传算法(GA)的分布式优化方法解决,其可行性和有效性通过仿真结果得到验证。

英文摘要

In this paper, we consider a realistic and meaningful scenario in the context of smart grids where an electricity retailer serves three different types of customers, i.e., customers with an optimal home energy management system embedded in their smart meters (C-HEMS), customers with only smart meters (C-SM), and customers without smart meters (C-NONE). The main objective of this paper is to support the retailer to make optimal day-ahead dynamic pricing decisions in such a mixed customer pool. To this end, we propose a two-level decision-making framework where the retailer acting as upper-level agent firstly announces its electricity prices of next 24 hours and customers acting as lower-level agents subsequently schedule their energy usages accordingly. For the lower level problem, we model the price responsiveness of different customers according to their unique characteristics. For the upper level problem, we optimize the dynamic prices for the retailer to maximize its profit subject to realistic market constraints. The above two-level model is tackled by genetic algorithms (GA) based distributed optimization methods while its feasibility and effectiveness are confirmed via simulation results.

1803.06989 2026-06-04 math.ST cs.LG cs.NA math.NA stat.ML stat.TH

Numerical Integration on Graphs: where to sample and how to weigh

图上的数值积分:在哪里采样和如何加权

George C. Linderman, Stefan Steinerberger

发表机构 * Program in Applied Mathematics, Yale University, New Haven, CT 06511, USA(应用数学项目,耶鲁大学,新 Haven, CT 06511, USA) Department of Mathematics, Yale University, New Haven, CT 06511, USA(数学系,耶鲁大学,新 Haven, CT 06511, USA)

AI总结 研究图上数值积分问题,通过热球最优打包几何问题重构积分,提出采样策略与加权方法,验证方法效率。

详情
AI中文摘要

设G=(V,E,w)为有限连通加权图。我们关注寻找顶点子集W⊆V和权重a_w,使得1/|V|∑_{v∈V}f(v)≈∑_{w∈W}a_wf(w),其中f:V→R是图几何下光滑的函数。主要应用是当f依赖于图结构但单点评估成本高的问题。证明不等式显示积分问题可转化为几何问题(最优热球打包)。讨论如何构造热球打包近似解;数值示例展示方法效率。

英文摘要

Let $G=(V,E,w)$ be a finite, connected graph with weighted edges. We are interested in the problem of finding a subset $W \subset V$ of vertices and weights $a_w$ such that $$ \frac{1}{|V|}\sum_{v \in V}^{}{f(v)} \sim \sum_{w \in W}{a_w f(w)}$$ for functions $f:V \rightarrow \mathbb{R}$ that are `smooth' with respect to the geometry of the graph. The main application are problems where $f$ is known to somehow depend on the underlying graph but is expensive to evaluate on even a single vertex. We prove an inequality showing that the integration problem can be rewritten as a geometric problem (`the optimal packing of heat balls'). We discuss how one would construct approximate solutions of the heat ball packing problem; numerical examples demonstrate the efficiency of the method.

1803.06775 2026-06-04 quant-ph cs.AI cs.ET cs.SY eess.SY

Comparing and Integrating Constraint Programming and Temporal Planning for Quantum Circuit Compilation

比较和整合约束编程与时间规划用于量子电路编译

Kyle E. C. Booth, Minh Do, J. Christopher Beck, Eleanor Rieffel, Davide Venturelli, Jeremy Frank

发表机构 * Quantum Artificial Intelligence Laboratory, NASA Ames Research Center(量子人工智能实验室,美国国家航空航天局阿姆斯特朗研究中心) Planning and Scheduling Group, NASA Ames Research Center(规划与调度组,美国国家航空航天局阿姆斯特朗研究中心) USRA Research Institute for Advanced Computer Science(美国宇航局高级计算机科学研究所) Stinger Ghaffarian Technologies, Inc.(Stinger Ghaffarian技术公司) Department of Mechanical & Industrial Engineering, University of Toronto(多伦多大学机械与工业工程系)

AI总结 本文比较了约束编程与时间规划在量子电路编译中的应用,提出混合方法提升求解质量,证明混合方法在多数问题中优于单独使用时间规划。

Comments 9 pages, 2 figures, Proceedings of the 28th International Conference of Automated Planning and Scheduling 2018 (ICAPS-18)

详情
AI中文摘要

最近,将一般量子算法编译为近期量子处理器的makespan最小化问题被引入人工智能社区。研究显示时间规划是量子电路编译(QCC)问题的一种强大方法。本文探讨了约束编程(CP)作为时间规划的替代和补充方法。我们通过引入两个新的问题变体扩展了先前工作,这些变体结合了量子计算社区识别的重要特征。我们应用时间规划和CP解决基准和扩展的QCC问题,作为单独和混合方法。我们的混合方法利用时间规划找到的解决方案预热CP,利用前者在任务选项性高的问题中找到满意解的能力,而CP通常难以处理。CP模型受益于预热提供的推断边界,从而找到更高质量的解。实证评估表明,虽然单独使用CP仅在最小问题中具有竞争力,但CP与时间规划的混合方法在多数问题类别中表现优于单独使用时间规划。

英文摘要

Recently, the makespan-minimization problem of compiling a general class of quantum algorithms into near-term quantum processors has been introduced to the AI community. The research demonstrated that temporal planning is a strong approach for a class of quantum circuit compilation (QCC) problems. In this paper, we explore the use of constraint programming (CP) as an alternative and complementary approach to temporal planning. We extend previous work by introducing two new problem variations that incorporate important characteristics identified by the quantum computing community. We apply temporal planning and CP to the baseline and extended QCC problems as both stand-alone and hybrid approaches. Our hybrid methods use solutions found by temporal planning to warm start CP, leveraging the ability of the former to find satisficing solutions to problems with a high degree of task optionality, an area that CP typically struggles with. The CP model, benefiting from inferred bounds on planning horizon length and task counts provided by the warm start, is then used to find higher quality solutions. Our empirical evaluation indicates that while stand-alone CP is only competitive for the smallest problems, CP in our hybridization with temporal planning out-performs stand-alone temporal planning in the majority of problem classes.

1703.02419 2026-06-04 stat.CO cs.LG cs.SY eess.SY

Probabilistic learning of nonlinear dynamical systems using sequential Monte Carlo

利用序贯蒙特卡洛方法进行非线性动力系统概率学习

Thomas B. Schön, Andreas Svensson, Lawrence Murray, Fredrik Lindsten

发表机构 * Department of Information Technology, Uppsala University(乌普萨拉大学信息科技系)

AI总结 本文提出基于序贯蒙特卡洛方法的概率非线性状态空间模型学习方法,通过粒子Metropolis-Hastings算法实现参数空间的高效采样,并展示了该方法在动态系统建模中的应用。

Comments Thomas B. Schön, Andreas Svensson, Lawrence Murray and Fredrik Lindsten, 2018. Probabilistic learning of nonlinear dynamical systems using sequential Monte Carlo. In Mechanical Systems and Signal Processing, Volume 104, pp. 866-883

详情
AI中文摘要

概率建模能够表示和操纵数据、模型、预测和决策中的不确定性。本文关注从测量数据中学习动态系统概率模型的问题,特别是非线性状态空间模型的学习。由于该问题没有闭式解,因此必须使用近似方法。本文提供了一个自包含的介绍,介绍了一种最先进的方法——粒子Metropolis-Hastings算法,该算法已被证明能提供实用的近似。这是一种基于蒙特卡洛的方法,其中粒子滤波用于引导马尔可夫链蒙特卡洛方法通过参数空间。粒子Metropolis-Hastings算法的一个关键优点是,在温和的假设下,它保证收敛到“真实解”,尽管它基于仅有限数量粒子的粒子滤波。本文还提供了一个数值示例,通过为序贯蒙特卡洛方法量身定制的建模语言来展示该方法。此类建模语言的目的是将高级蒙特卡洛方法(包括粒子Metropolis-Hastings)的威力带给大量用户,而无需他们了解所有底层数学细节。

英文摘要

Probabilistic modeling provides the capability to represent and manipulate uncertainty in data, models, predictions and decisions. We are concerned with the problem of learning probabilistic models of dynamical systems from measured data. Specifically, we consider learning of probabilistic nonlinear state-space models. There is no closed-form solution available for this problem, implying that we are forced to use approximations. In this tutorial we will provide a self-contained introduction to one of the state-of-the-art methods---the particle Metropolis--Hastings algorithm---which has proven to offer a practical approximation. This is a Monte Carlo based method, where the particle filter is used to guide a Markov chain Monte Carlo method through the parameter space. One of the key merits of the particle Metropolis--Hastings algorithm is that it is guaranteed to converge to the "true solution" under mild assumptions, despite being based on a particle filter with only a finite number of particles. We will also provide a motivating numerical example illustrating the method using a modeling language tailored for sequential Monte Carlo methods. The intention of modeling languages of this kind is to open up the power of sophisticated Monte Carlo methods---including particle Metropolis--Hastings---to a large group of users without requiring them to know all the underlying mathematical details.

1803.03104 2026-06-04 eess.SY cs.CV cs.SY math.DS stat.ML

Applicability and interpretation of the deterministic weighted cepstral distance

确定性加权谱距的应用与解释

Oliver Lauwers, Bart De Moor

发表机构 * KU Leuven, Department of Electrical Engineering (ESAT), STADIUS Center for Dynamical Systems, Signal Processing(库勒文大学电子工程系(ESAT)、动态系统信号处理与数据分析中心)

AI总结 本文结合系统理论和机器学习,研究了加权谱距在确定性线性时不变单输入单输出模型中的应用,提出了一种基于输入输出信号信息评估系统稳定性和相位类型的纯数据驱动方法。

Comments 18 pages, 5 figures, submitted for review to Automatica

详情
AI中文摘要

量化数据对象之间的相似性是现代数据科学的重要部分。决定使用哪种相似性度量非常依赖于具体应用。本文结合系统理论和机器学习的见解,研究了之前为ARMA模型信号定义的加权谱距。我们将其扩展到可逆的确定性线性时不变单输入单输出模型,并评估其适用性。我们证明了该距离总能以底层模型的极点和零点进行解释,并在稳定、最小相位或不稳定、最大相位模型的情况下,可以以子空间角度进行几何解释。然后,我们提出了一种仅使用输入/输出信号信息的方法来评估生成模型的稳定性和相位类型。通过这种方式,我们证明了扩展的加权谱距与加权谱模型范数之间的联系。通过这种方式,我们提供了一种纯数据驱动的方法来评估输入/输出信号对的不同底层动态,而无需任何系统识别步骤。这在时间序列聚类等机器学习任务中可能很有用。本文还发布了一个iPython教程,包含各种方法和算法的实现,以及一些证明等价性的数值示例。

英文摘要

Quantifying similarity between data objects is an important part of modern data science. Deciding what similarity measure to use is very application dependent. In this paper, we combine insights from systems theory and machine learning, and investigate the weighted cepstral distance, which was previously defined for signals coming from ARMA models. We provide an extension of this distance to invertible deterministic linear time invariant single input single output models, and assess its applicability. We show that it can always be interpreted in terms of the poles and zeros of the underlying model, and that, in the case of stable, minimum-phase, or unstable, maximum-phase models, a geometrical interpretation in terms of subspace angles can be given. We then devise a method to assess stability and phase-type of the generating models, using only input/output signal information. In this way, we prove a connection between the extended weighted cepstral distance and a weighted cepstral model norm. In this way, we provide a purely data-driven way to assess different underlying dynamics of input/output signal pairs, without the need for any system identification step. This can be useful in machine learning tasks such as time series clustering. An iPython tutorial is published complementary to this paper, containing implementations of the various methods and algorithms presented here, as well as some numerical illustrations of the equivalences proven here.

1803.01626 2026-06-04 stat.ML cs.LG cs.SY eess.SY

Variance-Aware Regret Bounds for Undiscounted Reinforcement Learning in MDPs

考虑平均回报准则下未知离散马尔可夫决策过程(MDP)中的强化学习的方差意识后悔界

Mohammad Sadegh Talebi, Odalric-Ambrym Maillard

发表机构 * KTH Royal Institute of Technology(皇家理工学院) INRIA Lille – Nord Europe(里尔-北欧洲研究所)

AI总结 本文基于平均回报准则,重新审视未知离散MDP中的强化学习问题,通过引入局部方差代替MDP直径,改进KL-UCRL算法的后悔界,提供更优的性能保证。

Comments To appear in Proceedings of the 29th International Conference on Algorithmic Learning Theory (ALT 2018)

详情
AI中文摘要

在未知和离散的马尔可夫决策过程(MDP)中,考虑在单一流观测下进行强化学习的问题,当学习者从初始状态开始与系统交互时。我们通过引入偏倚函数的局部方差代替MDP的直径,重新审视该问题的最小最大下界。此外,我们提供了KL-UCRL算法的新型分析,建立了高概率的后悔界,其规模为$\widetilde {\mathcal O}\Bigl({\textstyle \sqrt{S\sum_{s,a}{\bf V}^\star_{s,a}T}}\Big)$,适用于周期性MDP。该界优于之前已知的$\widetilde {\mathcal O}(DS\sqrt{AT})$界,其中$A$和$D$分别表示每个状态的最大动作数和MDP的直径。我们最终在一些基准MDP中比较了两个界的主导项,表明在某些情况下,所推导的界可以提供一个数量级的改进。我们的分析利用了运输引理的新变体结合KL集中不等式,我们认为这些方法具有独立的兴趣。

英文摘要

The problem of reinforcement learning in an unknown and discrete Markov Decision Process (MDP) under the average-reward criterion is considered, when the learner interacts with the system in a single stream of observations, starting from an initial state without any reset. We revisit the minimax lower bound for that problem by making appear the local variance of the bias function in place of the diameter of the MDP. Furthermore, we provide a novel analysis of the KL-UCRL algorithm establishing a high-probability regret bound scaling as $\widetilde {\mathcal O}\Bigl({\textstyle \sqrt{S\sum_{s,a}{\bf V}^\star_{s,a}T}}\Big)$ for this algorithm for ergodic MDPs, where $S$ denotes the number of states and where ${\bf V}^\star_{s,a}$ is the variance of the bias function with respect to the next-state distribution following action $a$ in state $s$. The resulting bound improves upon the best previously known regret bound $\widetilde {\mathcal O}(DS\sqrt{AT})$ for that algorithm, where $A$ and $D$ respectively denote the maximum number of actions (per state) and the diameter of MDP. We finally compare the leading terms of the two bounds in some benchmark MDPs indicating that the derived bound can provide an order of magnitude improvement in some cases. Our analysis leverages novel variations of the transportation lemma combined with Kullback-Leibler concentration inequalities, that we believe to be of independent interest.

1803.01579 2026-06-04 eess.SY cs.RO cs.SY

Motion and Cooperative Transportation Planning for Multi-Agent Systems under Temporal Logic Formulas

多智能体系统在时序逻辑公式下的运动与协作运输规划

Christos K. Verginis, Dimos V. Dimarogonas

发表机构 * KTH Center for Autonomous Systems(KTH 自主系统中心)

AI总结 本文提出一种混合控制框架,用于在高阶目标表达为线性时序逻辑(LTL)公式的情况下,多智能体系统的运动规划。设计控制协议以实现智能体在预定义兴趣区域间的过渡及协作运输物体。通过抽象智能体与物体的耦合行为为有限状态转移系统,设计满足智能体和物体规格的高层多智能体计划。

Comments Submitted to IEEE Transactions on Automation Science and Engineering. arXiv admin note: text overlap with arXiv:1611.05186

详情
AI中文摘要

本文提出了一种混合控制框架,用于多智能体系统(包括N个机器人和M个物体)在高阶目标表达为线性时序逻辑(LTL)公式的情况下进行运动规划。特别地,我们设计了控制协议,使智能体能够在预定义的兴趣区域之间过渡,并通过智能体协作运输物体。这允许将智能体和物体的耦合行为抽象为有限状态转移系统,并设计一个满足智能体和物体规格的高层多智能体计划,这些规格以时序逻辑公式给出。仿真结果验证了所提框架。

英文摘要

This paper presents a hybrid control framework for the motion planning of a multi-agent system including N robotic agents and M objects, under high level goals expressed as Linear Temporal Logic (LTL) formulas. In particular, we design control protocols that allow the transition of the agents as well as the cooperative transportation of the objects by the agents, among predefined regions of interest in the workspace. This allows to abstract the coupled behavior of the agents and the objects as a finite transition system and to design a high-level multi-agent plan that satisfies the agents' and the objects' specifications, given as temporal logic formulas. Simulation results verify the proposed framework.

1803.00491 2026-06-04 stat.ML cs.LG cs.NA math.NA

The Power Mean Laplacian for Multilayer Graph Clustering

多层图聚类的幂均值拉普拉斯

Pedro Mercado, Antoine Gautier, Francesco Tudisco, Matthias Hein

发表机构 * Department of Mathematics and Computer Science, Saarland University(萨尔兰大学数学与计算机科学系) Department of Mathematics and Statistics, University of Strathclyde(斯特拉斯克莱德大学数学与统计学系)

AI总结 本文提出一种参数化的矩阵幂均值方法,用于融合多层图的拉普拉斯矩阵,分析其在随机块模型中的期望性能,并在真实数据中验证其在不同设置下恢复真实聚类的能力。

Comments 19 pages, 3 figures. Accepted in Artificial Intelligence and Statistics (AISTATS), 2018

详情
AI中文摘要

多层图编码了相同实体集之间的不同种类的相互作用。当对这样的多层图进行聚类时,自然的问题是应如何融合不同层的信息。本文介绍了一种参数化的矩阵幂均值家族,用于融合不同层的拉普拉斯矩阵,并在随机块模型中分析其期望性能。我们证明该家族在不同设置下能够恢复真实聚类,并在真实世界数据中验证了这一点。尽管计算矩阵幂均值对于大图来说可能非常昂贵,我们引入了一种数值方案,以高效计算大规模稀疏图的幂均值的特征向量。

英文摘要

Multilayer graphs encode different kind of interactions between the same set of entities. When one wants to cluster such a multilayer graph, the natural question arises how one should merge the information different layers. We introduce in this paper a one-parameter family of matrix power means for merging the Laplacians from different layers and analyze it in expectation in the stochastic block model. We show that this family allows to recover ground truth clusters under different settings and verify this in real world data. While computing the matrix power mean can be very expensive for large graphs, we introduce a numerical scheme to efficiently compute its eigenvectors for the case of large sparse graphs.

1707.01625 2026-06-04 eess.SY cs.AI cs.GT cs.SY

Optimal Vehicle Dispatching Schemes via Dynamic Pricing

通过动态定价实现最优车辆调度方案

Mengjing Chen, Weiran Shen, Pingzhong Tang, Song Zuo

发表机构 * IIIS, Tsinghua University(清华大学信息科学与技术学院)

AI总结 本文通过经济方法解决网约车平台在地理和时间信息下的最优定价和车辆调度问题,提出高效算法计算最优随机定价方案,并通过实验证明其优于固定定价和涨价机制。

详情
AI中文摘要

近年来,拼车服务已成为缓解交通拥堵的有效方式。这些平台的关键问题是如何制定收入最优(或GMV最优)的定价方案和诱导的车辆调度策略,以整合地理和时间信息。本文通过经济方法解决此问题。简单建模下,底层优化问题可能非凸,难以计算。为此,我们使用所谓的

英文摘要

Over the past few years, ride-sharing has emerged as an effective way to relieve traffic congestion. A key problem for these platforms is to come up with a revenue-optimal (or GMV-optimal) pricing scheme and an induced vehicle dispatching policy that incorporate geographic and temporal information. In this paper, we aim to tackle this problem via an economic approach. Modeled naively, the underlying optimization problem may be non-convex and thus hard to compute. To this end, we use a so-called "ironing" technique to convert the problem into an equivalent convex optimization one via a clean Markov decision process (MDP) formulation, where the states are the driver distributions and the decision variables are the prices for each pair of locations. Our main finding is an efficient algorithm that computes the exact revenue-optimal (or GMV-optimal) randomized pricing schemes. We characterize the optimal solution of the MDP by a primal-dual analysis of a corresponding convex program. We also conduct empirical evaluations of our solution through real data of a major ride-sharing platform and show its advantages over fixed pricing schemes as well as several prevalent surge-based pricing schemes.

1802.00930 2026-06-04 cs.NE cs.LG cs.NA math.NA

Mixed Precision Training of Convolutional Neural Networks using Integer Operations

使用整数运算进行卷积神经网络的混合精度训练

Dipankar Das, Naveen Mellempudi, Dheevatsa Mudigere, Dhiraj Kalamkar, Sasikanth Avancha, Kunal Banerjee, Srinivas Sridharan, Karthik Vaidyanathan, Bharat Kaul, Evangelos Georganas, Alexander Heinecke, Pradeep Dubey, Jesus Corbal, Nikita Shustrov, Roma Dubtsov, Evarist Fomenko, Vadim Pirogov

发表机构 * Parallel Computing Lab(并行计算实验室) Intel Labs, India(英特尔实验室,印度) Product Architecture Group(产品架构组) Intel Labs, SC Intel, OR(英特尔实验室,SC英特尔,美国) Software Services Group(软件服务组) Intel, OR(英特尔,美国)

AI总结 本文提出了一种基于整数运算的混合精度训练方法,在ImageNet-1K数据集上训练了ResNet-50、GoogLeNet-v1等SOTA网络,实现了比FP32更高的训练吞吐量和相同精度下的最高准确率。

Comments Published as a conference paper at ICLR 2018

详情
AI中文摘要

当前混合精度训练的SOTA主要基于低精度浮点运算,如FP16累积到FP32的变种。然而,在低精度和混合精度整数训练领域,已有研究要么针对非SOTA网络(如仅AlexNet用于ImageNet-1K),要么针对较小的数据集(如CIFAR-10)。本文在通用硬件上训练了SOTA视觉理解神经网络,使用整数运算。特别关注整数融合乘加(FMA)运算,其输入为两个INT16操作数,输出为INT32。我们提出了张量的共享指数表示,并开发了适用于常见神经网络操作的动态定点(DFP)方案。研究了高效整数卷积核的开发细节,包括处理INT32累加器溢出的方法。我们实现了ResNet-50、GoogLeNet-v1、VGG-16和AlexNet的CNN训练,这些网络在相同迭代次数下达到或超过FP32的SOTA准确率,无需改变超参数,并在端到端训练吞吐量上提高了1.8倍。据我们所知,这些结果是首次在通用硬件上使用SOTA CNNs在ImageNet-1K数据集上实现INT16训练的结果,并实现了最高报告的准确率。

英文摘要

The state-of-the-art (SOTA) for mixed precision training is dominated by variants of low precision floating point operations, and in particular, FP16 accumulating into FP32 Micikevicius et al. (2017). On the other hand, while a lot of research has also happened in the domain of low and mixed-precision Integer training, these works either present results for non-SOTA networks (for instance only AlexNet for ImageNet-1K), or relatively small datasets (like CIFAR-10). In this work, we train state-of-the-art visual understanding neural networks on the ImageNet-1K dataset, with Integer operations on General Purpose (GP) hardware. In particular, we focus on Integer Fused-Multiply-and-Accumulate (FMA) operations which take two pairs of INT16 operands and accumulate results into an INT32 output.We propose a shared exponent representation of tensors and develop a Dynamic Fixed Point (DFP) scheme suitable for common neural network operations. The nuances of developing an efficient integer convolution kernel is examined, including methods to handle overflow of the INT32 accumulator. We implement CNN training for ResNet-50, GoogLeNet-v1, VGG-16 and AlexNet; and these networks achieve or exceed SOTA accuracy within the same number of iterations as their FP32 counterparts without any change in hyper-parameters and with a 1.8X improvement in end-to-end training throughput. To the best of our knowledge these results represent the first INT16 training results on GP hardware for ImageNet-1K dataset using SOTA CNNs and achieve highest reported accuracy using half-precision

1708.07827 2026-06-04 math.OC cs.LG cs.NA math.NA stat.ML

Second-Order Optimization for Non-Convex Machine Learning: An Empirical Study

非凸机器学习中的二阶优化:一项实证研究

Peng Xu, Farbod Roosta-Khorasani, Michael W. Mahoney

发表机构 * Institute for Computational and Mathematical Engineering, Stanford University(计算与数学工程研究所,斯坦福大学) School of Mathematics and Physics, University of Queensland(数学与物理学院,昆士兰大学) International Computer Science Institute, Berkeley, USA(国际计算机科学研究所,伯克利,美国) International Computer Science Institute and Department of Statistics, University of California at Berkeley(国际计算机科学研究所和统计学系,加州大学伯克利分校)

AI总结 本文通过实证研究评估了非凸机器学习问题中牛顿型方法的性能,证明其在泛化性能和超参数鲁棒性方面优于传统SGD,能有效逃离平坦区域和鞍点。

Comments 21 pages, 11 figures. Restructure the paper and add experiments

详情
AI中文摘要

尽管随机梯度下降(SGD)等一阶优化方法在机器学习(ML)中广泛应用,但它们存在收敛速度慢、超参数设置敏感、易陷入高训练误差和难以逃离平坦区域及鞍点等缺陷。在高度非凸设置(如神经网络中)尤为明显。受此启发,近期有研究关注二阶方法,旨在通过捕捉曲率信息缓解这些不足。本文报告了针对非凸ML问题的一类牛顿型方法——信任区域(TR)和自适应三次正则化(ARC)算法的子采样变体的详细实证评估。在此过程中,我们证明这些方法不仅在计算上与手工调优的SGD加动量方法具有竞争力,泛化性能可比或更优,而且对超参数设置具有高度鲁棒性。此外,与SGD加动量相比,这些牛顿型方法利用曲率信息的方式使其能够无缝逃离平坦区域和鞍点。

英文摘要

While first-order optimization methods such as stochastic gradient descent (SGD) are popular in machine learning (ML), they come with well-known deficiencies, including relatively-slow convergence, sensitivity to the settings of hyper-parameters such as learning rate, stagnation at high training errors, and difficulty in escaping flat regions and saddle points. These issues are particularly acute in highly non-convex settings such as those arising in neural networks. Motivated by this, there has been recent interest in second-order methods that aim to alleviate these shortcomings by capturing curvature information. In this paper, we report detailed empirical evaluations of a class of Newton-type methods, namely sub-sampled variants of trust region (TR) and adaptive regularization with cubics (ARC) algorithms, for non-convex ML problems. In doing so, we demonstrate that these methods not only can be computationally competitive with hand-tuned SGD with momentum, obtaining comparable or better generalization performance, but also they are highly robust to hyper-parameter settings. Further, in contrast to SGD with momentum, we show that the manner in which these Newton-type methods employ curvature information allows them to seamlessly escape flat regions and saddle points.