arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 21511
1701.02440 2026-06-04 cs.LG cs.NA math.NA stat.ML

Machine Learning of Linear Differential Equations using Gaussian Processes

利用高斯过程学习线性微分方程

Maziar Raissi, George Em. Karniadakis

AI总结 本文利用概率机器学习最新进展,通过高斯过程先验发现参数化的线性守恒律方程,包括常微分、偏微分、积分微分及分数阶算子。

详情
AI中文摘要

本工作利用概率机器学习最新进展,发现由参数化线性方程表达的守恒律。此类方程包括但不限于常微分、偏微分、积分微分和分数阶算子。此处,根据此类算子的特定形式修改高斯过程先验,并用于从稀疏且可能含噪声的观测中推断线性方程的参数。此类观测可能来自实验或

英文摘要

This work leverages recent advances in probabilistic machine learning to discover conservation laws expressed by parametric linear equations. Such equations involve, but are not limited to, ordinary and partial differential, integro-differential, and fractional order operators. Here, Gaussian process priors are modified according to the particular form of such operators and are employed to infer parameters of the linear equations from scarce and possibly noisy observations. Such observations may come from experiments or "black-box" computer simulations.

1709.03426 2026-06-04 cs.RO cs.SY eess.SY

Trajectory Synthesis for Fisher Information Maximization

信息最大化的轨迹合成

Andrew D. Wilson, Jarvis A. Schultz, Todd D. Murphey

AI总结 本文提出一种连续时间优化方法,通过改进Fisher信息矩阵的范数来生成局部最优轨迹,用于动态系统参数估计,实验验证显示轨迹优化显著提升了参数估计精度。

Comments 12 pages

详情
Journal ref
IEEE Transactions on Robotics, vol. 30, no. 6, pp. 1358-1370, 2014
AI中文摘要

动态系统模型参数估计可通过实验轨迹的选择得到显著提升。对于一般的非线性动态系统,找到全局最优轨迹通常不可行;然而,在给定初始模型参数估计和初始轨迹的情况下,本文提出了一种连续时间优化方法,生成在存在测量噪声下的局部最优轨迹。该优化算法旨在寻找能改进Fisher信息矩阵范数的系统轨迹。通过双臂小车装置进行数值和实验验证。在模拟中,优化轨迹使Fisher信息矩阵的最小特征值比初始轨迹提高了三个数量级。实验结果表明,优化轨迹在实践中使参数估计误差改善了一个数量级。

英文摘要

Estimation of model parameters in a dynamic system can be significantly improved with the choice of experimental trajectory. For general, nonlinear dynamic systems, finding globally "best" trajectories is typically not feasible; however, given an initial estimate of the model parameters and an initial trajectory, we present a continuous-time optimization method that produces a locally optimal trajectory for parameter estimation in the presence of measurement noise. The optimization algorithm is formulated to find system trajectories that improve a norm on the Fisher information matrix. A double-pendulum cart apparatus is used to numerically and experimentally validate this technique. In simulation, the optimized trajectory increases the minimum eigenvalue of the Fisher information matrix by three orders of magnitude compared to the initial trajectory. Experimental results show that this optimized trajectory translates to an order of magnitude improvement in the parameter estimate error in practice.

1709.02435 2026-06-04 cs.AI cs.LG cs.SE cs.SY eess.SY

An Analysis of ISO 26262: Using Machine Learning Safely in Automotive Software

ISO 26262分析:在汽车软件中安全使用机器学习

Rick Salay, Rodrigo Queiroz, Krzysztof Czarnecki

AI总结 本文分析了在汽车软件中使用机器学习对ISO 26262安全生命周期的影响,并提出适应该标准以容纳机器学习的建议。

Comments 6 pages, 3 figures

详情
AI中文摘要

机器学习(ML)在高级驾驶辅助和自动驾驶功能中的作用日益增加;然而,其在安全认证方面的充分性仍存在争议。本文分析了将ML作为实现方法对ISO 26262安全生命周期的影响,并探讨了如何解决这些问题。我们随后提供了一套建议,说明如何调整标准以适应机器学习。

英文摘要

Machine learning (ML) plays an ever-increasing role in advanced automotive functionality for driver assistance and autonomous operation; however, its adequacy from the perspective of safety certification remains controversial. In this paper, we analyze the impacts that the use of ML as an implementation approach has on ISO 26262 safety lifecycle and ask what could be done to address them. We then provide a set of recommendations on how to adapt the standard to accommodate ML.

1709.01237 2026-06-04 cs.CV cs.LG cs.NA math.NA

Newton-type Methods for Inference in Higher-Order Markov Random Fields

牛顿型方法在高阶马尔可夫随机场推断中的应用

Hariprasad Kannan, Nikos Komodakis, Nikos Paragios

AI总结 本文研究了在高阶马尔可夫随机场推断中使用牛顿型方法求解拉格朗日对偶问题的益处,提出了一种收敛性可证且高效的框架,包含Hessian矩阵构建的计算复杂度与精度的平衡策略、阻尼策略、截断策略与通用预条件器的结合,以及稀疏团势能的高效求和-乘积计算。

Comments 10 pages, 3 figures, 3 tables, CVPR 2017

详情
Journal ref
Poster at IEEE International Conference on Computer Vision and Pattern Recognition 2017
AI中文摘要

线性规划松弛是离散马尔可夫随机场MAP推断中的核心方法。正确求解拉格朗日对偶问题的能力是此类方法的关键组成部分。本文研究了使用牛顿型方法求解平滑版本问题的拉格朗日对偶问题的益处。我们探讨了其在实现更优收敛行为和更好地处理公式中的病态性质方面的能力,与一阶方法相比。我们证明了确实可以高效地应用信任区域牛顿方法,以解决广泛MAP推断问题。本文提出了一种可证收敛且高效的框架,包括(i)在Hessian矩阵构建方面计算复杂度和精度之间的良好平衡,(ii)一种有助于高效优化的阻尼策略,(iii)一种与通用共轭梯度预条件器结合的截断策略,(iv)稀疏团势能的高效求和-乘积计算。高阶马尔可夫随机场的结果展示了这种方法的潜力。

英文摘要

Linear programming relaxations are central to {\sc map} inference in discrete Markov Random Fields. The ability to properly solve the Lagrangian dual is a critical component of such methods. In this paper, we study the benefit of using Newton-type methods to solve the Lagrangian dual of a smooth version of the problem. We investigate their ability to achieve superior convergence behavior and to better handle the ill-conditioned nature of the formulation, as compared to first order methods. We show that it is indeed possible to efficiently apply a trust region Newton method for a broad range of {\sc map} inference problems. In this paper we propose a provably convergent and efficient framework that includes (i) excellent compromise between computational complexity and precision concerning the Hessian matrix construction, (ii) a damping strategy that aids efficient optimization, (iii) a truncation strategy coupled with a generic pre-conditioner for Conjugate Gradients, (iv) efficient sum-product computation for sparse clique potentials. Results for higher-order Markov Random Fields demonstrate the potential of this approach.

1607.03428 2026-06-04 cs.LG cs.SY eess.SY quant-ph stat.ML

Learning in Quantum Control: High-Dimensional Global Optimization for Noisy Quantum Dynamics

量子控制中的学习:用于噪声量子动力学的高维全局优化

Pantita Palittapongarnpim, Peter Wittek, Ehsan Zahedinejad, Shakib Vedaie, Barry C. Sanders

AI总结 本文提出使用差分进化算法解决高维量子系统中非凸优化问题,通过改进控制保真度和引入启发式方法提升计算效率,展示在量子相位估计和量子门设计中的优越性能。

Comments 32 pages, 4 figures, extension of proceedings in ESANN 2016 conference submitted to Neurocomputing

详情
Journal ref
Neurocomputing 268 (2017) 116-126
AI中文摘要

量子控制在多种量子技术中具有重要价值,如通用量子计算中的高保真门、自适应量子增强计量和超冷原子操控。尽管监督学习和强化学习广泛用于优化经典系统的控制参数,但量子参数优化主要通过基于梯度的贪心算法进行。虽然量子适应度景观通常与贪心算法兼容,但在高维量子系统中贪心算法可能表现不佳。本文采用差分进化算法克服非凸优化的停滞问题,通过平均目标函数提升噪声系统中的量子控制保真度。为减少计算成本,引入了运行终止的启发式方法和自适应搜索子空间选择。我们的实现是大规模并行和向量化的,以进一步减少运行时间。通过量子相位估计和量子门设计两个示例,我们展示了在保真度和可扩展性方面优于贪心算法的结果。

英文摘要

Quantum control is valuable for various quantum technologies such as high-fidelity gates for universal quantum computing, adaptive quantum-enhanced metrology, and ultra-cold atom manipulation. Although supervised machine learning and reinforcement learning are widely used for optimizing control parameters in classical systems, quantum control for parameter optimization is mainly pursued via gradient-based greedy algorithms. Although the quantum fitness landscape is often compatible with greedy algorithms, sometimes greedy algorithms yield poor results, especially for large-dimensional quantum systems. We employ differential evolution algorithms to circumvent the stagnation problem of non-convex optimization. We improve quantum control fidelity for noisy system by averaging over the objective function. To reduce computational cost, we introduce heuristics for early termination of runs and for adaptive selection of search subspaces. Our implementation is massively parallel and vectorized to reduce run time even further. We demonstrate our methods with two examples, namely quantum phase estimation and quantum gate design, for which we achieve superior fidelity and scalability than obtained using greedy algorithms.

1611.02862 2026-06-04 cs.CV cs.NA math.NA

The Little Engine that Could: Regularization by Denoising (RED)

那辆小引擎也能:通过去噪进行正则化(RED)

Yaniv Romano, Michael Elad, Peyman Milanfar

AI总结 本文提出了一种更强大灵活的框架,通过去噪引擎定义逆问题的正则化,以提升图像去模糊和超分辨率的性能。

详情
AI中文摘要

图像去噪是图像处理中广泛研究的问题。确实,最近高级且高效的去噪算法的出现使一些人相信现有的方法在去噪性能上已接近极限。我们能否利用这一显著成就来处理图像处理中的其他任务?最近的工作对此问题给出了肯定的回答,形式为Plug-and-Play Prior($P^3$)方法,表明通过依次应用图像去噪步骤可以处理任何逆问题。这严重依赖于ADMM优化技术,以获得这种连续去噪解释。这是否是图像处理任务中唯一能利用图像去噪引擎的方法?在本文中,我们提供了一种更强大、更灵活的框架来实现相同的目标。与$P^3$方法不同,我们提出了正则化通过去噪(RED):利用去噪引擎定义逆问题的正则化。我们提出了一种显式的图像自适应拉普拉斯基正则化函数,使整体目标函数更清晰且更明确。通过完全灵活地选择迭代优化过程来最小化上述函数,RED能够结合任何图像去噪算法,非常有效地处理一般逆问题,并保证收敛到全局最优解。我们测试了这种方法,并在图像去模糊和超分辨率问题中展示了最先进的结果。

英文摘要

Removal of noise from an image is an extensively studied problem in image processing. Indeed, the recent advent of sophisticated and highly effective denoising algorithms lead some to believe that existing methods are touching the ceiling in terms of noise removal performance. Can we leverage this impressive achievement to treat other tasks in image processing? Recent work has answered this question positively, in the form of the Plug-and-Play Prior ($P^3$) method, showing that any inverse problem can be handled by sequentially applying image denoising steps. This relies heavily on the ADMM optimization technique in order to obtain this chained denoising interpretation. Is this the only way in which tasks in image processing can exploit the image denoising engine? In this paper we provide an alternative, more powerful and more flexible framework for achieving the same goal. As opposed to the $P^3$ method, we offer Regularization by Denoising (RED): using the denoising engine in defining the regularization of the inverse problem. We propose an explicit image-adaptive Laplacian-based regularization functional, making the overall objective functional clearer and better defined. With a complete flexibility to choose the iterative optimization procedure for minimizing the above functional, RED is capable of incorporating any image denoising algorithm, treat general inverse problems very effectively, and is guaranteed to converge to the globally optimal result. We test this approach and demonstrate state-of-the-art results in the image deblurring and super-resolution problems.

1708.09347 2026-06-04 cs.RO cs.SY eess.SY

Sequential Action Control: Closed-Form Optimal Control for Nonlinear and Nonsmooth Systems

顺序动作控制:非线性与非光滑系统的闭环最优控制

Alex Ansari, Todd Murphey

AI总结 本文提出一种基于模型的算法,用于在线闭环计算非线性系统的预测最优控制。通过高阶模型和轨迹目标控制混合脉冲、欠驱动和约束系统,采用闭式表达式优化短期控制动作,保证全局唯一性和稳定性。

Comments 19 pages

详情
Journal ref
IEEE Transactions on Robotics, vol. 32, no. 5, pp. 1196-1214, 2016
AI中文摘要

本文提出了一种新的基于模型的算法,用于在线闭环计算传统上具有挑战性的非线性系统的预测最优控制。示例展示了该算法如何使用仅有的高层模型和轨迹目标来控制混合脉冲、欠驱动和约束系统。与迭代优化有限时间域控制序列不同,本文推导出个体控制动作的闭式表达式,即可以应用于短时间的控制值,以在长时间域内最优改善跟踪目标。在温和假设下,这些动作在平衡点附近变为线性反馈律,允许稳定性分析和基于性能的参数选择。全局上,最优动作保证存在性和唯一性。通过在线递推时间域的方式对这些动作进行序列化,所提出的控制器提供了一种最小-最大约束响应,避免了通常需要施加控制约束的开销。基准示例显示,该方法能够避免局部极小值,并在跟踪性能方面优于非线性最优控制器和最近的特定案例方法,且速度比传统方法快多个数量级。

英文摘要

This paper presents a new model-based algorithm that computes predictive optimal controls on-line and in closed loop for traditionally challenging nonlinear systems. Examples demonstrate the same algorithm controlling hybrid impulsive, underactuated, and constrained systems using only high-level models and trajectory goals. Rather than iteratively optimize finite horizon control sequences to minimize an objective, this paper derives a closed-form expression for individual control actions, i.e., control values that can be applied for short duration, that optimally improve a tracking objective over a long time horizon. Under mild assumptions, actions become linear feedback laws near equilibria that permit stability analysis and performance-based parameter selection. Globally, optimal actions are guaranteed existence and uniqueness. By sequencing these actions on-line, in receding horizon fashion, the proposed controller provides a min-max constrained response to state that avoids the overhead typically required to impose control constraints. Benchmark examples show the approach can avoid local minima and outperform nonlinear optimal controllers and recent, case-specific methods in terms of tracking performance, and at speeds orders of magnitude faster than traditionally achievable.

1708.08552 2026-06-04 cs.LG cs.NA math.NA stat.ML

An inexact subsampled proximal Newton-type method for large-scale machine learning

一种用于大规模机器学习的近似子采样近端牛顿型方法

Xuanqing Liu, Cho-Jui Hsieh, Jason D. Lee, Yuekai Sun

AI总结 本文提出一种快速近端牛顿型算法,通过子采样构造牛顿子问题,提升大规模优化效率,实验验证其在ℓ₁正则化逻辑回归中的优越性。

详情
AI中文摘要

我们提出了一种快速的近端牛顿型算法,用于最小化带有正则化的有限和。该算法能够在$\tilde{\mathcal{O}}(d(n + \sqrt{κd})\log(\frac{1}ε))$ FLOPS内返回一个$ε$-次优解,其中$n$是样本数,$d$是特征维度,$κ$是条件数。只要$n > d$,该方法比最先进的加速随机一阶方法更高效,后者需要$\tilde{\mathcal{O}}(d(n + \sqrt{κn})\log(\frac{1}ε))$ FLOPS。关键思想是通过子采样构造牛顿子问题,以保持目标函数的有限和结构,从而利用最近的随机一阶方法进展来求解子问题。实验结果验证了所提算法在真实数据集上的ℓ₁正则化逻辑回归任务中优于先前算法。

英文摘要

We propose a fast proximal Newton-type algorithm for minimizing regularized finite sums that returns an $ε$-suboptimal point in $\tilde{\mathcal{O}}(d(n + \sqrt{κd})\log(\frac{1}ε))$ FLOPS, where $n$ is number of samples, $d$ is feature dimension, and $κ$ is the condition number. As long as $n > d$, the proposed method is more efficient than state-of-the-art accelerated stochastic first-order methods for non-smooth regularizers which requires $\tilde{\mathcal{O}}(d(n + \sqrt{κn})\log(\frac{1}ε))$ FLOPS. The key idea is to form the subsampled Newton subproblem in a way that preserves the finite sum structure of the objective, thereby allowing us to leverage recent developments in stochastic first-order methods to solve the subproblem. Experimental results verify that the proposed algorithm outperforms previous algorithms for $\ell_1$-regularized logistic regression on real datasets.

1708.07850 2026-06-04 cs.LG cs.CV cs.NA math.NA

Structured Low-Rank Matrix Factorization: Global Optimality, Algorithms, and Applications

结构低秩矩阵分解:全局最优性、算法与应用

Benjamin D. Haeffele, Rene Vidal

AI总结 本文提出一种适用于大规模数据集的矩阵分解技术,通过特定正则化形式捕捉额外结构,证明在因子规模足够时局部极小值即为全局极小值,并展示在神经钙成像视频分割和高光谱压缩恢复中的优势。

详情
AI中文摘要

近年来,低秩矩阵分解问题凸形式在机器学习中受到广泛关注。然而,此类形式往往需要求解与数据矩阵同样大小的矩阵,难以应用于大规模数据集。此外,在许多应用中,数据可能表现出超越单纯低秩的结构,例如图像和视频呈现复杂的时空结构,而标准低秩方法大多忽略这些结构。本文研究了一种适用于大规模数据集的矩阵分解技术,通过特定形式的正则化捕捉额外结构,该正则化包括总变分和核范数等已知正则化器作为特例。尽管所得优化问题非凸,我们证明在因子规模足够时,若满足某些条件,则因子的任何局部极小值即为全局极小值。此外,本文还提供了几种实用算法来解决矩阵分解问题,并推导了近似解到全局最优解距离的界。神经钙成像视频分割和高光谱压缩恢复的示例展示了该方法在高维数据集中的优势。

英文摘要

Recently, convex formulations of low-rank matrix factorization problems have received considerable attention in machine learning. However, such formulations often require solving for a matrix of the size of the data matrix, making it challenging to apply them to large scale datasets. Moreover, in many applications the data can display structures beyond simply being low-rank, e.g., images and videos present complex spatio-temporal structures that are largely ignored by standard low-rank methods. In this paper we study a matrix factorization technique that is suitable for large datasets and captures additional structure in the factors by using a particular form of regularization that includes well-known regularizers such as total variation and the nuclear norm as particular cases. Although the resulting optimization problem is non-convex, we show that if the size of the factors is large enough, under certain conditions, any local minimizer for the factors yields a global minimizer. A few practical algorithms are also provided to solve the matrix factorization problem, and bounds on the distance from a given approximate solution of the optimization problem to the global optimum are derived. Examples in neural calcium imaging video segmentation and hyperspectral compressed recovery show the advantages of our approach on high-dimensional datasets.

1708.06345 2026-06-04 cs.RO cs.SY eess.SY

Robust Optimal Planning and Control of Non-Periodic Bipedal Locomotion with A Centroidal Momentum Model

非周期双足运动的鲁棒最优规划与控制:基于质心动量模型

Ye Zhao, Benito R. Fernandez, Luis Sentis

AI总结 本文提出基于质心动量动力学的混合相空间规划与控制方法,通过非周期关键帧状态的鲁棒跟踪实现双足运动的稳健控制,重点解决非周期步态生成和扰动鲁棒性问题。

Comments 43 pages, 22 figures, journal, International Journal of Robotics Research, 2017. arXiv admin note: substantial text overlap with arXiv:1701.05929, arXiv:1511.04628

详情
AI中文摘要

本文提出基于质心动量动力学的混合相空间规划与控制方法,通过非周期关键帧状态的鲁棒跟踪实现双足运动的稳健控制,重点解决非周期步态生成和扰动鲁棒性问题。

英文摘要

This study presents a theoretical method for planning and controlling agile bipedal locomotion based on robustly tracking a set of non-periodic keyframe states. Based on centroidal momentum dynamics, we formulate a hybrid phase-space planning and control method which includes the following key components: (i) a step transition solver that enables dynamically tracking non-periodic keyframe states over various types of terrains, (ii) a robust hybrid automaton to effectively formulate planning and control algorithms, (iii) a steering direction model to control the robot's heading, (iv) a phase-space metric to measure distance to the planned locomotion manifolds, and (v) a hybrid control method based on the previous distance metric to produce robust dynamic locomotion under external disturbances. Compared to other locomotion methodologies, we have a large focus on non-periodic gait generation and robustness metrics to deal with disturbances. Such focus enables the proposed control method to robustly track non-periodic keyframe states over various challenging terrains and under external disturbances as illustrated through several simulations.

1708.04790 2026-06-04 cs.RO cs.SY eess.SY

Evaluation of Human-Robot Collaboration Models for Fluent Operations in Industrial Tasks

工业任务中流畅操作的人机协作模型评估

Lior Sayfeld, Ygal Peretz, Roy Someshwar, Yael Edan

AI总结 本文评估了人机协作模型在工业任务中的性能,比较了基于定时和传感器的模型与自适应模型,发现自适应模型在总装配时间和空闲时间上有显著提升。

Comments Robotics: Science and Systems, Human-Robot Hand-Over Workshop 2015

详情
AI中文摘要

在本研究中,我们评估了集成人机操作系统中的人机协作模型。设计了一个包含协作机器人臂和人类工人的集成工作单元,用于执行实时装配任务。80名22-27岁的工业工程学生参与了实验,比较了基于定时和传感器的模型与在此框架内开发的自适应模型。性能指标包括总装配时间和总空闲时间。结果显示,自适应系统显著提高了所检查的参数,并与基于定时和传感器的模型相比,总装配时间减少了7%,总空闲时间减少了60%。

英文摘要

In this study we evaluated human-robot collaboration models in an integrated human-robot operational system. An integrated work cell which includes a robotic arm working collaboratively with a human worker was specially designed for executing a real-time assembly task. Eighty industrial engineering students aged 22-27 participated in experiments in which timing and sensor based models were compared to an adaptive model developed within this framework. Performance measures included total assembly time and total idle time. The results showed conclusively that the adaptive system improved the examined parameters and provided an improvement of 7% in total assembly time and 60% in total idle time when compared to timing and sensory based models.

1708.03366 2026-06-04 cs.LG cs.AI cs.CR cs.SY eess.SY

Resilient Linear Classification: An Approach to Deal with Attacks on Training Data

鲁棒线性分类:一种应对训练数据攻击的方法

Sangdon Park, James Weimer, Insup Lee

AI总结 本文提出一种鲁棒线性分类方法,通过引入多数约束,提高对抗训练数据攻击的鲁棒性,验证了传统算法在攻击下的脆弱性。

Comments Accepted as a conference paper at ICCPS17

详情
AI中文摘要

数据驱动技术用于控制自动驾驶车辆、处理能源管理的需求响应以及建模人体生理学用于医疗设备。这些技术从训练数据中提取模型,其性能通常基于训练数据中的随机误差进行分析。然而,如果训练数据被攻击者恶意篡改,这些攻击对数据驱动CPS底层学习算法的影响尚未被考虑。本文分析了分类算法对训练数据攻击的鲁棒性。具体而言,提出了一种通用度量标准,用于衡量分类算法对训练数据最坏情况篡改的鲁棒性。使用该度量标准,我们显示传统线性分类算法在受限条件下具有鲁棒性。为克服这些限制,我们提出了一种具有多数约束的线性分类算法,并证明其比传统算法更鲁棒。在合成数据和一个现实世界的回顾性心律失常医疗案例研究中的评估显示,传统算法对篡改的训练数据易受攻击,而所提算法更具鲁棒性(以最坏情况篡改衡量)。

英文摘要

Data-driven techniques are used in cyber-physical systems (CPS) for controlling autonomous vehicles, handling demand responses for energy management, and modeling human physiology for medical devices. These data-driven techniques extract models from training data, where their performance is often analyzed with respect to random errors in the training data. However, if the training data is maliciously altered by attackers, the effect of these attacks on the learning algorithms underpinning data-driven CPS have yet to be considered. In this paper, we analyze the resilience of classification algorithms to training data attacks. Specifically, a generic metric is proposed that is tailored to measure resilience of classification algorithms with respect to worst-case tampering of the training data. Using the metric, we show that traditional linear classification algorithms are resilient under restricted conditions. To overcome these limitations, we propose a linear classification algorithm with a majority constraint and prove that it is strictly more resilient than the traditional algorithms. Evaluations on both synthetic data and a real-world retrospective arrhythmia medical case-study show that the traditional algorithms are vulnerable to tampered training data, whereas the proposed algorithm is more resilient (as measured by worst-case tampering).

1708.03055 2026-06-04 cs.RO cs.SY eess.SY

Optimal Control for Constrained Coverage Path Planning

受约束覆盖路径规划的最优控制

Ankit Manerikar, Debasmit Das, Pranay Banerjee

AI总结 本文研究了在地图中存在障碍物约束下,如何通过改进线性扫掠覆盖方法实现最小能量/时间最优和最大面积覆盖,并分析不同参数对性能的影响。

Comments Report for AAE 568 (Applied Optimal Control) at Purdue

详情
AI中文摘要

受约束覆盖路径规划问题涉及机器人在地图中存在某些作为障碍物出现的约束条件下,试图覆盖最大面积。在多种覆盖路径规划方法中,我们考虑将线性扫掠覆盖方法进行增强,以实现最小能量/时间最优的同时实现最大面积覆盖。此外,我们还研究了不同参数变化对改进方法性能的影响。

英文摘要

The problem of constrained coverage path planning involves a robot trying to cover maximum area of an environment under some constraints that appear as obstacles in the map. Out of the several coverage path planning methods, we consider augmenting the linear sweep-based coverage method to achieve minimum energy/ time optimality along with maximum area coverage. In addition, we also study the effects of variation of different parameters on the performance of the modified method.

1608.02193 2026-06-04 cs.AI cs.SY eess.SY

Spacetimes with Semantics (III) - The Structure of Functional Knowledge Representation and Artificial Reasoning

具有语义的空间(III)- 功能知识表示与人工推理的结构

Mark Burgess

AI总结 本文探讨了知识表示作为语义系统的结构,基于承诺理论框架,提出概念、关联知识和情境意识的解释,强调语义时空属性对学习和智能系统的影响。

Comments 122 pages, builiding on parts I and II Minor updates and corrections added to current version

详情
AI中文摘要

利用先前发展的语义时空概念,本文在承诺理论框架内探讨了知识表示及其结构作为语义系统的解释。通过为现象赋予解释,从观察者到被观察者,可以接近基于功能的系统简单描述,并具有直接实用价值。重点在于概念、关联知识和情境意识的解释。推断认为,大多数或所有这些概念源于纯粹的语义时空属性,这为更广泛理解学习或智能系统的构成提供了可能。一些关键原则浮现:1)时空尺度分离,2)四种不可约简关联类型的重复出现,通过意图传播:聚合、因果、合作和相似性,3)身份的辨别需求(离散),通过区分时间线同时性与顺序事件,4)学习(记忆)的能力。至少合理推测,涌现的知识抽象能力起源于基本的时空结构。这些笔记呈现了大部分已知结果的统一观点;它们使信息模型、知识表示、机器学习和语义网络(运输和信息基础)在共同框架下得以理解。'智能空间'的概念涵盖了人工系统和生物系统,跨越许多不同尺度,例如智能城市和组织。

英文摘要

Using the previously developed concepts of semantic spacetime, I explore the interpretation of knowledge representations, and their structure, as a semantic system, within the framework of promise theory. By assigning interpretations to phenomena, from observers to observed, we may approach a simple description of knowledge-based functional systems, with direct practical utility. The focus is especially on the interpretation of concepts, associative knowledge, and context awareness. The inference seems to be that most if not all of these concepts emerge from purely semantic spacetime properties, which opens the possibility for a more generalized understanding of what constitutes a learning, or even `intelligent' system. Some key principles emerge for effective knowledge representation: 1) separation of spacetime scales, 2) the recurrence of four irreducible types of association, by which intent propagates: aggregation, causation, cooperation, and similarity, 3) the need for discrimination of identities (discrete), which is assisted by distinguishing timeline simultaneity from sequential events, and 4) the ability to learn (memory). It is at least plausible that emergent knowledge abstraction capabilities have their origin in basic spacetime structures. These notes present a unified view of mostly well-known results; they allow us to see information models, knowledge representations, machine learning, and semantic networking (transport and information base) in a common framework. The notion of `smart spaces' thus encompasses artificial systems as well as living systems, across many different scales, e.g. smart cities and organizations.

1603.04179 2026-06-04 cs.SD cs.IT cs.SY eess.SY math.IT

Performance Analysis of Source Image Estimators in Blind Source Separation

源图像估计器在盲源分离中的性能分析

Zbyněk Koldovský, Francesco Nesta

AI总结 本文分析了两种常用的传感器响应计算方法,探讨其在正交信号子空间下的等价性及应用差异。

Comments 24 pages

详情
AI中文摘要

盲方法通常在未知缩放因子下分离或识别信号或信号子空间。有时需要处理缩放模糊性,可通过重建传感器接收到的信号来解决,因为传感器响应的尺度有已知的物理解释。本文分析了两种广泛用于计算传感器响应的方法,特别是频域独立成分分析。一种方法是最小二乘投影,另一种假设正则混叠矩阵并计算其逆。这两种估计器对未知缩放因子具有不变性。尽管经常使用,但它们的差异尚未被研究。本文的目标是填补这一空白。通过理论研究、扰动分析和模拟比较这两种估计器。我们指出,当分离的信号子空间正交时,估计器等价,反之亦然。展示了两个应用,其中一个案例显示估计器产生显著不同的结果。

英文摘要

Blind methods often separate or identify signals or signal subspaces up to an unknown scaling factor. Sometimes it is necessary to cope with the scaling ambiguity, which can be done through reconstructing signals as they are received by sensors, because scales of the sensor responses (images) have known physical interpretations. In this paper, we analyze two approaches that are widely used for computing the sensor responses, especially, in Frequency-Domain Independent Component Analysis. One approach is the least-squares projection, while the other one assumes a regular mixing matrix and computes its inverse. Both estimators are invariant to the unknown scaling. Although frequently used, their differences were not studied yet. A goal of this work is to fill this gap. The estimators are compared through a theoretical study, perturbation analysis and simulations. We point to the fact that the estimators are equivalent when the separated signal subspaces are orthogonal, and vice versa. Two applications are shown, one of which demonstrates a case where the estimators yield substantially different results.

1707.08689 2026-06-04 cs.RO cs.LG cs.SY eess.SY

Multi-Robot Transfer Learning: A Dynamical System Perspective

多机器人迁移学习:动态系统视角

Mohamed K. Helwa, Angela P. Schoellig

AI总结 本文从动态系统角度研究多机器人迁移学习中的最优转移映射性质,提出无需详细动力学知识的算法,通过实验验证该算法在四旋翼平台间迁移学习中减少60-70%的误差。

Comments 7 pages, 6 figures, accepted at the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems

详情
AI中文摘要

多机器人迁移学习允许一个机器人利用第二个相似机器人生成的数据来改进自身行为。潜在优势是减少训练时间并降低训练阶段不可避免的风险。迁移学习算法旨在找到不同机器人之间的最优转移映射。本文通过单输入单输出(SISO)系统的理论研究,探讨了此类最优转移映射的性质。我们首先证明最优迁移学习映射通常是一个动态系统。本文的主要贡献是提供一种确定该最优动态映射性质的算法,包括其阶数和回归器(即它所依赖的变量)。所提出的算法不需要详细的机器人动力学知识,但依赖于通过简单实验测试可获得的基本系统属性。我们通过两个不同四旋翼平台间的迁移学习示例验证了所提算法。实验结果表明,通过我们的算法获得的最优动态映射在减少迁移学习误差方面比直接转移数据或使用最优静态映射的情况减少了60-70%。

英文摘要

Multi-robot transfer learning allows a robot to use data generated by a second, similar robot to improve its own behavior. The potential advantages are reducing the time of training and the unavoidable risks that exist during the training phase. Transfer learning algorithms aim to find an optimal transfer map between different robots. In this paper, we investigate, through a theoretical study of single-input single-output (SISO) systems, the properties of such optimal transfer maps. We first show that the optimal transfer learning map is, in general, a dynamic system. The main contribution of the paper is to provide an algorithm for determining the properties of this optimal dynamic map including its order and regressors (i.e., the variables it depends on). The proposed algorithm does not require detailed knowledge of the robots' dynamics, but relies on basic system properties easily obtainable through simple experimental tests. We validate the proposed algorithm experimentally through an example of transfer learning between two different quadrotor platforms. Experimental results show that an optimal dynamic map, with correct properties obtained from our proposed algorithm, achieves 60-70% reduction of transfer learning error compared to the cases when the data is directly transferred or transferred using an optimal static map.

1707.08369 2026-06-04 cs.LG cs.NA math.NA

Updating Singular Value Decomposition for Rank One Matrix Perturbation

针对秩一矩阵扰动的奇异值分解更新

Ratnik Gandhi, Amoli Rajgor

AI总结 本文提出一种高效算法,用于在O(n² log(1/ε))时间内更新秩一扰动矩阵的奇异值分解,利用快速多极子方法在O(n log(1/ε))时间内更新奇异向量。

详情
AI中文摘要

一个高效的奇异值分解(SVD)算法是大数据问题中分布式和流式计算的重要工具。观察到秩一扰动矩阵的奇异向量更新类似于Cauchy矩阵-向量乘积。基于此观察,本文提出一种高效的算法,用于在O(n² log(1/ε))时间内更新秩一扰动矩阵的SVD。该方法利用快速多极子方法(FMM)在O(n log(1/ε))时间内更新奇异向量,其中ε是计算精度。

英文摘要

An efficient Singular Value Decomposition (SVD) algorithm is an important tool for distributed and streaming computation in big data problems. It is observed that update of singular vectors of a rank-1 perturbed matrix is similar to a Cauchy matrix-vector product. With this observation, in this paper, we present an efficient method for updating Singular Value Decomposition of rank-1 perturbed matrix in $O(n^2 \ \text{log}(\frac{1}ε))$ time. The method uses Fast Multipole Method (FMM) for updating singular vectors in $O(n \ \text{log} (\frac{1}ε))$ time, where $ε$ is the precision of computation.

1610.06283 2026-06-04 cs.RO cs.LG cs.NE cs.SY eess.SY

Deep Neural Networks for Improved, Impromptu Trajectory Tracking of Quadrotors

用于四旋翼机即时轨迹跟踪的深度神经网络

Qiyang Li, Jingxing Qian, Zining Zhu, Xuchan Bao, Mohamed K. Helwa, Angela P. Schoellig

AI总结 本文提出基于深度神经网络的算法,通过提供定制参考输入提升经典反馈控制器的轨迹跟踪性能,实验表明该方法能有效减少跟踪误差,适用于实时轨迹跟踪应用。

Comments 7 pages, 8 figures. Accepted final version. To appear in the proc. of the 2017 IEEE International Conference on Robotics and Automation

详情
AI中文摘要

四旋翼机的轨迹跟踪控制对于应用范围从勘测和检查到影视制作都至关重要。然而,设计和调优经典控制器,如比例-积分-微分(PID)控制器,以实现高跟踪精度可能耗时且困难,由于隐藏动态和其他非理想因素。深度神经网络(DNN)凭借其卓越的近似抽象、非线性函数的能力,提出了一种增强轨迹跟踪控制的新方法。本文提出了一种基于DNN的算法作为附加模块,以提高经典反馈控制器的跟踪性能。给定期望轨迹,DNNs根据其获得的经验为控制器提供定制参考输入。输入旨在实现期望轨迹与输出轨迹之间的单位映射。这项工作的动机是交互式“画即飞”应用,用户在移动设备上绘制轨迹,四旋翼机即时飞越该轨迹,通过DNN增强的控制系统。实验结果表明,所提出的方法在DNNs在选定的周期轨迹上训练后,能够提高用户绘制轨迹的跟踪精度,表明该方法在现实应用中的潜力。跟踪误差在训练和测试轨迹上分别减少约40-50%,突显了DNNs在知识泛化方面的能力。

英文摘要

Trajectory tracking control for quadrotors is important for applications ranging from surveying and inspection, to film making. However, designing and tuning classical controllers, such as proportional-integral-derivative (PID) controllers, to achieve high tracking precision can be time-consuming and difficult, due to hidden dynamics and other non-idealities. The Deep Neural Network (DNN), with its superior capability of approximating abstract, nonlinear functions, proposes a novel approach for enhancing trajectory tracking control. This paper presents a DNN-based algorithm as an add-on module that improves the tracking performance of a classical feedback controller. Given a desired trajectory, the DNNs provide a tailored reference input to the controller based on their gained experience. The input aims to achieve a unity map between the desired and the output trajectory. The motivation for this work is an interactive "fly-as-you-draw" application, in which a user draws a trajectory on a mobile device, and a quadrotor instantly flies that trajectory with the DNN-enhanced control system. Experimental results demonstrate that the proposed approach improves the tracking precision for user-drawn trajectories after the DNNs are trained on selected periodic trajectories, suggesting the method's potential in real-world applications. Tracking errors are reduced by around 40-50% for both training and testing trajectories from users, highlighting the DNNs' capability of generalizing knowledge.

1707.05828 2026-06-04 cs.LG cs.NA math.NA

A deep learning approach to diabetic blood glucose prediction

基于深度学习的糖尿病血血糖预测方法

H. N. Mhaskar, S. V. Pereverzyev, M. D. van der Walt

AI总结 本文提出利用深度学习对糖尿病患者血糖进行30分钟预测,通过选取部分患者数据训练模型,验证深度学习在该任务中的优越性,并展示如何利用领域知识构建简洁的深度表示。

详情
Journal ref
Front. Appl. Math. Stat., 14 July 2017
AI中文摘要

我们考虑利用连续血糖监测设备测量的血糖水平进行30分钟预测,使用临床数据。虽然大多数此类研究针对单个患者,我们选取数据集中的一定比例患者作为训练数据,其余作为测试对象;即模型无需在数据集中的新患者上重新校准。我们展示了深度学习如何在该示例中优于浅层网络。一个创新点是展示如何利用领域知识构建简洁的深度表示。

英文摘要

We consider the question of 30-minute prediction of blood glucose levels measured by continuous glucose monitoring devices, using clinical data. While most studies of this nature deal with one patient at a time, we take a certain percentage of patients in the data set as training data, and test on the remainder of the patients; i.e., the machine need not re-calibrate on the new patients in the data set. We demonstrate how deep learning can outperform shallow networks in this example. One novelty is to demonstrate how a parsimonious deep representation can be constructed using domain knowledge.

1605.07246 2026-06-04 cs.LG cs.AI cs.NA math.NA

Adaptive ADMM with Spectral Penalty Parameter Selection

自适应ADMM与谱惩罚参数选择

Zheng Xu, Mario A. T. Figueiredo, Tom Goldstein

AI总结 本文提出自适应ADMM算法,通过自适应调整惩罚参数实现快速收敛,提高算法鲁棒性与易用性。

Comments AISTATS 2017

详情
AI中文摘要

交替方向乘子法(ADMM)是一种解决广泛约束优化问题的 versatile 工具,适用于可微或非可微的目标函数。不幸的是,其性能高度敏感于惩罚参数,使ADMM往往不可靠且难以自动化。我们通过提出自适应调整惩罚参数的方法来克服这一缺点,得到的自适应ADMM(AADMM)算法受成功Barzilai-Borwein谱方法启发,实现快速收敛和对初始步长和问题规模的相对不敏感性。

英文摘要

The alternating direction method of multipliers (ADMM) is a versatile tool for solving a wide range of constrained optimization problems, with differentiable or non-differentiable objective functions. Unfortunately, its performance is highly sensitive to a penalty parameter, which makes ADMM often unreliable and hard to automate for a non-expert user. We tackle this weakness of ADMM by proposing a method to adaptively tune the penalty parameters to achieve fast convergence. The resulting adaptive ADMM (AADMM) algorithm, inspired by the successful Barzilai-Borwein spectral method for gradient descent, yields fast convergence and relative insensitivity to the initial stepsize and problem scaling.

1707.05458 2026-06-04 cs.RO cs.SY eess.SY

Stabilization Control of the Differential Mobile Robot Using Lyapunov Function and Extended Kalman Filter

微分移动机器人稳定控制的Lyapunov函数与扩展卡尔曼滤波应用

T. T. Hoang, P. M. Duong, N. T. T. Van, T. Q. Vinh

AI总结 本文设计了导航微分移动机器人到达目标位置的控制模型,采用扩展卡尔曼滤波进行状态估计,结合Lyapunov函数实现稳定控制,确保闭环系统渐近稳定与鲁棒性。

Comments arXiv admin note: text overlap with arXiv:1611.07112, arXiv:1611.07114

详情
Journal ref
Journal of Science and Technology, pp.441-452, Vol. 50 no.4, 2012
AI中文摘要

本文提出了一种控制模型,用于使微分移动机器人从任意初始姿态导航至目标位置。该模型分为两个阶段:状态估计和稳定控制。在状态估计中,采用扩展卡尔曼滤波来最优结合系统动力学和测量信息。构造了两个Lyapunov函数,允许混合反馈控制律执行机器人运动。闭环系统的渐近稳定性和鲁棒性得到保证。通过仿真和实验验证了所提方法的有效性和实用性。

英文摘要

This paper presents the design of a control model to navigate the differential mobile robot to reach the desired destination from an arbitrary initial pose. The designed model is divided into two stages: the state estimation and the stabilization control. In the state estimation, an extended Kalman filter is employed to optimally combine the information from the system dynamics and measurements. Two Lyapunov functions are constructed that allow a hybrid feedback control law to execute the robot movements. The asymptotical stability and robustness of the closed loop system are assured. Simulations and experiments are carried out to validate the effectiveness and applicability of the proposed approach.

1707.05456 2026-06-04 cs.RO cs.NI cs.SY eess.SY

Control of an Internet-based Robot System Using the Real-time Transport Protocol

基于实时传输协议的互联网机器人系统控制

P. M. Duong, T. T. Hoang, T. Q. Vinh

AI总结 本文提出利用实时传输协议替代传统TCP和UDP进行机器人系统控制,通过理论分析、仿真和实验验证其可行性与有效性。

Comments in Proceeding of The 5th Vietnam Conference on Mechatronics, Ho chi minh City, Vietnam, 2010

详情
AI中文摘要

本文介绍了一种新的方法,用于在互联网上控制机器人系统。实时传输协议(RTP)被用作通信协议,而不是传统上使用TCP和UDP。理论分析、仿真研究和实验实现已执行,以评估所提出方法在实际应用中的可行性和有效性。

英文摘要

In this paper, we introduce a novel approach in controlling robot systems over the Internet. The Real-time Transport Protocol (RTP) is used as the communication protocol instead of traditionally using TCP and UDP. The theoretic analyses, the simulation studies and the experimental implementation have been performed to evaluate the feasibility and effectiveness of the proposed approach for practical uses.

1705.05065 2026-06-04 cs.RO cs.AI cs.CV cs.SY eess.SY

AirSim: High-Fidelity Visual and Physical Simulation for Autonomous Vehicles

AirSim:面向自动驾驶车辆的高保真视觉与物理模拟

Shital Shah, Debadeepta Dey, Chris Lovett, Ashish Kapoor

AI总结 本文提出基于Unreal引擎的AirSim模拟器,用于高效开发和测试自动驾驶算法,支持高频率物理模拟和多种协议,通过四旋翼实验验证其有效性。

Comments Accepted for Field and Service Robotics conference 2017 (FSR 2017)

详情
AI中文摘要

为自动驾驶车辆开发和测试算法在现实世界中成本高且耗时。为利用最新机器智能和深度学习进展,需收集大量标注训练数据。本文提出基于Unreal引擎的新模拟器,提供真实的物理和视觉模拟。模拟器包含可实现实时硬件在环(HITL)模拟的物理引擎,支持MavLink等流行协议。模拟器从零开始设计,可扩展以适应新车辆类型、硬件平台和软件协议。模块化设计使各组件可独立用于其他项目。通过实现四旋翼自动驾驶车辆并实验性比较软件组件与真实飞行,验证了模拟器的有效性。

英文摘要

Developing and testing algorithms for autonomous vehicles in real world is an expensive and time consuming process. Also, in order to utilize recent advances in machine intelligence and deep learning we need to collect a large amount of annotated training data in a variety of conditions and environments. We present a new simulator built on Unreal Engine that offers physically and visually realistic simulations for both of these goals. Our simulator includes a physics engine that can operate at a high frequency for real-time hardware-in-the-loop (HITL) simulations with support for popular protocols (e.g. MavLink). The simulator is designed from the ground up to be extensible to accommodate new types of vehicles, hardware platforms and software protocols. In addition, the modular design enables various components to be easily usable independently in other projects. We demonstrate the simulator by first implementing a quadrotor as an autonomous vehicle and then experimentally comparing the software components with real-world flights.

1707.02515 2026-06-04 cs.AI cs.LG cs.SY eess.SY

A Fast Integrated Planning and Control Framework for Autonomous Driving via Imitation Learning

一种通过模仿学习的快速集成规划与控制系统用于自动驾驶

Liting Sun, Cheng Peng, Wei Zhan, Masayoshi Tomizuka

AI总结 本文提出一种结合学习与优化方法的两层框架,通过神经网络学习长期最优策略并结合短期优化控制器提升自动驾驶的安全性和效率。

详情
AI中文摘要

为实现自动驾驶中的安全高效规划与控制,需要一种能够长期 horizon 内实现良好驾驶质量且保证安全可行的驾驶策略。基于优化的方法,如模型预测控制(MPC),可以提供此类最优策略,但其计算复杂度通常无法满足实时实现的需求。为解决此问题,我们提出了一种快速集成规划与控制系统,该系统通过在两层分层结构中结合学习与优化方法。第一层定义为“策略层”,由神经网络建立,学习由MPC生成的长期最优驾驶策略。第二层称为“执行层”,是一个基于优化的短期控制器,能够跟踪由“策略层”提供的参考轨迹,并保证短期的安全性和可行性。此外,通过高效且高度代表性的特征,小尺寸的神经网络足以处理许多复杂的驾驶场景。这使得在线模仿学习与数据集聚合(DAgger)成为可能,从而能够快速且持续地提升“策略层”的性能。几个驾驶场景的例子被演示以验证所提框架的有效性和效率。

英文摘要

For safe and efficient planning and control in autonomous driving, we need a driving policy which can achieve desirable driving quality in long-term horizon with guaranteed safety and feasibility. Optimization-based approaches, such as Model Predictive Control (MPC), can provide such optimal policies, but their computational complexity is generally unacceptable for real-time implementation. To address this problem, we propose a fast integrated planning and control framework that combines learning- and optimization-based approaches in a two-layer hierarchical structure. The first layer, defined as the "policy layer", is established by a neural network which learns the long-term optimal driving policy generated by MPC. The second layer, called the "execution layer", is a short-term optimization-based controller that tracks the reference trajecotries given by the "policy layer" with guaranteed short-term safety and feasibility. Moreover, with efficient and highly-representative features, a small-size neural network is sufficient in the "policy layer" to handle many complicated driving scenarios. This renders online imitation learning with Dataset Aggregation (DAgger) so that the performance of the "policy layer" can be improved rapidly and continuously online. Several exampled driving scenarios are demonstrated to verify the effectiveness and efficiency of the proposed framework.

1707.02201 2026-06-04 cs.RO cs.LG cs.SY eess.SY

Learning human behaviors from motion capture by adversarial imitation

通过对抗模仿学习学习人类行为

Josh Merel, Yuval Tassa, Dhruva TB, Sriram Srinivasan, Jay Lemmon, Ziyu Wang, Greg Wayne, Nicolas Heess

AI总结 本文提出利用生成对抗模仿学习训练神经网络策略,从有限的不完全观测状态特征中生成人类化运动模式,即使演示来自不同物理参数的躯体,也能通过子技能策略解决任务。

详情
AI中文摘要

深度强化学习的快速进展使训练高维人形身体控制器变得越来越可行。然而,纯强化学习方法使用简单的奖励函数往往会产生非人类化且过于刻板的运动行为。在本文中,我们扩展了生成对抗模仿学习,以使训练通用神经网络策略成为可能,从而从仅包含部分观测状态特征的有限演示中生成人类化运动模式,即使在没有动作信息且演示来自具有不同且未知物理参数的躯体时也是如此。我们利用这种方法从动作捕捉数据构建子技能策略,并展示这些策略在由更高层次控制器控制时可以用于解决任务。

英文摘要

Rapid progress in deep reinforcement learning has made it increasingly feasible to train controllers for high-dimensional humanoid bodies. However, methods that use pure reinforcement learning with simple reward functions tend to produce non-humanlike and overly stereotyped movement behaviors. In this work, we extend generative adversarial imitation learning to enable training of generic neural network policies to produce humanlike movement patterns from limited demonstrations consisting only of partially observed state features, without access to actions, even when the demonstrations come from a body with different and unknown physical parameters. We leverage this approach to build sub-skill policies from motion capture data and show that they can be reused to solve tasks when controlled by a higher level controller.

1705.09415 2026-06-04 cs.RO cs.SY eess.SY

Near-Optimal Belief Space Planning via T-LQG

通过T-LQG实现接近最优的信念空间规划

Mohammadhussein Rafieisakhaei, Suman Chakravorty, P. R. Kumar

AI总结 本文提出T-LQG方法,用于非线性机器人系统在观测和运动不确定性下的规划问题,提供近优反馈控制策略,解决POMDP问题。

Comments 3 pages, 3 figures, In Robotics: Science and Systems (RSS) 2017 Workshop of "POMDPs in Robotics: State of The Art, Challenges, and Opportunities"

详情
AI中文摘要

本文考虑了在观测和运动不确定性下非线性机器人系统的规划问题。通常将此问题形式化为部分观察马尔可夫决策过程(POMDP),确定最优解在计算上是不可行的。我们提出了一种轨迹优化线性二次高斯(T-LQG)方法,该方法能够为POMDP问题提供可量化近优的解决方案。我们提供了一个新的'分离原理',用于设计一个最优的开环轨迹随后是最佳反馈控制律,这为涉及多项式阶数最小阶数的信念空间规划问题提供了一个近优反馈控制策略。

英文摘要

We consider the problem of planning under observation and motion uncertainty for nonlinear robotics systems. Determining the optimal solution to this problem, generally formulated as a Partially Observed Markov Decision Process (POMDP), is computationally intractable. We propose a Trajectory-optimized Linear Quadratic Gaussian (T-LQG) approach that leads to quantifiably near-optimal solutions for the POMDP problem. We provide a novel "separation principle" for the design of an optimal nominal open-loop trajectory followed by an optimal feedback control law, which provides a near-optimal feedback control policy for belief space planning problems involving a polynomial order of calculations of minimum order.

1611.05963 2026-06-04 cs.CV cs.NA math.NA

Reweighted Low-Rank Tensor Decomposition based on t-SVD and its Applications in Video Denoising

基于t-SVD的加权低秩张量分解及其在视频去噪中的应用

M. Baburaj, Sudhish N. George

AI总结 本文提出基于t-SVD的加权低秩张量分解方法,通过改进张量多秩和稀疏成分恢复,提升视频去噪性能。

Comments Algorithm 1 is inefficient since line 2 is processed n 3 times need to be changed There are inconsistent notations throughout the manuscript Unitary Tensor are not defined

详情
AI中文摘要

基于t-SVD的张量鲁棒主成分分析(TRPCA)通过同时最小化张量核范数和l1范数,将低秩多线性信号分解为低多秩和稀疏成分。但当信号多秩较大或噪声较多时,TRPCA性能下降。为解决此问题,本文提出一种新的高效迭代加权张量分解方案,显著提升TRPCA的张量多秩。此外,通过加权l1范数恢复张量稀疏成分,提高分解精度。通过应用于视频去噪问题,实验结果表明所提算法优于其他方法。

英文摘要

The t-SVD based Tensor Robust Principal Component Analysis (TRPCA) decomposes low rank multi-linear signal corrupted by gross errors into low multi-rank and sparse component by simultaneously minimizing tensor nuclear norm and l 1 norm. But if the multi-rank of the signal is considerably large and/or large amount of noise is present, the performance of TRPCA deteriorates. To overcome this problem, this paper proposes a new efficient iterative reweighted tensor decomposition scheme based on t-SVD which significantly improves tensor multi-rank in TRPCA. Further, the sparse component of the tensor is also recovered by reweighted l 1 norm which enhances the accuracy of decomposition. The effectiveness of the proposed method is established by applying it to the video denoising problem and the experimental results reveal that the proposed algorithm outperforms its counterparts.

1707.01945 2026-06-04 cs.LG cs.NA math.NA stat.ML

Simple Classification using Binary Data

基于二进制数据的简单分类

Deanna Needell, Rayan Saab, Tina Woolf

AI总结 本文研究了从二进制数据进行分类的问题,提出了一种计算和资源消耗低的框架,并通过实验和理论分析验证其有效性。

详情
AI中文摘要

二进制数据在许多应用中自然出现,并在硬件实现和算法设计中具有吸引力。本文研究了从二进制数据进行分类的问题,提出了一种计算和资源消耗低的框架。我们通过 stylized 和 realistic 的数值实验展示了所提方法的实用性,并为简单情况提供了理论分析。我们希望我们的框架和分析能为研究类似方法提供基础。

英文摘要

Binary, or one-bit, representations of data arise naturally in many applications, and are appealing in both hardware implementations and algorithm design. In this work, we study the problem of data classification from binary data and propose a framework with low computation and resource costs. We illustrate the utility of the proposed approach through stylized and realistic numerical experiments, and provide a theoretical analysis for a simple case. We hope that our framework and analysis will serve as a foundation for studying similar types of approaches.

1707.01530 2026-06-04 cs.CV cs.NA math.NA

On the Fusion of Compton Scatter and Attenuation Data for Limited-view X-ray Tomographic Applications

在有限视角X射线断层成像应用中融合康普顿散射与衰减数据

Hamideh Rezaee, Brian Tracey, Eric L. Miller

AI总结 本文提出一种融合康普顿散射数据与传统衰减数据的方法,用于恢复材料密度和光电吸收,通过变分方法和正则化技术提升成像精度。

详情
AI中文摘要

本文演示了在有限视角X射线断层成像应用中,融合能量分辨的康普顿散射光子观测与传统衰减数据,用于联合恢复材料密度和光电吸收的实用性。我们首先开发了康普顿散射过程的物理和相关数值模型。利用该模型,我们提出了一种变分方法来恢复这两种材料属性。除了典型的数据保真项外,优化功能还包含对质量和光电系数的正则化。我们还考虑了质量密度情况下的新型边缘保持方法。为了帮助恢复光电信息,我们借鉴了最近的方法,并采用非局部正则化方案,利用质量密度更稳定成像的事实。模拟结果展示了同时使用散射光子数据和能量分辨信息在映射两种材料属性方面的明显优势。具体而言,比较仅使用传统衰减数据获得的图像与仅使用康普顿散射光子或两种数据结合形成的图像,显示同时利用两种数据进行重建能提供更准确的结果。

英文摘要

In this paper we demonstrate the utility of fusing energy-resolved observations of Compton scattered photons with traditional attenuation data for the joint recovery of mass density and photoelectric absorption in the context of limited view tomographic imaging applications. We begin with the development of a physical and associated numerical model for the Compton scatter process. Using this model, we propose a variational approach recovering these two material properties. In addition to the typical data-fidelity terms, the optimization functional includes regularization for both the mass density and photoelectric coefficients. We consider a novel edge-preserving method in the case of mass density. To aid in the recovery of the photoelectric information, we draw on our recent method in \cite{r15} and employ a non-local regularization scheme that builds on the fact that mass density is more stably imaged. Simulation results demonstrate clear advantages associated with the use of both scattered photon data and energy resolved information in mapping the two material properties of interest. Specifically, comparing images obtained using only conventional attenuation data with those where we employ only Compton scatter photons and images formed from the combination of the two, shows that taking advantage of both types of data for reconstruction provides far more accurate results.

1707.01322 2026-06-04 cs.LG cs.LO cs.SY eess.SY

Automated Experiment Design for Data-Efficient Verification of Parametric Markov Decision Processes

数据高效验证参数马尔可夫决策过程的自动化实验设计

Elizabeth Polgreen, Viraj Wijesuriya, Sofie Haesaert, Alessandro Abate

AI总结 本文提出一种利用参数模型和实验数据进行统计验证的新方法,通过参数综合确定可行参数集,主动合成实验提高数据相关性,并传播信息以获得验证结果。

Comments QEST 2017, 18 pages, 7 figures

详情
AI中文摘要

我们提出了一种新的方法,用于在部分未知系统上对定量属性进行统计验证,利用参数模型(本文中为参数马尔可夫决策过程)和从底层系统中收集的实验数据。我们获得底层系统满足给定属性的信心,并展示该方法使用数据高效,因此对可用数据量具有鲁棒性。这些特性通过首先利用参数综合确定可行参数集,其次主动合成实验以增加与属性相关的数据信息,最后将此信息传播到模型参数,从而获得反映我们对系统参数是否在可行集中的信心,从而解决验证问题。

英文摘要

We present a new method for statistical verification of quantitative properties over a partially unknown system with actions, utilising a parameterised model (in this work, a parametric Markov decision process) and data collected from experiments performed on the underlying system. We obtain the confidence that the underlying system satisfies a given property, and show that the method uses data efficiently and thus is robust to the amount of data available. These characteristics are achieved by firstly exploiting parameter synthesis to establish a feasible set of parameters for which the underlying system will satisfy the property; secondly, by actively synthesising experiments to increase amount of information in the collected data that is relevant to the property; and finally propagating this information over the model parameters, obtaining a confidence that reflects our belief whether or not the system parameters lie in the feasible set, thereby solving the verification problem.