arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 21511
2411.05591 2026-06-04 stat.ML cs.LG

Decentralized EM Algorithm for Gaussian Mixtures under Data Heterogeneity and Partial Labeling

数据异质性和部分标记下高斯混合的分布式EM算法

Xuetong Li, Shuyuan Wu, Bin Du, Hansheng Wang

发表机构 * School of Mathematics and Statistics(数学与统计学学院) School of Statistics and Data Science(统计与数据科学学院) Guanghua School of Management(光华管理学院)

AI总结 针对分布式联邦学习中数据异质性导致经典EM算法估计有偏的问题,提出动量网络EM(MNEM)算法和半监督MNEM(semi-MNEM)算法,实现渐近有效估计并加速收敛。

详情
AI中文摘要

我们系统研究了分布式联邦学习(DFL)中高斯混合模型的几种基于网络的期望最大化(EM)算法。理论研究表明,当数据在不同站点间异质分布时,直接将经典EM算法扩展到DFL会导致有偏估计。为解决这一问题,我们引入了动量网络EM(MNEM)算法,该算法整合了当前和先前DFL迭代的历史估计信息。我们进一步开发了半监督MNEM(semi-MNEM)算法,利用部分标记数据提供的信息。严格的理论分析表明,在适当的正则条件下,即使数据异质,MNEM估计器也能达到与全样本估计器相同的渐近效率。此外,即使不同混合成分分离较差,semi-MNEM估计器也能显著提高MNEM算法的收敛速度。进行了大量模拟,并分析了一个广泛使用的胸部X射线数据集,以证明所提出方法的有限样本性能。

英文摘要

We systematically study several network-based Expectation-Maximization (EM) algorithms for the Gaussian mixture model within decentralized federated learning (DFL). Our theoretical investigation shows that directly extending the classic EM algorithm to DFL leads to a biased estimator when data are heterogeneously distributed across sites. To address this, we introduce a momentum network EM (MNEM) algorithm, which integrates information from both current and historical estimators from previous DFL iterations. We further develop a semi-supervised MNEM (semi-MNEM) algorithm, which utilizes information provided by partially labeled data. Rigorous theoretical analysis demonstrates that the MNEM estimator can achieve the same asymptotic efficiency as the whole-sample estimator under appropriate regularity conditions, even with heterogeneous data. Moreover, the semi-MNEM estimator significantly improves the convergence speed of the MNEM algorithm, even if different mixture components are poorly separated. Extensive simulations are conducted, and a widely used chest X-ray dataset is analyzed to demonstrate the finite-sample performance of the proposed methods.

2205.08609 2026-06-04 stat.ML cs.LG stat.ME

Bagged Polynomial Regression and Neural Networks

Bagged Polynomial Regression and Neural Networks

Sylvia Klosin, Jaume Vives-i-Bastida

发表机构 * Department of Agricultural and Resource Economics, UC Davis(加州大学戴维斯分校农业与资源经济学系) Stanford Graduate School of Business(斯坦福商学院)

AI总结 针对高维预测问题,提出基于随机投影的袋装多项式回归(BPR),在保持与神经网络相当精度的同时提供可解释性和诊断工具。

详情
AI中文摘要

气候和环境应用越来越依赖于从遥感和其他科学数据中进行高维预测。神经网络(NN)在这些场景中能够提供强大的准确性,但往往难以审计且难以与领域知识对齐。作为替代方案,我们提出了基于随机投影的袋装多项式回归(BPR),这是一种计量经济学原生的集成方法,它对在随机选择的协变量组上拟合的多个正则化低次多项式模型进行平均。我们提供了新颖的有限样本和渐近风险界,并展示了协变量划分如何通过控制字典基增长来改善光滑目标函数的速率。速率改进对于边际效应的估计可能尤其重要。在使用光学和雷达图像进行基于卫星的作物分类的应用中,BPR 在保持易于诊断的同时达到了与 NN 相当的准确性。我们提供了实用的透明度工具、系数汇总和偏依赖诊断,表明 BPR 捕捉到了 NN 未能捕捉到的直观特征关系。

英文摘要

Climate and environmental applications increasingly rely on high-dimensional prediction from remote sensing and other scientific data. Neural networks (NN) can deliver strong accuracy in these settings, but they are often hard to audit and hard to align with domain knowledge. As an alternative, we propose bagged polynomial regression with random projections (BPR), an econometrics-native ensemble that averages many regularized low-degree polynomial models fit on randomly selected covariate groups. We provide novel finite-sample and asymptotic risk bounds and show how covariate partitioning can improve rates for smooth target functions by controlling dictionary basis growth. Rate improvements may be particularly relevant for the estimation of marginal effects. In an application to satellite-based crop classification using optical and radar imagery, BPR matches NN accuracy while remaining straightforward to diagnose. We provide practical transparency tools, coefficient summaries and partial-dependence diagnostics, that show BPR captures intuitive feature relationships that NNs do not.

2004.10846 2026-06-04 cs.CY cs.LG

Reducing the Filtering Effect in Public School Admissions: A Bias-aware Analysis for Targeted Interventions

减少公立学校招生中的过滤效应:面向针对性干预的偏差感知分析

Yuri Faenza, Swati Gupta, Aapeli Vuorinen, Xuan Zhang

发表机构 * Columbia University(哥伦比亚大学) Massachusetts Institute of Technology(麻省理工学院)

AI总结 本研究采用运筹学方法,通过分析纽约市教育部数据,将弱势学生分数分布偏移建模为偏差,并证明针对中等成绩弱势学生的集中干预(如奖学金或培训)可显著降低偏差影响。

详情
AI中文摘要

问题定义:传统上,纽约市顶尖的8所公立学校仅根据学生在特殊高中入学考试(SHSAT)中的成绩选拔候选人。这些成绩已知受到学生社会经济地位和初中所接受的考试准备的影响,导致教育管道中产生巨大的过滤效应。经典的学校分配机制并未自然解决学校隔离和班级多样性等问题,这些问题近年来日益恶化。包括政策制定者在内的科学界通过引入群体特定配额和比例约束来应对,但结果好坏参半。寻找有效且公平的方法以扩大优质教育机会的问题仍未解决。 方法/结果:我们采用与大多数现有文献不同的运筹学方法,目标是增加经济需求高的学生的机会。利用纽约市教育部(DOE)的数据,我们展示了被DOE归类为“弱势”(主要基于经济因素的标准)的学生所获分数的分布存在偏移。我们将这种偏移建模为“偏差”,源于对弱势学生真实潜力的低估。我们分析了这种偏差对分类匹配市场的影响。我们表明,当针对中等成绩的弱势学生群体时,通过奖学金或培训进行的集中干预可以显著降低偏差的影响。

英文摘要

Problem definition: Traditionally, New York City's top 8 public schools have selected candidates solely based on their scores in the Specialized High School Admissions Test (SHSAT). These scores are known to be impacted by socioeconomic status of students and test preparation received in middle schools, leading to a massive filtering effect in the education pipeline. The classical mechanisms for assigning students to schools do not naturally address problems like school segregation and class diversity, which have worsened over the years. The scientific community, including policymakers, have reacted by incorporating group-specific quotas and proportionality constraints, with mixed results. The problem of finding effective and fair methods for broadening access to top-notch education is still unsolved. Methodology/results: We take an operations approach to the problem different from most established literature, with the goal of increasing opportunities for students with high economic needs. Using data from the Department of Education (DOE) in New York City, we show that there is a shift in the distribution of scores obtained by students that the DOE classifies as "disadvantaged" (following criteria mostly based on economic factors). We model this shift as a "bias" that results from an underestimation of the true potential of disadvantaged students. We analyze the impact this bias has on an assortative matching market. We show that centrally planned interventions can significantly reduce the impact of bias through scholarships or training, when they target the segment of disadvantaged students with average performance.

2407.03956 2026-06-04 cs.MA cs.CL

Solving Zebra Puzzles Using Constraint-Guided Multi-Agent Systems

使用约束引导的多智能体系统解决斑马谜题

Shmuel Berman, Kathleen McKeown, Baishakhi Ray

发表机构 * Princeton University(普林斯顿大学) Columbia University(哥伦比亚大学)

AI总结 提出一种多智能体系统ZPS,结合大语言模型与定理证明器,通过分解问题、生成SMT代码和智能体间反馈,显著提升复杂逻辑谜题的解决能力。

详情
AI中文摘要

先前的研究通过链式思维提示或引入符号表示等技术,增强了大语言模型(LLMs)解决逻辑谜题的能力。然而,由于将自然语言线索翻译为逻辑语句的固有复杂性,这些框架通常仍不足以解决复杂的逻辑问题,例如斑马谜题。我们引入了一个多智能体系统ZPS,它将LLMs与现成的定理证明器集成在一起。该系统通过将问题分解为更小、更易管理的部分,生成SMT(可满足性模理论)代码以使用定理证明器求解,并利用智能体之间的反馈来反复改进答案,从而处理复杂的谜题求解任务。我们还引入了一个自动网格谜题评分器来评估我们谜题解决方案的正确性,并通过用户研究评估了该自动评分器的可靠性。我们的方法在我们测试的所有三个LLM中均显示出改进,其中GPT-4的完全正确解决方案数量提高了166%。

英文摘要

Prior research has enhanced the ability of Large Language Models (LLMs) to solve logic puzzles using techniques such as chain-of-thought prompting or introducing a symbolic representation. These frameworks are still usually insufficient to solve complicated logical problems, such as Zebra puzzles, due to the inherent complexity of translating natural language clues into logical statements. We introduce a multi-agent system, ZPS, that integrates LLMs with an off the shelf theorem prover. This system tackles the complex puzzle-solving task by breaking down the problem into smaller, manageable parts, generating SMT (Satisfiability Modulo Theories) code to solve them with a theorem prover, and using feedback between the agents to repeatedly improve their answers. We also introduce an automated grid puzzle grader to assess the correctness of our puzzle solutions and show that the automated grader is reliable by evaluating it in a user-study. Our approach shows improvement in all three LLMs we tested, with GPT-4 showing 166% improvement in the number of fully correct solutions.

1710.04238 2026-06-04 stat.ME cs.LG cs.NA math.NA

Regression-aware decompositions

回归感知的分解

Mark Tygert

发表机构 * Facebook Artificial Intelligence Research(脸书人工智能研究)

AI总结 本文提出了一种回归感知的分解方法,通过结合线性最小二乘回归模型与插值分解,实现了对矩阵B的监督降维,从而揭示了B中与A回归相关的结构。

Comments 19 pages, 9 figures, 2 tables

详情
Journal ref
Linear Algebra and Its Applications, 565 (6): 208-224, 2019
AI中文摘要

线性最小二乘回归通过设计矩阵A来近似给定矩阵B,通过最小化谱范数或Frobenius范数的差异||AX-B||来实现。另一种流行的近似方法是通过主成分分析(PCA)进行低秩近似,即奇异值分解(SVD)或插值分解(ID)。传统上,PCA/SVD和ID仅使用被近似的矩阵B,而不受任何辅助矩阵A的监督。然而,线性最小二乘回归模型可以指导ID,从而产生回归感知的ID。作为额外的好处,这为一种典型的判别分析(A和B之间的相关性)提供了解释。回归感知的分解有效使监督信息能够指导经典的降维方法,而经典降维方法历来是完全无监督的。回归感知的分解揭示了B中与A回归相关的结构。

英文摘要

Linear least-squares regression with a "design" matrix A approximates a given matrix B via minimization of the spectral- or Frobenius-norm discrepancy ||AX-B|| over every conformingly sized matrix X. Another popular approximation is low-rank approximation via principal component analysis (PCA) -- which is essentially singular value decomposition (SVD) -- or interpolative decomposition (ID). Classically, PCA/SVD and ID operate solely with the matrix B being approximated, not supervised by any auxiliary matrix A. However, linear least-squares regression models can inform the ID, yielding regression-aware ID. As a bonus, this provides an interpretation as regression-aware PCA for a kind of canonical correlation analysis between A and B. The regression-aware decompositions effectively enable supervision to inform classical dimensionality reduction, which classically has been totally unsupervised. The regression-aware decompositions reveal the structure inherent in B that is relevant to regression against A.

1803.03724 2026-06-04 math.DG cs.CG cs.CV cs.NA math.AP math.NA

Contour Parametrization via Anisotropic Mean Curvature Flows

通过各向异性均 curvature 流进行轮廓参数化

P. Suárez-Serrato, E. I. Velázquez Richards

发表机构 * Department of Mathematics, University of California, Santa Barbara, on leave from , Instituto de Matem\'aticas, Instituto de Matem\'aticas, Universidad Nacional Aut\'onoma de M\'exico, Mexico City

AI总结 本文提出了一种新的各向异性均 curvature 流实现,用于轮廓识别。通过将平面闭合光滑曲线的均 curvature 流与外部场相结合,该方法利用点电荷势场约束曲线运动,从而实现轮廓的参数化。

Comments 30 pages, 20 images, source code for our numerical implementation is available in this URL https://github.com/V3du4rd0/AMCF

详情
Journal ref
Applied Mathematics and Computation, Volume 441, 2023, 127699
AI中文摘要

我们提出了一种新的各向异性均 curvature 流实现,用于轮廓识别。我们的方法将平面闭合光滑曲线的均 curvature 流与来自点电荷势场的外部场相结合。这种耦合在曲线与背景图像匹配时约束其运动。我们还包含了用于数值近似的稳定性准则。

英文摘要

We present a new implementation of anisotropic mean curvature flow for contour recognition. Our procedure couples the mean curvature flow of planar closed smooth curves, with an external field from a potential of point-wise charges. This coupling constrains the motion when the curve matches a picture placed as background. We include a stability criteria for our numerical approximation.

2312.08472 2026-06-04 cs.NE cs.LG cs.NA math.NA

AutoNumerics-Zero: Automated Discovery of State-of-the-Art Mathematical Functions

AutoNumerics-Zero:自动发现最先进的数学函数

Esteban Real, Mirko Rossini, Connal de Souza, Manav Garg, Moritz Firsching, Quoc V. Le, Yao Chen, Akhil Verghese, Ekin Dogus Cubuk, David H. Park

发表机构 * University of California, Berkeley(加州大学伯克利分校)

AI总结 本文通过符号回归的进化方法,在不依赖任意精度的情况下,自动发现比传统方法更高效的数学函数近似程序,例如一个10操作的程序逼近指数函数达到14位有效数字。

Comments v2: Accepted to the International Conference on Machine Learning (ICML 2026); added results, clarified framing, and added proofs

详情
AI中文摘要

超越函数(如指数函数)是科学计算的核心,但数字硬件无法原生计算它们。相反,计算机必须通过组合基本运算(如$\{+, -, \times, ÷\}$)使用泰勒级数等方法近似这些函数。这些方法由数学家经过几个世纪发展而来,专注于能够达到任意精度的途径。然而,计算机通过仅使用有限精度类型(如float32)即可处理大多数应用,超出该类型精度的任何精度实际上都被丢弃了。因此,我们探索放弃任意精度是否能够发现更高效的近似。符号回归的进化方法特别适合,因为它可以搜索任意操作组合,并优化不可微的目标(如使用的操作数)。我们的结果表明,进化能够发现在此设置下优于已有方法的计算机程序,尽管除了基本运算的计算外没有先前的数学知识。从空代码开始,符号回归构建表示新颖数学表达式的程序。特别地,我们发现了10个操作的逼近指数函数达到14位有效数字的程序,其精度比此前已知的同等规模近似高出超过6个数量级。

英文摘要

Transcendental functions, such as the exponential, are central to scientific computing, yet they cannot be natively calculated by digital hardware. Instead, computers must approximate these functions by combining basic operations, such as $\{+, -, \times, ÷\}$, using methods like Taylor series. These methods were developed over centuries by mathematicians, who focused on approaches that could attain arbitrary accuracy. However, computers can handle most applications by using only finite-precision types, like float32, where any accuracy beyond the type's precision is effectively discarded. We explore, therefore, whether forgoing arbitrary accuracy can lead to the discovery of more efficient approximations. The evolutionary method of symbolic regression is particularly suitable, as it can search for arbitrary operation combinations and can optimize non-differentiable objectives, such as the number of operations used. Our results show that evolution can discover computer programs that outperform established methods in this setting, despite having no prior mathematical knowledge beyond the calculation of the basic operations. Starting from empty code, symbolic regression constructs programs representing novel mathematical expressions. In particular, we discovered a 10-operation program that approximates the exponential function to 14 significant figures, exceeding the accuracy of previously known approximations of this size by more than 6 orders of magnitude.

1702.03433 2026-06-04 eess.SY cs.RO cs.SY

Path Assignment Techniques For Vehicle Tracking

车辆跟踪中的路径分配技术

Richard Altendorfer, Sebastian Wirkert

发表机构 * Driver Assistance Systems, ZF TRW(驾驶辅助系统,ZF TRW) Deutsches Krebsforschungszentrum(德国癌症研究中心)

AI总结 本文提出两种路径分配方法,旨在通过延迟处理阶段来过滤测量数据,以避免延迟和其他中间滤波器的伪影,通过生成离散后验概率分布并使用中位数估计器提取路径或车道索引,通过ROC曲线展示方法性能。

Comments 6 pages, 9 figures

详情
Journal ref
Proceedings of the IEEE Intelligent Vehicles Symposium (2014) 1451-1456
AI中文摘要

许多驾驶员辅助系统,如自适应巡航控制系统,需要识别处于主机车辆路径中的最近车辆。这涉及将检测到的车辆分配给主机车辆的路径或邻近路径。在回顾了主机车辆路径估计和车道分配技术的方法后,我们介绍了两种方法,这些方法受到在尽可能晚的处理阶段过滤测量数据的动机,以避免延迟和其他中间滤波器的伪影。这些滤波器生成离散后验概率分布,从中通过中位数估计器提取路径或“车道”索引。通过使用实验数据和标记的地面真实数据,展示了这些方法的相对性能。

英文摘要

Many driver assistance systems such as Adaptive Cruise Control require the identification of the closest vehicle that is in the host vehicle's path. This entails an assignment of detected vehicles to the host vehicle path or neighboring paths. After reviewing approaches to the estimation of the host vehicle path and lane assignment techniques we introduce two methods that are motivated by the rationale to filter measured data as late in the processing stages as possible in order to avoid delays and other artifacts of intermediate filters. These filters generate discrete posterior probability distributions from which a path or "lane" index is extracted by a median estimator. The relative performance of those methods is illustrated by a ROC using experimental data and labeled ground truth data.

1508.04124 2026-06-04 eess.SY cs.RO cs.SY

A Complete Derivation Of The Association Log-Likelihood Distance For Multi-Object Tracking

多目标跟踪中关联对数似然距离的完整推导

Richard Altendorfer, Sebastian Wirkert

发表机构 * Driver Assistance Systems, ZF TRW(ZF TRW驾驶辅助系统) German Cancer Research Center(德国癌症研究中心)

AI总结 本文基于多目标跟踪中的关联问题,推导了关联对数似然距离,并通过蒙特卡洛模拟验证了其在关联关系准确性上的优越性。

Comments 7 pages, 3 figures

详情
Journal ref
2016 IEEE Intelligent Vehicles Symposium (IV)
AI中文摘要

Mahalanobis距离常用于多目标跟踪中的测量-跟踪关联。从Mahalanobis距离的原始定义出发,我们回顾了其在关联中的应用。由于多目标跟踪中没有原则将Mahalanobis距离视为一种独特的统计距离,我们重新审视了多假设跟踪中的全局关联假设作为最通用的关联设置。这些关联假设诱导出一种用于分配的距离似然量,我们称之为关联对数似然距离。我们比较了Mahalanobis距离与关联对数似然距离在蒙特卡洛模拟中产生正确关联关系的能力。结果表明,基于关联对数似然的距离在平均上比Mahalanobis距离表现更好,证实了最大化全局关联假设比最小化特定统计距离度量是一种更根本的关联方法。

英文摘要

The Mahalanobis distance is commonly used in multi-object trackers for measurement-to-track association. Starting with the original definition of the Mahalanobis distance we review its use in association. Given that there is no principle in multi-object tracking that sets the Mahalanobis distance apart as a distinguished statistical distance we revisit the global association hypotheses of multiple hypothesis tracking as the most general association setting. Those association hypotheses induce a distance-like quantity for assignment which we refer to as association log-likelihood distance. We compare the ability of the Mahalanobis distance to the association log-likelihood distance to yield correct association relations in Monte-Carlo simulations. It turns out that on average the distance based on association log-likelihood performs better than the Mahalanobis distance, confirming that the maximization of global association hypotheses is a more fundamental approach to association than the minimization of a certain statistical distance measure.

1409.6111 2026-06-04 math.OC cs.LG cs.MA cs.SY eess.SY stat.ML

Distributed Clustering and Learning Over Networks

网络上的分布式聚类与学习

Xiaochuan Zhao, Ali H. Sayed

发表机构 * Department of Electrical Engineering, University of California, Los Angeles(加州大学洛杉矶分校电气工程系)

AI总结 本文提出了一种自适应的聚类和学习方案,使智能体能够学习应与哪些邻居合作以及哪些邻居应忽略,从而在网络中实现更准确的学习和估计。通过详细的均方分析,评估了聚类机制的一阶和二阶误差概率,并证明这些概率随步长指数衰减,从而可以将正确聚类的概率任意接近于一。

Comments 47 pages, 6 figures

详情
AI中文摘要

网络上的分布式处理依赖于节点间的在网处理和邻近智能体之间的合作。当智能体共享共同目标时,合作是有益的。然而,在许多应用中,智能体可能属于不同的集群,追求不同的目标。因此,无差别合作会导致不期望的结果。在本文中,我们提出了一种自适应的聚类和学习方案,使智能体能够学习应与哪些邻居合作以及哪些其他邻居应忽略。通过这样做,所得到的算法使智能体能够识别其集群,并在网络中实现改进的学习和估计准确性。我们进行了详细的均方分析,并评估了聚类机制的一阶和二阶误差概率,即虚警和误检概率。此外,我们证明这些概率随着步长指数衰减,从而使正确聚类的概率可以任意接近于一。

英文摘要

Distributed processing over networks relies on in-network processing and cooperation among neighboring agents. Cooperation is beneficial when agents share a common objective. However, in many applications agents may belong to different clusters that pursue different objectives. Then, indiscriminate cooperation will lead to undesired results. In this work, we propose an adaptive clustering and learning scheme that allows agents to learn which neighbors they should cooperate with and which other neighbors they should ignore. In doing so, the resulting algorithm enables the agents to identify their clusters and to attain improved learning and estimation accuracy over networks. We carry out a detailed mean-square analysis and assess the error probabilities of Types I and II, i.e., false alarm and mis-detection, for the clustering mechanism. Among other results, we establish that these probabilities decay exponentially with the step-sizes so that the probability of correct clustering can be made arbitrarily close to one.

1809.07870 2026-06-04 eess.SY cs.RO cs.SY math.OC

Suspended Load Path Tracking Control Using a Tilt-rotor UAV Based on Zonotopic State Estimation

基于退化拓扑状态估计的倾斜旋翼无人机悬浮负载路径跟踪控制

Brenner S. Rego, Guilherme V. Raffo

发表机构 * Graduate Program in Electrical Engineering, Federal University of Minas Gerais, Belo Horizonte, MG 31270-901, Brazil(巴西联邦矿务工程师学院电气工程研究生项目) Department of Electronics Engineering, Federal University of Minas Gerais, Belo Horizonte, MG 31270-901, Brazil(巴西联邦矿务工程师学院电子工程系)

AI总结 本文研究了利用倾斜旋翼无人机进行悬浮负载路径跟踪控制的问题,通过建立多体机械系统的动力学模型,提出退化拓扑状态估计器来估计负载位置和姿态,并设计了具有极点放置约束的离散时间混合H2/H∞控制器以实现鲁棒的路径跟踪控制。

详情
AI中文摘要

本文解决了一种倾斜旋翼无人机悬浮负载路径跟踪控制的问题。主要挑战来自于负载动态行为,通常通过绳索与无人机连接,增加了系统的未驱动自由度。此外,为了执行负载运输任务,通常需要知道负载的位置信息。由于可用传感器通常嵌入在移动平台上,负载位置的信息可能无法直接获取。为了解决这个问题,本文首先从负载的角度出发,推导了多体机械系统的运动学,利用欧拉-拉格朗日方法推导出详细的动力学模型,得到一个高度耦合、非线性的状态空间表示,输入是仿射的,负载的位置和姿态直接由状态变量表示。提出了一种退化拓扑状态估计器来解决负载位置和姿态的估计问题,该估计器基于飞机上的传感器,具有不同的采样时间,并且测量噪声是未知但有界的。为了解决路径跟踪问题,设计了一个具有极点放置约束的离散时间混合H2/H∞控制器,具有保证的时间响应特性,并对未建模动态、参数不确定性以及外部干扰具有鲁棒性。通过在基于Gazebo模拟器的平台上进行的数值实验以及系统计算机辅助设计(CAD)模型上的实验,验证了退化拓扑状态估计器和设计的控制器的性能。

英文摘要

This work addresses the problem of path tracking control of a suspended load using a tilt-rotor UAV. The main challenge in controlling this kind of system arises from the dynamic behavior imposed by the load, which is usually coupled to the UAV by means of a rope, adding unactuated degrees of freedom to the whole system. Furthermore, to perform the load transportation it is often needed the knowledge of the load position to accomplish the task. Since available sensors are commonly embedded in the mobile platform, information on the load position may not be directly available. To solve this problem in this work, initially, the kinematics of the multi-body mechanical system are formulated from the load's perspective, from which a detailed dynamic model is derived using the Euler-Lagrange approach, yielding a highly coupled, nonlinear state-space representation of the system, affine in the inputs, with the load's position and orientation directly represented by state variables. A zonotopic state estimator is proposed to solve the problem of estimating the load position and orientation, which is formulated based on sensors located at the aircraft, with different sampling times, and unknown-but-bounded measurement noise. To solve the path tracking problem, a discrete-time mixed $\mathcal{H}_2/\mathcal{H}_\infty$ controller with pole-placement constraints is designed with guaranteed time-response properties and robust to unmodeled dynamics, parametric uncertainties, and external disturbances. Results from numerical experiments, performed in a platform based on the Gazebo simulator and on a Computer Aided Design (CAD) model of the system, are presented to corroborate the performance of the zonotopic state estimator along with the designed controller.

2107.01629 2026-06-04 stat.ML cs.LG econ.GN q-fin.EC stat.AP

From Live to Recording: Consumer Demand and Response to Price Across the Livestreaming Lifecycle

从直播到录制:消费者对直播生命周期中价格的需求与响应

Ziwei Cong, Jia Liu, Puneet Manchanda

发表机构 * Georgetown University(乔治·华盛顿大学) Hong Kong University of Science and Technology(香港科学与技术大学) University of Michigan(密歇根大学) Stephen M. Ross School of Business, University of Michigan(密歇根大学罗斯商学院)

AI总结 利用大型直播平台数据,研究消费者在直播前后对价格敏感性的差异,发现直播前需求价格弹性更高,主要由消费者自选择和质量不确定性驱动。

Comments An earlier version of this paper was distributed under the title "The Role of 'Live' in Livestreaming Markets: Evidence Using Orthogonal Random Forest."

详情
AI中文摘要

直播已发展成为一个蓬勃发展的行业,创作者可以直接从中获利并与观众和粉丝互动。在实践中,创作者和平台通常将营销工作集中在直播前的时期。然而,直播活动在结束后自然过渡到录制格式,创造了潜在的“剩余”变现机会。本研究利用一个大型直播平台的数据,系统性地考察了消费者在整个直播生命周期中对直播活动的需求,该平台允许消费者在直播结束后购买付费直播活动的录制版本。我们发现,与直播后时期相比,直播前时期的需求对价格更敏感。这部分由两种机制驱动:消费者自选择(不常消费的消费者可能错过了直播活动,对录制版本表现出更高的支付意愿)和质量不确定性(消费者在直播前时期面临的事件质量不确定性高于直播后时期)。我们的研究结果为直播市场的定价和定向策略提供了启示。

英文摘要

Livestreaming has evolved into a thriving industry where creators can directly monetize and engage with their audiences and followers. In practice, creators and platforms typically concentrate their marketing efforts on the period leading up to the livestream. However, livestreaming events naturally transition into recorded formats once the event concludes, creating potential "residual" opportunities for monetization. This study systematically examines consumer demand for live events throughout the entire livestream life-cycle, using data from a large livestreaming platform that allows consumers to purchase the recorded version of a paid live event after the livestream ends. We find that the demand is surprisingly more price-sensitive during the pre-livestream period compared to the post-period. This is partly driven by two mechanisms: consumer self-selection (infrequent consumers who may have missed the live events exhibit a higher willingness to pay for recorded versions) and quality uncertainty (consumers face higher uncertainty in event quality during the pre-period than in the post-period). Our findings generate implications for the pricing and targeting strategies in livestreaming markets.

2006.04013 2026-06-04 cs.CY cs.AI cs.LG

AI from concrete to abstract: demystifying artificial intelligence to the general public

从具体到抽象的人工智能:向公众揭秘人工智能

Rubens Lacerda Queiroz, Fábio Ferrentini Sampaio, Cabral Lima, Priscila Machado Vieira Lima

发表机构 * Federal University of Rio de Janeiro – UFRJ – Brazil(巴西联邦大学里约热内卢分校) InovLabs – Portugal(葡萄牙InovLabs) Atlantica University – Portugal(葡萄牙Atlantica大学) PESC/COPPE Tercio Pacitti Institute (NCE)(Tercio Pacitti研究所(NCE))

AI总结 本文提出一种结合可视化编程与WiSARD无权重人工神经网络的新方法AIcon2abs,通过实践开发学习机器并观察其学习过程,帮助普通大众(包括儿童)理解人工智能的基本概念。

Comments 23 pages; 2 tables; 47 figures; review comment: Included references for the final published peer-reviewed version of this pre-print: https://doi.org/10.1007/s00146-021-01151-x and https://rdcu.be/cihdO; typos corrected

详情
Journal ref
AI & SOCIETY, 36 877-893 (2021)
AI中文摘要

人工智能(AI)已被广泛应用于众多领域,这表明迫切需要开发手段,使普通大众对AI的含义有最基本的理解。本文结合可视化编程与WiSARD无权重人工神经网络,提出了一种新方法——从具体到抽象的人工智能(AIcon2abs),使普通人(包括儿童)能够实现这一目标。该方法的主要策略是通过与学习机器开发相关的实践活动,以及观察其学习过程,来促进对人工智能的去神秘化。因此,它能够使受训者获得技能,从而在涉及采用人工智能机制的辩论和决策中成为有洞察力的参与者。目前,通过编程教授基本AI概念的现有方法将机器智能视为外部元素/模块。经过训练后,该外部模块被耦合到学习者正在开发的主应用程序中。而在本文提出的方法中,训练和分类任务都是构成主程序的模块,就像其他编程结构一样。作为AIcon2abs的一个有益副作用,能够从数据中学习的程序与常规计算机程序之间的区别变得更加明显。此外,WiSARD无权重人工神经网络模型的简单性使得训练和分类任务的内部实现易于可视化和理解。

英文摘要

Artificial Intelligence (AI) has been adopted in a wide range of domains. This shows the imperative need to develop means to endow common people with a minimum understanding of what AI means. Combining visual programming and WiSARD weightless artificial neural networks, this article presents a new methodology, AI from concrete to abstract (AIcon2abs), to enable general people (including children) to achieve this goal. The main strategy adopted by is to promote a demystification of artificial intelligence via practical activities related to the development of learning machines, as well as through the observation of their learning process. Thus, it is possible to provide subjects with skills that contributes to making them insightful actors in debates and decisions involving the adoption of artificial intelligence mechanisms. Currently, existing approaches to the teaching of basic AI concepts through programming treat machine intelligence as an external element/module. After being trained, that external module is coupled to the main application being developed by the learners. In the methodology herein presented, both training and classification tasks are blocks that compose the main program, just as the other programming constructs. As a beneficial side effect of AIcon2abs, the difference between a program capable of learning from data and a conventional computer program becomes more evident. In addition, the simplicity of the WiSARD weightless artificial neural network model enables easy visualization and understanding of training and classification tasks internal realization.

1610.04091 2026-06-04 eess.SY cs.DC cs.RO cs.SY math.OC

Optimizing Communication and Computation for Multi-UAV Information Gathering Applications

为多UAV信息采集应用优化通信与计算

Mason Thammawichai, Sujit P. Baliyarasimhuni, Eric C. Kerrigan, João B. Sousa

发表机构 * Department of Aeronautics, Imperial College London(帝国理工学院伦敦校区航空系) Department of Electronics and Communications Engineering, Indraprastha Institute of Information Technology(印度拉普拉兹信息技术学院电子与通信工程系) Department of Electrical & Electronic Engineering and the Department of Aeronautics, Imperial College London(帝国理工学院伦敦校区电子与电气工程系及航空系) Department of Electrical and Computer Engineering, University of Porto(葡萄牙波尔图大学电气与计算机工程系)

AI总结 本文针对多UAV系统中通信与计算能耗的优化问题,提出了一种混合整数优化方法,通过数据聚合和多跳分层聚类实现高效的路由方案,以延长系统寿命。

详情
Journal ref
IEEE Transactions on Aerospace and Electronic Systems ( Volume: 54, Issue: 2, April 2018)
AI中文摘要

移动代理网络,如多UAV系统,受到资源限制的约束。特别是,有限的能源直接影响系统性能,如系统寿命。在无线传感器网络文献中已证明,通信能耗主导了计算和传感能耗。因此,通过优化通信数据量可以显著延长多UAV系统的寿命,但会增加计算成本。在本文中,我们旨在取得通信与计算能耗之间的最佳权衡。具体而言,我们提出了一种混合整数优化公式,用于多跳分层聚类基于自组织UAV网络的数据聚合,以获得节能的信息路由方案。所提出的框架在两个应用上进行了测试,即目标跟踪和区域映射。基于仿真结果,我们的方法相比没有数据聚合和聚类方案的基线策略,能显著节省能量。

英文摘要

Mobile agent networks, such as multi-UAV systems, are constrained by limited resources. In particular, limited energy affects system performance directly, such as system lifetime. It has been demonstrated in the wireless sensor network literature that the communication energy consumption dominates the computational and the sensing energy consumption. Hence, the lifetime of the multi-UAV systems can be extended significantly by optimizing the amount of communication data, at the expense of increasing computational cost. In this work, we aim at attaining an optimal trade-off between the communication and the computational energy. Specifically, we propose a mixed-integer optimization formulation for a multi-hop hierarchical clustering-based self-organizing UAV network incorporating data aggregation, to obtain an energy-efficient information routing scheme. The proposed framework is tested on two applications, namely target tracking and area mapping. Based on simulation results, our method can significantly save energy compared to a baseline strategy, where there is no data aggregation and clustering scheme.

1806.06723 2026-06-04 eess.SY cs.RO cs.SY math.OC

Towards Manipulability of Interactive Lagrangian Systems

面向交互拉格朗日系统的可操作性

Hanlei Wang

发表机构 * Science and Technology on Space Intelligent Control Laboratory(航天智能控制实验室)

AI总结 本文研究了具有参数不确定性和通信/传感约束的交互拉格朗日系统的可操作性,提出了一种新的动态反馈方法,设计了能够实现无限可操作性和对通信/传感约束鲁棒性的自适应控制器,解决了非线性双侧远程操作中的任意未知时变延迟问题。

Comments 15 pages, 15 figures

详情
Journal ref
Automatica, 119: 108913, 2020
AI中文摘要

本文研究了具有参数不确定性和通信/传感约束的交互拉格朗日系统的可操作性。两个标准例子是主从远程操作和机器人教学操作。我们系统地提出了通用动力系统中无限可操作性的概念,并探讨了这种统一动机如何形成一种设计范式,以保证交互动力系统的无限可操作性,并特别促进了交互拉格朗日系统的非线性自适应控制器的设计和分析。具体而言,基于一个新的动态反馈类,我们提出了一种自适应控制器,能够实现受控拉格朗日系统的无限可操作性和对通信/传感约束的鲁棒性,主要归因于由此产生的动态级联框架。所提出的范式在人机交互的网络耦合要求和受控动力学之间实现了理想的平衡。我们还证明了我们主要结果的一个特殊情况解决了长期存在的非线性双侧远程操作问题,即任意未知时变延迟。仿真结果展示了所提出自适应控制器下交互机器人系统的性能。

英文摘要

This paper investigates manipulability of interactive Lagrangian systems with parametric uncertainty and communication/sensing constraints. Two standard examples are teleoperation with a master-slave system and teaching operation of robots. We here systematically formulate the concept of infinite manipulability for general dynamical systems, and investigate how such a unified motivation yields a design paradigm towards guaranteeing the infinite manipulability of interactive dynamical systems and in particular facilitates the design and analysis of nonlinear adaptive controllers for interactive Lagrangian systems. Specifically, based on a new class of dynamic feedback, we propose adaptive controllers that achieve both the infinite manipulability of the controlled Lagrangian systems and the robustness with respect to the communication/sensing constraints, mainly owing to the resultant dynamic-cascade framework. The proposed paradigm yields the desirable balance between network coupling requirements and controlled dynamics of human-system interaction. We also show that a special case of our main result resolves the longstanding nonlinear bilateral teleoperation problem with arbitrary unknown time-varying delay. Simulation results show the performance of the interactive robotic systems under the proposed adaptive controllers.

1803.00638 2026-06-04 math.NA cs.CV cs.NA

Fast and accurate computation of orthogonal moments for texture analysis

快速且准确的正交矩计算用于纹理分析

C. Di Ruberto, L. Putzu, G. Rodriguez

发表机构 * Department of Mathematics and Computer Science, University of Cagliari(卡利里大学数学与计算机科学系) Department of Electrical and Electronic Engineering, University of Cagliari(卡利里大学电子与电气工程系)

AI总结 本文提出了一种快速稳定的算法,用于图像正交矩的计算,通过递推关系方法优化了Matlab实现,以提高计算效率和重建精度,并在纹理分析中展示了优于传统描述符的性能。

Comments 29 pages, 9 figures

详情
Journal ref
Pattern Recongnit. 83 (2018) 498-510
AI中文摘要

在本文中,我们描述了一种快速且稳定的算法,用于计算图像的正交矩。正交矩具有高区分能力,但某些公式形式计算复杂度较高,限制了实时应用。本文详细描述了基于递推关系的方法,并提出了一种优化的Matlab实现,旨在解决上述限制,并向社区提供高效易用的软件。在实验中,我们评估了递推公式的有效性及其在重建任务中的性能,与文献中常用的闭式表示进行比较。结果表明,计算复杂度显著降低,同时重建精度更高。为了评估和比较计算矩在纹理分析中的准确性,我们在六个著名的纹理图像数据库上进行了分类实验。再次,递推公式在分类任务中优于闭式表示。更重要的是,如果使用所提出的稳定过程从图像的GLCM计算正交矩,则在某些情况下,正交矩优于一些最流行的纹理分类状态-of-the-art描述符。

英文摘要

In this work we describe a fast and stable algorithm for the computation of the orthogonal moments of an image. Indeed, orthogonal moments are characterized by a high discriminative power, but some of their possible formulations are characterized by a large computational complexity, which limits their real-time application. This paper describes in detail an approach based on recurrence relations, and proposes an optimized Matlab implementation of the corresponding computational procedure, aiming to solve the above limitations and put at the community's disposal an efficient and easy to use software. In our experiments we evaluate the effectiveness of the recurrence formulation, as well as its performance for the reconstruction task, in comparison to the closed form representation, often used in the literature. The results show a sensible reduction in the computational complexity, together with a greater accuracy in reconstruction. In order to assess and compare the accuracy of the computed moments in texture analysis, we perform classification experiments on six well-known databases of texture images. Again, the recurrence formulation performs better in classification than the closed form representation. More importantly, if computed from the GLCM of the image using the proposed stable procedure, the orthogonal moments outperform in some situations some of the most diffused state-of-the-art descriptors for texture classification.

1904.08962 2026-06-04 eess.SY cs.LG cs.SY

Constrained Restless Bandits for Dynamic Scheduling in Cyber-Physical Systems

用于网络物理系统动态调度的受限 restless 扩展老虎机

Kesav Kaza, Rahul Meshram, Varun Mehta, S. N. Merchant

发表机构 * Department of Electrical Engineering, Indian Institute of Technology Bombay(印度理工学院班加罗尔电子工程系) Polytechnique Montreal(蒙特利尔理工学院) IIIT Allahabad(阿哈迈德纳巴德印度理工学院) University of Ottawa(渥太华大学) IIT Bombay(印度理工学院班加罗尔)

AI总结 本文研究了一类受约束的 restless 多臂老虎机(CRMAB),其中约束以时间变化的动作集(可用臂集)形式存在。该变化可以是随机的或半确定性的。给定一组臂,每个决策区间内固定数量的臂可以被选择进行播放。每个臂的播放会产生依赖于当前状态的奖励。当前臂的状态通过二进制反馈信号部分可观察,而当前可用臂的状态则是完全可观察的。目标是最大化长期累积奖励。未来臂可用性的不确定性以及部分状态信息使这一目标具有挑战性。CRMAB的应用可以发现于涉及时间变化可用性的网络物理系统中的资源分配。首先,通过 Whittle 的指数策略分析该优化问题。为此,研究了一个受约束的 restless 单臂老虎机。证明其具有阈值型最优策略,并且是可指数化的。提出了一种计算 Whittle 指数的算法。还提出了一种复杂度更低的替代解决方案方法,以在线滚动策略的形式呈现。还详细讨论了这两种方案的复杂性,表明具有短前瞻的在线滚动策略比 Whittle 指数计算更容易实施。进一步,推导了价值函数的上界,以估计各种解决方案的次优程度。模拟研究比较了 Whittle 指数、在线滚动、贪心和修改 Whittle 指数策略的性能。

Comments 17 pages, 2 figures

详情
AI中文摘要

本文研究了一类受约束的 restless 多臂老虎机(CRMAB)。约束以时间变化的动作集(可用臂集)形式存在。这种变化可以是随机的或半确定性的。给定一组臂,每个决策区间内固定数量的臂可以被选择进行播放。每个臂的播放会产生依赖于当前状态的奖励。当前臂的状态通过二进制反馈信号部分可观察,而当前可用臂的状态则是完全可观察的。目标是最大化长期累积奖励。未来臂可用性的不确定性以及部分状态信息使这一目标具有挑战性。CRMAB的应用可以发现于涉及时间变化可用性的网络物理系统中的资源分配。首先,通过 Whittle 的指数策略分析该优化问题。为此,研究了一个受约束的 restless 单臂老虎机。证明其具有阈值型最优策略,并且是可指数化的。提出了一种计算 Whittle 指数的算法。还提出了一种复杂度更低的替代解决方案方法,以在线滚动策略的形式呈现。还详细讨论了这两种方案的复杂性,表明具有短前瞻的在线滚动策略比 Whittle 指数计算更容易实施。进一步,推导了价值函数的上界,以估计各种解决方案的次优程度。模拟研究比较了 Whittle 指数、在线滚动、贪心和修改 Whittle 指数策略的性能。

英文摘要

This paper studies a class of constrained restless multi-armed bandits (CRMAB). The constraints are in the form of time varying set of actions (set of available arms). This variation can be either stochastic or semi-deterministic. Given a set of arms, a fixed number of them can be chosen to be played in each decision interval. The play of each arm yields a state dependent reward. The current states of arms are partially observable through binary feedback signals from arms that are played. The current availability of arms is fully observable. The objective is to maximize long term cumulative reward. The uncertainty about future availability of arms along with partial state information makes this objective challenging. Applications for CRMAB can be found in resource allocation in cyber-physical systems involving components with time varying availability. First, this optimization problem is analyzed using Whittle's index policy. To this end, a constrained restless single-armed bandit is studied. It is shown to admit a threshold-type optimal policy and is also indexable. An algorithm to compute Whittle's index is presented. An alternate solution method with lower complexity is also presented in the form of an online rollout policy. A detailed discussion on the complexity of both these schemes is also presented, which suggests that online rollout policy with short look ahead is simpler to implement than Whittle's index computation. Further, upper bounds on the value function are derived in order to estimate the degree of sub-optimality of various solutions. The simulation study compares the performance of Whittle's index, online rollout, myopic and modified Whittle's index policies.

1903.11734 2026-06-04 math.OC cs.LG cs.SY eess.SY

A Posteriori Probabilistic Bounds of Convex Scenario Programs with Validation Tests

凸场景程序后验概率界限的后验概率界限与验证测试

Chao Shang, Fengqi You

发表机构 * College of Engineering, Cornell University, Ithaca, New York(工程学院,康奈尔大学,伊萨卡,纽约)

AI总结 本文提出了一种新的后验界限,用于凸场景程序的后验概率评估,结合支持约束的实现和样本外验证数据的表现,以提高随机解的风险评估。

详情
Journal ref
IEEE Transactions on Automatic Control, Sept. 2021, Volume 66, Issue 9, Pages 4015 - 4028
AI中文摘要

场景程序已建立为在不确定性下做出决策的有效工具。为了评估基于场景的解决方案的质量,后验验证测试基于伯努利试验已被广泛采用。然而,为了达到理论上可靠的风险判断,通常需要收集大量的验证样本。在本文中,我们提出了一种新的后验界限,用于凸场景程序的后验概率评估,这些界限依赖于支持约束的实现和样本外验证数据的表现。所提出的界限具有广泛的通用性,因为许多现有的理论结果可以作为特殊情况被纳入其中。为了便于实际应用,还开发了一种系统的方法来参数化后验概率界限,该方法被证明具有多种有利的属性,允许易于实施和清晰的解释。通过综合支持约束和验证测试的全面信息,可以比现有的后验界限更有效地评估随机解的风险。对飞机侧向运动控制器设计的案例研究被提出以验证所提出的后验界限的有效性。

英文摘要

Scenario programs have established themselves as efficient tools towards decision-making under uncertainty. To assess the quality of scenario-based solutions a posteriori, validation tests based on Bernoulli trials have been widely adopted in practice. However, to reach a theoretically reliable judgement of risk, one typically needs to collect massive validation samples. In this work, we propose new a posteriori bounds for convex scenario programs with validation tests, which are dependent on both realizations of support constraints and performance on out-of-sample validation data. The proposed bounds enjoy wide generality in that many existing theoretical results can be incorporated as particular cases. To facilitate practical use, a systematic approach for parameterizing a posteriori probability bounds is also developed, which is shown to possess a variety of desirable properties allowing for easy implementations and clear interpretations. By synthesizing comprehensive information about support constraints and validation tests, improved risk evaluation can be achieved for randomized solutions in comparison with existing a posteriori bounds. Case studies on controller design of aircraft lateral motion are presented to validate the effectiveness of the proposed a posteriori bounds.

1711.05519 2026-06-04 cs.IT cs.LG cs.NA math.IT math.NA math.OC

Accelerated Alternating Projections for Robust Principal Component Analysis

加速交替投影用于鲁棒主成分分析

HanQin Cai, Jian-Feng Cai, Ke Wei

发表机构 * Department of Mathematics, University of California, Los Angeles(加州大学洛杉矶分校数学系) Department of Mathematics, Hong Kong University of Science and Technology(香港理工大学数学系) School of Data Science, Fudan University(复旦大学数据科学学院)

AI总结 本文提出了一种加速交替投影算法,用于鲁棒主成分分析,显著提高了现有交替投影方法在更新低秩因子时的计算效率,并证明了该算法的精确恢复保证和线性收敛性。

详情
Journal ref
Journal of Machine Learning Research, 20 (2019): 685-717
AI中文摘要

我们研究了完全观测设置下的鲁棒PCA,即从其总和D=L+S中分离低秩矩阵L和稀疏矩阵S。在本文中,提出了一种新的算法,称为加速交替投影,用于鲁棒PCA,显著提高了现有在[Netrapalli, Praneeth, et al., 2014]中提出的交替投影方法在更新低秩因子时的计算效率。通过首先将矩阵投影到某些低维子空间,然后通过截断SVD获得低秩矩阵的新估计,实现了加速。精确恢复保证已经建立,证明了所提出算法的线性收敛性。经验性能评估证明了我们的算法在鲁棒PCA中的优势。

英文摘要

We study robust PCA for the fully observed setting, which is about separating a low rank matrix $\boldsymbol{L}$ and a sparse matrix $\boldsymbol{S}$ from their sum $\boldsymbol{D}=\boldsymbol{L}+\boldsymbol{S}$. In this paper, a new algorithm, dubbed accelerated alternating projections, is introduced for robust PCA which significantly improves the computational efficiency of the existing alternating projections proposed in [Netrapalli, Praneeth, et al., 2014] when updating the low rank factor. The acceleration is achieved by first projecting a matrix onto some low dimensional subspace before obtaining a new estimate of the low rank matrix via truncated SVD. Exact recovery guarantee has been established which shows linear convergence of the proposed algorithm. Empirical performance evaluations establish the advantage of our algorithm over other state-of-the-art algorithms for robust PCA.

1807.06172 2026-06-04 eess.SY cs.RO cs.SY

Experimental Resilience Assessment of An Open-Source Driving Agent

开放源代码驾驶代理的实验韧性评估

Abu Hasnat Mohammad Rubaiyat, Yongming Qin, Homa Alemzadeh

发表机构 * Department of Electrical and Computer Engineering(电气与计算机工程系)

AI总结 本文提出基于系统理论过程分析(STPA)的故障注入框架,用于评估开放源代码驾驶代理openpilot在不同环境条件和传感器数据故障下的韧性,通过战略性软件故障注入方法提高不安全场景的覆盖率,从而更有效地模拟安全关键故障并测试自动驾驶车辆。

Comments 10 pages, 7 figures

详情
AI中文摘要

自动驾驶车辆(AV)依赖雷达和相机等传感器进行环境感知、路径规划和控制。随着自主性提高和与复杂环境的交互增加,对AV安全性和可靠性的关注日益增长。本文提出一种基于系统理论过程分析(STPA)的故障注入框架,用于评估开放源代码驾驶代理openpilot在不同环境条件和影响传感器数据的故障下的韧性。为了增加测试期间不安全场景的覆盖率,我们采用战略性软件故障注入方法,其中触发故障注入的触发器来源于系统高级危险分析中识别出的不安全场景。实验结果表明,所提出的战略性故障注入方法相比随机故障注入提高了危险覆盖率,从而有助于更有效地模拟安全关键故障并测试AV。此外,本文还提供了关于openpilot安全机制性能及其在及时检测和恢复故障输入能力方面的见解。

英文摘要

Autonomous vehicles (AV) depend on the sensors like RADAR and camera for the perception of the environment, path planning, and control. With the increasing autonomy and interactions with the complex environment, there have been growing concerns regarding the safety and reliability of AVs. This paper presents a Systems-Theoretic Process Analysis (STPA) based fault injection framework to assess the resilience of an open-source driving agent, called openpilot, under different environmental conditions and faults affecting sensor data. To increase the coverage of unsafe scenarios during testing, we use a strategic software fault-injection approach where the triggers for injecting the faults are derived from the unsafe scenarios identified during the high-level hazard analysis of the system. The experimental results show that the proposed strategic fault injection approach increases the hazard coverage compared to random fault injection and, thus, can help with more effective simulation of safety-critical faults and testing of AVs. In addition, the paper provides insights on the performance of openpilot safety mechanisms and its ability in timely detection and recovery from faulty inputs.

1605.04344 2026-06-04 eess.SY cs.RO cs.SY

On the Effects of Measurement Uncertainty in Optimal Control of Contact Interactions

关于测量不确定性在接触相互作用最优控制中的影响

Brahayam Ponton, Stefan Schaal, Ludovic Righetti

发表机构 * Max-Planck Institute for Intelligent Systems, Tuebingen-Germany(图灵智能研究所,图宾根-德国) University of Southern California, Los Angeles-USA(南加州大学,洛杉矶-美国)

AI总结 本文研究了在机器人应用中,接触相互作用的不确定性不仅来自过程模型的噪声,还来自对世界知识的不精确,提出了一种基于风险敏感控制的SOC算法,通过引入观测器动态来显式依赖当前测量不确定性,仿真结果显示测量不确定性导致低阻尼行为,与过程噪声导致的刚性行为形成对比。

Comments 17 pages, 5 figures - this version is the one published at WAFR 2016 to fulfill the open access requirements of the EU commission, please refer to the previous version for the complete derivation of the algorithm

详情
AI中文摘要

随机最优控制(SOC)通常仅考虑过程模型中的噪声,即未知干扰。然而,在许多涉及与环境交互的机器人应用中,如运动和操作,不确定性还来自对世界的不精确知识,这并非实际干扰。我们通过开发基于风险敏感控制的SOC算法,分析同时考虑测量模型中的噪声的影响,该算法将观测器动态纳入其中,使得控制律显式依赖于当前的测量不确定性。在简单2D机械臂的仿真结果中,我们观察到测量不确定性导致低阻尼行为,这一结果与过程噪声产生刚性行为的效果形成对比。这表明考虑测量不确定性可能是解决涉及不确定接触相互作用问题的一种很有前途的方法。

英文摘要

Stochastic Optimal Control (SOC) typically considers noise only in the process model, i.e. unknown disturbances. However, in many robotic applications involving interaction with the environment, such as locomotion and manipulation, uncertainty also comes from lack of precise knowledge of the world, which is not an actual disturbance. We analyze the effects of also considering noise in the measurement model, by developing a SOC algorithm based on risk-sensitive control, that includes the dynamics of an observer in such a way that the control law explicitly depends on the current measurement uncertainty. In simulation results on a simple 2D manipulator, we have observed that measurement uncertainty leads to low impedance behaviors, a result in contrast with the effects of process noise that creates stiff behaviors. This suggests that taking into account measurement uncertainty could be a potentially very interesting way to approach problems involving uncertain contact interactions.

1811.01220 2026-06-04 math.OC cs.AI cs.CC cs.NA math.NA

Sharp worst-case evaluation complexity bounds for arbitrary-order nonconvex optimization with inexpensive constraints

任意阶非凸优化的最坏情况评估复杂度界限

Coralia Cartis, Nick I. M. Gould, Philippe L. Toint

发表机构 * Mathematical Institute, Oxford University(牛津大学数学研究所) Computational Mathematics Group, STFC-Rutherford Appleton Laboratory(STFC-拉瑟福德苹果顿实验室计算数学组) Namur Center for Complex Systems (naXys), University of Namur(纳慕尔复杂系统中心(naXys),纳慕尔大学)

AI总结 本文研究了具有低成本约束的任意阶非凸优化问题的最坏情况评估复杂度界限,提出了一种概念性正则化算法,能够在给定精度和最优阶数的情况下,以O(ε^(- (p+1)/(p-q+1)))的次数评估目标函数及其导数,计算出合适的q阶近似极小值点。

Comments 30 pages

详情
Journal ref
SIAM Journal on Optimization,, vol. 30(1), pp. 513-541, 2020
AI中文摘要

我们为非凸最小化问题提供了精确的最坏情况评估复杂度界限,这些问题是具有通用低成本约束的问题,即约束的评估/执行成本相对于目标函数的评估成本可以忽略不计。这些界限统一、扩展或改进了所有已知的无约束和凸约束问题的上界和下界复杂度界限。证明了,在给定精度水平ε,最高可用Lipschitz连续导数阶数p和期望最优阶数q(介于1和p之间)的情况下,一个概念性正则化算法需要不超过O(ε^(- (p+1)/(p-q+1)))次目标函数及其导数的评估,以计算一个合适的q阶近似极小值点。通过适当选择正则化,如果p阶导数仅仅是Hölder连续而非Lipschitz连续,则也得出类似的结果。我们提供了一个例子,说明上述复杂度界限对于无约束和广泛类别的约束问题都是精确的,并且从最坏情况复杂度的角度解释了正则化方法的最优性,限于一大类使用相同导数信息的算法。

英文摘要

We provide sharp worst-case evaluation complexity bounds for nonconvex minimization problems with general inexpensive constraints, i.e.\ problems where the cost of evaluating/enforcing of the (possibly nonconvex or even disconnected) constraints, if any, is negligible compared to that of evaluating the objective function. These bounds unify, extend or improve all known upper and lower complexity bounds for unconstrained and convexly-constrained problems. It is shown that, given an accuracy level $ε$, a degree of highest available Lipschitz continuous derivatives $p$ and a desired optimality order $q$ between one and $p$, a conceptual regularization algorithm requires no more than $O(ε^{-\frac{p+1}{p-q+1}})$ evaluations of the objective function and its derivatives to compute a suitably approximate $q$-th order minimizer. With an appropriate choice of the regularization, a similar result also holds if the $p$-th derivative is merely Hölder rather than Lipschitz continuous. We provide an example that shows that the above complexity bound is sharp for unconstrained and a wide class of constrained problems, we also give reasons for the optimality of regularization methods from a worst-case complexity point of view, within a large class of algorithms that use the same derivative information.

1903.00979 2026-06-04 math.OC cs.LG cs.SY eess.SY math.DS stat.ML

Analysis of a Generalized Expectation-Maximization Algorithm for Gaussian Mixture Models: A Control Systems Perspective

Gaussian混合模型中通用期望-最大化算法的分析:控制系统的视角

Sarthak Chatterjee, Orlando Romero, Sérgio Pequito

发表机构 * Department of Electrical, Computer, and Systems Engineering, Rensselaer Polytechnic Institute(雷德利尔理工学院电子工程与计算机系统系) Department of Industrial and Systems Engineering, Rensselaer Polytechnic Institute(雷德利尔理工学院工业与系统工程系)

AI总结 本文从控制系统的角度分析了Gaussian混合模型中的一种通用期望-最大化算法,探讨了其收敛性质,并通过示例展示了该方法的优势。

Comments 17 pages, 7 figures

详情
AI中文摘要

期望-最大化(EM)算法是无监督学习中解决参数分布基于聚类问题最流行的算法之一。在本文中,我们提出在高斯混合模型的背景下分析一种通用的EM(GEM)算法,其中EM中的最大化步骤被替换为递增步骤。我们证明这种GEM算法可以被理解为具有反馈非线性的线性时不变(LTI)系统。因此,我们利用鲁棒控制理论的工具来探索其收敛性质。最后,我们解释了如何设计所提出的GEM,并通过一个教学示例来理解所提出方法的优势。

英文摘要

The Expectation-Maximization (EM) algorithm is one of the most popular methods used to solve the problem of parametric distribution-based clustering in unsupervised learning. In this paper, we propose to analyze a generalized EM (GEM) algorithm in the context of Gaussian mixture models, where the maximization step in the EM is replaced by an increasing step. We show that this GEM algorithm can be understood as a linear time-invariant (LTI) system with a feedback nonlinearity. Therefore, we explore some of its convergence properties by leveraging tools from robust control theory. Lastly, we explain how the proposed GEM can be designed, and present a pedagogical example to understand the advantages of the proposed approach.

1812.05506 2026-06-04 eess.SY cs.LG cs.SY

A predictive safety filter for learning-based control of constrained nonlinear dynamical systems

基于约束非线性动力学系统的预测安全过滤器

Kim P. Wabersich, Melanie N. Zeilinger

发表机构 * Institute for Dynamic Systems and Control, ETH Zurich, Zurich, Switzerland(动态系统与控制研究所,苏黎世联邦理工学院,瑞士苏黎世)

AI总结 本文提出了一种预测安全过滤器,用于基于学习的控制中处理物理限制下的安全问题,通过将约束动力学系统转换为无约束安全系统,使任何强化学习算法都能直接应用。

详情
AI中文摘要

将强化学习(RL)技术转移到现实应用中的挑战在于存在物理限制下的安全要求。大多数RL方法,特别是最流行的算法,不支持显式考虑状态和输入约束。在本文中,我们针对具有连续状态和输入空间的非线性系统,引入了一种预测安全过滤器,能够将约束动力学系统转换为无约束安全系统,并且任何RL算法都可以直接应用。预测安全过滤器接收提出的控制输入,并基于当前系统状态决定是否可以安全地应用于实际系统,或者是否需要进行修改。安全通过一个不断更新的安全策略来建立,该策略基于数据驱动的系统模型和考虑状态和输入依赖的不确定性,采用模型预测控制的公式。

英文摘要

The transfer of reinforcement learning (RL) techniques into real-world applications is challenged by safety requirements in the presence of physical limitations. Most RL methods, in particular the most popular algorithms, do not support explicit consideration of state and input constraints. In this paper, we address this problem for nonlinear systems with continuous state and input spaces by introducing a predictive safety filter, which is able to turn a constrained dynamical system into an unconstrained safe system and to which any RL algorithm can be applied `out-of-the-box'. The predictive safety filter receives the proposed control input and decides, based on the current system state, if it can be safely applied to the real system, or if it has to be modified otherwise. Safety is thereby established by a continuously updated safety policy, which is based on a model predictive control formulation using a data-driven system model and considering state and input dependent uncertainties.

1902.02311 2026-06-04 cs.MA cs.AI cs.LG cs.SY eess.SY

Decentralized Multi-Agents by Imitation of a Centralized Controller

通过模仿集中控制器实现去中心化多智能体

Alex Tong Lin, Mark J. Debord, Katia Estabridis, Gary Hewer, Guido Montufar, Stanley Osher

发表机构 * UCLA(加州大学洛杉矶分校) Max Planck Institute, Leipzig(莱比锡马克斯·普朗克研究所) University of California, Los Angeles(加州大学洛杉矶分校)

AI总结 本文提出了一种基于集中训练、去中心执行框架的新型算法,通过模仿学习生成去中心化多智能体,解决了多智能体强化学习中非平稳和部分可观测环境下的协作问题。

详情
AI中文摘要

我们考虑了一个多智能体强化学习问题,其中每个智能体试图在与其他智能体交互时最大化共享奖励,且可能无法通信。通常,智能体无法访问其他智能体的策略,因此每个智能体都处于非平稳和部分可观测的环境中。为了获得去中心化作用的多智能体,我们引入了一种新的算法,该算法基于流行的集中训练、去中心执行框架。该训练框架首先通过单一集中联合空间学习者解决多智能体问题,然后用于指导模仿学习以生成独立的去中心化多智能体。该框架具有灵活性,可以使用任何强化学习算法来获得专家,以及任何模仿学习算法来获得去中心化智能体。这与其它多智能体学习算法不同,例如可能需要更具体的结构。我们为该方法提供了一些理论界限,并展示了通过模仿学习可以获得多智能体问题的去中心化解决方案。

英文摘要

We consider a multi-agent reinforcement learning problem where each agent seeks to maximize a shared reward while interacting with other agents, and they may or may not be able to communicate. Typically the agents do not have access to other agent policies and thus each agent is situated in a non-stationary and partially-observable environment. In order to obtain multi-agents that act in a decentralized manner, we introduce a novel algorithm under the popular framework of centralized training, but decentralized execution. This training framework first obtains solutions to a multi-agent problem with a single centralized joint-space learner, which is then used to guide imitation learning for independent decentralized multi-agents. This framework has the flexibility to use any reinforcement learning algorithm to obtain the expert as well as any imitation learning algorithm to obtain the decentralized agents. This is in contrast to other multi-agent learning algorithms that, for example, can require more specific structures. We present some theoretical bounds for our method, and we show that one can obtain decentralized solutions to a multi-agent problem through imitation learning.

1701.00178 2026-06-04 math.OC cs.AI cs.LG cs.SY eess.SY stat.ML

Lazily Adapted Constant Kinky Inference for Nonparametric Regression and Model-Reference Adaptive Control

惰性适应的常数Kinky推断用于非参数回归和模型参考自适应控制

Jan-Peter Calliess

发表机构 * Dept. of Engineering Science University of Oxford, UK(工程科学系 奥克斯福德大学 英国)

AI总结 本文提出了一种惰性适应的常数Kinky推断方法,用于非参数回归和模型参考自适应控制,通过在线估计Hölder常数并建立强通用逼近保证,展示了在密集数据下学习任意连续函数的能力。

详情
AI中文摘要

非线性集合成员预测、Lipschitz插值或Kinky推断是机器学习中利用预设Lipschitz性质来计算未观测函数值推断的方法。在已知目标函数真实最佳Lipschitz常数的上界时,这些方法提供收敛保证和预测的界限。考虑一个更一般的设置,该设置基于相对于伪度量的Hölder连续性,我们提出了一种在线方法,用于从可能受有界观测误差影响的函数值观测中估计Hölder常数。利用此方法在Kinky推断规则中计算自适应参数,从而得到一种非参数机器学习方法,我们为此建立了强通用逼近保证。也就是说,我们证明我们的预测规则在数据越来越密集的情况下,可以学习任意连续函数,其最坏误差界取决于观测不确定性水平。我们在非参数模型参考自适应控制(MRAC)的背景下应用了我们的方法。在一系列模拟飞机滚动动力学和性能指标中,我们的方法优于基于高斯过程和RBF神经网络最近提出的方法。对于离散时间系统,我们为我们的基于学习的控制器在批量学习和在线学习设置下的跟踪成功率提供了保证。

英文摘要

Techniques known as Nonlinear Set Membership prediction, Lipschitz Interpolation or Kinky Inference are approaches to machine learning that utilise presupposed Lipschitz properties to compute inferences over unobserved function values. Provided a bound on the true best Lipschitz constant of the target function is known a priori they offer convergence guarantees as well as bounds around the predictions. Considering a more general setting that builds on Hoelder continuity relative to pseudo-metrics, we propose an online method for estimating the Hoelder constant online from function value observations that possibly are corrupted by bounded observational errors. Utilising this to compute adaptive parameters within a kinky inference rule gives rise to a nonparametric machine learning method, for which we establish strong universal approximation guarantees. That is, we show that our prediction rule can learn any continuous function in the limit of increasingly dense data to within a worst-case error bound that depends on the level of observational uncertainty. We apply our method in the context of nonparametric model-reference adaptive control (MRAC). Across a range of simulated aircraft roll-dynamics and performance metrics our approach outperforms recently proposed alternatives that were based on Gaussian processes and RBF-neural networks. For discrete-time systems, we provide guarantees on the tracking success of our learning-based controllers both for the batch and the online learning setting.

1903.08818 2026-06-04 eess.SY cs.RO cs.SY

Contingency Model Predictive Control for Automated Vehicles

应急模型预测控制用于自动驾驶车辆

John P. Alsterda, Matthew Brown, J. Christian Gerdes

发表机构 * Department of Mechanical Engineering, Stanford University(斯坦福大学机械工程系)

AI总结 本文提出了一种新的应急模型预测控制(CMPC)框架,该框架在跟踪期望路径的同时,维护一个替代轨迹以避免已识别的潜在紧急情况。通过在经典递推MPC时间 horizon中并行添加额外的预测时间 horizon,CMPC能够预见可能发生的情况,而不是在紧急情况发生后才反应。通过数学定义框架并实验比较其性能与最先进的确定性MPC,展示了CMPC在应对潜在摩擦损失(如冰面)时的有效性。

Comments American Control Conference, July 2019; 6 pages

详情
Journal ref
IEEE American Control Conference (ACC) (2019) 717-722
AI中文摘要

我们提出了应急模型预测控制(CMPC),一种新颖且可实施的控制框架,该框架在跟踪期望路径的同时,同时维护一个应急计划——一个替代轨迹以避免已识别的潜在紧急情况。通过在经典递推MPC时间 horizon中并行添加额外的预测时间 horizon,CMPC能够预见可能发生的情况,而不是在紧急情况发生后才反应。通过数学定义框架并实验比较其性能与最先进的确定性MPC,展示了CMPC在应对潜在摩擦损失(如冰面)时的有效性。

英文摘要

We present Contingency Model Predictive Control (CMPC), a novel and implementable control framework which tracks a desired path while simultaneously maintaining a contingency plan -- an alternate trajectory to avert an identified potential emergency. In this way, CMPC anticipates events that might take place, instead of reacting when emergencies occur. We accomplish this by adding an additional prediction horizon in parallel to the classical receding MPC horizon. The contingency horizon is constrained to maintain a feasible avoidance solution; as such, CMPC is selectively robust to this emergency while tracking the desired path as closely as possible. After defining the framework mathematically, we demonstrate its effectiveness experimentally by comparing its performance to a state-of-the-art deterministic MPC. The controllers drive an automated research platform through a left-hand turn which may be covered by ice. Contingency MPC prepares for the potential loss of friction by purposefully and intuitively deviating from the prescribed path to approach the turn more conservatively; this deviation significantly mitigates the consequence of encountering ice.

1905.04403 2026-06-04 eess.SY cs.LG cs.SY

PAC Statistical Model Checking for Markov Decision Processes and Stochastic Games

PAC统计模型检验用于马尔可夫决策过程和随机游戏

Pranav Ashok, Jan Křetínský, Maximilian Weininger

发表机构 * Technical University of Munich, Germany(慕尼黑技术大学)

AI总结 本文提出了一种用于马尔可夫决策过程和随机游戏的PAC统计模型检验算法,该算法在不完全了解转移函数的情况下,能够提供概率近似正确性保证,且在实际应用中效率较高。

详情
AI中文摘要

统计模型检验(SMC)是一种用于分析概率系统的技术,这些系统可能(部分)未知。我们提出了一种用于无界可达性的SMC算法,该算法能够提供概率近似正确(PAC)的保证。我们考虑了两种情况:(i)没有转移函数的知识(仅需一个转移概率的下界)和(ii)了解底层图的拓扑结构。一方面,这是首个针对随机游戏的算法;另一方面,即使对于马尔可夫决策过程,这也是首个实用的算法。与之前需要运行时间超过宇宙年龄的方法相比,我们的算法通常可以在几分钟内得到合理精确的结果,不需要了解混合时间或整个模型的拓扑结构。

英文摘要

Statistical model checking (SMC) is a technique for analysis of probabilistic systems that may be (partially) unknown. We present an SMC algorithm for (unbounded) reachability yielding probably approximately correct (PAC) guarantees on the results. We consider both the setting (i) with no knowledge of the transition function (with the only quantity required a bound on the minimum transition probability) and (ii) with knowledge of the topology of the underlying graph. On the one hand, it is the first algorithm for stochastic games. On the other hand, it is the first practical algorithm even for Markov decision processes. Compared to previous approaches where PAC guarantees require running times longer than the age of universe even for systems with a handful of states, our algorithm often yields reasonably precise results within minutes, not requiring the knowledge of mixing time or the topology of the whole model.

1904.09841 2026-06-04 cs.DS cs.LG cs.NA math.NA

Simple Heuristics Yield Provable Algorithms for Masked Low-Rank Approximation

简单的启发式方法可证明算法用于带掩码的低秩近似

Cameron Musco, Christopher Musco, David P. Woodruff

发表机构 * UMass Amherst(马萨诸塞大学阿默斯特分校) New York University(纽约大学) Carnegie Mellon University(卡内基梅隆大学)

AI总结 本文研究了带掩码的低秩近似问题,提出了一种简单的启发式方法,通过将掩码为0的区域设为0,然后求解标准低秩近似,从而得到具有双标准逼近保证的算法。

Comments ITCS 2021

详情
AI中文摘要

在$masked\ low-rank\ approximation$中,给定$A \in \mathbb{R}^{n imes n}$和二进制掩码矩阵$W \in \{0,1\}^{n imes n}$。目标是找到一个秩为$k$的矩阵$L$,使得$$cost(L) = \sum_{i=1}^{n} \sum_{j = 1}^{n} W_{i,j} \cdot (A_{i,j} - L_{i,j} )^2 \leq OPT + ε\|A\|_F^2 ,$$其中$OPT = \min_{rank-k\ \hat{L}} cost(\hat L)$,$ε$是一个给定的误差参数。根据$W$的不同选择,该问题捕捉到因子分析、低秩加对角分解、鲁棒PCA、低秩矩阵补全、低秩加块矩阵近似以及许多问题。许多这些问题都是NP难的,尽管已有一些具有证明保证的算法,但它们要么1) 运行时间是$n^{Ω(k^2/ε)}$,要么2) 做出强假设,例如$A$是不相干的或$W$是随机的。在本工作中,我们证明了一个常见的多项式时间启发式方法,即简单地将$W$为0的区域设为0,然后找到标准低秩近似,可以为该问题提供双标准逼近保证。特别是,对于秩为$k' > k$,取决于$W$的public\ coin\ partition\ number,该启发式方法输出秩为$k'$的$L$,其成本$(L) \leq OPT + ε\|A\|_F^2$。这个partition number反过来由$W$作为两个玩家通信矩阵时的randomized\ communication\ complexity$所限制。对于许多重要的带掩码低秩近似示例,包括上述所有问题,该结果提供了具有$k' = k \cdot poly(\log n/ε)$的双标准逼近保证。此外,我们还显示了不同的通信模型为带掩码低秩近似的自然变种提供了算法。例如,多玩家number-in-hand通信复杂度与带掩码张量分解相关,而非确定性通信复杂度与带掩码布尔低秩分解相关。

英文摘要

In $masked\ low-rank\ approximation$, one is given $A \in \mathbb{R}^{n \times n}$ and binary mask matrix $W \in \{0,1\}^{n \times n}$. The goal is to find a rank-$k$ matrix $L$ for which: $$cost(L) = \sum_{i=1}^{n} \sum_{j = 1}^{n} W_{i,j} \cdot (A_{i,j} - L_{i,j} )^2 \leq OPT + ε\|A\|_F^2 ,$$ where $OPT = \min_{rank-k\ \hat{L}} cost(\hat L)$ and $ε$ is a given error parameter. Depending on the choice of $W$, this problem captures factor analysis, low-rank plus diagonal decomposition, robust PCA, low-rank matrix completion, low-rank plus block matrix approximation, and many problems. Many of these problems are NP-hard, and while some algorithms with provable guarantees are known, they either 1) run in time $n^{Ω(k^2/ε)}$ or 2) make strong assumptions, e.g., that $A$ is incoherent or that $W$ is random. In this work, we show that a common polynomial time heuristic, which simply sets $A$ to $0$ where $W$ is $0$, and then finds a standard low-rank approximation, yields bicriteria approximation guarantees for this problem. In particular, for rank $k' > k$ depending on the $public\ coin\ partition\ number$ of $W$, the heuristic outputs rank-$k'$ $L$ with cost$(L) \leq OPT + ε\|A\|_F^2$. This partition number is in turn bounded by the $randomized\ communication\ complexity$ of $W$, when interpreted as a two-player communication matrix. For many important examples of masked low-rank approximation, including all those listed above, this result yields bicriteria approximation guarantees with $k' = k \cdot poly(\log n/ε)$. Further, we show that different models of communication yield algorithms for natural variants of masked low-rank approximation. For example, multi-player number-in-hand communication complexity connects to masked tensor decomposition and non-deterministic communication complexity to masked Boolean low-rank factorization.

1903.07214 2026-06-04 eess.SY cs.LG cs.SY

A Control Lyapunov Perspective on Episodic Learning via Projection to State Stability

从控制李雅普诺夫视角看通过投影到状态稳定性进行片段学习

Andrew J. Taylor, Victor D. Dorobantu, Meera Krishnamoorthy, Hoang M. Le, Yisong Yue, Aaron D. Ames

发表机构 * California Institute of Technology(加州理工学院)

AI总结 本文从李雅普诺夫函数视角探讨学习对控制合成的影响,提出投影到状态稳定性(PSS)概念,用于表征CLF对系统不确定数据的鲁棒性,并展示如何利用PSS在仿射控制中限制不确定性,实现鲁棒控制合成。

详情
AI中文摘要

本文的目标是从李雅普诺夫函数视角理解学习对控制合成的影响。具体而言,而不是考虑完整系统动态中的不确定性,我们采用控制李雅普诺夫函数(CLFs)作为低维投影。为了理解和表征这些投影动态引入的不确定性,我们引入了一个新概念:投影到状态稳定性(PSS)。PSS可以看作是定义在投影动态上的输入到状态稳定性变种,能够表征CLF对用于学习系统不确定性的数据的鲁棒性。我们使用PSS来限制仿射控制中的不确定性,并展示一种实用的片段学习方法可以利用PSS来表征CLF中的不确定性,以实现鲁棒控制合成。

英文摘要

The goal of this paper is to understand the impact of learning on control synthesis from a Lyapunov function perspective. In particular, rather than consider uncertainties in the full system dynamics, we employ Control Lyapunov Functions (CLFs) as low-dimensional projections. To understand and characterize the uncertainty that these projected dynamics introduce in the system, we introduce a new notion: Projection to State Stability (PSS). PSS can be viewed as a variant of Input to State Stability defined on projected dynamics, and enables characterizing robustness of a CLF with respect to the data used to learn system uncertainties. We use PSS to bound uncertainty in affine control, and demonstrate that a practical episodic learning approach can use PSS to characterize uncertainty in the CLF for robust control synthesis.