arXivDaily arXiv每日学术速递 周一至周五更新

AI 大模型

AI Agent

智能体、工具调用、规划、工作流、多智能体和自主任务执行。

今日/当前日期收录 91 信号源:cs.AI, cs.CL, cs.LG, cs.SE
2505.03863 2026-06-18 cs.CR cs.AI 55%

Data-Driven Falsification of Cyber-Physical Systems

数据驱动的物理系统验证

Atanu Kundu, Sauvik Gon, Rajarshi Ray

发表机构 * Indian Association for the Cultivation of Science(印度科学培养协会)

专题命中 其他Agent :数据驱动验证物理系统,涉及智能体验证

AI总结 本文提出一种框架,将物理系统验证与深度神经网络验证联系起来,并利用决策树的可解释性加速验证过程,展示了在ARCH-COMP 2024基准测试中高效发现多个反例的潜力。

详情
AI中文摘要

物理系统(CPS)在医疗、航空电子和自动驾驶等安全关键领域中普遍存在。因此,对其操作安全性的形式验证至关重要。本文针对验证问题,即寻找系统中的不安全执行而非证明其不存在。本文的贡献是提出一个框架,将CPS的验证与深度神经网络(DNN)的验证联系起来,并利用决策树的内在可解释性加速CPS的验证。这通过构建被测CPS的替代模型(作为DNN模型或决策树),应用各种DNN验证工具来验证CPS,并通过从其决策树替代模型中提取的安全违规解释来指导新的验证算法实现。所提出的框架有潜力利用一系列设计用于验证DNN鲁棒性属性的对抗攻击算法,以及最先进的DNN验证算法。尽管所提出的 methodology 可应用于可以执行或模拟的一般系统,但我们特别展示了其在CPS中的有效性。我们展示了我们的框架,作为工具FlexiFal,能够检测具有线性和非线性动态的CPS中难以发现的反例。决策树引导的验证在ARCH-COMP 2024验证基准测试中显示出有希望的结果。

英文摘要

Cyber-Physical Systems (CPS) are abundant in safety-critical domains such as healthcare, avionics, and autonomous vehicles. Formal verification of their operational safety is, therefore, of utmost importance. In this paper, we address the falsification problem, where the focus is on searching for an unsafe execution in the system instead of proving their absence. The contribution of this paper is a framework that (a) connects the falsification of CPS with the falsification of deep neural networks (DNNs) and (b) leverages the inherent interpretability of Decision Trees for faster falsification of CPS. This is achieved by: (1) building a surrogate model of the CPS under test, either as a DNN model or a Decision Tree, (2) application of various DNN falsification tools to falsify CPS, and (3) a novel falsification algorithm guided by the explanations of safety violations of the CPS model extracted from its Decision Tree surrogate. The proposed framework has the potential to exploit a repertoire of \emph{adversarial attack} algorithms designed to falsify robustness properties of DNNs, as well as state-of-the-art falsification algorithms for DNNs. Although the presented methodology is applicable to systems that can be executed/simulated in general, we demonstrate its effectiveness, particularly in CPS. We show that our framework, implemented as a tool \textsc{FlexiFal}, can detect hard-to-find counterexamples in CPS that have linear and non-linear dynamics. Decision tree-guided falsification shows promising results in efficiently finding multiple counterexamples in the ARCH-COMP 2024 falsification benchmarks~\cite{khandait2024arch}.