Carbon-Aware Intrusion Detection: A Comparative Study of Supervised and Unsupervised DRL for Sustainable IoT Edge Gateways
碳感知入侵检测:监督与非监督DRL在可持续物联网边缘网关中的比较研究
Saeid Jamshidi, Foutse Khomh, Kawser Wazed Nafi, Amin Nikanjam, Samira Keivanpour, Omar Abdul-Wahab, Martine Bellaiche
AI总结 本文提出两种基于深度强化学习的入侵检测系统,通过理论分析和实验评估验证其在边缘网关上的有效性,同时引入碳感知的多目标奖励机制,以实现可持续的实时入侵检测。
详情
物联网的快速扩展加剧了网络安全挑战,尤其是在网络边缘缓解分布式拒绝服务(DDoS)攻击方面。传统入侵检测系统(IDS)面临显著局限,包括对演变和零日攻击适应性差、依赖静态签名和标记数据集以及在资源受限的边缘网关上效率低下。此外,大多数现有基于DRL的IDS研究忽略了可持续性因素,如能源效率和碳影响。为了解决这些挑战,本文提出了两种新型基于深度强化学习(DRL)的IDS:DeepEdgeIDS,一种无标签的自编码器-DRL混合模型,以及AutoDRL-IDS,一种监督的LSTM-DRL模型。这两种基于DRL的IDS通过理论分析和在边缘网关上的实验评估进行了验证。结果表明,AutoDRL-IDS在使用标记数据时达到94%的检测准确率,而DeepEdgeIDS通过无标签的异常检测和在线缓解反馈达到98%的离线评估准确率。本研究引入了碳感知的多目标奖励公式,支持对AutoDRL-IDS的监督奖励优化和对DeepEdgeIDS的无标签在线奖励学习,从而在动态物联网网络中实现可持续的实时IDS操作。
The rapid expansion of the Internet of Things (IoT) has intensified cybersecurity challenges, particularly in mitigating Distributed Denial-of-Service (DDoS) attacks at the network edge. Traditional Intrusion Detection Systems (IDSs) face significant limitations, including poor adaptability to evolving and zero-day attacks, reliance on static signatures and labeled datasets, and inefficiency on resource-constrained edge gateways. Moreover, most existing DRL-based IDS studies overlook sustainability factors such as energy efficiency and carbon impact. To address these challenges, this paper proposes two novel Deep Reinforcement Learning (DRL)-based IDS: DeepEdgeIDS, a label-free Autoencoder-DRL hybrid, and AutoDRL-IDS, a supervised LSTM--DRL model. Both DRL-based IDS are validated through theoretical analysis and experimental evaluation on edge gateways. Results demonstrate that AutoDRL-IDS achieves 94% detection accuracy using labeled data, while DeepEdgeIDS attains 98% offline evaluation accuracy through label-free anomaly detection and online mitigation feedback. This study introduces a carbon-aware, multi-objective reward formulation that supports supervised reward optimization for AutoDRL-IDS and label-free online reward learning for DeepEdgeIDS, enabling sustainable real-time IDS operation in dynamic IoT networks.