arXivDaily arXiv每日学术速递 周一至周五更新
2606.19968 2026-06-19 cs.GT 新提交

Beyond Lower Quota: Avoiding Overrepresentation in Multi-Winner Voting

超越最低配额:避免多赢者投票中的过度代表

Anton Baychkov, Martin Lackner, Jan Maly, Oliviero Nardi, Jannik Peters

AI总结 本文提出避免过度代表的公理JUQ,引入复合Thiele规则并刻画满足该公理的Adams-AV规则,同时提出平衡避免不足与过度代表的公理JNQ。

Comments This is an extended version of the publication with the same name in the proceedings of EC 2026

详情
AI中文摘要

最近,在社会选择文献中,避免基于批准的多赢者投票中代表不足的问题受到了广泛关注。本文探讨了被广泛忽视的互补问题——避免过度代表。尽管这是一个具有具体应用的理想性质,但尚未被系统研究。直观上,过度代表发生在一个群体决定了委员会中不成比例的大部分席位,从而超过了该群体的配额。我们提出了一个强且吸引人的避免过度代表的公理,称为可证明的上限配额(JUQ)。我们引入了Thiele规则的一个推广——复合Thiele规则,并刻画了该类中满足我们公理的唯一规则。该规则Adams-AV自然地扩展了Adams分配方法,此前未被研究。此外,我们引入了一个满足JUQ的多项式时间规则。进一步,我们引入了有理由的接近配额(JNQ),这是一个平衡避免不足和过度代表的公理。它刻画了扩展Sainte-Laguë分配方法的唯一Thiele规则。最后,我们分析了我们的公理与已建立的比例性概念(如EJR+)的兼容性。

英文摘要

Recently, in the social choice literature, much attention has been given to the question of avoiding underrepresentation in approval-based multi-winner voting. In this paper, we explore the largely overlooked complementary question of avoiding overrepresentation. This has not been explored systematically, despite being a desirable property with concrete applications. Intuitively, overrepresentation happens when a group determines a disproportionately large part of the committee, thereby exceeding the group's quota. We formulate a strong and appealing axiom for avoiding overrepresentation, called justifiable upper quota (JUQ). We introduce a generalization of Thiele rules, composite Thiele rules, and characterize the unique rule in this class satisfying our axiom. This rule, Adams-AV, which naturally extends Adams' apportionment method, has not been studied before. Additionally, we introduce a polynomial-time rule that satisfies JUQ. Furthermore, we introduce justified near quota, an axiom that balances avoiding under- and overrepresentation. It characterizes the unique Thiele rule extending the Sainte-Laguë apportionment method. Finally, we analyze the compatibility of our axioms with established proportionality notions such as EJR+.

2606.19618 2026-06-19 cs.GT 新提交

Joint-task truthfulness of the DMI mechanism

DMI机制的联合任务真实性

Rafael Frongillo

AI总结 研究DMI机制在联合任务策略下的真实性,证明当其他代理使用一致策略时,真实报告仍是贝叶斯-纳什均衡,但无限制时主导真实性和知情真实性均失效。

详情
AI中文摘要

Kong (2020, 2024) 的 Determinant Mutual Information (DMI) 机制在*一致*报告策略类(即对每个任务应用相同单任务策略)中是主导真实的。在代理在报告前看到多个任务的环境中,如同行评分或同行评审,考虑*联合任务*策略(可能根据完整信号向量调整报告)是自然的。令人惊讶的是,我们证明当其他代理使用一致策略时,DMI机制在所有联合任务策略中保留真实报告作为最佳响应,因此真实性在联合任务类中仍然是贝叶斯-纳什均衡。然而,如果没有同行限制为一致策略,主导真实性和知情真实性均无法对抗联合任务同行策略。

英文摘要

The Determinant Mutual Information (DMI) mechanism of Kong (2020, 2024) is dominantly truthful within the class of *consistent* reporting strategies, those that apply the same single-task strategy to every task. In settings where agents see multiple tasks before reporting, such as peer grading or peer review, it is natural to consider *joint-task* strategies that may condition reports on the full signal vector. Perhaps surprisingly, we show that the DMI mechanism preserves truthful reporting as a best response among all joint-task strategies when other agents play consistent strategies, so that truthfulness remains a Bayes--Nash equilibrium in the joint-task class. Without the restriction of peers to consistent strategies, however, both dominant truthfulness and informed truthfulness fail against joint-task peer strategies.

2606.16326 2026-06-19 cs.GT cs.AI q-fin.RM 新提交

Gaming-Resistant Insurance Contracts for Autonomous AI Agents: Strategy-Proof Toll Mechanism Design

自主AI代理的抗博弈保险合约:策略证明的通行费机制设计

Hao-Hsuan Chen

发表机构 * Hao-Hsuan Chen(何浩轩)

AI总结 本文扩展了时间一致精算运行时的框架,使运营商策略化,刻画了自主AI代理保险合约的五种攻击空间,并证明了精算运行时的抗博弈性,通过新合约条款实现激励兼容。

Comments 29 pages. Companion to arXiv:2605.26508 (Paper A, foundations) and arXiv:2605.25632 (Paper B, empirical)

详情
AI中文摘要

论文A定义了一个时间一致的精算运行时,该运行时根据合约固定的安全默认值对每个产生副作用的行动定价,并针对储备预算门控执行。它将运营商视为被动。本文使运营商策略化。我们刻画了自主AI代理保险合约的五种攻击空间,并证明了精算运行时何时具有抗博弈性。两种攻击面——通行费后的安全默认选择以及边界内的行动分割——通过论文A的最小权限和无分割条款得以关闭。其余三种需要新的合约条款。首先,公共控制聚合防止跨边界重新路由将通行费降低到应用于总暴露的边界潜力以下。其次,接口故障(如无效JSON)是合约相关事件,而非安全胜利:将其视为零通行费安全默认值可能奖励不可靠的模型,而升级费用则逆转了激励。我们通过来自配套实证论文的跨模型轨迹验证了这一接口合规定理。第三,一个带有分量最小惩罚计划的模型身份菜单使得部署模型的真实报告成为弱占优策略。然后,我们将这些条款与论文A的运行时保证组合,以获得在五种攻击空间上的联合激励兼容性。最后,一个双参数保费族在真实均衡下满足了运营商个体理性和弱预算平衡。结果是为自主代理副作用的精算控制提供了一个激励兼容层。

英文摘要

Paper A defines a time-consistent actuarial runtime that prices each side-effect-bearing action against a contractually fixed safe default and gates execution against a reserve budget. It treats the operator as passive. This paper makes the operator strategic. We characterise a five-attack space for autonomous AI-agent insurance contracts and prove when the actuarial runtime is gaming-resistant. Two attack surfaces -- post-toll safe-default selection and within-boundary action splitting -- are closed by Paper A's minimal-authority and no-splitting clauses. The remaining three require new contract clauses. First, common-control aggregation prevents cross-boundary re-routing from reducing toll below the boundary potential applied to total exposure. Second, interface failures such as invalid JSON are contract-relevant events, not safety wins: treating them as zero-toll safe defaults can reward unreliable models, while escalation fees reverse the incentive. We validate this interface-compliance theorem on committed cross-model traces from the companion empirical paper. Third, a model-identity menu with a componentwise-minimum penalty schedule makes truthful reporting of the deployed model weakly dominant. We then compose these clauses with Paper A's runtime guarantees to obtain joint incentive compatibility over the five-attack space. Finally, a two-parameter premium family discharges operator individual rationality and weak budget balance at the truthful equilibrium. The result is an incentive-compatibility layer for actuarial control of autonomous-agent side effects.

2606.20232 2026-06-19 cs.RO cs.GT 交叉投稿

Mobile Target Search with Imperfect Perception: A Partially Observable Stochastic Game Theoretical Approach

不完美感知下的移动目标搜索:一种部分可观测随机博弈论方法

Hanzheng Zhang, Shu Liang, Shuyu Liu

发表机构 * Shanghai Research Institute for Intelligent Autonomous Systems, Tongji University(同济大学上海自主智能无人系统科学中心) Department of Control Science and Engineering, Tongji University(同济大学控制科学与工程系)

AI总结 针对传感器限制、恶意干扰或通信噪声导致的不完美感知,采用部分可观测随机博弈(POSG)框架建模搜索者与目标间的对抗互动,提出可检测性概念和基于随机递归分析的充分判据,并开发服务器辅助分布式算法。

详情
AI中文摘要

本文研究了在传感器限制、恶意干扰或通信噪声导致的不完美感知下的移动目标搜索问题。搜索者和目标在具有有限移动性的网格状区域中运行,导致搜索与逃避之间的动态相互作用。为了捕捉不完美感知下的这种对抗互动,我们采用部分可观测随机博弈(POSG)方法,该方法通过引入目标智能来推广部分可观测马尔可夫决策过程(POMDP)。为了处理感知不确定性引起的虚警和漏检,我们提出了一种新颖的可检测性概念,以确定搜索策略是否能保证最终检测,并基于随机递归分析提供了充分的可检测性准则。我们进一步开发了一种服务器辅助的分布式算法,该算法利用搜索者的聚合势博弈结构和基于KL散度的目标预测约简。数值模拟验证了所提算法的有效性,并支持了可检测性分析。

英文摘要

This paper investigates mobile target search under imperfect perceptions caused by sensor limitations, malicious jamming, or communication noise. Searchers and targets operate in a grid-shaped area with bounded mobility, leading to a dynamic interplay between search and evasion. To capture this adversarial interaction under imperfect perceptions, we adopt the partially observable stochastic game (POSG) approach, which generalizes partially observable Markov decision processes (POMDPs) by incorporating target intelligence. To handle false alarms and missed detections caused by perceptual uncertainties, we propose a novel detectability concept to determine whether a search strategy guarantees eventual detection, and provide sufficient detectability criteria based on stochastic recurrence analysis. We further develop a server-assisted distributed algorithm that utilizes the aggregative potential game structure for searchers and a KL-divergence-based reduction for target prediction. Numerical simulations validate the effectiveness of the proposed algorithm and support the detectability analysis.

2606.19695 2026-06-19 eess.SY cs.GT cs.SY math.OC 交叉投稿

A Unified Framework for Joint Sensor Placement and Scheduling for Intrusion Detection

入侵检测中联合传感器放置与调度的统一框架

Jayanth Bhargav, Mahsa Ghasemi, Shreyas Sundaram

AI总结 提出一个统一框架,将传感器放置与方向调度联合优化,通过博弈论设计效用函数并利用弱子模性实现近最优检测性能。

Comments 27 pages, 4 figures

详情
AI中文摘要

我们考虑一个入侵检测任务,其中防御者必须联合优化传感器放置位置和方向,以最小化入侵者穿越受保护环境时被漏检的概率。我们将此问题分解为一个元问题(称为SensorPlacement)和一个嵌入的子问题(称为OrientationScheduling)。对于固定的传感器放置,OrientationScheduling子问题被建模为防御者和入侵者之间的两人零和博弈,其中防御者寻求已部署传感器的方向策略以最小化漏检概率,而入侵者则寻求路径选择策略以最大化该概率。由于防御者的策略空间随传感器数量和方向组合增长,通过标准线性规划求解博弈变得不可行。为此,我们开发了一种迭代且高效的均衡求解算法,该算法利用博弈收益函数的结构,并建立了收敛到博弈纳什均衡(NE)的理论保证。该NE值随后被用作SensorPlacement元问题中的效用度量。我们证明了这个基于博弈值的效用函数在传感器放置集合上是弱子模的,并提出了一个具有近最优性保证的贪婪放置算法。据我们所知,这是第一个将博弈论效用设计与(弱)子模优化相结合的统一框架,实现了传感器放置和方向调度的原则性联合优化。通过大量仿真,我们证明所提出的方法实现了近最优的检测性能,同时与基线相比显著减少了计算时间。

英文摘要

We consider an intrusion detection task in which a defender must jointly optimize sensor placement locations and orientations to minimize the probability of missed detection of an intruder traversing a protected environment. We decompose this problem into a meta problem, termed SensorPlacement, and an embedded subproblem, termed OrientationScheduling. The OrientationScheduling subproblem, for a fixed sensor placement, is modeled as a 2-player zero-sum game between the defender and the intruder, where the defender seeks an orientation strategy for the deployed sensors to minimize the probability of missed detection, while the intruder seeks a path selection strategy to maximize it. Since the defender's strategy space grows combinatorially with the number of sensors and orientations, solving the game via standard linear programming becomes prohibitive. To this end, we develop an iterative and efficient equilibrium-seeking algorithm that exploits the structure of the game's payoff function and establishes theoretical guarantees for convergence to the Nash equilibrium (NE) of the game. This NE value is then used as a utility measure in the SensorPlacement meta problem. We show that this game-value-based utility function is weakly submodular over the set of sensor placements and propose a greedy placement algorithm with near-optimality guarantees. To our knowledge, this is the first unified framework to integrate game-theoretic utility design with (weak) submodular optimization, enabling principled joint optimization of sensor placement and orientation scheduling. Through extensive simulations, we demonstrate that the proposed approach achieves near-optimal detection performance while significantly reducing computation time compared to baselines.

2606.18679 2026-06-19 cs.DS cs.GT cs.LG math.OC 交叉投稿

Fair Online Resource Allocation

公平在线资源分配

Christopher En, Yuri Faenza, Andrea Lodi, Gonzalo Muñoz

发表机构 * Columbia University, IEOR Department(哥伦比亚大学工业工程与运营研究系) Cornell Tech(康奈尔科技学院) Universidad de Chile(智利大学)

AI总结 研究在线资源分配中的公平性问题,提出基于对偶镜像下降的算法,在批次内强制执行公平约束,实现亚线性遗憾,并通过难民数据验证了福利与公平的权衡。

Comments 30 pages, 4 figures. To appear in the proceedings of EC 2026

详情
AI中文摘要

我们研究公平在线资源分配问题,其动机源于难民安置和航班调度等应用,其中代理顺序到达并必须分配到容量有限的设施。我们引入一个模型,在资源约束和Lipschitz公平性要求下最大化整体福利,该要求确保同一批次中到达的相似代理获得相似的预期结果。我们首先分析离线问题,证明最优公平分配的价值至少是最优不公平分配的$\Omega(1/\gamma)$倍,其中$\gamma$是公平系数,从而界定了公平的代价。对于在线设置,我们提出一种基于对偶镜像下降的算法,该算法在估计最优对偶变量的同时,在批次内强制执行公平约束。我们证明该算法相对于最优离线流体基准实现了亚线性遗憾。最后,我们使用难民经济项目的真实数据验证了理论结果,展示了算法的性能,并考察了福利最大化与公平执行之间的权衡。

英文摘要

We study the problem of fair online resource allocation, motivated by applications such as refugee resettlement and airline scheduling, where agents arrive sequentially and must be assigned to facilities with limited capacities. We introduce a model that maximizes the overall welfare subject to resource constraints and a Lipschitz fairness requirement, which ensures that similar agents arriving in the same batch receive similar expected outcomes. We first analyze the offline problem, proving that the value of the optimal fair allocation is at least an $Ω(1/γ)$ fraction of the optimal unfair allocation, where $γ$ is the fairness coefficient, thereby bounding the price of fairness. For the online setting, we propose an algorithm based on dual mirror descent that enforces fairness constraints within batches while estimating optimal dual variables. We prove that this algorithm achieves sublinear regret relative to the optimal offline fluid benchmark. Finally, we validate our theoretical results using real-world data from the Refugee Economies Programme, demonstrating the algorithm's performance and examining the trade-offs between welfare maximization and fairness enforcement.

2604.27276 2026-06-19 cs.GT cs.CC 版本更新

Fisher Markets with Approximately Optimal Bundles and the Need for a PCP Theorem for PPAD

具有近似最优束的Fisher市场与PPAD的PCP定理的必要性

Argyrios Deligkas, John Fearnley, Alexandros Hollender, Themistoklis Melissourgos

AI总结 研究在SPLC效用函数的Fisher市场中计算具有近似最优束的竞争均衡的PPAD困难性,证明在PCP-for-PPAD猜想下存在常数δ>0使得问题为PPAD难,且该猜想对证明困难性是必要的。

详情
AI中文摘要

我们研究了在具有可分分段线性凹(SPLC)效用函数的Fisher市场中计算具有近似最优束的竞争均衡的问题,即每个买家收到一个$(1-\delta)$-最优束,而不是完全最优的束。我们首次建立了该问题的难解性结果,通过证明在PCP-for-PPAD猜想下,对于某个常数$\delta > 0$,该问题是PPAD难的。即使所有买家具有相同的预算(等收入竞争均衡)、线性上限效用函数,并且即使我们允许$\varepsilon$-近似清算而不是完全清算,对于任何常数$\varepsilon < 1/9$,该困难性结果仍然成立。重要的是,我们表明PCP-for-PPAD猜想实际上对于证明常数$\delta$的困难性是必要的:在包含我们困难性结果所生成市场的一类广泛市场中,展示寻找此类近似市场均衡的PPAD困难性将证明该猜想。这是第一个自然问题,其中该猜想被证明是建立其困难性所必需的。

英文摘要

We study the problem of computing a competitive equilibrium with approximately optimal bundles in Fisher markets with separable piecewise-linear concave (SPLC) utility functions, meaning that every buyer receives a $(1-δ)$-optimal bundle, instead of a perfectly optimal one. We establish the first intractability result for the problem by showing that it is PPAD-hard for some constant $δ> 0$, assuming the PCP-for-PPAD conjecture. This hardness result holds even if all buyers have identical budgets (competitive equilibrium with equal incomes), linear capped utilities, and even if we also allow $\varepsilon$-approximate clearing instead of perfect clearing, for any constant $\varepsilon < 1/9$. Importantly, we show that the PCP-for-PPAD conjecture is in fact required to show hardness for constant $δ$: showing PPAD-hardness for finding such approximate market equilibria in a broad class of markets encompassing those generated by our hardness result would prove the conjecture. This is the first natural problem where the conjecture is provably required to establish hardness for it.

2602.04115 2026-06-19 cs.GT 版本更新

Robustness of Stable Matchings When Attributes and Salience Determine Preferences

当属性和显著性决定偏好时稳定匹配的鲁棒性

Amit Ronen, S. S. Ravi, Sarit Kraus

AI总结 研究匹配市场中属性向量和显著性向量扰动下稳定匹配的鲁棒性,提出多项式时间算法验证和计算鲁棒半径,并设计近似最鲁棒匹配的搜索算法。

Comments Accepted to AAMAS 2026. This arXiv version contains the full appendix. Version 2 removes two appendix sections containing an incorrect auxiliary argument. All main results remain unchanged

详情
AI中文摘要

在许多匹配市场中——例如运动员招募或学术招生——一方的参与者通过另一方已知的属性向量进行评估,而另一方则应用个体的显著性向量来赋予这些属性相对重要性。由于显著性在实践中会发生变化,一个核心问题随之产生:稳定匹配对此类扰动的鲁棒性如何?我们在此背景下解决了几个基本问题。首先,我们将鲁棒性形式化为一个半径,在该半径内,稳定匹配在显著性向量的任何可容许扰动下(假设已归一化)仍能免疫于阻塞对。给定一个稳定匹配和一个半径,我们提出一个多项式时间算法来验证该匹配是否在指定半径内保持稳定。我们还给出了一个多项式时间算法来计算给定稳定匹配的最大鲁棒半径。此外,我们设计了一种随时搜索算法,利用认证的下界和上界来近似最鲁棒的稳定匹配,并通过可高效计算的界来刻画鲁棒性与成本之间的关系,这些界描述了鲁棒性与成本之间可实现的权衡。最后,我们证明,对于每个稳定匹配,保持其稳定性的显著性轮廓集是单纯形内低维多面体的乘积。这种几何结构精确刻画了每个鲁棒区域的多面体形状;其体积可以高效计算,随着维度增加可采用近似方法,从而将匹配市场中的鲁棒性分析与凸几何的经典工具联系起来。

英文摘要

In many matching markets--such as athlete recruitment or academic admissions--participants on one side are evaluated by attribute vectors known to the other side, which in turn applies individual \emph{salience vectors} to assign relative importance to these attributes. Since saliences are known to change in practice, a central question arises: how robust is a stable matching to such perturbations? We address several fundamental questions in this context. First, we formalize robustness as a radius within which a stable matching remains immune to blocking pairs under any admissible perturbation of salience vectors (which are assumed to be normalized). Given a stable matching and a radius, we present a polynomial-time algorithm to verify whether the matching is stable within the specified radius. We also give a polynomial-time algorithm for computing the maximum robustness radius of a given stable matching. Further, we design an anytime search algorithm that uses certified lower and upper bounds to approximate the most robust stable matching, and we characterize the robustness-cost relationship through efficiently computable bounds that delineate the achievable tradeoff between robustness and cost. Finally, we show that for each stable matching, the set of salience profiles that preserve its stability factors is a product of low-dimensional polytopes within the simplex. This geometric structure precisely characterizes the polyhedral shape of each robustness region; its volume can then be computed efficiently, with approximate methods available as the dimension grows, thereby linking robustness analysis in matching markets with classical tools from convex geometry.

2511.17625 2026-06-19 cs.MA cs.GT 版本更新

Iterative Negotiation and Oversight: A Case Study in Decentralized Air Traffic Management

迭代协商与监督:去中心化空中交通管理案例研究

Jaehan Im, John-Paul Clarke, Ufuk Topcu, David Fridovich-Keil

AI总结 提出一种受监管的去中心化协商框架,通过交易拍卖实现共识,并引入税收式监督机制引导系统效率和公平性,理论保证有限时间终止,案例验证了框架在去中心化空中交通管理中的有效性。

详情
AI中文摘要

在去中心化多智能体系统中,自利智能体通常具有冲突偏好,达成共识仍然具有挑战性。现有的协调方法使智能体无需中央协调员即可达成共识,但无法对系统级目标(如效率或公平性)提供正式保证。为解决这一局限,我们提出一个受监管的去中心化协商框架,该框架通过有限的监管监督增强去中心化协商机制。该框架基于交易拍卖达成共识,使具有冲突偏好的自利智能体能够通过资产交易进行协商,同时避免直接披露私有资产估值。我们引入一种监督机制,实施类似税收的干预,引导去中心化协商走向系统高效和公平的结果,同时调节框架的收敛速度。我们建立了有限时间终止的理论保证,并推导出系统效率和收敛速度与监管干预水平相关的界限。基于美国空中交通管理中的协作航迹选项计划(一个改道倡议)的案例研究表明,该框架能够可靠地在自利空域扇区管理者之间达成共识,并揭示了监管干预水平如何调节系统效率与收敛速度之间的关系。综合理论和实验结果表明,所提出的框架提供了一种受监管的去中心化协调机制,在维护非合作最终选择的同时保障系统级目标。

英文摘要

Achieving consensus among self-interested agents remains challenging in decentralized multi-agent systems, where agents often have conflicting preferences. Existing coordination methods enable agents to reach consensus without a centralized coordinator, but do not provide formal guarantees on system-level objectives such as efficiency or fairness. To address this limitation, we propose a regulated decentralized negotiation framework that augments a decentralized negotiation mechanism with limited regulatory oversight. The framework builds upon the trading auction for consensus, enabling self-interested agents with conflicting preferences to negotiate through asset trading while avoiding direct disclosure of private asset valuations. We introduce an oversight mechanism, which implements a taxation-like intervention that guides decentralized negotiation toward system-efficient and equitable outcomes while also regulating how fast the framework converges. We establish theoretical guarantees of finite-time termination and derive bounds linking system efficiency and convergence rate to the level of regulatory intervention. A case study based on the collaborative trajectory options program, a rerouting initiative in U.S. air traffic management, demonstrates that the framework can reliably achieve consensus among self-interested airspace sector managers, and reveals how the level of regulatory intervention regulates the relationship between system efficiency and convergence speed. Taken together, the theoretical and experimental results indicate that the proposed framework provides a mechanism for regulated decentralized coordination that preserves noncooperative final selection while safeguarding system-level objectives.

2507.19712 2026-06-19 cs.DC cs.AI cs.GT cs.LG cs.NI 版本更新

Oranits: Mission Assignment and Task Offloading in Open RAN-based ITS using Metaheuristic and Deep Reinforcement Learning

Oranits: 基于Open RAN的智能交通系统中的任务分配与卸载——元启发式与深度强化学习方法

Ngoc Hung Nguyen, Nguyen Van Thieu, Quang-Trung Luu, Anh Tuan Nguyen, Senura Wanasekara, Nguyen Cong Luong, Fatemeh Kavehmadavani, Van-Dinh Nguyen

发表机构 * Department of Smart City, Hanyang University(翰阳大学智能城市系)

AI总结 提出Oranits系统模型,通过元启发式算法CGG-ARO和深度强化学习框架MA-DDQN优化车辆协作中的任务依赖与卸载成本,分别提升任务完成率7.7%和12.5%。

Comments 16 pages, 13 figures

Journal ref IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2026

详情
AI中文摘要

本文研究了基于开放无线接入网(Open RAN)的智能交通系统(ITS)中的任务分配与卸载问题,其中自动驾驶车辆利用移动边缘计算进行高效处理。现有研究常忽视任务之间的复杂依赖关系以及将任务卸载到边缘服务器的成本,导致决策次优。为弥补这一不足,我们引入了Oranits,一种新颖的系统模型,明确考虑了任务依赖性和卸载成本,同时通过车辆协作优化性能。为此,我们提出了一种双重优化方法。首先,我们开发了一种基于元启发式的进化计算算法,即混沌高斯全局ARO(CGG-ARO),作为单时隙优化的基线。其次,我们设计了一种增强的基于奖励的深度强化学习(DRL)框架,称为多智能体双深度Q网络(MA-DDQN),该框架集成了多智能体协调和多动作选择机制,显著减少了任务分配时间并提高了对基线方法的适应性。大量仿真表明,CGG-ARO将完成任务数量和总体收益分别提高了约7.1%和7.7%。同时,MA-DDQN在任务完成率和总体收益方面分别实现了11.0%和12.5%的更大提升。这些结果凸显了Oranits在动态ITS环境中实现更快、更自适应、更高效任务处理的有效性。

英文摘要

In this paper, we explore mission assignment and task offloading in an Open Radio Access Network (Open RAN)-based intelligent transportation system (ITS), where autonomous vehicles leverage mobile edge computing for efficient processing. Existing studies often overlook the intricate interdependencies between missions and the costs associated with offloading tasks to edge servers, leading to suboptimal decision-making. To bridge this gap, we introduce Oranits, a novel system model that explicitly accounts for mission dependencies and offloading costs while optimizing performance through vehicle cooperation. To achieve this, we propose a twofold optimization approach. First, we develop a metaheuristic-based evolutionary computing algorithm, namely the Chaotic Gaussian-based Global ARO (CGG-ARO), serving as a baseline for one-slot optimization. Second, we design an enhanced reward-based deep reinforcement learning (DRL) framework, referred to as the Multi-agent Double Deep Q-Network (MA-DDQN), that integrates both multi-agent coordination and multi-action selection mechanisms, significantly reducing mission assignment time and improving adaptability over baseline methods. Extensive simulations reveal that CGG-ARO improves the number of completed missions and overall benefit by approximately 7.1% and 7.7%, respectively. Meanwhile, MA-DDQN achieves even greater improvements of 11.0% in terms of mission completions and 12.5% in terms of the overall benefit. These results highlight the effectiveness of Oranits in enabling faster, more adaptive, and more efficient task processing in dynamic ITS environments.