arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.15186 2026-04-17 cs.DC cs.AI

Scepsy: Serving Agentic Workflows Using Aggregate LLM Pipelines

Marcel Wagenländer, Otto White, Britannio Jarrett, Pedro Silvestre, Yanda Tao, Guo Li, Huanzhou Zhu, Llúis Vilanova, Peter Pietzuch

详情

英文摘要

Agentic workflows carry out complex tasks by orchestrating multiple large language models (LLMs) and tools. Serving such workflows at a target throughput with low latency is challenging because they can be defined using arbitrary agentic frameworks and exhibit unpredictable execution times: execution may branch, fan-out, or recur in data-dependent ways. Since LLMs in workflows often outnumber available GPUs, their execution also leads to GPU oversubscription. We describe Scepsy, a new agentic serving system that efficiently schedules arbitrary multi-LLM agentic workflows onto a GPU cluster. Scepsy exploits the insight that, while agentic workflows have unpredictable end-to-end latencies, the shares of each LLM's total execution times are comparatively stable across executions. Scepsy decides on GPU allocations based on these aggregate shares: first, it profiles the LLMs under different parallelism degrees. It then uses these statistics to construct an Aggregate LLM Pipeline, which is a lightweight latency/throughput predictor for allocations. To find a GPU allocation that minimizes latency while achieving a target throughput, Scepsy uses the Aggregate LLM Pipeline to explore a search space over fractional GPU shares, tensor parallelism degrees, and replica counts. It uses a hierarchical heuristic to place the best allocation onto the GPU cluster, minimizing fragmentation, while respecting network topology constraints. Our evaluation on realistic agentic workflows shows that Scepsy achieves up to 2.4x higher throughput and 27x lower latency compared to systems that optimize LLMs independently or rely on user-specified allocations.

URL PDF HTML ☆

赞 0 踩 0

2604.15143 2026-04-17 cs.NE cs.AI cs.LG

Structure as Computation: Developmental Generation of Minimal Neural Circuits

Duan Zhou

2604.15114 2026-04-17 stat.ML cs.AI cs.LG

Amortized Optimal Transport from Sliced Potentials

Minh-Phuc Truong, Khai Nguyen

Comments 26 pages, 11 figures, 10 tables

2604.15107 2026-04-17 stat.ML cs.LG

MinShap: A Modified Shapley Value Approach for Feature Selection

Chenghui Zheng, Garvesh Raskutti

2604.15101 2026-04-17 cs.IR cs.LG

Metric-agnostic Learning-to-Rank via Boosting and Rank Approximation

Camilo Gomez, Pengyang Wang, Yanjie Fu

Comments Published in IEEE ICDM 2023. 6 pages

2604.15086 2026-04-17 cs.MM cs.CV cs.SD

ControlFoley: Unified and Controllable Video-to-Audio Generation with Cross-Modal Conflict Handling

Jianxuan Yang, Xinyue Guo, Zhi Cheng, Kai Wang, Lipan Zhang, Jinjie Hu, Qiang Ji, Yihua Cao, Yihao Meng, Zhaoyue Cui, Mengmei Liu, Meng Meng, Jian Luan

2604.15082 2026-04-17 cs.AR cs.AI

Autonomous Evolution of EDA Tools: Multi-Agent Self-Evolved ABC

Cunxi Yu, Haoxing Ren

Comments 7 pages; To appear at DAC 2026

2604.15075 2026-04-17 cs.SE cs.LG

Atropos: Improving Cost-Benefit Trade-off of LLM-based Agents under Self-Consistency with Early Termination and Model Hotswap

Naryeong Kim, Shin Yoo

Comments Will appear at ISSTA 2026

2604.15055 2026-04-17 eess.SP cs.SD

Enhancing time-frequency resolution with optimal transport and barycentric fusion of multiple spectrogram

David Valdivia, Elsa Cazelles, Cédric Févotte

Comments main text: 13 pages, 8 figures. supplementary material: 3 pages, 3 figures

2604.15044 2026-04-17 cs.HC cs.AI

CoGrid & the Multi-User Gymnasium: A Framework for Multi-Agent Experimentation

Chase McDonald, Cleotilde Gonzalez

Comments 36 pages, 11 figures

2604.15022 2026-04-17 cs.CR cs.AI cs.CL cs.LG

Route to Rome Attack: Directing LLM Routers to Expensive Models via Adversarial Suffix Optimization

Haochun Tang, Yuliang Yan, Jiahua Lu, Huaxiao Liu, Enyan Dai

2604.14984 2026-04-17 cs.HC cs.AI

Agentic Explainability at Scale: Between Corporate Fears and XAI Needs

Yomna Elsayed, Cecily Jones

Comments Presented at Human-centered Explainable AI Workshop (HCXAI) @ CHI 2026, Barcelona, Spain, 2026

2604.14973 2026-04-17 cs.CR cs.CV

Robustness of Vision Foundation Models to Common Perturbations

Hongbin Liu, Zhengyuan Jiang, Cheng Hong, Neil Zhenqiang Gong

Comments Accepted by CVPR 2026 Workshop

2604.14957 2026-04-17 cs.NI cs.CR cs.LG

MLDAS: Machine Learning Dynamic Algorithm Selection for Software-Defined Networking Security

Pablo Benlloch, Oscar Romero, Antonio Leon, Jaime Lloret

Comments 22 pages, 15 figures, 12 tables

2604.14931 2026-04-17 quant-ph cs.LG

Learning to Concatenate Quantum Codes

Nico Meyer, Christopher Mutschler, Dominik Seuß, Andreas Maier, Daniel D. Scherer

Comments 7 pages, 5 figures, 1 table

2604.13466 2026-04-17 cs.HC cs.AI cs.CL cs.LG

Functional Emotions or Situational Contexts? A Discriminating Test from the Mythos Preview System Card

Hiranya V. Peiris

Comments 7 pages. v2: supplementary analysis added, references updated

2604.10681 2026-04-17 cs.CR cs.AI

Critical-CoT: A Robust Defense Framework against Reasoning-Level Backdoor Attacks in Large Language Models

Vu Tuan Truong, Long Bao Le

2604.10427 2026-04-17 cs.CR cs.AI cs.LG cs.SY eess.SY math.OC

A Queueing-Theoretic Framework for Dynamic Attack Surfaces: Data-Integrated Risk Analysis and Adaptive Defense

Jihyeon Yun, Abdullah Yasin Etcibasi, Ming Shi, C. Emre Koksal

详情

英文摘要

We develop a queueing-theoretic framework to model the temporal evolution of cyber-attack surfaces, where the number of active vulnerabilities is represented as the backlog of a queue. Vulnerabilities arrive as they are discovered or created, and leave the system when they are patched or successfully exploited. Building on this model, we study how automation affects attack and defense dynamics by introducing an AI amplification factor that scales arrival, exploit, and patching rates. Our analysis shows that even symmetric automation can increase the rate of successful exploits. We validate the model using vulnerability data collected from an open source software supply chain and show that it closely matches real-world attack surface dynamics. Empirical results reveal heavy-tailed patching times, which we prove induce long-range dependence in vulnerability backlog and help explain persistent cyber risk. Utilizing our queueing abstraction for the attack surface, we develop a systematic approach for cyber risk mitigation. We formulate the dynamic defense problem as a constrained Markov decision process with resource-budget and switching-cost constraints, and develop a reinforcement learning (RL) algorithm that achieves provably near-optimal regret. Numerical experiments validate the approach and demonstrate that our adaptive RL-based defense policies significantly reduce successful exploits and mitigate heavy-tail queue events. Using trace-driven experiments on the ARVO dataset, we show that the proposed RL-based defense policy reduces the average number of active vulnerabilities in a software supply chain by over 90% compared to existing defense practices, without increasing the overall maintenance budget. Our results allow defenders to quantify cumulative exposure risk under long-range dependent attack dynamics and to design adaptive defense strategies with provable efficiency.

URL PDF HTML ☆

赞 0 踩 0

2603.26723 2026-04-17 cond-mat.soft cs.LG

Interpretable liquid crystal phase classification via two-by-two ordinal patterns

Leonardo G. J. M. Voltarelli, Natalia Osiecka-Drewniak, Marcin Piwowarczyk, Ewa Juszynska-Galazka, Rafael S. Zola, Matjaz Perc, Haroldo V. Ribeiro

Comments 16 two-column pages, 8 figures, supplementary information; accepted for publication in Physical Review E

2603.24448 2026-04-17 cs.HC cs.AI

Integrating Causal Machine Learning into Clinical Decision Support Systems: Insights from Literature and Practice

Domenique Zipperling, Lukas Schmidt, Benedikt Hahn, Niklas Kühl, Steven Kimbrough

Comments Accepted at the Thirty-Fourth European Conference on Information Systems (ECIS 2026), Milan, Italy

2603.06431 2026-04-17 math.NA cs.LG cs.NA stat.ML

Certified and accurate computation of function space norms of deep neural networks

Johannes Gründler, Moritz Maibaum, Philipp Petersen

2601.10120 2026-04-17 cs.MA cs.AI cs.CL

TopoDIM: One-shot Topology Generation of Diverse Interaction Modes for Multi-Agent Systems

Rui Sun, Jie Ding, Chenghua Gong, Tianjun Gu, Yihang Jiang, Juyuan Zhang, Liming Pan, Linyuan Lü

Comments ACL Findings Camera Ready

2601.07449 2026-04-17 cs.IR cs.AI

RLPO: Residual Listwise Preference Optimization for Long-Context Review Ranking

Hao Jiang, Zhi Yang, Annan Wang, Yichi Zhang, Weisi Lin

2511.01838 2026-04-17 cs.IT cs.AI cs.NE math.IT

Efficient Vector Symbolic Architectures from Histogram Recovery

Zirui Deng, Netanel Raviv

Comments To appear at ISIT 2026

2510.14509 2026-04-17 cs.SE cs.AI cs.CL

E2Edev: Benchmarking Large Language Models in End-to-End Software Development Task

Jingyao Liu, Chen Huang, Zhizhao Guan, Wenqiang Lei, Yang Deng

Comments Accepted to ACL 2026 main

2508.19588 2026-04-17 cs.CY cs.AI

Hallucinating with AI: AI Psychosis as Distributed Delusions

Lucy Osler

2505.02979 2026-04-17 physics.ao-ph cs.LG

Parameter estimation for land-surface models using Neural Physics

Ruiyue Huang, Claire E. Heaney, Maarten van Reeuwijk

Comments 18 pages, 5 figures, 3 tables

2410.16593 2026-04-17 eess.SP cs.AI cs.LG

Sampling Transferable Graph Neural Networks with Limited Graph Information

Haoyu Wang, Renyuan Ma, Gonzalo Mateos, Luana Ruiz

Comments Submitted to IEEE TSP

2311.11841 2026-04-17 math.OC cs.LG

High Probability Guarantees for Random Reshuffling

Hengxu Yu, Xiao Li

Comments In this new version, we have removed the saddle-point avoidance part and improved the stopping criterion part by using a horizon-free step size rule

2311.01956 2026-04-17 cs.CR cs.AI

Towards Adaptive, Learning-Based Security in Decentralized Applications

Stefan Kambiz Behfar, Jon Crowcroft