arXivDaily arXiv每日学术速递 周一至周五更新
重置
cs.OS操作系统3
2606.11886 2026-06-11 cs.SD cs.OS 新提交

Real-Time Language Model Jamming: A Case Study for Live Music Accompaniment Generation

实时语言模型阻塞:现场音乐伴奏生成的案例研究

Bowen Zheng, Andrew H. Yang, Jiaqi Ruan, Jia He, Xinyue Li, Yuan-Hsin Chen, Ziyu Wang, Xiaosong Ma

发表机构 * MBZUAI(穆罕默德·本·扎耶德人工智能大学)

AI总结 提出StreamMUSE系统,在客户端-服务器架构中实现帧同步流式推理,通过现场音乐伴奏任务验证了不同延迟环境下实时同步的有效性。

详情
Comments
Accepted to RTAS 2026. 14 pages, 5 figures, 3 tables
AI中文摘要

语言模型(LMs)已成为现代生成建模中最突出的范式之一。虽然提高速度是实时部署的主要焦点,但仅靠速度是不够的。许多实际应用,如同步翻译和语音合成,还需要生成内容与外部信号在生成内容和时序上精确对齐。我们将此问题称为\textit{帧同步流式推理}。为了解决这个问题,我们提出了StreamMUSE,一个在客户端-服务器架构中响应外部信号流执行LM生成的推理系统。客户端基于最新输入持续发送高频推理请求,并接收与外部时钟同步的输出,而服务器执行模型推理。我们通过现场音乐伴奏任务演示了该框架,展示了在不同往返延迟的部署环境中如何实现实时同步。我们进一步建模了系统超参数与往返延迟之间的关系,并评估了不同环境如何影响实现实时性能的最佳配置。实验结果表明,系统实时性能与音乐质量之间存在一致对应关系,证明了所提出框架的有效性。该项目是开源的。相关代码和最新更新可在此https URL获取。

英文摘要

Language models (LMs) have become one of the most prominent paradigms in modern generative modeling. While making them faster has been the main focus of real-time deployment, speed alone is not enough. Many real-world applications, such as synchronized translation and voice synthesis, also require precise alignment between generation and external signals, both in terms of generation content and timing. We refer to this problem as \textit{frame-synchronous streaming inference}. To address it, we present StreamMUSE, an inference system that performs LM generation in response to an external signal stream within a client-server architecture. The client continuously sends high-frequency inference requests based on the most recent inputs and receives outputs synchronized to the external clock, while the server executes model inference. We demonstrate the framework through a live music accompaniment task, showing how real-time synchronization can be achieved across different deployment environments with varying round-trip latencies. We further model the relationship between system hyperparameters and round-trip latency, and evaluate how different environments affect optimal configurations to achieve real-time performance. Experimental results show a consistent correspondence between system real-time performance and music quality, demonstrating the effectiveness of the proposed framework. The project is open source. Relevant code and the latest updates are available at this https URL.

2606.00455 2026-06-11 cs.CR cs.OS 版本更新

Beyond Edge Coverage: Per-Task Data-Flow Extraction at Kernel Function Boundaries via LLVM

超越边覆盖:通过LLVM在核函数边界进行每任务数据流提取

Yunseong Kim

AI总结 提出BOUNDARY FLOW框架,基于LLVM扩展KCOV,在函数边界提取参数和返回值数据流,为内核模糊测试提供状态感知反馈,并支持Rust内核模块,开销低于3%。

详情
Comments
Source code and Linux kernel RFC patches for the Boundary Context Extraction framework are publicly available
AI中文摘要

基于覆盖引导的内核模糊测试工具(如syzkaller)依赖边覆盖(trace-pc)作为唯一的反馈信号。这种上下文无关的方法无法区分仅在参数值上有所不同的执行路径。例如,两次调用copy_from_user(),即使参数size不同,也会命中相同的基本块,但安全含义却大相径庭。我们提出了BOUNDARY FLOW,一个基于LLVM的插桩框架,它扩展了Linux KCOV,实现了函数参数和返回值的数据流提取。一个编译器pass(-fsanitize-coverage=dataflow-args, dataflow-ret)在函数入口处发出轻量级回调,捕获结构化元组<PC, arg_idx, arg_size, ptr, offsets[]>,在返回时捕获<PC, ret_size, ptr, offsets[]>。复合类型通过DWARF DICompositeType元数据自动分解,无需源代码注释。一个独立的内核设备(/sys/kernel/debug/kcov_dataflow)提供无锁的每任务环形缓冲区,不会干扰现有的KCOV或syzkaller基础设施。我们展示了双重用途:模糊测试工具获得状态感知反馈,用于指导进入值依赖状态转换的变异;安全分析师获得确定性的参数记录,用于根本原因分析,而无需printk或kprobe开销。一个编译后流水线(rustc, opt, llc)使得Rust内核模块插桩成为可能,无需修改rustc,这是在drgn/vmcore因-O2 DWARF省略而失败的情况下捕获Rust函数参数的唯一运行时方法。在五类漏洞(越界、释放后使用、双重释放、10层深度链传播、Rust FFI、Rust for Linux模块)上评估,插桩路径的开销低于3%。

英文摘要

Coverage-guided kernel fuzzers such as syzkaller rely on edge coverage (trace-pc) as their sole feedback signal. This context-blind approach cannot distinguish execution paths that differ only in argument values -- for example, two invocations of copy_from_user() with different size parameters hit identical basic blocks yet have vastly different security implications. I present TOOLNAME, an LLVM-based instrumentation framework that extends Linux KCOV with data-flow extraction of function arguments and return values. A compiler pass emits lightweight callbacks capturing structured tuples of program counter, argument metadata, and field values at function entry and return. Composite types are automatically decomposed via DWARF DICompositeType metadata with zero source annotation. A lock-free per-task ring buffer delivers records to user space with no interference to existing KCOV or syzkaller infrastructure. I demonstrate dual utility: (1) fuzzers gain state-aware feedback for mutation guidance into value-dependent state transitions, and (2) security analysts obtain deterministic argument records for root-cause analysis without printk or kprobe overhead. Two Rust instrumentation paths are provided: a post-compilation pipeline requiring no rustc modification, and native instrumentation via rustc built against the custom LLVM -- both the only runtime methods for capturing Rust function arguments given that drgn/vmcore fails under -O2 DWARF elision.

2603.09738 2026-06-11 cs.OS cs.DC 版本更新

Ensuring Data Freshness in Multi-Rate Task Chains Scheduling

确保多速率任务链调度中的数据新鲜度

José Luis Conradi Hoffmann, Antônio Augusto Fröhlich

AI总结 针对安全关键系统中数据新鲜度与确定性执行之间的权衡,提出基于任务的数据新鲜度约束调度框架,通过分解数据依赖图并设计偏移搜索算法同步多速率任务链,在不引入LET额外延迟的前提下保证端到端数据新鲜度。

详情
AI中文摘要

在安全关键自主系统中,数据新鲜度是一个基本的设计挑战。虽然逻辑执行时间(LET)范式保证了组合确定性,但通常以注入延迟为代价,可能降低高频控制回路上的数据年龄。此外,异构、多速率的任务依赖通常通过过采样来低效地保证。本文提出了一种扩展了数据新鲜度约束的基于任务的调度框架。与传统模型不同,调度决策由数据的生命周期驱动。我们引入了一种形式化方法,通过从执行器向后追踪最严格的数据新鲜度约束,将数据依赖图分解为主路径。基于这种分解,我们提出了一种偏移搜索算法,用于同步多速率、多依赖的任务链。该方法在不引入LET缓冲的人工延迟的情况下强制实现端到端数据新鲜度,这是数据新鲜度与执行确定性之间的权衡。我们正式证明,这种基于偏移的对齐在保证数据新鲜度的同时,保留了全局EDF的100%可调度性能力。

英文摘要

In safety-critical autonomous systems, data freshness presents a fundamental design challenge. While the Logical Execution Time (LET) paradigm ensures compositional determinism, it often does so at the cost of injected latency, possibly degrading the age of data on high-frequency control loops. Furthermore, heterogeneous, multi-rate, task dependencies is typically guaranteed inefficiently through oversampling. This paper proposes a Task-based scheduling framework extended with data freshness constraints. Unlike traditional models, scheduling decisions are driven by the lifespan of data. We introduce a formal methodology to decompose Data Dependency Graphs into dominant paths by tracing the strictest data freshness constraints backward from the actuators. Based on this decomposition, we propose an offset search algorithm that synchronizes multi-rate, multi-dependencies, task chains. This approach enforces end-to-end data freshness without the artificial latency of LET buffering, a trade-off between data freshness and execution determinism. We formally prove that this offset-based alignment preserves the 100% schedulability capacity of Global EDF while addressing data freshness guarantees.