arXivDaily arXiv每日学术速递 周一至周五更新
全部学科分类 1926
专题追踪
2605.22391 2026-05-22 cs.AI cs.CL cs.CY

Epicure: Navigating the Emergent Geometry of Food Ingredient Embeddings

Epicure:探索食品成分嵌入的涌现几何

Jakub Radzikowski, Josef Chen

AI总结 本文提出Epicure,一种基于三兄弟skip-gram模型重新训练的食品成分嵌入方法,通过多语言食谱语料库构建了包含1790个标准成分的嵌入模型,并通过三种不同的随机游走方案生成了不同侧重的模型。

详情
AI中文摘要

我们提出了Epicure,一种由三个兄弟skip-gram成分嵌入模型组成的家族,这些模型是从多语言食谱语料库中从头开始重新训练的。我们汇总了来自11个来源的414万条食谱,涵盖七种语言:英语、中文、俄语、越南语、西班牙语、土耳其语、印度尼西亚语、德语和印度英语,并通过一个增强语言模型的流程将原始成分字符串标准化为1790个标准条目。一个包含203,508条边的成分-成分NPMI图和一个包含80,019条边的带类型FlavorDB成分-化合物图,以及2,247个带类型化合物节点跨越15个类别,为三种共享架构和超参数但仅在随机游走方案上不同的Metapath2Vec变体提供了基础:Cooc仅在共现图上行走,Chem仅在带类型化合物元路径上行走,Core则通过注入的成分-成分行走进行混合,在可控混合下,将每个模型置于化学与食谱上下文的谱线上不同的位置。

英文摘要

We present Epicure, a family of three sibling skip-gram ingredient embeddings retrained from scratch on a multilingual recipe corpus. We aggregate 4.14M recipes from 11 sources spanning seven languages, English, Chinese, Russian, Vietnamese, Spanish, Turkish, Indonesian, German, and Indian-English, and normalise the raw ingredient strings to 1,790 canonical entries via an LLM-augmented pipeline. A 203,508-edge ingredient-ingredient NPMI graph and an 80,019-edge typed FlavorDB ingredient-compound graph, 2,247 typed compound nodes across 15 categories, seed three Metapath2Vec variants that share architecture and hyperparameters and differ only in the random-walk schema: Cooc walks the co-occurrence graph only, Chem walks the typed compound metapaths only, and Core blends both via injected ingredient-ingredient walks at controlled mixing, placing each model at a distinct point on the chemistry-vs-recipe-context spectrum.

2605.22372 2026-05-22 cs.LG

ASAP: Attention Sink Anchored Pruning

ASAP: 以注意力汇点为中心的剪枝

Jaehyuk Lee, Hanyoung Kim, Yanggee Kim, Donghun Lee

AI总结 本文提出ASAP方法,通过将注意力汇点作为特征,利用懒惰随机游走建模视觉Transformer的信息流,实现单次剪枝过程中的token分区和背景冗余压缩,从而在保持或超越基线精度的同时,提升吞吐量达48%。

详情
AI中文摘要

视觉Transformer(ViTs)在高分辨率下由于自注意力的二次复杂度面临严重的计算瓶颈。现有token减少方法依赖局部指标-如单层注意力分数-这些指标本质上容易受到注意力汇点现象的影响,即无信息token paradoxically被保留下来。我们提出ASAP(Attention Sink Anchored Pruning),一种无需训练的框架,将此汇点作为特征。通过将ViT信息流建模为懒惰随机游走,ASAP将汇点识别为概率质量的主要累积器。通过计算累积转移矩阵中到汇点的扩散距离,ASAP利用径向扩散聚类对token进行分区,并通过转移权重池化压缩背景冗余。在图像、视频和视觉-语言任务中的广泛实验表明,ASAP在保持或超越基线精度的同时,加速吞吐量高达48%。

英文摘要

Vision Transformers (ViTs) face severe computational bottlenecks due to the quadratic complexity of self-attention at high resolutions. Existing token reduction methods rely on local metrics - such as single-layer attention scores - that are inherently vulnerable to the attention sink phenomenon, where uninformative tokens are paradoxically preserved over salient foreground objects. We propose ASAP (Attention Sink Anchored Pruning), a training-free framework that recasts this sink as a feature. Modeling ViT information flow as a Lazy Random Walk, ASAP identifies the sink as a dominant accumulator of probability mass. By computing the diffusion distance to the sink within the cumulative transition matrix, ASAP partitions tokens via Radial Diffusion Clustering and compresses background redundancy through Transition Weight Pooling in a single shot. Extensive experiments across image, video, and vision-language tasks demonstrate ASAP outperforms state-of-the-art methods, accelerating throughput by up to 48% while maintaining - or even exceeding - baseline accuracy.

2605.22290 2026-05-22 cs.CV

Detection of Virus and Small Cell Patches in Foci Images Using Switchable Convolution and Feature Pyramid Networks

利用可切换卷积和特征金字塔网络在焦点图像中检测病毒和小细胞斑块

Amrita Singh, Snehasis Mukherjee

AI总结 本文提出了一种改进的YOLOv2检测器,结合特征金字塔网络和可切换空洞卷积机制,以提高在生物医学焦点图像中检测病毒斑块和小细胞斑块的性能,实验结果显示在不同IoU阈值下的mAP值显著提升。

详情
AI中文摘要

准确检测和计数焦点形成单位(FFU)图像中的病毒斑块对于量化病毒感染和分析细胞结构至关重要。这项任务具有挑战性,因为生物医学目标在大小、密度、对比度和形状上往往差异显著。本文提出了一种增强的YOLOv2检测器,集成了特征金字塔网络(FPN)以提高多尺度特征表示。我们还引入了可切换空洞卷积机制,以适应密集显微图像中细粒度目标的接收域。所提出的方法在生物医学焦点图像数据集上进行评估,用于病毒斑块和小细胞斑块的检测。对于小细胞斑块检测,模型在25%的交并比(IoU)阈值下达到40.5%的平均精度均值(mAP)。对于FFU病毒斑块检测,模型达到68%的mAP。这些结果表明,结合FPN特征融合与可切换卷积能够提高YOLOv2在专门生物医学目标检测任务中的适用性。

英文摘要

Accurate detection and counting of virus patches in focus-forming unit (FFU) images, also known as foci images, are important for quantifying viral infection and analyzing cellular structures. This task is challenging because biomedical targets often vary substantially in size, density, contrast, and shape. In this paper, we propose an enhanced YOLOv2-based detector that integrates a Feature Pyramid Network (FPN) to improve multi-scale feature representation. We also incorporate a switchable atrous convolution mechanism to adapt the receptive field for fine-grained targets in dense microscopy images. The proposed method is evaluated on biomedical foci image datasets for virus patch and small cell patch detection. For small cell patch detection, the model achieves a mean average precision (mAP) of 40.5% at a 25% Intersection over Union (IoU) threshold. For FFU virus patch detection, the model achieves an mAP of 68%. These results indicate that combining FPN-based feature fusion with switchable convolution improves the suitability of YOLOv2 for specialized biomedical object detection tasks

2605.22273 2026-05-22 cs.CV

Exposing Vulnerabilities in Visible-Infrared VLMs: A Unified Geometric Adversarial Framework with Cross-Task Transferability

揭示可见-红外VLMs中的漏洞:一种具有跨任务迁移性的统一几何对抗框架

Xiang Chen, Yuxian Dong, Chao Li, Chengyin Hu, Jiaju Han, Fengyu Zhang, Yiwei Wei, Jiahuan Long, Jiujiang Guo

AI总结 本文针对可见-红外视觉语言模型在多模态任务中的对抗鲁棒性不足问题,提出了一种基于分形几何的对抗框架CFGPatch,通过引入曲边分形元素和Fraser螺旋渲染机制,有效攻击VLMs并展示出跨任务迁移能力。

详情
AI中文摘要

视觉语言模型(VLMs)在多样化的多模态任务中实现了强大的性能,但其在可见-红外(VIS-IR)场景中的对抗鲁棒性仍处于探索阶段。为了解决这种跨模态威胁设置,我们提出了CFGPatch,一种基于三角分形几何的曲边分形对抗补丁框架,用于攻击VIS-IR VLMs。CFGPatch基于三角分形几何,用贝塞尔曲线元素替代刚性的直边元素,在保持多尺度分形自相似性的同时引入更平滑的轮廓、更丰富的方向变化和更灵活的形状变形。此外,我们设计了模态特定的Fraser螺旋渲染机制,以在可见和红外图像中注入细粒度纹理扭曲和误导性感知线索。通过将全局曲边分形几何与局部螺旋基外观干扰相结合,CFGPatch破坏了形状感知和纹理解释。我们进一步采用期望超越变换(EOT)以提高对常见图像级变换的鲁棒性。大量实验表明,CFGPatch能够有效欺骗VIS-IR VLMs,并在攻击效果和鲁棒性上均优于标准补丁基线。此外,针对零样本分类优化的对抗样本在图像描述和视觉问答任务中表现出良好的迁移能力,展示了在下游任务中的强大跨任务迁移性和泛化能力。

英文摘要

Vision-language models (VLMs) have achieved strong performance across diverse multimodal tasks, but their adversarial robustness in visible-infrared (VIS-IR) scenarios remains underexplored. This gap is critical because VIS-IR sensing is widely used in real-world perception systems to support reliable understanding under challenging imaging conditions. To address this cross-modal threat setting, we propose CFGPatch, a curved-edge fractal geometric adversarial patch framework for attacking VIS-IR VLMs. CFGPatch builds on triangular fractal geometry and replaces rigid straight-edged primitives with Bezier-curved elements, preserving multi-scale fractal self-similarity while introducing smoother contours, richer directional variation, and more flexible shape deformation. In addition, we design a modality-specific Fraser-spiral rendering mechanism to inject fine-grained texture distortions and misleading perceptual cues into visible and infrared images. By coupling global curved-fractal geometry with local spiral-based appearance interference, CFGPatch disrupts both shape perception and texture interpretation. We further adopt expectation over transformation (EOT) to improve robustness against common image-level transformations. Extensive experiments show that CFGPatch effectively fools VIS-IR VLMs and consistently outperforms standard patch baselines in attack effectiveness and robustness. Moreover, adversarial samples optimized for zero-shot classification transfer well to image captioning and visual question answering, demonstrating strong cross-task transferability and generalizability across downstream tasks.

2605.22235 2026-05-22 cs.LG math.DS

Holomorphic Neural ODEs with Kolmogorov-Arnold Networks for Interpretable Discovery of Complex Dynamics

具有Kolmogorov-Arnold网络的全纯神经ODEs用于复杂动力学的可解释发现

Bhaskar Ranjan Karn, Dinesh Kumar

AI总结 本文提出了一种基于Kolmogorov-Arnold网络的全纯神经ODE框架,用于在复杂动力学系统中发现可解释的 governing equations,通过可微的正则化保持全纯结构,并在多个复杂动力学系统上验证了其有效性。

Comments 16 pages. Comments are welcome

详情
AI中文摘要

由全纯映射(如z² + c)支配的复杂动力系统表现出具有极端初始条件敏感性的分形边界。从数据准确建模这些结构需要尊重底层复解析几何的方法,但神经普通微分方程(Neural ODEs)中的多层感知机(MLP)缺乏复解析先验,违反柯西-黎曼条件,并作为不透明的近似器无法提供 governing equations。我们引入了全纯KAN-ODE框架,用Kolmogorov-Arnold网络(KAN)取代MLP,其可学习的B样条激活函数位于网络边,并将柯西-黎曼方程作为可微正则化以保持全纯结构。我们在六个复杂动力系统家族上进行了评估,涵盖多项式和超越类。仅使用280个参数(比MLP基线少16倍),网络在所有六个系统上实现了速度场R² > 0.95,正确识别了所有六个 governing symbolic families 通过自动样条到公式拟合,并重建了Julia集分形边界,与98.0%一致。关键的是,模型在10%观测噪声下仅表现出4%的MSE退化,而MLP则退化了15.2倍,且在从二次到三次动力学的迁移学习中实现了90.4%的改进。虽然MLP在点重建误差上更低,因为其容量更大,但KAN唯一提供了可解释的符号方程,强制了全纯结构,并具有优越的噪声鲁棒性,这些能力在黑盒架构中完全缺失。这些结果确立了KANs作为MLP的参数高效、可解释的替代方案,用于具有全纯动力学的物理信息发现。

英文摘要

Complex dynamical systems governed by holomorphic maps such as $z^2 + c$ exhibit fractal boundaries with extreme sensitivity to initial conditions. Accurately modelling these structures from data requires methods that respect the underlying complex-analytic geometry, yet Multi-Layer Perceptrons (MLPs) within Neural Ordinary Differential Equations (Neural ODEs) lack complex-analytic priors, violate the Cauchy--Riemann conditions, and function as opaque approximators incapable of yielding governing equations. We introduce Holomorphic KAN-ODE, a framework that replaces the MLP with a Kolmogorov-Arnold Network (KAN) whose learnable B-spline activations reside on network edges, and incorporates Cauchy--Riemann equations as a differentiable regularization to preserve holomorphic structure. We evaluate on six families of complex dynamical systems spanning polynomial and transcendental classes. With only 280 parameters ($16\times$ fewer than the MLP baseline), the network achieves velocity-field $R^2 > 0.95$ on all six systems, correctly identifies all six governing symbolic families through automatic spline-to-formula fitting, and reconstructs Julia set fractal boundaries with up to 98.0\% agreement. Crucially, the model exhibits only 4\% MSE degradation under 10\% observation noise versus $15.2\times$ for MLPs, and achieves 90.4\% improvement in transfer learning from quadratic to cubic dynamics. While the MLP attains lower pointwise reconstruction error due to its larger capacity, the KAN uniquely provides interpretable symbolic equations, enforced holomorphic structure, and superior noise resilience, capabilities that are entirely absent in black-box architectures. These results establish KANs as a parameter-efficient, interpretable alternative to MLPs for physics-informed discovery of holomorphic dynamics.

2605.21273 2026-05-22 cs.CV

DriveMA: Rethinking Language Interfaces in Driving VLAs with One-Step Meta-Actions

DriveMA: 重新思考驾驶VLAs中的语言接口以单步元动作

Weicheng Zheng, Yixin Huang, Qiao Sun, Derun Li, Hang zhao

AI总结 本文提出DriveMA,通过单步元动作替代传统的自然语言推理,解决了驾驶VLAs中语言接口的三个瓶颈问题,实现了更高效的端到端规划。

Comments We withdraw this submission because the current version contains a mismatch between the paper title, conceptual framing, and the intended contribution of the work. To avoid potential misunderstanding by readers, the authors have decided to withdraw this version and substantially revise the title, organization, and presentation before any future submission

详情
AI中文摘要

驾驶视觉-语言-动作模型(Driving VLAs)通常将自然语言推理作为端到端规划的中间接口,但以推理为中心的接口面临三个实际瓶颈:获得高质量的推理注释困难,生成和理解长推理链对紧凑模型具有挑战性,且推理延迟显著增加。本文重新思考了驾驶VLAs中的语言接口设计,表明简洁的单步元动作是替代冗长推理的有效替代方案。元动作提供语义决策基础,同时保持低熵,并能自动从专家轨迹推导出来,从而实现可扩展的监督和可靠的轨迹条件化。基于此接口,我们提出了DriveMA,结合以动作为中心的监督训练和基于转弯级别的信用分配强化学习框架,共同优化元动作的正确性、轨迹质量和轨迹-元动作一致性。实验表明,DriveMA在Waymo端到端驾驶挑战中已使用2B模型达到新的状态,Rater Feedback Score(RFS)为8.060,其4B版本进一步将状态提升至8.079;DriveMA在NAVSIM上也取得了具有竞争力的性能。消融实验显示,单步元动作在表达性、可预测性和推理效率之间提供了更好的实际权衡,优于自然语言推理或更细粒度的动作序列。代码、数据和模型将被发布以促进未来研究。

英文摘要

Driving Vision-Language-Action Models (Driving VLAs) commonly introduce natural-language reasoning as an intermediate interface for end-to-end planning, but reasoning-centric interfaces face three practical bottlenecks: obtaining high-quality reasoning annotations is difficult, generating and understanding long reasoning chains is challenging for compact models, and inference latency is substantially increased. In this paper, we rethink the design of language interfaces in Driving VLAs and show that concise one-step meta-actions are a simple yet effective alternative to verbose reasoning. Meta-actions provide semantic decision grounding while remaining low-entropy, and being automatically derivable from expert trajectories, enabling scalable supervision and reliable trajectory conditioning. Building on this interface, we propose DriveMA, which combines action-centric supervised training with a turn-level credit-assignment reinforcement learning framework that jointly optimizes meta-action correctness, trajectory quality, and trajectory--meta-action consistency. Experiments show that DriveMA already achieves a new state of the art on the Waymo End-to-End Driving Challenge with a 2B model, reaching a Rater Feedback Score (RFS) of 8.060, while its 4B version further improves the state of the art to 8.079; DriveMA also obtains competitive performance on NAVSIM. Ablations demonstrate that one-step meta-actions offer a better practical trade-off between expressiveness, predictability, and inference efficiency than natural-language reasoning or finer-grained action sequences. Code, data, and models will be released to facilitate future research.

2605.14926 2026-05-22 cs.CV

SCRWKV: Ultra-Compact Structure-Calibrated Vision-RWKV for Topological Crack Segmentation

SCRWKV: Ultra-Compact Structure-Calibrated Vision-RWKV for Topological Crack Segmentation

Hanxu Zhang, Chen Jia, Hui Liu, Xu Cheng, Fan Shi, Shengyong Chen

AI总结 本文提出了一种高效的SCRWKV网络,通过新颖的结构场编码器和轻量级解码器,实现结构裂缝拓扑分割的高精度建模,其在多个复杂纹理和严重干扰的基准测试中表现出色,参数量仅为1.22M,达到了84.28%的F1分数和85.12%的mIoU。

Comments Accept by ICML2026

详情
AI中文摘要

实现跨多样场景的结构裂缝像素级准确分割仍然是一个严峻的挑战。现有方法在平衡裂缝拓扑建模与计算效率之间面临显著瓶颈,往往无法在高分割质量与低资源需求之间取得平衡。为了解决这些限制,我们提出了Ultra-Compact Structure-Calibrated Vision RWKV (SCRWKV),一种通过新颖的Structure-Field Encoder (SFE) backbone实现高精度建模的网络,同时保持线性复杂度。SFE集成了Adaptive Multi-scale Cascaded Modulator (AMCM)以增强纹理表示,并利用Structure-Calibrated Insight Unit (SCIU)作为其核心引擎。具体而言,SCIU采用Geometry-guided Bidirectional Structure Transformation (GBST)来捕捉拓扑相关性,并将Dynamic Self-Calibrating Decay (DSCD)整合到Dy-WKV中以抑制噪声传播。此外,我们引入了一种轻量级的Cross-Scale Harmonic Fusion (CSHF)解码器以实现精确的特征聚合。系统评估表明,在多个具有复杂纹理和严重干扰的基准测试中,仅拥有1.22M参数的SCRWKV显著优于现有最佳方法。在TUT数据集上,该模型达到了84.28%的F1分数和85.12%的mIoU,证明了其在高效现实部署中的鲁棒潜力。代码可在https://github.com/zhxhzy/SCRWKV上获取。

英文摘要

Achieving pixel-level accurate segmentation of structural cracks across diverse scenarios remains a formidable challenge. Existing methods face significant bottlenecks in balancing crack topology modeling with computational efficiency, often failing to reconcile high segmentation quality with low resource demands. To address these limitations, we propose the Ultra-Compact Structure-Calibrated Vision RWKV (SCRWKV), a network that achieves high-precision modeling via a novel Structure-Field Encoder (SFE) backbone while maintaining linear complexity. The SFE integrates the Adaptive Multi-scale Cascaded Modulator (AMCM) to enhance texture representation and utilizes the Structure-Calibrated Insight Unit (SCIU) as its core engine. Specifically, the SCIU employs the Geometry-guided Bidirectional Structure Transformation (GBST) to capture topological correlations and integrates the Dynamic Self-Calibrating Decay (DSCD) into Dy-WKV to suppress noise propagation. Furthermore, we introduce a lightweight Cross-Scale Harmonic Fusion (CSHF) decoder to achieve precise feature aggregation. Systematic evaluations on multiple benchmarks characterized by complex textures and severe interference demonstrate that SCRWKV, with only 1.22M parameters, significantly outperforms SOTA methods. Achieving an F1 score of 0.8428 and mIoU of 0.8512 on the TUT dataset, the model confirms its robust potential for efficient real-world deployment. The code is available at https://github.com/zhxhzy/SCRWKV.

2510.23090 2026-05-22 cs.CL

MAP4TS: A Multi-Aspect Prompting Framework for Time-Series Forecasting with Large Language Models

MAP4TS: 一个用于基于大语言模型的时间序列预测的多方面提示框架

Suchan Lee, Jihoon Choi, Sohyeon Lee, Minseok Song, Bong-Gyu Jang, Hwanjo Yu, Soyeon Caren Han

AI总结 本文提出MAP4TS框架,通过将经典时间序列分析融入提示设计,提升大语言模型在时间序列预测中的性能,实验表明其在多个数据集上均优于现有方法。

Comments There is a error in modeling. Thereafter, paper will be revised and re-uploaded

详情
AI中文摘要

最近的研究探讨了使用预训练的大语言模型(LLMs)进行时间序列预测,通过将数值输入对齐到LLM嵌入空间。然而,现有的多模态方法往往忽视了时间序列数据中独特的统计特性和时间依赖性。为弥合这一差距,我们提出了MAP4TS,一种新颖的多方面提示框架,该框架明确将经典时间序列分析纳入提示设计。我们的框架引入了四个专门的提示组件:一个全局领域提示传达数据集级别的上下文,一个局部领域提示编码近期趋势和系列特定行为,以及一对统计和时间提示,嵌入了从自相关(ACF)、偏自相关(PACF)和傅里叶分析中提取的手工洞察。多方面提示与原始时间序列嵌入结合,并通过跨模态对齐模块生成统一的表示,然后通过LLM处理并投影以进行最终预测。在八个多样化的数据集上进行的广泛实验表明,MAP4TS在多个数据集上均优于现有方法。我们的消融研究进一步揭示,提示意识设计显著提升了性能稳定性,并且当与结构化提示结合时,GPT-2模型在长期预测任务中优于较大的模型如LLaMA。

英文摘要

Recent advances have investigated the use of pretrained large language models (LLMs) for time-series forecasting by aligning numerical inputs with LLM embedding spaces. However, existing multimodal approaches often overlook the distinct statistical properties and temporal dependencies that are fundamental to time-series data. To bridge this gap, we propose MAP4TS, a novel Multi-Aspect Prompting Framework that explicitly incorporates classical time-series analysis into the prompt design. Our framework introduces four specialized prompt components: a Global Domain Prompt that conveys dataset-level context, a Local Domain Prompt that encodes recent trends and series-specific behaviors, and a pair of Statistical and Temporal Prompts that embed handcrafted insights derived from autocorrelation (ACF), partial autocorrelation (PACF), and Fourier analysis. Multi-Aspect Prompts are combined with raw time-series embeddings and passed through a cross-modality alignment module to produce unified representations, which are then processed by an LLM and projected for final forecasting. Extensive experiments across eight diverse datasets show that MAP4TS consistently outperforms state-of-the-art LLM-based methods. Our ablation studies further reveal that prompt-aware designs significantly enhance performance stability and that GPT-2 backbones, when paired with structured prompts, outperform larger models like LLaMA in long-term forecasting tasks.

2509.15151 2026-05-22 cs.SD cs.AI

Exploring How Audio Effects Alter Emotion with Foundation Models

探索音频效果如何通过基础模型改变情感

Stelios Katsis, Vassilis Lyberatos, Spyridon Kantarelis, Edmund Dervakos, Giorgos Stamou

AI总结 本文研究音频效果如何通过基础模型影响情感,探讨了基础模型在分析音频效果与情绪关系中的作用,揭示了声音设计技术对感知影响的模式。

Comments https://github.com/stelioskt/audioFX

详情
AI中文摘要

音频效果(如混响、失真、调制和动态范围处理)在音乐聆听过程中塑造情感反应中起着关键作用。尽管先前研究已探讨了低级音频特征与情感感知之间的联系,但音频效果对情绪的系统性影响仍被忽视。本文研究如何利用基础模型——大规模预训练于多模态数据的神经架构——来分析这些效果。此类模型编码了音乐结构、音色和情感意义之间的丰富关联,提供了一个强大的框架来探测声音设计技术的情感后果。通过应用各种探测方法到深度学习模型的嵌入中,我们考察了音频效果与估计情绪之间的复杂、非线性关系,揭示了与特定效果相关的模式,并评估了基础音频模型的鲁棒性。我们的发现旨在推进对音频制作实践感知影响的理解,对音乐认知、表演和情感计算具有启示意义。

英文摘要

Audio effects (FX) such as reverberation, distortion, modulation, and dynamic range processing play a pivotal role in shaping emotional responses during music listening. While prior studies have examined links between low-level audio features and affective perception, the systematic impact of audio FX on emotion remains underexplored. This work investigates how foundation models - large-scale neural architectures pretrained on multimodal data - can be leveraged to analyze these effects. Such models encode rich associations between musical structure, timbre, and affective meaning, offering a powerful framework for probing the emotional consequences of sound design techniques. By applying various probing methods to embeddings from deep learning models, we examine the complex, nonlinear relationships between audio FX and estimated emotion, uncovering patterns tied to specific effects and evaluating the robustness of foundation audio models. Our findings aim to advance understanding of the perceptual impact of audio production practices, with implications for music cognition, performance, and affective computing.

2605.21997 2026-05-22 cs.AI cs.MA

The Log is the Agent: Event-Sourced Reactive Graphs for Auditable, Forkable Agentic Systems

日志即代理:用于可审计、可分支代理系统的事件源反应图

Yohei Nakajima

AI总结 本文提出了一种基于事件源的反应图结构,通过将日志作为事实来源,实现了可审计、可分支的代理系统,提供了确定性回放、低成本分支和端到端溯源能力。

Comments 11 pages, 1 figure. Open-source Apache-2.0 implementation with reproducible quickstart demo, deterministic replay, fork-and-diff, and lineage tracing

详情
AI中文摘要

大多数代理框架围绕语言模型构建:先有对话循环,然后是工具,接着是规则,最后是用于可观测性的日志层,状态被保存为可检索的

英文摘要

Most agent frameworks are built around the language model: a conversation loop comes first, then tools, then rules, and finally a logging layer bolted on for observability, with state persisted as retrievable "memory." We describe ActiveGraph, a runtime that inverts this arrangement. The append-only event log is the source of truth; the working graph is a deterministic projection of that log; and behaviors--ordinary functions, classes, LLM-backed routines, or logic attached to typed edges--react to changes in the graph and emit new events. No component instructs another; coordination happens entirely through the shared graph. This single design decision yields three properties that retrieval-and-summarization memory systems do not provide: deterministic replay of any run from its log, cheap forking that branches a run at any event without re-executing the shared prefix, and end-to-end lineage from a high-level goal down to the individual model call that produced each artifact. We present the architecture, a determinism contract that makes replay sound, and a worked diligence example whose full causal structure is reconstructable from the log alone. We discuss--without claiming to demonstrate--why this substrate is unusually well suited to self-improving agents, and how it extends the BabyAGI lineage and prior graph-memory research.

2605.21902 2026-05-22 cs.AI cs.CL

Planning in the LLM Era: Building for Reliability and Efficiency

在大语言模型时代进行规划:构建可靠性与效率

Michael Katz, Harsha Kokel, Kavitha Srinivas, Shirin Sohrabi

AI总结 本文探讨了在大语言模型时代规划领域的发展,重点介绍了通过生成可验证的符号求解器来提高规划的可靠性和效率的方法。

Comments Published at ICAPS 2026

详情
AI中文摘要

随着智能代理受到越来越多的关注,规划能力成为其核心能力之一。早期尝试利用大语言模型(LLMs)进行规划的方法主要依赖于单次计划生成,随后发展出结合LLMs与有限外部搜索的混合方法。这些方法本质上不严谨且不完整,往往需要大量资源,但并未在未见问题上产生更好的解决方案。随着对LLMs局限性的认识加深,近期的研究转向在解决方案构建时使用LLMs,生成可用于验证并高效用于推理时间的一类问题的符号求解器。这一趋势反映了对既可靠又资源高效的代理日益增长的需求。它还提供了一条生成可维护规划器的路径,从而在推理时对语言模型的依赖最小化。在本文中,我们论证这种转变反映了在大语言模型时代规划领域更广泛的真实调整。我们检查了三种主要的规划器生成方法类别,讨论了它们当前的局限性,并概述了朝着更可靠和高效的大语言模型驱动的规划器生成的研究步骤。

英文摘要

Growing attention to intelligent agents has put a spotlight on one of their central capabilities: planning. Early attempts to leverage large language models (LLMs) for planning relied on single-shot plan generation, followed by hybrid approaches that coupled LLMs with limited external search. These methods, unsound and incomplete by their very nature, often require substantial resources without yielding better solutions on unseen problems. As the limitations of LLMs become clearer, recent work has shifted toward using them at solution construction time -- generating symbolic solvers for a family of problems that can be verified and then used efficiently at inference time. This trend reflects the growing need for agents that are both reliable and resource-efficient. It also offers a path towards generating maintainable planners with minimal dependence on language models at inference time. In this paper, we argue that this shift reflects a broader realignment of the planning field in the LLM era. We examine three major categories of planner-generation methods, discuss their current limitations, and outline research steps towards a more reliable and efficient LLM-based generation of planners.

2605.21868 2026-05-22 cs.LG

When to Switch, Not Just What: Transition Quality Prediction in Clash Royale

何时切换,而不仅仅是选择:Clash Royale中的切换质量预测

Heeyun Heo, Huy Kang Kim

AI总结 该研究探讨了竞技游戏中玩家在连续失利后切换策略的频率与胜率之间的反向关联,提出了一种基于切换质量预测(TQP)的三阶段方法,通过PersonaGate、TimingGate和ScoreFusion来优化策略推荐,并引入SwitchGap作为评估指标,以衡量策略的判别质量。

Comments 11 pages, 2 figures, 4 tables; Accepted at IEEE Conference on Games (CoG) 2026

详情
AI中文摘要

在竞技游戏中,玩家经常在连续失利后切换策略,但通过对34,619名Clash Royale玩家的926,334场比赛记录分析,发现切换频率与胜率之间存在反直觉的关联:切换频率与胜率成反比,且这种影响在不同玩家和情境中差异显著。我们归因于许多先前推荐系统的一个局限性,即仅通过预期质量评估策略,而忽略了切换行为的成本和个体在切换倾向上的差异。我们将这一隐含前提称为零切换成本假设。为了解决这一问题,我们将策略推荐重新表述为一个过渡层面的决策问题,并将其实例化为TQP(Transition Quality Predictor),一个三阶段的流程,结构为Who -> When -> What。PersonaGate抑制了那些在经验上与更优结果相关联的玩家的推荐。TimingGate识别出切换可能比保持更有净收益的时刻,使用子类型和状态匹配的基线来控制自然胜率恢复。ScoreFusion通过结合采用性信号和预测的过渡质量(delta WR)来对候选策略进行排名。我们进一步引入了SwitchGap,一种衡量策略判别质量的评估指标,不将观察到的玩家选择视为最优地面真实。这一属性尤为重要,因为最频繁切换的玩家记录了最低的胜率。完整的流程在推荐率为5.4%时实现了SwitchGap的+10.4个百分点,尽管在表现最差的群体中,触发损失的切换者从子类型条件指导中受益最大。

英文摘要

In competitive games, players frequently switch strategies after losing streaks, yet our analysis of 926,334 match records from 34,619 Clash Royale players reveals a counterintuitive pattern: switching frequency is inversely associated with the win rate, with effects that vary substantially across players and situational contexts. We attribute this to a limitation common in many prior recommendation systems, which evaluate strategies by expected quality while overlooking the behavioral cost of switching and individual differences in switching propensity. We refer to this implicit premise as the Zero Switching Cost Assumption. To address this, we reformulate strategy recommendation as a transition-level decision problem and instantiate it as TQP (Transition Quality Predictor), a three-stage pipeline structured as Who -> When -> What. PersonaGate suppresses recommendations for players whose strategic consistency is empirically associated with superior outcomes. TimingGate identifies moments when switching is likely to yield a net benefit over staying, using a subtype- and state-matched baseline to control for natural win-rate recovery. ScoreFusion ranks candidate strategies by combining an adoptability signal with predicted transition quality (delta WR). We further introduce SwitchGap, an evaluation metric that measures a policy's discriminative quality without treating observed player choices as optimal ground truth. This property is particularly important because the most frequent switchers record the lowest win rates. The full pipeline achieves a SwitchGap of +10.4 percentage points at a recommendation rate of 5.4%, and loss-triggered switchers, despite being the lowest-performing group, benefit the most from subtype-conditioned guidance.

2605.21836 2026-05-22 cs.RO

Analytical and Experimental Force Analysis of a Soft Linear Pneumatic Actuator

软线性气动执行器的分析与实验力分析

Mohammed Abboodi

AI总结 本文通过分析和实验研究了一种线性软套筒执行器(LSSA)的力特性,探讨了压力、几何形状、位移、负载和轴向刚度之间的耦合效应,揭示了其力生成机制。

详情
AI中文摘要

软套筒执行器(SSAs)最近被开发为可穿戴和辅助机器人系统中的气动驱动方法。通过将驱动结构集成到套筒状几何形状中,这些执行器可以减少对外部附件层和传动机制的依赖,同时保持与肢体形状表面的顺应性。然而,SSAs的力生成行为仍然解释不足,特别是在伸展过程中输出力的变化、外部负载的影响以及轴向刚度的机械作用方面。本文提出了线性软套筒执行器(LSSA)的分析和实验力分析。通过将净轴向力表示为由帽和折叠壁产生的压力生成贡献,并减去与轴向刚度相关的力,开发了一个准静态分析模型。该模型结合了内部压力、投影压力面积、折叠壁几何形状、轴向位移以及实验拟合的轴向刚度关系。进行了预设伸展和静态负载实验以评估执行器响应。在125kPa时,生成的力从零伸展时的约112N减少到40mm时几乎为零。静态负载延迟了可测量的力生成并减少了力输出,特别是在低和中等压力下。结果表明,LSSA的力生成由压力、几何形状、位移、负载和轴向刚度的耦合效应所支配。

英文摘要

Soft sleeve actuators (SSAs) have recently been developed as a pneumatic actuation approach for wearable and assistive robotic systems. By integrating the actuation structure into a sleeve-like geometry, these actuators can reduce reliance on external attachment layers and transmission mechanisms while maintaining compliance with limb-shaped surfaces. However, the force-generation behavior of SSAs remains insufficiently explained, particularly with respect to the variation of output force during extension, the influence of external loading, and the mechanical role of axial stiffness. This paper presents an analytical and experimental force analysis of a linear soft sleeve actuator (LSSA). A quasi-static analytical model was developed by expressing the net axial force as the pressure-generated contribution from the cap and folded walls, reduced by the force associated with axial stiffness. The model incorporates internal pressure, projected pressure areas, folded wall geometry, axial displacement, and an experimentally fitted axial stiffness relation. Prescribed-extension and static-load experiments were conducted to evaluate the actuator response. At 125 kPa, the generated force decreased from approximately 112 N at zero extension to nearly zero at 40 mm. Static loading delayed measurable force generation and reduced force output, particularly at low and intermediate pressures. The results show that LSSA force generation is governed by coupled effects of pressure, geometry, displacement, loading, and axial stiffness.

2605.21724 2026-05-22 cs.LG cs.AI

TBP-mHC: full expressivity for manifold-constrained hyper connections through transportation polytopes

TBP-mHC: 通过运输多面体实现 manifold-constrained 超连接的全表达性

Anton Lyubinin

AI总结 本文提出 TBP-mHC,通过运输多面体参数化实现 manifold-constrained 超连接的全表达性,解决了超连接中无约束混合导致的训练不稳定性问题,并在语言模型预训练中展示了竞争性性能和改进的稳定性与可扩展性。

详情
AI中文摘要

超连接(HC)通过在多个残差流之间引入可学习的混合来改进残差网络,但无约束的混合导致训练不稳定。Manifold-Constrained Hyper-Connections(mHC)通过Sinkhorn归一化强制近似双随机性,而mHC-lite则通过置换矩阵的凸组合确保精确约束,但以阶乘复杂度为代价。KromHC通过Kronecker积参数化减少此成本,但限制混合矩阵为Birkhoff多面体的结构子流形。我们提出运输Birkhoff多面体(TBP)参数化及其递归变体(RTBP),通过(n-1)^2自由度构造精确的双随机混合矩阵。我们的方法避免了迭代归一化和组合爆炸,同时保持Birkhoff多面体的完整表达性。在语言模型预训练中的实验证明了竞争性性能,同时具有改进的稳定性和可扩展性。

英文摘要

Hyper-Connections (HC) improve residual networks by introducing learnable mixing across multiple residual streams, but unconstrained mixing leads to training instability. Manifold-Constrained Hyper-Connections (mHC) address this by enforcing approximate double stochasticity via Sinkhorn normalization, while mHC-lite ensures exact constraints through convex combinations of permutation matrices at the cost of factorial complexity. KromHC reduces this cost using Kronecker-product parameterizations, but restricts the mixing matrices to a structured submanifold of the Birkhoff polytope . We propose Transportation Birkhoff Polytope (TBP) parameterizations and their Recursive variants (RTBP), which construct exactly doubly stochastic mixing matrices with $(n-1)^2$ degrees of freedom. Our approach avoids iterative normalization and combinatorial explosion while preserving full expressivity of the Birkhoff polytope. Empirical results on language model pre-training' demonstrate competitive performance with improved stability and scalability.

2605.12836 2026-05-22 cs.LG

Discrete Stochastic Localization for Non-autoregressive Generation

非自回归生成的离散随机定位

Yunshu Wu, Jiayi Cheng, Longxuan Yu, Partha Thakuria, Rob Brekelmans, Evangelos E. Papalexakis, Greg Ver Steeg

AI总结 本文提出了一种连续状态框架,通过单位球体令牌嵌入实现离散随机定位,以提高离散序列生成的分布忠实度,并展示了在OpenWebText上改进MAUVE指标的效果。

Comments This work was intended as a replacement of arXiv:2602.16169 and any subsequent updates will appear there

详情
AI中文摘要

连续扩散是一种非自回归生成的自然框架,但在离散序列生成上通常落后于掩码离散扩散模型(MDMs)。我们争论瓶颈不是连续性本身,而是在于表示中去噪依赖于时间步索引的噪声制度。我们引入了离散随机定位(DSL),一种具有单位球体令牌嵌入的连续状态框架,其贝叶斯最优去噪器在定位信道下对名义信号噪声比(SNR)具有不变性。一个训练好的网络可以支持整个SNR路径家族,端点掩码扩散路径是特殊情况。对预训练MDLM检查点进行微调可显著提高OpenWebText上的分布忠实度(MAUVE)在所有步骤预算从T=128到T=1024,且同一检查点支持随机顺序自回归采样,以及使用最少T=48总步骤的混合连续-然后-离散采样器,无需蒸馏或重新训练。

英文摘要

Continuous diffusion is a natural framework for non-autoregressive generation but has generally lagged behind masked discrete diffusion models (MDMs) on discrete sequence generation. We argue that the bottleneck is not continuity itself, but a representation in which denoising depends on timestep-indexed noise regimes. We introduce \emph{Discrete Stochastic Localization} (DSL), a continuous-state framework with unit-sphere token embeddings whose Bayes-optimal denoiser is invariant to the nominal signal-to-noise ratio (SNR) under the localization channel. One trained network then supports an entire family of per-token SNR paths, with endpoint masked-diffusion paths as a special case. Fine-tuning a pretrained MDLM checkpoint with DSL substantially improves distributional faithfulness (MAUVE) on OpenWebText across all step budgets from $T{=}128$ to $T{=}1024$, and the same checkpoint supports random-order autoregressive sampling, as well as a hybrid continuous-then-discrete sampler using as few as T=48 total steps -- without distillation or retraining.

2604.14084 2026-05-22 cs.LG cs.AI

TIP: Token Importance in On-Policy Distillation

TIP: on-policy distillation 中的 token 重要性

Yuanda Xu, Hejian Sang, Zhengze Zhou, Ran He, Zhipeng Wang, Alborz Geramifard

AI总结 本研究探讨了在 on-policy 知识蒸馏中哪些 token 对学习信号最有用,提出了一种基于学生熵和教师-学生分歧的双轴分类方法,并通过实验验证了在有限内存条件下使用少量 token 进行蒸馏的有效性。

详情
AI中文摘要

On-policy knowledge distillation (OPD) trains a student on its own rollouts under token-level supervision from a teacher. Not all token positions matter equally, but existing views of token importance are incomplete. We ask a direct question: which tokens carry the most useful learning signal in OPD? Our answer is that informative tokens come from two regions: positions with high student entropy, and positions with low student entropy plus high teacher--student divergence, where the student is overconfident and wrong. Empirically, student entropy is a strong first-order proxy: retaining 50% of tokens with entropy-based sampling matches or exceeds all-token training while reducing peak memory by up to 47%. But entropy alone misses a second important region. When we isolate low-entropy, high-divergence tokens, training on fewer than 10% of all tokens nearly matches full-token baselines, showing that overconfident tokens carry dense corrective signal despite being nearly invisible to entropy-only rules. We organize these findings with TIP (Token Importance in on-Policy distillation), a two-axis taxonomy over student entropy and teacher--student divergence, and give a theoretical explanation for why entropy is useful yet structurally incomplete. This view motivates type-aware token selection rules that combine uncertainty and disagreement. We validate this picture across three teacher--student pairs spanning Qwen3, Llama, and Qwen2.5 on MATH-500 and AIME 2024/2025, and on the DeepPlanning benchmark for long-horizon agentic planning, where Q3-only training on <20% of tokens surpasses full-token OPD. Our experiments are implemented by extending the OPD repository https://github.com/HJSang/OPSD_OnPolicyDistillation, which supports memory-efficient distillation of larger models under limited GPU budgets.

英文摘要

On-policy knowledge distillation (OPD) trains a student on its own rollouts under token-level supervision from a teacher. Not all token positions matter equally, but existing views of token importance are incomplete. We ask a direct question: which tokens carry the most useful learning signal in OPD? Our answer is that informative tokens come from two regions: positions with high student entropy, and positions with low student entropy plus high teacher--student divergence, where the student is overconfident and wrong. Empirically, student entropy is a strong first-order proxy: retaining $50\%$ of tokens with entropy-based sampling matches or exceeds all-token training while reducing peak memory by up to $47\%$. But entropy alone misses a second important region. When we isolate low-entropy, high-divergence tokens, training on fewer than $10\%$ of all tokens nearly matches full-token baselines, showing that overconfident tokens carry dense corrective signal despite being nearly invisible to entropy-only rules. We organize these findings with TIP (Token Importance in on-Policy distillation), a two-axis taxonomy over student entropy and teacher--student divergence, and give a theoretical explanation for why entropy is useful yet structurally incomplete. This view motivates type-aware token selection rules that combine uncertainty and disagreement. We validate this picture across three teacher--student pairs spanning Qwen3, Llama, and Qwen2.5 on MATH-500 and AIME 2024/2025, and on the DeepPlanning benchmark for long-horizon agentic planning, where Q3-only training on $<$$20\%$ of tokens surpasses full-token OPD. Our experiments are implemented by extending the OPD repository https://github.com/HJSang/OPSD_OnPolicyDistillation, which supports memory-efficient distillation of larger models under limited GPU budgets.

2411.02813 2026-05-22 cs.LG

Sparse Orthogonal Parameters Tuning for Continual Learning

稀疏正交参数调优用于持续学习

Kun-Peng Ning, Hai-Jian Ke, Yu-Yang Liu, Jia-Yu Yao, Yong-Hong Tian, Li Yuan

AI总结 本文提出了一种名为SoTU的新型方法,通过稀疏正交参数调优来解决持续学习中的灾难性遗忘问题,实现了对流数据的最优特征表示。

详情
AI中文摘要

基于预训练模型(PTM)的持续学习方法近年来引起了广泛关注,这些方法能够适应连续的下游任务而无需灾难性遗忘。这些方法通常不更新预训练参数,而是使用额外的适配器、提示和分类器。在本文中,我们从新的角度研究了稀疏正交参数对持续学习的益处。我们发现,合并来自多个流任务的模型所学习的稀疏正交性在解决灾难性遗忘方面具有巨大潜力。利用这一见解,我们提出了一种新颖且有效的称为SoTU(稀疏正交参数调优)的方法。我们假设SoTU的有效性在于将多个领域学到的知识转换为正交delta参数的融合。在多样化的CL基准测试中评估了所提出的方法的有效性。值得注意的是,SoTU在不需要复杂分类器设计的情况下实现了流数据的最优特征表示,使其成为一种即插即用的解决方案。

英文摘要

Continual learning methods based on pre-trained models (PTM) have recently gained attention which adapt to successive downstream tasks without catastrophic forgetting. These methods typically refrain from updating the pre-trained parameters and instead employ additional adapters, prompts, and classifiers. In this paper, we from a novel perspective investigate the benefit of sparse orthogonal parameters for continual learning. We found that merging sparse orthogonality of models learned from multiple streaming tasks has great potential in addressing catastrophic forgetting. Leveraging this insight, we propose a novel yet effective method called SoTU (Sparse Orthogonal Parameters TUning). We hypothesize that the effectiveness of SoTU lies in the transformation of knowledge learned from multiple domains into the fusion of orthogonal delta parameters. Experimental evaluations on diverse CL benchmarks demonstrate the effectiveness of the proposed approach. Notably, SoTU achieves optimal feature representation for streaming data without necessitating complex classifier designs, making it a Plug-and-Play solution.

2402.11621 2026-05-22 cs.CL

Decoding News Narratives: A Critical Analysis of Large Language Models in Framing Detection

解码新闻叙述:对大型语言模型在框架检测中的关键分析

Valeria Pastorino, Jasivan A. Sivakumar, Nafise Sadat Moosavi

AI总结 本文研究了大型语言模型在框架检测中的应用,分析了不同模型在零样本、少样本和解释性提示设置下的表现,指出模型性能对提示设计敏感且易在模糊案例中出现系统性错误,并提出了一种新的跨领域新闻标题数据集以提高评估的现实性。

Journal ref Proceedings of the 3rd Workshop on Natural Language Processing for Political Sciences (PoliticalNLP 2026) @ LREC 2026, pages 17-28

详情
AI中文摘要

随着新闻报道的复杂性和多样性增加,框架分析已成为计算社会科学中的关键但具有挑战性的任务。传统方法,包括手动标注和微调模型,仍然受到高标注成本、领域特定性和不一致泛化能力的限制。基于指令的大型语言模型(LLMs)提供了一个有前景的替代方案,但它们在框架分析中的可靠性尚不充分。本文系统评估了几个LLMs,包括GPT-3.5/4、FLAN-T5和Llama 3,在零样本、少样本和基于解释的提示设置下的表现。聚焦于领域转移和固有的标注模糊性,我们显示模型性能高度敏感于提示设计,并且在模糊案例中容易出现系统性错误。尽管LLMs,特别是GPT-4,表现出更强的跨领域泛化能力,但它们也显示出系统性偏见,最值得注意的是倾向于将情感语言与框架混淆。为了在现实世界的话题多样性下实现原则性评估,我们引入了一种新的跨领域新闻标题数据集。最后,通过分析多个模型在现有框架数据集上的一致性模式,我们证明了跨模型共识为识别争议标注提供了一个有用的信号,为低资源环境下的数据集审计提供了一种实用方法。

英文摘要

The growing complexity and diversity of news coverage have made framing analysis a crucial yet challenging task in computational social science. Traditional approaches, including manual annotation and fine-tuned models, remain limited by high annotation costs, domain specificity, and inconsistent generalisation. Instruction-based large language models (LLMs) offer a promising alternative, yet their reliability for framing analysis remains insufficiently understood. In this paper, we conduct a systematic evaluation of several LLMs, including GPT-3.5/4, FLAN-T5, and Llama 3, across zero-shot, few-shot, and explanation-based prompting settings. Focusing on domain shift and inherent annotation ambiguity, we show that model performance is highly sensitive to prompt design and prone to systematic errors on ambiguous cases. Although LLMs, particularly GPT-4, exhibit stronger cross-domain generalisation, they also display systematic biases, most notably a tendency to conflate emotional language with framing. To enable principled evaluation under real-world topic diversity, we introduce a new dataset of out-of-domain news headlines covering diverse subjects. Finally, by analysing agreement patterns across multiple models on existing framing datasets, we demonstrate that cross-model consensus provides a useful signal for identifying contested annotations, offering a practical approach to dataset auditing in low-resource settings.

1607.06330 2026-05-22 cs.CL

La representación de la variación contextual mediante definiciones terminológicas flexibles

通过灵活的术语定义实现语境变化的表示

Antonio San Martín

AI总结 本文研究了如何通过灵活的术语定义来反映环境领域中专业概念在不同语境下的变化,提出了灵活的术语定义方法,并通过实证研究分析了语境变化对术语定义的影响。

Comments PhD Thesis. in Spanish. University of Granada. 2016

详情
AI中文摘要

在本博士论文中,我们应用认知语言学的原理到术语定义,并提出了一种称为灵活术语定义的提案。这包括由一组同一概念的定义组成,包括一个一般定义(在此情况下,涵盖整个环境领域)以及额外的定义,从相关子领域的角度描述该概念。由于语境是构建词汇单位(包括术语)意义的关键因素,我们假设术语定义可以并且应该反映语境的影响,尽管定义传统上被视为与任何语境因素无关的意义表达。本论文的主要目标是分析语境变化对专业环境概念的影响,以它们在术语定义中的表示为视角。具体而言,我们专注于基于主题限制的语境变化。为了完成本博士论文的目标,我们进行了实证研究,包括对一组具有语境变化的概念进行分析,并为其中两个概念创建灵活的定义。在我们实证研究的第一部分,我们将我们的领域依赖性语境变化概念划分为三种不同的现象:调制、视角化和子概念化。这些现象是叠加的,即所有概念都经历调制,一些概念还经历视角化,最后,少数概念还经历子概念化。在第二部分,我们应用这些概念到术语定义,并提出了如何构建灵活定义的指南,从知识提取到定义的实际写作。

英文摘要

In this doctoral thesis, we apply premises of cognitive linguistics to terminological definitions and present a proposal called the flexible terminological definition. This consists of a set of definitions of the same concept made up of a general definition (in this case, one encompassing the entire environmental domain) along with additional definitions describing the concept from the perspective of the subdomains in which it is relevant. Since context is a determining factor in the construction of the meaning of lexical units (including terms), we assume that terminological definitions can, and should, reflect the effects of context, even though definitions have traditionally been treated as the expression of meaning void of any contextual effect. The main objective of this thesis is to analyze the effects of contextual variation on specialized environmental concepts with a view to their representation in terminological definitions. Specifically, we focused on contextual variation based on thematic restrictions. To accomplish the objectives of this doctoral thesis, we conducted an empirical study consisting of the analysis of a set of contextually variable concepts and the creation of a flexible definition for two of them. As a result of the first part of our empirical study, we divided our notion of domain-dependent contextual variation into three different phenomena: modulation, perspectivization and subconceptualization. These phenomena are additive in that all concepts experience modulation, some concepts also undergo perspectivization, and finally, a small number of concepts are additionally subjected to subconceptualization. In the second part, we applied these notions to terminological definitions and we presented we presented guidelines on how to build flexible definitions, from the extraction of knowledge to the actual writing of the definition.

2605.22124 2026-05-22 stat.ML cs.LG math.PR

From Betting to Empirical Bernstein LIL

从赌局到经验伯恩斯坦LIL

Francesco Orabona

AI总结 本文通过在线投注策略的财富保证,推导出迭代对数定律,并提出经验伯恩斯坦LIL方法。

详情
AI中文摘要

This is a verbatim copy of a technical report I wrote in 2017-2018 to obtain the law of the iterated logarithm using the guarantee on the wealth of an online betting strategy.

英文摘要

This is a verbatim copy of a technical report I wrote in 2017-2018 to obtain the law of the iterated logarithm using the guarantee on the wealth of an online betting strategy.

2605.16299 2026-05-22 cs.SE cs.AI

ACE: Self-Evolving LLM Coding Framework via Adversarial Unit Test Generation and Preference Optimization

ACE:通过对抗性单元测试生成和偏好优化的自进化LLM编码框架

Yixu Huang, Xinglei Yu, Zhongyu Wei

AI总结 本文提出ACE框架,通过基于求解器-对抗架构的执行中心监督,实现自进化代码生成,无需真实代码或外部奖励模型,实验表明其在CodeContests、MBPP和LiveCodeBench上均优于现有求解器-验证器基线。

详情
AI中文摘要

大型语言模型(LLMs)在代码生成方面表现出色,但仍然严重依赖大规模标注解决方案和基于验证的监督,这限制了可扩展性和持续自我改进的能力。最近的求解器-验证器框架利用程序执行作为自动监督信号,但其有效性在求解器变得中等强大时会下降:验证器生成的测试越来越多地确认语义正确性,而不是暴露剩余的失败模式。我们提出了ACE,一种基于求解器-对抗架构的自进化代码生成框架,优先通过以执行为中心的监督进行主动失败发现。一个单一的LLM在生成候选程序和生成优化以诱导执行级失败(如运行时错误、异常或非终止)的对抗性单元测试输入之间交替进行。监督仅来源于执行结果:稳健的程序被选为监督微调,而对抗性测试通过Kahneman-Tversky优化使用执行衍生的偏好进行优化。值得注意的是,整个训练循环不需要真实代码或外部奖励模型。在CodeContests、MBPP和LiveCodeBench上的实验表明,ACE在pass@1上持续优于强大的求解器-验证器基线,实现了3-7%的绝对提升,在分布外基准上改进更大,同时保持竞争性或改进的推理效率。

英文摘要

Large Language Models (LLMs) excel at code generation but remain heavily reliant on large-scale annotated solutions and verification-based supervision, which constrains scalability and hinders sustained self-improvement. Recent solver--verifier frameworks exploit program execution as an automatic supervision signal, but their effectiveness degrades as solvers become moderately strong: verifier-generated tests increasingly confirm semantic correctness rather than exposing the remaining failure modes. We propose \textbf{ACE}, a self-evolving code generation framework based on a solver--adversary architecture that prioritizes active failure discovery through execution-centric supervision. A single LLM alternates between generating candidate programs and producing adversarial unit test inputs optimized to induce execution-level failures, such as runtime errors, exceptions, or non-termination. Supervision is derived solely from execution outcomes: robust programs are selected for supervised fine-tuning, while adversarial tests are optimized via Kahneman--Tversky Optimization using execution-derived preferences. Notably, the entire training loop requires no ground-truth code or external reward models. Experiments on CodeContests, MBPP, and LiveCodeBench demonstrate that ACE consistently outperforms strong solver--verifier baselines, achieving 3--7\% absolute gains in pass@1, with larger improvements on out-of-distribution benchmarks, while maintaining competitive or improved inference efficiency.

2604.03501 2026-05-22 cs.HC cs.AI

The Augmentation Trap: AI Productivity and the Cost of Cognitive Offloading

增强陷阱:人工智能生产力与认知卸载的成本

Michael Caosun, Sinan Aral

AI总结 本文研究了人工智能工具对工人生产力的影响,发现尽管短期生产力提升,但持续使用会侵蚀工人技能。通过动态模型分析,发现即使预期技能损耗,理性决策者仍可能在短期收益大于长期成本时采用AI,导致稳态损失。此外,当管理者短视或工人技能具有外部价值时,决策者可能陷入增强陷阱,使工人状况恶化。最后,当AI生产力与工人技能关联较弱时,工人技能可能永久分化,经验丰富的工人实现潜力,而经验不足的工人技能降至零。

详情
AI中文摘要

实验证据证实,AI工具提高了工人的生产力,但持续使用也会侵蚀支撑这些收益的专业技能。我们开发了一个动态模型,其中决策者在时间上选择工人使用AI的强度,权衡即时生产力与工人技能的损耗。我们将工具的生产力效应分解为两个渠道,一个与工人技能无关,另一个随技能变化。模型产生了三个主要结果。第一,即使决策者完全预见技能损耗,理性决策者在短期生产力收益超过长期技能成本时仍会采用AI,产生稳态损失:工人最终比采用AI前更不 productive。第二,当管理者短视或工人技能具有外部价值时,决策者的最优政策将稳态损失转化为增强陷阱,使工人状况比未采用AI时更差。第三,当AI生产力较少依赖工人技能时,工人技能可以永久分化:经验丰富的工人实现全部潜力,而经验较少的工人技能降至零。小的管理激励差异决定了工人的路径。生产力分解将部署分为五个制度,区分有益和有害的采用,并识别哪些部署容易陷入陷阱。

英文摘要

Experimental evidence confirms that AI tools raise worker productivity, but also that sustained use can erode the expertise on which those gains depend. We develop a dynamic model in which a decision-maker chooses AI usage intensity for a worker over time, trading immediate productivity against the erosion of worker skill. We decompose the tool's productivity effect into two channels, one independent of worker expertise and one that scales with it. The model produces three main results. First, even a decision-maker who fully anticipates skill erosion rationally adopts AI when front-loaded productivity gains outweigh long-run skill costs, producing steady-state loss: the worker ends up less productive than before adoption. Second, when managers are short-termist or worker skill has external value, the decision-maker's optimal policy turns steady-state loss into the augmentation trap, leaving the worker worse off than if AI had never been adopted. Third, when AI productivity depends less on worker expertise, workers can permanently diverge in skill: experienced workers realize their full potential while less experienced workers deskill to zero. Small differences in managerial incentives can determine which path a worker takes. The productivity decomposition classifies deployments into five regimes that separate beneficial adoption from harmful adoption and identifies which deployments are vulnerable to the trap.

2605.21915 2026-05-22 cs.CR cs.LG

CCLab: Adversarial Testing of Learning- and Non-Learning-Based Congestion Controllers

CCLab: 学习型和非学习型拥塞控制器的对抗测试

Zhi Chen, Shehab Sarar Ahmed, Chenkai Wang, Brighten Godfrey, Gang Wang

AI总结 本文提出CCLab框架,用于系统评估学习型和非学习型拥塞控制器在对抗性条件下的鲁棒性,发现学习型控制器在对抗测试中比传统算法更鲁棒,并展示了对抗性追踪可用于训练更鲁棒的拥塞控制器。

Comments 13 pages for main paper, 16 pages in total

详情
AI中文摘要

拥塞控制器(CCs)对网络性能至关重要,但其在恶劣条件下的鲁棒性仍不够了解。尽管最近的学习型CCs在受控环境中表现出色,但当控制器的输入信号被破坏或环境条件变得系统性挑战时,其与传统CCs的性能对比尚不清楚。本文介绍CCLab,一种对抗测试框架,用于系统评估学习型和非学习型CCs的鲁棒性。CCLab包含一个基于强化学习(RL)的对抗代理,在闭环中与拥塞控制策略协同工作,生成受约束的扰动,无论是对输入信号(特征级)还是外部网络条件(环境级),同时通过显式约束保持现实性。利用此框架,我们在特征级和环境级对抗性条件下比较学习型和非学习型CCs。尽管两种类型的CCs在对抗测试中均出现性能下降,但学习型CCs总体上比传统人工设计算法更鲁棒。最后,我们展示对抗性追踪可用于训练更鲁棒的CCs,其在挑战性和正常条件下均优于现有学习型CCs。

英文摘要

Congestion controllers (CCs) are critical to network performance, and yet their robustness under adverse conditions remains insufficiently understood. While recent learning-based CCs have demonstrated strong performance in controlled environments, it is unclear how they compare to traditional CCs when controllers' input signals are corrupted or when environmental conditions become systematically challenging. In this paper, we introduce CCLab, an adversarial testing framework for systematically evaluating the robustness of both learning-based and non-learning-based CCs. CCLab includes a reinforcement learning (RL)-based adversarial agent that operates in a closed loop with the congestion control policy, generating bounded perturbations either on input signals (feature-level) or on external network conditions (environment-level), while preserving realism through explicit constraints. Using this framework, we compare learning-based CCs with non-learning-based CCs under both feature-level and environment-level adversarial conditions. While both types of CCs suffer from performance degradation under adversarial testing, we find that learning-based CCs, in general, are more robust than traditional human-designed algorithms. Finally, we show that our adversarial traces can be used to train more robust CCs that outperform existing learning-based CCs under both challenging and normal conditions.

2605.22822 2026-05-22 hep-th hep-ph quant-ph

Bottom-up open EFT for non-Abelian gauge theory with dynamical color environment

自底向上的开放有效场论用于非阿贝尔规范理论中的动态颜色环境

Yoshihiko Abe, Kanji Nishii

AI总结 本文提出了一种自底向上的开放有效场论,用于描述非阿贝尔规范理论中动态颜色环境的色响应,通过保留慢响应变量构建局部系统-环境有效场论,并展示了硬热环响应作为保留环境响应的一种实现。

Comments 51 pages, 1 figure

详情
AI中文摘要

我们发展了一种基于Schwinger-Keldysh形式化方法的自底向上的开放有效场论(EFT),用于非阿贝尔规范理论。与完全积分出环境并从非局域影响功能开始不同,我们显式保留慢环境响应变量,并构建了局部的系统-环境EFT。环境部分由一个动态的颜色框架变量、类似Stückelberg的场以及相关的颜色电流部分描述,这给出了系统与环境之间非平凡的相互作用和耗散。所得到的构造提供了非局域和非马尔可夫色响应的协变马尔可夫嵌入。在积分出保留的环境变量并采用延迟边界条件后,简化的系统理论获得了非局域耗散核和随机源。我们展示了硬热环响应自然地作为保留环境响应的一种实现。本文框架提供了非阿贝尔等离子体中色输运、记忆效应和涨落-耗散结构的局部开放EFT描述,并为具有动态环境的耗散杨-米尔斯EFTs提供了系统性的起点。

英文摘要

We develop a bottom-up open effective field theory (EFT) for non-Abelian gauge theories within the Schwinger--Keldysh formalism. Instead of integrating out the environment completely and starting from a nonlocal influence functional, we retain the slow environmental response variables explicitly and construct a local system-environment EFT. The environmental sector is described by a dynamical color-frame variable, Stückelberg-like field, and an associated color-current sector, which gives the nontrivial interactions and dissipation between the system and the environment. The resulting construction provides a gauge-covariant Markov embedding of nonlocal and non-Markovian color response. After integrating out the retained environmental variables with retarded boundary conditions, the reduced system theory acquires nonlocal dissipative kernels and stochastic sources. We show that the hard thermal loop response arises naturally as a particular realization of the retained environmental response. Our framework provides a local open-EFT description of color transport, memory effects, and fluctuation-dissipation structure in non-Abelian plasmas, and offers a systematic starting point for dissipative Yang--Mills EFTs with dynamical environments.

2605.22815 2026-05-22 hep-ex hep-ph

New constraints on physics within and beyond the standard model from the latest CONUS datasets

从最新CONUS数据集对标准模型内和外的物理的新约束

N. Ackermann, H. Bonet, A. Bonhomme, C. Buck, 1 K. Fülber, J. Hakenmüller, J. Hempfling, G. Heusser, T. Hugle, M. Lindner, W. Maneschg, S. Mertens, K. Ni, D. Piani, M. Rank, T. Rink, E. Sanchez Garcia, I. Stalder, H. Strecker, R. Wink, J. Woenckhaus

AI总结 利用中性子衰变静止、太阳和最近的反应堆反中性子检测,CONUS合作组将相干弹性中性子-核散射(CEνNS)确立为研究标准模型内和外物理的工具,通过最新数据集进一步改进了中性子磁矩和中性子毫电荷的限制,并降低了与NSI相关的新的物理尺度和轻新媒介耦合的限制。

Comments 35 pages, 16 figures, 6 tables; comments welcome

详情
AI中文摘要

通过中性子衰变静止、太阳和最近的反应堆反中性子检测,CONUS合作组将相干弹性中性子-核散射(CEνNS)确立为研究标准模型内和外物理的工具。CONUS实验位于德国布罗克多夫和瑞士莱比锡特核电站,使用锗半导体探测器在紧凑屏蔽中靠近反应堆核心运行。在莱比锡特站点报告了3.7σ显著性观测结果,与标准模型预测良好一致。总结了在布罗克多夫反应堆和莱比锡特站点收集的最新数据集上进行的物理研究。通过实验分析框架,所呈现的结果包含实验背后的完整系统。之前确定的中性子-电子散射中中性子磁矩和中性子毫电荷的限制被改进为μ_ν <5.18·10^{-11}μ_B和q_ν<1.76·10^{-12}e_0(90%置信水平)。此外,与NSI相关的新的物理尺度被改进到Λ_{NSI}=145 GeV,轻新媒介耦合的限制被降低到4·10^{-7}(90%置信水平)。最后,利用CEνNS和反应堆反中性子确定的Weinberg角为sin^{2}θ_W=0.28^{+0.03}_{-0.04},在动量转移约为10 MeV时。

英文摘要

Its detections with pion-decay-at-rest, solar and recently with reactor antineutrinos by the CONUS collaboration render coherent elastic neutrino-nucleus scattering (CE$ν$NS) an established tool for investigations within and beyond the Standard Model (SM). The CONUS experiment located at the nuclear power plants in Brokdorf (Germany) and Leibstadt (Switzerland) operates Germanium semiconductor detectors in a compact shield at close distance to the reactor core. An observation with $3.7 σ$ significance is reported at the Leibstadt site, showing good agreement with its SM prediction. Physics investigations performed with the last datasets collected at the Brokdorf reactor and with the first data obtained at the Leibstadt site are summarized. By using the experimental analysis framework, the presented results contain the full systematics that underlie the experiment. Previously determined limits with neutrino-electron scattering on the neutrino magnetic moment and a neutrino millicharge are improved to $μ_ν <5.18\cdot 10^{-11}μ_\mathrm{B}$ and $q_ν<1.76\cdot 10^{-12} e_0$ (90% C.L). Further, the scale of new physics related to NSIs is improved to $Λ_{\rm NSI}$=145 GeV and limits on the coupling of light new mediators are lowered down to $4 \cdot 10^{-7}$ (90% C.L.) with the new data. Finally, the determination of the Weinberg angle with CE$ν$NS and reactor antineutrinos yields $\sin^{2}θ_W= 0.28^{+0.03}_{-0.04}$ at a momentum transfer of $\sim 10 \ \mathrm{MeV}$.

2605.22813 2026-05-22 cs.DS

Optimal Testing of Reed-Muller Codes with an Online Adversary

Reed-Muller码的最优测试与在线对抗

Esty Kelman, Uri Meir, Kai Zhe Zheng

AI总结 本文提出了一种半采样测试器,用于在在线擦除模型中对Reed-Muller码进行最优测试,改进了Minzer和Zheng的工作,并首次为提升的仿射不变码提供了在线擦除模型下的测试方法。

详情
AI中文摘要

受Kalemaj、Raskhodnikova和Varma(ITCS 2022和Theory of Computing 2023)在线擦除模型中属性测试应用的启发,我们定义并分析了Reed-Muller码的半采样测试器。Reed-Muller测试的任务是通过尽可能少的点查询来确定输入函数$f: \F^n o \F$是否属于Reed-Muller码或与其远距离。Reed-Muller测试在属性测试和概率可验证证明文献中均有深入研究。在线擦除模型引入了新的挑战:每次查询后,对手可能擦除输入函数的最多$t$个点,这可能阻止任何查询遵循可预测模式的测试。半采样测试器是样本测试器和标准测试器之间的混合体:样本测试器只能对输入函数进行均匀随机查询,而标准测试器可以自由选择查询。它们是为在线擦除模型设计的,操作方式是首先选择域的一个子集$S$,然后在$S$内均匀随机地进行查询。我们描述了Reed-Muller码的半采样测试器,并给出了其正确性的最优分析。因此,我们证明半采样测试器确实在存在在线擦除的情况下有效,从而在在线擦除模型中实现了Reed-Muller码测试的最优查询复杂度。这一结果改进了Minzer和Zheng(SODA 2024)的工作。作为额外的奖励,我们还证明半采样测试器也适用于Guo、Kopparty和Sudan(ITCS 2013)提出的提升的仿射不变码,从而为这些码在在线擦除模型下提供了已知的首次测试方法。

英文摘要

Motivated by applications to property testing in the online-erasure model of Kalemaj, Raskhodnikova, and Varma (ITCS 2022 and Theory of Computing 2023), we define and analyze {\em semi-sample-based testers} for Reed-Muller codes. The task in Reed-Muller testing is to determine whether an input function $f: \F^n \to \F$ belongs to the Reed-Muller code or is far from it, using as few point queries to $f$ as possible. Reed-Muller testing is a well-studied task with its roots in both the Property Testing and Probabilistically Checkable Proofs literature. The online-erasure model introduces a twist: after each query made, an adversary may erase up to $t$ points of the input function, potentially thwarting any test in which the queries follow a predictable pattern. Semi-sample-based testers are a hybrid between sample-based testers -- which can only make uniformly random queries to the input function -- and standard testers, which can choose their queries freely. They are designed with the online-erasure model in mind and operate by first choosing some subset $S$ of the domain and then making their queries uniformly at random inside of $S$. We describe semi-sample-based testers for the Reed-Muller code and give an optimal analysis of their soundness. Consequently, we show that semi-sample-based testers are indeed effective in the presence of online erasures, and thereby achieve optimal query complexity for testing the Reed-Muller code in the online-erasure model. This result improves upon prior work of Minzer and Zheng (SODA 2024). As an added bonus, we show that semi-sample-based testers also exist for the lifted affine-invariant codes of Guo, Kopparty, and Sudan (ITCS 2013), thereby providing the first known testers for these codes in the online-erasure model.

2605.22811 2026-05-22 cs.DB

GS-QA: A Benchmark for Geospatial Question Answering

GS-QA:一个地理空间问答的基准测试

Majid Saeedan, Muhammad Shihab Rashid, Ahmed Eldawy, Vagelis Hristidis

AI总结 本文提出GS-QA基准测试,用于评估地理空间问答系统,通过结合多个来源的信息,涵盖广泛的空间对象、谓词和答案类型,展示了现有方法在处理复杂空间谓词和多源推理时的局限性。

详情
AI中文摘要

近年来,大型语言模型(LLMs)的进步显著提升了问答(QA)性能。为解决评估QA系统的问题,标准化基准测试已被引入。本研究聚焦于地理空间问答问题,其中大量的地理空间数据以空间数据库或其他形式存在。现有地理空间问答基准测试存在诸多限制,包括问题数量少、空间谓词有限、输出类型狭窄以及缺乏多源推理。我们提出了GS-QA,一个可扩展的地理空间问答基准测试,基于OpenStreetMap和维基百科数据,包含28种模板下的2800个问题-答案对,覆盖广泛的空间对象、谓词(包括方向和朝向过滤)以及答案类型(实体名称、位置、距离、方向、计数和聚合面积/长度)。GS-QA的一个关键特点是某些问题需要结合多个来源的信息,例如OSM的地理空间信息和维基百科的事实信息。GS-QA包含综合评估方法,结合基于文本的QA度量和地理空间特定度量,如距离误差和角度误差。我们实现了九个基于LLM的地理空间QA基线,使用三种LLM(GPT-4o、Claude Sonnet 4.6和Minstral-3)结合直接提示、检索增强生成和文本到SQL。我们的结果表明,现有方法在处理简单的空间谓词和实体名称输出时表现良好,但在涉及复杂空间谓词、数值输出类型和多源推理的问题上准确性显著下降,表明地理空间问答仍是一个具有挑战性的问题,需要进一步研究。

英文摘要

Recent advances in Large Language Models (LLMs) have led to dramatic improvements in question answering (QA). To address the challenge of evaluating QA systems, standardized benchmarks have been introduced. This work focuses on the problem of geospatial QA, where a large collection of geospatial data is available in the form of a spatial database or other forms. Existing work on geospatial QA benchmarks has various limitations, including a small number of questions, limited spatial predicates, narrow output types, and no multi-source reasoning. We present GS-QA, an extensible geospatial QA benchmark with 2,800 question-answer pairs across 28 templates on top of OpenStreetMap and Wikipedia data, covering a wide range of spatial objects, predicates (including directional and towards filtering), and answer types (entity names, locations, distances, directions, counts, and aggregated areas/lengths). A key feature of GS-QA is that some questions require combining information from multiple sources, e.g., geospatial information from OSM and factual information from Wikipedia. GS-QA includes a comprehensive evaluation methodology that combines text-based QA measures with geospatial-specific measures such as distance error and angular error. We implemented nine LLM-based geospatial QA baselines using three LLMs (GPT-4o, Claude Sonnet 4.6, and Ministral-3) with combinations of direct prompting, retrieval-augmented generation, and text-to-SQL. Our results show that existing solutions perform reasonably well on simple spatial predicates with entity name outputs, but accuracy degrades significantly for questions involving complex spatial predicates, numeric output types, and multi-source reasoning, demonstrating that geospatial QA remains a challenging open problem warranting further research.

2605.22810 2026-05-22 cond-mat.mes-hall

Signatures of the Quantum Geometric Dipole of Interlayer Excitons in Counterflow Conductivity

反向电流导电性中层间激子的量子几何偶极子特征

Fanuel I. Mendez, Luis Brey, H. A. Fertig

AI总结 本文研究了双层系统中层间激子的量子几何偶极子结构,通过反向电流电导作为探测手段,揭示了强垂直磁场下磁激子能带中量子几何偶极子的特征,展示了非平衡动量分布与层不对称驱动场的关系。

详情
AI中文摘要

许多体电子系统的集体激发可以携带内部结构,支持新的量子几何和拓扑性质。其中,量子几何偶极子(QGD)对于激子具有直接的内部极化意义。在双层系统中,层间激子的QGD表示平面偶极矩,可以用于驱动平面电场。在本工作中,我们考虑双层系统中由驱动激子关联的反向电流电导作为探测其QGD结构的探针。作为一个简单但非平凡的例子,我们分析了在强垂直磁场下的一个维周期势结构。所得到的磁激子能带中包含区分于均匀系统激子QGD的QGD结构。为了建模激子输运,我们采用玻尔兹曼方法,包括带间隧穿,使我们能够考虑强层不对称驱动场所产生的非平衡动量分布。我们展示了如何通过驱动场的层对称成分的线性响应来获取QGD信息,并且通过改变层不对称场可以探测激子能带的宽QGD结构。我们的结果表明,反向电流电导作为层间激子所携带的内部量子几何结构的可调探测手段,连接了输运与许多体激发的量子几何。

英文摘要

Collective excitations of many-body electron systems can carry internal structure, supporting novel quantum geometric and topological properties. Among these are a quantum geometric dipole (QGD), which for excitons have direct significance as an internal polarization. For interlayer excitons of a bilayer system, this represents an in-plane dipole moment, which can be used to drive them with in-plane electric fields. In this work, we consider counterflow electric currents associated with driven excitons in such a bilayer system as a probe of their QGD structure. As a simple but non-trivial example, we analyze a structure with a one-dimensional periodic potential in a strong perpendicular magnetic field. The resulting magnetoexciton bands host QGD structure that distinguishes it from the exciton QGD of a uniform system. To model exciton transport we adopt a Boltzmann approach that includes inter-band tunneling, allowing us to consider non-equilibrium momentum distributions that result from strong layer-antisymmetric driving fields. We show how linear response to a layer-symmetric component of the driving fields provide information about the QGD, and that the broad QGD structure of the exciton bands can be probed by the varying the layer-antisymmetric field. Our results demonstrate that counterflow conductivity serves as a tunable probe of the internal quantum geometric structure carried by the interlayer excitons, connecting transport to the quantum geometry of many-body excitations.

2605.22807 2026-05-22 quant-ph

How many systems can be dephased before the quantum switch becomes causally definite?

在量子交换成为因果确定之前,可以有多少系统被去相位?

Yassine Benhaj, Kuntal Sengupta, Cyril Branciard

AI总结 该研究探讨了在量子过程的因果顺序不定时,非经典性对因果非分离性的影响,通过分析去相位系统数量来确定因果非分离性的维持条件。

Comments Format: 2-page extended abstract + 10-page technical material

详情
AI中文摘要

具有不定因果顺序的量子过程——所谓的因果非分离过程——在固定或明确因果结构的量子电路中表现出各种优势。一个自然的问题是,过程要显示因果非分离性需要多少非经典性。在这里,我们通过研究在该属性消失之前可以有多少系统被去相位(或退相干)来解决这个问题。首先,对于具有开放过去和未来的双分量过程,我们显示如果所有系统都被去相位,或者只保留未来系统未被去相位,则过程变为因果可分离。然而,如果任何单个系统(除了未来系统)保持未被去相位,则存在过程仍能保持因果非分离性。接下来,我们展示了在多分量情况下,当受限于物理动机的量子电路类(量子控制量子电路QC-QCs)时,类似的行为。即,去相位所有系统或只保留未来系统未被去相位会使任何QC-QC因果可分离;而如果任何非未来系统保持未被去相位,则因果非分离性仍可持续。

英文摘要

Quantum processes with indefinite causal order -- so-called causally nonseparable processes -- can exhibit various advantages over quantum circuits with a fixed or a well-defined causal structure. A natural question is how much nonclassicality is required for a process to display causal nonseparability. Here we address this by investigating how many systems can be dephased (or decohered) before this property vanishes. First, for bipartite processes with open past and future we show that if all systems are dephased, or if only the future system is kept undephased, then the process becomes causally separable. However, if any single system other than the future system remains undephased, then there exist processes that retain causal nonseparability. Next, we demonstrate a similar behaviour in the multipartite case, when restricted to the physically motivated class of quantum circuits with quantum control (QC-QCs). Namely, dephasing all systems or keeping only the future system undephased renders any QC-QC causally separable; while causal nonseparability can persist if any non-future system is left undephased.

2605.22806 2026-05-22 astro-ph.GA astro-ph.CO

From protogalaxy through thick and thin: Why did the Milky Way evolve in three kinematic phases?

从原星系到厚盘和薄盘:为什么银河系经历了三个运动学阶段?

Olti Myrtaj, James S. Bullock, Michael Boylan-Kolchin, Vedant Chandra, Claude-André Faucher-Giguère, Robert Feldmann, Francisco J. Mercado, Jorge Moreno, Jonathan Stern, Andrew Wetzel, Pratik J. Gandhi

AI总结 研究通过模拟揭示银河系的结构演变经历了三个运动学阶段,核心方法是利用FIRE-2模拟,主要贡献是阐明了这三个阶段的物理起源。

Comments 27 pages, 21 figures

详情
AI中文摘要

APOGEE和Gaia数据揭示银河系的结构似乎经历了三个不同的运动学阶段。首先,在早期宇宙时期,银河系是一个无序的原星系,随后“加速”到第二个运动学阶段,该阶段以在旋转的厚星盘中形成恒星为标志。厚盘阶段后来过渡到第三个(也是最终的)阶段,此时恒星形成发生在冷的薄星盘中。本文利用FIRE-2模拟的银河系质量星系,展示了在我们的宇宙放大模拟中,同样出现了这三个阶段,并研究了它们的物理起源。在所有我们的星系中,早期无序阶段发生在冷却气体(温度≤10^4 K)转化为恒星的速率较低时,恒星形成速率是爆发性的,且重子质量相对于中心质量运动在宿主势能中“翻滚”。气体在翻滚阶段结束后开始协同旋转,随后是年轻恒星的自转加速。星系的中心势能在气体自转加速之前最不集中。这个第二阶段的厚盘阶段与冷却气体转化为恒星的速率最高的时期重合,尽管该阶段的恒星形成速率仍然保持爆发性。最终过渡到薄盘阶段发生在内 circumgalactic 媒介virialize时。薄盘阶段与恒星形成稳定且冷却气体转化为恒星的速率处于中间水平的时期相关。合并似乎没有在三个阶段之间的转变中起决定性作用。厚盘形成的条件似乎相当简单:中心质量运动的稳定。薄盘的形成需要更多:气体必须缓慢地积累,以便其角动量在加入星系之前混合并变得一致。

英文摘要

APOGEE and Gaia data have revealed that the Milky Way's structure appears to have evolved through three distinct kinematic phases. First, at early cosmic times, the Milky Way was a disordered protogalaxy, which subsequently "spun up" to a second kinematic phase marked by star formation occurring in a rotating, thick stellar disk. The thick disk phase later transitioned to a third (and final) phase with star formation occurring in a cold, thin stellar disk. In this paper, we use a suite of FIRE-2 simulations of Milky Way-mass galaxies to demonstrate that the same three phases arise in our cosmological zoom-in simulations, and study their physical origin. In all of our galaxies, the early disordered phase occurs when the rate of cool gas ($T \leq 10^4$ K) converting into stars is low, the star formation rate is bursty, and the baryonic mass "sloshes" within the host potential with respect to the center of mass motion. The gas in the galaxy begins to spin coherently after the sloshing phase ends, followed by the spin-up of young stars. The central potential of the galaxy is least concentrated just prior to gas spin-up. This second, thick disk phase coincides with a period when the rate of cool gas converting into stars is highest, even though the star formation rate remains bursty in this phase. The final transition to the thin disk phase occurs when the inner circumgalactic medium virializes. The thin disk phase is associated with a time of steady star formation and intermediate rates of cool gas converting into stars. Mergers do not appear to play a defining role in driving transitions between the three phases. The condition for the formation of a thick disk appears to be fairly minimal: a stable center of mass motion. The formation of a thin disk requires more: gas must accrete slowly enough for its angular momentum to mix and become coherent prior to joining the galaxy.