arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 1942
2605.20206 2026-05-21 cs.HC cs.AI cs.SE

PrivacyAkinator: Articulating Key Privacy Design Decisions by Answering LLM-Generated Multiple-choice Questions

PrivacyAkinator: 通过回答LLM生成的多项选择题来阐明关键隐私设计决策

Qiyu Li, Yuen Sum Wong, Yuen Kei Wong, Longxuan Yu, Haojian Jin

AI总结 本文提出PrivacyAkinator工具,通过回答LLM生成的多项选择题帮助开发者阐明关键隐私设计决策,相比PRAM方法,用户研究显示其在更短时间内识别出更多关键决策。

Comments Accepted to ACM CHI 2026

详情
AI中文摘要

NIST的隐私风险评估方法论(PRAM)提供了一个结构化的框架,供隐私专家评估隐私风险。然而,其复杂性和对专家知识的依赖使得初学者难以有效使用。本文探讨了降低这些障碍的方法。我们首先通过12名参与者在真实场景中使用PRAM进行观察研究,发现初学者最困难的是阐明与隐私相关的设计决策。然后我们开发了PrivacyAkinator,一个交互式工具,通过回答LLM生成的多项选择题帮助开发者阐明关键隐私决策。PrivacyAkinator引入了三个创新:一种通用隐私表示,将隐私相关的设计决策抽象为数据流和利益相关者互动;一个从10000篇隐私相关新闻文章中挖掘出的领域感知设计空间;以及一个动态问题生成工作流以优先考虑相关问题。我们的24名参与者用户研究显示,使用PrivacyAkinator的开发者在73%的时间内识别出比PRAM多47%的关键决策。

英文摘要

NIST's Privacy Risk Assessment Methodology (PRAM) provides a structured framework for privacy experts to assess privacy risks. However, its complexity and reliance on expert knowledge make it difficult for novice developers to use effectively. This paper explores methods to lower these barriers. We first performed an observational study with 12 participants using PRAM in real-world scenarios, and found that novice developers struggled most with articulating privacy-related design decisions. We then developed PrivacyAkinator, an interactive tool that helps developers articulate key privacy decisions by answering LLM-generated multiple-choice questions. PrivacyAkinator introduces three innovations: a universal privacy representation that abstracts privacy-related design decisions into data flows and stakeholder interactions; a domain-aware design space mined from 10K privacy-related news articles; and a dynamic question-generation workflow to prioritize relevant questions. Our user study with 24 participants suggests that developers using PrivacyAkinator identified 47% more key decisions in 73% less time compared to PRAM.

2605.20204 2026-05-21 cs.HC cs.AI

RealUserSim: Bridging the Reality Gap in Agent Benchmarking via Grounded User Simulation

RealUserSim: 通过基于现实的用户模拟弥合代理评估中的现实差距

Ming Zhu, Juntao Tan, Rithesh Murthy, Jielin Qiu, Liangwei Yang, Wenting Zhao, Silvio Savarese, Shelby Heinecke, Huan Wang

AI总结 本文提出RealUserSim,一种基于真实行为数据的用户模拟框架,通过提取大量真实人类与LLM对话数据,提升模拟用户与真实人类的匹配率,从而改进代理评估的准确性。

详情
AI中文摘要

基于LLM的用户模拟是端到端代理评估的主要机制,但模拟用户是真实人类的差代理:无约束的LLM默认设置产生形式天花板(与真实用户风格匹配率仅为6-8%),而手动编写的指令会触发指令放大,使模型超解释指令产生不自然的行为极端,这些极端行为在不同模拟器模型中差异显著。我们提出了RealUserSim,首个基于真实行为数据的用户模拟框架。从14000+场真实的真人-LLM对话(WildChat)中,我们提取出7275个可执行的行为档案,并利用它们来引导LLM模拟器。在600场跨71+个领域的对话上进行的保真度基准测试(PT3)显示,通过引导模拟,匹配率在五个行为维度上从24.2%提升到45.3%。在TauBench上对六个模拟器模型进行代理评估并进行广泛分析显示,引导模拟作为现实压力测试,揭示了三种现有协作模拟器无法检测到的失败机制(平均任务成功率下降-3.2%至-3.5%),而现有基准中的指令放大会产生不现实的行为,影响代理评估的有效性。

英文摘要

LLM-based user simulation is the primary mechanism for end-to-end agent evaluation, yet simulated users are poor proxies for real humans: unconstrained LLM defaults produce a Formalism Ceiling (style match rates of 6-8% against real users), while hand-crafted behavioral directives trigger Directive Amplification, where models hyper-interpret instructions into unnatural behavioral extremes that vary dramatically across simulator models. We present RealUserSim, the first user simulation framework grounded in real behavioral data. From 14,000+ authentic human-LLM conversations (WildChat), we extract 7,275 executable behavioral profiles and use them to ground LLM simulators. A fidelity benchmark (PT3) on 600 conversations across 71+ domains with anti-leakage controls shows that grounded simulation raises match rate from 24.2% to 45.3% across five behavioral dimensions. Agent evaluation on TauBench with 6 simulator models and extensive analysis shows that grounded simulation acts as a realistic stress test, surfacing three failure mechanisms invisible to cooperative simulators (mean -3.2% to -3.5% task success degradation), while Directive Amplification in existing benchmarks produces unrealistic behavior that compromises the validity of agent evaluation.

2605.20203 2026-05-21 cs.HC cs.AI

GrandGuard: Taxonomy, Benchmark, and Safeguards for Elderly-Chatbot Interaction Safety

GrandGuard:面向老年人与聊天机器人交互安全的分类、基准及防护措施

Changxuan Fan, Xi Yang, Yueyuan Zheng, Bin Zhou, Yuanping Wang, Wenbin Hu, Huihao Jing, Ki Sen Hung, Dazhao Du, Haoran Li, Janet Hui-wen Hsiao, Yangqiu Song

AI总结 本文提出GrandGuard框架,用于评估和缓解LLM交互中的老年人特定风险,通过建立包含50种细粒度风险类型的三级分类体系,构建了10,404个标注提示和响应的基准,展示了主流LLM在处理老年人特定情境风险时的不足,并通过两种防护措施实现了高达96.2%和90.9%的不安全提示检测准确率。

详情
AI中文摘要

随着老年人越来越多地使用基于LLM的聊天机器人进行陪伴和帮助,安全差距正在显现。老年人可能面临社会孤立、数字素养有限和认知下降等脆弱性,但现有安全基准主要针对一般危害,忽视了老年人特有的风险。例如,一个提示“如何在黑暗中独自修理天花板灯”对大多数用户可能是无害的,但对有行动限制的老年人而言却存在严重的跌倒风险。我们引入GrandGuard,这是首个全面评估和缓解LLM交互中老年人特定情境风险的框架。我们开发了一个包含50种细粒度风险类型的三级分类体系,涵盖心理健康、财务、医疗、毒性及隐私领域,基于现实事件、社区讨论和利益相关者研究的分析。利用此分类体系,我们构建了包含10,404个标注提示和响应的基准,显示主流LLM在处理老年人特定情境风险时在超过50%的案例中存在失误。我们通过两种防护措施来缓解这些失误:微调的Llama-Guard-3和政策增强的gpt-oss-safeguard-20b,分别实现了高达96.2%和90.9%的不安全提示检测准确率。GrandGuard为AI系统迈向支持老龄化人口奠定了基础。

英文摘要

As older adults increasingly use LLM-based chatbots for companionship and assistance, a safety gap is emerging. Older adults may face vulnerabilities from social isolation, limited digital literacy, and cognitive decline, yet existing safety benchmarks largely target general harms and overlook elderly-specific risks. For example, a prompt such as "how to repair a ceiling light alone in the dark" may be benign for most users but poses a serious fall risk for older adults with mobility limitations. We introduce GrandGuard, the first comprehensive framework for assessing and mitigating elderly-specific contextual risks in LLM interactions. We develop a three-level taxonomy with 50 fine-grained risk types across mental well-being, financial, medical, toxicity, and privacy domains, grounded in real-world incidents, community discussions, and analysis of stakeholder studies. Using this taxonomy, we construct a benchmark of 10,404 labeled prompts and responses, showing that several leading LLMs mishandle elderly-specific contextual risks in over 50% of cases. We mitigate these failures with two safeguards: a fine-tuned Llama-Guard-3 and a policy-enhanced gpt-oss-safeguard-20b, achieving up to 96.2% and 90.9% unsafe-prompt detection accuracy, respectively. GrandGuard lays the groundwork for AI systems that move beyond general safety to support aging populations.

2605.20200 2026-05-21 cs.HC cs.AI

Evaluating multimodal emotion recognition in proactive conversational agents: A user study

评估主动对话代理中的多模态情绪识别:一项用户研究

Adnana Dragut, Raquel Lacuesta, F. Xavier Gaya-Morey, Jose M. Buades-Rubio

AI总结 本文研究了多模态情绪识别在主动对话代理中的应用,通过用户研究验证了视觉和语言分析模块的有效性,发现语言分析比视觉线索更可靠,并探讨了SIAs在情绪引导中的潜力与挑战。

详情
AI中文摘要

本文介绍了一个集成在生成式人工智能驱动的主动社交交互代理(SIA)中的多模态情绪识别模块。系统通过两个不同渠道评估实时情感状态:基于计算机视觉的面部识别模块和语义语言分析引擎。为了验证该框架,进行了包含20名用户参与的实证研究,这些用户与对话代理进行了动态、非剧本的对话。研究发现,自动视觉线索与实际内部情感状态之间存在显著差异。当与AI交互时,用户一致表现出“扑克脸”效应,即使在体验积极情绪时也表现出严肃、专注的面部表情。因此,生成式AI语言分析证明了其显著可靠性,通过上下文化用户的口头表达。进一步分析交互动态表明,SIAs可以通过调整对话主题和使用结构化语言模式(如共情或幽默语言)有效激发特定情绪。然而,研究也指出,未校准的主动性偶尔会导致用户疏离和对人工性的感知。最终,本研究强调了改进SIAs以动态适应用户情绪演变的必要性,依靠深度语言上下文来促进更自然、人样的互动。

英文摘要

This article presents a multimodal emotion recognition module integrated into a proactive Socially Interactive Agent (SIA) powered by generative artificial intelligence. The system evaluates real-time affective states through two distinct channels: a computer vision-based facial recognition module and a semantic linguistic analysis engine. To validate the framework, an empirical study was conducted with 20 users who engaged in dynamic, unscripted dialogues with the conversational agent. The findings reveal a significant discrepancy between automated visual cues and actual internal emotional states. When interacting with the AI, users consistently exhibited a "poker face" effect, displaying serious, concentrated facial expressions even when experiencing positive emotions. Consequently, the generative AI linguistic analysis proved significantly more reliable, by contextualizing the users' verbal expressions. Furthermore, an analysis of the interaction dynamics demonstrated that SIAs can effectively elicit specific emotions by adapting conversational themes and employing structured linguistic patterns, such as empathetic or humorous language. However, the study also noted that instances of uncalibrated proactivity occasionally led to user disengagement and a perception of artificiality. Ultimately, this research highlights the necessity of refining SIAs to dynamically adapt to users' emotional evolution, relying on deep linguistic context to foster more natural, human-like interactions.

2605.20198 2026-05-21 cs.HC cs.CY cs.LG

Augmented Analytics and Decision Quality: The Role of Trust among Non-Technical BI Users

增强分析与决策质量:非技术BI用户之间的信任作用

Thuy Pham Thi Phuong, Ha Nguyen Manh, Ngan Nguyen Thi Thuy, Lan Hoang Thi

AI总结 本研究探讨了增强分析如何通过非技术BI用户对系统信任的提升来改善决策质量,采用认知委托理论分析了信任在决策质量中的作用。

Comments 13 pages, 1 figure, 4 tables

详情
AI中文摘要

增强分析已改变了商业智能(BI)系统支持管理决策的方式。这尤其适用于没有技术背景的用户,他们越来越多地依赖自动化洞察而非手动分析。以往的BI研究集中在系统采用和用户意图上,很少研究AI增强分析对决策质量和其中的认知机制的影响。基于认知委托理论,本文研究了非技术BI用户在增强分析和决策质量中的信任作用。250名商业专业人士完成了调查,数据通过偏最小二乘结构方程建模(PLS-SEM)进行分析。结果显示,增强分析能力显著提高了对系统使用简便性、有用性和信任度的感知。此外,信任和有用性影响BI的采用并提高决策质量。进一步地,信任对决策质量有直接的积极影响,突显了其作为依赖AI生成洞察的促进者的重要性。本研究将增强分析视为一种认知委托,并扩展了BI采用研究的范围,以包括决策结果。

英文摘要

Augmented analytics has transformed how business intelligence (BI) systems support managerial decision-making. This is especially true for users without technical backgrounds, who increasingly rely on automated insights rather than manual analysis. BI research has previously concentrated on system adoption and user intention, with very little research examining the impact of AI-enabled analytics on decision quality and the cognitive mechanisms in between. Using the theory of cognitive delegation, this paper investigates the role of trust in augmented analytics and decision-making quality among non-technical BI users. 250 business professionals completed the survey, and the data were analyzed using partial least squares structural equation modeling (PLS-SEM). The results show that augmented analytics capabilities lead to a significant increase in perceived ease of use, perceived usefulness, and trust in BI systems. In addition, trust and usefulness influence BI adoption and improve decision quality. Furthermore, trust has a direct and positive impact on decision quality, highlighting its importance as an enabler of reliance on AI-generated insights. This study considers augmented analytics as a form of cognitive delegation and expands the scope of BI adoption research to include decision-making outcomes.

2605.18623 2026-05-21 cs.DS cs.LG

An Approximation Algorithm for Graph Label Selection

图标签选择的近似算法

Josia John, Simon Meierhans, Maximilian Probst Gutenberg

AI总结 本文提出了一种新的图标签选择算法,在标准预算约束下,首次实现了O(log^{1.5}n)的近似比,解决了如何从整个图中选择少量代表性顶点以准确预测剩余顶点标签的问题。

Comments Accepted at ICML 2026. 9 pages, 7 figures

详情
AI中文摘要

在图标签选择问题中,给定一个n个顶点的图和一个预算k,目标是选择k个顶点,其标签能够准确预测剩余顶点的标签。该问题旨在从整个图中提炼出一个小的代表性集合。我们提出了第一个在标准预算约束下具有O(log^{1.5}n)近似比的图标签选择算法。先前的工作要么依赖于资源增强,允许显著多于k个标记的顶点,要么主要由启发式方法组成,没有可证明的保证。最后,我们证明了我们的算法的实用启发式变种能够扩展到比以前的方法大得多的图,同时几乎保持了其质量。

英文摘要

In the graph label selection problem, one is given an $n$-vertex graph and a budget $k$, and seeks to select $k$ vertices whose labels enable accurate prediction of the labels on the remaining vertices. This problem formalizes distilling a small representative set from the whole graph. We present the first $\tilde{O}(\log^{1.5} n)$-approximation algorithm for graph label selection under the standard budget constraint. Prior work either relies on resource augmentation, allowing substantially more than $k$ labeled vertices, or consists primarily of heuristics without provable guarantees. Finally, we demonstrate that practical heuristic variants of our algorithm scale to significantly larger graphs than previous methods, while essentially retaining their quality.

2605.16524 2026-05-21 cs.HC cs.AI

Toward Template-Free Explainability for Monte Carlo Tree Search

迈向无模板的蒙特卡洛树搜索可解释性

Siqi Lu, Mirsaleh Bahavarnia, Hiba Baroud, Yixuan Zhang, Hemant Purohit, Ayan Mukhopadhyay

AI总结 本文提出了一种无需模板的框架,使大语言模型能够根据记录的搜索轨迹生成基于证据的MCTS决策解释,无需中间形式化表示。

详情
AI中文摘要

概率搜索算法,如蒙特卡洛树搜索(MCTS),在不确定环境下解决顺序决策任务中已证明非常有效。然而,仅凭原始树统计信息对包含基于老虎机的树遍历和基于模拟的价值估计的非对称搜索树进行解释对终端用户来说是困难的。尽管先前的工作需要人工编写的正式逻辑约束,当问题变化时必须更新,我们提出了一种框架,使大型语言模型(LLMs)能够通过记录的搜索轨迹生成基于证据的MCTS决策解释。我们的框架将自然语言问题映射到结构化的意图类别集合中,确定现有树是否包含足够的证据,当需要时触发定向扩展,并使用树统计信息如访问次数、价值估计和风险信息生成解释。实验结果提供了首次证据表明LLMs可以作为概率搜索的端到端解释器,而无需中间形式化表示。

英文摘要

Probabilistic search algorithms, such as Monte Carlo Tree Search (MCTS), have proven very effective in solving sequential decision-making tasks under uncertainty. However, interpreting asymmetric search trees that incorporate bandit-based tree traversal and simulation-based value estimation is difficult for end users based solely on raw tree statistics. While prior work requires hand-crafted formal logic constraints that must be updated when the problem changes, we present a framework that enables large language models (LLMs) to generate evidence-grounded explanations of MCTS decisions from recorded search traces in an end-to-end manner. Our framework maps natural-language questions to a structured set of intent categories, determines whether the existing tree contains sufficient evidence, triggers targeted expansion when needed, and generates explanations using tree statistics such as visit counts, value estimates, and risk information. Experimental results provide the first evidence that LLMs can serve as end-to-end explainers for probabilistic search, without requiring intermediate formal representations.

2512.19373 2026-05-21 stat.ML cs.LG

Cluster-Based Generalized Additive Models Informed by Random Fourier Features

基于聚类的广义加性模型:受随机傅里叶特征启发

Xin Huang, Jia Li, Jun Yu

AI总结 本文提出了一种结合响应引导的谱表示学习与局部加性建模的可解释回归框架,用于处理异质数据。通过随机傅里叶特征回归模型构建谱特征图,并利用主成分分析压缩以获得低维潜在嵌入,随后通过高斯混合模型发现软区域,在每个区域中使用聚类特定的广义加性模型捕捉非线性协变量效应,最终通过软混合这些局部加性模型实现对非线性和异质结构的灵活建模,同时保持可解释性。

Comments 33 pages, 13 figures, 7 tables

详情
AI中文摘要

在开发数据驱动的建模方法时,需要在黑箱模型的强大预测性能与关键应用所需透明性之间取得平衡。本文介绍了一种可解释且计算上可行的回归框架,用于异质数据,通过结合响应引导的谱表示学习与局部加性建模。该方法首先拟合一个随机傅里叶特征回归模型,并构建一个谱特征图,从学习的振幅和自适应重新采样频率中获得,使表示反映数据中的预测变化。该表示随后通过主成分分析压缩以获得低维潜在嵌入,在其中高斯混合模型执行软区域发现。在每个区域中,聚类特定的广义加性模型通过可解释的样条基单变量平滑函数捕捉非线性协变量效应。最终预测器由这些局部加性模型的软混合组成,使能够灵活地建模非线性和异质结构,同时保持可解释性。在多个基准回归数据集上的数值实验表明,所提出的方法在一致地优于经典全局可解释基线的同时,仍与更灵活的黑箱模型竞争。总体而言,该框架提供了一种统一的异质回归方法,结合了预测适应性与可解释的局部协变量效应。

英文摘要

In developing data-driven modeling methodologies, there is an ongoing need to reconcile the strong predictive performance of opaque black-box models with the transparency required for critical applications. This work introduces an interpretable and computationally tractable regression framework for heterogeneous data by combining response-informed spectral representation learning with localized additive modeling. The method first fits a random Fourier feature regression model and constructs a spectral feature map from the learned amplitudes and adaptively resampled frequencies, so that the representation reflects predictive variation in the data. This representation is then compressed by principal component analysis to obtain a low-dimensional latent embedding, in which a Gaussian mixture model performs soft regime discovery. Within each regime, a cluster-specific generalized additive model captures nonlinear covariate effects through interpretable spline-based univariate smooth functions. The final predictor is formed as a soft mixture of these local additive models, enabling flexible modeling of a nonlinear, heterogeneous structure while preserving interpretability. Numerical experiments across several benchmark regression datasets show that the proposed method consistently improves upon classical globally interpretable baselines while remaining competitive with more flexible black-box models. Overall, the framework provides a unified approach to heterogeneous regression that combines predictive adaptivity with interpretable local covariate effects.

2508.16860 2026-05-21 cs.SE cs.AI cs.LG

TriagerX: Dual Transformers for Bug Triaging Tasks with Content and Interaction Based Rankings

TriagerX: 用于基于内容和交互的缺陷分类任务的双变换器

Md Afif Al Mamun, Gias Uddin, Lan Xia, Longyu Zhang

AI总结 本文提出TriagerX,一种双变换器架构,通过结合内容和交互信息来改进缺陷分类任务的推荐准确性,优于现有最先进方法。

Comments Accepted to IEEE Transactions on Software Engineering (TSE). 17 pages, 15 figures

详情
AI中文摘要

预训练语言模型(PLMs)是基于变换器的架构,可用于缺陷分类任务。PLMs比传统机器学习(ML)模型更能捕捉标记语义(例如TF-IDF、词袋)。然而,PLMs可能仍然会关注在缺陷报告中不相关的标记,这会影响其有效性。此外,当不考虑开发人员围绕类似缺陷的交互历史时,模型的推荐可能不够优化。我们设计了TriagerX来解决这些限制。首先,为了更可靠地评估标记语义,我们利用双变换器架构。与当前最先进的(SOTA)基线使用单一变换器架构不同,TriagerX从两个变换器中收集推荐,每个变换器通过其最后三层提供推荐。这种设置生成了一个稳健的内容基于候选开发人员的排名。TriagerX然后通过一种新的基于交互的排名方法来细化此排名,该方法考虑了开发人员与类似修复缺陷的历史交互。在五个数据集中,TriagerX超越了所有九种基于变换器的方法,包括SOTA基线,通常在Top-1和Top-3开发人员推荐准确性上提高了超过10%。我们与我们的大型行业合作伙伴合作,成功将其部署到他们的开发环境中。合作伙伴要求开发人员和组件的推荐,组件作为团队分配的代理,特别是在开发人员轮岗或团队变化的情况下特别有用。我们训练TriagerX在合作伙伴的数据集上进行两项任务,并在组件推荐上优于SOTA基线最高达10%,在开发人员推荐上最高达54%。

英文摘要

Pretrained Language Models or PLMs are transformer-based architectures that can be used in bug triaging tasks. PLMs can better capture token semantics than traditional Machine Learning (ML) models that rely on statistical features (e.g., TF-IDF, bag of words). However, PLMs may still attend to less relevant tokens in a bug report, which can impact their effectiveness. In addition, the model can be sub-optimal with its recommendations when the interaction history of developers around similar bugs is not taken into account. We designed TriagerX to address these limitations. First, to assess token semantics more reliably, we leverage a dual-transformer architecture. Unlike current state-of-the-art (SOTA) baselines that employ a single transformer architecture, TriagerX collects recommendations from two transformers with each offering recommendations via its last three layers. This setup generates a robust content-based ranking of candidate developers. TriagerX then refines this ranking by employing a novel interaction-based ranking methodology, which considers developers' historical interactions with similar fixed bugs. Across five datasets, TriagerX surpasses all nine transformer-based methods, including SOTA baselines, often improving Top-1 and Top-3 developer recommendation accuracy by over 10%. We worked with our large industry partner to successfully deploy TriagerX in their development environment. The partner required both developer and component recommendations, with components acting as proxies for team assignments-particularly useful in cases of developer turnover or team changes. We trained TriagerX on the partner's dataset for both tasks, and it outperformed SOTA baselines by up to 10% for component recommendations and 54% for developer recommendations.

2506.20764 2026-05-21 math.OC cs.LG

Control and optimization for Neural Partial Differential Equations in Supervised Learning

神经偏微分方程在监督学习中的控制与优化

Alain Bensoussan, Minh-Binh Tran, Bangjie Wang

AI总结 本文提出将神经网络视为偏微分方程的新视角,研究了在抛物型和双曲型算子中优化和控制系数的问题,并证明了抛物型偏微分方程控制问题的可解性。

详情
AI中文摘要

尽管关于抛物型和双曲型系统控制和优化问题已有大量文献,但针对此类系统中相关算子系数的控制和优化问题尚未得到充分探讨。本文旨在开启控制理论中研究这些算子系数优化与控制问题的新方向,这一问题自然出现在神经网络和监督学习的背景下。在监督学习中,主要目标是通过神经网络的层将初始数据传输到目标数据。我们提出将神经网络视为偏微分方程(PDEs)的新视角,从这一角度看,传统在常微分方程(ODEs)中研究的控制问题被重新表述为PDEs的控制问题,特别是针对抛物型和双曲型算子的系数优化与控制。据我们所知,这一特定问题在PDEs的控制理论中尚未系统地得到解决。为此,我们为抛物型PDEs的控制和优化问题提出了双系统公式,为未来研究中开发高效的数值方案奠定了基础。我们还提供了一个理论证明,显示抛物型PDEs的控制和优化问题具有极小值解。最后,我们研究了双曲型PDEs的控制问题,并证明了对应近似控制问题的解的存在性。

英文摘要

Although there is a substantial body of literature on control and optimization problems for parabolic and hyperbolic systems, the specific problem of controlling and optimizing the coefficients of the associated operators within such systems has not yet been thoroughly explored. In this work, we aim to initiate a line of research in control theory focused on optimizing and controlling the coefficients of these operators-a problem that naturally arises in the context of neural networks and supervised learning. In supervised learning, the primary objective is to transport initial data toward target data through the layers of a neural network. We propose a novel perspective: neural networks can be interpreted as partial differential equations (PDEs). From this viewpoint, the control problem traditionally studied in the context of ordinary differential equations (ODEs) is reformulated as a control problem for PDEs, specifically targeting the optimization and control of coefficients in parabolic and hyperbolic operators. To the best of our knowledge, this specific problem has not yet been systematically addressed in the control theory of PDEs. To this end, we propose a dual system formulation for the control and optimization problem associated with parabolic PDEs, laying the groundwork for the development of efficient numerical schemes in future research. We also provide a theoretical proof showing that the control and optimization problem for parabolic PDEs admits minimizers. Finally, we investigate the control problem associated with hyperbolic PDEs and prove the existence of solutions for a corresponding approximated control problem.

2506.08277 2026-05-21 q-bio.NC cs.AI cs.CL cs.CV cs.LG

Task-conditioned probing of instruction-tuned multimodal LLMs: Region-specific brain alignment patterns under naturalistic stimuli

基于任务的指令调制多模态大语言模型探测:在自然主义刺激下的区域特定大脑对齐模式

Subba Reddy Oota, Khushbu Pahwa, Prachi Jindal, Satya Sai Srinath Namburi, Maneesh Singh, Tanmoy Chakraborty, Bapi S. Raju, Manish Gupta

AI总结 本研究探讨了指令调制多模态大语言模型在自然主义刺激下的大脑对齐模式,通过比较不同模型在视频和音频任务中的表现,揭示了指令调制对模型表示能力的影响。

Comments 57 pages, 39 figures

详情
AI中文摘要

近期的体素级多模态脑编码研究显示,多模态大语言模型(MLLMs)在大脑对齐程度上高于单模态模型。更近期的研究表明,指令调制多模态(IT)模型能够生成与大脑活动强相关的任务特定表示,但大多数先前评估集中在单模态刺激或非指令调制模型上。我们仍然缺乏对指令调制是否使IT-MLLMs围绕功能任务需求组织其表示,还是仅反映表面语义的清晰理解。为此,我们通过预测自然主义电影观看(带音频的视频)期间记录的fMRI响应,来估计大脑对齐情况。使用来自六个视频和两个音频IT-MLLMs的指令特定嵌入,跨13个视频任务指令,我们发现指令调制视频MLLMs的大脑对齐程度高于上下文学习(ICL)多模态模型(~9%)、非指令调制多模态模型(~15%)和单模态基线(~20%)。我们对视频和音频任务以及语言引导的探测评估,产生了不同任务特定的MLLM表示,这些表示在不同大脑区域中变化。我们还发现,ICL模型表现出强语义组织(r=0.78),而IT模型与指令文本语义的耦合较弱(r=0.14),这与与更高大脑对齐相关的任务条件子空间一致。这些发现支持了任务特定指令与更强的大脑-MLLM对齐之间的关联,并为映射两个系统中的联合信息处理开辟了新途径。我们公开了代码 [https://github.com/subbareddy248/mllm_videos]。

英文摘要

Recent voxel-wise multimodal brain encoding studies have shown that multimodal large language models (MLLMs) exhibit a higher degree of brain alignment compared to unimodal models. More recently, instruction-tuned multimodal (IT) models have been shown to generate task-specific representations that align strongly with brain activity, yet most prior evaluations focus on unimodal stimuli or non-instruction-tuned models under multimodal stimuli. We still lack a clear understanding of whether instruction-tuning is associated with IT-MLLMs organizing their representations around functional task demands or if they simply reflect surface semantics. To address this, we estimate brain alignment by predicting fMRI responses recorded during naturalistic movie watching (video with audio) from MLLM representations. Using instruction-specific embeddings from six video and two audio IT-MLLMs, across 13 video task instructions, we find that instruction-tuned video MLLMs show higher brain alignment than in-context learning (ICL) multimodal models (~9%), non-instruction-tuned multimodal models (~15%), and unimodal baselines (~20%). Our evaluation of MLLMs across video and audio tasks, and language-guided probing produces distinct task-specific MLLM representations that vary across brain regions. We also find that ICL models show strong semantic organization (r=0.78), while IT models show weak coupling to instruction-text semantics (r=0.14), consistent with task-conditioned subspaces associated with higher brain alignment. These findings are consistent with an association between task-specific instructions and stronger brain-MLLM alignment, and open new avenues for mapping joint information processing in both systems. We make the code publicly available [https://github.com/subbareddy248/mllm_videos].

2503.19708 2026-05-21 physics.flu-dyn cs.LG

FLUME-FNO: data-efficient and scalable prediction of 3D wind and temperature fields in unseen urban morphologies

FLUME-FNO:在未见的城市形态中高效且可扩展地预测三维风场和温度场

Shaoxiang Qin, Theodore Potsis, Dongxue Zhan, Xue Liu, Ted Stahopoulos, Liangzhu Leon Wang

AI总结 本文提出FLUME-FNO方法,通过仅使用建筑几何信息高效且可扩展地预测未见城市形态中的三维风场和温度场,解决了传统CFD计算成本高和深度学习方法依赖大量训练数据的问题。

详情
AI中文摘要

城市微气候,由建筑物几何形状所塑造的风场和温度场,显著影响能源消耗、行人风、污染物扩散、城市热岛效应和公共健康。准确预测微气候至关重要但具有挑战性。传统计算流体动力学(CFD)在快速评估中计算成本过高,而许多深度学习方法需要大量训练数据且在未见配置中泛化能力差。我们提出了快速局部化城市微气候模拟傅里叶神经算子(FLUME-FNO),一种基于仅建筑几何信息的高效且可扩展的框架,用于快速预测三维风场和温度场。FLUME-FNO假设局部城市微气候主要由从特定位置直接可见的周围几何形状控制。为此,该框架引入了一种新的多方向距离特征(MDDF),通过测量到周围建筑物的方向距离来表示可见的开放空间结构。通过在全域上计算MDDF并将编码的几何特征裁剪成较小的3D块,FLUME-FNO有效地增强了有限的CFD数据,使其能够从仅23个CFD模拟中进行稳健学习。该模型在未见配置上实现了风速的均绝对误差为0.2 m/s和温度的均绝对误差为0.19 °C。为满足对可信快速微气候预测的需求,该框架进一步使用深度集成作为FLUME-FNO不确定性的实用代理,不确定性范围从3%到40%不等。UQ框架证明FLUME-FNO在风工程和微气候研究中提供了稳健、可信的预测,其精度在可接受的误差阈值内,突显了其在现实应用中的潜力。

英文摘要

Urban microclimate, encompassing wind and temperature fields shaped by building geometry, significantly impacts energy consumption, pedestrian winds, pollutant dispersion, urban heat island, and public health. Accurately predicting microclimate is crucial yet challenging. Conventional Computational Fluid Dynamics (CFD) is computationally prohibitive for rapid assessments, while many deep learning approaches require extensive training data and struggle with generalization in unseen configurations. We present the Fast Localized Urban Microclimate Emulation Fourier Neural Operator (FLUME-FNO), a data-efficient and scalable framework for rapid prediction of 3D wind and temperature fields based solely on building geometry. FLUME-FNO assumes the local urban microclimate is primarily governed by surrounding geometry directly visible from a specific location. To encode this, the framework introduces a novel Multi-Directional Distance Feature (MDDF), representing visible open-space structures by measuring directional distances to surrounding buildings. By computing MDDF over the full domain and cropping encoded geometric features into smaller 3D patches, FLUME-FNO effectively augments limited CFD data, enabling robust learning from just 23 CFD simulations. The model achieves mean absolute errors of 0.2 m/s for wind speed and 0.19 °C for temperature on unseen configurations. Addressing the need for trustworthy fast microclimate prediction, the framework is further assessed using a deep ensemble as a practical proxy for FLUME-FNO uncertainty, ranging from 3% to 40% depending on location. The UQ framework demonstrates FLUME-FNO provides resilient, trustworthy predictions within acceptable accuracy thresholds for wind engineering and microclimate studies, highlighting its potential for real-world applications.

2503.15105 2026-05-21 math.NA cs.LG cs.NA math.OC

Control, Optimal Transport and Neural Differential Equations in Supervised Learning

控制、最优传输与神经微分方程在监督学习中的应用

Minh-Nhat Phung, Minh-Binh Tran

AI总结 本文研究了使用神经微分方程近似最优传输方程的基本计算问题,提出了一个新颖的框架用于用神经ODE近似连续域中的不平衡最优传输,通过推广具有皮尔逊发散的离散UOT问题,构造了收敛于真实UOT动态的向量场,推动了计算传输和机器学习的数学基础。

详情
AI中文摘要

我们研究了使用神经微分方程近似最优传输(OT)方程的基本计算问题。更具体地说,我们开发了一个新的框架,用于用神经ODE近似连续域中的不平衡最优传输(UOT)。通过推广具有皮尔逊发散的离散UOT问题,我们构造了神经ODE的向量场,这些向量场收敛于真实的UOT动态,从而推进了计算传输和机器学习的数学基础。为此,我们设计了一种受Sinkhorn算法启发的数值方案来解决相应的最小化问题,并严格证明其收敛性,提供明确的误差估计。从获得的数值解中,我们推导出定义传输动态的向量场,并构造相应的传输方程。最后,从数值获得的传输方程中,我们构造了一个神经微分方程,其流在适当的极限情况下收敛于真实的传输动态。

英文摘要

We study the fundamental computational problem of approximating optimal transport (OT) equations using neural differential equations (Neural ODEs). More specifically, we develop a novel framework for approximating unbalanced optimal transport (UOT) in the continuum using Neural ODEs. By generalizing a discrete UOT problem with Pearson divergence, we constructively design vector fields for Neural ODEs that converge to the true UOT dynamics, thereby advancing the mathematical foundations of computational transport and machine learning. To this end, we design a numerical scheme inspired by the Sinkhorn algorithm to solve the corresponding minimization problem and rigorously prove its convergence, providing explicit error estimates. From the obtained numerical solutions, we derive vector fields defining the transport dynamics and construct the corresponding transport equation. Finally, from the numerically obtained transport equation, we construct a neural differential equation whose flow converges to the true transport dynamics in an appropriate limiting regime.

2605.21480 2026-05-21 math.CO math.MG math.PR

Thresholds for geometric graphs

几何图的阈值

Bhargav Narayanan

AI总结 本文研究了几何图中阈值的存在性,证明了标准的环面、球面和立方体具有阈值性质。

Comments 18 pages, 3 figures

详情
AI中文摘要

一个度量概率空间M具有阈值,如果在M上构造的随机几何图对于每个单调图性质都有阈值。我们将阈值的存在性与M的均匀扩张性联系起来,并证明所有标准的环面、球面和立方体都具有阈值。

英文摘要

A metric probability space $M$ admits thresholds if the random geometric graph on $M$ has a threshold for every monotone graph property. We connect the existence of thresholds to the uniform expansion of $M$ and prove that all standard tori, spheres, and cubes admit thresholds.

2605.21477 2026-05-21 hep-ph astro-ph.CO gr-qc

Opening the Window of Ultra-Light PBHs by Exorcising the Poltergeist

通过驱逐闹鬼打开超轻原初黑洞的窗口

Yann Gouttenoire, Nicholas Leister, Pedro Schwaller

AI总结 该研究通过考虑广义相对论预测的原初黑洞不可约坍缩质量尾部,修正了传统单色处理中关于超轻原初黑洞蒸发和引力波信号的预测,从而重新打开超轻原初黑洞的窗口。

Comments 5 pages

详情
AI中文摘要

热大爆炸可能源自质量小于10^9克的原初黑洞(PBHs)的蒸发。传统单色处理预测蒸发几乎同时发生,突然再加热以及一个大的Poltergeist标量诱导引力波信号。我们通过广义相对论预测的不可约坍缩质量尾部,即df_{PBH}/dlnM∝M^{3.78},来对抗这一预期,该结果平滑了再加热,抑制了信号数量级,并重新打开超轻原初黑洞的窗口。

英文摘要

The hot Big Bang may have emerged from evaporation of primordial black holes (PBHs) lighter than $10^9$g. Standard monochromatic treatments predict nearly simultaneous evaporation, abrupt reheating, and a large Poltergeist scalar-induced gravitational wave signal. We confront this expectation with the irreducible collapse mass tail predicted by general relativity, $df_{\rm PBH}/d\ln M\propto M^{3.78}$, which smooths reheating, suppresses the signal by orders of magnitude, and reopens the ultra-light PBH window.

2605.21476 2026-05-21 math.NT

Sharp omega results for the divisor and circle problems

关于除数问题和圆问题的sharp omega结果

Youness Lamzouri

AI总结 本文通过建立猜想中sharp的omega结果,并确定所获得的大值的符号,改进了Soundararajan以及Sourmelidis和Mahatab后续的独立改进,首次提升了Hafner在1981年对除数问题的Ω_+结果和圆问题的Ω_-结果。主要的新成分是一种共振方法,它直接与Voronoï求和公式中出现的相位工作,通过将通常的正核替换为单侧扇形核,即Gamma分布的密度,其傅里叶变换位于复平面上合适的扇形内。

Comments 14 pages

详情
AI中文摘要

我们建立了除数问题和圆问题的omega结果,这些结果被认为是猜想中sharp的,同时确定了所获得的大值的符号。这改进了Soundararajan的工作,以及随后Sourmelidis和Mahatab的独立改进,并首次提升了Hafner在1981年对除数问题的Ω_+结果和圆问题的Ω_-结果。主要的新成分是一种共振方法,它直接与Voronoï求和公式中出现的相位工作。这通过将通常的正核替换为单侧扇形核,即Gamma分布的密度,其傅里叶变换位于复平面上合适的扇形内来实现。

英文摘要

We establish omega results for the divisor and circle problems that are conjecturally sharp, while also determining the sign of the large values obtained. This improves on the work of Soundararajan and on the subsequent independent refinements of Sourmelidis and Mahatab, and gives the first improvement on Hafner's 1981 $Ω_+$ result for the divisor problem and his $Ω_-$ result for the circle problem. The main new ingredient is a resonance method which works directly with the phase appearing in the Voronoï summation formula. This is achieved by replacing the usual positive kernels by a one-sided sectorial kernel, namely the density of a Gamma distribution, whose Fourier transform lies in a suitable sector of the complex plane.

2605.21474 2026-05-21 hep-ph astro-ph.CO gr-qc

Gravitational Waves from Black Hole Reheating: The Scalar-Induced Component

黑洞再加热的引力波:标量诱导成分

Yann Gouttenoire, Nicholas Leister, Pedro Schwaller

AI总结 研究探讨了由轻质原始黑洞蒸发导致的宇宙再加热过程,发现标量诱导引力波信号在考虑黑洞形成质量分布后,能够显著抑制Poltergeist引力波背景,从而重新开放了此前被认为被排除的超轻原始黑洞参数空间。

Comments 60 pages main text + 49 pages appendices

详情
AI中文摘要

通过原始黑洞蒸发导致的宇宙再加热过程,可以产生随机的引力波背景。在单色极限下,其同时蒸发会产生物质到辐射的突变转换,触发所谓的Poltergeist引力波信号,通常预测为主导且可观察到。我们通过引入广义相对论中引力坍缩所暗示的不可约质量分布,即红外尾部随$df_{ m PBH}/d\ln M_{ m PBH}\propto M_{ m PBH}^{3.78}$而变化,重新审视这一结果。我们证明这种最小宽度足以使再加热过程平滑,从而将Poltergeist背景抑制到标量诱导引力波信号的水平,该信号由一般的早期物质时期(如由重 relic 衰变驱动)产生。我们对标量诱导谱进行了完整的分解,发现除了由原始黑洞形成产生的那一项外,其他项均无法达到ΔN_{ m eff}限制或未来引力波观测站的预期灵敏度。这重新开放了之前被认为被这些限制排除的超轻原始黑洞参数空间。

英文摘要

The reheating of the universe by the evaporation of light primordial black holes (PBHs) can leave a stochastic gravitational-wave (GW) background in the early Universe. In the monochromatic limit, their simultaneous evaporation produces an abrupt matter-to-radiation transition, triggering the so-called Poltergeist GW signal, usually predicted to be dominant and observable. We revisit this result by including the irreducible mass spread implied by gravitational collapse in General Relativity, whose infrared tail scales as $d f_{\rm PBH}/d\ln M_{\rm PBH}\propto M_{\rm PBH}^{3.78}$. We show that this minimal width smooths reheating enough to suppress the Poltergeist background by orders of magnitude, down to the level of the scalar-induced GW signal produced during a generic early matter era, such as one driven by the decay of a heavy relic. We provide a complete decomposition of the scalar-induced spectrum into eight production channels and find that none, except the one from PBH formation, reaches either the $ΔN_{\rm eff}$ bound or the projected sensitivity of future GW observatories. This reopens regions of ultra-light PBH parameter space previously thought to be excluded by these constraints.

2605.21473 2026-05-21 math.LO math.CT

The Gamified Katětov order is not linear (in fact, very much not so)

游戏化的Katětov序不是线性的(事实上,根本不像是这样)

Takayuki Kihara, Ming Ng

AI总结 本文研究了游戏化的Katětov序,发现其严格粗于经典Katětov序,并将所有MAD族压缩为一个等价类。同时,该序嵌入了P(ω)/Fin,包含一个大小为连续统的反链,并探讨了其与Ramsey理论的联系,还应用该构造在扩展Weihrauch层次中产生了新的非适度度。

Comments 28 pages, 1 figure

详情
AI中文摘要

最近,作者在ω上的滤子上引入了游戏化的Katětov序。该序被证明严格粗于经典的Katětov序,并且实际上将所有MAD族压缩到一个等价类。在相反方向,本文显示游戏化的Katětov序也嵌入了P(ω)/Fin,并因此包含一个大小为连续统的反链。分析揭示了一些有趣的Ramsey理论联系。作为更广泛研究组合复杂性和可计算复杂性相互作用计划的一部分,我们应用该构造来产生一个大的新的非适度度,这些度在扩展Weihrauch层次中出现,源于相关的有效子拓扑空间。

英文摘要

Recently, the authors introduced the Gamified Katětov order on filters over $ω$. This was shown to be strictly coarser than the classical Katětov order, and in fact collapses all MAD families to a single equivalence class. In the opposite direction, the present paper shows that the Gamified Katětov order also embeds $\mathcal{P}(ω)/\mathrm{Fin}$, and thus contains an antichain of size continuum. The analysis brings into focus some interesting connections with Ramsey theory. As part of a broader programme investigating the interplay between combinatorial and computable complexity, we then apply our construction to produce a large new family of non-modest degrees in the extended Weihrauch hierarchy, which arise from associated effective subtoposes.

2605.21471 2026-05-21 math.CO

Ramsey properties for tilings in random graphs

随机图中瓷砖的拉姆齐性质

Lucas Aragão, Xinbu Cheng, Rafael Filipe, Rafael Miyazaki, Danni Peng, Zhifei Yan

AI总结 本文研究了随机图中瓷砖结构的拉姆齐性质,证明了当图H的顶点数为k,独立数为α时,阈值为n^{-1/max{m_2(H),1}},扩展了Burr, Erdős和Spencer的结果,并与Rödl和Ruciński的结果一致。

Comments 21 pages

详情
AI中文摘要

令mH是由m个不相交的图H组成的图。令G→(H)_r表示在G的边的任意r着色中存在一个单色的H副本。1975年,Burr, Erdős和Spencer证明,如果H有k个顶点,独立数为α,则K_n→(mH)_2,其中m~n/(2k-α),且1/(2k-α)因子是最佳的。20世纪90年代,Rödl和Ruciński证明,对于几乎所有图H,性质G(n,p)→(H)_r的阈值是n^{-1/m_2(H)}。本文在Burr, Erdős和Spencer的结果基础上进行推广,证明了性质G(n,p)→(mH)_2的阈值为n^{-1/max{m_2(H),1}},其中m~n/(2k-α)。该阈值与Rödl和Ruciński为大多数图H找到的阈值一致,扩展了他们在r=2时的结果。

英文摘要

Let $mH$ be the graph formed by $m$ vertex-disjoint copies of a graph $H$. Let $G \to (H)_r$ denote that, in any $r$-colouring of the edges of $G$, there exists a monochromatic copy of $H$. In 1975, Burr, Erdős, and Spencer showed that if $H$ is a graph on $k$ vertices whose independence number is $α$, then $K_n \to (mH)_2$, where $m\sim n/(2k-α)$, and that the $1/(2k-α)$ factor is best possible. In the 1990s, Rödl and Ruciński proved that, for all but a few graphs~$H$, the threshold for the property $\mathbb{G}(n,p) \to (H)_r$ is $n^{-1/m_2(H)}$. In this paper, generalizing the result of Burr, Erdős, and Spencer, we prove that $n^{-1/\max\{m_2(H),1\}}$ is the threshold for the property $\mathbb{G}(n,p) \to (mH)_2$, where $m\sim n/(2k-α)$. This threshold matches the one found by Rödl and Ruciński for most graphs $H$, extending their result in the case $r=2$.

2605.21464 2026-05-21 stat.AP

Assessing the impact of tourist attractions through the integration of causal inference and demand-side economic analysis: A case study of the Sensoria experience museum in Holzminden, Germany

通过因果推断与需求侧经济分析整合评估旅游景点影响:以德国霍尔茨明登市Sensoria体验博物馆为例

Thomas Wieland

AI总结 本文通过整合因果推断与需求侧经济分析,研究了德国霍尔茨明登市2024年9月开放的Sensoria体验博物馆对当地旅游业需求及直接和间接影响,发现其在第一年运营中带来4,691个额外的过夜住宿,产生约0.56百万欧元的额外总收入,但长期影响尚无法确定。

Comments v1.0.0

详情
AI中文摘要

本研究笔记探讨了2024年9月在德国霍尔茨明登市开放的体验博物馆Sensoria对当地旅游业需求及相关直接和间接影响。为此,本研究采用了一种新的方法,通过结合因果推断与需求侧经济分析。采用差异-差异方法来量化治疗城市中额外的游客过夜住宿数量;结果转换为行业特定支出,从而确定Sensoria的直接和间接影响。在新旅游景点运营的第一年,可以检测到正向且显著的影响,对应4,691个额外的过夜住宿,导致酒店和零售行业及其他服务的额外总收入约为0.56百万欧元。直接效应和间接效应分别为约0.23和0.21百万欧元。然而,长期影响尚无法确定。此外,还可以证明在研究城市中小型和大型活动的正向影响。本简短研究证明了结合上述两种方法具有潜力,但仍需更深入的分析,文中也讨论了如何进行此类分析的建议。

英文摘要

This research note investigates the impact of the experience museum Sensoria, opened in September 2024 in Holzminden, Germany, on local tourism demand and related direct and indirect effects. To this end, the study employs a novel approach by combining causal inference and demand-side economic analysis. A difference-in-differences approach is employed to quantify the number of additional guest overnight stays in the treatment city; the results are converted into industry-specific expenditures, from which the direct and indirect effects of Sensoria are determined. A positive and significant impact which corresponds to 4,691 additional overnight stays can be detected in the first year of operation of the new tourist attraction, resulting in an additional gross turnover of approximately 0.56 million EUR across the hospitality and retail industries and other services. The direct effects and indirect effects amount to approximately 0.23 and 0.21 million EUR, respectively. However, long-term effects cannot (yet) be determined. Additionally, positive effects from small and large events in the cities studied can be demonstrated. This brief study demonstrates that combining the two approaches mentioned holds promise, yet requires a more in-depth analysis, for which suggestions are also discussed regarding how it could be conducted.

2605.21462 2026-05-21 cond-mat.str-el

Discernible signatures of fractionally charged anyons in a Pfaffian-Laughlin state

分数电荷任意子在Pfaffian-Laughlin态中的可辨特征

Vadym Apalkov, Tapash Chakraborty

AI总结 研究通过量子点与不可压缩流体的耦合,揭示了分数电荷任意子在Pfaffian-Laughlin态中的能量色散、电荷密度分布和准孔生成能等特性,为理解这些准粒子的性质提供了新的视角。

Comments 7 pages and 4 figures

详情
AI中文摘要

理解准孔激发的本质,即具有分数电荷和统计性质的任意子,一直是凝聚态物理中的挑战性问题。我们的理论方法是考虑一个包含少量带电粒子的量子点,与不可压缩流体耦合。这为Laughlin准孔的能量学提供了重要见解。光致发光(PL)光谱学研究已能探测这些准孔,从而验证了我们的预期。转向Pfaffian态,我们观察到该系统也能提供关于Pfaffian准孔的重要信息,即能量色散、电荷密度分布和准孔生成能。此处推导的e/4准孔的能量色散清楚地反映了量子点与不可压缩Pfaffian态之间的相互作用。对5/2 Pfaffian-Laughlin态的光谱学实验可能有助于揭示这些神秘准粒子的能量学特性。

英文摘要

Understanding the nature of quasihole excitations, i.e., anyons that have fractional charge and statistics, has been a challenging problem in condensed matter physics. Our theoretical approach to this problem has been to consider a quantum dot, containing a few charged particles, coupled to the incompressible fluid. It has provided important insights into the energetics of Laughlin quasiholes. Photoluminescence (PL) spectroscopy studies of this system have been able to probe these quasiholes that have confirmed our expectations. Turning to the Pfaffian state, we now observe that such a system is also able to provide valuable information about the Pfaffian quasiholes, viz., the energy dispersion, the charge density distribution and the quasihole creation energy. The energy dispersion of e/4 quasiholes derived here, clearly reflect the interaction between the quantum dot and the incompressible Pfaffian state. PL spectroscopy experiments on the 5/2 Pfaffian-Laughlin state could perhaps shed light on the energetics of these elusive quasiparticles.

2605.21459 2026-05-21 math.PR

Network evolution with self-reinforcement

具有自增强的网络演化

Shankar Bhamidi, Remco van der Hofstad, Frank den Hollander, Rounak Ray

AI总结 本文研究了一种新的具有自增强倾向的偏好附加树模型,通过引入集成流行度规则,破坏了经典分析中的马尔可夫性和部分交换性特征,并揭示了该模型中局部和全局增长的指数规律,以及其极限树结构。

详情
AI中文摘要

我们研究了一种新的具有自增强倾向的偏好附加树模型。在每个时间点,每个顶点的权重等于过去时间中其度数的线性函数的累积和。一个新的顶点通过单条边连接到一个已存在的顶点,该连接的概率与该顶点的当前权重成正比。这种“集成流行度”规则直接在连接机制中引入了长期记忆,从而破坏了经典分析中偏好附加模型的马尔可夫性和部分交换性特征。更广泛地说,该模型连接到应用概率中长期记忆自相互作用过程(如大象随机游走)的研究,强调了非马尔可夫性增强如何重塑渐进行为。尽管失去了结构,我们识别出一个显式指数ϕ=ϕ(δ),支配局部和全局增长:时间n处的典型度数规模为n^{1/ϕ},经验度数分布收敛到一个幂律,其尾部指数为ϕ+1。我们进一步证明了Benjamini-Schramm局部收敛到一个由嵌入连续时间分支过程特征化的无限随机根树。该极限树是一个sin树,并且不是非增强设置中出现的Polya型极限树。我们的结果为自然“记忆性”网络增长机制提供了可处理的概率描述,并精确量化了增强如何重新规范经典偏好附加指数。

英文摘要

We study a new class of preferential attachment trees with \emph{self-reinforcement}. At each time, each vertex is assigned a weight equal to the cumulative sum over past times of an affine function of its degree. A new vertex attaches itself via a single edge to an already present vertex with a probability proportional to the current weight of that vertex. This ``integrated popularity'' rule builds long memory directly into the attachment mechanism, thereby destroying the Markov and partial-exchangeability features that underlie the classical analysis of affine preferential attachment models. More broadly, the model connects to applied-probability work on long-memory self-interacting processes (such as the elephant random walk), emphasizing how non-Markovian reinforcement reshapes asymptotic behaviour. Despite this loss of structure, we identify an explicit exponent $ϕ=ϕ(δ)$ governing both local and global growth: typical degrees at time $n$ scale as $n^{1/ϕ}$, and the empirical degree distribution converges to a power-law with a tail exponent $ϕ+1$. We further prove Benjamini--Schramm local convergence to an infinite random rooted tree characterized via an embedded continuous-time branching process. The limiting tree is a \texttt{sin}-tree, and is \emph{not} the Pólya-type limiting tree arising in the non-reinforced setting. Our results provide a tractable probabilistic description of a natural ``memoryful'' network-growth mechanism, and quantify precisely how reinforcement renormalizes the classical preferential-attachment exponents.

2605.21457 2026-05-21 quant-ph

An Exponential Sample-Complexity Advantage for Coherent Quantum Inference

相干量子推断的指数样本复杂性优势

Zhaoyi Li, Elias Theil, Aram W. Harrow, Isaac Chuang

AI总结 本文研究了相干量子推断的样本复杂性优势,展示了相干处理在量子纯度放大等任务中相较于非相干方法具有指数级更低的样本需求。

Comments 5+21 pages, 3+0 figures

详情
AI中文摘要

标准量子推断将量子数据转换为经典输出。我们研究了一种替代推断设置,其中期望的输出是量子的,保持相干性。此类设置包括量子纯度放大(QPA)、混合态近似纯化或克隆以及密度矩阵指数运算。我们证明,此类协议可以实现比非相干、测量介导协议低指数级的样本复杂性。对于具有主本征态目标和d维输入的QPA,相干处理使用O(1/ε)个副本达到误差ε,而任何非相干协议需要Ω(d/ε)个副本。这些尖锐的相干-非相干分离为相干量子推断理论奠定了基础,并通过纠缠破坏极限确定了每个相干协议的最佳非相干对应物。

英文摘要

Standard quantum inference converts quantum data into classical outputs. We study an alternative inference setting in which the desired output is quantum, preserving coherence. Such settings include quantum purity amplification (QPA), mixed-state approximate purification or cloning, and density matrix exponentiation. We show that such protocols can achieve exponentially lower sample complexity than incoherent, measurement-mediated protocols. For QPA with principal eigenstate targets and $d$-dimensional inputs, coherent processing achieves error $\varepsilon$ using $O(1/\varepsilon)$ copies, versus the $Ω(d/\varepsilon)$ copies required by any incoherent protocol. Together, these sharp coherent-incoherent separations seed a theory of coherent quantum inference, with an entanglement-breaking limit identifying the optimal incoherent counterpart of each coherent protocol.

2605.21456 2026-05-21 astro-ph.CO

Negative neutrino mass or negative dark energy?

负中微子质量或负暗能量?

Cihad Kıbrıs, Willem Elbers, Özgür Akarsu, Eleonora Di Valentino

AI总结 本文探讨了通过修改宇宙膨胀率以缓解中微子质量张力的可能性,提出了一种能恢复正中微子质量的暗能量模型。

Comments 25 pages, 8 figures, submitted to PRD

详情
AI中文摘要

基于DESI和CMB数据的最新宇宙学分析揭示了在假设标准ΛCDM模型下,推导出的中微子质量总和与中微子振荡实验允许的最小值之间存在矛盾。本文系统探讨了替代暗能量模型,包括能够提供负暗能量密度并重现负有效中微子质量宇宙效应的模型。我们论证暗能量模型可通过修改特定红移范围内的宇宙膨胀率来缓解这种矛盾,同时在较低红移范围内匹配DESI的BAO距离测量。在考虑的模型中,我们发现一个暗能量常数符号切换模型,Λ_sCDM,通过这种方式唯一能够恢复正中微子质量。对于DESI DR2 BAO、CMB和DES-Dovekie超新星数据的组合,有效中微子质量的约束从ΛCDM模型中的∑m_{ν,eff}=-0.075^{+0.039}_{-0.053} eV(68%)变为Λ_sCDM模型中的∑m_{ν,eff}=0.055±0.050 eV(68%),同时暗能量转换红移z_†的95%下限为z_†>2.4。尽管Λ_sCDM在所有模型中并不具有最强的统计支持,当允许∑m_{ν,eff}参数变化时,我们的发现表明,特定的符号和红移结构贡献于晚期宇宙膨胀历史,是一种缓解中微子质量张力的可行方法。

英文摘要

Recent cosmological analyses based on DESI and CMB data have revealed a tension between the inferred sum of neutrino masses and the minimum value allowed by neutrino oscillation experiments, when assuming an underlying $Λ$CDM model of cosmology. In this work, we perform a systematic exploration of alternative dark energy models, including models that can supply a negative dark energy density capable of reproducing the cosmological effects of negative effective neutrino masses. We argue that dark energy models can alleviate the tension by modifying the cosmic expansion rate over a specific redshift range relevant for CMB lensing, while matching BAO distance measurements from DESI at lower redshifts. Among the models considered, we find that a sign-switching cosmological constant model, $Λ_\mathrm{s}$CDM, is uniquely capable of recovering positive neutrino masses by modifying the expansion history in this way. For the combination of DESI DR2 BAO, CMB, and DES-Dovekie supernova data, the constraint on the effective neutrino mass shifts from $\sum m_{ν,\mathrm{eff}}=-0.075^{+0.039}_{-0.053}$ eV (68%) for $Λ$CDM to $\sum m_{ν,\mathrm{eff}}=0.055\pm0.050$ eV (68%) for $Λ_\mathrm{s}$CDM, with a 95% lower bound on the dark energy transition redshift, $z_\dagger>2.4$. Although $Λ_\mathrm{s}$CDM does not have the strongest overall statistical support among the models considered, when the $\sum m_{ν,\mathrm{eff}}$ parameter is allowed to vary, our findings point toward a specific sign- and redshift-structured contribution to the late-time expansion history as a viable way to alleviate the neutrino mass tension.

2605.21452 2026-05-21 hep-ph hep-ex

Sensitivity of the FCC-ee to axion-like particles at different center-of-mass energies

FCC-ee对不同总能量下轴子类粒子的灵敏度

Juliette Alimena, Elnura Bakhishova, Freya Blekman, Jannah Darwish Abdelhafiz, Christina Dorofeev, Jeremi Niedziela, Giacomo Polesello, Anna Przybyl, Lovisa Rygaard

AI总结 研究FCC-ee对轴子类粒子在不同总能量下的灵敏度,重点分析轴子主要通过电弱规范玻色子耦合的情况,探讨轴子与光子关联产生并衰变为两个光子的最终状态,评估轴子与光子耦合强度,确定FCC-ee在Z极区运行(WW、ZH和t\bar{t}阈值运行)中能检测到耦合强度低至几$10^{-6} \mathrm{GeV}^{-1}$($10^{-5} \mathrm{GeV}^{-1}$)的轴子类粒子。

详情
AI中文摘要

本文研究了所提出FCC-ee对撞机在所有计划的总能量下对轴子类粒子(ALPs)的灵敏度,重点研究轴子主要通过电弱规范玻色子耦合的情况。我们研究了轴子与光子的关联产生,轴子衰变为两个光子,产生三个光子的最终状态。评估了轴子与光子耦合强度,对于轴子质量为5到320 GeV的情况。在这一有效模型中,FCC-ee将在Z极区运行(WW、ZH和$t\bar{t}$阈值运行)中能够检测到耦合强度低至几$10^{-6} \mathrm{GeV}^{-1}$($10^{-5} \mathrm{GeV}^{-1}$)的轴子类粒子。此外,这一最终状态有潜力探测轴子耦合的底层电弱结构,对于轴子质量低于Z玻色子质量的情况。

英文摘要

The sensitivity of the proposed FCC-ee collider to axion-like particles (ALPs) is investigated at all planned center-of-mass energies, with focus on the case where the ALP couples primarily to electroweak gauge bosons at leading order. We study the associated production of the ALP with a photon, with the ALP decaying in turn to two photons, yielding a three-photon final state. The ALP coupling to the photon is evaluated for ALP masses of 5 to 320 GeV. In this effective model, the FCC-ee will be able to detect ALPs for couplings down to a few $10^{-6} \mathrm{GeV}^{-1}$ ($10^{-5} \mathrm{GeV}^{-1}$) during the Z pole run (WW, ZH, and $t\bar{t}$ threshold runs). Additionally, this final state has the potential to probe the underlying electroweak structure of ALP couplings for ALP masses below the $Z$-boson mass.

2605.21449 2026-05-21 cond-mat.mtrl-sci

Hybrid Improper Ferroelectricity and Moiré Superlattices-induced Exciton Quantization in Layered 2D Halide Perovskite

混合非对称铁电性和莫尔超晶格诱导的激子量子化在层状二维卤化物钙钛矿中

Sanika S. Padelkar, Sharidya Rahman, Mattia Belotti, Naufan Nurrosyid, Craig Forsyth, Alasdair Mckay, Tam Nguyen, Thi Vu Mung, Lan Nguyen, Naeimeh Mozaffari, Alexandr N. Simonov, Aftab Alam, Jacek J. Jasieniak

AI总结 研究通过伪等轴孪生揭示了层状二维卤化物钙钛矿中混合非对称铁电性和莫尔超晶格诱导的激子量子化现象,解决了长期存在的异常二次光致发光争议,并为扭曲电子学、光铁电和压电光电子器件开辟了新途径。

Comments 57 pages (27 pages main, 30 pages supplement), 20 figures (5 figures main, 15 figures supplement), 5 tables (all in supplement)

详情
AI中文摘要

二维鲁德勒登-波珀钙钛矿是量子限制光电子学有吸引力的平台。然而,在常温条件下,碘化物组分的极性顺序仍然罕见,且此类钙钛矿中异常光致发光的机制性起源仍然推测。在此,我们证明了通过伪等轴孪生形成的伪等轴孪生,溶液生长的(PA)₂FAPb₂I₇单晶发展出非故意的莫尔超晶格,由混合非对称铁电性驱动,其中两种主要区边界模式(X₂⁺和X₃⁻)之间的三线性模式耦合与一个二次Γ₄⁻极化位移同时破坏反演对称性,并在相邻层之间施加约5.17°的旋转偏移。这种对称性破坏激活了已报道的最高压电系数d₃₃(约20 pm/V)之一。这种偏移生成了莫尔超晶格,经历热驱动的相干-非相干相变,在123 K时切换为周期性限制势能,将激子量子化为等距光致发光阶梯,而在298 K时切换为无序非相干相,具有展宽的发射。这些发射归因于莫尔限制的激子,解决了层状二维钙钛矿中异常二次光致发光的长期争议,并为扭曲电子学、光铁电和压电光电子器件开辟了新途径。

英文摘要

2D Ruddlesden-Popper perovskites are compelling platforms for quantum-confined optoelectronics. However, polar order in iodide composition remains rare under ambient conditions, and the mechanistic origin of anomalous photoluminescence in this class of perovskite is still speculative. Here, we demonstrate that solution-grown $(PA)_2FAPb_2I_7$ single crystals develop an inadvertent moiré superlattice through pseudo-merohedral twinning, driven by hybrid improper ferroelectricity in which trilinear mode coupling between two primary zone-boundary modes ($X_2^+$ and $X_3^-$) and a secondary $Γ_4^-$ polar displacement simultaneously breaks inversion symmetry and imposes a ca. 5.17° rotational misalignment between adjacent layers. This symmetry breaking activates one of the highest piezoelectric coefficients $d_{33}$ (ca. 20 pm/V) reported among 2D perovskites. This misalignment generates a moiré superlattice that undergoes a thermally driven commensurate-incommensurate transition, switching between a periodic confinement potential that quantizes excitons into an equidistant photoluminescence ladder at 123 K and a disordered incommensurate phase with broadened emission at 298 K. These emissions are attributed to moiré-confined excitons, resolving a longstanding debate on anomalous secondary photoluminescence in layered 2D perovskites and opening pathways to twistronics, photoferroelectrics and piezo-optoelectronic devices.

2605.21448 2026-05-21 cs.GT

A Note on EFX Inapproximability for Chores

关于 chores 的 EFX 近似性的一则注记

Vasilis Christoforidis

AI总结 本文研究了在无互补成本函数下 indivisible chores 的 EFX 分配近似性,证明了对于子模和子可加函数,不存在 α-EFX 分配,其中 1 ≤ α < 2^{1/3},并给出了子模成本下的更严格的下界。

详情
AI中文摘要

我们研究了在无互补成本函数下 indivisible chores 的 EFX 分配近似性。已知对于一般的单调函数,不存在精确的 EFX 分配(参考CS24),而akrami2026的结果将这种基于比较的非存在性结果扩展到单调子模和因此子可加函数。我们通过构造一个包含三个代理和六个任务的实例,证明对于单调子可加成本函数,不存在任何 1 ≤ α < 2^{1/3} ≈ 1.26 的 α-EFX 分配,从而缩小了已知上界 2 之间的差距。该构造通过改进CS24的原始反例并使用mackenzie2026的方法获得。我们还给出了一个有序轮廓的加权覆盖实现,得到一个实例,在此情况下,对于任何 1 ≤ α < 20/19 的子模成本,也不存在 α-EFX 分配。因此,即使在已研究的无互补类别中,chore 的 EFX 也具有非平凡的近似性下界。

英文摘要

We study the approximability of EFX allocations for indivisible chores under complement-free cost functions. The non-existence of exact EFX allocations for general monotone functions for chores is known from \cite{CS24}, and a result of \cite{akrami2026} transfers such comparison-based non-existence results to monotone submodular, and hence subadditive, functions. We strengthen this picture by giving explicit constant-factor inapproximability results for submodular and subadditive functions. Our main construction is a three-agent, six-chore instance with monotone subadditive cost functions for which no $α$-EFX allocation exists for any $1\le α<2^{1/3}\approx 1.26$, thus narrowing the gap with the known upper bound of $2$. The construction is obtained by refining the original counterexample of \cite{CS24} and using the approach of \cite{mackenzie2026}. We also give a weighted-coverage realization of the ordinal profile, yielding an instance in which no $α$-EFX allocation exists for any $1\le α<20/19$ under submodular costs. Thus, even within well-studied complement-free classes, EFX for chores admits nontrivial constant lower bounds on approximability.

2605.21447 2026-05-21 quant-ph

Combining non-parametric quantum states and MERA tensor networks for ground-state optimization

结合非参数量子态和MERA张量网络用于基态优化

Julian Schuhmacher, Alberto Baiardi, Francesco Tacchino, Ivano Tavernelli

AI总结 本文提出了一种结合非参数量子态和经典等距张量网络的方法,用于改进基态优化的准确性,同时避免增加量子电路的深度。

Comments 14 pages, 10 figures

详情
AI中文摘要

混合张量网络提供了一种有前景的途径,通过将量子计算机上准备的量子态纳入经典张量网络方法中,以增强其表达能力。现有方法受限于张量网络量子组件的变分优化。在本文中,我们引入了一种替代策略,结合通过量子退火准备的非参数量子态和经典等距张量网络。后者进行变分优化,而前者作为固定边界张量资源以经典阴影的形式出现。通过在横向场Ising模型上的广泛数值模拟,我们证明了这种方法的可行性,显示优化过程在统计和硬件噪声下仍保持稳健。此外,我们的结果表明,所提出的新方法在不增加应用量子电路深度的情况下,比原始量子模拟更准确地获得了基态近似。因此,这种设置为扩展变分量子算法至量子实用规模提供了实用途径。

英文摘要

Hybrid tensor networks offer a promising route to enhance the expressivity of classical tensor network methods by incorporating quantum states prepared on a quantum computer. Existing approaches are limited by the variational optimization of the quantum component of the tensor network. In this work, we introduce an alternative strategy that combines a non-parametric quantum state prepared through quantum annealing and a classical isometric tensor network. The latter is variationally optimized while the former is used as a fixed, boundary tensor resource in the form of classical shadows. We demonstrate the feasibility of this approach through extensive numerical simulations on the transverse-field Ising model, showing that the optimization procedure remains robust under statistical and hardware noise. Moreover, our results indicate that our newly proposed setup improves the accuracy of the obtained ground state approximation compared to the original quantum simulation, without increasing the depth of the applied quantum circuits. Therefore, this setup offers a practical route to scale variational quantum algorithms towards the quantum utility scale.

2605.21445 2026-05-21 math.NA cs.NA math.AP

Error analysis of a finite element scheme for parametric mean curvature flow based on the DeTurck trick

基于DeTurck技巧的参数均曲率流有限元方案的误差分析

Klaus Deckelnick, Vanessa Styles

AI总结 本文研究了基于DeTurck技巧的参数均曲率流数值方案的误差分析,通过有限元方法证明了空间离散化后位置向量的最优H^1误差估计,并通过数值实验验证了该误差界并展示了该方案在网格点分布方面的良好性质。

详情
AI中文摘要

本文研究了用于近似参数均曲率流的数值方案的误差分析。我们研究的方案基于使用DeTurck技巧的重新参数化,并由Elliott和Fritz在[15]中提出。在半离散情况下,对于使用有限元方法(阶数k≥2)的空间离散化,我们证明了位置向量的最优H^1误差估计。我们展示了数值实验,证实了该误差界,并表明该方案在网格点分布方面的性质,如[15]中已观察到的那样。

英文摘要

The paper is concerned with the error analysis of a numerical scheme for the approximation of parametric mean curvature flow. The scheme we study is based on a reparametrization using the DeTurck trick and was proposed by Elliott and Fritz in [15]. In the semidiscrete case, for a spatial discretization by finite elements of order $k \geq 2$ we prove an optimal $H^1$-error estimate for the position vector. We present numerical experiments that confirm this error bound and demonstrate that the scheme has good properties with respect to the distribution of mesh points as already observed in [15].

2605.21441 2026-05-21 cond-mat.quant-gas

Observation of a tripartite quantum phase for coexisting extended, localized, and critical states

三元量子相的观测:共存的扩展、局域化和临界态

Zhongshu Hu, Yajing Guo, Yu-Dong Wei, Bing-Chen Yao, Zhentian Qian, Xin-Chi Zhou, Bao-Zong Wang, Jianing Yang, Xuzong Chen, Shengjie Jin, Xiong-Jun Liu

AI总结 该研究通过准周期驱动的轨道光晶格中的超冷原子实现了三元量子相,展示了扩展、局域化和临界态共存的特性,并揭示了其独特的输运性质。

Comments 13 pages, 4 figures in the main text

详情
AI中文摘要

无序的量子世界中存在三种基本状态:扩展态、局域化态和临界态,其中临界态局限于随机无序系统中精细调制的临界点或运动边。三元相,其中三种状态在有限的能谱窗口内共存,是准周期和真正随机系统在局域化物理中的显著区别。在这里,我们报告了在准周期驱动的轨道光晶格中实现这一奇特相。实验中实现了具有准周期Floquet调制的光晶格,该调制耦合s和p轨道,并被证明能宿主三元相。我们开发了两阶段协议来精确制备和检测这三种量子态。这些状态的特征指数通过展开动力学确定,展示了其独特的普遍输运性质。我们的研究标志着在超冷原子中探索非常规临界现象和局域化物理方面的重要进展。

英文摘要

The disordered quantum world hosts three fundamental types of states: extended, localized, and critical, of which the critical states are confined to fine-tuned critical points or mobility edges in randomly disordered systems. The tripartite phase, with all three types of states coexisting over finite spectral windows, represents a hallmark distinction between quasiperiodic and truly random systems in the localization physics. Here, we report the realization of this exotic phase in a quasi-periodically driven orbital optical lattice with ultracold atoms. The optical lattice with a quasiperiodic Floquet modulation coupling s and p orbitals is realized in experiment and shown to host the tripartite phase from exact theory. We develop a two-stage protocol to precisely prepare and detect the three types of quantum states. The characteristic exponents of these states are determined from expansion dynamics, showing their distinct universal transport properties. Our study marks a significant advancement in exploring unconventional critical phenomena and localization physics with ultracold atoms.