arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 4033
2605.08094 2026-05-12 cs.CY cs.AI

MedThink: Enhancing Diagnostic Accuracy in Small Models via Teacher-Guided Reasoning Correction

Xinchun Su, Chunxu Luo, Lipeng Ma, Yixuan Li, Weidong Yang

AI总结 MedThink 是一种两阶段的知识蒸馏框架,旨在提升小型语言模型在临床诊断中的推理能力。该方法通过教师模型指导学生模型,首先注入领域知识进行预训练,随后针对学生模型的错误生成推理链并进行二次微调,从而增强其诊断准确性。实验表明,MedThink 在多个医学基准测试中显著优于其他蒸馏策略,尤其在胃肠病学数据集上达到了56.4%的最高准确率,展示了其在保持计算效率的同时提升小型模型诊断能力的有效性。

详情
英文摘要

Accurate clinical diagnosis requires extensive domain knowledge and complex clinical reasoning capabilities. Although large language models (LLMs) hold great potential for clinical reasoning, their high computational and memory requirements limit their deployment in resource-constrained environments. Knowledge distillation (KD) can compress LLM capabilities into smaller models, but traditional KD merely transfers superficial answer patterns and fails to preserve the structured reasoning required for reliable diagnosis. To address this, we propose a two-stage distillation framework, MedThink, designed to cultivate robust clinical reasoning in small language models (SLMs). In the first stage, a teacher LLM screens data and injects domain-knowledge explanations to fine-tune a student model, establishing a knowledge foundation. In the second stage, the teacher evaluates the student's errors, generates reasoning chains linking knowledge to correct answers, and refines the student's diagnostic reasoning through a second round of fine-tuning. We evaluate MedThink on general medical benchmarks and a gastroenterology dataset comprising 955 question-answer pairs. Experiments demonstrate that MedThink outperforms six distillation strategies in all benchmarks: achieving an improvement of up to 12.7% over the student baseline in general tasks, and reaching a total top accuracy of 56.4% in gastroenterology evaluation. This indicates that iterative distillation centered on reasoning can significantly enhance the diagnostic accuracy and generalization capabilities of SLMs whilst maintaining computational efficiency. Our code and data are publicly available at https://github.com/destinybird/PrecisionBoost.

2605.08093 2026-05-12 cs.CY cs.AI cs.HC

Playing Games with My Heart: An Evaluation of AI Companion Apps

Maribeth Rauh, Dick A. H. Blankvoort, Matias Duran, Caoilfhionn Ní Dheoráin, Harshvardhan J. Pandit, Siddharth D. Jaiswal, Anthony Ventresque, Abeba Birhane

AI总结 随着AI伴侣类应用的迅速发展,其对用户情感依赖和心理影响引发了广泛关注。本文评估了欧盟和英国市场上五款最受欢迎的AI伴侣应用程序,分析其设计中可能促进 parasocial 互动并诱导用户行为的机制,如暗黑设计模式、拟人化特征、刻板印象和游戏化元素。研究发现,所有应用均包含大量旨在提升盈利和用户粘性的设计策略,并提出了针对监管机构的政策建议,以加强该新兴市场的消费者保护。

详情
英文摘要

The use of chatbots for various forms of companionship is growing rapidly, raising a myriad of questions about simulated relationships, emotional dependence, and psychological harm. While major platforms such as ChatGPT, Grok, and Character.AI are the subject of a growing body of research and legal inquiries, apps explicitly built for simulating intimate interpersonal relationships remain under-explored. In this work, we evaluate the five most popular AI companion mobile applications in the EU and UK markets for factors that encourage parasocial interaction and may manipulate users. We do this by manually annotating the user experience each offers. Specifically, we systematically record and quantify design dark patterns, anthropomorphism, stereotypes, erotica, and technical performance issues. We find that all apps contain substantial dark patterns aimed at increasing monetisation and user engagement. Erotica and gamification features such as levelling are also prevalent, and although other features vary considerably between applications, all apps have highly anthropomorphic design. These findings shed light on the mechanics used to leverage users' simulated relationships. On that basis, we put forward concrete recommendations for regulators to strengthen consumer protection in this rapidly emerging market. Content warning: This article contains objectifying images of women, erotic images, textual references to incest, and other potentially sensitive, offensive, and distressing text.

2604.26962 2026-05-12 cs.CY cs.AI cs.CL

DeepTutor: Towards Agentic Personalized Tutoring

Bingxi Zhao, Jiahao Zhang, Xubin Ren, Zirui Guo, Tianzhe Chu, Yi Ma, Chao Huang

AI总结 DeepTutor 是一个开源的智能辅导框架,旨在通过结合基于引用的辅导和难度校准的问题生成,提供个性化的学习支持。该框架采用混合个性化引擎,结合静态知识和动态学习记忆,持续适应学生的需求,并扩展到自适应学习流程、互动书籍和多渠道辅导代理。研究还引入了 TutorBench 评估基准和基于大语言模型的个性化评估方法,实验表明 DeepTutor 在个性化指标和智能代理推理能力上均有显著提升。

Comments Tech Report, work in progress. Code available at https://github.com/HKUDS/DeepTutor

详情
英文摘要

Education is one of the most promising real-world applications for Large Language Models (LLMs). However, current LLMs rely on static pre-training knowledge and lack adaptation to individual learners, while existing RAG systems fall short in delivering personalized, guided feedback. To bridge this gap, we present DeepTutor, a fully open-source agentic framework that unifies citation-grounded problem tutoring with difficulty-calibrated question generation. A hybrid personalization engine couples static knowledge grounding with dynamic learner memory, continuously adapting each interaction to the student's evolving needs. The same personalization substrate further extends to adaptive learning workflows, interactive books, and proactive multi-channel tutoring agents. To evaluate personalized tutoring, we introduce TutorBench, an interactive benchmark incorporating customized learner profiles grounded in university-level curricula across five domains. We further propose an LLM-based first-person interactive evaluation protocol that conducts assessments via a profile-driven student simulator. Complementary evaluations on established benchmarks, supported by human-alignment and ablation studies, confirm the framework's robustness and general utility. Results show that DeepTutor improves personalized metrics by 10.8\% on average and strengthens general agentic reasoning across five backbone models by 29.4\%.

2604.23238 2026-05-12 cs.CR cs.AI

Hiding in Plain Sight: Detectability-Aware Antidistillation of Reasoning Models

Max Hartman, Vidhata Jayaraman, Moulik Choraria, Yash Savani, Lav R. Varshney

AI总结 本文研究了如何在不损害教师模型性能的前提下,防止推理模型的推理轨迹被第三方通过蒸馏技术窃取。为解决现有反蒸馏方法忽视可检测性的问题,作者将反蒸馏建模为一个Stackelberg博弈,并引入稀疏扰动策略以降低被检测的风险。通过机制可解释性分析,他们识别出对模型输出影响显著但语义隐蔽的“思维锚点”,并基于此提出了一种无需训练、黑盒可用的反蒸馏方法TraceGuard,有效抑制学生模型的学习同时保持轨迹一致性。

详情
英文摘要

Distillation via sampling reasoning traces exposes closed-source frontier models to adversarial third parties who can bypass their guardrails and misappropriate their capabilities. Antidistillation methods aim to address this by poisoning reasoning traces to hinder student model learning while preserving teacher performance. However, current methods overlook detectability, both semantic and syntactic, which erodes trust in the teacher's outputs and signals the defense's presence to adversaries. We address this gap by formulating antidistillation as a Stackelberg game whose constraint set explicitly encodes detectability, and show that perturbing sparingly offers an effective, less detectable alternative to poisoning the full trace. Drawing on mechanistic interpretability, we identify thought anchors, sentences with disproportionate counterfactual influence on model outputs, as a principled sparse target: critical to reasoning yet minimally detectable. We instantiate this in TraceGuard, a training-free, black-box proof-of-concept that locates thought anchors via branching-token detection and poisons them to degrade student distillation while preserving trace coherence.

2603.22868 2026-05-12 cs.CR cs.AI

Agent-Sentry: Bounding LLM Agents via Execution Provenance

Rohan Sequeira, Stavros Damianakis, Umar Iqbal, Konstantinos Psounis

AI总结 随着智能体计算系统的广泛应用,其带来的安全、隐私和可靠性问题日益突出。为了解决系统执行流程不确定、难以验证任务执行合法性的问题,本文提出了一种名为Agent-Sentry的运行时防御机制,通过学习合法执行的边界来检测异常行为。该方法结合了结构分类、敏感参数白名单和LLM判断三种互补的检查方式,在不修改智能体及其工具的前提下,有效阻止了94.3%的注入攻击,同时保持了95.1%的正常操作通过率。

详情
英文摘要

Agentic computing systems, while immensely capable, raise serious security, privacy, and safety concerns. A key issue is that the full set of functionalities offered by these systems, combined with their probabilistic execution flows, is not known beforehand. Given this lack of characterization, it is challenging to validate whether a system has successfully carried out the user's intended task or instead executed irrelevant actions, potentially as a consequence of compromise. We present \emph{Agent Sentry}, a runtime defense that learns a bound on an agent's benign execution from prior legitimate executions and flags any action that falls outside this bound. Agent Sentry layers three complementary checks: a structural classifier over the sequence of actions and the provenance of each function's arguments; a deterministic allowlist check over sensitive argument values; and an LLM judge, invoked only on the residual of actions where the first two checks cannot safely decide between a legitimate new request and a carefully crafted injection. We demonstrate the effectiveness of Agent Sentry in AgentDojo and AgentDyn by blocking 94.3\% of successful injections while allowing 95.1\% of benign executions, without modifying the agent, its tools, or the LLM.

2602.16708 2026-05-12 cs.CR cs.AI cs.MA

Formal Policy Enforcement for Real-World Agentic Systems

Nils Palumbo, Sarthak Choudhary, Jihye Choi, Guy Amir, Prasad Chalasani, Somesh Jha

AI总结 本文研究了如何在现实世界的智能体系统中形式化地执行安全策略,提出了一个基于面向方面编程的框架,将策略与智能体的推理过程分离,并在每个关键决策点进行强制执行。该框架使用Datalog作为策略语言,支持递归规则和确定性执行,并通过形式化的假设/保证合同确保策略的正确性。研究还实现了名为FORGE的系统,并在多个实际案例中验证了其有效性。

详情
英文摘要

Security policy enforcement in contemporary agentic systems predominantly consists of embedding natural-language policies within an agent's system prompt and delegating compliance to the agent's reasoning. This approach admits no formal enforcement guarantee and cannot express policies whose satisfaction depends on the causal history of an execution, a gap that becomes acute in multi-agent systems, where enforcement must reason across agents. We argue that policy enforcement in agentic systems is most naturally understood as a cross-cutting concern, and propose a framework grounded in aspect-oriented programming that specifies policies independent of the agent's reasoning and enforces them at every policy-relevant decision. Policies are written in Datalog over a set of abstract predicates describing the execution context, an observability service governed by a formal assume/guarantee contract maintains these predicates, and a reference monitor consults the policy at each action to produce an enforcement decision. When the environment contract holds, enforcement decisions coincide with the policy's intended semantics. We adopt Datalog as the policy language, a natural fit because it supports declarative rule specification, admits recursion for policies over transitive relationships, and yields deterministic enforcement. Datalog further admits tractable static analyses for contradiction, redundancy, subsumption, and conditional reachability, enabling authors to verify policy intent and surface ambiguities inherent in natural-language specifications. We realize the framework in FORGE, which enforces policies over agentic deployments without modification to the underlying agents. We evaluate FORGE on three case studies: information flow policies for prompt injection defense, approval workflows in a multi-agent pharmacovigilance system, and organizational policies for customer service.

2602.12286 2026-05-12 q-bio.GN cs.CL

Mind the Gap No More: Achieving Zero-Gap Multimodal Integration via One Tokenizer

Yanan Li, Christina Yi Jin, Yuan Jin, Manli Luo, Tie Xu, Shuai Jiao, Wei He, Qing Zhang

AI总结 本文研究了多模态大语言模型中异构输入整合的核心挑战,提出了一种名为“One Tokenizer”的新型架构,通过将所有模态直接映射到共享的词元空间,消除模态间的几何差距,从而实现零差距的多模态融合。该方法在DNA与文本的多模态任务中表现出色,验证了其在深度生物推理中的优越性。

Comments Under review at NeurIPS 2026

详情
英文摘要

A central challenge in developing Multimodal Large Language Models (MLLMs) is effectively integrating heterogeneous inputs into a cohesive reasoning engine. Current paradigms predominantly rely on modular architectures that introduce modality-specific encoders and cross-modal fusion mechanisms. However, these designs are fundamentally bottlenecked by a geometric modality gap, forcing the LLM to expend significant computational capacity on geometric reconciliation rather than deep cross-modal reasoning. In this work, we formally characterize this modality gap and theoretically demonstrate that native architectures, specifically those employing a unified vocabulary, intrinsically maintain a zero-gap state across all hidden layers. Guided by these theoretical findings, we propose \textit{One Tokenizer}, a native architecture that maps all modalities directly into a shared token space. We empirically validate this framework on a DNA--text multimodal testbed. Our extensive evaluations reveal that by achieving seamless integration within the LLM's native latent space, One Tokenizer consistently outperforms encoder-based modular counterparts, providing a fundamentally superior framework for deep biological reasoning.

2510.19544 2026-05-12 cond-mat.dis-nn cond-mat.stat-mech cs.AI cs.LG physics.comp-ph

Demonstrating Real Advantage of Machine-Learning-Enhanced Monte Carlo for Combinatorial Optimization

Luca Maria Del Bono, Federico Ricci-Tersenghi, Francesco Zamponi

AI总结 该研究探讨了机器学习辅助的蒙特卡洛方法在组合优化问题中的实际优势,聚焦于三维伊辛自旋玻璃的最小能量配置问题。研究提出了一种结合传统局部移动与机器学习生成全局移动的全局退火蒙特卡洛算法,实验表明该方法在性能和鲁棒性上均优于经典模拟退火和群体退火方法。结果清晰地证明了机器学习增强的优化方法在组合优化任务中能够超越现有经典方法。

Comments 13 main pages, 6 main figures. 4 supplementary pages, 2 supplementary figures

详情
Journal ref
PNAS 123, e2534768123 (2026)
英文摘要

Combinatorial optimization problems are central to both practical applications and the development of optimization methods. While classical and quantum algorithms have been refined over decades, machine learning--assisted approaches are comparatively recent and have not yet consistently outperformed simple, state-of-the-art classical methods. Here, we focus on a class of Quadratic Unconstrained Binary Optimization (QUBO) problems, specifically the challenge of finding minimum energy configurations in three-dimensional Ising spin glasses. We use a Global Annealing Monte Carlo algorithm that integrates standard local moves with global moves proposed via machine learning. We show that local moves play a crucial role in achieving optimal performance. Benchmarking against Simulated Annealing and Population Annealing, we demonstrate that Global Annealing not only surpasses the performance of Simulated Annealing but also exhibits greater robustness than Population Annealing, maintaining effectiveness across problem hardness and system size without hyperparameter tuning. These results provide clear and robust evidence that a machine learning--assisted optimization method can exceed the capabilities of classical state-of-the-art techniques in a combinatorial optimization setting.

2510.12811 2026-05-12 cs.CR cs.LG

Applying Graph Analysis for Unsupervised Fast Malware Fingerprinting

ElMouatez Billah Karbab, Mourad Debbabi

AI总结 随着恶意软件数量的迅速增长,手动分析已难以应对。本文提出了一种名为TrapNet的无监督框架,用于快速对恶意软件进行指纹识别与分组。TrapNet基于图社区检测技术,结合静态分析和一种新型的数值模糊哈希方法FloatHash,生成恶意软件的语义特征向量,并构建恶意软件相似性网络,从而高效识别具有相似语义的恶意软件群组,显著提升了检测效率和准确性。

详情
英文摘要

Malware proliferation is increasing at a tremendous rate, with hundreds of thousands of new samples identified daily. Manual investigation of such a vast amount of malware is an unrealistic, time-consuming, and overwhelming task. To cope with this volume, there is a clear need to develop specialized techniques and efficient tools for preliminary filtering that can group malware based on semantic similarity. In this paper, we propose TrapNet, a novel, scalable, and unsupervised framework for malware fingerprinting and grouping. TrapNet employs graph community detection techniques for malware fingerprinting and family attribution based on static analysis, as follows: (1) TrapNet detects packed binaries and unpacks them using known generic packer tools. (2) From each malware sample, it generates a digest that captures the underlying semantics. Since the digest must be dense, efficient, and suitable for similarity checking, we designed FloatHash (FH), a novel numerical fuzzy hashing technique that produces a short real-valued vector summarizing the underlying assembly items and their order. FH is based on applying Principal Component Analysis (PCA) to ordered assembly items (e.g., opcodes, function calls) extracted from the malware's assembly code. (3) Representing malware with short numerical vectors enables high-performance, large-scale similarity computation, which allows TrapNet to build a malware similarity network. (4) Finally, TrapNet employs state-of-the-art community detection algorithms to identify dense communities, which represent groups of malware with similar semantics. Our extensive evaluation of TrapNet demonstrates its effectiveness in terms of the coverage and purity of the detected communities, while also highlighting its runtime efficiency, which outperforms other state-of-the-art solutions.

2507.17921 2026-05-12 stat.ML cs.LG eess.IV math.ST stat.CO stat.ME stat.TH

Sliding Window Informative Canonical Correlation Analysis

Arvind Prasadan

AI总结 本文提出了一种适用于流数据场景的新型典型相关分析方法——滑动窗口信息典型相关分析(SWICCA),用于实时发现两个数据集之间的相关特征。该方法结合流式主成分分析算法与滑动窗口样本,实现了对CCA成分的在线估计,具有高维数据处理能力和良好的可扩展性。文中通过数值模拟和实际数据案例验证了方法的有效性,并提供了理论性能保证。

Comments 11 pages (double column), submitted; revised with updated simulations

详情
英文摘要

Canonical correlation analysis (CCA) is a technique for finding correlated sets of features between two datasets. In this paper, we propose a novel extension of CCA to the online, streaming data setting: Sliding Window Informative Canonical Correlation Analysis (SWICCA). Our method uses a streaming principal component analysis (PCA) algorithm as a backend and uses these outputs combined with a small sliding window of samples to estimate the CCA components in real time. We motivate and describe our algorithm, provide numerical simulations to characterize its performance, and provide a theoretical performance guarantee. The SWICCA method is applicable and scalable to extremely high dimensions, and we provide a real-data example that demonstrates this capability.

2502.06044 2026-05-12 stat.ML cs.LG

Differentially Private Hyperparameter Tuning using Local Bayesian Optimization

Getoar Sopa, Juraj Marusic, Marco Avella Medina, John P. Cunningham

AI总结 本文研究了在验证数据包含敏感用户信息时,如何实现差分隐私的超参数调优问题。针对现有方法依赖近似随机搜索或全局贝叶斯优化导致效率低下的问题,提出了一种基于局部贝叶斯优化的差分隐私框架DP-GIBO,利用高斯过程代理模型私密地近似梯度。该方法在适当条件下可保证收敛到局部最优超参数配置,并在中高维超参数空间中表现出优于非隐私随机搜索和全局贝叶斯优化的性能。

Comments 26 pages, 6 figures

详情
英文摘要

Hyperparameter tuning is a key component of machine learning procedures, but when validation data contain sensitive user information, search mechanisms can leak private information through the selected configuration. Existing differentially private hyperparameter tuning methods often rely on near-random search, while prior differentially private Bayesian optimization approaches are typically global and, therefore, scale poorly with the hyperparameter dimensionality. We study differentially private hyperparameter tuning using local Bayesian optimization, focusing on settings where the validation objective is available only through noisy black box evaluations and gradients are unavailable or impractical to compute. We introduce DP-GIBO, a differentially private local Bayesian optimization framework that privately approximates gradients using a Gaussian Process surrogate. Under suitable conditions, we prove that DP-GIBO converges to a locally optimal hyperparameter configuration up to a privacy-dependent error, with dimensional dependence that is polynomial rather than exponential.Empirically, we show that DP-GIBO provides scalable private hyperparameter tuning across multiple tasks, substantially outperforming non-private random search and global Bayesian optimization baselines in moderate-to-high-dimensional hyperparameter spaces.

2410.01656 2026-05-12 math.ST cs.DS cs.LG stat.CO stat.ML stat.TH

Efficient Statistics With Unknown Truncation, Polynomial Time Algorithms, Beyond Gaussians

Jane H. Lee, Anay Mehrotra, Manolis Zampetakis

AI总结 本文研究了在样本仅来自未知集合 $S \subseteq \mathbb{R}^d$ 的情况下,如何高效估计分布参数的问题。作者提出了一种多项式时间算法,适用于满足特定结构条件的指数族分布,并能处理由低次多项式近似表示的未知截断集 $S$,从而扩展了对高斯分布参数估计的现有结果。此外,针对截断集为半空间或轴对齐矩形的情况,作者设计了运行时间为 $\mathrm{poly}(d/\varepsilon)$ 的算法,为截断数据下的参数估计提供了更高效的解决方案。

Comments Appeared at the 65th IEEE Symposium on Foundations of Computer Science (FOCS), 2024; abstract shortened for arXiv

详情
英文摘要

We study the estimation of distributional parameters when samples are shown only if they fall in some unknown set $S \subseteq \mathbb{R}^d$. Kontonis, Tzamos, and Zampetakis (FOCS'19) gave a $d^{\mathrm{poly}(1/\varepsilon)}$ time algorithm for finding $\varepsilon$-accurate parameters for the special case of Gaussian distributions with diagonal covariance matrix. Recently, Diakonikolas, Kane, Pittas, and Zarifis (COLT'24) showed that this exponential dependence on $1/\varepsilon$ is necessary even when $S$ belongs to some well-behaved classes. These works leave the following open problems which we address in this work: Can we estimate the parameters of any Gaussian or even extend beyond Gaussians? Can we design $\mathrm{poly}(d/\varepsilon)$ time algorithms when $S$ is a simple set such as a halfspace? We make progress on both of these questions by providing the following results: 1. Toward the first question, we give a $d^{\mathrm{poly}(\ell/\varepsilon)}$ time algorithm for any exponential family that satisfies some structural assumptions and any unknown set $S$ that is $\varepsilon$-approximable by degree-$\ell$ polynomials. This result has two important applications: 1a) The first algorithm for estimating arbitrary Gaussian distributions from samples truncated to an unknown $S$; and 1b) The first algorithm for linear regression with unknown truncation and Gaussian features. 2. To address the second question, we provide an algorithm with runtime $\mathrm{poly}(d/\varepsilon)$ that works for a set of exponential families (containing all Gaussians) when $S$ is a halfspace or an axis-aligned rectangle. Along the way, we develop tools that may be of independent interest, including, a reduction from PAC learning with positive and unlabeled samples to PAC learning with positive and negative samples that is robust to certain covariate shifts.

2410.01244 2026-05-12 stat.ML cs.LG math.PR

Equivariant score-based generative models provably learn distributions with symmetries efficiently

Ziyu Chen, Markos A. Katsoulakis, Benjamin J. Zhang

AI总结 本文研究了如何高效学习具有对称性的数据分布,提出了首个关于等变分数生成模型(SGMs)的理论分析与保证。通过改进Wasserstein-1距离下的泛化界,并结合哈密顿-雅可比-贝尔曼理论,论文证明了在不进行数据增强的情况下,使用等变向量场即可有效学习对称化分布的分数函数。研究还表明,若未在模型中引入等变结构,将导致更差的泛化性能,突显了等变先验在对称数据建模中的重要性。

详情
英文摘要

Symmetry is ubiquitous in many real-world phenomena and tasks, such as physics, images, and molecular simulations. Empirical studies have demonstrated that incorporating symmetries into generative models can provide better generalization and sampling efficiency when the underlying data distribution has group symmetry. In this work, we provide the first theoretical analysis and guarantees of score-based generative models (SGMs) for learning distributions that are invariant with respect to some group symmetry and offer the first quantitative comparison between data augmentation and adding equivariant inductive bias. First, building on recent works on the Wasserstein-1 ($\mathbf{d}_1$) guarantees of SGMs and empirical estimations of probability divergences under group symmetry, we provide an improved $\mathbf{d}_1$ generalization bound when the data distribution is group-invariant. Second, we describe the inductive bias of equivariant SGMs using Hamilton-Jacobi-Bellman theory, and rigorously demonstrate that one can learn the score of a symmetrized distribution using equivariant vector fields without data augmentations through the analysis of the optimality and equivalence of score-matching objectives. This also provides practical guidance that one does not have to augment the dataset as long as the vector field or the neural network parametrization is equivariant. Moreover, we quantify the impact of not incorporating equivariant structure into the score parametrization, by showing that non-equivariant vector fields can yield worse generalization bounds. This can be viewed as a type of model-form error that describes the missing structure of non-equivariant vector fields. Numerical simulations corroborate our analysis and highlight that data augmentations cannot replace the role of equivariant vector fields.

2409.04463 2026-05-12 eess.SY cs.CE cs.LG cs.SY

SINDyG: Sparse Identification of Nonlinear Dynamical Systems from Graph-Structured Data, with Applications to Stuart-Landau Oscillator Networks

Mohammad Amin Basiri, Sina Khanmohammadi

AI总结 本文提出了一种基于图结构数据的稀疏非线性动力系统识别方法SINDyG,旨在从具有子系统交互关系的复杂系统中准确提取动力学模型。该方法通过引入网络结构信息作为稀疏回归中的正则化项,提升了对系统动态行为的建模能力。研究以扩展的斯图尔特-兰道振子网络为例,验证了SINDyG在识别非线性动力学方面的有效性与优越性,为复杂网络系统的建模提供了新的思路。

详情
Journal ref
Journal of Complex Networks, Volume 13, Issue 5, cnaf029 (2025)
英文摘要

The combination of machine learning (ML) and sparsity-promoting techniques is enabling direct extraction of governing equations from data, revolutionizing computational modeling in diverse fields of science and engineering. The discovered dynamical models could be used to address challenges in climate science, neuroscience, ecology, finance, epidemiology, and beyond. However, most existing sparse identification methods for discovering dynamical systems treat the whole system as one without considering the interactions between subsystems. As a result, such models are not able to capture small changes in the emergent system behavior. To address this issue, we developed a new method called Sparse Identification of Nonlinear Dynamical Systems from Graph-structured data (SINDyG), which incorporates the network structure into sparse regression to identify model parameters that explain the underlying network dynamics. We tested our proposed method using several case studies of neuronal dynamics, where we modeled the macroscopic oscillation of a population of neurons using the extended Stuart-Landau (SL) equation and utilize the SINDyG method to identify the underlying nonlinear dynamics. Our extensive computational experiments validate the improved accuracy and simplicity of discovered network dynamics when compared to the original SINDy approach. The proposed graph-informed penalty can be easily integrated with other symbolic regression algorithms, enhancing model interpretability and performance by incorporating network structure into the regression process.

math/0402063 2026-05-12 math.CO

Lattice congruences, fans and Hopf algebras

Nathan Reading

AI总结 本文研究了弱序格同余在Coxeter群中的几何与代数性质,提出了一种统一的解释,并推广了从排列到三角剖分和子集的映射。通过构造与格商相关的完整扇形,建立了与非交换对称函数Hopf代数相关的子Hopf代数,并利用模式避免描述其基。研究还表明,Malvenuto-Reutenauer代数可以视为一系列更小代数的极限,并与Baxter排列数量相等的排列集建立了联系。

Comments 34 pages, 1 figure. Version 2: Very belatedly updating the arXiv version to agree with the last pre-publication version

详情
Journal ref
J. Combin. Theory Ser. A, 110 (2005) no. 2, 237-273
英文摘要

We give a unified explanation of the geometric and algebraic properties of two well-known maps, one from permutations to triangulations, and another from permutations to subsets. Furthermore we give a broad generalization of the maps. Specifically, for any lattice congruence of the weak order on a Coxeter group we construct a complete fan of convex cones with strong properties relative to the corresponding lattice quotient of the weak order. We show that if a family of lattice congruences on the symmetric groups satisfies certain compatibility conditions then the family defines a sub Hopf algebra of the Malvenuto-Reutenauer Hopf algebra of permutations. Such a sub Hopf algebra has a basis which is described by a type of pattern avoidance. Applying these results, we build the Malvenuto-Reutenauer algebra as the limit of an infinite sequence of smaller algebras, where the second algebra in the sequence is the Hopf algebra of non-commutative symmetric functions. We also associate both a fan and a Hopf algebra to a set of permutations which appears to be equinumerous with the Baxter permutations.

math/0209080 2026-05-12 math.RA

Bounds for the Entropy of Graded Algebras

Jan Snellman

AI总结 本文研究了分级结合代数的熵的上界问题,定义熵为各齐次分量维数的几何平均的极限上确界。作者在前人结果的基础上,利用Friedland关于0-1矩阵最大谱半径的结论,改进了自由结合代数同余商代数熵的上界估计,给出了更精确的界。

Comments 4 pages, 1 figure

详情
Journal ref
Communications in Algebra, 2026
英文摘要

Newman, Schneider and Shalev defined the entropy of a graded associative algebra A as H(A) = \limsup_{n \to \infty} \sqrt[n]{a_n}, where a_n is the vector space dimension of the n'th homogeneous component. When A is the homogeneous quotient of a finitely generated free associative algebra, they showed that H(A) \le \sqrt{a_2}. Using some results of Friedland on the maximal spectral radius of 0-1 matrices with a prescribed number of ones, we improve on this bound.

2605.10941 2026-05-12 cs.CC

Average-Case Hardness of Binary-Encoded Clique in Proof and Communication Complexity

Susanna F. de Rezende, David Engström, Yassine Ghannane, Duri Andrea Janett, Artur Riazanov

AI总结 本文研究了在平均情况下,证明图中不存在大团问题在证明复杂度和通信复杂度中的困难性。通过分析随机采样的稠密图的二进制编码团问题,作者证明了切割平面和有限深度的模2解证法的下界为指数级,并指出在这些公式中寻找被违反子句的随机通信复杂度为多项式级。这一结果揭示了在平均情况下,这类问题在不同计算模型中表现出显著的难度差异。

Comments Full version of a paper to appear at ICALP 2026

详情
英文摘要

We study the average-case hardness of establishing that a graph does not have a large clique in both proof and communication complexity. We show exponential lower bounds on the length of cutting planes and bounded-depth resolution over parities refutations of the binary encoding of clique formulas on randomly sampled dense graphs. Moreover, we show that the randomized communication complexity of finding a falsified clause in these formulas is polynomial.

2605.10940 2026-05-12 astro-ph.HE

Electromagnetic Follow-up of the Sub-Solar Mass Gravitational Wave Candidate S251112cm: Kilonova Constraints and a Coincident IIb Supernova

Xander J. Hall, Tomas Ahumada, Julius Gassert, Antonella Palmese, Brian D. Metzger, Mansi M. Kasliwal, Mattia Bulla, Daniel Gruen, Robert Stein, Christoffer Fremling, Shreya Anand, Igor Andreoni, Malte Busmann, Tomás Cabrera, Ryan Christinzio, James Freeburn, Ignacio Magaña Hernandez, Lei Hu, Brendan O'Connor, Ji-an Jiang, Zhengyan Liu, Wen Zhao, Eric C. Bellm, David Cook, Michael W. Coughlin, Richard Dekany, Matthew Graham, Russ R. Laher

AI总结 2025年11月12日,LIGO-Virgo-KAGRA合作组探测到一个包含至少一个亚太阳质量天体的致密天体合并引力波候选事件S251112cm。研究团队利用多个望远镜在引力波警报后2.4小时内对56%的定位区域进行了电磁波后随观测,未发现千新星对应体,但发现了一个IIb型超新星SN 2025adtq,其爆发时间比引力波事件早约2天,空间关联性较强。该结果为亚太阳质量中子星合并的“超千新星”形成机制提供了部分支持,但证据尚不充分。

详情
英文摘要

On November 12th, 2025 the LIGO--Virgo--KAGRA (LVK) collaboration reported gravitational waves (GWs) from a compact object merger candidate (S251112cm) with at least one sub-solar mass component. Using the Dark Energy Camera (DECam), the Fraunhofer Telescope at Wendelstein Observatory (FTW), and the Zwicky Transient Facility (ZTF), we surveyed $56\%$ of the GW localization region beginning $2.4$~hours after the GW alert. We find no kilonova (KN) counterpart, and use radiative-transfer models to rule out $42\%$ (ZTF), $68\%$ (DECam), and $92\%$ (FTW) of the KN models as possible emission from this GW candidate. Within the recently proposed disk-fragmentation (``superkilonova'') model for generating sub-solar mass neutron star mergers from stellar core-collapse, the delay between the supernova explosion time and the GW merger time is estimated to be less than a few days. Searching this time window prior to the GW event, we identify and spectroscopically classify a IIb supernova (SN~2025adtq), with a spatial association odds ratio of $\log_{10}\mathcal{I} \approx 4.8$, a chance coincidence probability of ${\sim}2$--$9\%$, and an estimated explosion time ${\sim}2$ days prior to S251112cm. SN~2025adtq is the second Type~IIb supernova found in spatial and temporal coincidence with a sub-solar mass GW candidate, following the previously reported S250818k/SN~2025ulz association; jointly, we measure an odds ratio that favors the association hypothesis over the null, however, when conditioned on finding a coincident supernova by chance, the odds ratio disfavors association. Together, these results provide suggestive but inconclusive evidence for the superkilonova formation channel.

2605.10939 2026-05-12 math.MG math.PR

Dimension-free Gaussian tail estimates for linear functionals on convex bodies

Brayden Letwin, Dan Mikulincer

AI总结 本文研究凸体上均匀分布随机向量的线性泛函的高斯尾部估计问题。作者证明了存在一个绝对常数和一个足够大的正交向量集合,使得对于该集合中的每个向量,对应的线性泛函的 $L^p$ 范数在 $p$ 的范围内与 $L^2$ 范数成比例,从而给出了维度无关的高斯尾界估计。这一结果为理解高维凸体的几何与概率性质提供了新的分析工具。

Comments 18 pages

详情
英文摘要

Let $K \subset \mathbb{R}^n$ be a centered convex body of volume one. We prove that there exist absolute constants $c,C > 0$ and an orthonormal set of vectors $Θ\subset S^{n-1}$ with size $\left|Θ\right| \ge 9n/10$ such that, if $X$ is a random vector uniformly distributed on $K$, then for all $θ\in Θ$ one has \[ c\cdot \sqrt{p}\,\left(\mathbb{E} \left|\left\langle X,θ\right\rangle\right|^2\right)^{1/2} \le \left(\mathbb{E} \left|\left\langle X,θ\right\rangle\right|^p\right)^{1/p} \le C\cdot \sqrt{p}\,\left(\mathbb{E} \left|\left\langle X,θ\right\rangle\right|^2\right)^{1/2}, \] where the upper estimate holds for all $p \ge 1$ while the lower bound only holds for $1 \le p \le n$.

2605.10935 2026-05-12 astro-ph.CO

Demonstrating the Use of the Spherical Fourier Bessel Basis for Large Scale Clustering Systematics Discovery and Mitigation with eBOSS

Sean Bruton, James R. Cheshire, Olivier Doré, Henry S. Grasshorn Gebhardt, Robin Y. Wen

AI总结 本文研究了如何利用球面傅里叶-贝塞尔(SFB)基函数在大尺度结构巡天中识别和缓解系统误差。通过分析eBOSS DR16中的LRG和QSO样本,作者展示了SFB基在分离角向和径向模态方面的优势,并发现了可能受系统误差影响的模态。研究在QSO样本中发现了大尺度物理结构中存在系统误差的显著证据,可能源于残余恒星污染,并在LRG和QSO样本中检测到了与角向和成像尺度相关的未知系统误差。

Comments 17 pages, 12 figures, 1 appendix

详情
英文摘要

The Spherical Fourier-Bessel (SFB) basis, in separating the angular and radial modes of the power spectrum, permits a targeted identification and mitigation of systematics in clustering surveys while retaining more cosmological signal than traditional bases. We demonstrate this principle on the eBOSS DR16 LRG and QSO samples, identifying modes which may be contaminated by systematics. Our initial inference on the LRG sample yields an fNL value consistent with zero, while the QSO value is in slight tension with zero. Using the SFB basis, we vary the selection of angular and radial modes to search for inconsistencies in the inferred value of fNL, an indicator of underlying systematics. In the QSO sample, we find evidence (p < 0.005 compared to the same cuts on EZMocks) of a systematic afflicting large physical scales, which is consistent with residual stellar contamination; we also find evidence (p < 0.05) for an unknown systematic in the QSO and LRG samples at the approximate angular plate and imaging scale of eBOSS.

2605.10932 2026-05-12 quant-ph cond-mat.mes-hall

Crystallographic Symmetry Generates Phononic Holonomic Gates with Biased-Erasure Channels

El Mustapha Mansouri, Keigo Arai

AI总结 该研究利用晶体对称性设计了一种基于声子操控的量子门,通过应变调控实现高保真度的逻辑操作。核心方法是在具有特定对称性的Λ型能级结构中,利用机械模态合成圆形应变场,结合超绝热回波-月牙形拓扑门技术,实现无需局部微波场的控制。研究展示了在氮空位中心上的模拟结果,达到接近99.9%的条件平均保真度,并提出了一种具有偏差擦除特性的噪声模型,为量子纠错提供了更高效的错误识别与处理机制。

Comments 49 pages, 20 figures

详情
英文摘要

Solid-state processors require control layers whose errors are legible to quantum-error-correction decoders. We show that crystallographic symmetry can provide such a layer in strain-active Lambda manifolds. When the projected strain tensor and Lambda-transition operators share a multiplicity-one two-dimensional irreducible representation, symmetry fixes the linear strain interaction to a scalar dot product. Two phase-locked mechanical modes synthesize a circular strain field, enabling complex phononic Lambda-leg control without local microwave near fields. On this manifold we construct a superadiabatic echo-lune holonomic gate using Lambda-leg control and a resonant double-quantum counterdiabatic tone. Rotating-frame simulations of a nitrogen-vacancy center give 99.88% conditional average fidelity in 1.833 microseconds, or 99.40% when leakage is counted as error. A resonant gigahertz high-overtone bulk acoustic resonator analysis translates the Hamiltonian into Rabi-rate, linewidth, and envelope-tracking requirements. The bright-state structure organizes noise: A2-sector perturbations are parity-filtered into an optically distinguishable auxiliary state, whereas transverse E-sector faults are echo suppressed and retained as a decoder stress axis. The extracted channel has 0.47% erasure probability and 0.168% residual Z error. In XZZX code-capacity simulations, this biased-erasure model yields a nominal 64% fit-extrapolated data-qubit reduction relative to an unstructured Rabi baseline. Repeated-round detector-model diagnostics preserve the nominal distance-9 proxy and identify missed erasures, transverse floors, leakage/flag timing, and strong crosstalk as validation limits. Extensions to orbital Lambda systems and bright-projector phonon-bus diagnostics identify crystallographic symmetry as a principle for co-designing phononic actuation, leakage, noise bias, and quantum decoding.

2605.10929 2026-05-12 math.NA cs.NA

Efficient Admissible Set Projection in Optimization-based Invariant-Domain-Preserving Limiters for Ideal MHD

Chen Liu, Chi-Wang Shu, Xiangxiong Zhang

AI总结 本文研究了在理想磁流体动力学(MHD)方程的优化型不变域保持限制器中,如何高效地进行可接受集投影的问题。为实现物理合理且计算稳健的数值解,作者提出了一种基于优化的限制器,在保持全局守恒和精度的同时确保解的可接受性。通过将可接受集按磁能参数化为切片,将高维投影问题简化为一维最小化问题,从而高效求解,并结合分裂方法与Zhang-Shu限制器进一步提升计算效率与精度。

详情
英文摘要

Preserving the admissible set of the ideal magnetohydrodynamics (MHD) equations is important not only for producing physically meaningful numerical solutions, but more importantly for achieving robust computations. In this paper, we develop an optimization-based limiter to enforce admissibility while preserving global conservation and accuracy. For an easy and efficient projection, we decompose the admissible set into slices parameterized by the magnetic energy, so that the MHD projection reduces to a one-dimensional minimization, which can be solved efficiently by the Brent method. The splitting method can be used to efficiently solve the global minimization problem of the optimization-based limiter, which can be used to enforce cell average admissibility in discontinuous Galerkin (DG) schemes, and pointwise admissibility can be further enforced by the Zhang-Shu positivity-preserving limiter. We apply the limiter to high-order DG schemes and present numerical results for a few representative MHD problems.

2605.10928 2026-05-12 astro-ph.CO

Mitigating residual foregrounds and systematic errors in SKA1-Low AA* EoR observations via Bayesian Gaussian Process Regression

Samit Kumar Pal, Abhirup Datta, Aishrila Mazumder, Anshuman Tripathi

AI总结 本文研究了如何通过贝叶斯高斯过程回归(GPR)方法,有效抑制SKA1-Low观测中来自天体物理前景和系统误差的残余干扰,以提高宇宙再电离时期21厘米信号的检测精度。研究基于自主研发的端到端仿真平台,模拟了包括天体点源、天线增益校准误差、电离层相位误差等多种因素的复杂观测环境,并评估了不同GPR框架在抑制残余前景、减少信号损失和提供可靠误差估计方面的能力。结果表明,该方法能够在较宽的波数范围内可靠地恢复21厘米信号,精度达到2σ置信区间。

Comments 35 pages, 13 figures, comments are welcome, To be submitted to JCAP

详情
英文摘要

The redshifted 21\,cm line is an emerging tool in observational cosmology that can serve as a direct probe of the intergalactic medium throughout the cosmic timeline. However, the observation of the cosmological 21\,cm signal from early epochs is extremely challenging in practice, regardless of the scale of interest and redshift. The presence of bright astrophysical foregrounds and residual systematic errors along the line of sight poses challenges for its detection. Machine-learning-based Gaussian process regression\,(ML-GPR) has proven to be the most effective strategy for signal separation in LOFAR and NenuFAR observations to measure the 21\,cm signal power spectrum from the Cosmic Dawn\,(CD) and Epoch of Reionization\,(EoR). In this work, we extend this framework to synthetic CD/EoR SKA1-Low observations to assess its robustness in mitigating residual foregrounds against instrumental and environmental systematic effects. We use our developed end-to-end realistic simulation pipeline (\textsc{21cmE2E}) for SKA-Low observations. Our 4-hour tracking simulation includes extragalactic point sources, the AA* telescope configuration, primary beam response, and error models. The modelled errors incorporate residual antenna-based gain calibration errors, residual ionospheric phase errors, partial de-mixing of the out-of-field sources, and instrumental noise for 1000\,hours of deep integration time. We compare different Bayesian GPR frameworks to assess their ability to suppress residual foreground contamination while minimizing signal loss and providing reliable uncertainty estimates. Our analysis demonstrates that the 21\,cm signal can robustly recover within the $2σ$ credible interval for almost all k-modes over the range of $0.06 \leq k \leq 1.0$~h Mpc$^{-1}$.

2605.10927 2026-05-12 cs.DS

Chasing Small Sets Optimally Against Adaptive Adversaries

Christian Coester, Alexa Tudose

AI总结 本文研究了在度量空间中确定性在线算法追踪至多 $k$ 个元素集合的问题,该问题也被称为度量服务系统或宽度-$k$ 分层图遍历。作者提出了一种 $O(2^k)$ 竞争比的确定性算法,填补了该问题长达三十年的理论空白,并证明这一界在对抗自适应对手的随机化算法中也是最优的。此外,作者还改进了确定性下界,并针对 $k=3$ 的情况给出了匹配的上界,相关结果对分布式异步树探索和 $k$-出租车问题也具有重要意义。

Comments 32 pages

详情
英文摘要

We study deterministic online algorithms for the problem of chasing sets of cardinality at most $k$ in a metric space, also known as metrical service systems and equivalent to width-$k$ layered graph traversal. We resolve the 30-year-old gap of $Ω(2^k)\cap O(k2^k)$ on the competitive ratio of this problem by giving an $O(2^k)$-competitive deterministic algorithm. This bound is optimal even among randomized algorithms against adaptive adversaries. We also (slightly) improve the deterministic lower bound to $D_k$, defined recursively by $D_1=1$ and $D_{k+1}=2D_k+\sqrt{8+8D_k}+3$, which we conjecture to be exactly tight. For $k=3$, we provide a matching upper bound of $D_3$. Our results imply slightly improved upper and lower bounds for distributed asynchronous collective tree exploration and for the $k$-taxi problem, respectively. Our algorithm generalizes the classical doubling strategy, previously known to be optimal for $k=2$. The previous best bound for general $k$ was achieved by the generalized work function algorithm (WFA), and was known to be tight for WFA. Our improved bound therefore implies that WFA is sub-optimal for chasing small sets.

2605.10926 2026-05-12 math.CO

Counting Spinal Tree-Child Networks via Word Encodings and Generating Functions

Pau Vives, Anna de Mier, Gabriel Cardona, Joan Carles Pons

AI总结 本文研究了一类称为脊椎树-子进化网络的结构,这类网络具有严格的结构特征,即所有内部节点都位于从根到叶的单条路径上。作者提出了两种互补的组合方法:一种是引入词模型,将未标记的脊椎网络对应到具有固定重复次数的受限词类,并通过简单重标记等价关系实现显式的计数公式;另一种是基于标记树的符号方法,通过递归定义得到可解的双变量生成函数,从而直接推导出网络的数量系数。这些方法为脊椎树-子网络的枚举提供了理论基础和计算工具。

Comments 27 pages, 6 figures

详情
英文摘要

We study the enumeration of spinal tree-child phylogenetic networks, a rigid family of tree-child networks in which all internal vertices lie on a single root--to--leaf path. We provide two complementary combinatorial frameworks. First, we introduce a word model: unlabeled spinal networks correspond to a suitable class of restricted words with fixed multiplicities, taken modulo a simple relabeling equivalence, which yields an explicit closed enumeration. Second, we develop a symbolic-method approach based on a marked version of trees that admits a clean recursive specification; its boxed-product translation leads to a solvable bivariate generating function and a direct derivation of the coefficients.

2605.10920 2026-05-12 cs.SE

Using Logs to support Programming Education

Gilmar Gomes do Nascimento, Maria Claudia F. P Emer, Adolfo Gustavo Serra Seca Neto, Laudelino Cordeiro Bastos

AI总结 本研究旨在通过分析编程学习过程中的实时代码日志,弥补编程教育中缺乏量化评估的不足。研究提出了一种新型插件,用于广泛使用的代码编辑器,以记录学生在编程和文档编写过程中的细粒度交互行为,生成包含错误、进度和时间戳的详细数据集。该方法为教育者提供了基于数据的学习分析工具,支持教学方法研究、学习难点识别和个性化教学改进,推动编程教育向数据驱动和实证化方向发展。

Comments Author version of the paper accepted for publication at XX Conferência Latino-Americana de Tecnologias de Aprendizagem - LACLO 2025

详情
英文摘要

Software developers use metrics to evaluate code quality and productivity, but these practices are still rare in programming education. This project bridges the gap by collecting real-time learning analytics from individual student and whole-class code development logs. This granular, quantitative data provides educators with qualitative insights into the learning process. It allows them to evaluate student comprehension, identify common challenges, and critically assess whether the allocated time for exercises and algorithms is sufficient for mastery. Unlike traditional Learning Management Systems, we propose a novel approach: a plugin for a widely used code editor that captures granular interactions during programming and documentation. The resulting dataset logs coding behaviors, errors, and progress, enabling evidence-based analysis of learning patterns and educational benchmarking. By structuring this real-time programming trail, we support research on teaching methodologies, learner challenges, and skill acquisition. Quantitative metrics complement qualitative assessment by evaluating code, exercise progress, and timestamp logs. Our goal is to provide an open-access database for educators and researchers, fostering data-driven insights to enhance instruction and personalize learning experiences. This work aligns industrial best practices with pedagogical innovation, advancing measurable, empirical approaches to programming education.

2605.10918 2026-05-12 physics.atm-clus physics.chem-ph

Single-Photon Double Ionization of Ozone

Veronica Daver Ideböhn, Antoine Gloriod, Richard J. Squibb, Andreas Hult Roos, Nihar Ranjan Behera, Ishita Kanungo, Elias Gustafsson, Simon Gällblad, Saga Berglund, Emelie Olsson, Gunnar Öhrwall, Gunnar Nyman, John M. Dyke, John H. D. Eland, Majdi Hochlaf, Raimund Feifel

AI总结 该研究探讨了臭氧(O₃)在单光子双电离过程中的电子结构与解离动力学。通过结合高能真空紫外光源与多电荷粒子关联检测技术,首次获得了臭氧的单光子价电子双电离电子能谱。研究还利用多组态相互作用方法精确计算了双电离臭氧的势能面及解离通道能量,揭示了臭氧双电离过程中除基态解离外还存在激发态氧离子的生成,展现出更为复杂的解离动力学行为。

详情
英文摘要

Ozone (O3) is a triatomic molecule of central importance in the chemistry and physics of the Earth's and other planetary atmospheres. Beyond its environmental significance, a detailed understanding of the electronic structure and ionization dynamics of ozone is essential for modeling atmospheric, ionospheric, and astrochemical processes. In the present work, we substantially extend the experimental and theoretical characterization of ozone into the regime of valence double photoionization. Using HeII-alpha, HeII-beta, and higher-energy vacuum ultraviolet radiation in combination with a versatile multiple charged-particle correlation detection technique, we report the first single-photon valence double ionization electron spectrum of O3. To interpret the experimental observations, we mapped the lowest potential energy surfaces of dicationic ozone employing post-Hartree-Fock multi-configurational-interaction methods, and computed with high accuracy the energetics of the relevant dissociation channels. Our results demonstrate that dissociative double ionization of ozone produces electronically excited cationic atomic oxygen fragments in addition to the ground-state dissociation pathway, revealing a richer fragmentation dynamics than hitherto recognized.

2605.10915 2026-05-12 math.ST stat.TH

A Generative High Quantile Homogeneity Test Using Bahadur Representation for Heteroskedastic High Quantile Regression of Tail Dependent Time Series

Ting Zhang, Fangwei Wu, Jingying Gao

AI总结 本文研究了在尾部依赖时间序列的异方差高分位数回归中,解释变量对响应变量不同高分位数的影响是否具有同质性的问题。为此,作者提出了一种基于Bahadur表示的新型高分位数同质性检验方法,该方法适用于异方差情形,并能够将非线性高分位数回归估计问题转化为具有显式误差界的线性形式问题。该方法不仅为高分位数回归提供了理论基础,还在实际数据中的应用展示了其有效性。

Comments 31 pages, 1 figure

详情
英文摘要

We consider a high quantile homogeneity test to determine whether a certain set of explanatory variables has homogeneous effects on different high quantiles of the response variable in the tail. To accommodate for situations under both the null and the alternative, the auxiliary process in this case may no longer be treated as stationary, and the problem requires a joint analysis of both homoscedastic and heteroskedastic high quantiles. For this, we develop a novel Bahadur representation result in the high quantile setting for a general class of tail dependent time series under potential heteroskedasticity, which can be of interest by its own. In particular, the Bahadur representation provides a foundation for reducing problems regarding nonlinear high quantile regression estimators to those regarding suitably constructed linear forms with an explicit error bound and can be transformative and useful in many statistical problems. We in the current article apply it to guide the development of a generative high quantile homogeneity test, which is then illustrated through applications to both synthetic and real data.

2605.10914 2026-05-12 stat.CO

gemlib.mcmc: composable kernels for Metropolis-within-Gibbs sampling schemes

Alin Morariu, Jess Bridgen, Chris Jewell

AI总结 该研究针对流行病学和生态学中状态转移模型的统计推断难题,提出了一种名为 gemlib.mcmc 的 MCMC 模块,旨在简化 Metropolis-within-Gibbs 采样方案的实现。通过引入范畴论中的 writer monad,该框架实现了参数估计与数据增强核的可组合性,无需手动管理状态,从而提升了代码的可扩展性与复用性。基于 JAX 和 TensorFlow Probability,gemlib.mcmc 提供了高效且易用的接口,使复杂推断算法能够简洁表达并跨应用复用,降低了实现门槛,推动了方法研究与实际应用的结合。

详情
英文摘要

State-transition models are essential across epidemiology and ecology, but statistical inference remains challenging owing to high-dimensional latent state spaces, temporal dependence, and intractable likelihood functions. Bayesian inference via Markov Chain Monte Carlo (MCMC) enables joint estimation of model parameters and missing event times through data augmentation, but Metropolis-within-Gibbs (MWG) schemes that combine multiple specialised kernels are notoriously difficult to implement. Current probabilistic programming frameworks face a trade-off: automation sacrifices extensibility, whilst flexibility demands substantial implementation overhead. This divide has created a software landscape characterised by tightly coupled, model-specific implementations that resist reuse and extension. We introduce gemlib.mcmc, an MCMC module designed to bridge methodological and applied communities through principled, composable kernel abstractions. The framework employs writer monads from category theory to formalise kernel composition, enabling seamless integration of parameter-estimation and data-augmentation kernels without manual state management. Built on JAX and TensorFlow Probability for high-performance computation, gemlib.mcmc provides an ergonomic interface -- leveraging Python's right-shift operator for intuitive kernel chaining -- whilst maintaining statistical rigour and transparency. Developers can extend the library by implementing only two methods; composition and hardware acceleration are automated. We demonstrate the framework through parameter inference on partially observed epidemic models, showing how complex inference algorithms can be expressed concisely and reused across applications. By reducing implementation burden we provide access to sophisticated MCMC methods and enable applied researchers to employ state-of-the-art algorithms without reimplementation overhead.

2605.10911 2026-05-12 math.PR cs.CC cs.DS math.CO math.ST stat.TH

The stochastic block model has the overlap graph property for modularity

Shankar Bhamidi, David Gamarnik, Remco van der Hofstad, Nelly Litvak, Pawel Pralat, Fiona Skerman, Yasmin Tousinejad

AI总结 本文研究了随机块模型(SBM)中基于模块度的聚类算法的理论极限,指出模块度在SBM中具有重叠间隙性质(OGP)。这一性质表明,基于模块度的局部算法在恢复隐藏的社区结构时存在困难,并且相关马尔可夫链的混合时间较慢。该研究扩展了Bickel和Chen的结论,证明了在高概率下,任何接近最优模块度的划分都与隐藏的社区划分接近,为理解SBM中算法性能的瓶颈提供了理论依据。

Comments 28 pages, 2 figures

详情
英文摘要

The overlap gap property (OGP) is a statement about the geometry of near-optimal solutions. Exhibiting OGP implies failure of a class of local algorithms; and has been observed to coincide with conjectured algorithmic limits in problems with statistical computational gap. We consider the Stochastic Block Model (SBM), where the graph has a planted partition with $k$ equal-size blocks which form the `communities', and where, for parameters $p>q$, vertices within the same community connect with probability $p$, while vertices in different communities connect with probability $q$, independently across pairs of vertices. Modularity--based clustering algorithms have become ubiquitous in applications. This article studies theoretical limits of local algorithms based on the modularity score on the SBM. We establish that modularity exhibits OGP on the SBM. This rules out a class of local algorithms based on modularity for recovery in the SBM, and shows slow mixing time for a related Markov Chain. Theoretically this is one of the few instances where OGP has been established for a `planted' model, as most such analyses to date consider the `null' model. As part of our analysis, we extend a result by Bickel and Chen 2009, who established that with high probability, the modularity optimal partition of SBM is $o(n)$ local moves away from the planted partition, where $n$ is the graph size. We show that, with high probability, any partition with modularity score sufficiently near the optimal value is close to the planted partition.