ShrinkageTrees: An R Package for Bayesian Tree Ensembles for Survival Analysis and Causal Inference
ShrinkageTrees: 用于生存分析和因果推断的贝叶斯树集成R包
AI总结 ShrinkageTrees是一个R包,通过贝叶斯加性回归树模型处理右删失和区间删失生存数据,支持因果推断中的预后和治疗效应分解,并引入深度惩罚、Dirichlet分裂和马蹄铁先验等正则化策略,适用于高维场景。
ShrinkageTrees: 用于生存分析和因果推断的贝叶斯树集成R包
Tijn Jacobs
AI总结 ShrinkageTrees是一个R包,通过贝叶斯加性回归树模型处理右删失和区间删失生存数据,支持因果推断中的预后和治疗效应分解,并引入深度惩罚、Dirichlet分裂和马蹄铁先验等正则化策略,适用于高维场景。
ShrinkageTrees是一个用于生存分析和因果推断的贝叶斯树集成R包。该包在加速失效时间(AFT)框架下实现了针对右删失和区间删失生存结果的贝叶斯加性回归树模型,并可选择分解为预后和治疗效应成分以进行因果推断。提供两种互补的正则化形式:通过深度惩罚先验和Dirichlet分裂先验对树结构进行正则化,以及通过全局-局部收缩先验对步高进行正则化。ShrinkageTrees首次实现了马蹄铁森林,即对步高施加马蹄铁先验。这些正则化策略将贝叶斯树集成扩展到高维设置。高效的Rcpp后端、多链MCMC和S3方法支持完整的流程:拟合、预测、因果效应估计和收敛诊断。
ShrinkageTrees is an R package for Bayesian tree ensembles in survival analysis and causal inference. The package implements Bayesian additive regression tree models for right- and interval-censored survival outcomes within an accelerated failure time (AFT) framework, with optional decomposition into prognostic and treatment-effect components for causal inference. Two complementary forms of regularisation are available: regularisation of the tree structure, via depth-penalising priors and Dirichlet splitting priors, and regularisation of the step heights, via global-local shrinkage priors. ShrinkageTrees provides the first implementation of the Horseshoe Forest, which places a horseshoe prior on the step heights. These regularisation strategies extend Bayesian tree ensembles to high-dimensional settings. An efficient Rcpp backend, multi-chain MCMC, and S3 methods support the full workflow: fitting, prediction, causal effect estimation, and convergence diagnostics.
具有顺序最优子集选择的分数高斯过程的复合似然推断
Mathis Fourreau, Matthieu Garcin
AI总结 针对分数高斯过程,提出通过顺序最大化Godambe信息来选择子集,以平衡估计精度与计算成本,并推导了Fisher信息和Godambe信息的理论表达式。
复合似然方法通过考虑观测的几个子集而非全部来降低时间序列参数估计的计算成本。该方法的渐近性质与Godambe信息相关,Godambe信息是Fisher信息的扩展,考虑了观测子集之间的依赖性。我们旨在将该方法应用于线性高斯模型,特别是分数布朗运动和分数高斯噪声。我们推导了其Fisher信息和Godambe信息的理论表达式,并推导出一种顺序最大化Godambe信息的子集选择设计。子集的大小使我们能够控制估计精度与计算成本之间的权衡。通过模拟,我们将该方法与矩方法和最大似然估计进行比较,并将其应用于真实数据,即股票指数的波动率序列和风速时间序列。
The composite likelihood method reduces the computational cost of parameter estimation in time series by considering several subsets of observations instead of all observations at once. The asymptotic properties of this method are related to the Godambe information, an extension of the Fisher information that accounts for the dependence between subsets of observations. We aim to apply this method to linear Gaussian models, in particular fractional Brownian motion and fractional Gaussian noise. We derive theoretical expressions for their Fisher information and their Godambe information and deduce a subset selection design that sequentially maximizes the Godambe information. The size of the subsets then allows us to control the trade-off between estimation accuracy and computational cost. Through simulations, we compare this method with the method of moments and maximum likelihood estimation, and we apply it to real data, namely volatility series of a stock index and a wind speed time series.
二阶最小二乘法作为多项式最大化方法的特例
Serhii Zabolotnii
AI总结 证明在条件同方差非高斯误差下,最优加权二阶最小二乘法与二次广义多项式最大化方法等价,并揭示高阶效率储备。
我们证明,对于具有条件同方差非高斯误差的线性回归,最优加权二阶最小二乘法(SLS)与二次广义多项式最大化方法(PMM)是相同的总体估计方程:它们选择前两个中心残差矩的最优线性组合,求解同一个总体正规方程组,共享同一个影响函数,并达到相同的渐近方差 $c_2g_2/N$——普通最小二乘斜率方差因子 $c_2$ 乘以 PMM 方差缩减系数 $g_2=1-\gamma_3^2/(2+\gamma_4)$(其中 $\gamma_3,\gamma_4$ 为误差偏度和超额峰度)。因此,可行的插件实现是一阶等价的,仅存在高阶有限样本差异。这一等价性是尖锐的:在异方差下,无条件 PMM 主体与条件 SLS 加权分离,导致对称误差的效率损失和不对称误差的一致性损失。在二次以上,PMM 拥有 SLS 在其二阶矩范围内无法达到的效率储备。对于对称的尖峰误差,SLS 退化为普通最小二乘法估计斜率,而三次 PMM 通过闭式系数 $g_3$ 利用 SLS 矩范围之外的峰度信息;对于典型非对称分布,在三次多项式矩类中,这一储备为 $30$--$50\\%$。Lean 4 开发环境机器检验了特定次数的代数核心——$g_2$ 和 $g_3$ 的闭式、$g_2\le1$ 结果、设计抵消和对称退化——而一般单调性 $g_{S+1}\le g_S\le1$ 通过嵌套分析证明。蒙特卡洛研究说明了等价性、储备和异方差边界在有限样本中的表现。
We prove that optimally weighted second-order least squares (SLS) and the degree-two generalized polynomial maximization method (PMM) are the same population estimating equation for linear regression with conditionally homoskedastic non-Gaussian errors: they choose the same optimal linear combination of the first two centered residual moments, solve one population normal system, share one influence function, and attain the common asymptotic variance $c_2g_2/N$ -- the ordinary-least-squares slope-variance factor $c_2$ scaled by the PMM variance-reduction coefficient $g_2=1-\gamma_3^2/(2+\gamma_4)$ (with $\gamma_3,\gamma_4$ the error skewness and excess kurtosis). Feasible plug-in implementations are therefore first-order equivalent, with only higher-order finite-sample differences. The identity is sharp: under heteroskedasticity the unconditional PMM body and the conditional SLS weighting separate, costing efficiency for symmetric errors and consistency for asymmetric errors. Beyond degree two, PMM holds an efficiency reserve that SLS cannot reach within its second-moment span. For symmetric platykurtic errors SLS collapses to ordinary least squares for the slope, while degree-three PMM exploits kurtosis information outside the SLS moment span through a closed-form coefficient $g_3$; for canonical asymmetric laws this reserve is $30$--$50\%$ within the degree-three polynomial moment class. The Lean 4 development machine-checks the degree-specific algebraic core -- the closed forms for $g_2$ and $g_3$, the $g_2\le1$ result, the design cancellations, and the symmetric collapse -- while the general monotonicity $g_{S+1}\le g_S\le1$ is proved analytically by nesting. A Monte Carlo study illustrates the equivalence, the reserve, and the heteroskedastic boundary at finite samples.
GraphGP: 基于Vecchia近似的可扩展高斯过程
Benjamin Dodge, Philipp Frank, Susan E. Clark
AI总结 提出GraphGP算法,利用Vecchia近似和GPU加速,将高斯过程扩展到近十亿参数,实现线性时间和内存复杂度,适用于大动态范围任意点分布。
高斯过程是建模连续场的强大工具,但其朴素的$\mathcal{O}(N^3)$计算成本和$\mathcal{O}(N^2)$内存需求常常限制其实际应用。Vecchia近似是一种针对平稳、衰减核的稀疏精度矩阵近似,它将每个点仅条件于其$k$个最近邻。我们提出GraphGP,一种用于Vecchia近似的GPU算法,可扩展到近十亿参数,具有线性时间和内存需求,并能处理大动态范围内的任意点分布。我们的关键贡献是:(1) 一种比特反转k-d树排序,允许高效邻居搜索同时最大化批处理并行性;(2) 一种可微的CUDA实现,比纯JAX基线显著更快且内存效率更高。GraphGP提供了推理所需的构建块,包括前向生成、逆应用、对数行列式和核参数导数。
Gaussian processes are a powerful tool for modeling continuous fields, but their naive $\mathcal{O}(N^3)$ computational cost and $\mathcal{O}(N^2)$ memory requirement often limit their practical use. Vecchia's approximation is a sparse precision matrix approximation for stationary, decaying kernels that conditions each point only on its $k$ nearest neighbors. We present GraphGP, a GPU algorithm for Vecchia's approximation that scales to nearly a billion parameters with linear time and memory requirements, handling arbitrary point distributions over a large dynamic range. Our key contributions are (1) a bit-reversed k-d tree ordering that allows efficient neighbor searches while also maximizing batch parallelism, and (2) a differentiable CUDA implementation, which is substantially faster and more memory efficient than our pure JAX baseline. GraphGP provides the building blocks for inference, including forward generation, inverse application, log-determinant, and kernel parameter derivatives.
使用动态更新边界的P样条实现GPLSIAMs的稳定直接估计
Danilo V. Silva, Gilberto A. Paula
AI总结 本文提出了一种稳定直接估计GPLSIAMs的方法,通过使用模型矩阵和惩罚完全鱼尔信息矩阵动态更新单指数协变量的边界,在统一的迭代框架中实现快速计算有效自由度和点wise置信区间。
广义部分线性单指数加法模型(GPLSIAMs)因其在功能灵活性与参数维度缩减之间的平衡而被广泛应用于不同领域。然而,估计过程面临严重的计算挑战。本文介绍了一种新的稳定方法,利用每个单指数效应的模型矩阵,定义为其单指数系数,并通过惩罚完全鱼尔信息矩阵动态更新单指数协变量的边界,以统一的迭代框架实现。推导出的模型矩阵使得能够快速计算估计的有效自由度和单指数效应的点wise置信区间。通过广义Fellner-Schall方法将平滑参数更新整合到迭代过程中,从而提供对全局惩罚优化问题的高效近似。在中等样本量和非高斯分布下的模拟研究证实了估计在多个场景下的经验一致性。值得注意的是,所提出的方法在最先进竞争方法无法恢复真实单指数系数和非线性函数的稳定情况下仍保持稳定,并且在计算最密集的场景中比常规两步方法快80.13倍。通过应用于Capital Bike Sharing数据集,展示了该方法的建模优势,其中处理每年的单指数交互效应,具有不同的单指数系数和复杂的结构,使得竞争方法不适用。所提出的方法在R中实现,提供了可重复和透明的比较功能。
Generalized partially linear single-index additive models (GPLSIAMs) have been increasingly applied across diverse areas due to their versatility in integrating functional flexibility with parametric dimension reduction while maintaining interpretability. However, the estimation presents severe computational challenges. This paper introduces a novel stable method that uses the model matrix for each single-index effect, defined by its single-index coefficients, and the penalized complete Fisher information matrix to dynamically update the boundaries of the single-index covariates within a unified iterative framework. The derived model matrices enable the fast computation of the estimated effective degrees of freedom and pointwise confidence bands for the single-index effects. The smoothing parameter updates are integrated into the iterative process via the generalized Fellner-Schall method, which recycles the derived matrix decompositions, thereby providing an efficient approximation to the global penalized optimization problem. Simulation studies with moderate sample sizes under non-Gaussian distributions confirm the empirical consistency of the estimation across multiple scenarios. Notably, the proposed approach remains stable where state-of-the-art competitive methods fail to recover true single-index coefficients and nonlinear functions, and is 80.13 times faster than the usual two-step method in the most computationally intensive scenario. The modeling advantage is illustrated through an application to Capital Bike Sharing data, where we deal with a single-index interaction effect for each year, with distinct single-index coefficients, a complex structure that makes competitive methods inapplicable. The proposed method is implemented in R, with functions available for reproducibility and transparency in comparisons.
ProjGuard:通过低维投影实现计算机使用代理的安全监控
Kebin Contreras, Carlos Hinojosa, Jorge Bacca, Bernard Ghanem
AI总结 ProjGuard通过行为轨迹监控实现计算机使用代理的安全防护,利用轻量级风险信号提前预警潜在危险,结合辅助视觉语言模型进行针对性修正,提升任务完成率并降低安全风险。
计算机使用代理越来越多地在真实操作系统上运行,但这也增加了提示注入、间接指令和视觉攻击的风险。现有防御通常依赖于在推理时分析提示或每个潜在恶意输入,使用第二个大模型,这可能限制覆盖范围或增加部署成本。我们提出了ProjGuard,一种基于行为轨迹监控的替代方案。在每一步,我们从代理的累积交互历史中推导出一个轻量级的标量风险信号,并在线评估执行是否开始向不安全区域偏移。这使在轨迹达到潜在有害操作之前就能发出预警。当触发警报时,我们选择性地激活辅助的视觉语言模型,提出修正的下一步,并将执行引导回任务完成。在OS-Harm实验中,使用按需修正的监控将不安全率从16%降低到3%,同时提高任务完成率从59%到65%。我们进一步评估了在RiosWorld上的迁移效果,方法保持竞争力,达到4%的不安全率和64%的任务完成率。总体而言,这些结果支持了一种分层的安全策略,即持续监控可提前预警偏差,并仅在需要时激活修正。
Computer-use agents are increasingly capable of operating on real operating systems, but this capability has also increased the risks posed by prompt injection, indirect instructions, and visual attacks. Existing defenses typically rely on analyzing the prompt or each potentially malicious input with a second large model at inference time, which can limit coverage or increase deployment cost. We propose ProjGuard, an alternative based on behavioral trajectory monitoring. At each step, we derive a lightweight scalar risk signal from the agent's accumulated interaction history and evaluate, online, whether execution is beginning to drift toward an unsafe region. This enables early warnings before the trajectory reaches a potentially harmful action. When an alert is raised, we selectively activate an auxiliary vision-language model to propose a corrected next step and steer execution back toward task completion. Experiments on OS-Harm show that monitoring with on-demand correction reduces the unsafe rate from 16 percent to 3 percent while improving task completion from 59 percent to 65 percent. We further evaluate transfer to RiosWorld, where the method remains competitive, reaching 4 percent unsafe and 64 percent completion. Overall, these results support a hierarchical safety strategy in which always-on monitoring anticipates deviations and activates correction only when needed.
压缩贝叶斯张量回归
Roberto Casarin, Radu Craiu, Qing Wang
AI总结 针对张量回归中的高维问题,提出广义张量随机投影方法将高维协变量嵌入低维子空间,结合贝叶斯推理框架和低秩参数表示,实现高效预测与计算成本降低。
为了解决张量回归中常见的高维问题,我们引入了一种广义张量随机投影方法,该方法将高维张量值协变量嵌入低维子空间,同时最小化响应信息的损失。该方法灵活,允许张量-wise、模式-wise 或组合随机投影作为特例。我们提供了一个贝叶斯推理框架,其特点是使用分层先验分布和参数的低秩表示。为随机投影的集中性质和贝叶斯推理的后验一致性提供了强有力的理论支持。开发了一个高效的吉布斯采样器来对压缩数据进行推理。为了减轻随机投影引入的敏感性,采用了贝叶斯模型平均,并使用逆逻辑回归估计归一化常数。进行了广泛的模拟研究,以检查不同调谐参数的影响。模拟表明,并且实际数据应用证实,与标准贝叶斯张量回归相比,压缩贝叶斯张量回归可以在显著降低计算成本的同时实现更好的样本外预测。
To address the common problem of high dimensionality in tensor regressions, we introduce a generalized tensor random projection method that embeds high-dimensional tensor-valued covariates into low-dimensional subspaces with minimal loss of information about the responses. The method is flexible, allowing for tensor-wise, mode-wise, or combined random projections as special cases. A Bayesian inference framework is provided featuring the use of a hierarchical prior distribution and a low-rank representation of the parameter. Strong theoretical support is provided for the concentration properties of the random projection and posterior consistency of the Bayesian inference. An efficient Gibbs sampler is developed to perform inference on the compressed data. To mitigate the sensitivity introduced by random projections, Bayesian model averaging is employed, with normalising constants estimated using reverse logistic regression. An extensive simulation study is conducted to examine the effects of different tuning parameters. Simulations indicate, and the real data application confirms, that compressed Bayesian tensor regression can achieve better out-of-sample prediction while significantly reducing computational cost compared to standard Bayesian tensor regression.
从扩散MRI实验估计空间平滑的纤维取向分布
Jilei Yang, Seungyong Hwang, Mengjie Shi, Jie Peng
AI总结 提出最近邻自适应回归模型(NARM),通过加权局部似然估计和空间邻域嵌套实现纤维取向分布(FOD)的空间自适应估计,引入体素级重缩放和数据驱动停止规则防止过平滑,并基于配置感知策略选择相似性平滑参数,在模拟和人类连接组项目数据中提高了估计准确性和可重复性。
扩散加权磁共振成像(D-MRI)是一种非侵入性体内技术,用于探测生物组织的微观结构架构。在每个体素处,纤维取向分布(FOD)表征局部纤维构型和方向,因此是D-MRI分析中的核心估计对象。我们提出了最近邻自适应回归模型(NARM),这是一种用于FOD估计的空间自适应框架,它在嵌套的空间邻域上执行加权局部似然估计,其中权重联合编码相邻FOD之间的空间邻近性和相似性,通过最优传输或Hellinger距离测量。为了防止过平滑同时保留结构异质性,我们引入了体素级重缩放方案和基于最小最近邻相异性的数据驱动停止规则。我们进一步开发了一种配置感知策略来选择相似性平滑参数,使平滑强度能够适应局部纤维复杂性。模拟研究表明,相对于体素级方法和现有的空间平滑方法PMARM,NARM提高了FOD估计精度。对人类连接组项目的重测数据的应用还表明,NARM产生了更可重复的FOD估计。实现细节以及模拟和真实数据分析的脚本可在以下网址获得:https://github.com/DMRIdotL/NARM
Diffusion-weighted magnetic resonance imaging (D-MRI) is a noninvasive in vivo technique for probing the microstructural architecture of biological tissues. At each voxel, the fiber orientation distribution (FOD) characterizes local fiber configurations and orientations and is therefore a central object of estimation in D-MRI analysis. We propose the Nearest-Neighbor Adaptive Regression Model (NARM), a spatially adaptive framework for FOD estimation that performs weighted local likelihood estimation over nested spatial neighborhoods, where the weights jointly encode spatial proximity and similarity among neighboring FODs, measured by either the optimal transport or Hellinger distance. To prevent over-smoothing while preserving structural heterogeneity, we introduce a voxel-wise rescaling scheme and a data-driven stopping rule based on minimum nearest-neighbor dissimilarity. We further develop a configuration-aware strategy for selecting the similarity-smoothing parameter, allowing the smoothing strength to adapt to local fiber complexity. Simulation studies demonstrate that NARM improves FOD estimation accuracy relative to voxel-wise methods and the existing spatial smoothing approach PMARM. Application to test-retest data from the Human Connectome Project additionally shows that NARM yields more reproducible FOD estimates. Implementation details and scripts for the simulation and real data analyses are available at this https URL