arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2606.19720 2026-06-19 eess.SP 新提交

An Optimization Framework for Certain Separable Problems using Neural Networks

基于神经网络的特定可分离问题优化框架

Rohit Negi, Soummya Kar

AI总结针对参数可分离的约束优化问题，提出离线学习与在线处理两阶段策略，利用ADMM和神经网络降低在线计算复杂度。

Comments 15 pages, 5 figures

详情

AI中文摘要

本文研究一类由实时应用驱动的参数约束优化问题。在参数可分离问题结构下，提出基于离线学习和在线处理的两阶段策略，以在资源受限设备上解决这些优化问题。具体地，利用可分离结构，开发了基于交替方向乘子法（ADMM）的迭代求解过程，该过程允许使用基于学习的函数表示（离线学习但在线可快速计算）来降低整体在线设备实现复杂度。通过精心设计ADMM过程，表明即使参数变化，参数优化问题的相应实例也可通过设备上的轻量级在线计算，借助神经网络协处理器求解。

英文摘要

This paper studies a class of parametric constrained optimization problems that are motivated by applications in real time applications. Under a parameter-separable problem structure that naturally arises in these applications, the paper proposes a two phase strategy, based on offline learning and online processing, to address these optimization problems on resource limited devices. Specifically, by exploiting the separable structure, an iterative Alternating Direction Method of Multipliers (ADMM) based solution procedure is developed that enables the use of certain learning based function representations (learned offline but readily computable online) to reduce the overall online on-device implementation complexity. By carefully crafting the ADMM procedure, it is shown that even as the parameters vary, the corresponding instances of the parametric optimization problem may be solved by lightweight online computations in the device with the assistance of a neural network co-processor.

URL PDF HTML ☆

赞 0 踩 0

2606.19666 2026-06-19 eess.SP 新提交

Degrees of Freedom and Beamforming for Large Intelligent Surfaces

大规模智能表面的自由度与波束赋形

Jiawang Li, Alireza Saberkari, Buon Kiong Lau, Mats Gustafsson

AI总结通过互阴影面积闭式表达式估计大规模智能表面（LIS）的空间自由度（DoF），并验证其与数值奇异值谱的吻合；基于DoF分析设计采样方案和波束赋形，证明可形成约DoF数量的独立波束，超过此限会导致干扰增加；极化研究表明电场分量对DoF贡献不均，总场DoF为单极化分量的两倍。

详情

AI中文摘要

空间自由度（DoF）、采样和波束赋形是多用户大规模智能表面（LIS）的基础，其中电磁场必须在多个近场位置进行成形、分辨和聚焦。本文利用互阴影面积的闭式表达式，针对代表性LIS配置估计了DoF数量。通过数值奇异值谱验证了所得DoF预测，其谱膝点与理论估计紧密吻合。对于线源配置，通过将源或观测线划分为单位DoF区间，开发了一种解析采样方案，从而能够选择空间样本。使用最大比传输和迫零的波束赋形结果表明，可以形成大约DoF数量的独立波束。试图超过此限制会导致干扰增加和性能下降。对于基于表面的LIS配置，采样点则通过离散经验插值方法数值确定。相应的波束赋形结果进一步证实，目标区域可以支持大约与DoF分析预测数量相同的独立波束。最后，一项极化感知研究表明，电场分量对DoF的贡献不相等，且总场DoF是单极化分量DoF的两倍。

英文摘要

Spatial degrees of freedom (DoF), sampling, and beamforming are fundamental to multi-user large intelligent surfaces (LISs), where electromagnetic fields must be shaped, resolved, and focused at multiple near-field locations. This work estimates the number of DoF using closed-form expressions derived from the mutual shadow area for representative LIS configurations. The resulting DoF predictions are validated through numerical singular-value spectra, whose spectral knee points closely match the theoretical estimates. For line-source configurations, an analytic sampling scheme is developed by partitioning the source or observation line into unit-DoF intervals, enabling the selection of spatial samples. Beamforming results using maximum-ratio transmission and zero-forcing demonstrate that approximately the number of DoF independent beams can be formed. Attempting to exceed this limit results in increased interference and degraded performance. For surface-based LIS configurations, sampling points are instead determined numerically using the discrete empirical interpolation method. The corresponding beamforming results further confirm that the target region can support approximately as many independent beams as predicted by the DoF analysis. Finally, a polarization-aware study reveals that the electric-field components contribute unequally to the DoF and that the total-field DoF is twice that of a single polarization component.

URL PDF HTML ☆

赞 0 踩 0

2606.19536 2026-06-19 eess.SP 新提交

Multistatic J-Band Radar TX/RX Chipset in SiGe BiCMOS with Integrated x16 Frequency Multiplier Chain and High EIRP

采用SiGe BiCMOS工艺的集成x16倍频链和高EIRP的多基地J波段雷达收发芯片组

Stephan Hauptmeier, Kennet Braasch, Till Ziegler-Bellenberg, Diana P. Cortes N., Tobias T. Braun, Michael Höft, Nils Pohl

AI总结本文设计并测量了一种多基地J波段雷达芯片组，包含集成x16倍频链的发射和接收MMIC，实现了高EIRP和远距离探测。

详情

AI中文摘要

本文介绍了一种多基地J波段雷达芯片组的设计与测量，该芯片组包括一个发射机和一个接收机MMIC，两者均集成了$\ imes$16倍频链，用于低频本振分配和可扩展雷达配置。多基地雷达架构可以同时维持高发射功率和高接收灵敏度，这一优势在本芯片组中得到了充分利用。为此，发射机MMIC上集成的四路功率合成放大器链提供了11.2 dBm的输出功率。在292 GHz下，使用准直PTFE透镜时测得的EIRP为41 dBm，无透镜时为8.8 dBm。尽管倍频因子较高，但片上谐波抑制优于24 dBc，而通过多个滤波器级实现了约50 dBc的辐射带内谐波抑制。接收机MMIC包含三级低噪声放大器，在292 GHz下整体转换增益为43.3 dB。集成的片上贴片天线便于系统集成，并可使用高方向性介质透镜，使该芯片组适用于长达150米的远距离雷达测量。MMIC采用130 nm SiGe BiCMOS工艺实现，其f_T和f_max分别为500 GHz和610 GHz。

英文摘要

This work presents the design and measurement of a multistatic J-band radar chipset comprising a transmitter and a receiver MMIC both featuring an integrated $times$16 frequency multiplier chain for low-frequency local-oscillator distribution and scalable radar configurations. Multistatic radar architectures can sustain high transmission power and high receiver sensitivity simultaneously an advantage that is fully leveraged in the present chipset. To this end a four-way power-combining amplifier chain integrated on the transmitter MMIC delivers an output power of 11.2 dBm. The resulting measured EIRP is 41 dBm at 292 GHz with a collimating PTFE lens and 8.8 dBm without a lens. Despite the high frequency-multiplication factor an on-chip harmonic rejection better than 24 dBc was measured while a radiated in-band harmonic rejection of approximately 50 dBc was achieved through multiple filter stages. The receiver MMIC incorporates a three-stage low-noise amplifier and exhibits an overall conversion gain of 43.3 dB at 292 GHz. Integrated on-chip patch antennas facilitate system integration and the use of highly directive dielectric lenses making the chipset suitable for long-range radar measurements which are demonstrated up to 150 m. The MMICs are realized in a 130 nm SiGe BiCMOS technology with an f_T and f_max of 500 GHz and 610 GHz respectively.

URL PDF HTML ☆

赞 0 踩 0

2606.19453 2026-06-19 eess.AS 新提交

A Survey of Full-Duplex Spoken Dialogue Systems: Architectural Hierarchy, Interaction Ontology, and Decision State Machine

全双工口语对话系统综述：架构层次、交互本体与决策状态机

Jingyu Lu, Yuhan Wang, Jianming Luo, Yifu Chen, Tianle Liang, Shengpeng Ji, Ziyue Jiang, Xiaoda Yang, Yu Zhang, Xize Cheng, Chenyuhao Wen, Changhao Pan, Haoxiao Wang, Chen Ye, Jian Wu, Xiaoxi Jiang, Guanjun Jiang, Zhou Zhao

AI总结针对全双工术语歧义，提出L0-L3架构层次、T×I×R交互本体和IDLE/LISTEN/SPEAK/WAIT/DUAL决策状态机三个框架，揭示现有系统在训练与评估中的实现差距。

Comments 34 pages, 5 figures, 7 tables. Project page and interactive demo: https://github.com/DuplexLM/DuplexSurvey

详情

AI中文摘要

近期有十余个口语对话系统声称实现了“全双工”，但该术语被用于描述本质上不同的能力。现有综述将它们归入单一轴（级联/端到端，或工程化/学习型），忽略了构建者最关心的区别。我们认为这种歧义很大程度上源于分类学问题：当前术语未明确双工决策在何处做出、支持哪些交互类型、以及系统如何逐时刻行为。本文引入三个互补框架：(i) L0-L3架构层次，定位双工决策位置；(ii) T×I×R交互本体，指定每次交互的时间关系、用户意图和所需系统响应；(iii) 决策状态机（IDLE/LISTEN/SPEAK/WAIT/DUAL），描述系统如何在状态间转换。通过对已发表系统和基准的审计，我们记录了一个实现差距：尽管许多架构原则上能在全双工状态下运行，但其观察到的行为仍受训练和评估中表示的交互模式约束。我们指出，相对于（大多未公开的）工业语料库，有限的公开训练数据覆盖范围，以及尚未实现的L3表示级建模目标，是全双工对话未来研究的关键前沿。相关材料见https://this https URL。

英文摘要

More than a dozen spoken dialogue systems have recently claimed to be "full-duplex," yet the term has been used to describe substantially different capabilities. Existing surveys collapse them onto a single axis (cascaded/end-to-end, or engineered/learned) and miss the distinctions that matter most for builders. We argue that much of this ambiguity is taxonomical: current terminology does not specify where duplex decisions are made, which interaction types are supported, or how a system behaves moment by moment. This paper introduces three complementary frameworks: (i) an L0-L3 Architectural Hierarchy that locates where duplex decisions are made; (ii) a $T\times I\times R$ Interaction Ontology that specifies the temporal relation, user intent, and required system response for each interaction; and (iii) a Decision State Machine (IDLE/LISTEN/SPEAK/WAIT/DUAL) that describes how systems move between states. Across published systems and benchmarks, our audit documents a realization gap: although many architectures can in principle operate in full-duplex states, their observed behavior remains constrained by the interaction patterns represented in training and evaluation. We point to the limited public training-data coverage relative to the (largely undisclosed) industrial corpora, together with the still-unrealized goal of L3 representation-level modeling, as the key frontiers for future research on full-duplex dialogue. The related material is available at https://github.com/DuplexLM/DuplexSurvey.

URL PDF HTML ☆

赞 0 踩 0

2606.20240 2026-06-19 econ.EM stat.AP 新提交

Two-Sample IV: Efficient Two-Step Estimation and Tests for Overidentification and Weak-Instruments

两样本IV：高效两步估计及过度识别与弱工具变量检验

Fatima Kasenally, Ruoxi Guan, Frank Windmeijer

AI总结针对两样本IV估计，提出异方差和样本异质性下稳健的两步高效估计方法及过度识别检验，仅需线性回归的汇总统计量，并扩展弱工具变量检验。

详情

AI中文摘要

两样本IV是一种流行的估计方法，当结果变量和处理变量在不同样本中可用，而工具变量在两个样本中都可用时。标准估计量是两样本两阶段最小二乘估计量，在同方差和样本同质性下是有效的。我们开发了一个稳健的两步程序，用于在一般异方差和样本异质性下进行有效估计，并提出了相关的两样本Hansen过度识别检验。我们方法的一个关键特征是只需要两个样本中简化形式和第一阶段的线性回归的汇总统计量。这些是估计系数向量的六个对象，以及同方差和异方差稳健的估计方差矩阵。我们进一步表明，在同方差和同质性下，处理样本中的第一阶段F统计量可以按标准方式用作弱工具变量检验，这里的相对偏差是比例偏差。我们提出了Montiel-Olea和Pflueger (2013)的有效F统计量的扩展，用于异方差情况，遵循Windmeijer (2025)的推广。我们在Marshall (2019)研究教育对投票行为影响的应用中说明了估计量和检验，并进行了聚类稳健推断。

英文摘要

Two-sample IV is a popular estimation method when the outcome and treatment variables are available in different samples, whereas instruments are available in both samples. The standard estimator is two-sample two-stage least squares estimator, which is efficient under homoskedasticity and homogeneity of the samples. We develop a robust two-step procedure for efficient estimation under general heteroskedasticity and heterogeneity of the samples, and propose a related two-sample Hansen overidentification test. A key feature of our approach is that only summary statistics from the linear regressions of the reduced form and first-stage in the two samples are needed. These are the six objects of the estimated coefficient vectors, and the homoskedastic and heteroskedasticity robust estimated variance matrices. We further show that the first-stage F-statistic in the treatment sample can be used as a test for weak instruments in the standard way under homoskedasticity and homogeneity, with the relative bias here a proportional bias. We propose an extension of the effective F-statistic of Montiel-Olea and Pflueger (2013) for the heteroskedastic case, following the generalization in Windmeijer (2025). We illustrate the estimators and tests in an application studying the effect of education on voting behavior from Marshall (2019), with cluster robust inference.

URL PDF HTML ☆

赞 0 踩 0

2606.20514 2026-06-19 stat.ME 新提交

Hypergraph Variable Selection with False Discovery Rate Control

具有错误发现率控制的超图变量选择

Sarah Organ, Toby Kenney, Hong Gu

AI总结针对预测变量复杂依赖结构导致变量选择方法功效降低的问题，提出基于超图的选择方法，在控制错误发现率的同时提高选择功效。

Comments 28 pages, 4 figures

2606.20406 2026-06-19 stat.ME stat.CO 新提交

Flexible modeling of bimodal distributions via skewed-$t$ mixtures

双峰分布的灵活建模：基于偏斜-t分布的混合模型

Marco Bee, Flavio Santi

AI总结提出基于Fernández和Steel (1998)偏斜-t分布的混合模型，通过EM算法进行极大似然估计，并开发似然比检验，用于拟合双峰、偏斜和厚尾数据，在标准普尔500指数中验证了双峰性。

详情

AI中文摘要

我们提出了一种位置-尺度偏斜-t分布的混合模型，用于拟合双峰、偏斜和厚尾数据。特别地，该混合模型基于Fernández和Steel (1998)的偏斜-t分布，因此模型构建过程可以轻松扩展到其他对称分布的混合。在研究了混合模型的性质后，我们通过EM算法开发了极大似然估计方法，并提出了一个似然比检验，用于检验任何给定成分中无偏斜的原假设。与最近提出的g-and-h分布混合的基于模拟的比较表明，所提出模型在良好指定设置下的估计精度和错误指定框架下的建模能力方面均表现出色。将该模型拟合到标准普尔500指数失真数据，证实了其分布的双峰性，这意味着美国股市历史上处于熊市或牛市状态，而非接近其基本面价值。

英文摘要

We propose a mixture of location-scale skewed-$t$ distributions to fit bimodal, skewed and heavy-tailed data. In particular, the mixture is based on the skewed-$t$ distribution by Fernández and Steel (1998), so that the model-building procedure can be easily extended to mixtures of other symmetric distributions. After studying the properties of the mixture, we develop a maximum likelihood estimation approach via the EM algorithm and a likelihood ratio test of the null hypothesis of no skewness in any given component. A simulation-based comparison to a recently proposed mixture of g-and-h distributions suggests that the performance of the proposed model is excellent, in terms of both estimation precision in well-specified setups and modeling capability in mis-specified frameworks. Fitting the model to the Standard & Poor's 500 distortion allows us to confirm the bimodality of its distribution, with the implication that the US stock market has historically been in bearish or bullish conditions, rather than near its fundamental value.

URL PDF HTML ☆

赞 0 踩 0

2606.20341 2026-06-19 stat.ME stat.AP 新提交

Anchors Away: Navigating Unanchored Indirect Comparisons with Multilevel Unanchored Meta-Regression (ML-UMR)

锚定之外：使用多层次非锚定元回归（ML-UMR）导航非锚定间接比较

Conor Chandler, Jack Ishak

AI总结针对随机证据缺失时的非锚定治疗比较，提出多层次非锚定元回归（ML-UMR），通过贝叶斯框架联合建模个体与汇总数据，估计多治疗、多研究及目标人群的边际和条件效应，并明确识别假设与可转移性假设。

Comments 20 pages (excluding supplementary material), 5 figures

详情

AI中文摘要

当随机证据不可用时，使用单臂研究或断开证据的非锚定间接治疗比较越来越多地用于卫生技术评估（HTA）。现有方法，包括匹配调整间接比较（MAIC）和模拟治疗比较（STC），通常局限于成对设置，并且通常估计比较研究人群中的边际效应，这可能与决策相关人群不同。我们提出多层次非锚定元回归（ML-UMR），一种用于综合来自完全断开证据的个体患者数据和汇总数据的贝叶斯回归框架。ML-UMR通过在一个统一似然中联合建模个体水平和汇总水平数据，将多层次网络元回归（ML-NMR）扩展到非锚定设置，从而能够估计跨多个治疗、研究和目标人群的治疗特异性结果以及边际和条件效应。ML-UMR区分了识别治疗效应所需的假设与将结果转移到目标人群所需的假设。与所有非锚定比较一样，有效推断依赖于强且通常不可验证的假设，包括条件可交换性、结果模型的正确设定以及跨治疗假设（例如，共享预后因素假设（SPFA））。ML-UMR并未减轻这些要求，而是在统一框架内使其明确，并促进敏感性分析。在模拟研究中，ML-UMR对比较人群效应产生了低偏差和名义覆盖。向其他人群的可转移性关键取决于识别假设：在强效应修饰下，违反SPFA导致偏差，而纳入亚组信息则恢复了近乎无偏的估计和名义覆盖。

英文摘要

Unanchored indirect treatment comparisons using single-arm studies or disconnected evidence are increasingly used in health technology assessment (HTA) when randomized evidence is unavailable. Existing methods, including matching-adjusted indirect comparison (MAIC) and simulated treatment comparison (STC), are generally limited to pairwise settings and typically estimate marginal effects in the comparator study population, which may differ from the decision-relevant population. We propose multilevel unanchored meta-regression (ML-UMR), a Bayesian regression framework for synthesizing individual patient data and aggregate data from fully disconnected evidence. ML-UMR extends multilevel network meta-regression (ML-NMR) to unanchored settings by jointly modeling individual- and aggregate-level data within a unified likelihood, enabling estimation of treatment-specific outcomes and both marginal and conditional effects across multiple treatments, studies, and target populations. ML-UMR distinguishes assumptions required to identify treatment effects from those required to transport results to target populations. As with all unanchored comparisons, valid inference relies on strong and often unverifiable assumptions, including conditional exchangeability, correct specification of the outcome model, and cross-treatment assumptions (e.g., shared prognostic factor assumption (SPFA)). ML-UMR does not lessen these requirements but makes them explicit within a unified framework and facilitates sensitivity analyses. In simulation studies, ML-UMR produced low bias and nominal coverage for comparator-population effects. Transportability to alternative populations depended critically on identifying assumptions: violations of SPFA led to bias under strong effect modification, whereas incorporating subgroup information restored near-unbiased estimation and nominal coverage.

URL PDF HTML ☆

赞 0 踩 0

2606.20226 2026-06-19 stat.ME stat.CO 新提交

Analysis of uncertain fixed-effects model for Latin square designs

拉丁方设计的不确定固定效应模型分析

Yaru Cheng, Zhiming Li

AI总结针对无频率稳定性的不确定实验数据，建立拉丁方设计的不确定固定效应模型，提出三种估计方法并构建置信区间，进行不确定齐性检验和常见检验，通过数值模拟和实例验证模型有效性。

2606.20191 2026-06-19 stat.ML stat.ME 新提交

AK-MCS-C2 : Active Kriging Monte Carlo Simulation method with conformal certification for failure probability estimation

AK-MCS-C2: 具有共形认证的主动克里金蒙特卡洛模拟方法用于失效概率估计

Edgar Jaber, Vincent Chabridon, Mathilde Mougeot

AI总结提出一种结合主动克里金蒙特卡洛模拟与共形预测的主动学习框架，通过自适应交叉共形策略和J+GP共形估计器，在少量样本下提供无分布假设的预测误差保证，提高极限状态面附近样本分类可靠性，从而提升失效概率估计的准确性和鲁棒性。

2606.20148 2026-06-19 stat.ME 新提交

A case study of causal mediation using Bayesian nonparametrics and semiparametric corrections

使用贝叶斯非参数和半参数修正的因果中介分析案例研究

Yuhua Zhang, Michael J. Daniels

AI总结提出截断富集狄利克雷过程混合模型估计自然直接和间接效应，结合高效MCMC算法和基于有效影响函数的一步后验修正，解决贝叶斯非参数中因果估计量的可靠推断问题。

2606.20141 2026-06-19 stat.CO 新提交

DASH: A Dimensionality Reduction Method for Large-scale Convex MIQP with Applications in Subset Portfolio Selection

DASH: 一种用于大规模凸MIQP的降维方法及其在子集投资组合选择中的应用

Pinzhang Cheng

AI总结提出DASH降维方法，通过减少变量层次改善大规模凸MIQP求解器性能，在子集投资组合选择中显著提升Gurobi难以求解问题的初始解质量。

详情

AI中文摘要

作为MIP（混合整数规划）的子集选择问题是NP难的。对于大规模问题，在合理时间内找到全局最优解是不可行的，实践中常通过MIP求解器寻找高质量的初始解。本文提出DASH（递减活动集层次）——一种降维方法，针对可表述为MIQP（混合整数二次规划）的一类最佳子集选择问题，提高MIP求解器的性能。我们在子集投资组合选择问题中开发并评估了DASH的性能，并与商业MIP求解器Gurobi进行了比较。除了问题规模外，问题的难度还与协方差矩阵的条件数以及投资组合权重的箱约束有关。大量不同问题配置的数值实验表明，当Gurobi难以求解问题时，DASH能持续显著改进初始解。特别是，DASH改进的幅度和持续时间随问题难度增加而扩大。

英文摘要

Subset selection problems as MIPs (Mixed Integer Programs) are NP-hard. For large scale problems, it is infeasible to find global optimal solutions in a reasonable time and good-quality incumbent solutions are sought after with MIP solvers in practice. This paper proposes DASH (Decreasing Active Set Hierarchy) -- a dimensionality reduction method that improves the MIP solver performance for a subclass of best subset selection problems that can be formulated as MIQPs (Mixed Integer Quadratic Programs). We develop and evaluate the performance of DASH in the subset portfolio selection problem with comparison to Gurobi, a commercial MIP solver. In addition to the problem size, the difficulty of a problem is related to the condition number of the covariance matrix and the box constraint on portfolio weights. An extensive set of numerical experiments with varying problem configurations shows that DASH offers consistent and significant improvement of incumbent solutions when the problem is difficult to solve by Gurobi. In particular, the magnitude and duration of improvement by DASH scale with the difficulty of the problem.

URL PDF HTML ☆

赞 0 踩 0

2606.20114 2026-06-19 stat.ME stat.AP 新提交

Community detection in small-sample ordinal regimes: A benchmarking framework for Delphi data

小样本有序情境下的社区检测：德尔菲数据的基准测试框架

Yuri Calleo, Simone Di Zio, Fabrizio Maturo

AI总结针对德尔菲数据高维小样本导致的秩亏问题，提出从变量中心协方差模型转向网络中心连接模型，利用社区检测算法识别潜在主题结构，实现结构稳定的降维。

详情

AI中文摘要

德尔菲数据共识的统计建模面临一个关键瓶颈：问卷项目的高维性与专家小组有限样本量之间的矛盾。这种秩亏导致传统潜变量模型（如主成分分析）结构不稳定且易过拟合。为弥补这一方法论空白，本研究提出从变量中心协方差模型转向网络中心连接模型。通过将项目相关性映射到加权图拓扑，我们提出了一个基于模拟的基准测试，利用社区检测算法识别潜在主题结构，有效解决了高维小样本情境下典型的谱不稳定性和秩亏问题。该研究系统评估了基于结构密度、信息流和谱划分的拓扑方法在合成数据集上的鲁棒性，这些数据集旨在复制共识数据的病理条件，包括有序量表和系统噪声。核心方法论贡献在于证明专家判断间的共线性——传统上被视为需要正则化的统计冗余——可以有效地重新解释为凝聚的拓扑信号。该框架为研究人员提供了一种结构化的自动降维程序，确保即使在标准因子分析失效的小样本情境下也能保持结构稳定性和心理测量一致性。

英文摘要

The statistical modeling of consensus in Delphi data faces a critical bottleneck: the high dimensionality of questionnaire items relative to the limited sample size of expert panels. This rank deficiency leads traditional latent variable models, such as Principal Component Analysis, to be structurally unstable and prone to overfitting. Addressing this methodological gap, this study proposes a transition from variable-centric covariance models to network-centric connectivity models. By mapping item correlations onto a weighted graph topology, we present a simulation-based benchmark that utilizes community detection algorithms to identify latent thematic structures, effectively addressing the spectral instability and rank deficiency typical of high-dimensional, low-sample-size regimes. The research systematically evaluates the robustness of topological approaches based on structural density, information flow, and spectral partitioning against synthetic datasets designed to replicate the pathological conditions of consensus data, including ordinal scales and systemic noise. The central methodological contribution lies in demonstrating that collinearity among expert judgments - traditionally treated as statistical redundancy to be regularized - can be effectively reinterpreted as a topological signal of cohesion. This framework provides researchers with a structured and automated procedure for dimensionality reduction, ensuring structural stability and psychometric consistency even in small-sample regimes where standard factor analysis breaks down.

URL PDF HTML ☆

赞 0 踩 0

2606.20078 2026-06-19 stat.OT 新提交

A Law of Iterated Expectation Primer for Causal Inference

因果推断中的迭代期望定律入门

Ashley I. Naimi, Razieh Nabi, Lindsay J. Collin, Paul N. Zivich, Stephen R. Cole

AI总结本文介绍迭代期望定律及其在因果效应识别中的应用，通过g公式的两种非参数等价形式（NICE和ICE）和三个数值示例阐明其数学直觉。

详情

AI中文摘要

g公式是识别观察数据中因果效应的基础工具，它基于迭代期望定律——统计学中的一个关键数学恒等式。然而，表达迭代期望定律和g公式的符号对于统计背景不足的人来说可能难以理解。我们提供了一篇入门文章，介绍迭代期望定律、用于表达它的积分符号，以及它通过g公式在因果效应识别中的作用。在因果一致性、正性和条件可交换性假设下，迭代期望定律可以重写为因果标准化公式（g公式），有两种非参数等价形式：非迭代条件期望（NICE）形式，涉及条件结果均值的单一加权平均；以及迭代条件期望（ICE）形式，涉及嵌套期望。我们通过三个逐步复杂的数值示例说明这两种形式：一个时间固定示例，包含单个二元混杂因子；一个时间固定示例，包含离散和连续混杂因子；以及一个时间变化示例，包含两个时间点。我们阐明了迭代期望定律是什么，它与g公式的关系，以及如何在实际数据示例中理解其数学公式的直觉，这些示例可以推广到各种场景。

英文摘要

The g-formula is a foundational tool for identifying causal effects in observational data. This tool is based on the law of iterated expectation, a key mathematical identity in statistics. However, the notation with which the law of iterated expectation and the g-formula is expressed can be opaque to those with little background in statistics. We provide a primer introducing the law of iterated expectation, the integration notation used to express it, and its role for causal effect identification via the g-formula. Under the assumptions of causal consistency, positivity, and conditional exchangeability, the law of iterated expectation can be rewritten as a causal standardization formula (the g-formula) in two nonparametrically equivalent forms: a non-iterative conditional expectation (NICE) form involving a single weighted average of conditional outcome means, and an iterative conditional expectation (ICE) form involving nested expectations. We illustrate both forms using three progressively complex numerical examples: a time-fixed example with a single binary confounder, a time-fixed example with discrete and continuous confounders, and a time-varying example with two timepoints. We provide clarity on what the law of iterated expectation is, how it is related to the g-formula, and how to gain intuition of its mathematical formulations in actual data examples that can be generalized to a range of settings.

URL PDF HTML ☆

赞 0 踩 0

2606.20069 2026-06-19 stat.ME 新提交

A minimum-risk and cost-efficient two-sample sequential testing framework for the shifted exponential models with application to precipitation data

移位指数模型的最小风险与成本高效双序贯检验框架及其在降水数据中的应用

Ashwani Rajput, Neeraj Joshi

AI总结提出一种双序贯抽样框架，通过控制第一类错误概率并最小化包含第二类错误和抽样成本的损失函数，检验两个移位指数模型的位置参数差异，具有一阶、二阶效率和风险效率。

详情

AI中文摘要

本文通过一种新颖的双序贯抽样框架，研究了比较两个移位指数模型位置参数的问题。所提出的假设检验过程通过将第一类错误概率控制在预设水平，同时最小化包含第二类错误概率和相应抽样成本的损失函数来开发。相应的最优固定样本量表达式依赖于未知的尺度参数，这使得在固定样本设计下，期望的检验精度在实践中无法实现。为克服这一困难，提出了一种双序贯抽样程序，用于在尺度参数未知且不等时检验位置参数之间的差异。所提出的方法具有理想的新近性质，包括一阶效率、二阶效率和二阶风险效率。广泛的模拟研究和涉及气象站强降水事件的实际数据应用证明了所提出程序的实际有效性和适用性。

英文摘要

This paper investigates the problem of comparing the location parameters of two shifted exponential models through a novel double sequential sampling framework. The proposed hypothesis testing procedure is developed by controlling the type I error probability at a preassigned level while minimizing a loss function that incorporates both the type II error probability and the associated sampling cost. The corresponding optimal fixed-sample-size expressions are shown to depend on unknown scale parameters, rendering the desired testing accuracies unattainable in practice under fixed-sample designs. To overcome this difficulty, a double sequential sampling procedure is proposed to test the difference between location parameters when the scale parameters are unknown and unequal. The proposed methodology is shown to possess desirable asymptotic properties, including first-order efficiency, second-order efficiency, and second-order risk efficiency. Extensive simulation studies and a real-data application that involves heavy precipitation episodes at meteorological stations demonstrate the practical effectiveness and applicability of the proposed procedure.

URL PDF HTML ☆

赞 0 踩 0

2606.19982 2026-06-19 stat.ME 新提交

Built-in Selection Bias in Proportional Hazards Models with Omitted Covariates: Simulation Evidence and Alternative Approaches

省略协变量的比例风险模型中的内置选择偏倚：模拟证据与替代方法

Ayoub Bifenzi, Helene Jacqmin-Gadda

AI总结本文通过模拟和实际数据，证明在随机试验中，即使省略的协变量与处理独立，仍会导致Cox比例风险模型估计的处理风险比存在偏倚，并比较了脆弱模型、加速失效时间模型和Kaplan-Meier曲线等替代方法的稳健性。

详情

AI中文摘要

在时间-事件分析中，来自Cox比例风险（PH）模型的风险比（HR）是评估治疗效果最常用且广泛报告的指标。然而，由于风险比固有地依赖于每个时间点的生存条件，它们具有非可压缩性。因此，当存在因省略重要协变量导致的未测量异质性时，即使这些协变量在基线时与主要暴露独立（如随机对照试验中），风险比也会受到内置选择偏倚的影响。本文旨在概述文献中关于未观测异质性（由影响结局的省略协变量引起）如何在标准比例风险模型中偏倚治疗风险比估计的关键发现，即使在处理分配独立于这些协变量的随机试验中也是如此。通过模拟，我们评估了半参数Cox PH模型和参数PH模型在各种未测量异质性场景下的偏倚程度。然后，我们将这些标准模型与替代方法进行比较，这些方法要么解决了这一问题，要么被认为对此具有稳健性。这些替代方法包括来自脆弱模型的风险比、来自加速失效时间（AFT）模型的回归参数，以及使用Kaplan-Meier曲线非参数估计或基于具有时变暴露效应的Cox模型估计的治疗组间生存差异。我们通过一个来自放射治疗肿瘤学组（RTOG 9202）的随机对照试验的实际数据应用，说明了所探索替代方法的实际相关性。

英文摘要

In time-to-event analysis, the hazard ratio (HR) derived from the Cox proportional hazards (PH) model is the most commonly used and widely reported measure for assessing treatment effects. However, hazard ratios are non-collapsible due to their inherent conditioning on survival up to each time point. As a result, they are subject to built-in selection bias in the presence of unmeasured heterogeneity arising from omitted important covariates, even when these covariates are independent of the main exposure at baseline, as is the case in randomized controlled trials. This article aims to provide an overview of key findings from the literature on how unobserved heterogeneity, due to omitted covariates that affect the outcome, can bias the estimation of the treatment hazard ratio in standard proportional hazards models, even in randomized trials where treatment is assigned independently of such covariates. Through simulations, we evaluate the extent of bias in the semi-parametric Cox PH model and parametric PH model under various scenarios of unmeasured heterogeneity. We then compare these standard models to alternative approaches that either account for this issue or are considered robust to it. These alternatives include the hazard ratio estimated from frailty models, regression parameters from an Accelerated Failure Time (AFT) model, and survival differences between treatment groups estimated nonparametrically using Kaplan-Meier curves or based on a Cox model with time-dependent effect of the exposure. We illustrate the practical relevance of the explored alternatives through a real data application to a randomized controlled trial from the Radiation Therapy Oncology Group (RTOG 9202).

URL PDF HTML ☆

赞 0 踩 0

2606.19892 2026-06-19 stat.ME 新提交

The Ghosh-Lin and Fine-Gray models for a mix of administrative and random censoring

混合行政删失与随机删失下的Ghosh-Lin和Fine-Gray模型

Thomas H. Scheike, Christian Mirian, Isao Yokota, Giuliana Cortese

AI总结针对同时存在行政删失和随机删失的数据，提出结合风险集调整和逆概率删失加权的方法，使Ghosh-Lin和Fine-Gray模型得到一致估计。

详情

AI中文摘要

复发事件或竞争风险回归模型通常应用于生物医学领域，两者都可视为边际模型。在存在右删失的情况下，需要调整这些模型以获得一致估计量。当删失是行政性时，边际回归模型特别容易估计。然而，当删失是随机作用时，通常考虑逆概率删失加权（IPCW）调整来获得参数估计。该技术通过正确的删失模型进行删失权重调整，但对于行政删失，只需修改风险集即可正确调整。在实践中，对于大型中央登记处或某些临床试验，所有受试者的行政删失时间已知，但通常也会有一定比例的受试者被随机删失。在这项工作中，我们考虑两种常用的回归方法：用于带有终止事件的复发事件的Ghosh-Lin模型和用于竞争事件的Fine-Gray模型。对于这两种情况，当同时存在行政删失和随机删失时，我们展示了如何通过处理这两种不同类型删失的组合，在最小化建模假设的基础上获得正确估计。

英文摘要

Recurrent events or competing risks regression models are often applied in the bio-medical setting and both can be considered as marginal models. In presence of right-censoring, such models need to be adjusted to give consistent estimators. When censoring is administrative, marginal regression models are particularly easy to estimate. However, when censoring is instead acting randomly, inverse probability of censoring weighting (IPCW) adjustments are typically considered to obtain parameter estimates. This technique relies on a censoring-weights adjustment via a correct censoring model, but for administrative censoring the adjustment is done correctly simply by modifying the risk-set. In practice for large central registries or some clinical trials, the administrative censoring time will be known for all subjects, but there will typically also be a proportion of subjects that are censored at random. In this work, we consider two frequently used regression approaches, the Ghosh-Lin model for recurrent events with terminal events and the Fine-Gray model for competing events. For these two settings, when both administrative and random censoring are present, we demonstrate how to obtain correct estimation by dealing with the combination of the two different types of censoring relying on a minimum of modeling assumptions.

URL PDF HTML ☆

赞 0 踩 0

2606.19760 2026-06-19 stat.AP 新提交

Covariate-Adjusted Functional Principal Components Analysis for Modeling Hazard Rates of Physical Activity in the US Population

协变量调整的功能主成分分析用于建模美国人口体力活动的风险率

Md Rokibul Hasan, Pratim Guha Niyogi

AI总结提出基于风险函数的分布分析方法，利用功能主成分分析（FPCA）从腕部加速度计数据中刻画个体活动强度分布变异，优于均值摘要。

详情

AI中文摘要

体力活动在人类健康中起着至关重要的作用。其整体分布因人而异。常用的汇总指标无法描述这种分布模式。我们提出了一种基于分布的分析方法，通过从腕部加速度计数据中导出的风险函数来建模个体活动强度模式，从而描述体力活动。我们分析了2011-2012年国家健康与营养调查（NHANES）中4297名连续佩戴设备7天的成年人的分钟级独立于监测器的运动摘要（MIMS）数据。我们使用基于生存的方法为每个个体在共同强度网格上导出了非参数活动强度风险，将MIMS的风险曲线及其对数变换后的MIMS都视为功能对象。我们在MIMS的两个尺度上使用功能主成分分析（FPCA）来表征活动强度分布的主要变异模式。组均值风险函数在低强度水平上差异很小，而在高强度水平上我们观察到显著差异。我们的结果表明，基于风险的功能表示方法能够捕捉个体间体力活动强度分布的差异，提供了一种灵活且可解释的方式来表征异质性。该方法优于基于均值的摘要，并支持对人口亚组之间体力活动模式进行有原则的比较。

英文摘要

Physical activity plays a vital role in human health. Its entire distribution differs among people. Commonly used summary measures cannot describe this distributional pattern. We present a distribution-based analytical approach to describe physical activity by modeling individual-level activity-intensity patterns through hazard functions derived from wrist-worn accelerometer data. We analyzed minute-level Monitor-Independent Movement Summary (MIMS) data of 4297 adults with seven continuous days of device wear from the 2011- 2012 National Health and Nutrition Examination Survey (NHANES). We derived a nonparametric activity-intensity hazard using a survival-based approach for each individual on a common intensity grid, treating both the hazard curves from MIMS and their log-transformed MIMS as functional objects. We used functional principal component analysis (FPCA) on both scales of MIMS to characterize dominant modes of variation in activity-intensity distributions. Group-wise mean hazard functions showed little difference at lower intensity levels, while we observed a substantial difference at higher intensity levels. Our results demonstrate that hazard-based functional representations for capturing differences in physical activity intensity distributions across individuals offer a flexible and interpretable way to characterize heterogeneity. This approach works better than mean-based summaries and supports principled comparisons of physical activity patterns across population subgroups.

URL PDF HTML ☆

赞 0 踩 0

2606.19743 2026-06-19 stat.ME stat.AP 新提交

A Bayesian spatio-temporal nearest neighbor Gaussian process model for pooled genetic data

一种用于汇总遗传数据的贝叶斯时空最近邻高斯过程模型

Imke Botha, Tianxiao Hao, Lucinda E. Harrison, Nick Golding, Daniel J. Weiss, Jennifer A. Flegg

AI总结提出最近邻高斯过程模型，结合序贯蒙特卡洛平方算法，高效推断汇总遗传数据中的单倍型频率，并应用于非洲抗疟药物耐药性遗传数据分析。

详情

AI中文摘要

大规模遗传数据集通常汇总不同遗传标记的总等位基因计数。从这些汇总数据中推断单倍型频率（即多标记等位基因的频率）是一个挑战。由于计算成本，先前在此背景下的时空建模仅限于3个标记。在这项工作中，我们提出了一种最近邻高斯过程（NNGP）模型，以改善随标记和观测数量扩展的规模。为了推断模型参数，我们开发了一种新颖的序贯蒙特卡洛平方算法，该算法使用带有祖先抽样的粒子吉布斯来变异NNGP函数值。后者在观测数量和NNGP数量上具有线性成本，并可应用于广泛的NNGP模型。作为案例研究，我们分析了与非洲抗疟药物耐药性相关的遗传数据，并在3和6个遗传标记数据集上实证展示了我们的扩展结果。

英文摘要

Large scale genetic datasets often aggregate the total allele counts of distinct genetic markers. Inferring haplotype frequencies (i.e.\ the frequency of multimarker alleles) from these pooled data is a challenge. Previous spatio-temporal modelling in this context has been limited to 3 markers due to the computational cost. In this work, we propose a nearest neighbor Gaussian process (NNGP) model to improve scaling with the number of markers and observations. To infer the parameters of our model, we develop a novel sequential Monte Carlo squared algorithm, which uses particle Gibbs with ancestor sampling to mutate the NNGP function values. The latter has a linear cost in the number of observations and the number of NNGPs, and can be applied to a broad range of NNGP models. As a case study, we analyse genetic data relating to antimalarial drug resistance in Africa, and show our scaling results empirically on a 3 and 6 genetic marker dataset.

URL PDF HTML ☆

赞 0 踩 0

2606.19737 2026-06-19 stat.ME stat.ML 新提交

Calibration without labels in multiple testing

多重检验中的无标签校准

Adway S. Wadekar, Jake A. Soloff

AI总结针对多重检验中无法观测真实标签的难题，利用有序p值间距构造伪标签，实现局部错误发现率的校准，并揭示q值在心理学和神经科学文献中可能严重失准。

详情

AI中文摘要

大规模假设检验支持对单个假设的概率性声明，如经验贝叶斯方法估计局部错误发现率。我们研究如何将这些声明解释为原假设的近似校准预测，即使在模型误设定下也能产生可解释的错误概率。我们的方法从概率预测中汲取概念灵感，但面临不同的挑战：与预测不同（标签最终可观测），在多重检验中真实情况从未揭示，因此校准必须随机评估并间接建立。我们通过构造一组伪标签来应对这一挑战，这些伪标签源自有序$p$值的间距，并以局部错误发现率作为回归目标。我们的构造解锁了现有工具，用于评估和执行多重检验中的事后校准。值得注意的是，我们在对已发表的心理学和神经科学文献的大规模实证调查中发现，基于错误发现率的流行误差度量$q$值可能严重失准。

英文摘要

Large-scale hypothesis testing supports probability claims about individual hypotheses, as in empirical Bayes methods for estimating local false discovery rates. We study how such claims can be interpreted as approximately calibrated forecasts of the null hypothesis, yielding interpretable error probabilities even under model misspecification. Our approach draws conceptual inspiration from probabilistic forecasting but addresses a different challenge: unlike forecasting, where labels are eventually observed, in multiple testing the ground truth is never revealed, so calibration must be assessed stochastically and established indirectly. We address this challenge by constructing a set of pseudo-labels, derived from the spacings of ordered $p$-values, which have the local false discovery rate as their regression target. Our construction unlocks existing tools for assessing and performing post-hoc calibration in multiple testing. Notably, we find on a large-scale empirical survey of published psychology and neuroscience literature that the $q$-value, a popular error measure based on the false discovery rate, can be severely miscalibrated.

URL PDF HTML ☆

赞 0 踩 0

2606.19580 2026-06-19 stat.ME stat.ML 新提交

Machine Learning Integrated in Wavelet Shrinkage (MLShrink)

机器学习集成小波收缩 (MLShrink)

Dixon Vimalajeewa, Vijini Lakmini, Brani Vidakovic

AI总结提出MLShrink，结合小波收缩与机器学习，通过双阈值对中间带系数进行数据自适应分类，保留经典阈值简单性，理论证明其非扩张性和oracle一致性，在非平滑信号上表现优异。

详情

AI中文摘要

实践中遇到的数据经常被加性噪声污染，小波收缩仍是非参数估计中恢复潜在信号的基本工具。经典方法如硬阈值和软阈值几乎完全根据系数的大小决定是否保留。尽管在许多情况下有效，这些规则对于幅度落在信号与噪声区分不确定的中间区域的系数可能过于僵化。我们提出MLShrink，一种将小波收缩与机器学习相结合的双阈值小波去噪过程。低于下阈值的系数被丢弃，高于上阈值的系数被保留，中间带的系数使用局部小波域特征进行分类。这样，MLShrink在远离决策边界处保留了经典阈值的简单性，同时允许对模糊系数进行数据自适应决策。本文还为此架构开发了一个理论框架。我们证明MLShrink是一个非扩张的支持选择规则，推导出一个基于oracle的风险分解，表明多余的去噪风险由未决策带上的分类误差决定，并在分类器性能的适当假设下建立了oracle一致性结果。在标准基准信号上的模拟实验表明，MLShrink与几种已建立的小波收缩方法具有竞争力，尤其适用于具有不规则、边缘丰富或非平滑结构的信号。这些发现表明，中间阈值带上的学习决策为经典小波去噪与现代统计学习之间提供了有用且可解释的联系。

英文摘要

Data encountered in practice are frequently contaminated by additive noise, and wavelet shrinkage remains a fundamental tool for recovering underlying signals in nonparametric estimation. Classical procedures such as hard and soft thresholding decide whether to retain a wavelet coefficient almost entirely from its magnitude. Although effective in many settings, these rules can be too rigid for coefficients whose magnitudes fall in an intermediate region where the distinction between signal and noise is uncertain. We propose MLShrink, a two-threshold wavelet denoising procedure that combines wavelet shrinkage with machine learning. Coefficients below a lower threshold are discarded, coefficients above an upper threshold are retained, and coefficients in the intermediate band are classified using local wavelet-domain features. In this way, MLShrink preserves the simplicity of classical thresholding away from the decision boundary while allowing data-adaptive decisions for ambiguous coefficients. The paper also develops a theoretical framework tailored to this architecture. We show that MLShrink is a nonexpansive support-selection rule, derive an oracle-based risk decomposition showing that excess denoising risk is determined by classification errors on the undecided band, and establish an oracle-consistency result under suitable assumptions on classifier performance. Simulation experiments on standard benchmark signals indicate that MLShrink is competitive with several established wavelet shrinkage methods and is especially effective for signals with irregular, edge-rich, or non-smooth structure. These findings suggest that learned decisions on the intermediate threshold band provide a useful and interpretable connection between classical wavelet denoising and modern statistical learning.

URL PDF HTML ☆

赞 0 踩 0

2606.19572 2026-06-19 stat.ME 新提交

SCOPE Shrinkage: A Unified Framework for Wavelet Denoising

SCOPE 收缩：小波去噪的统一框架

Dixon Vimalajeewa, Vijini Lakmini, Malith Premarathna, Fabrizio Ruggeri, Brani Vidakovic

AI总结提出基于对称单峰分布累积分布函数的SCOPE收缩族，通过两个可解释参数分离尺度与形状效应，实现局部强收缩与渐近无偏的平衡，在小波去噪中性能与可解释性兼具。

详情

AI中文摘要

我们引入了对称CDF导向概率增强（SCOPE）收缩，这是一个由对称单峰分布的中心累积分布函数构造的保号收缩规则统一族。所提出的框架生成了一类广泛的衰减轮廓，在零点附近强局部收缩与尾部渐近无偏行为之间插值。我们开发了一个通用公式，通过两个可解释参数分离尺度与形状效应，从而能够独立控制有效的阈值位置和过渡锐度。在明确的规律性假设下，建立了SCOPE收缩的结构性质，包括奇性、单调性、连续性、收缩性以及将规则与软化阈值算子联系起来的混合表示。还发展了贝叶斯和惩罚似然解释：SCOPE规则允许偶惩罚表示，该表示在系数幅度上非递减，并且合适的子类在适当的对称单峰先验下作为精确的最大后验估计出现。基于逻辑分布、均匀分布和柯西分布的代表性例子说明了概率形状如何控制收缩行为。通过Stein型无偏风险估计讨论了光滑子类的数据驱动参数选择。在标准Donoho-Johnstone测试函数上的Oracle校准模拟研究表明，SCOPE收缩与几种已建立的小波去噪方法相比具有竞争力，同时保持了高度的可解释性和结构灵活性。结果突出了中心分布函数作为小波去噪及相关估计问题中收缩的自然且通用的设计原则。

英文摘要

We introduce Symmetric CDF Oriented Probability Enhanced (SCOPE) shrinkage, a unified family of sign-preserving shrinkage rules constructed from centered cumulative distribution functions of symmetric unimodal distributions. The proposed framework generates a broad class of attenuation profiles that interpolate between strong local shrinkage near zero and asymptotically unbiased behavior in the tails. A general formulation is developed that separates scale and shape effects through two interpretable parameters, allowing effective threshold location and transition sharpness to be controlled independently. Under explicit regularity assumptions, structural properties of SCOPE shrinkage are established, including oddness, monotonicity, continuity, contractivity, and a mixture representation that connects the rules to softened thresholding operators. A Bayesian and penalized likelihood interpretation is also developed: SCOPE rules admit even penalty representations that are nondecreasing in coefficient magnitude, and suitable subclasses arise as exact maximum a posteriori estimators under proper symmetric unimodal priors. Representative examples based on logistic, uniform, and Cauchy distributions illustrate how probabilistic shape governs shrinkage behavior. Data driven parameter selection for smooth subclasses is discussed via Stein-type unbiased risk estimation. Oracle calibrated simulation studies on standard Donoho-Johnstone test functions show that SCOPE shrinkage performs competitively with several established wavelet denoising methods, while retaining a high degree of interpretability and structural flexibility. The results highlight centered distribution functions as a natural and versatile design principle for shrinkage in wavelet denoising and related estimation problems.

URL PDF HTML ☆

赞 0 踩 0

2606.19540 2026-06-19 stat.ME stat.CO stat.ML 新提交

Overfitted high-dimensional matrix factorizations via adaptive spectral shrinkage

通过自适应谱收缩的过拟合高维矩阵分解

Lorenzo Mauri, David B. Dunson

AI总结提出EigenBayes方法，通过谱估计和自适应经验贝叶斯校准超参数，实现快速且具有不确定性量化的过拟合因子模型，在数值实验和基因组学应用中优于现有方法。

详情

AI中文摘要

因子模型是分析高维数据以提取低秩信号和估计协方差的常用方法。它们将协方差矩阵分解为低秩分量和对角分量之和。一个关键问题是如何选择潜在维度$k$，当因子模型仅近似成立且信噪比较低时，这尤其具有挑战性。贝叶斯过拟合因子模型指定$k$的上界，并依赖结构化收缩先验有效去除多余分量。这类方法流行且有效，但计算成本高。我们提出了一种更快的\texttt{EigenBayes}方法，基于潜在因子的谱估计和关键超参数的自适应经验贝叶斯校准，提供有效的不确定性量化。得到的后验分布可跨结果分解且解析可处理，绕过了马尔可夫链蒙特卡洛。我们证明\texttt{EigenBayes}能适应每个结果和潜在维度的信噪比，同时将多余的潜在分量收缩至零。我们建立了良好的渐近性质，并在数值实验和基因组学应用中展示了强大的实证性能，其中EigenBayes优于最先进的替代方法。

英文摘要

Factor models are popular approaches for analyzing high-dimensional data to extract low-rank signals and estimate covariances. They decompose the covariance matrix as the sum of low-rank and diagonal components. A key issue is how to choose the latent dimension $k$, which is particularly challenging when the factor model only holds approximately and in low signal-to-noise scenarios. Bayesian overfitted factor models specify an upper bound on $k$ and rely on structured shrinkage priors to effectively remove extra components. Such approaches are popular and effective, but computationally expensive. We propose a much faster \texttt{EigenBayes} approach that provides valid uncertainty quantification, based on spectral estimation of latent factors and adaptive empirical Bayes calibration of key hyperparameters. The resulting posterior distribution factorizes across outcomes and is analytically tractable, bypassing Markov chain Monte Carlo. We show that \texttt{EigenBayes} adapts to the signal-to-noise ratio of each outcome and latent dimension, while shrinking superfluous latent components to zero. We establish favorable asymptotic properties and demonstrate strong empirical performance in numerical experiments and a genomics application, where EigenBayes outperforms state-of-the-art alternatives.

URL PDF HTML ☆

赞 0 踩 0

2606.20420 2026-06-19 q-fin.CP stat.AP 新提交

Advanced Calibration Analysis and Tools: Identifying Influential Observations in Stochastic Interest Rate Model Calibration

高级校准分析与工具：识别随机利率模型校准中的有影响观测值

Philipp Mahler, Peter Ruckdeschel

AI总结将校准问题嵌入非线性回归理论，证明最小化RMSRE等价于加权最小二乘，开发诊断框架（加权帽子矩阵、影响函数、泛函Delta方法），实证发现杠杆边界主导、有效维度损失及2022年后参数稳定性转变，指出低RMSRE不足以验证校准。

Comments 47 pages, 9 figures, 1 table

详情

AI中文摘要

利率模型的准确校准对于市场一致性估值和经济情景生成器（ESGs）至关重要。多因子模型（如G2++模型）的传统校准方法通常依赖于点估计，忽略了特定市场数据的影响和估计不确定性的量化。本文开发了一个诊断框架，将校准问题嵌入非线性回归理论。研究表明，行业常见的均方根相对误差（RMSRE）最小化等价于加权最小二乘（WLS）问题。这一等价关系导出了诊断工具的相应公式，包括用于杠杆分析的加权帽子矩阵、用于局部敏感性诊断的影响函数，以及用于局部、边界置信区间的泛函Delta方法。实现中采用了高效的雅可比矩阵分解，利用了平价（ATM）上限的解析可处理性。该框架应用于2016-2025年期间的欧元ATM上限数据集。我们的实证分析揭示了边界主导的杠杆分布、由于参数约束活跃导致的重复有效维度损失，以及2022年后市场转型中局部参数稳定性的诊断机制转变。对精算模型治理的启示是：低RMSRE不足以验证校准。最后，我们讨论了该框架对一般最小二乘问题的适用性，同时指出了对于缺乏闭式梯度的工具（如互换期权）的计算挑战。

英文摘要

The accurate calibration of interest rate models is central to market-consistent valuation and Economic Scenario Generators (ESGs). Traditional calibration methods for multi-factor models such as the G2++ model often rely on point estimates, neglecting the influence of specific market data and the quantification of estimation uncertainty. This paper develops a diagnostic framework embedding the calibration problem into non-linear regression theory. It shows that the common industry practice of minimizing the Root Mean Squared Relative Error (RMSRE) is equivalent to a Weighted Least Squares (WLS) problem. This equivalence yields the corresponding formulations for diagnostic tools, including the Weighted Hat Matrix for leverage analysis, Influence Functions for local sensitivity diagnostics, and the Functional Delta Method for local, boundary-respecting confidence intervals. The implementation uses an efficient Jacobian factorization that exploits the analytical tractability of At-The-Money (ATM) caps. The framework is applied to a dataset of Euro ATM caps covering the period 2016--2025. Our empirical analysis reveals a boundary-dominated leverage profile, repeated losses of effective dimensionality due to active parameter constraints, and a diagnostic regime shift in local parameter stability around the post-2022 market transition. The resulting message for actuarial model governance is that low RMSRE is not sufficient for calibration validation. We conclude by discussing the framework's applicability to general least-squares problems while highlighting the computational challenges for instruments lacking closed-form gradients, such as swaptions.

URL PDF HTML ☆

赞 0 踩 0

2606.20079 2026-06-19 q-fin.RM 新提交

How to spot outliers: an Ensemble Anomaly Detection Framework

如何发现异常值：一种集成异常检测框架

Daniil Peysakhovich, Rafał Sieradzki

AI总结针对风险估值输出中的异常问题，提出集成质量评估框架（EQAF），结合多种无监督异常检测方法，在信用衍生品数据上实现F1分数61-79%，优于最佳单一方法（6-66%），并揭示纯统计方法无法检测冻结馈送异常。

详情

AI中文摘要

由数据馈送失败、模型配置错误或系统故障引起的风险估值输出错误可能通过投资银行的风险基础设施未被检测地传播，并产生重大操作损失。利用一家全球大型投资银行涵盖129个交易日183笔交易的专有每日信用衍生品数据，我们设计、实施并实证评估了集成质量评估框架（EQAF），这是一种分层无监督架构，结合互补的异常检测方法，实时监控风险计算完整性。通过使用八种操作现实场景的受控异常注入协议，我们表明校准后的集成在四个不同风险度量数据集上实现了61-79%的F1分数，显著优于最佳单一方法（6-66%）。AUC-ROC提高4-6个百分点证实了这种优势对阈值选择具有鲁棒性。我们进一步证明，纯统计检测方法系统地无法识别冻结值异常，这是一类冻结馈送错误，其中估值输出与先前观测相同，因此与正常数据无法区分，并且领域特定的确定性规则在架构上是不可或缺的。这些发现对巴塞尔III和交易账簿基本审查（FRTB）下的模型风险管理具有直接影响，其中对内部风险模型的自动化和可审计质量控制要求日益增加。

英文摘要

Errors in risk valuation outputs arising from data-feed failures, model misconfiguration, or system malfunctions can propagate undetected through an investment bank's risk infrastructure and generate material operational losses. Using proprietary daily credit-derivatives data from a major global investment bank covering 183 trades across 129 trading days, we design, implement, and empirically evaluate the Ensemble Quality Assessment Framework (EQAF), a layered unsupervised architecture that combines complementary outlier-detection methods to monitor risk calculation integrity in real time. Using a controlled anomaly-injection protocol with eight operationally realistic scenarios, we show that the calibrated ensemble achieves F1 scores of 61-79%, substantially outperforming the best individual method (6-66%) across four distinct risk-measure datasets. Improvements of 4-6 percentage points in AUC-ROC confirm that this advantage is robust to threshold selection. We further demonstrate that purely statistical detection methods systematically fail to identify stale-value anomalies, a class of frozen-feed errors in which valuation outputs are identical to prior observations and therefore indistinguishable from normal data, and that domain-specific deterministic rules are architecturally indispensable. These findings have direct implications for model risk management under Basel III and the Fundamental Review of the Trading Book (FRTB), where automated and auditable quality controls for internal risk models are increasingly required.

URL PDF HTML ☆

赞 0 踩 0

2606.19846 2026-06-19 econ.GN q-fin.EC 新提交

What Capital After Labor? Forecasting the Talent ROI Transition in the Human-AI Era

劳动力之后是什么资本？预测人机时代的人才ROI转型

Kwan Soo Shin, In Seok Kang

AI总结针对AI增强打破劳动时间与贡献的会计关联，本文构建从时间到产出的人才ROI预测框架，核心定理为ROI反转，并利用韩国52小时工作制案例验证了前期压力信号，预测产出型企业在2032年TFP增长领先1.5-2.0个百分点。

Comments 90 pages, 6 figures

详情

AI中文摘要

AI增强打破了劳动时间与生产贡献之间的会计联系，但企业仍通过基于时间的间接费用包来评估人才。本文开发了一个预测框架，用于在人机时代从基于时间的人才会计向基于产出的人才ROI转型。该框架以定理3（在τ*处的ROI反转）为实证主轴，包含四个机制定理：间接费用非加性、增强节省时间路径、创新溢价放大以及人机二元归因不确定性。韩国分阶段实施的52小时工作制规定提供了一个实证预警案例。在一个包含365家上市公司的DART面板数据（2281个公司-年观测值）中，SG&A与收入比率从2018年的18.26%上升至2020年的20.06%，在2021-2022年略有修正，并于2024年达到20.10%的峰值。在收入百分位队列代理下，双向固定效应（+1.56个百分点，p=0.049）、合并事件研究估计（t=+3时为+4.21个百分点，p=0.001）以及Callaway-Sant'Anna双重稳健交错DID估计（t=+4时为+4.51个百分点）收敛于一个正向间接费用压力特征。2015-2017年的向后扩展（224家公司，601个观测值）提供了预处理数据，提供了反对预先存在的上升趋势混杂因素的证据。我们将韩国证据解读为，据我们所知，第一个经验记录的τ*前间接费用压力制度特征，其中基于时间的会计仍占主导地位，而AI增强和劳动时间压缩共同推高了间接费用。预计到2032年，基于产出的公司在公司层面TFP增长上比基于时间的同行高出1.5-2.0个百分点。贡献在于为向AI增强的人才ROI会计转型提供了一个预测模型和管理规划工具。

英文摘要

AI augmentation breaks the accounting link between labor time and productive contribution, yet firms continue to evaluate talent through time-based overhead bundles. This paper develops a forecasting framework for the transition from time-based talent accounting to output-based talent ROI in the human-AI era. The framework centres on Theorem 3 (ROI Inversion at τ*) as the empirical spine, with four mechanism theorems: overhead non-additivity, augmentation-saved-time pathways, innovation-premium amplification, and human-AI dyad attribution uncertainty. Korea's staged 52-hour workweek mandate provides an empirical early-warning case. In a DART panel of 365 listed firms (2,281 firm-year observations), the SG&A-to-revenue ratio rose from 18.26 percent in 2018 to 20.06 percent in 2020, corrected mildly in 2021-2022, and peaked at 20.10 percent in 2024. Under the revenue-percentile cohort proxy, two-way fixed effects (+1.56 pp, p = 0.049), pooled event-study estimates (+4.21 pp at t = +3, p = 0.001), and Callaway-Sant'Anna doubly-robust staggered DiD estimates (+4.51 pp at t = +4) converge on a positive overhead-pressure signature. A 2015-2017 backward extension (224 firms, 601 observations) supplies pre-treatment data, providing evidence against pre-existing upward-trend confounds. We read the Korean evidence not as a direct τ* estimate or a point causal magnitude, but as, to our knowledge, the first empirically documented signature of the pre-τ overhead-pressure regime, where time-based accounting still dominates while AI augmentation and labor-time compression jointly raise overhead. Output-based firms are forecast to outperform time-based peers by 1.5-2.0 percentage points in firm-level TFP growth by 2032. The contribution is a forecasting model and managerial planning tool for the shift to AI-augmented talent ROI accounting.

URL PDF HTML ☆

赞 0 踩 0

2606.19550 2026-06-19 q-fin.GN q-fin.PR 新提交

Which Portfolios? The Construction Dependence of Factor Model Performance

哪些投资组合？因子模型表现的构建依赖性

Useong Shin

AI总结研究发现因子模型表现高度依赖于测试资产的构建方式，如选股、初始加权、持有期和再平衡，其中买入持有策略偏好FF5和FF6，而每日恒定加权偏好FF3，且q5在因子跨度测试中夏普比率最高但定价误差较大。

2606.19517 2026-06-19 q-fin.TR 新提交

Do Prediction Markets Match Option Prices? Bitcoin Threshold Evidence from Binance and Polymarket

预测市场是否与期权价格匹配？来自币安和Polymarket的比特币阈值证据

Victoria Portnaya

AI总结本文通过比较Polymarket预测市场与币安期权隐含的比特币阈值合约价格，发现两者之间存在显著且持久的定价差距，平均约6.3个百分点，表明数字金融市场碎片化导致经济上相同的收益产生系统性定价偏差。

Comments 22 pages, 6 figures, 7 tables; JEL: G13, G14, G19

详情

AI中文摘要

金融市场的数字化产生了两类平台，它们原则上对相同的状态依存收益进行定价：中心化加密期权交易所和基于区块链的预测市场。本文首次提供了加密货币阈值合约的预测市场定价的期权隐含基准测试。在匹配样本的每个小时，我们将Polymarket的Yes价格与同一标的、行权价和到期日的上市币安看涨期权所隐含的贴现风险中性二元值进行比较，并研究两者之间的差距。在2023年9月的主要比特币合约中，平均定价差距为5.6个百分点（基于214个每小时观测值，t=6.46，p<10^{-9}）。合并三个与币安兼容的比特币阈值市场，在287个观测值上得到平均差距为6.3个百分点，对HAC和块自举推断稳健。该差距是持久的——AR(1)半衰期约为四小时——但均值回归，这与分割场所之间缓慢的信息传递而非机械噪声一致。横截面回归显示，价差在期权隐含概率低和到期时间长时最大，这与预测市场合约的投机需求而非测量误差一致。在对冲套利代理中，在保守交易成本后仍保持盈利，但统计精度边际。在相同三个比特币合约上扩展至Deribit，合并差距更大，为11个百分点，而较小的以太坊练习则产生混合证据。结果表明，数字金融市场碎片化导致经济上相同的收益产生系统性、持久的定价偏差。

英文摘要

The digitization of financial markets has produced two classes of platforms that price, in principle, the same state - contingent payoffs: centralized crypto-option exchanges and blockchain-based prediction markets. This paper provides the first option-implied benchmark test of prediction-market pricing for cryptocurrency threshold contracts. For each hour in a matched sample, we compare the Polymarket Yes price with the discounted risk-neutral binary value implied by a listed Binance call option on the same underlying, strike, and maturity, and study the gap between them. In the main September 2023 Bitcoin contract, the mean pricing gap equals 5.6 percentage points across 214 hourly observations (t = 6.46, p < 10^{-9}). Pooling three Binance-compatible Bitcoin threshold markets yields a mean gap of 6.3 percentage points across 287 observations, robust to HAC and block-bootstrap inference. The gap is persistent - with an AR(1) half-life of roughly four hours - yet mean-reverting, consistent with slow information transmission between segmented venues rather than mechanical noise. Cross-sectional regressions reveal that the wedge is largest at low option-implied probabilities and long maturities, a pattern consistent with speculative demand for prediction-market contracts rather than measurement error. A delta-hedged arbitrage proxy remains profitable after conservative transaction costs, though with marginal statistical precision. A Deribit extension on the same three Bitcoin contracts produces a larger pooled gap of 11 percentage points, while a smaller Ethereum exercise yields mixed evidence. The results demonstrate that digital fragmentation of financial markets generates systematic, persistent pricing wedges even for economically identical payoffs.

URL PDF HTML ☆

赞 0 踩 0

2606.19762 2026-06-19 q-bio.MN 新提交

Oscillations and Spatial Patterns in Large-Scale Stochastic Gene Regulatory Networks

大规模随机基因调控网络中的振荡与空间模式

Manuel Eduardo Hernández-García, Jorge Velázquez-Castro

AI总结研究负反馈与扩散的循环基因调控网络，通过确定性和随机方法分析其稳定性，发现随机波动可诱导图灵失稳，为理解发育中的模式形成提供新视角。

Comments 16 pages, 10 figures

详情

AI中文摘要

基因调控网络（GRNs）是细胞生长和组织形成的基础，在发育过程中协调基因表达的时空调控。这些网络固有地受到分子噪声引起的内在波动的影响，因此分析其稳定性对于理解生物体稳健的模式形成和发育动力学至关重要。在本研究中，我们分析了具有负反馈和扩散的循环GRNs的稳定性和动力学，考虑了确定性和随机方法。在确定性情况下，系统表现出稳定性与不稳定性之间的分岔，导致无扩散时的Hopf失稳和包含扩散时的Turing-Hopf失稳。观察到空间域的离散化引入了额外的不稳定模式，从而允许更广泛的模式。基于二阶矩方法的随机框架包含了内在波动，揭示了对于小系统尺寸，即使系统在无扩散时是稳定的，波动也可以主导动力学并诱导随机Turing失稳。值得注意的是，即使所有变量具有相同的扩散速率，Turing失稳也可能出现。所开发的框架提供了一种系统的方法来分析具有扩散的高维随机系统的稳定性，从而简化了Turing和Turing-Hopf失稳的预测。这些发现有助于更深入地理解GRNs中的复杂动力学和模式形成，对细胞分化和发育等生物过程具有潜在意义。

英文摘要

Gene regulatory networks (GRNs) are fundamental to cellular growth and tissue formation, orchestrating spatially and temporally regulated gene expression during development. These networks are inherently subject to intrinsic fluctuations arising from molecular noise, making the analysis of their stability essential for understanding robust pattern formation and developmental dynamics of the organism. In this study, we analyze the stability and dynamics of cyclic GRNs with negative feedback and diffusion, considering both deterministic and stochastic approaches. In the deterministic case, the system exhibits a bifurcation between stability and instability, leading to Hopf instability in the absence of diffusion and to Turing-Hopf instability when diffusion is included. It was observed that the discretization of the spatial domain introduces additional unstable modes, enabling a wider range of patterns. The stochastic framework based on the second-moment approach, which incorporates intrinsic fluctuations, reveals that for small system sizes, fluctuations can dominate the dynamics and induce stochastic Turing instability, even when the system is stable in the absence of diffusion. Notably, Turing instabilities can emerge even when all variables have the same diffusion rate. The developed framework provides a systematic method for analyzing the stability of high-dimensional stochastic systems with diffusion, thereby simplifying the prediction of Turing and Turing-Hopf instabilities. These findings contribute to a deeper understanding of the complex dynamics and pattern formation in GRNs, with potential implications for biological processes, such as cellular differentiation and development.

URL PDF HTML ☆

赞 0 踩 0

2606.19739 2026-06-19 q-bio.NC 新提交

Robust probabilistic measurement of structural-functional module consistency in infant brain development

婴儿大脑发育中结构-功能模块一致性的鲁棒概率测量

Lingbin Bian, Feihong Liu, Qian Wang, Han Zhang, Dinggang Shen, the UNC/UMN Baby Connectome Project Consortium

AI总结提出基于随机模块的概率方法，鲁棒测量婴儿大脑结构-功能模块一致性，发现0-5岁间一致性下降，初级脑区一致性更高。

详情

DOI: 10.1007/s00429-026-03143-3

AI中文摘要

脑网络通常被划分为模块，用于分析其在神经影像学研究的群体分析中功能分离的角色。这里，我们引入脑网络中的随机模块，用于在受试者群体中对结构-功能模块一致性（SFMC）进行鲁棒的概率测量。具体而言，随机模块可被视为一个脑区在受试者间可能被分配到群体级子网络的机会，其特征为该脑区的分配概率。这种新方法在评估脑网络中的非均匀模块方面有两个优势。首先，它可以鲁棒地评估脑结构模块与功能模块之间的一致性，而两者的群体规模不必相同；其次，它能够考虑群体中模块的个体间变异性。此外，与传统的结构-功能耦合方法相比，我们的基于随机模块的方法揭示了结构与功能之间耦合的更显著下降，表明更强的发育重组。我们使用婴儿连接组项目（BCP）数据集的结果显示，SFMC在0至5岁期间下降，并且在初级脑区（如视觉区域）较高，而在更高级的认知区域（包括与注意力、控制和默认模式网络相关的区域）较低。

英文摘要

Brain network is commonly divided into modules for analyzing their functionally segregated roles for group-level analysis in neuroimaging studies. Here, we introduce stochastic modules within brain networks for a robust probabilistic measurement of structural-functional module consistency (SFMC) in a group of subjects. Specifically, a stochastic module can be regarded as the chance of a brain region across subjects potentially being assigned to a group-level sub-network, characterized as an assignment probability for this brain region. This novel method has two advantages for evaluating inhomogeneous modules in brain networks. The first is that it can robustly evaluate the consistency between brain structural and functional modules whose population sizes are not necessary the same, and the second is that it is able to take into account the inter-individual variability of the modules for the groups. Moreover, compared with the conventional structural-functional coupling approach, our stochastic module-based method reveals a more pronounced decline in the coupling between structure and function, indicating stronger developmental reorganization. Our results using the dataset from Baby Connectome Project (BCP) show that the SFMC decreases from 0 to 5 years old, and is greater in primary brain regions, such as visual areas, while lower in more advanced cognitive regions, including those related to attention, control, and default mode network.

URL PDF HTML ☆

赞 0 踩 0

AI 大模型

视觉与机器人

科学与医疗

An Optimization Framework for Certain Separable Problems using Neural Networks

Degrees of Freedom and Beamforming for Large Intelligent Surfaces

Multistatic J-Band Radar TX/RX Chipset in SiGe BiCMOS with Integrated x16 Frequency Multiplier Chain and High EIRP

A Survey of Full-Duplex Spoken Dialogue Systems: Architectural Hierarchy, Interaction Ontology, and Decision State Machine

Two-Sample IV: Efficient Two-Step Estimation and Tests for Overidentification and Weak-Instruments

Hypergraph Variable Selection with False Discovery Rate Control

Flexible modeling of bimodal distributions via skewed-$t$ mixtures

Anchors Away: Navigating Unanchored Indirect Comparisons with Multilevel Unanchored Meta-Regression (ML-UMR)

Analysis of uncertain fixed-effects model for Latin square designs

AK-MCS-C2 : Active Kriging Monte Carlo Simulation method with conformal certification for failure probability estimation

A case study of causal mediation using Bayesian nonparametrics and semiparametric corrections

DASH: A Dimensionality Reduction Method for Large-scale Convex MIQP with Applications in Subset Portfolio Selection

Community detection in small-sample ordinal regimes: A benchmarking framework for Delphi data

A Law of Iterated Expectation Primer for Causal Inference

A minimum-risk and cost-efficient two-sample sequential testing framework for the shifted exponential models with application to precipitation data

Built-in Selection Bias in Proportional Hazards Models with Omitted Covariates: Simulation Evidence and Alternative Approaches

The Ghosh-Lin and Fine-Gray models for a mix of administrative and random censoring

Covariate-Adjusted Functional Principal Components Analysis for Modeling Hazard Rates of Physical Activity in the US Population

A Bayesian spatio-temporal nearest neighbor Gaussian process model for pooled genetic data

Calibration without labels in multiple testing

Machine Learning Integrated in Wavelet Shrinkage (MLShrink)

SCOPE Shrinkage: A Unified Framework for Wavelet Denoising

Overfitted high-dimensional matrix factorizations via adaptive spectral shrinkage

Advanced Calibration Analysis and Tools: Identifying Influential Observations in Stochastic Interest Rate Model Calibration

How to spot outliers: an Ensemble Anomaly Detection Framework

What Capital After Labor? Forecasting the Talent ROI Transition in the Human-AI Era

Which Portfolios? The Construction Dependence of Factor Model Performance

Do Prediction Markets Match Option Prices? Bitcoin Threshold Evidence from Binance and Polymarket

Oscillations and Spatial Patterns in Large-Scale Stochastic Gene Regulatory Networks

Robust probabilistic measurement of structural-functional module consistency in infant brain development