arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2602.11118 2026-02-12 stat.ME stat.ML

A Doubly Robust Machine Learning Approach for Disentangling Treatment Effect Heterogeneity with Functional Outcomes

Filippo Salmaso, Lorenzo Testa, Francesca Chiaromonte

Comments 20 pages, 4 figures

2602.11108 2026-02-12 stat.CO cs.NA math.NA

Large Scale High-Dimensional Reduced-Rank Linear Discriminant Analysis

Jocelyn T. Chi

2602.11107 2026-02-12 stat.ME cs.LG stat.ML

Renet: Principled and Efficient Relaxation for the Elastic Net via Dynamic Objective Selection

Albert Dorador

详情

英文摘要

We introduce Renet, a principled generalization of the Relaxed Lasso to the Elastic Net family of estimators. While, on the one hand, $\ell_1$-regularization is a standard tool for variable selection in high-dimensional regimes and, on the other hand, the $\ell_2$ penalty provides stability and solution uniqueness through strict convexity, the standard Elastic Net nevertheless suffers from shrinkage bias that frequently yields suboptimal prediction accuracy. We propose to address this limitation through a framework called \textit{relaxation}. Existing relaxation implementations rely on naive linear interpolations of penalized and unpenalized solutions, which ignore the non-linear geometry that characterizes the entire regularization path and risk violating the Karush-Kuhn-Tucker conditions. Renet addresses these limitations by enforcing sign consistency through an adaptive relaxation procedure that dynamically dispatches between convex blending and efficient sub-path refitting. Furthermore, we identify and formalize a unique synergy between relaxation and the ``One-Standard-Error'' rule: relaxation serves as a robust debiasing mechanism, allowing practitioners to leverage the parsimony of the 1-SE rule without the traditional loss in predictive fidelity. Our theoretical framework incorporates automated stability safeguards for ultra-high dimensional regimes and is supported by a comprehensive benchmarking suite across 20 synthetic and real-world datasets, demonstrating that Renet consistently outperforms the standard Elastic Net and provides a more robust alternative to the Adaptive Elastic Net in high-dimensional, low signal-to-noise ratio and high-multicollinearity regimes. By leveraging an adaptive solver backend, Renet delivers these statistical gains while offering a computational profile that remains competitive with state-of-the-art coordinate descent implementations.

URL PDF HTML ☆

赞 0 踩 0

2602.11090 2026-02-12 cs.LG cs.AI cs.CE stat.CO

Direct Learning of Calibration-Aware Uncertainty for Neural PDE Surrogates

Carlos Stein Brito

Comments 13 pages, 11 figures

2602.11059 2026-02-12 stat.ML cs.LG stat.AP

A Gibbs posterior sampler for inverse problem based on prior diffusion model

Jean-François Giovannelli

2602.11018 2026-02-12 cs.LG cs.AI stat.ML

OSIL: Learning Offline Safe Imitation Policies with Safety Inferred from Non-preferred Trajectories

Returaj Burnwal, Nirav Pravinbhai Bhatt, Balaraman Ravindran

Comments 21 pages, Accepted at AAMAS 2026

2602.10971 2026-02-12 cs.LG stat.ML

A Jointly Efficient and Optimal Algorithm for Heteroskedastic Generalized Linear Bandits with Adversarial Corruptions

Sanghwa Kim, Junghyun Lee, Se-Young Yun

Comments 49 pages, 1 table

2602.10969 2026-02-12 stat.ME stat.ML

Weighting-Based Identification and Estimation in Graphical Models of Missing Data

Anna Guo, Razieh Nabi

2602.10960 2026-02-12 q-fin.ST cs.CE econ.EM q-fin.RM stat.CO

Integrating granular data into a multilayer network: an interbank model of the euro area for systemic risk assessment

Ilias Aarab, Thomas Gottron, Andrea Colombo, Jörg Reddig, Annalauro Ianiro

Journal ref Adv Data Anal Classif (2026)

详情

DOI: 10.1007/s11634-026-00668-7

英文摘要

Micro-structural models of contagion and systemic risk emphasize that shock propagation is inherently multi-channel, spanning counterparty exposures, short-term funding and roll-over risk, securities cross-holdings, and common-asset (fire-sale) spillovers. Empirical implementations, however, often rely on stylized or simulated networks, or focus on a single exposure dimension, reflecting the practical difficulty of reconciling heterogeneous granular collections into a coherent representation with consistent identifiers and consolidation rules. We close part of this gap by constructing an empirically grounded multilayer network for euro area significant banking groups that integrates several supervisory and statistical datasets into layer-consistent exposure matrices defined on a common node set. Each layer corresponds to a distinct transmission channel, long- and short-term credit, securities cross-holdings, short-term secured funding, and overlapping external portfolios, and nodes are enriched with balance-sheet information to support model calibration. We document pronounced cross-layer heterogeneity in connectivity and centrality, and show that an aggregated (flattened) representation can mask economically relevant structure and misidentify the institutions that are systemically important in specific markets. We then illustrate how the resulting network disciplines standard systemic-risk analytics by implementing a centrality-based propagation measure and a micro-structural agent-based framework on real exposures. The approach provides a data-grounded basis for layer-aware systemic-risk assessment and stress testing across multiple dimensions of the banking network.

URL PDF HTML ☆

赞 0 踩 0

2602.10924 2026-02-12 stat.ME

Non-centred Bayesian inference for discrete-valued state-transition models: the Rippler algorithm

James Neill, Lloyd A. C. Chapman, Chris Jewell

Comments 18 pages, 7 figures (plus supplementary material with an additional 9 pages, 8 figures)

2602.10867 2026-02-12 stat.ML cs.LG

Deep Learning of Compositional Targets with Hierarchical Spectral Methods

Hugo Tabanelli, Yatin Dandi, Luca Pesce, Florent Krzakala

2602.03609 2026-02-12 stat.AP

Scalable non-separable spatio-temporal Gaussian process models for large-scale short-term weather prediction

Tim Gyger, Reinhard Furrer, Fabio Sigrist

2512.00181 2026-02-12 cs.LG cs.AI stat.ML

Orion-Bix: Bi-Axial Attention for Tabular In-Context Learning

Mohamed Bouadi, Pratinav Seth, Aditya Tanna, Vinay Kumar Sankarapu

2511.18141 2026-02-12 stat.ML cs.LG

Conformal Prediction for Compositional Data

Lucas P. Amaral, Luben M. C. Cabezas, Thiago R. Ramos, Gustavo H. G. A. Pereira

Comments 32 pages, 11 figures

2510.06025 2026-02-12 cs.LG stat.ML

Out-of-Distribution Detection from Small Training Sets using Bayesian Neural Network Classifiers

Kevin Raina, Tanya Schmah

Comments British Machine Vision Conference (BMVC) 2025; 18 pages, 6 figures, 3 tables

Journal ref https://bmvc2025.bmva.org/proceedings/1187/

2509.25170 2026-02-12 cs.LG cs.AI stat.ML

GLASS Flows: Transition Sampling for Alignment of Flow and Diffusion Models

Peter Holderrieth, Uriel Singer, Tommi Jaakkola, Ricky T. Q. Chen, Yaron Lipman, Brian Karrer

2508.07465 2026-02-12 cs.LG q-bio.GN stat.ML

MOTGNN: Interpretable Graph Neural Networks for Multi-Omics Disease Classification

Tiantian Yang, Zhiqian Chen

Comments 11 pages, 6 figures, 7 tables

2506.23870 2026-02-12 stat.ME math.ST stat.TH

Upgrading survival models with CARE

William G. Underwood, Henry W. J. Reeve, Oliver Y. Feng, Samuel A. Lambert, Bhramar Mukherjee, Richard J. Samworth

Comments 80 pages, 12 figures

2506.16202 2026-02-12 cs.CY cs.HC stat.AP

AI labeling reduces the perceived accuracy of online content but has limited broader effects

Chuyao Wang, Patrick Sturgis, Daniel de Kadt

Comments 31 pages, 5 figures, 10 tables

2504.08513 2026-02-12 math.PR math.ST stat.TH

Measure Theory of Conditionally Independent Random Function Evaluation

Felix Benning

2408.12175 2026-02-12 cs.LG stat.ML

Measuring Orthogonality as the Blind-Spot of Uncertainty Disentanglement

Ivo Pascal de Jong, Andreea Ioana Sburlea, Matthia Sabatelli, Matias Valdenegro-Toro

Comments 25 pages, 17 figures, 6 tables

2407.12149 2026-02-12 math.NA cs.NA math.ST stat.TH

Multigrid Monte Carlo Revisited: Theory and Bayesian Inference

Yoshihito Kazashi, Eike H. Müller, Robert Scheichl

Comments 64 pages, 4 figures, 2 tables; to appear in "Foundations of Computational Mathematics"

详情

英文摘要

Gaussian random fields play an important role in many areas of science and engineering. In practice, they are often simulated by sampling from a high-dimensional multivariate normal distribution, which arises from the discretisation of a suitable precision operator. Existing methods such as Cholesky factorization and Gibbs sampling become prohibitively expensive on fine meshes due to their high computational cost. In this work, we revisit the Multigrid Monte Carlo (MGMC) algorithm developed by Goodman & Sokal (Physical Review D 40.6, 1989) in the quantum physics context. While the authors of this paper conclude that MGMC does not overcome critical slowing down in simulations of field theories near phase transitions, we demonstrate here that it has the potential to significantly accelerate sampling in spatial statistics. The class of Gaussian Random Fields we consider includes those with Matérn covariance, but is more general in that it also allows for non-stationary covariance functions. To show that MGMC can overcome the limitation of existing methods, we establish a grid-size-independent convergence theory based on the link between linear solvers and samplers for multivariate normal distributions, drawing on standard multigrid convergence arguments. We then apply this theory to linear Bayesian inverse problems. This application is achieved by extending the standard multigrid theory to operators with a low-rank perturbation. Moreover, we develop a novel bespoke random smoother which takes care of the low-rank updates that arise in constructing posterior moments. In particular, we prove that Multigrid Monte Carlo is algorithmically optimal in the limit of the grid-size going to zero. Numerical results support our theory, demonstrating that Multigrid Monte Carlo can be significantly more efficient than alternative methods when applied in a Bayesian setting.

URL PDF HTML ☆

赞 0 踩 0

2407.07559 2026-02-12 math.ST stat.TH

Granulometric Smoothing on Manifolds

Diego Bolón, Rosa M. Crujeiras, Alberto Rodríguez-Casal

Comments 65 pages (a main paper of 28 pages and several appendices)

2405.16828 2026-02-12 cs.LG math.ST stat.ML stat.TH

Kernel-based Optimally Weighted Conformal Time-Series Prediction

Jonghyeok Lee, Chen Xu, Yao Xie

Journal ref In Proceedings of the Thirteenth International Conference on Learning Representations (ICLR), 2025

2402.03991 2026-02-12 cs.LG cs.NA math.NA stat.ML

Provable Emergence of Deep Neural Collapse and Low-Rank Bias in $L^2$-Regularized Nonlinear Networks

Emanuele Zangrando, Piero Deidda, Simone Brugiapaglia, Nicola Guglielmi, Francesco Tudisco

2304.04724 2026-02-12 stat.CO cs.CC stat.ML

When does Metropolized Hamiltonian Monte Carlo provably outperform Metropolis-adjusted Langevin algorithm?

Yuansi Chen, Khashayar Gatmiry, Minhui Jiang

Comments 46 pages, fixed typos and minor issues

2203.00554 2026-02-12 stat.ML cs.LG

Neural Score Matching for High-Dimensional Causal Inference

Oscar Clivio, Fabian Falck, Brieuc Lehmann, George Deligiannidis, Chris Holmes

Comments Fixed erroneous Propositions 5-6-7 and Appendix B from the previous version

2602.10784 2026-02-12 stat.AP

Integrating Unsupervised and Supervised Learning for the Prediction of Defensive Schemes in American football

Rouven Michels, Robert Bajons, Jan-Ole Fischer

2602.10774 2026-02-12 math.ST stat.TH

Nonparametric two sample test of spectral densities

Ilaria Nadin, Tatyana Krivobokova, Farida Enikeeva

2602.10754 2026-02-12 cs.LG cs.AI cs.SY eess.SY stat.ML

Exploring the impact of adaptive rewiring in Graph Neural Networks

Charlotte Cambier van Nooten, Christos Aronis, Yuliya Shapovalova, Lucia Cavallaro

Comments This work has been submitted to the IEEE for possible publication

2602.10730 2026-02-12 stat.ME math.ST stat.TH

A closed form solution for Bayesian analysis of a simple linear mixed model

Hilde Vinje, Lars Erik Gangsei

2602.10714 2026-02-12 stat.CO stat.ML

A Non-asymptotic Analysis for Learning and Applying a Preconditioner in MCMC

Max Hird, Florian Maire, Jeffrey Negrea

2602.10697 2026-02-12 math.OC stat.ML

Fast and Large-Scale Unbalanced Optimal Transport via its Semi-Dual and Adaptive Gradient Methods

Ferdinand Genans

2602.10691 2026-02-12 stat.ML cs.LG

Convergence Rates for Distribution Matching with Sliced Optimal Transport

Gauthier Thurin, Claire Boyer, Kimia Nadjahi

2602.10673 2026-02-12 stat.ME

Inferring the presence and abundance of rare waterbirds species from scarce data

Barbara Bricout, Laura Dami, Pierre Defos du Rau, Sophie Donnet, Thomas Galewski, Stephane Robin

Comments 31 pages, 9 figures

2602.10640 2026-02-12 stat.ML cs.LG

Beyond Kemeny Medians: Consensus Ranking Distributions Definition, Properties and Statistical Learning

Stephan Clémençon, Ekhine Irurozki

2602.10613 2026-02-12 stat.ML cs.LG

Highly Adaptive Principal Component Regression

Mingxun Wang, Alejandro Schuler, Mark van der Laan, Carlos García Meixide

2602.10611 2026-02-12 cs.LG physics.comp-ph stat.ML

On the Role of Consistency Between Physics and Data in Physics-Informed Neural Networks

Nicolás Becerra-Zuniga, Lucas Lacasa, Eusebio Valero, Gonzalo Rubio

Comments 24 pages, 7 Figures, 3 Tables

详情

英文摘要

Physics-informed neural networks (PINNs) have gained significant attention as a surrogate modeling strategy for partial differential equations (PDEs), particularly in regimes where labeled data are scarce and physical constraints can be leveraged to regularize the learning process. In practice, however, PINNs are frequently trained using experimental or numerical data that are not fully consistent with the governing equations due to measurement noise, discretization errors, or modeling assumptions. The implications of such data-to-PDE inconsistencies on the accuracy and convergence of PINNs remain insufficiently understood. In this work, we systematically analyze how data inconsistency fundamentally limits the attainable accuracy of PINNs. We introduce the concept of a consistency barrier, defined as an intrinsic lower bound on the error that arises from mismatches between the fidelity of the data and the exact enforcement of the PDE residual. To isolate and quantify this effect, we consider the 1D viscous Burgers equation with a manufactured analytical solution, which enables full control over data fidelity and residual errors. PINNs are trained using datasets of progressively increasing numerical accuracy, as well as perfectly consistent analytical data. Results show that while the inclusion of the PDE residual allows PINNs to partially mitigate low-fidelity data and recover the dominant physical structure, the training process ultimately saturates at an error level dictated by the data inconsistency. When high-fidelity numerical data are employed, PINN solutions become indistinguishable from those trained on analytical data, indicating that the consistency barrier is effectively removed. These findings clarify the interplay between data quality and physics enforcement in PINNs providing practical guidance for the construction and interpretation of physics-informed surrogate models.

URL PDF HTML ☆

赞 0 踩 0

2602.10608 2026-02-12 stat.ML cs.LG

Bayesian Inference of Contextual Bandit Policies via Empirical Likelihood

Jiangrong Ouyang, Mingming Gong, Howard Bondell

Comments Accepted for publication in JMLR

2602.10588 2026-02-12 cs.LG stat.ML

TRACE: Theoretical Risk Attribution under Covariate-shift Effects

Hosein Anjidani, S. Yahya S. R. Tehrani, Mohammad Mahdi Mojahedian, Mohammad Hossein Yassaee

2602.10587 2026-02-12 stat.ML cs.LG

Deep Bootstrap

Jinyuan Chang, Yuling Jiao, Lican Kang, Junjie Shi

2602.10566 2026-02-12 math.ST math.CO math.RA math.SP stat.TH

Finite-sample confidence regions for spectral clustering and graph centrality

Chandrasekhar Gokavarapu, Sekhar Babu Gosala, Vamis Pasalapudi, Tarakarama Kapakayala

2602.10545 2026-02-12 cs.LG cs.AI stat.ML

$μ$pscaling small models: Principled warm starts and hyperparameter transfer

Yuxin Ma, Nan Chen, Mateo Díaz, Soufiane Hayou, Dmitriy Kunisky, Soledad Villar

Comments 61 pages, 6 figures

2602.10532 2026-02-12 stat.ML cs.LG math.ST stat.TH

Statistical Inference and Learning for Shapley Additive Explanations (SHAP)

Justin Whitehouse, Ayush Sawarni, Vasilis Syrgkanis

Comments 48 pages, 1 figure

2602.10530 2026-02-12 stat.ML cs.LG math.ST stat.TH

Generalized Robust Adaptive-Bandwidth Multi-View Manifold Learning in High Dimensions with Noise

Xiucai Ding, Chao Shen, Hau-Tieng Wu

Comments 4 figures

2602.10484 2026-02-12 stat.ME

CoVaR under Asymptotic Independence

Zhaowen Wang, Yutao Liu, Deyuan Li

2602.10464 2026-02-12 math.ST stat.ML stat.TH

Do More Predictions Improve Statistical Inference? Filtered Prediction-Powered Inference

Shirong Xu, Will Wei Sun

2602.10348 2026-02-12 stat.ME

Optimizing precision in stepped-wedge designs via machine learning and quadratic inference functions

Liangbo Lyu, Bingkai Wang

2602.10332 2026-02-12 stat.ME stat.ML

Generalized Prediction-Powered Inference, with Application to Binary Classifier Evaluation

Runjia Zou, Daniela Witten, Brian Williamson

2602.10303 2026-02-12 cs.LG q-bio.QM stat.ML

ICODEN: Ordinary Differential Equation Neural Networks for Interval-Censored Data

Haoling Wang, Lang Zeng, Tao Sun, Youngjoo Cho, Ying Ding

2602.10274 2026-02-12 math.ST stat.TH

Asymptotic equivalence for nonparametric additive regression

Moritz Jirak, Alexander Meister, Angelika Rohde

2602.10261 2026-02-12 cs.LG stat.AP stat.ML

Kernel-Based Learning of Chest X-ray Images for Predicting ICU Escalation among COVID-19 Patients

Qiyuan Shi, Jian Kang, Yi Li

2602.10256 2026-02-12 math.ST stat.TH

Bernstein-von Mises theorem for log-concave posteriors

Victor-Emmanuel Brunel

2602.10241 2026-02-12 stat.ME cs.CY

Geographically Weighted Canonical Correlation Analysis: Local Spatial Associations Between Two Sets of Variables

Zhenzhi Jiao, Angela Yao, Ran Tao, Jean-Claude Thill

2602.10182 2026-02-12 cs.LG stat.ML

Signature-Kernel Based Evaluation Metrics for Robust Probabilistic and Tail-Event Forecasting

Benjamin R. Redhead, Thomas L. Lee, Peng Gu, Víctor Elvira, Amos Storkey

Comments Main Paper: 8 pages 3 figures Including Appendix and References: 19 pages 7 figures

2602.10176 2026-02-12 stat.ML cs.LG

Dissecting Performative Prediction: A Comprehensive Survey

Thomas Kehrenberg, Javier Sanguino, Jose A. Lozano, Novi Quadrianto

2602.10144 2026-02-12 stat.ML cs.AI cs.LG

When LLMs get significantly worse: A statistical approach to detect model degradations

Jonas Kübler, Kailash Budhathoki, Matthäus Kleindessner, Xiong Zhou, Junming Yin, Ashish Khetan, George Karypis

Comments https://openreview.net/forum?id=cM3gsqEI4K

Journal ref ICLR 2026

2602.09208 2026-02-12 stat.ME

Some Bayesian Perspectives on Clinical Trials

Alexandra Sokolova, Vadim Sokolov, Nick Polson

2602.08215 2026-02-12 cs.LG stat.ME

Distribution-Free Robust Predict-Then-Optimize in Function Spaces

Yash Patel, Ambuj Tewari

2602.07402 2026-02-12 quant-ph physics.hist-ph stat.AP

The ABL Rule and the Perils of Post-Selection

Jacob A. Barandes

Comments 28 pages, no figures

2602.03514 2026-02-12 cs.LG math.OC stat.ML

A Function-Space Stability Boundary for Generalization in Interpolating Learning Systems

Ronald Katende

2601.21014 2026-02-12 stat.ML cs.LG stat.AP

Efficient Causal Structure Learning via Modular Subgraph Integration

Haixiang Sun, Pengchao Tian, Zihan Zhou, Jielei Zhang, Peiyi Li, Andrew L. Liu

2601.18626 2026-02-12 cs.LG cs.AI stat.ML

Rank-1 Approximation of Inverse Fisher for Natural Policy Gradients in Deep Reinforcement Learning

Yingxiao Huo, Satya Prakash Dash, Radu Stoican, Samuel Kaski, Mingfei Sun

2601.16865 2026-02-12 econ.EM math.ST stat.AP stat.TH

Distributional Instruments: Identification and Estimation with Quantile Least Squares

Rowan Cherodian, Guy Tchuente

2512.25025 2026-02-12 stat.ME econ.EM math.ST stat.TH

Modewise Additive Factor Model for Matrix Time Series

Elynn Chen, Yuefeng Han, Jiayu Li, Ke Xu

2512.19338 2026-02-12 math.ST stat.AP stat.ME stat.TH

A hybrid-Hill estimator enabled by heavy-tailed block maxima

Claudia Neves, Chang Xu

Comments 32 pages, 5 figures

2511.21516 2026-02-12 math.ST stat.TH

Causal Inference: A Tale of Three Frameworks

Linbo Wang, Thomas Richardson, James Robins

2511.01103 2026-02-12 math.ST stat.TH

Nonparametric Least Squares Estimators for Interval Censoring

Piet Groeneboom

Comments 26 pages, 8 figures

2510.08554 2026-02-12 cs.LG stat.ML

Improving Reasoning for Diffusion Language Models via Group Diffusion Policy Optimization

Kevin Rojas, Jiahe Lin, Kashif Rasul, Anderson Schneider, Yuriy Nevmyvaka, Molei Tao, Wei Deng

2510.00309 2026-02-12 cs.LG stat.ML

Lipschitz Bandits with Stochastic Delayed Feedback

Zhongxuan Liu, Yue Kang, Thomas C. M. Lee

Comments The Fourteenth International Conference on Learning Representations (ICLR 2026)

2506.11214 2026-02-12 math.OC cs.AI cs.CC cs.LG stat.ML

Complexity of normalized stochastic first-order methods with momentum under heavy-tailed noise

Chuan He, Zhaosong Lu, Defeng Sun, Zhanwang Deng

2506.10569 2026-02-12 stat.AP

A composition of simplified physics-based model with neural operator for trajectory-level seismic response predictions of structural systems

Jungho Kim, Sang-ri Yi, Ziqi Wang

Journal ref Structural Safety, Vol(119), 102668, 2026

详情

DOI: 10.1016/j.strusafe.2025.102668

英文摘要

Accurate prediction of nonlinear structural responses is essential for earthquake risk assessment and management. While high-fidelity nonlinear time history analysis provides the most comprehensive and accurate representation of the responses, it becomes computationally prohibitive for complex structural system models and repeated simulations under varying ground motions. To address this challenge, we propose a composite learning framework that integrates simplified physics-based models with a Fourier neural operator to enable efficient and accurate trajectory-level seismic response prediction. In the proposed architecture, a simplified physics-based model, obtained from techniques such as linearization, modal reduction, or solver relaxation, serves as a preprocessing operator to generate structural response trajectories that capture coarse dynamic characteristics. A neural operator is then trained to correct the discrepancy between these initial approximations and the true nonlinear responses, allowing the composite model to capture hysteretic and path-dependent behaviors. Additionally, a linear regression-based postprocessing scheme is introduced to further refine predictions and quantify associated uncertainty with negligible additional computational effort. The proposed approach is validated on three representative structural systems subjected to synthetic or recorded ground motions. Results show that the proposed approach consistently improves prediction accuracy over baseline models, particularly in data-scarce regimes. These findings demonstrate the potential of physics-guided operator learning for reliable and data-efficient modeling of nonlinear structural seismic responses.

URL PDF HTML ☆

赞 0 踩 0

2505.23599 2026-02-12 cs.LG math.RT math.ST stat.ML stat.TH

On Transferring Transferability: Towards a Theory for Size Generalization

Eitan Levin, Yuxin Ma, Mateo Díaz, Soledad Villar

Comments 75 pages, 10 figures, closest to version to be published in NeurIPS

2505.16204 2026-02-12 cs.LG math.ST stat.ML stat.TH

Directional Convergence, Benign Overfitting of Gradient Descent in leaky ReLU two-layer Neural Networks

Ichiro Hashimoto

Comments 41 pages, Accepted at International Conference on Learning Representations 2026 (ICLR 2026)

2504.08263 2026-02-12 stat.ME stat.OT

A roadmap for systematic identification and analysis of multiple biases in causal inference

Rushani Wijesuriya, Rachael A. Hughes, John B. Carlin, Rachel L. Peters, Jennifer J. Koplin, Margarita Moreno-Betancur

Comments 12 Pages, 4 Figures

2504.05661 2026-02-12 math.ST stat.TH

Online Bernstein-von Mises theorem

Jeyong Lee, Junhyeok Choi, Minwoo Chae

Comments 124 pages, 1 figure (Accepted to the Journal of Machine Learning Research)

2503.02437 2026-02-12 stat.ML cs.LG

Decentralized Reinforcement Learning for Multi-Agent Multi-Resource Allocation via Dynamic Cluster Agreements

Antonio Marino, Esteban Restrepo, Claudio Pacchierotti, Paolo Robuffo Giordano

Journal ref IEEE Robotics and Automation Letters, 2025, 10 (8), pp.8123-8130

2503.01882 2026-02-12 cs.LG physics.geo-ph stat.AP stat.ML

Constructing balanced datasets for predicting failure modes in structural systems under seismic hazards

Jungho Kim, Taeyong Kim

Journal ref Engineering Structures, Vol(346), 121637, 2026

2502.14121 2026-02-12 stat.ML cs.AI cs.LG

Multi-Objective Bayesian Optimization for Networked Black-Box Systems: A Path to Greener Profits and Smarter Designs

Akshay Kudva, Wei-Ting Tang, Joel A. Paulson

详情

英文摘要

Designing modern industrial systems requires balancing several competing objectives, such as profitability, resilience, and sustainability, while accounting for complex interactions between technological, economic, and environmental factors. Multi-objective optimization (MOO) methods are commonly used to navigate these tradeoffs, but selecting the appropriate algorithm to tackle these problems is often unclear, particularly when system representations vary from fully equation-based (white-box) to entirely data-driven (black-box) models. While grey-box MOO methods attempt to bridge this gap, they typically impose rigid assumptions on system structure, requiring models to conform to the underlying structural assumptions of the solver rather than the solver adapting to the natural representation of the system of interest. In this chapter, we introduce a unifying approach to grey-box MOO by leveraging network representations, which provide a general and flexible framework for modeling interconnected systems as a series of function nodes that share various inputs and outputs. Specifically, we propose MOBONS, a novel Bayesian optimization-inspired algorithm that can efficiently optimize general function networks, including those with cyclic dependencies, enabling the modeling of feedback loops, recycle streams, and multi-scale simulations - features that existing methods fail to capture. Furthermore, MOBONS incorporates constraints, supports parallel evaluations, and preserves the sample efficiency of Bayesian optimization while leveraging network structure for improved scalability. We demonstrate the effectiveness of MOBONS through two case studies, including one related to sustainable process design. By enabling efficient MOO under general graph representations, MOBONS has the potential to significantly enhance the design of more profitable, resilient, and sustainable engineering systems.

URL PDF HTML ☆

赞 0 踩 0

2502.03366 2026-02-12 cs.LG stat.ML

Rethinking Approximate Gaussian Inference in Classification

Bálint Mucsányi, Nathaël Da Costa, Philipp Hennig

Comments 46 pages

2502.03174 2026-02-12 math.ST stat.ML stat.TH

Robust Label Shift Quantification

Alexandre Lecestre

Comments Revision were made, including a change of title. Also, this version contains new results in the calibration section

2501.01783 2026-02-12 math.ST stat.ML stat.TH

Nonparametric estimation of a factorizable density using diffusion models

Hyeok Kyu Kwon, Dongha Kim, Ilsang Ohn, Minwoo Chae

Comments Accepted for publication in the Journal of Machine Learning Research (JMLR)

2412.20481 2026-02-12 math.OC stat.CO

EM algorithms for optimization problems with polynomial objectives

Kensuke Asai, Jun-ya Gotoh

2412.17070 2026-02-12 math.PR math.OC stat.ML

Decoupled Functional Central Limit Theorems for Two-Time-Scale Stochastic Approximation

Yuze Han, Xiang Li, Jiadong Liang, Zhihua Zhang

2412.00228 2026-02-12 stat.ME

A Doubly Robust Framework for Addressing Outcome-Dependent Selection Bias in Multi-Cohort EHR Studies

Ritoban Kundu, Xu Shi, Michael Kleinsasser, Lars G. Fritsche, Maxwell Salvatore, Bhramar Mukherjee

2410.06125 2026-02-12 stat.ME stat.AP

Simultaneous Graphical Dynamic Modeling

Mike West, Luke Vrotsos

Comments 34 pages and 13 figures

2406.01552 2026-02-12 stat.ML cs.AI cs.LG

Tensor learning with orthogonal, Lorentz, and symplectic symmetries

Wilson G. Gregory, Josué Tonelli-Cueto, Nicholas F. Marshall, Andrew S. Lee, Soledad Villar

Comments 40 pages, 1 figure. To appear at ICLR 2026

2404.07593 2026-02-12 stat.ML cs.LG stat.ME

Diffusion posterior sampling for simulation-based inference in tall data settings

Julia Linhart, Gabriel Victorino Cardoso, Alexandre Gramfort, Sylvain Le Corff, Pedro L. C. Rodrigues

Comments 49 pages, 24 figures, 3 tables, 2 algorithms, 12 appendices, TMLR acceptance

2402.04582 2026-02-12 stat.AP stat.ML

Dimensionality reduction can be used as a surrogate model for high-dimensional forward uncertainty quantification

Jungho Kim, Sang-ri Yi, Ziqi Wang

Journal ref Reliability Engineering & System Safety, Vol(265), 111474, 2026

2305.19640 2026-02-12 stat.ML cs.LG

Fine-grained Analysis of Non-parametric Estimation for Pairwise Learning

Junyu Zhou, Shuo Huang, Han Feng, Puyu Wang, Ding-Xuan Zhou

2110.01950 2026-02-12 stat.ML cs.LG

Classification of high-dimensional data with spiked covariance matrix structure

Yin-Jen Chen, Minh Tang

Comments 40 pages, 2 figures

Journal ref Transactions on Machine Learning Research (01/2026)