arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2602.10055 2026-02-11 math.ST stat.TH

The weak law of large numbers for the friendship paradox index

Mingao Yuan

2602.10045 2026-02-11 cs.CV cs.LG stat.ME stat.ML

Conformal Prediction Sets for Instance Segmentation

Kerri Lu, Dan M. Kluger, Stephen Bates, Sherrie Wang

2602.10026 2026-02-11 stat.ME

Degrees-of-Freedom Approximations for Conditional-Mean Inference in Random-Lot Stability Analysis

Andrew T. Karl, Heath Rushing, Richard K. Burdick, Jeff Hofer

2602.10018 2026-02-11 stat.ME math.ST stat.ML stat.TH

Online Selective Conformal Prediction with Asymmetric Rules: A Permutation Test Approach

Mingyi Zheng, Ying Jin

2602.10012 2026-02-11 stat.ME

Doubly Robust Estimation of Desirability of Outcome Ranking (DOOR) Probability with Application to MDRO Studies

Shiyu Shu, Toshimitsu Hamasaki, Scott Evans, Lauren Komarow, David van Duin, Guoqing Diao

2602.09982 2026-02-11 stat.ME

Kelly Betting as Bayesian Model Evaluation: A Framework for Time-Updating Probabilistic Forecasts

Michael Beuoy

Comments 31 pages, 10 figures

2602.09959 2026-02-11 math.ST cs.LG stat.ML stat.TH

Statistical-Computational Trade-offs in Learning Multi-Index Models via Harmonic Analysis

Hugo Latourelle-Vigeant, Theodor Misiakiewicz

Comments 91 pages

2602.09936 2026-02-11 stat.ML cs.LG math.ST stat.TH

The Catastrophic Failure of The k-Means Algorithm in High Dimensions, and How Hartigan's Algorithm Avoids It

Roy R. Lederman, David Silva-Sánchez, Ziling Chen, Gilles Mordant, Amnon Balanov, Tamir Bendory

2511.20605 2026-02-11 cs.LG stat.ML

How to Purchase Labels? A Cost-Effective Approach Using Active Learning Markets

Xiwen Huang, Pierre Pinson

Comments Accepted for publication in INFORMS Journal on Data Science (IJDS). This is the authors' preprint

2506.22499 2026-02-11 cs.CV cs.AI stat.AP

Scalable Dynamic Origin-Destination Demand Estimation Enhanced by High-Resolution Satellite Imagery Data

Jiachao Liu, Pablo Guarda, Koichiro Niinuma, Sean Qian

2504.03560 2026-02-11 math.OC cs.LG math.ST stat.ML stat.TH

Stochastic Optimization with Optimal Importance Sampling

Liviu Aolaritei, Bart P. G. Van Parys, Henry Lam, Michael I. Jordan

2402.15004 2026-02-11 stat.ME math.ST stat.TH

Repro Samples Method for a Performance Guaranteed Inference in General and Irregular Inference Problems

Minge Xie, Peng Wang

2310.01153 2026-02-11 math.ST stat.ME stat.TH

Measuring Evidence against Exchangeability and Group Invariance with E-values

Nick W. Koning

2602.09911 2026-02-11 stat.ME

Doubly Robust Machine Learning for Population Size Estimation with Missing Covariates: Application to Gaza Conflict Mortality

Mateo Dulce Rubio, Edward H. Kennedy, Nicholas P. Jewell

2602.09847 2026-02-11 stat.ML cs.LG

Stabilized Maximum-Likelihood Iterative Quantum Amplitude Estimation for Structural CVaR under Correlated Random Fields

Alireza Tabarraei

2602.09845 2026-02-11 stat.CO

Estimating Individual Customer Lifetime Values with R: The CLVTools Package

Markus Meierer, Patrick Bachmann, Jeffrey Näf, Patrik Schilter, René Algesheimer

2602.09833 2026-02-11 math.ST stat.TH

Density estimation from batched broken random samples

Hancheng Bi, Bernhard Schmitzer, Thilo D. Stier

Comments 18 pages, 4 figures

2602.09762 2026-02-11 math.ST cs.NA math.NA stat.TH

Asymptotic analysis of the Gaussian kernel matrix for partially noisy data in high dimensions

Kensuke Aishima

2602.09731 2026-02-11 stat.ME

Bayesian identification of early warning signals for long-range dependent climatic time series

Sigrunn H. Sørbye, Eirik Myrvoll-Nilsen, Håvard Rue

Comments 27 pages, 9 figures

2602.09720 2026-02-11 stat.ML cs.LG

Continual Learning for non-stationary regression via Memory-Efficient Replay

Pablo García-Santaclara, Bruno Fernández-Castro, RebecaP. Díaz-Redondo, Martín Alonso-Gamarra

2602.09704 2026-02-11 stat.ME stat.ML

Extended Isolation Forest with feature sensitivities

Illia Donhauzer

Comments The automated classifier suggested cs.LG. We believe the paper is primarily machine learning theory, and we would appreciate cross-listing to cs.LG or stat.ML if deemed appropriate

2602.09643 2026-02-11 math.PR math.ST stat.TH

A simple proof of the discreteness of Dirichlet processes

Nils Lid Hjort

Comments Based on pages 18-19 in N.L. Hjort's graduate thesis, 1976

2602.09632 2026-02-11 stat.AP

Bayesian network approach to building an affective module for a driver behavioural model

Dorota Młynarczyk, Gabriel Calvo, Francisco Palmi-Perales, Carmen Armero, Virgilio Gómez-Rubio, Ana de la Torre-García, Ricardo Bayona Salvador

2602.09619 2026-02-11 math.ST math.AG stat.TH

Discrete-time, discrete-state multistate Markov models from the perspective of algebraic statistics

Dario Gasbarra, Kaie Kubjas, Sangita Kulathinal, Nataliia Kushnerchuk, Fatemeh Mohammadi, Etienne Sebag

详情

英文摘要

We study discrete-time, discrete-state multistate Markov models from the perspective of algebraic statistics. These models are widely studied in event history analysis, and are characterized by the state space, the initial distribution and the transition probabilities. A finite path under the multistate Markov model is a particular set of states occupied at finite time instances $\{1, \dots, n\}$. The main goal of this paper is to establish a bridge between event history analysis and algebraic statistics. The joint probabilities of finite paths in these models have a natural monomial parametrization in terms of the initial distribution and the transition probabilities. We study the polynomial relations among joint path probabilities. When the statistical constraints on the parameters are disregarded, nonhomogeneous multistate Markov models of arbitrary order can be viewed as slices of decomposable hierarchical models. This yields a complete description of their vanishing ideals as toric ideals generated by explicit families of binomials. Moreover, the variety of this vanishing ideal equals the nonhomogeneous multistate Markov model on the probability simplex. In contrast, homogeneous multistate Markov models exhibit different algebraic behavior, as time homogeneity imposes additional polynomial relations, leading to vanishing ideals that are strictly larger than in the nonhomogeneous case. We also derive families of binomial relations that vanish on homogeneous multistate Markov models. We investigate maximum likelihood estimation from statistical and algebraic perspectives. For nonhomogeneous models, classical and algebraic formulas agree; in the homogeneous case, the algebraic approach is more complex. Lastly, we provide data applications where we demonstrate the statistical theory to obtain the maximum likelihood estimates of the parameters under specific multistate Markov models.

URL PDF HTML ☆

赞 0 踩 0

2602.09566 2026-02-11 cs.LG cs.AI cs.CV stat.ME

ECG-IMN: Interpretable Mesomorphic Neural Networks for 12-Lead Electrocardiogram Interpretation

Vajira Thambawita, Jonas L. Isaksen, Jørgen K. Kanters, Hugo L. Hammer, Pål Halvorsen

2602.09542 2026-02-11 stat.ME

High Dimensional Mean Test for Shrinking Random Variables with Applications to Backtesting

Liujun Chen, Chen Zhou

2602.09537 2026-02-11 stat.ME

A joint QoL-Survival framework with debiased estimation under truncation by death

Torben Martinussen, Klaus K. Holst, Christian Bressen Pipper, Per Kragh Andersen

2602.09512 2026-02-11 stat.ME stat.CO

Continuous mixtures of Gaussian processes as models for spatial extremes

Lorenzo Dell'Oro, Carlo Gaetan, Thomas Opitz

2602.09456 2026-02-11 cs.LG stat.ML

Taming the Monster Every Context: Complexity Measure and Unified Framework for Offline-Oracle Efficient Contextual Bandits

Hao Qin, Chicheng Zhang

Comments 40 pages (13 pages main body, 24 pages supplementary materials)

2602.09356 2026-02-11 math.ST stat.TH

Regularized geometric quantiles and universal linear distribution functionals

Dimitri Konen, Gilles Stupfler

2602.09351 2026-02-11 stat.ME

Supervised Learning of Functional Outcomes with Predictors at Different Scales: A Functional Gaussian Process Approach

R. Jacob Andros, Rajarshi Guhaniyogi, Devin Francom, Donatella Pasqualini

2602.09314 2026-02-11 cs.LG cs.AI stat.ML

Clarifying Shampoo: Adapting Spectral Descent to Stochasticity and the Parameter Trajectory

Runa Eschenhagen, Anna Cai, Tsung-Hsien Lee, Hao-Jun Michael Shi

2602.09279 2026-02-11 stat.ME math.ST stat.TH

Stochastic EM Estimation and Inference for Zero-Inflated Beta-Binomial Mixed Models for Longitudinal Count Data

John Barrera, Ana Arribas-Gil, Dae-Jin Lee, Cristian Meza

Comments 21 pages, 4 figures

2602.09277 2026-02-11 stat.ML cs.LG

Mutual Information Collapse Explains Disentanglement Failure in $β$-VAEs

Minh Vu, Xiaoliang Wan, Shuangqing Wei

2602.09247 2026-02-11 stat.CO

Motivating REML via Prediction-Error Covariances in EM Updates for Linear Mixed Models

Andrew T. Karl

2602.09240 2026-02-11 math.ST cs.IT cs.LG math.IT math.PR stat.ML stat.TH

Optimal Estimation in Orthogonally Invariant Generalized Linear Models: Spectral Initialization and Approximate Message Passing

Yihan Zhang, Hong Chang Ji, Ramji Venkataramanan, Marco Mondelli

2602.09235 2026-02-11 cs.LG stat.AP stat.ME

RAPID: Risk of Attribute Prediction-Induced Disclosure in Synthetic Microdata

Matthias Templ, Oscar Thees, Roman Müller

Comments 29 pages, 5 figures

2602.09219 2026-02-11 math.ST stat.TH

Goodness-of-fit testing for nonlinear inverse problems with random observations

Remo Kretschmann, Han Cheng Lie

Comments 44 pages

2602.09196 2026-02-11 cs.LG stat.ML

Fair Feature Importance Scores via Feature Occlusion and Permutation

Camille Little, Madeline Navarro, Santiago Segarra, Genevera Allen

2602.09167 2026-02-11 stat.ME

Mean regression for (0,1) responses via beta scale mixtures

Arno Otto, Andriëtte Bekker, Johan Ferreira, Lebogang Rathebe

Comments 21 pages, 11 figures

2602.09145 2026-02-11 stat.ME

Estimating causal effects of functional treatments with modified functional treatment policies

Ziren Jiang, Erjia Cui, Jared D. Huling

2602.09058 2026-02-11 stat.ML cs.AI cs.IT cs.LG math.IT

Persistent Entropy as a Detector of Phase Transitions

Matteo Rucco

2602.08681 2026-02-11 cs.LG stat.ML

The Theory and Practice of MAP Inference over Non-Convex Constraints

Leander Kurscheidt, Gabriele Masina, Roberto Sebastiani, Antonio Vergari

2602.07707 2026-02-11 stat.ME

Generation of Multivariate Discrete Data with Generalized Poisson, Negative Binomial and Binomial Marginal Distributions

Chak Kwong, Cheng, Hakan Demirtas

2602.07681 2026-02-11 stat.ME cs.AI

Mapping Drivers of Greenness: Spatial Variable Selection for MODIS Vegetation Indices

Qishi Zhan, Cheng-Han Yu, Yuchi Chen, Zhikang Dong, Rajarshi Guhaniyogi

2602.07632 2026-02-11 stat.ML cs.LG

Scalable Mean-Field Variational Inference via Preconditioned Primal-Dual Optimization

Jinhua Lyu, Tianmin Yu, Ying Ma, Naichen Shi

2601.20152 2026-02-11 math.ST math.PR stat.ML stat.TH

Concentration Inequalities for Exchangeable Tensors and Matrix-valued Data

Chen Cheng, Rina Foygel Barber

Comments 45 pages, 3 figures

2601.19186 2026-02-11 stat.ML cs.LG

Double Fairness Policy Learning: Integrating Action Fairness and Outcome Fairness in Decision-making

Zeyu Bian, Lan Wang, Chengchun Shi, Zhengling Qi

2601.17400 2026-02-11 stat.ME

Variational autoencoder for inference of nonlinear mixed effect models based on ordinary differential equations

Zhe Li, Mélanie Prague, Rodolphe Thiébaut, Quentin Clairon

2601.14049 2026-02-11 stat.ME

Tail-Aware Density Forecasting of Locally Explosive Time Series: A Neural Network Approach

Elena Dumitrescu, Julien Peignon, Arthur Thomas

2601.07752 2026-02-11 econ.EM cs.LG math.ST stat.ME stat.ML stat.TH

A Unified Framework for Debiased Machine Learning: Riesz Representer Fitting under Bregman Divergence

Masahiro Kato

2512.23190 2026-02-11 cs.LG math.OC stat.ML

A Simple, Optimal and Efficient Algorithm for Online Exp-Concave Optimization

Yi-Han Wang, Peng Zhao, Zhi-Hua Zhou

2512.15771 2026-02-11 cs.LG cs.AI cs.NA math.NA stat.ML

Solving PDEs With Deep Neural Nets under General Boundary Conditions

Chenggong Zhang

Comments 7 pages, 2 figures

2512.14609 2026-02-11 stat.ME econ.EM

Asymptotic Inference for Rank Correlations

Marc-Oliver Pohle, Jan-Lukas Wermuth, Christian H. Weiß

2512.02266 2026-02-11 stat.AP

Estimating excess mortality during the Covid-19 pandemic in Aotearoa New Zealand: Addendum

Michael J. Plank, Pubudu Senanayake, Richard Lyon

Journal ref International Journal of Epidemiology (2026) 55(1): dyag008

2512.01965 2026-02-11 stat.AP

Predicting Onsets and Dry Spells of the West African Monsoon Season Using Machine Learning Methods

Colin Bobocea, Yves Atchadé

2510.23631 2026-02-11 cs.LG cs.AI stat.ME stat.ML

Beyond Pairwise: Empowering LLM Alignment With Ranked Choice Modeling

Yuxuan Tang, Yifan Feng

Comments Accepted by The Fourteenth International Conference on Learning Representations (ICLR 2026)

2510.15632 2026-02-11 stat.ME math.ST stat.AP stat.TH

Robust estimation of polyserial correlation coefficients: A density power divergence approach

Max Welz

Comments 69 pages (32 main text), 19 figures and 5 tables in total

Journal ref Forthcoming in Psychometrika (2026+)

2510.08174 2026-02-11 math.ST stat.TH

Dimension-free Bounds for Covariance Estimation with Tensor-Train Structure

Artsiom Patarusau, Nikita Puchkin, Maxim Rakhuba, Fedor Noskov

2509.21996 2026-02-11 stat.ML cs.LG

A Nonparametric Discrete Hawkes Model with a Collapsed Gaussian-Process Prior

Trinnhallen Brisley, Gordon Ross, Daniel Paulin

2509.09569 2026-02-11 stat.AP

Measuring football fever through wearable technology: A case study on the German cup final

Timo Adam, Jonas Bauer, Christian Deutscher, Christiane Fuchs, Tamara Schamberger, David Winkelmann

2508.21536 2026-02-11 stat.ME econ.EM

Triply Robust Panel Estimators

Susan Athey, Guido Imbens, Zhaonan Qu, Davide Viviano

2508.13366 2026-02-11 stat.AP econ.GN q-fin.EC stat.ME

Monotonic Path-Specific Effects: Application to Estimating Educational Returns

Aleksei Opacic

2507.15529 2026-02-11 stat.CO

Algorithms for Approximating Conditionally Optimal Bounds

George Bissias

2507.09093 2026-02-11 stat.ML cs.LG math.OC

Sharp High-Probability Rates for Nonlinear SGD under Heavy-Tailed Noise via Symmetrization

Aleksandar Armacki, Dragana Bajovic, Dusan Jakovetic, Soummya Kar

Comments 43 pages, 1 figure

详情

英文摘要

We study convergence in high-probability of SGD-type methods in non-convex optimization and the presence of heavy-tailed noise. To combat the heavy-tailed noise, a general black-box nonlinear framework is considered, subsuming nonlinearities like sign, clipping, normalization and their smooth counterparts. Our first result shows that nonlinear SGD (N-SGD) achieves the rate $\widetilde{\mathcal{O}}(t^{-1/2})$, for any noise with unbounded moments and a symmetric probability density function (PDF). Crucially, N-SGD has exponentially decaying tails, matching the performance of linear SGD under light-tailed noise. To handle non-symmetric noise, we propose two novel estimators, based on the idea of noise symmetrization. The first, dubbed Symmetrized Gradient Estimator (SGE), assumes a noiseless gradient at any reference point is available at the start of training, while the second, dubbed Mini-batch SGE (MSGE), uses mini-batches to estimate the noiseless gradient. Combined with the nonlinear framework, we get N-SGE and N-MSGE methods, respectively, both achieving the same convergence rate and exponentially decaying tails as N-SGD, while allowing for non-symmetric noise with unbounded moments and PDF satisfying a mild technical condition, with N-MSGE additionally requiring bounded noise moment of order $p \in (1,2]$. Compared to works assuming noise with bounded $p$-th moment, our results: 1) are based on a novel symmetrization approach; 2) provide a unified framework and relaxed moment conditions; 3) imply optimal oracle complexity of N-SGD and N-SGE, strictly better than existing works when $p < 2$, while the complexity of N-MSGE is close to existing works. Compared to works assuming symmetric noise with unbounded moments, we: 1) provide a sharper analysis and improved rates; 2) facilitate state-dependent symmetric noise; 3) extend the strong guarantees to non-symmetric noise.

URL PDF HTML ☆

赞 0 踩 0

2507.06556 2026-02-11 math.PR math.CO math.ST stat.TH

Spectra of high-dimensional sparse random geometric graphs

Yifan Cao, Yizhe Zhu

Comments 26 pages, 4 figures

2507.05526 2026-02-11 cs.LG stat.ME stat.ML

Estimating Interventional Distributions with Uncertain Causal Graphs through Meta-Learning

Anish Dhir, Cristiana Diaconu, Valentinian Mihai Lungu, James Requeima, Richard E. Turner, Mark van der Wilk

2506.13865 2026-02-11 quant-ph cond-mat.dis-nn cs.LG cs.NE stat.ML

Connecting phases of matter to the flatness of the loss landscape in analog variational quantum algorithms

Kasidit Srimahajariyapong, Supanut Thanasilp, Thiparat Chotibut

Comments 17+9 pages, 9+7 figures

2506.05905 2026-02-11 stat.ME cs.NA math.NA stat.CO stat.ML

Sequential Monte Carlo approximations of Wasserstein--Fisher--Rao gradient flows

Francesca R. Crucinio, Sahani Pathiraja

Comments Changes from v1: the study of tempered dynamics was removed in favour of a larger experimental section

2506.05776 2026-02-11 stat.AP stat.OT

Analyzing the retraining frequency of global forecasting models: towards more stable forecasting systems

Marco Zanotti

2505.21208 2026-02-11 stat.ML cs.LG math.OC

Input Convex Kolmogorov Arnold Networks

Thomas Deschatre, Xavier Warin

2505.19013 2026-02-11 cs.LG cs.AI econ.GN q-fin.EC stat.ML

Faithful Group Shapley Value

Kiljae Lee, Ziqi Liu, Weijing Tang, Yuan Zhang

Comments Accepted to NeurIPS 2025

2505.17133 2026-02-11 stat.ML cs.AI cs.LG

Learning Probabilities of Causation with Mask-Augmented Data

Shuai Wang, Yizhou Sun, Judea Pearl, Ang Li

Comments arXiv admin note: text overlap with arXiv:2502.08858

2505.10919 2026-02-11 physics.flu-dyn cs.LG stat.ML

A Physics-Informed Spatiotemporal Deep Learning Framework for Turbulent Systems

Luca Menicali, Andrew Grace, David H. Richter, Stefano Castruccio

2505.08654 2026-02-11 stat.ME econ.EM q-fin.ST

Holistic Multi-Scale Inference of the Leverage Effect: Efficiency under Dependent Microstructure Noise

Ziyang Xiong, Zhao Chen, Christina Dan Wang

2504.03784 2026-02-11 stat.ML cs.AI cs.LG

Robust Reinforcement Learning from Human Feedback for Large Language Models Fine-Tuning

Kai Ye, Hongyi Zhou, Jin Zhu, Francesco Quinzan, Chengchun Shi

2412.08794 2026-02-11 cs.LG stat.ML

Latent Safety-Constrained Policy Approach for Safe Offline Reinforcement Learning

Prajwal Koirala, Zhanhong Jiang, Soumik Sarkar, Cody Fleming

Journal ref International Conference on Learning Representations (ICLR), 2025

2412.06582 2026-02-11 math.ST stat.ME stat.TH

Optimal estimation in private distributed functional data analysis

Gengyu Xue, Zhenhua Lin, Yi Yu

2410.04165 2026-02-11 stat.ME econ.EM

How to Compare Copula Forecasts?

Tobias Fissler, Yannick Hoga

Journal ref Journal of Business & Economic Statistics (2026)

2408.00955 2026-02-11 stat.ML cs.LG stat.ME

Aggregation Models with Optimal Weights for Distributed Gaussian Processes

Haoyuan Chen, Rui Tuo

Comments 34 pages, 8 figures, 2 tables

2406.05637 2026-02-11 math.OC cs.LG math.PR stat.ML

A Generalized Version of Chung's Lemma and its Applications

Li Jiang, Xiao Li, Andre Milzarek, Junwen Qiu

Comments 38 pages

2312.05319 2026-02-11 stat.ME math.ST stat.TH

Hyperbolic Network Latent Space Model with Learnable Curvature

Jinming Li, Gongjun Xu, Ji Zhu

Journal ref Journal of the American Statistical Association 2026

2311.17407 2026-02-11 math.ST cs.NA math.NA stat.TH

Strong consistency of an estimator by the truncated singular value decomposition for an errors-in-variables regression model with collinearity

Kensuke Aishima

Comments arXiv admin note: text overlap with arXiv:2302.06824

Journal ref Linear Algebra and its Applications, Volume 721, 15 September 2025, Pages 520-541

2308.14240 2026-02-11 stat.AP

Bayesian Multivariate Track Geometry Degradation Modelling and its use in Condition-Based Inspection

Huy Truong-Ba, Sinda Rebello, Michael E. Cholette, Venkat Reddy, Pietro Borghesani

Journal ref Railway Engineering Science, 2025

2212.00133 2026-02-11 cs.LG math.OC stat.ML

Universal Neural Optimal Transport

Jonathan Geuter, Gregor Kornhardt, Ingimar Tomasson, Vaios Laschos

Comments 37 pages, 19 figures, accepted to ICML 2025

Journal ref Proceedings of the 42nd International Conference on Machine Learning, PMLR 267:19196-19232, 2025

2101.00245 2026-02-11 stat.ML cs.CV cs.LG cs.NE

The Bayesian Method of Tensor Networks

Erdong Guo, David Draper

Comments 13 pages, 4 figures

Journal ref Neurocomputing 675 (2026) 132961

详情

DOI: 10.1016/j.neucom.2026.132961

英文摘要

Bayesian learning is a powerful learning framework which combines the external information of the data (background information) with the internal information (training data) in a logically consistent way in inference and prediction. By Bayes rule, the external information (prior distribution) and the internal information (training data likelihood) are combined coherently, and the posterior distribution and the posterior predictive (marginal) distribution obtained by Bayes rule summarize the total information needed in the inference and prediction, respectively. In this paper, we study the Bayesian framework of the Tensor Network from two perspective. First, we introduce the prior distribution to the weights in the Tensor Network and predict the labels of the new observations by the posterior predictive (marginal) distribution. Since the intractability of the parameter integral in the normalization constant computation, we approximate the posterior predictive distribution by Laplace approximation and obtain the out-product approximation of the hessian matrix of the posterior distribution of the Tensor Network model. Second, to estimate the parameters of the stationary mode, we propose a stable initialization trick to accelerate the inference process by which the Tensor Network can converge to the stationary path more efficiently and stably with gradient descent method. We verify our work on the MNIST, Phishing Website and Breast Cancer data set. We study the Bayesian properties of the Bayesian Tensor Network by visualizing the parameters of the model and the decision boundaries in the two dimensional synthetic data set. For a application purpose, our work can reduce the overfitting and improve the performance of normal Tensor Network model.

URL PDF HTML ☆

赞 0 踩 0