arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2602.15809 2026-02-18 stat.AP cs.AI

Decision Quality Evaluation Framework at Pinterest

Yuqi Tian, Robert Paine, Attila Dobi, Kevin O'Sullivan, Aravindh Manickavasagam, Faisal Farooq

详情

英文摘要

Online platforms require robust systems to enforce content safety policies at scale. A critical component of these systems is the ability to evaluate the quality of moderation decisions made by both human agents and Large Language Models (LLMs). However, this evaluation is challenging due to the inherent trade-offs between cost, scale, and trustworthiness, along with the complexity of evolving policies. To address this, we present a comprehensive Decision Quality Evaluation Framework developed and deployed at Pinterest. The framework is centered on a high-trust Golden Set (GDS) curated by subject matter experts (SMEs), which serves as a ground truth benchmark. We introduce an automated intelligent sampling pipeline that uses propensity scores to efficiently expand dataset coverage. We demonstrate the framework's practical application in several key areas: benchmarking the cost-performance trade-offs of various LLM agents, establishing a rigorous methodology for data-driven prompt optimization, managing complex policy evolution, and ensuring the integrity of policy content prevalence metrics via continuous validation. The framework enables a shift from subjective assessments to a data-driven and quantitative practice for managing content safety systems.

URL PDF HTML ☆

赞 0 踩 0

2602.15731 2026-02-18 stat.ME

Generalised Exponential Kernels for Nonparametric Density Estimation

Laura M. Craig, Wagner Barreto-Souza

Comments Paper submitted for publication

2602.15697 2026-02-18 stat.AP

Reproducibility and Statistical Methodology

Anthony Almudevar, Jacob Almudevar

Comments 34 pages; 4 tables; 7 figures

2602.15679 2026-02-18 stat.ME

Safe hypotheses testing with application to order restricted inference

Ori Davidov

2602.15673 2026-02-18 stat.ME

Leicester's Tale: Another Perspective on the EPL 2015/16 Through Expected Goals (xG) Modelling

Sheikh Badar Ud Din Tahir, Leonardo Egidi, Nicola Torelli

2602.15600 2026-02-18 cs.SI cs.AI econ.EM stat.AP

The geometry of online conversations and the causal antecedents of conflictual discourse

Carlo Santagiustina, Caterina Cruciani

详情

DOI: 10.13140/RG.2.2.34796.22409

英文摘要

This article investigates the causal antecedents of conflictual language and the geometry of interaction in online threaded conversations related to climate change. We employ three annotation dimensions, inferred through LLM prompting and averaging, to capture complementary aspects of discursive conflict (such as stance: agreement vs disagreement; tone: attacking vs respectful; and emotional versus factual framing) and use data from a threaded online forum to examine how these dimensions respond to temporal, conversational, and arborescent structural features of discussions. We show that, as suggested by the literature, longer delays between successive posts in a thread are associated with replies that are, on average, more respectful, whereas longer delays relative to the parent post are associated with slightly less disagreement but more emotional (less factual) language. Second, we characterize alignment with the local conversational environment and find strong convergence both toward the average stance, tone and emotional framing of older sibling posts replying to the same parent and toward those of the parent post itself, with parent post effects generally stronger than sibling effects. We further show that early branch-level responses condition these alignment dynamics, such that parent-child stance alignment is amplified or attenuated depending on whether a branch is initiated in agreement or disagreement with the discussion's root message. These influences are largely additive for civility-related dimensions (attacking vs respectful, disagree vs agree), whereas for emotional versus factual framing there is a significant interaction: alignment with the parent's emotionality is amplified when older siblings are similarly aligned.

URL PDF HTML ☆

赞 0 踩 0

2602.15587 2026-02-18 math.ST stat.TH

Adjusted Scores for Discrete Langevin Algorithms

Armand Gissler, Saeed Saremi, Francis Bach

2602.15586 2026-02-18 cs.LG stat.ML

Uniform error bounds for quantized dynamical models

Abdelkader Metakalard, Fabien Lauer, Kevin Colin, Marion Gilson

2602.15585 2026-02-18 math.ST cs.IT math.IT math.PR stat.TH

Optimal detection of planted stars via a random energy model

Ijay Narang, Will Perkins, Timothy L. H. Wee

Comments 34 pages

2602.15568 2026-02-18 stat.ME cs.LG cs.SY eess.SY stat.ML

Scenario Approach with Post-Design Certification of User-Specified Properties

Algo Carè, Marco C. Campi, Simone Garatti

2602.15559 2026-02-18 stat.ME econ.EM math.ST stat.ML stat.TH

Fixed-Horizon Self-Normalized Inference for Adaptive Experiments via Martingale AIPW/DML with Logged Propensities

Gabriel Saco

Comments 32 pages. Comments welcome

2602.15538 2026-02-18 stat.ML cs.LG math.OC

Functional Central Limit Theorem for Stochastic Gradient Descent

Kessang Flamand, Victor-Emmanuel Brunel

2602.13380 2026-02-18 stat.ME

Robust Design in the Presence of Aleatoric and Epistemic Uncertainty

Luis G. Crespo

2602.09170 2026-02-18 stat.ML cs.AI cs.LG

Quantifying Epistemic Uncertainty in Diffusion Models

Aditi Gupta, Raphael A. Meyer, Yotam Yaniv, Elynn Chen, N. Benjamin Erichson

Comments Will appear in the Proceedings of the 29th International Conference on Artificial Intelligence and Statistics (AISTATS) 2026

2601.07961 2026-02-18 stat.AP

Language Markers of Emotion Flexibility Predict Depression and Anxiety Treatment Outcomes

Benjamin Brindle, George A. Bonanno, Thomas Derrick Hull, Nicolas Charon, Matteo Malgaroli

2512.21176 2026-02-18 econ.EM stat.ME

Difference-in-Differences in the Presence of Unknown Interference

Fabrizia Mealli, Javier Viviens

2510.11923 2026-02-18 physics.chem-ph cond-mat.mtrl-sci cs.LG stat.ML

Enhancing Diffusion-Based Sampling with Molecular Collective Variables

Juno Nam, Bálint Máté, Artur P. Toshev, Manasa Kaniselvan, Rafael Gómez-Bombarelli, Ricky T. Q. Chen, Brandon Wood, Guan-Horng Liu, Benjamin Kurt Miller

2510.08749 2026-02-18 math.ST stat.ME stat.ML stat.TH

Theoretical guarantees for change localization using conformal p-values

Swapnaneel Bhattacharyya, Aaditya Ramdas

Comments 45 pages, 8 figures

2508.20883 2026-02-18 math.NA cs.ET cs.NA stat.CO

Lattice Random Walk Discretisations of Stochastic Differential Equations

Samuel Duffield, Maxwell Aifer, Denis Melanson, Zach Belateche, Patrick J. Coles

Comments 19 pages, 7 figures

2508.11460 2026-02-18 cs.LG stat.ML

Calibrated and uncertain? Evaluating uncertainty estimates in binary classification models

Aurora Grefsrud, Nello Blaser, Trygve Buanes

Comments Accepted Manuscript for publication in Open Access journal Machine Learning: Science and Technology

2507.01761 2026-02-18 cs.LG cs.AI stat.ML

Enhanced Generative Model Evaluation with Clipped Density and Coverage

Nicolas Salvy, Hugues Talbot, Bertrand Thirion

2506.15723 2026-02-18 q-fin.ST cs.LG econ.GN q-fin.EC stat.AP

Modern approaches to building interpretable models of the property market using machine learning on the base of mass cadastral valuation

Alexey S. Tanashkin, Irina G. Tanashkina, Alexander S. Maksimchuik

Comments 62 pages, 21 figures, 11 tables; after the major revision, accepted in journal Land Use Policy; changes: literature review is added to introduction section, new conclusion, comparison of the models with the random forest is added, the feature selection section is reconsidered, many minor corrections, language sufficiently improved

详情

DOI: 10.1016/j.landusepol.2026.107970
Journal ref: Land Use Policy, Volume 165, 2026, 107970

英文摘要

In this paper, we review modern approaches to building interpretable models of property markets using machine learning on the base of mass valuation of property in the Primorye region, Russia. There are numerous potential difficulties one could encounter in the effort to build a good model. Their main source is the huge difference between noisy real market data and ideal data usually used in tutorials on machine learning. This paper covers all stages of modeling: collection of initial data, identification of outliers, search and analysis of patterns in the data, formation and final choice of price factors, building of the model, and evaluation of its efficiency. For each stage, we highlight potential issues and describe sound methods for overcoming emerging difficulties on actual examples. We show that the combination of classical linear regression with kriging (interpolation method of geostatistics) allows to build an effective model for land parcels. For flats, when many objects are attributed to one spatial point, the application of geostatistical methods becomes problematic. Instead, we suggest linear regression with automatic generation and selection of additional rules on the base of decision trees, so called the RuleFit method. We compare the performance of our inherently interpretable models with well-proven "black-box" Random Forest method and demonstrate similar results. Thus we show, that despite such a strong restriction as the requirement of interpretability which is important in practical aspects, for example, legal matters, it is still possible to build effective models of real property markets.

URL PDF HTML ☆

赞 0 踩 0

2503.00509 2026-02-18 cs.LG cs.AI math.OC stat.ML

Functional multi-armed bandit and the best function identification problems

Yuriy Dorn, Aleksandr Katrutsa, Ilgam Latypov, Anastasiia Soboleva

2502.05161 2026-02-18 stat.AP

Comprehensive and Spatially Detailed Passenger Vehicle and Truck Traffic Volume Data for the United States Estimated by Machine Learning

Brittany Antonczak, Meg Fay, Aviral Chawla, Gregory Rowangould

Comments 18 pages including references, 5 figures

详情

DOI: 10.1016/j.dib.2026.112451
Journal ref: Antonczak, B., Fay, M., Chawla, A., & Rowangould, G. (2026). Comprehensive and spatially detailed passenger vehicle and truck traffic volume data for the United States estimated by machine learning. Data in Brief, 64, 112451

英文摘要

The Highway Performance Monitoring System, managed by the Federal Highway Administration, provides data on average annual daily traffic volume across roadways in the United States, but it has limited representation of medium- and heavy-duty vehicle traffic on lower-volume roadways that are not part of the national highway system. This gap limits research and policy analysis on the community impacts of truck traffic, especially concerning air quality and public health. To address this, we use random forest regression to estimate medium- and heavy-duty vehicle traffic volumes on network links where these data are missing. The result is a comprehensive vehicle traffic dataset that covers 85.2% of public roadways in the United States. From these data, we also calculate traffic density values for each census block and vehicle class that can serve as a high-resolution surrogate for traffic-related air pollution exposure in public health studies and policy analysis. Our high-resolution spatial data products are rigorously validated and provide a more complete representation of truck traffic than any existing publicly available dataset. These datasets are valuable for transportation planning, public health research, and policy decisions aimed at understanding and mitigating the effects of truck traffic on communities that are disproportionately exposed to air pollution from vehicle traffic.

URL PDF HTML ☆

赞 0 踩 0

2409.19400 2026-02-18 stat.ME stat.ML

The co-varying ties between networks and item responses via latent variables

Selena Wang, Plamena Powla, Tracy Sweet, Subhadeep Paul

2408.14073 2026-02-18 cs.LG stat.ME stat.ML

Score-based change point detection via tracking the best of infinitely many experts

Anna Markovich, Nikita Puchkin

Comments 61 pages, 4 figures

2408.04854 2026-02-18 stat.ME

Transportability of aggregate trial results to an external environment in causally interpretable meta-analysis

Tran Trong Khoi Le, Marie-Felicia Béclin, Sivem Afach, Tat-Thang Vo

2405.21012 2026-02-18 cs.LG stat.ME

IGC-Net for conditional average potential outcome estimation over time

Konstantin Hess, Dennis Frauen, Valentyn Melnychuk, Stefan Feuerriegel

2401.07111 2026-02-18 stat.AP stat.CO

Bayesian Signal Matching for Transfer Learning in ERP-Based Brain Computer Interface

Tianwen Ma, Jane E. Huggins, Jian Kang

Comments 35 pages, 6 figures, 2 tables

2107.03633 2026-02-18 cs.LG stat.ML

Generalization Error of GAN from the Discriminator's Perspective

Hongkang Yang, Weinan E

1902.10708 2026-02-18 math.ST stat.ME stat.TH

Quasi-Bayes properties of a recursive procedure for mixtures

Sandra Fortini, Sonia Petrone

1902.10288 2026-02-18 math.OC math.ST stat.TH

Clustering, factor discovery and optimal transport

Hongkang Yang, Esteban G. Tabak

Comments Improved clarity of presentation

2602.15503 2026-02-18 cs.LG stat.ML

Approximation Theory for Lipschitz Continuous Transformers

Takashi Furuya, Davide Murari, Carola-Bibiane Schönlieb

2602.15496 2026-02-18 stat.ME

Confidence Distributions for FIC scores

Céline Cunen, Nils Lid Hjort

Comments 26 pages, 9 figures, 2020 version, later published in essentially this form, Econometrics 2020, volume 8, number27, www.mdpi.com/2225-1146/8/3/27

2602.15429 2026-02-18 stat.AP

Deep description of static and dynamic network ties in Honduran villages

Marios Papamichalis, Nikolaos Nakis, Nicholas A. Christakis

Comments This is the first draft of the paper. It is under review at a statistics journal

2602.15390 2026-02-18 stat.ME cs.NA math.NA math.NT

Space-filling lattice designs for computer experiments

Naoki Sakai, Takashi Goda

Comments 24 pages, 5 figures

2602.15387 2026-02-18 stat.ME stat.AP

Bayesian Nonparametrics for Gene-Gene and Gene-Environment Interactions in Case-Control Studies: A Synthesis and Extension

Durba Bhattacharya, Sourabh Bhattacharya

Comments Feedback welcome

2602.15374 2026-02-18 stat.ME stat.AP

Joint Modeling of Longitudinal EHR Data with Shared Random Effects for Informative Visiting and Observation Processes

Cheng-Han Yang, Xu Shi, Bhramar Mukherjee

Comments 37 pages, 8 figures, 6 tables; with 30-page supplementary material (total 67 pages)

2602.15315 2026-02-18 cs.CV stat.ML

Training-Free Zero-Shot Anomaly Detection in 3D Brain MRI with 2D Foundation Models

Tai Le-Gia, Jaehyun Ahn

Comments Accepted for MIDL 2026

2602.15306 2026-02-18 stat.ML cs.LG

Sparse Additive Model Pruning for Order-Based Causal Structure Learning

Kentaro Kanamori, Hirofumi Suzuki, Takuya Takagi

Comments 15 pages, 12 figures, to appear in the 40th AAAI Conference on Artificial Intelligence (AAAI 2026)

2602.15303 2026-02-18 cs.IT math.IT math.ST stat.TH

On the Entropy of General Mixture Distributions

Namyoon Lee

Comments 20 pages, 5 figures

2602.15297 2026-02-18 math.ST stat.TH

Bayes Risk for Goodness of Fit Tests

Nicholas G. Polson, Vadim Sokolov, Daniel Zantedeschi

2602.15291 2026-02-18 stat.ME

Structural grouping of extreme value models via graph fused lasso

Takuma Yoshida, Koki Momoki, Shuichi Kawano

Comments 40 pages, 14 figures

2602.15247 2026-02-18 stat.ME stat.AP

Sample size and power determination for assessing overall SNP effects in joint modeling of longitudinal and time-to-event data

Yuan Bian, Shelley B. Bull

2602.15218 2026-02-18 eess.SP cs.NA math.NA stat.CO

Multiplierless DFT Approximation Based on the Prime Factor Algorithm

L. Portella, F. M. Bayer, R. J. Cintra

Comments 24 pages, 4 figures

2602.15191 2026-02-18 math.PR math.ST stat.TH

Derivation of the AMP equations from belief propagation for the $\ell_2$ minimisation problem

Giuseppe Genovese, Arianna Piana

Comments 52 pages, 1 figure

2602.15184 2026-02-18 cs.LG stat.ML

Learning Data-Efficient and Generalizable Neural Operators via Fundamental Physics Knowledge

Siying Ma, Mehrdad M. Zadeh, Mauricio Soroco, Wuyang Chen, Jiguo Cao, Vijay Ganesh

2602.15167 2026-02-18 cs.CV stat.AP stat.ML

Distributional Deep Learning for Super-Resolution of 4D Flow MRI under Domain Shift

Xiaoyi Wen, Fei Jiang

2602.15150 2026-02-18 stat.ME stat.CO

bayesics: Core Statistical Methods via Bayesian Inference in R

Daniel K. Sewell, Alan T. Arakkal

2602.15136 2026-02-18 stat.ML cs.LG

Universal priors: solving empirical Bayes via Bayesian inference and pretraining

Nick Cannella, Anzo Teh, Yanjun Han, Yury Polyanskiy

Comments 40 pages, 5 figures

2602.15095 2026-02-18 stat.ME stat.AP

Natural direct effects of vaccines and post-vaccination behaviour

Bronner P. Gonçalves, Piero L. Olliaro, Sheena G. Sullivan, Benjamin J. Cowling

2602.15094 2026-02-18 math.OC math.PR stat.ML

On propagation of chaos for the Fisher-Rao gradient flow in entropic mean-field optimization

Petra Lazić, Linshan Liu, Mateusz B. Majka

Comments 38 pages, to appear in AISTATS 2026

2602.15076 2026-02-18 cs.LG stat.ML

Near-Optimal Sample Complexity for Online Constrained MDPs

Chang Liu, Yunfan Li, Lin F. Yang

2602.15041 2026-02-18 physics.comp-ph physics.plasm-ph stat.CO

VR-PIC: An entropic variance-reduction method for particle-in-cell solutions of the Vlasov-Poisson equation

Victor Windhab, Andreas Adelmann, Mohsen Sadr

Comments Preprint

2602.14813 2026-02-18 stat.ME

The empirical distribution of sequential LS factors in Multi-level Dynamic Factor Models

Gian Pietro Bellocca, Ignacio Garrón, Vladimir Rodríguez-Caballero, Esther Ruiz

2602.11325 2026-02-18 stat.ML cs.LG stat.CO stat.ME

Amortised and provably-robust simulation-based inference

Ayush Bharti, Charita Dellaporta, Yuga Hikida, François-Xavier Briol

2602.07418 2026-02-18 cs.LG stat.ML

Achieving Optimal Static and Dynamic Regret Simultaneously in Bandits with Deterministic Losses

Jian Qian, Chen-Yu Wei

2601.16427 2026-02-18 stat.ML cs.LG stat.AP stat.ME

Perfect Clustering for Sparse Directed Stochastic Block Models

Behzad Aalipur, Yichen Qin

2601.11099 2026-02-18 stat.ME math.ST stat.CO stat.TH

Robust $M$-Estimation of Scatter Matrices via Precision Structure Shrinkage

Soma Nikai, Yuichi Goto, Koji Tsukuda

Comments 30 pages

2601.09747 2026-02-18 q-bio.PE math.GT stat.AP

Topological Percolation in Urban Dengue Transmission: A Multi-Scale Analysis of Spatial Connectivity

Marcílio Ferreira dos Santos, Cleiton de Lima Ricardo

Comments 12 pages, 4 figures

2511.19797 2026-02-18 cs.LG cs.AI cs.CV stat.ML

Terminal Velocity Matching

Linqi Zhou, Mathias Parger, Ayaan Haque, Jiaming Song

Comments Blog post: https://lumalabs.ai/blog/engineering/tvm Code available at: https://github.com/lumalabs/tvm

2511.19026 2026-02-18 cs.NI stat.ME

Energy-Efficient Routing Protocol in Vehicular Opportunistic Networks: A Dynamic Cluster-based Routing Using Deep Reinforcement Learning

Meisam Sharifi Sani, Saeid Iranmanesh, Raad Raad, Faisel Tubbal

Comments Published in IEEE Transactions on Intelligent Transportation Systems (2026)

2511.17117 2026-02-18 stat.CO econ.EM

Modified Delayed Acceptance MCMC for Quasi-Bayesian Inference with Linear Moment Conditions

Masahiro Tanaka

2508.11060 2026-02-18 stat.ML cs.LG stat.ME

Counterfactual Survival Q-learning via Buckley-James Boosting, with Applications to ACTG 175 and CALGB 8923

Jeongjin Lee, Jong-Min Kim

Comments Accepted at JRSS C

2507.18240 2026-02-18 q-fin.RM stat.AP

Index insurance under demand and solvency constraints

Olivier Lopez, Daniel Nkameni

2507.10679 2026-02-18 stat.CO econ.EM stat.ME

FARS: Factor Augmented Regression Scenarios in R

Gian Pietro Bellocca, Ignacio Garrón, Vladimir Rodríguez-Caballero, Esther Ruiz

2505.11985 2026-02-18 cs.LG stat.ML

Variance-Optimal Arm Selection: Misallocation Minimization and Best Arm Identification

Sabrina Khurshid, Gourab Ghatak, Mohammad Shahid Abdulla

详情

英文摘要

This paper focuses on selecting the arm with the highest variance from a set of $K$ independent arms. Specifically, we focus on two settings: (i) misallocation minimization setting, that penalizes the number of pulls of suboptimal arms in terms of variance, and (ii) fixed-budget best arm identification setting, that evaluates the ability of an algorithm to determine the arm with the highest variance after a fixed number of pulls. We develop a novel online algorithm called UCB-VV for the misallocation minimization (MM) and show that its upper bound on misallocation for bounded rewards evolves as $\mathcal{O}\left(\log{n}\right)$ where $n$ is the horizon. By deriving the lower bound on the misallocation, we show that UCB-VV is order optimal. For the fixed budget best arm identification (BAI) setting we propose the SHVV algorithm. We show that the upper bound of the error probability of SHVV evolves as $\exp\left(-\frac{n}{\log(K) H}\right)$, where $H$ represents the complexity of the problem, and this rate matches the corresponding lower bound. We extend the framework from bounded distributions to sub-Gaussian distributions using a novel concentration inequality on the sample variance and standard deviation. Leveraging the same, we derive a concentration inequality for the empirical Sharpe ratio (SR) for sub-Gaussian distributions, which was previously unknown in the literature. Empirical simulations show that UCB-VV consistently outperforms $ε$-greedy across different sub-optimality gaps though it is surpassed by VTS, which exhibits the lowest misallocation, albeit lacking in theoretical guarantees. We also illustrate the superior performance of SHVV, for a fixed budget setting under 6 different setups against uniform sampling. Finally, we conduct a case study to empirically evaluate the performance of the UCB-VV and SHVV in call option trading on $100$ stocks generated using GBM.

URL PDF HTML ☆

赞 0 踩 0

2504.11194 2026-02-18 stat.AP

Two-Part Forecasting for Time-Shifted Metrics

Harrison Katz, Erica Savage, Kai Thomas Brusch

2502.15146 2026-02-18 stat.ME

On the Validity of Isotropic Covariance Functions for Set-indexed Random Fields

Lucas da Cunha Godoy, Marcos Oliveira Prates, Fernando Andrés Quintana, Jun Yan

Comments 27 pages, 3 figures

2502.07397 2026-02-18 stat.ML cs.LG

Linear Bandits beyond Inner Product Spaces, the case of Bandit Optimal Transport

Lorenzo Croissant

2410.05225 2026-02-18 cs.LG cs.RO stat.ML

ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control

Ehsan Futuhi, Shayan Karimi, Chao Gao, Martin Müller

Comments We have expanded the related work section with more detailed discussions and enhanced our experiments by incorporating additional data and analysis

2305.03571 2026-02-18 eess.SP cs.IT cs.LG math.IT stat.ML

Model-free Reinforcement Learning of Semantic Communication by Stochastic Policy Gradient

Edgar Beck, Carsten Bockelmann, Armin Dekorsy

Comments Accepted for publication in IEEE International Conference on Machine Learning for Communication and Networking (ICMLCN 2024), Source Code: https://github.com/ant-uni-bremen/SINFONY

2208.12113 2026-02-18 stat.ME stat.CO stat.ML

Generative Bayesian Inference with GANs

Yuexi Wang, Veronika Ročková

2006.02397 2026-02-18 math.ST cs.CR stat.CO stat.TH

One Step to Efficient Synthetic Data

Jordan Awan, Zhanrui Cai

Comments 30 pages before references and appendices