arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2602.15730 2026-02-18 cs.CL econ.EM

Causal Effect Estimation with Latent Textual Treatments

Omri Feldman, Amar Venugopal, Jann Spiess, Amir Feder

详情

英文摘要

Understanding the causal effects of text on downstream outcomes is a central task in many applications. Estimating such effects requires researchers to run controlled experiments that systematically vary textual features. While large language models (LLMs) hold promise for generating text, producing and evaluating controlled variation requires more careful attention. In this paper, we present an end-to-end pipeline for the generation and causal estimation of latent textual interventions. Our work first performs hypothesis generation and steering via sparse autoencoders (SAEs), followed by robust causal estimation. Our pipeline addresses both computational and statistical challenges in text-as-treatment experiments. We demonstrate that naive estimation of causal effects suffers from significant bias as text inherently conflates treatment and covariate information. We describe the estimation bias induced in this setting and propose a solution based on covariate residualization. Our empirical results show that our pipeline effectively induces variation in target features and mitigates estimation error, providing a robust foundation for causal effect estimation in text-as-treatment settings.

URL PDF HTML ☆

赞 0 踩 0

2602.15722 2026-02-18 math.OC econ.GN q-fin.EC

Pricing Discrete and Nonlinear Markets With Semidefinite Relaxations

Cheng Guo, Lauren Henderson, Ryan Cory-Wright, Boshi Yang

2602.15686 2026-02-18 econ.TH

Minimizing Volatility: Optimal Adjustment with Evolving Feasibility Constraints

Simon Jantschgi, Heinrich H. Nax, Bary S. R. Pradelski, Marek Pycia

2602.15600 2026-02-18 cs.SI cs.AI econ.EM stat.AP

The geometry of online conversations and the causal antecedents of conflictual discourse

Carlo Santagiustina, Caterina Cruciani

详情

DOI: 10.13140/RG.2.2.34796.22409

英文摘要

This article investigates the causal antecedents of conflictual language and the geometry of interaction in online threaded conversations related to climate change. We employ three annotation dimensions, inferred through LLM prompting and averaging, to capture complementary aspects of discursive conflict (such as stance: agreement vs disagreement; tone: attacking vs respectful; and emotional versus factual framing) and use data from a threaded online forum to examine how these dimensions respond to temporal, conversational, and arborescent structural features of discussions. We show that, as suggested by the literature, longer delays between successive posts in a thread are associated with replies that are, on average, more respectful, whereas longer delays relative to the parent post are associated with slightly less disagreement but more emotional (less factual) language. Second, we characterize alignment with the local conversational environment and find strong convergence both toward the average stance, tone and emotional framing of older sibling posts replying to the same parent and toward those of the parent post itself, with parent post effects generally stronger than sibling effects. We further show that early branch-level responses condition these alignment dynamics, such that parent-child stance alignment is amplified or attenuated depending on whether a branch is initiated in agreement or disagreement with the discussion's root message. These influences are largely additive for civility-related dimensions (attacking vs respectful, disagree vs agree), whereas for emotional versus factual framing there is a significant interaction: alignment with the parent's emotionality is amplified when older siblings are similarly aligned.

URL PDF HTML ☆

赞 0 踩 0

2602.15559 2026-02-18 stat.ME econ.EM math.ST stat.ML stat.TH

Fixed-Horizon Self-Normalized Inference for Adaptive Experiments via Martingale AIPW/DML with Logged Propensities

Gabriel Saco

Comments 32 pages. Comments welcome

2512.21176 2026-02-18 econ.EM stat.ME

Difference-in-Differences in the Presence of Unknown Interference

Fabrizia Mealli, Javier Viviens

2511.09424 2026-02-18 econ.TH

Posterior-Separable Costs and Menu Preferences

Henrique de Oliveira, Jeffrey Mensch

2511.04299 2026-02-18 econ.GN q-fin.EC

Measuring economic outlook in the news

Elliot Beck, Franziska Eckert, Linus Kühne, Helge Liebert, Rina Rosenblatt-Wisch

2511.02764 2026-02-18 econ.EM

Peer effect analysis with latent processes

Vincent Starck

2506.15723 2026-02-18 q-fin.ST cs.LG econ.GN q-fin.EC stat.AP

Modern approaches to building interpretable models of the property market using machine learning on the base of mass cadastral valuation

Alexey S. Tanashkin, Irina G. Tanashkina, Alexander S. Maksimchuik

Comments 62 pages, 21 figures, 11 tables; after the major revision, accepted in journal Land Use Policy; changes: literature review is added to introduction section, new conclusion, comparison of the models with the random forest is added, the feature selection section is reconsidered, many minor corrections, language sufficiently improved

详情

DOI: 10.1016/j.landusepol.2026.107970
Journal ref: Land Use Policy, Volume 165, 2026, 107970

英文摘要

In this paper, we review modern approaches to building interpretable models of property markets using machine learning on the base of mass valuation of property in the Primorye region, Russia. There are numerous potential difficulties one could encounter in the effort to build a good model. Their main source is the huge difference between noisy real market data and ideal data usually used in tutorials on machine learning. This paper covers all stages of modeling: collection of initial data, identification of outliers, search and analysis of patterns in the data, formation and final choice of price factors, building of the model, and evaluation of its efficiency. For each stage, we highlight potential issues and describe sound methods for overcoming emerging difficulties on actual examples. We show that the combination of classical linear regression with kriging (interpolation method of geostatistics) allows to build an effective model for land parcels. For flats, when many objects are attributed to one spatial point, the application of geostatistical methods becomes problematic. Instead, we suggest linear regression with automatic generation and selection of additional rules on the base of decision trees, so called the RuleFit method. We compare the performance of our inherently interpretable models with well-proven "black-box" Random Forest method and demonstrate similar results. Thus we show, that despite such a strong restriction as the requirement of interpretability which is important in practical aspects, for example, legal matters, it is still possible to build effective models of real property markets.

URL PDF HTML ☆

赞 0 踩 0

2602.15312 2026-02-18 cs.CL econ.EM

Extracting Consumer Insight from Text: A Large Language Model Approach to Emotion and Evaluation Measurement

Stephan Ludwig, Peter J. Danaher, Xiaohao Yang, Yu-Ting Lin, Ehsan Abedin, Dhruv Grewal, Lan Du

2602.15289 2026-02-18 econ.EM

A Projection Approach to Nonparametric Significance and Conditional Independence Testing

Xiaojun Song, Jichao Yuan

2602.15246 2026-02-18 econ.TH

Learning Against Nature: Minimax Regret and the Price of Robustness

Yeon-Koo Che, Longjian Li, Tianling Luo

2602.15069 2026-02-18 physics.soc-ph cs.CY econ.GN q-fin.EC

Travel Time Prediction from Sparse Open Data

Geoff Boeing, Yuquan Zhou

2602.13537 2026-02-18 econ.EM

Cluster-Robust Inference for Quadratic Forms

Michal Kolesár, Pengjin Min, Wenjie Wang, Yichong Zhang

2602.12958 2026-02-18 econ.GN q-fin.EC

The Directions of Technical Change

Miklos Koren, Zsofia Barany, Ulrich Wohak

Comments We have revised the introduction and the discussion section to emphasize the economics rather than the mathematical results. We have fixed a typo in Section 3.2 equation. Otherwise same content

2511.17117 2026-02-18 stat.CO econ.EM

Modified Delayed Acceptance MCMC for Quasi-Bayesian Inference with Linear Moment Conditions

Masahiro Tanaka

2507.10679 2026-02-18 stat.CO econ.EM stat.ME

FARS: Factor Augmented Regression Scenarios in R

Gian Pietro Bellocca, Ignacio Garrón, Vladimir Rodríguez-Caballero, Esther Ruiz

2505.22862 2026-02-18 econ.TH

Optimal Auction Design for Dynamic Stochastic Environments: Myerson Meets Naor

Yeon-Koo Che, Andrew B. Choi

2303.15483 2026-02-18 math.CO econ.TH

On Smithson's fixed point theorem for order preserving multifunctions

Haruki Kono, Mark Voorneveld

2205.13186 2026-02-18 econ.GN q-fin.EC

Sovereign Hold-Up and Technology Adoption: Evidence from the North Sea

Michele Fioretti, Alessandro Iaria, Aljoscha Janssen, Clément Mazet-Sonilhac, Robert K. Perrons