arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.08956 2026-03-25 econ.GN cs.LG q-fin.EC

A Survey of Reinforcement Learning For Economics

Pranjal Rawat

详情

英文摘要

This survey (re)introduces reinforcement learning methods to economists. The curse of dimensionality limits how far exact dynamic programming can be effectively applied, forcing us to rely on suitably "small" problems or our ability to convert "big" problems into smaller ones. While this reduction has been sufficient for many classical applications, a growing class of economic models resists such reduction. Reinforcement learning algorithms offer a natural, sample-based extension of dynamic programming, extending tractability to problems with high-dimensional states, continuous actions, and strategic interactions. I review the theory connecting classical planning to modern learning algorithms and demonstrate their mechanics through simulated examples in pricing, inventory control, strategic games, and preference elicitation. I also examine the practical vulnerabilities of these algorithms, noting their brittleness, sample inefficiency, sensitivity to hyperparameters, and the absence of global convergence guarantees outside of tabular settings. The successes of reinforcement learning remain strictly bounded by these constraints, as well as a reliance on accurate simulators. When guided by economic structure, reinforcement learning provides a remarkably flexible framework. It stands as an imperfect, but promising, addition to the computational economist's toolkit. A companion survey (Rust and Rawat, 2026b) covers the inverse problem of inferring preferences from observed behavior. All simulation code is publicly available.

URL PDF HTML ☆

赞 0 踩 0

2510.23421 2026-03-25 econ.GN cs.AI q-fin.EC

Quantifying Systemic Vulnerability in the Foundation Model Industry

Claudio Pirrone, Stefano Fricano, Gioacchino Fazio

Comments Conference Paper - SIEPI (29-30 January 2026) - Bari

2305.11523 2026-03-25 econ.GN q-fin.EC

AI Regulation in the European Union: Examining Non-State Actor Preferences

Jonas Tallberg, Magnus Lundgren, Johannes Geith

2603.23300 2026-03-25 q-fin.PM cs.AI cs.MA q-fin.ST

Designing Agentic AI-Based Screening for Portfolio Investment

Mehmet Caner, Agostino Capponi, Nathan Sun, Jonathan Y. Tan

2603.23024 2026-03-25 econ.GN q-fin.EC

Heart Failure's First Shock and Nurse-Led Chronic Care

Moslem Rashidi, Luke B. Connelly, Gianluca Fiorentini

2603.22886 2026-03-25 cs.LG q-fin.GN q-fin.ST

Conditionally Identifiable Latent Representation for Multivariate Time Series with Structural Dynamics

Minkey Chang, Jae-Young Kim

Comments Accepted paper for 2026 ICLR FINAI workshop

2603.22880 2026-03-25 q-fin.GN cs.CE q-fin.PM

Portfolio Optimization under Recursive Utility via Reinforcement Learning

Minkey Chang

2603.22831 2026-03-25 cs.CE q-fin.MF

Option pricing model under the G-expectation framework

Ziting Pei, Xingye Yue, Xiaotao Zheng

2603.22569 2026-03-25 q-fin.RM stat.ME

Proxy-Reliance Control in Conformal Recalibration of One-Sided Value-at-Risk

Tenghan Zhong

Comments 44 pages, 4 figures, 9 tables, appendix included

2602.07023 2026-03-25 q-fin.TR cs.AI

Behavioral Consistency Validation for LLM Agents: An Analysis of Trading-Style Switching through Stock-Market Simulation

Zeping Li, Guancheng Wan, Keyang Chen, Yu Chen, Yiwen Zhao, Philip Torr, Guangnan Ye, Zhenfei Yin, Hongfeng Chai

2512.06033 2026-03-25 cs.CR econ.GN q-fin.EC

Sell Data to AI Algorithms Without Revealing It: Secure Data Valuation and Sharing via Homomorphic Encryption

Michael Yang, Ruijiang Gao, Zhiqiang Zheng

2511.19186 2026-03-25 q-fin.PM

Carbon-Penalised Portfolio Insurance Strategies in a Stochastic Factor Model with Partial Information

Katia Colaneri, Federico D'Amario, Daniele Mancinelli