arXivDaily arXiv每日学术速递 周一至周五更新
重置
2604.19627 2026-04-22 econ.GN q-fin.EC

Comment on "The Forsaken Road: Reassessing Living Standards Following the Cuban Revolution and the American Embargo"

Francisco Rodríguez

Comments Comment on http://dx.doi.org/10.2139/ssrn.5235912

详情
英文摘要

Bastos, Geloso, and Bologna Pavlik (2026) argue that the US embargo explains less than one tenth of the difference in per capita income between Cuba and a counterfactual scenario in which the country did not follow socialist economic policies. We show that their results are driven by the use of an elasticity of income to trade openness that is neither representative nor a reasonable upper bound of the values found in the literature and by their choice to attribute the effect of the interaction between the embargo and other determinants of growth solely to those other determinants. We show that, once these problems are corrected, the embargo can account for a substantial fraction, and in some cases all, of Cuba's post 1959 economic underperformance.

2604.19580 2026-04-22 q-fin.ST econ.EM q-fin.PM stat.AP

Probabilistic Forecasting for Day-ahead Electricity Prices, Battery Trading Strategies and the Economic Evaluation of Predictive Accuracy

Simon Hirsch, Florian Ziel

Comments 30 pages, 15 figures, 5 pages supplementary materials

详情
英文摘要

Electricity price forecasting supports decision-making in energy markets and asset operation. Probabilistic forecasts are increasingly adopted to explicitly quantify uncertainty, typically issued as quantile predictions or ensembles of the full predictive distribution. However, how improvements in statistical forecast quality translate into economic value remains unclear. Battery storage arbitrage in day-ahead markets is a popular application-based benchmark for this purpose. We analyze quantile-based trading strategies (QBTS) and identify two critical flaws: they do not incentivize honest probabilistic forecasting and they ignore the intertemporal dependence structure of electricity prices. We therefore frame battery optimization as a stochastic program based on fully probabilistic forecasts and examine decision quality measurement for risk-neutral and risk-averse settings under different uncertainty models. Our discussion touches both sides of the coin: How reliable is the economic evaluation of forecasting models though (simplified) application studies - and how do improvements in statistical forecast quality for stochastic programs relate to the decision-quality and economic performance? We provide theoretical justification and empirical evidence from a case study on the German electricity market. Our results highlight the pitfalls of ranking forecasting models through battery trading strategies. We conclude with implications for evaluation practice and directions for future research in application-based forecast assessment.

2603.17463 2026-04-22 stat.AP econ.EM q-fin.RM q-fin.ST

Multivariate GARCH and portfolio variance prediction: A forecast reconciliation perspective

Massimiliano Caporin, Daniele Girolimetto, Emanuele Lopetuso

详情
英文摘要

We assess the advantage of combining univariate and multivariate portfolio risk forecasts with the aid of forecast reconciliation techniques. In our analyzes, we assume knowledge of portfolio weights, a standard for portfolio risk management applications. With an extensive simulation experiment, we show that, if the true covariance is known, forecast reconciliation improves over a standard multivariate approach, in particular when the adopted multivariate model is misspecified. However, if noisy proxies are used, correctly specified models and the misspecified ones (for instance, neglecting spillovers) turn out to be, in several cases, indistinguishable, with forecast reconciliation still providing improvements. The noise in the covariance proxy plays a crucial role in determining the improvement of both the forecast reconciliation and the correct model specification. An empirical analysis shows how forecast reconciliation can be adopted with real data to improve traditional GARCH-based portfolio variance forecasts.

2505.07913 2026-04-22 econ.GN q-fin.EC

Continental-scale assessment of spatial food market accessibility in Africa using open geospatial data

Robert Benassai-Dalmau, Vasiliki Voukelatou, Rossano Schifanella, Stefania Fiandrino, Daniela Paolotti, Kyriaki Kalimeri

Comments 23 pages, 5 figures

详情
英文摘要

Food market accessibility is a critical yet underexplored dimension of food systems, particularly in low- and middle-income countries. In this paper, we present a continent-wide assessment of spatial food market accessibility in Africa, integrating open geospatial data from OpenStreetMap and the World Food Programme. We compare three complementary metrics: travel time to the nearest market, market availability within a 30-minute threshold, and an entropy-based measure of spatial distribution, to quantify accessibility across diverse settings. Our analysis reveals pronounced disparities: rural and economically disadvantaged populations face substantially higher travel times, limited market reach, and less spatial redundancy. These accessibility patterns align with socioeconomic stratification, as measured by the Relative Wealth Index, and moderately correlate with food insecurity levels, assessed using the Integrated Food Security Phase Classification. We find pronounced disparities in accessibility: rural and economically disadvantaged populations face substantially longer travel times and reduced market availability, with some areas requiring several hours of travel. Overall, results suggest that access to food markets reflects broader geographic and economic inequalities and plays a relevant role in shaping food security outcomes. This framework provides a scalable, data-driven approach for identifying underserved regions and supporting equitable infrastructure planning and policy design across diverse African contexts.

2604.19290 2026-04-22 q-fin.CP q-fin.MF q-fin.PR q-fin.RM stat.ME

Orthogonal reparametrization of the Nelson-Siegel-Svensson interest rate curve model: conditioning, diagnostics, and identifiability

Robert Flassig, Emrah Gülay, Daniel Guterding

Comments 28 pages, 10 figures

详情
英文摘要

The Nelson-Siegel-Svensson (NSS) interest rate curve model yields a separable nonlinear least-squares problem whose inner linear block is often ill-conditioned because the basis functions become nearly collinear. We analyze this instability via an exact orthogonal reparametrization of the design matrix. A thin QR decomposition produces orthogonal linear parameters for which, conditional on the nonlinear parameters, the Fisher information matrix is diagonal. We also derive a finite-horizon analytical orthogonalization: on $[0,T]$, the $4\times 4$ continuous Gram matrix has closed-form entries involving exponentials, logarithms, and the exponential integral $E_1$, yielding an explicit horizon-dependent orthogonal NSS basis. Together with Jacobian-rank and profile-likelihood arguments, this representation clarifies the degenerate manifold $λ_1=λ_2$, where the Svensson extension loses two degrees of freedom. Orthogonalization leaves the least-squares fit and uncertainty of the original linear parameters unchanged, but isolates the conditioning structure. When the decay parameters are estimated jointly, the full first-order covariance in orthogonal coordinates admits an explicit Schur-complement form. The approach also yields a scalar identifiability diagnostic through the QR element $R_{44}$ and separates model reduction from numerical instability. Synthetic experiments confirm that orthogonal parametrization eliminates correlations among the linear parameters and keeps their conditional uncertainty uniform. A daily U.S. Treasury study on a reduced fixed 9-tenor grid from 1981 to 2026 shows smoother orthogonal parameter series than classical NSS parameters while the moving QR basis remains nearly constant.

2604.19260 2026-04-22 econ.GN q-fin.EC

Understanding the Mechanism of Altruism in Large Language Models

Shuhuai Zhang, Shu Wang, Zijun Yao, Chuanhao Li, Xiaozhi Wang, Songfa Zhong, Tracy Xiao Liu

详情
英文摘要

Altruism is fundamental to human societies, fostering cooperation and social cohesion. Recent studies suggest that large language models (LLMs) can display human-like prosocial behavior, but the internal computations that produce such behavior remain poorly understood. We investigate the mechanisms underlying LLM altruism using sparse autoencoders (SAEs). In a standard Dictator Game, minimal-pair prompts that differ only in social stance (generous versus selfish) induce large, economically meaningful shifts in allocations. Leveraging this contrast, we identify a set of SAE features (0.024% of all features across the model's layers) whose activations are strongly associated with the behavioral shift. To interpret these features, we use benchmark tasks motivated by dual-process theories to classify a subset as primarily heuristic (System 1) or primarily deliberative (System 2). Causal interventions validate their functional role: activation patching and continuous steering of this feature direction reliably shift allocation distributions, with System 2 features exerting a more proximal influence on the model's final output than System 1 features. The same steering direction generalizes across multiple social-preference games. Together, these results enhance our understanding of artificial cognition by translating altruistic behaviors into identifiable network states and provide a framework for aligning LLM behavior with human values, thereby informing more transparent and value-aligned deployment.

2604.19178 2026-04-22 econ.GN q-fin.EC

A rapid evaluation of Australia's COVID-era apprentice wage subsidy programs

Peter Bowers, Patrick Rehill, Ethan Slaven

Comments 26 pages, 9 figures

详情
英文摘要

In the midst of the COVID-19 pandemic in 2020, the Australian Government launched two programs to incentivise new apprentices to start and complete apprenticeships -- the Boosting Apprenticeship Commencements (BAC) and Completing Apprenticeship Commencements (CAC) programs. These programs were wage subsidies to encourage employers to take on or retain apprentices. This paper evaluates the impact of these programs on apprenticeship commencements and completions taking a mixed-methods approach combining econometric modelling and interviews with stakeholders including employers and peak bodies. The programs led to a 70\% increase in commencement of apprenticeships but do not seem to have boosted retention rates. There appears to be a small increase in cancellation rates suggesting lower eventual completion rates compared to previous cohorts. Cancellation rates were higher for non-trade commencements (7\% increase) during BAC, but slightly lower for trade commencements (0.7\% decrease). We find this effect in non-trade apprenticeships was likely driven by `sharp practice' where some employers took advantage of the BAC by converting existing employees over to apprenticeships to attract the wage subsidy with no intention of having these employees stay as apprentices beyond the period of the BAC's generous subsidy. While the BAC / CAC were successful in many of their goals, there are several lessons that can be learnt from its design. In particular, the need to implement the program quickly meant early design choices inadvertently encouraged `sharp practice' and a rush for places that placed strain on the training sector. However, employers appreciated the front-loading of payments which provided the most financial support when apprentices were new and at their least productive.

2604.19107 2026-04-22 q-fin.ST

Structural Dynamics of G5 Stock Markets During Exogenous Shocks: A Random Matrix Theory-Based Complexity Gap Approach

Kundan Mukhia, Imran Ansari, Md. Nurujjaman

详情
英文摘要

We identify a robust structural signature of stock markets during exogenous shock events by analyzing collective return dynamics across G5 countries. Using Random Matrix Theory, we introduce the complexity gap, defined as the difference between the normalized largest eigenvalue and the average pairwise correlation, to quantify changes in market structure. This measure reveals a consistent three-phase pattern across multiple shocks, including the 2025 U.S. tariff event, the COVID-19 crisis, and country-specific shocks in Japan and China during 2024. Before a shock, markets show a positive complexity gap, reflecting a rich structure with multiple interacting factors. During shocks, the gap collapses to near zero, signaling strong synchronization under a single dominant mode. Post-shock recovery follows a nonmonotonic path: an initial widening (a false recovery), a temporary recollapse, and final sustained restoration. This pattern holds at both market and sector levels and across global and local shocks. Ordinal entropy analysis confirms the same sequence of collapse and false recovery in directional diversity. We further demonstrate that lower complexity gap values predict higher future portfolio volatility, especially after shocks, establishing its value as a state-dependent risk indicator. For investors, initial gap widening may mislead, while sustained widening signals genuine structural stabilization. These findings reveal a robust structural signature governing financial market dynamics during crisis and recovery periods.

2604.18821 2026-04-22 q-fin.PM

Evaluating Structured Strategy Backtests: Peer Benchmarks, Regime Timing, and Live Performance

Chang Liu

详情
英文摘要

Institutional allocators often evaluate structured strategies on the basis of marketed backtests -- hypothetical track records constructed by applying a strategy's rules to historical data prior to any live trading, also referred to as pro-forma performance. It is unclear how much of that signal survives once the strategy is actually traded. Using 1,726 commercially distributed structured strategies from ten global institutions, this paper shows that raw pro-forma performance has only limited portability into the live period and weakens sharply once live outcomes are measured relative to peer and external benchmarks. The evidence indicates that marketed backtests predominantly reflect the common factor regime present before launch rather than strategy-specific skill. Strategies launched after unusually strong bucket-factor conditions experience materially worse subsequent deterioration. For allocators, the implication is practical: backtests should be judged relative to appropriate peer benchmarks, and the discount applied to them should increase when launch occurs after an extreme factor run.

2604.18605 2026-04-22 q-fin.GN math.PR stat.AP

Exploring Drivers of Extreme Housing Prices in Australia

Grace Burtenshaw, Ashley Burtenshaw, Meagan Carney

详情
英文摘要

In recent years Australia has observed a growing, unexplained resilience of increasing house price trends. Here, we seek to understand what is driving Australia's indestructible asset using insights from market experts. We construct a differential equation model of house price to develop intuition for its historical behaviour and responsiveness to changes in mortgage rates. Using this model, we identify a point of 'decoupling' between house price and mortgage rate in the system with supply limitations found to be the main driver for this change. From there, modern extreme value techniques are implemented on real-world data to investigate how the effectiveness of mortgage rate in moderating extreme house price has changed before and after this historical decoupling. We find that without an increase in the housing supply chain, through either deregulation or reduced competition with government building, an 11\% increase in mortgage rate will be needed to slow extreme housing costs.

2604.17579 2026-04-22 q-fin.RM q-fin.MF

Vault as a credit instrument

Anastasiia Zbandut, Carolina Goldstein

详情
英文摘要

We derive five tractable credit risk metrics for DeFi lending vault depositors, grounded in a formal three level decomposition of vault risk into mechanical loss channels (Level 1), governance quality (Level 2) and smart contract code integrity (Level 3). For Level 1, we show that six structural features of onchain execution (oracle execution divergence, endogenous recovery, full information run dynamics, timelock constrained governance, oracle manipulation and congestion driven liquidation failure) break canonical TradFi analogies and generate depositor loss channels absent from standard credit frameworks. Vault credit risk metrics translate these channels into measurable risk components which are aggregated into a vault credit score. The empirical contribution is an implementable estimation architecture for credit risk metrics, including required onchain data, identification strategies for core parameters, partial identification bounds and a coherent stress scenario methodology. The results have direct implications for vault risk management and for minimum transparency standards necessary for depositor risk assessment.

2601.21272 2026-04-22 econ.EM q-fin.PR q-fin.ST

Finite-Sample Properties of Model Specification Tests for Multivariate Dynamic Regression Models

Koichiro Moriya, Akihiko Noda

Comments 57 pages; 4 figures; 9 tables

详情
英文摘要

We propose a new model specification test for multiple-equation systems with cross-equation error and dynamic regressor--error dependences. Conventional tests often rely on exogeneity conditions strong enough to ensure consistency of the OLS estimator. These exogeneity conditions are violated when regressors and errors are dynamically dependent, rendering conventional model specification tests invalid. To address these limitations, we clarify the relationship among alternative exogeneity conditions, characterize the consistency of competing multiple-equation estimators, and propose a generalized Durbin estimator for multiple-equation systems with an intercept, cross-equation error and regressor--error dependences. We show that our estimator remains consistent under the weakest exogeneity condition. We then derive its asymptotic distribution and construct Wald tests. Our Monte Carlo experiments confirm that the bootstrap-based Wald test substantially improves finite-sample size control. An application of the bootstrap-based Wald test to the Fama--French multifactor models leaves the null hypothesis unrejected in cases where competing FGLS-based tests reject it.

2508.20467 2026-04-22 q-fin.PM cs.LG q-fin.CP

QTMRL: An Agent for Quantitative Trading Decision-Making Based on Multi-Indicator Guided Reinforcement Learning

Jingfeng Pan, Jiahao Chen

详情
英文摘要

In the highly volatile and uncertain global financial markets, traditional quantitative trading models relying on statistical modeling or empirical rules often fail to adapt to dynamic market changes and black swan events due to rigid assumptions and limited generalization. To address these issues, this paper proposes QTMRL (Quantitative Trading Multi-Indicator Reinforcement Learning), an intelligent trading agent combining multi-dimensional technical indicators with reinforcement learning (RL) for adaptive and stable portfolio management. We first construct a comprehensive multi-indicator dataset using 23 years of S&P 500 daily OHLCV data (2000-2022) for 16 representative stocks across 5 sectors, enriching raw data with trend, volatility, and momentum indicators to capture holistic market dynamics. Then we design a lightweight RL framework based on the Advantage Actor-Critic (A2C) algorithm, including data processing, A2C algorithm, and trading agent modules to support policy learning and actionable trading decisions. Extensive experiments compare QTMRL with 9 baselines (e.g., ARIMA, LSTM, moving average strategies) across diverse market regimes, verifying its superiority in profitability, risk adjustment, and downside risk control. The code of QTMRL is publicly available at https://github.com/ChenJiahaoJNU/QTMRL.git

2507.01918 2026-04-22 q-fin.PM cs.AI math.OC physics.data-an stat.ML

End-to-End Large Portfolio Optimization for Variance Minimization with Neural Networks through Covariance Cleaning

Christian Bongiorno, Efstratios Manolakis, Rosario Nunzio Mantegna

详情
Journal ref
The Journal of Finance and Data Science, 12, (2026) 100179
英文摘要

We develop a rotation-invariant neural network that provides the global minimum-variance portfolio by jointly learning how to lag-transform historical returns and marginal volatilities and how to regularise the eigenvalues of large equity covariance matrices. This explicit mathematical mapping offers clear interpretability of each module's role, so the model cannot be regarded as a pure black box. The architecture mirrors the analytical form of the global minimum-variance solution yet remains agnostic to dimension, so a single model can be calibrated on panels of a few hundred stocks and applied, without retraining, to one thousand US equities, a cross-sectional jump that indicates robust generalization capability. The loss function is the future short-term realized minimum variance and is optimized end-to-end on real returns. In out-of-sample tests from January 2000 to December 2024, the estimator delivers systematically lower realized volatility, smaller maximum drawdowns, and higher Sharpe ratios than the best competitors, including state-of-the-art non-linear shrinkage, and these advantages persist across both short and long evaluation horizons despite the model's training focus is short-term. Furthermore, although the model is trained end-to-end to produce an unconstrained minimum-variance portfolio, we show that its learned covariance representation can be used in general optimizers under long-only constraints with virtually no loss in its performance advantage over competing estimators. These advantages persist when the strategy is executed under a highly realistic implementation framework that models market orders at the auctions, empirical slippage, exchange fees, and financing charges for leverage, and they remain stable during episodes of acute market stress.

2502.14479 2026-04-22 q-fin.RM q-fin.ST stat.AP

Modelling the term-structure of default risk under IFRS 9 within a multistate regression framework

Arno Botha, Tanja Verster, Roland Breedt

Comments 37 pages, 10013 words, 10 figures

详情
英文摘要

The lifetime behaviour of loans is notoriously difficult to model, which can compromise a bank's financial reserves against future losses, if modelled poorly. Therefore, we present a data-driven comparative study amongst three techniques in modelling a series of default risk estimates over the lifetime of each loan, i.e., its term-structure. The behaviour of loans can be described using a nonstationary and time-dependent semi-Markov model, though we model its elements using a multistate regression-based approach. As such, the transition probabilities are explicitly modelled as a function of a rich set of input variables, including macroeconomic and loan-level inputs. Our modelling techniques are deliberately chosen in ascending order of complexity: 1) a Markov chain; 2) beta regression; and 3) multinomial logistic regression. Using residential mortgage data, our results show that each successive model outperforms the previous, likely as a result of greater sophistication. This finding required devising a novel suite of simple model diagnostics, which can itself be reused in assessing sampling representativeness and the performance of other modelling techniques. These contributions surely advance the current practice within banking when conducting multistate modelling. Consequently, we believe that the estimation of loss reserves will be more timeous and accurate under IFRS 9.

2502.12141 2026-04-22 econ.GN q-fin.EC stat.ME

Potato Potahto in the FAO-GAEZ Productivity Measures? Nonclassical Measurement Error with Multiple Proxies

Rafael Araujo, Vitor Possebom

详情
英文摘要

The FAO-GAEZ productivity data are widely used in Economics. However, the empirical literature rarely discusses measurement error. We use two proxies to derive analytical bounds around the effect of agricultural productivity in a setting with nonclassical measurement error. These bounds rely on assumptions weaker than those imposed in empirical studies and exhaust the information contained in the first two data moments. We reevaluate three influential studies, finding wide intervals around the effects of agricultural productivity. These results call for caution, highlighting the limits of our knowledge about these effects. Our methodology has broad applications in empirical research involving mismeasured variables.

2409.18660 2026-04-22 econ.GN cs.AI cs.HC q-fin.EC

Who Benefits from AI? Self-Selection, Skill Gap, and the Hidden Costs of AI Feedback

Christoph Riedl, Eric Bogert

详情
英文摘要

Feedback from artificial intelligence (AI) is increasingly easy to access and research has already established that people learn from it. But individuals choose when and how to seek such feedback, and more engaged and motivated individuals may seek it more, creating an illusion of effectiveness that masks self-selection. We investigate how the endogenous choice to seek AI feedback shapes both individual learning and collective outcomes. Using data from over five years and 52,000 individuals on an online chess platform, we show that motivated and higher-skilled individuals self-select into AI feedback use-and use it more productively. This self-selection creates an illusion of AI effectiveness: apparent learning gains disappear once endogenous motivation is accounted for. This same selection mechanism drives two population-level consequences. Because motivated, higher-skilled individuals benefit disproportionately, AI access widens the skill gap. And because individuals exposed to centralized AI feedback converge on common input from a centralized AI source, intellectual diversity declines. Leveraging 42 platform-level natural experiments, we show this diversity reduction is causal. Self-selection into AI use thus connects individual-level learning dynamics to collective-level consequences-a micro-macro linkage with implications for organizational learning, human capital development, and the design of AI-augmented work.

2405.10498 2026-04-22 econ.GN q-fin.EC

A Deep Learning Approach to Heterogeneous Consumer Aesthetics in Fast Fashion

Pranjal Rawat

详情
英文摘要

Aesthetics drives product differentiation in industries such as fashion, interior decor, luxury goods, real estate and hospitality. However, visual differentiation is hard to encode in formal economic analysis. This paper analyses millions of purchase records from H\&M in the Netherlands, including product images, text descriptions, prices, and consumer demographics. I fine-tune Fashion CLIP embeddings with a three-tower approach that builds separate channels for product visuals and text, consumer history, and price, which makes downstream analysis tractable and scalable. The embeddings feed a latent-class deep demand system that captures price and taste sensitivities through deep nets, recovers rich substitution patterns, reveals meaningful heterogeneity, and performs much better than competing alternatives. Then, a supply-side inversion recovers sensible markups and costs and supports conduct tests and counterfactuals on sustainability practices. I also estimate machine learning hedonic pricing models that perform much better than competing alternatives. This model allows us to construct quality-adjusted price indices, make it possible to price completely new designs, and with an Oaxaca-Blinder decomposition reveal the underlying sources of price changes. Finally, a Poisson event study around the COVID-19 lockdown shows that the range of demand responses across embedding-based product and user clusters exceeds anything recoverable from simple text-based attributes or demographic labels alone. The methodology is portable to any market where products are differentiated along sensory dimensions that are hard to encode but meaningfully important for consumer choices.