Dave Rapach - Working Papers

Working Papers

The Anatomy of Machine Learning-Based Portfolio Performance (with Philippe Goulet Coulombe, Erik Christian Montes Schütte, and Sander Schwenk-Nebbe)

Abstract: Asset return predictability is routinely assessed by economic value: based on a set of predictors, out-of-sample return forecasts are generated—increasingly via “black box” machine learning models—which serve as inputs for portfolio construction, and performance metrics are computed over an evaluation period. We develop a flexible methodology based on Shapley values—the Shapley-based portfolio performance contribution (SPPC)—to exactly allocate the contributions of individual or groups of predictors to a performance metric. We illustrate the SPPC in an empirical application measuring the economic value of cross-sectional stock return predictability using a large number of firm characteristics and machine learning.

[Presented as part of the EDHEC Speaker Series—The Future of Finance—Season 5 (slides) Presented at the 2025 Wolfe Research 2nd Annual Canadian Quantitative and Macro Investment Conference | Presented at the 2024 CEMFI Workshop on Big Data in Asset Management | Presented at the 2024 European Economic Association and Econometric Society European Meeting (EEA-ESEM) | Presented at the 2024 International Symposium on Forecasting | Presented at the 6th Future of Financial Information Conference | Presented in the Applied Machine Learning, Economics, and Data Science (AMLEDS) Webinar Series | Subject of Machine Learning & Quant Finance blog post]

Economic Fundamentals and Short-Run Exchange Rate Prediction: A Machine-Learning Perspective (with Ilias Filippou, Mark P. Taylor, and Guofu Zhou)

Abstract: This paper establishes the out-of-sample predictability of monthly exchange rates based on economic fundamentals using country characteristics, global variables, and their interactions. Previous work does not find consistent evidence of short-horizon predictability, likely due to using a small set of fundamentals and inadequately capturing time variation and nonlinearities in predictive relations. By employing a large set of economic fundamentals and global variables in conjunction with machine-learning techniques, we are able to consistently and significantly outperform the stringent no-change benchmark forecast. We find stronger predictability during periods of crisis and recession. The exchange rate forecasts are also economically valuable, as they generate sizable utility gains for an investor in the context of foreign currency portfolios. To enhance our understanding of the economic drivers of exchange rate predictability, we identify the most relevant predictors for forecasting exchange rates in the fitted machine-learning models.

[Internet Appendix | Revise and resubmit to the Journal of Financial and Quantitative Analysis | Presented at the 2021 Vienna Symposium on Foreign Exchange Markets]

The Anatomy of Out-of-Sample Forecasting Accuracy: A Shapley-Based Approach (with Daniel Borup, Philippe Goulet Coulombe, Erik Christian Montes Schütte, and Sander Schwenk-Nebbe)

Abstract: We propose the global performance-based Shapley value (GPBSV) to measure the contributions made by each of the individual predictors in fitted time-series forecasting models to the loss over the out-of-sample period. The GPBSVs for the individual predictors sum to the out-of-sample loss, so our new metric produces an exact decomposition of out-of-sample performance—in essence, we anatomize out-of-sample forecasting performance. The GPBSV is model agnostic and can be used for any loss function. We illustrate our new metric in an application forecasting US inflation using a large number of predictors and variety of machine learning models.

[Online Appendix | Revise and resubmit to the Journal of Applied Econometrics | Python package anatomy | Presented at the 12th European Central Bank Conference on Forecasting Techniques | Earlier version: Federal Reserve Bank of Atlanta Working Paper 2022-16]

Improving Hedge Fund Return Prediction: Dealing with Missing Data via Deep Learning (with Ilias Filippou, Ioannis Psaradellis, and Lazaros Zagrafopoulos)

Abstract: We study the important issue of handling missing values in hedge fund data. We employ a deep learning approach, BRITS, for filling in missing values for both hedge fund returns and fund characteristics used to predict returns. BRITS harnesses information from past and future observations in the time-series dimension as well as contemporaneous observations in the cross-sectional dimension. Compared to alternative data imputation methods, BRITS provides high fidelity for data imputation in experiments. Moreover, BRITS substantially improves out-of-sample hedge fund return prediction with neural networks in terms of filling in missing values for the training samples. BRITS appears especially valuable for generating hedge fund return predictions to guide the dynamic selection of top-performing hedge funds.

[Internet Appendix (in progress) | Slides | To be presented at the EUROFIDAI-ESSEC Paris December 2025 Finance Meeting | Presented at the 2025 FMA Annual Meeting | Presented at the 2025 Alpine Finance Summit | Presented at the 2025 EUROFIDAI-ESSEC Paris December Finance Meeting (slides)]

Going Supranational: Anomaly-Market Links and New Dimensions of Market Efficiency (with Xi Dong, Yan Li, Yanran Li, and Guofu Zhou)

Abstract: We connect cross-sectional anomalies to time-series market return predictability using data from 44 non-US countries. While a large set of representative anomaly returns show limited predictive power for market returns at the country level, they exhibit strong predictive ability when aggregated to the supranational level. We develop an international analytical framework to explain this difference: cross-sectional mispricing corrections in one country can propagate into market-wide corrections in another, enhancing supranational predictability precisely when mispricing is more country-specific than global. We further decompose anomaly-market links into three analytically grounded market (in)efficiency measures of broad relevance: systematic mispricing, overpricing dominance, and price randomness. Supported by data, they govern the strength and nature of anomaly-market links across global markets.

[Presented at the 2025 Midwest Finance Association Meeting]

Sparse Macro-Finance Factors (with Guofu Zhou)

Abstract: We estimate sparse principal components from a large set of macro-finance variables. Each component is a sparse linear combination of the underlying variables, enhancing economic interpretability and yielding sharper asset pricing signals. Innovations to the components constitute a set of sparse macro-finance factors. Robust tests show that sparse factors tied to housing, yields, and credit spreads earn significant risk premia. Among the top 20 factor models formed from prominent characteristic-based factors and mimicking portfolios for the housing, yield, and credit spread factors, the latter three play leading roles, highlighting the importance of sparse macro-finance factors for capturing systematic risks.

[Internet Appendix | Sparse Macro-Finance Factors website | Best Paper Prize, INQUIRE UK & Europe Spring 2019 Residential Joint Conference]

Cryptocurrency Return Predictability: A Machine-Learning Analysis (with Ilias Filippou and Christoffer Thimsen; older version—currently being revised)

Abstract: We investigate the out-of-sample predictability of daily cryptocurrency returns using modern machine-learning methods. We consider a large number of cryptocurrencies (41) and a rich set of predictors relating to network value and activity, momentum, technical signals, and online activity. We find that return predictability is an important feature of the cryptocurrency market: machine-learning methods significantly improve the statistical accuracy of cryptocurrency return forecasts and provide substantial economic value to investors. Predictors relating to momentum, size, and value stand out as important determinants of future cryptocurrency returns. Nonlinearities also play a significant role in improving cryptocurrency return predictability.

[Online Appendix | Presented at the 2024 Econometric Society Interdisciplinary Frontiers (ESIF) Economics and AI+ML Meeting (slides) | Presented at the 2023 Northern Finance Association Annual Conference | Presented at the INQUIRE UK 2023 Autumn Residential Seminar]

WORK IN PROGRESS

Beyond Stars: A Bayesian Shapley Framework for Understanding Model Comparison with an Application to the Phillips Curve (with Tao Zha)

Forecasting Inflation: The Role of Financial Market Information (with Nikolay Gospodinov, Indrajit Mitra, and Bin Wei)

OTHER ITEMS

Forecasting Asset Returns: The State of the Art

Invited Lectures at CEMA/CUFE on July 12–13, 2016

[Slides for Lecture 1 | Slides for Lecture 2]

Forecasting Asset Returns in Realistic Environments

Invited Presentation at the CFA Montreal Asset Management Forum on October 8, 2015 (other presenters: Andrew Ang, Mark Carhart, Craig Bodenstab, Philip Tetlock)

[Slides | CFA Research Foundation Brief: Portfolio Structuring and the Value of Forecasting]

Page updated

Google Sites

Report abuse