Registration for Cambridge Corporate Finance Theory Symposium is open!
By Thies Lindenthall, part of the Department of Land Economy and former CERF Fellow.
Attempts to predict stock market returns are as old as the stock markets themselves. However, for today’s efficient markets, one would expect predictability to be nearly non-existent. But the reality is more dynamic, driven by an arms race of data availability, empirical innovation and market efficiency.
Enter the brave new world of machine learning (ML), which has ignited another wave of academic research on asset return predictability. ML algorithms are especially adept at handling the complexities of financial data, notably non-linearities and interactions between predictors. The seminal paper by Gu et al. (2020) showcased this strength. They found that ML algorithms, such as random forests and neural networks, have an edge over traditional linear models in predicting next month’s return for U.S. stocks. They achieved a modest, yet positive, out-of-sample R2 of 0.3–0.5%.
For bonds, the predictability is an order of magnitude higher. Bianchi et al. (2021) report out-ofsample R2 values as large as 5% for bonds. Additionally, Leippold et al. (2022) indicated that in less-efficient Chinese stock market, the predictive power of ML models reaches out-of-sample R2 values of up to 3%. Notably, these predictions remain economically significant even after accounting for transaction costs.
Kahshin Leow and I contribute to this growing literature with an analysis of Real Estate Investment Trusts (REITs). REITs are interesting to study since their returns depend, ultimately, on the performance of the properties they own. If these assets produce predictable returns then one might also see predictability at the fund level. Using stock market information (CRSP) and a selection of macroeconomic variables, we conduct a horse race in which various empirical models compete to predict the returns of all REITs traded on U.S. exchanges from 1990 to 2022. We find that the predictability of REIT returns sits between that of general stocks and bonds, as our models achieve out-of-sample R2 ranging between 0.5–3%, depending on the selected time frame (Table 1).
Table 1: Monthly out-of-sample REIT-level prediction performance (percentage R2 oos)
Notes: This figure shows the cumulative returns of the best performing machine learning portfolios. The portfolios are based on a long-only strategy of holding REITs in the top 30% quantile, and the benchmark portfolio is the weighted index of all REITs in the sample period.
This predictability also enhances the optimization of mean-variance portfolios, in line with Markowitz's (1953) demand that “we must have procedures for finding reasonable μi and σij. These procedures [...] should combine statistical techniques and the judgment of practical men”. Our results showed that portfolios where we update stock mean expectations based on ML predictions outperformed naive 1/n portfolios (Table 2).
Are these results large enough to trade on it? We are not sure yet. What we are confident of, however, is that REITs returns are more predictable than previously thought, especially in times of high heterogeneity of REIT returns as, for instance, seen in 2020/21.
Table 2: Performance of machine learning portfolios using mean-variance optimization
Notes: This table reports the out-of-sample performance measures for the best performing machine learning models using mean-variance optimization. The naive strategy involves holding a portfolio weight of 1/N in each of the N REITs. In Panel A, the mean-variance portfolios are constrained to long-only positions to allow for an apples-to-apples comparison to the naive 1/N portfolio. In Panel B, the mean-variance portfolios are permitted to take long-short positions. “Avg” : average realized monthly return(%). “Std”: the standard deviation of realized monthly returns(%). “S.R.”: annualized Sharpe ratio. “T-stat": t-statistic of realized monthly returns. “Skew”: skewness. “Kurt”: kurtosis. “MaxDD”: the portfolio maximum drawdown (%). “Max 1M Loss”: the most extreme negative realized monthly return(%). “Corr": correlation of realized monthly returns against the naive 1/N portfolio returns.
Bibliography:
Allen, D., Lizieri, C., Satchell, S. 2019. In defense of portfolio optimization: What if we can forecast? Financial Analysts Journal 75:3,20-38.
Bianchi, D., Buchner, M., Tamoni, A. 2021. Bond Risk Premiums with Machine Learning. The Review of Financial Studies 34(2):1046-1089.
Gu, S., Kelly, B., Xiu, D. 2020. Empirical asset pricing via machine learning. The Review of Financial Studies 33(5),2223–2273.
Leippold, M., Wang, Q., Zhou, W. 2022. Machine learning in the Chinese stock market. Journal of Financial Economics 145(2),64–82.