Research Papers

1. Deep Learning for Search And Matching Models (a.k.a. "DeepSAM", with Jonathan Payne and Adam Rebei). 2024. [pdf] [slides] [ssrn] [Click for Abstract]

Abstract: We develop a new method for characterizing global solutions to search and matching models with aggregate shocks and heterogeneous agents. We formulate general equilibrium as a high dimensional partial differential equation (PDE) with the distribution as a state variable. Solving this problem has previously been intractable because the distribution impacts agent decisions through the matching mechanism rather than through aggregate prices. We overcome these challenges by developing a new deep learning algorithm with efficient sampling in a high dimensional state space. This allows us to study search markets that are not "block recursive" and compute variables (e.g. wages and prices) that were previously unattainable. In applications to labor search models, we show that distribution feedback plays a more important role when aggregate shocks have an asymmetric impact across agents. Business cycles have a "cleansing" effect by amplifying positive assortative matching in recessions, and the magnitude of the countercyclicality depends on the bargaining process between workers and firms. In applications to OTC markets, we show how default risk impacts bond prices across different maturities.

Presentations: ASU, Atlanta Fed, Chicago, EIEF, Norges Bank, NYU, Rice, Yale; Econometric Society DSE 2023 Conference on "Deep Learning for Solving and Estimating Dynamic Models", EEA-ESEM Invited Session on "Machine Learning and Macroeconomic Analysis", Minnesota Finance Junior Conference, Swiss Winter Conference on Macroeconomics and Finance, T2M, SFI, Zurich Workshop on the Frontier of Quantitative Macro, CEF, CICM, SED, Philly Fed-Chicago Booth Conference on Frontiers in Machine Learning and Economics.

2. DeepHAM: A Global Solution Method for Heterogeneous Agent Models with Aggregate Shocks (with Jiequn Han and Weinan E). 2021. Updated in 2024 [pdf] [slides] [code] [ssrn] [Abstract] Revise and Resubmit, Quantitative Economics.

Abstract: We propose an efficient, reliable, and interpretable global solution method, the Deep learning-based algorithm for Heterogeneous Agent Models (DeepHAM), for solving high dimensional heterogeneous agent models with aggregate shocks. The state distribution is approximately represented by a set of optimal generalized moments. Deep neural networks are used to approximate the value and policy functions, and the objective is optimized over directly simulated paths. In addition to being an accurate global solver, this method has three additional features. First, it is computationally efficient in solving complex heterogeneous agent models, and it does not suffer from the curse of dimensionality. Second, it provides a general and interpretable representation of the distribution over individual states, which is crucial in addressing the classical question of whether and how heterogeneity matters in macroeconomics. Third, it solves the constrained efficiency problem as easily as it solves the competitive equilibrium, which opens up new possibilities for normative studies. As a new application, we study constrained efficiency in heterogeneous agent models with aggregate shocks. We find that in the presence of aggregate risk, a utilitarian planner would raise aggregate capital for redistribution less than in absence of it because poor households do more precautionary savings and thus rely less on labor income.

Presentations: Stanford SITE, Yale, Federal Reserve Bank of Philadelphia, Princeton, UPenn, Zurich, ETH Zurich, Rutgers, HKU, CUHK, PKU, Tsinghua, Guelph, SDU, T2M (King's College London), 2022 CEA (Carleton), 2022 ES {North America, Asia} Meetings, 2022 CICM, CES Rising star, HKUST-Jinan Workshop, EEA-ESEM, CESifo, FRB Conference on "Nontraditional Data, Machine Learning, and Natural Language Processing in Macroeconomics".

3. Redistributive Inflation and Optimal Monetary Policy. 2022. [pdf] [slides] [Click for Abstract]

Awards: CICF Yihong Xia Best Paper Award, CES Gregory C. Chow Best Paper Award, QCGBF Young Economist Prize Runner-up.

Abstract: Inflation has heterogeneous impacts on households, which then affects optimal monetary policy design. I study optimal monetary policy rules in a quantitative heterogeneous agent New Keynesian (HANK) model where inflation has redistributive effects on households through their different (1) consumption baskets, (2) nominal wealth positions, and (3) earnings elasticities to business cycles. I parameterize the model based on the empirical analysis of these channels using the most recent data. Unlike in representative agent models, a utilitarian central bank should adopt an asymmetric monetary policy rule that is accommodative towards inflation and aggressive towards deflation. Specifically, by accommodating stronger demand and higher inflation, the central bank benefits low-income and low-wealth households through nominal debt devaluation and higher earnings growth.

Presentations: Princeton, Zurich, UNC Chapel Hill, Bank of Canada, Rice, St Gallen, U Houston, Baruch, NUS, HKU, CUHK {Econ, Finance}, PKU {Guanghua, HSBC, INSE, Econ}, Tsinghua, Sveriges Riksbank, Macro Finance Society Workshop, CEBRA, CICF, CES, AEA 2024.

4. The Knowledge Graph for Macroeconomic Analysis with Alternative Big Data (with Yue Pang, Guanhua Huang and Weinan E). 2020. [pdf] [slides] [ssrn] [Click for Abstract]

Media Coverage: Quantpedia.

Abstract: The current knowledge system of macroeconomics is built on interactions among a small number of variables, since traditional macroeconomic models can mostly handle a handful of inputs. Recent work using big data suggests that a much larger number of variables are active in driving the dynamics of the aggregate economy. However, most of such work is model-free and purely data driven. To integrate human knowledge with high dimensional statistical modeling, we build a knowledge graph (KG) that consists of not only linkages between traditional economic variables but also new alternative big data variables. We propose an active learning natural language processing (NLP) algorithm to extract these variables and linkages from the massive textual data of academic literature and research reports. The KG provides a systematic approach to incorporate human knowledge when dealing with a large number of variables in macroeconomic models. For example, in macroeconomic forecasting, we use the KG as the prior knowledge to select variables as model inputs. When applied to inflation and investment forecasts, the KG-based method achieves significantly higher accuracy, especially for long run forecasts, compared to statistical variable selection methods.

Presentations: Banca d'Italia and Federal Reserve Board Conference, Federal Reserve Bank of Philadelphia, Her Majesty's Treasury, Monash-Warwick-Zurich Text-as-Data Workshop, 2021 SoFiE Machine Learning Workshop, 21st IWH-CIREQ-GW Macroeconometric Workshop, RES 2021, 2021 ESCoE, 2021 AMES.

5. Networks, Business Cycles, and Asset Pricing (with Wu Zhu). 2020. [Link to 2020 version] [Click for Abstract]

Abstract: The speed at which the US economy has recovered from recessions ranges from months to years. We propose a model incorporating the innovation network, the production network, and cross-sectional shocks and show that their interactions jointly explain large variations in the recovery speed across recessions in the US. In the model, besides the production linkages, firms learn insights on production from each other through the innovation network. We show when the innovation network takes a low-rank structure, there exists one key direction: the impact a shock becomes persistent only if the shock is parallel to this key direction; in contrast, the impact declines quickly if the shock follows other directions. Empirically, we estimate the model in a state-space form and document a set of new stylized facts of the US economy. First, the innovation network among sectors takes a low-rank structure. Second, the innovation network has non-negligible overlap with the production network. Third, recessions with slow recovery are those witnessing sizable negative shock to sectors in the center of the innovation network. Such network structures and the time-varying sectoral distribution of the shocks can well explain the large variation in the recovery speed across recessions in the US. Finally, to emphasize the prevalence of the channel, we explore the application of the theory in asset pricing.

Presentations: Princeton Macro Workshop, 2021 AMES.