Welcome to my research page! My work focuses on econometrics and applied economics, with an emphasis on developing robust statistical tools for real-world data challenges. I am particularly interested in methods that remain reliable in the tails of distributions, where traditional approximations often break down.
Here you’ll find my publications, working papers, ongoing projects, and collaborations. I am always excited to exchange ideas — please feel free to reach out if you’d like to discuss research or explore opportunities to work together.
Published and Forthcoming Papers
Non-Existent Moments of Earnings Growth. (Joint with Silvia Sarpietro and Yuya Sasaki.) Journal of Applied Econometrics, forthcoming. Supplementary Material. arXiv.
Extreme Quantile Treatment Effects under Endogeneity. (Joint with Yuya Sasaki.) Journal of Business & Economic Statistics, forthcoming. Supplementary Material. arXiv.
On Uniform Confidence Intervals for the Tail Index and the Extreme Quantile. (Joint with Yuya Sasaki.) Journal of Econometrics 244 (2024), 105865. arXiv.
Getting the Right Tail Right: Modeling Tails of Health Expenditure Distributions. (Joint with Martin Karlsson and Nicolas R. Ziebarth.) Journal of Health Economics 97 (2024), 102912.
Testing Limited Overlap. (Joint with Xinwei Ma and Yuya Sasaki.) Econometric Theory (2024), 1-34. Supplementary Material.
Extreme Changes in Changes. (Joint with Yuya Sasaki.) Journal of Business & Economic Statistics 42 (2023), 812–824. arXiv.
Tuning Parameter-Free Nonparametric Density Estimation from Tabulated Summary Data. (Joint with Ji Hyung Lee, Yuya Sasaki, and Alexis A. Toda.) Journal of Econometrics 238 (2024), 105568. arXiv.
Testing for Homogeneous Thresholds in Threshold Regression Models. (Joint with Yoonseok Lee.) Econometric Theory 40 (2024), 608–651.
Threshold Regression with Nonparametric Sample Splitting. (Joint with Yoonseok Lee.) Journal of Econometrics 235 (2023), 816-842. arXiv.
Diagnostic Testing of Finite Moment Conditions for the Consistency and Root-N Asymptotic Normality of the GMM and M Estimators. (Joint with Yuya Sasaki.) Journal of Business & Economic Statistics 41 (2023), 339-348. Supplementary Material. arXiv.
Nonparametric Tests of Tail Behavior in Stochastic Frontier Models. (Joint with William C. Horrace.) Journal of Applied Econometrics 37 (2022), 537-562. arXiv.
Estimation and Inference about Tail Features with Tail Censored Data. (Joint with Zhijie Xiao.) Journal of Econometrics 230 (2022), 363-387. Supplementary Material. arXiv.
Fixed-k Inference for Conditional Extremal Quantiles. (Joint with Yuya Sasaki.) Journal of Business & Economic Statistics 40 (2022), 829-837. Supplementary Material. arXiv.
Efficient Minimum Distance Estimation of Pareto Exponent from Top Income Shares. (Joint with Alexis A. Toda.) Journal of Applied Econometrics 36 (2021), 228-243. arXiv.
Nearly Weighted Risk Minimal Unbiased Estimation. (Joint with Ulrich K. Müller.) Journal of Econometrics 209 (2019), 18-34. Supplementary Material.
Fixed-k Asymptotic Inference about Tail Properties. (Joint with Ulrich K. Müller.) Journal of the American Statistical Association 112 (2017), 1134-1143. Supplementary Material.
Working Papers
Robust Econometrics for Growth at Risk. (Joint with Tobias Adrian and Yuya Sasaki.) arXiv.
The Growth-at-Risk (GaR) framework has garnered attention in recent econometric literature, yet current approaches implicitly assume a constant Pareto exponent. We introduce novel and robust econometrics to estimate the tails of GaR based on a rigorous theoretical framework and establish validity and effectiveness. Simulations demonstrate consistent outperformance relative to existing alternatives in terms of predictive accuracy. We perform a long-term GaR analysis that provides accurate and insightful predictions, effectively capturing financial anomalies better than current methods.
Genuinely Robust Inference for Clustered Data. (Joint with Harold D. Chiang and Yuya Sasaki.) arXiv.
Conventional cluster-robust inference can be invalid when data contain clusters of unignorably large size. We formalize this issue by deriving a necessary and sufficient condition for its validity, and show that this condition is frequently violated in practice: specifications from 77% of empirical research articles in American Economic Review and Econometrica during 2020–2021 appear not to meet it. To address this limitation, we propose a genuinely robust inference procedure based on a new cluster score bootstrap. We establish its validity and size control across broad classes of data-generating processes where conventional methods break down. Simulation studies corroborate our theoretical findings, and empirical applications illustrate that employing the proposed method can substantially alter conventional statistical conclusions.
Binary Outcome Models with Extreme Covariates: Estimation and Prediction. (Joint with Laura Liu.) Supplementary Material. arXiv.
This paper presents a novel semiparametric method to study the effects of extreme events on binary outcomes and subsequently forecast future outcomes. Our approach, based on Bayes’ theorem and regularly varying (RV) functions, facilitates a Pareto approximation in the tail without imposing parametric assumptions beyond the tail. We analyze cross-sectional as well as static and dynamic panel data models, incorporate additional covariates, and accommodate the unobserved unit-specific tail thickness and RV functions in panel data. We establish consistency and asymptotic normality of our tail estimator, and show that our objective function converges to that of a panel Logit regression on tail observations with the log extreme covariate as a regressor, thereby simplifying implementation. The empirical application assesses whether small banks become riskier when local housing prices sharply decline, a crucial channel in the 2007–2008 financial crisis.
Estimating Export-productivity Cutoff Contours with Profit Data: A Novel Threshold Estimation Approach. (Joint with Peter H. Egger.) arXiv.
This paper develops a novel method to estimate firm-specific market-entry thresholds in international economics, allowing fixed costs to vary across firms alongside productivity. Our framework models market entry as an interaction between productivity and observable fixed-cost measures, extending traditional single-threshold models to ones with set-valued thresholds. Applying this approach to Chinese firm data, we estimate export-market entry thresholds as functions of domestic sales and surrogate variables for fixed costs. The results reveal substantial heterogeneity and threshold contours, challenging conventional single-threshold-point assumptions. These findings offer new insights into firm behavior and provide a foundation for further theoretical and empirical advancements in trade research.
High-Dimensional Tail Index Regression: with An Application to Text Analyses of Viral Posts in Social Media. (Joint with Yuya Sasaki and Jing Tao.) arXiv.
Motivated by the empirical observation of power-law distributions in the credits (e.g., “likes”) of viral social media posts, we introduce a high-dimensional tail index regression model and propose methods for estimation and inference of its parameters. First, we present a regularized estimator, establish its consistency, and derive its convergence rate. Second, we introduce a debiasing technique for the regularized estimator to facilitate inference and prove its asymptotic normality. Third, we extend our approach to handle large-scale online streaming data using stochastic gradient descent. Simulation studies corroborate our theoretical findings. We apply these methods to the text analysis of viral posts on X (formerly Twitter) related to LGBTQ+ topics.
Inference in Auctions with Many Bidders Using Transaction Prices. (Joint with Federico A. Bugni.) Supplementary Material. arXiv. Slides.
This paper studies inference in first- and second-price sealed-bid auctions with many bidders, using an asymptotic framework where the number of bidders increases while the number of auctions remains fixed. Relevant applications include online, treasury, spectrum, and art auctions. Our approach enables asymptotically exact inference on key features such as the winner’s expected utility, seller’s expected revenue, and the tail of the valuation distribution using only transaction price data. Our simulations demonstrate the accuracy of the methods in finite samples. We apply our methods to Hong Kong vehicle license auctions, focusing on high-priced, single-letter plates.
Fixed-k Tail Regression: New Evidence on Tax and Wealth Inequality from Forbes 400. (Joint with Ji Hyung Lee, Yuya Sasaki, and Alexis A. Toda.) arXiv.
We develop a new tail regression method to estimate the tail index (reciprocal of the Pareto exponent) of a size distribution as a function of macroeconomic state variables. Our method is motivated by the unique feature of the Forbes 400 data, which is a repeated cross-section of wealth truncated from below at the 400th largest order statistic. Applying this method, we find that higher capital income tax rates are associated with higher wealth Pareto exponents (lower top tail inequality). We present a simple economic model that explains these findings and discuss the welfare implication of capital taxation.
Capital and Labor Income Pareto Exponents in the United States, 1916-2019. (Joint with Ji Hyung Lee, Yuya Sasaki, and Alexis A. Toda.) arXiv.
Accurately estimating income Pareto exponents is challenging due to limitations in data availability and the applicability of statistical methods. Using tabulated summaries of incomes from tax authorities and a recent estimation method, we estimate income Pareto exponents in U.S. for 1916-2019. We find that during the past three decades, the capital and labor income Pareto exponents have been stable at around 1.2 and 2. Our findings suggest that the top tail income and wealth inequality is higher and wealthy agents have twice as large an impact on the aggregate economy than previously thought but there is no clear trend post-1985.