Research
Books published
Kim, J.K. and Shao, J. (2021). Statistical Methods for Handling Incomplete Data (2nd Edn) Chapman & Hall / CRC.
Kim, J.K. (2017). Survey Sampling (표본 조사론), 2nd Edition, 자유 아카데미사 (Written in Korean)
Published Papers
Peiris, H., Jeong, H., Kim, J.K., and Lee, H. (2024). Integration of traditional and telematics data for efficient insurance claims prediction, ASTIN bulletin, accepted.
M. Uehara, D. Lee, and J.K. Kim (2023), "Semiparametric response model with nonignorable nonresponse", Scandinavian Journal of Statistics, https://doi.org/10.1111/sjos.12652.
Y. Yang, Y. Kwon, J.K. Kim, and I. Cho. (2023) "Ultra Data-Oriented Parallel Fractional Hot-Deck Imputation with Efficient Linearized Variance Estimation" IEEE transactions on Knowledge and Data Engineering, Accepted.
Gao, C., Yang, S., and Kim, J.K. (2023). ``Soft calibration for correcting selection bias under mixed-effects models,'' Biometrika, Accepted.
Wang, H., Kim, J.K. Statistical inference using regularized M-estimation in the reproducing kernel Hilbert space for handling missing data. Ann Inst Stat Math 75, 911–929 (2023). https://doi.org/10.1007/s10463-023-00872-8
Kim, J. K., & Morikawa, K. (2023). An Empirical Likelihood Approach to Reduce Selection Bias in Voluntary Samples. Calcutta Statistical Association Bulletin, 75(1), 8-27. https://doi.org/10.1177/00080683231186488
J.K. Kim, Z. Wang, and J.N.K. Rao. (2023). "Hypotheses Testing from Complex Survey Data Using Bootstrap Weights: A Unified Approach," Journal of the American Statistical Association, Accepted for publication.
Kim, J.K., and Wang, H. (2023). Comments on “Statistical disclosure control and developments in formal privacy: In memoriam to Chris Skinner”: A note on weight smoothing in survey sampling. Survey Methodology, Statistics Canada, Catalogue No. 12-001-X, Vol. 49, No. 1. Paper available at http://www.statcan.gc.ca/pub/12-001-x/2023001/article/00005-eng.htm.
Wang, Z., Kim, H.J. and Kim, J.K. (2023). "Survey data integration for regression analysis using model calibration", Survey Methodology, Accepted for publication.
Gao, C., Thompson, K.J., Yang, S., and Kim, J.K. (2023). "Nearest neighbor ratio imputation with incomplete multinomial outcome in survey sampling", Journal of the Royal Statistical Society: Series A, Accepted for publication.
Wang, Z., and Kim, J.K. (2022). Comments on “Statistical inference with non-probability survey samples”. Survey Methodology, Statistics Canada, Catalogue No. 12-001-X, Vol. 48, No. 2.
Wang, H. and Kim,J.K. (2022). "Maximum sampled conditional likelihood for informative subsampling," Journal of Machine Learning Research, 23, 1-50.
Wang, Z., Peng, L., and Kim, J.K. (2022). "Bootstrap inference for the finite population mean under complex sampling designs," Journal of the Royal Statistical Society: Series B, 84, 1150-1174.
Kim, J.K. (2022). A gentle introduction to data integration in survey sampling, The survey statistician, 85, 19—29.
Kim, S., Ah. K., and Kim, J.K. (2022). "A calibrated Bayesian method for the stratified proportional hazards model with missing covariates", Lifetime Data Analysis, Accepted for publication.
Kim, J.K., Rao, J.N.K., and Kwon, Y. (2022). "Analysis of clustered survey data based on two-stage informative sampling and associated two-level models", Journal of the Royal Statistical Society: Series A, accepted for publication.
Lee, D., Zhang, L-C., and Kim, J.K. (2022). "Maximum entropy classification for record linkage," Survey Methodology, 48, 1-23.
H. Sang , J.K. Kim, and D. Lee (2022). "Semiparametric fractional imputation using Gaussian mixture models for handling multivariate missing data", Journal of the American Statistical Association, 117, 654--663.
Lee, D and Kim, J.K. (2022). "Semiparametric imputation using Conditional Gaussian mixture models under item nonresponse", Biometrics, 78, 227-232.
J.K. Kim, S. Park, Y. Chen and C. Wu (2021). "Combining Non-probability and Probability Survey Samples Through Mass Imputation," Journal of the Royal Statistical Society: Series A, 184, 941-963.
K. Morikawa and J.K. Kim (2021). "Semiparametric Optimal Estimation With Nonignorable Nonresponse Data", Annals of Statistics, 49, 2991-3014 (Available here).
S. Yang, J.K. Kim, and Y. Hwang (2021). "Integration of survey data and big observational data for finite population inference using mass imputation", Survey Methodology, 47, 29--58.
H. Sang and J.K. Kim (2021). "An approximate Bayesian inference using propensity score estimation under unit nonresponse", Canadian Journal of Statistics, 49, 793-807.
J.K. Kim and S. Tam (2021). "Data integration by combining big data and survey sample data for finite population inference", International Statistical Review, 89, 382-401.
Berg, E. and Kim, J.K. (2021). "An approximate best prediction approach to small area estimation for sheet and rill erosion under informative sampling," Annals of Applied Statistics, 15, 102--125.
Y. Yang, I.H. Cho, and J.K. Kim (2020). "Parallel Fractional Hot Deck Imputation and Variance Estimation for Big Incomplete Data Curing", IEEE transactions on Knowledge and Data Engineering,34, 3912-3926.
S. Yang and J.K. Kim (2020). "Statistical Data Integration in Survey Sampling: A review ", Japanese Journal of Statistics and Data Science, 3, 625--650.
S. Sugasawa and J.K. Kim (2020). "An approximate Bayesian approach to regression estimation with many auxiliary variables", Statistica Sinica, Accepted.
S. Chen, S. Yang, and J.K. Kim (2020). "Nonparametric Mass Imputation for Data Integration", Journal of Survey Statistics and Methodology, Accepted.
Eunyong Ahn, Cyprian Ouma, Mesfin Loha, Asrat Dibaba, Wendy Dyment, Jaekwang Kim, Nam Seon Beck and Taesung Park (2020). Do we need to reconsider the CMAM admission and discharge criteria?; an analysis of CMAM data in South Sudan, BMC Public Health, 20: 511.
G. Goh and J.K. Kim (2020). "Accounting for model uncertainty in multiple imputation under informative sampling," Scandinavian Journal of Statistics, 48, 930--949.
S. Yang, J.K. Kim, and R. Song (2020). "Doubly Robust Inference when Combining Probability and Non-probability Samples with High-dimensional Data", Journal of the Royal Statistical Society: Series B, 82, 445-465.
S. Yang and J.K. Kim (2020). "Asymptotic theory and inference of predictive mean matching imputation using a superpopulation model framework", Scandinavian Journal of Statistics, 47, 839-861.
J.K. Kim, S. Park, and K. Kim (2019). ``A note on propensity score weighting method using paradata in survey sampling,'' Survey Methodology, 45, 451-463
S. Park and J.K. Kim (2019). "Mass imputation for two-phase sampling", Journal of the Korean Statistical Society, 48, 578-592.
D. Lee, J.K. Kim, and C. Skinner (2019). "Within-cluster resampling for multilevel models under informative cluster size", Biometrika 106, 965-972.
J.K. Kim and Z. Wang (2019). "Sampling techniques for big data analysis in finite population inference", International Statistical Review, 87, S177-S191.
S. Yang and J.K. Kim (2019). “Nearest neighbor imputation for general parameter estimation in survey sampling,†Advances in Econometrics (Volume 39) — The Econometrics of Complex Survey Data: Theory and Applications, 209-234.
Morikawa, K. and Kim, J. K. (2018). "A discussion of ‘statistical inference for nonignorable missing data problems: a selective review’ by Niansheng Tang and Yuanyuan Ju". Statistical Theory and Related Fields, 2, 140.
S. Tam and J.K. Kim (2018). "Big data, selection bias, and Ethics - An official statistician's perspective", Statistical Journal of the IAOS, 34, 577-588.
E. Gwak, J.K. Kim, and Y. Kim (2018). "A random effects model approach to survey data integration", Statistics and Applications, 16, p 227-243.
J. Im, I.H. Cho, and J.K. Kim (2018). "FHDI: An R Package for Fractional Hot Deck Imputation", The R journal, 10, 140-154.
K. Morikawa and J.K. Kim (2018). "A note on the equivalence of two semiparametric estimation methods for nonignorable nonresponse ", Statistics and Probability Letters, 140, 1-6.
J.K. Kim, Z. Wang, Z. Zhu, and N. Cruze (2018). ``Combining survey and non-survey big data for improved sub-area prediction using a multi-level model'', Journal of Agricultural, Biological, and Environmental Statistics, 23, 175-189.
Y. Hwang, S. Lu, and J.K. Kim (2018). ``Bottom-up estimation and top-down prediction in multilevel models: Solar Energy Prediction combining information from multiple sources'', Annals of Applied Statistics, 12, 2096-2120.
W. Yu, J.K. Kim, and T. Park (2018). ``Estimation of Area Under the Curve (AUC) under nonignorable verification bias'', Statistica Sinica, 28, 2149-2166 .
Z. Wang, J.K. Kim, and S. Yang. (2018). ``An approximate Bayesian inference under informative sampling,'' Biometrika, 105, 91-102.
Y. Kwon, J.K. Kim, M.C. Paik, and H. Kim (2018). A robust calibration-assisted method for linear mixed effects model under cluster-specific nonignorable missingness. Statistica Sinica, 28, 1907-1928.
S. Park and J.K. Kim (2018). "Analysis of inaccurate data using mixture measurement error models", Journal of the Korean Statistical Society, 47, 1-12.
S. Yang and J.K. Kim (2017). Discussion of "Dissecting multiple imputation from a multi-phase inference perspective: what happens when god's, imputer's and analyst's models are uncongenial?" by Xie and Meng, Statistica Sinica, 27, 1485-1594.
S. Park, J.K. Kim, and D. Stukel (2017). "A measurement error model for survey data integration: combining information from two surveys", Metron, 75, 345-357.
K. Morikawa, J.K. Kim, and Y. Kano. (2017). ``Semiparametric maximum likelihood estimation under nonignorable nonresponse,'' Canadian Journal of Statistics, 45, 393-409.
J.K. Kim, S. Park, and Y. Lee (2017). ``Statistical inference using generalized linear mixed models under informative cluster sampling,'' Canadian Journal of Statistics, 45, 479-497.
S. Chen and J.K. Kim (2017). Semiparametric fractional imputation using empirical likelihood in survey sampling, Statistical Theory and Related Fields, 1, 69-81.
J. Im, E. Ahn, N. Beck, J.K. Kim, and T. Park (2017). Correlation estimation with singly truncated bivariate data. Statistics in Medicine, 36, 1977–1988
Y. Xu, J.K. Kim, and Y. Li. (2017). ``Semiparametric estimation for measurement error models with validation data'', Canadian Journal of Statistics 45, 185–201.
J.K. Kim and S. Yang. (2017). ``A note on multiple imputation under informative sampling'', Biometrika, 104, 221-228.
Yang, S. and J.K. Kim (2017). ``A semiparametric inference to regression analysis with missing covariates in survey data'', Statistica Sinica, 27, 261--285.
E. Berg, J.K. Kim, and C. J. Skinner. (2016). ``Imputation under informative sampling'', Journal of the Survey Statistics and Methodology, 4, 436--462.
D. Da Silva, C. Skinner, and J.K. Kim. (2016). ``Using Binary Paradata to Correct for Measurement Error in Survey Data Analysis.'' Journal of the American Statistical Association, 111, 526--537.
S. Yang and J.K. Kim. (2016). ``Fractional imputation in survey sampling: A comparative review'', Statistical Science, 31, 415--432.
S. Park, J.K. Kim, and S. Park. (2016). "An imputation approach for handling mixed mode surveys'', Annals of Applied Statistics, 10, 1063-1085
M.A. Hidiroglou, J.K. Kim, and C.O. Nambeu. (2016). "Regression using estimated totals''. Survey Methodology, 42, 121-135.
J.K. Kim, Y. Kwon, and M.H.C. Paik. (2016). "Calibrated propensity score method for survey nonresponse in cluster sampling", Biometrika, 103, 461-473.
J.K. Kim, E, Berg, and T. Park. (2016). "Statistical matching using fractional imputation''. Survey Methodology, 40, 19--40.
S. Yang and J.K. Kim (2016). "A Note on Multiple Imputation for General-Purpose Estimation'', Biometrika, 103, 244 -- 251.
M. Riddles, J.K. Kim, and J. Im (2016) "Propensity score adjustment for nonignorable nonresponse.'' Journal of Survey Statistics and Methodology, 4, 215-245.
S. Yang and J.K. Kim (2016). "Likelihood-based inference with missing data under missing-at-random'', Scandinavian Journal of Statistics, 43, 436--454.
K.L. Peyer, G. Welk, L. Bailey-Davis, S. Yang, J.K. Kim (2015). ``Factors associated with parent concern for child weight and parenting behaviors'', Childhood Obesity, 11, 269-274.
Kim, J.K., Park, S. and Kim, S. (2015). ``Small area estimation combining information from several sources'', Survey Methodology, 41, 21-36.
Kim, J.K. and Yang, S. (2014). ``Fractional hot deck imputation for robust inference under item nonresponse in survey sampling'', Survey Methodology 40, 211-230.
Chen, S. and Kim, J.K. (2014). ``Two-phase sampling experiment for propensity score estimation in self-selected samples'', Annals of Applied Statistics 8, 1492-1515.
Kim, J.K. and Im, J. (2014). ``Propensity score weighting adjustment with several follow-ups'', Biometrika 101, 439-448.
Wang, S., Shao, J. and Kim, J.K. (2014). ``An instrument variable approach for identification and estimation with Nonignorable Nonresponse,'' Statistica Sinica 24, 1097-1116
Chen, S. and Kim, J.K. (2014). ``Semi-parametric inference with a functional-form empirical likelihood,'' Journal of the Korean Statistical Society 43, 201-214.
S. Park and J.K. Kim. (2014). ``Instrumental-variable calibration estimation in survey sampling'', Statistica Sinica 24, 1001-1015.
Kim, J.K. and Haziza, D. (2014). ``Doubly robust inference with missing data in survey sampling,'' Statistica Sinica 24, 375--394.
Chen, S. and Kim, J.K. (2014). ``Population empirical likelihood for nonparametric inference in survey sampling,'' Statistica Sinica 24, 335--355.
S. Yang, J.K. Kim, and D.W. Shin. (2013). ``Imputation methods for quantile estimation under missing at random'', Statistics and Its Interface 6, 369--377.
S. Yang, J.K. Kim, and Z. Zhu. (2013). ``Parametric fractional imputation for mixed models with nonignorable missing data'', Statistics and Its Interface 6, 339--347.
Kim, J.K. and Skinner, C.J. (2013). ``Weighting in survey analysis under informative sampling,'' Biometrika 100, 385-398.
Kim, J.K. and Wu, C. (2013). ``Sparse and efficient replication variance estimation for complex surveys,'' Survey Methodology 39, 91-120.
Kim, J.K. and Riddles, M. (2012). ``Some theory for propensity scoring adjustment estimator,'' Survey Methodology 38, 157-165.
Kim, J.K. and Hong, M. (2012). ``An imputation approach to statistical inference with coarse data,'' Canadian Journal of Statistics 40, 604-618.
Zhou, M. and Kim, J.K. (2012). ``An efficient method of estimation for longitudinal surveys with monotone missing data,'' Biometrika 99, 631-648.
Kim, J.K. and Shin, D.W. (2012). ``The factoring likelihood method for non-monotone missing data,'' Journal of the Korean Statistical Society 41, 375--386.
Kim, J.Y. and Kim, J.K. (2012). ``Fractional imputation for nonignorable missing data,'' Journal of Korean Statistical Society 41, 291--303.
Kim, J.K. and Rao, J.N.K. (2012). ``Combining data from two independent surveys: a model-assisted approach,'' Biometrika 99, 85--100.
Kim, J.K., Fuller, W.A., and Bell, W.R. (2011). ``Variance Estimation for Nearest Neighbor Imputation for U.S. Census Long Form Data,'' Annals of Applied Statistics 5, 824--842.
Kim, J.K. and Yu, C.L. (2011). ``Replication variance estimation under two-phase sampling,'' Survey Methodology 37, 67--74.
Kim, J.K. and Yu, C.Y. (2011). ``A semi-parametric estimation of mean functionals with non-ignorable missing data,'' Journal of the American Statistical Association 106, 157--165.
Kim, J.K. (2011). ``Parametric fractional imputation for missing data analysis,'' Biometrika 98, 119--132.
Kim, J.K. (2010). ``Calibration estimation using exponential tilting in sample surveys,'' Survey Methodology 36, 145--155.
Kim, J.K. and Park, M. (2010). ``Calibration estimation in survey sampling,'' International Statistical Review 78, 21--39.
Kim, J.K. and Rao, J.N.K. (2009). ``Unified approach to linearization variance estimation from survey data after imputation for item nonresponse,'' Biometrika 96, 917-932.
Kim, J.K. (2009). ``Calibration estimation using empirical likelihood in survey sampling,'' Statistica Sinica 19, 145-158.
Kim, J.K. (2007). ``Regression fractional hot deck imputation,'' Journal of the Korean Statistical Society 36, 423-434.
Kim, J.K. and Kim, J.J. (2007). ``Nonresponse weighting adjustment using estimated response probability,'' Canadian Journal of Statistics 35, 501-514.
Kim, J.K., Navarro, A., and Fuller, W.A. (2006). ``Replicate variance estimation after multi-phase stratified sampling,'' Journal of the American Statistical Association 101, 312-320.
Kim, J.K. and Park, H.A. (2006). ``Imputation using response probability,'' Canadian Journal of Statistics 34, 171-182.
Kim, J.K., Brick, M.J., Fuller, W.A., and Kalton, G. (2006). ``On the bias of the multiple imputation variance estimator in survey sampling,'' Journal of the Royal Statistical Society: Series B 68, 509-521.
Fuller, W.A. and Kim, J.K. (2005). ``Hot deck imputation for the response model,'' Survey Methodology 31, 139-149.
Kim, J.K. (2004). ``Extension of factoring likelihood approach to non-monotone missing data'', Journal of the Korean Statistical Society 33, 401-410.
Kim, J.K. (2004). ``Finite sample properties of multiple imputation estimators,'' The Annals of Statistics 32, 766-783.
Brick, J.M., Kalton, G., and Kim, J.K. (2004). ``Variance estimation with hot deck imputation using a model,'' Survey Methodology 30, 57-66.
Kim, J.K. and Fuller, W.A. (2004). ``Fractional hot deck imputation,'' Biometrika 91, 559-578.
Kim, J.K. and Sitter, R.R. (2003). ``Efficient variance estimation for two-phase sampling,'' Statistica Sinica 13, 641-653.
Kim, J.K. and Kim, Y. (2003). ``Inference after stochastic regression imputation under response model,'' Journal of the Korean Statistical Society 32, 103-119.
Kim, J.K. (2002). ``A note on approximate Bayesian bootstrap imputation,'' Biometrika 89, 470-477.
Kim, J.K. (2001). ``Variance estimation after imputation,'' Survey Methodology
Submitted papers
H. Wang, J.K. Kim, J. Han, and Y. Lee. "Robust propensity score weighting estimation under missing at random", submitted.
Z. Wang, X., Mao, X., Wang, H. and Kim, J.K. "Functional Calibration under Non-Probability Survey Sampling," submitted.
Morikawa, K. and Kim, J.K. "Semiparametric adaptive estimation under informative sampling," submitted (available here).
H. Wang and J.K. Kim. "Information projection approach to propensity score function estimation under missing at random," submitted.
Cho, S., Qiu, Y., and Kim, J.K. "Multiple Bias Calibration for Valid Statistical Inference under Nonignorable Nonresponse," submitted.
Kwon, Y., Kim, J.K., and Qiu, Y. "Debiased calibration estimation using generalized entropy in survey sampling," submitted (available here)
Conference (Refereed)
Masatoshi Uehara, Takeru Matsuda, Jae Kwang Kim Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics, PMLR 108:831-841, 2020.
Wang, H. and Kim, J.K. (2020). "Variance estimation after Kernel Ridge regression imputation under item nonresponse", ICML workshop.
Kwon, Y. and Kim, J.K. (2024). "Ensemble Fractional Imputation for Incomplete Categorical Data with a Graphical Model", ICML workshop.