Publications (Google scholar)
1. Lee KH, Chakraborty S, and Sun J (2011) Bayesian variable selection in semiparametric proportional hazards model for high dimensional survival data. The International Journal of Biostatistics, Volume 7, Issue 1, 1-32.
2. Lee KH, Haneuse S, Schrag D, and Dominici F.(2014) Bayesian semi-parametric analysis of semi-competing risks data: investigating hospital readmission after a pancreatic cancer diagnosis. Journal of the Royal Statistical Society: Series C, Volume 64, Issue 2, 253-273 (This paper won 2013 David P. Byar Young Investigator Award )
3. Lee KH, Chakraborty S, and Sun J (2015) Survival prediction and variable selection with simultaneous shrinkage and grouping priors. Statistical Analysis and Data Mining, Volume 8, Issue 2, pages 114-127.
4. Tanuma J, Lee KH, Haneuse S, Matsumoto S, Nguyen TD, Nguyen THD, Do DC, Pham TTT, Nguyen VK, and Oka S (2016) Incidence of AIDS-defining opportunistic infections and mortality during antiretroviral therapy in a cohort of adult HIV-infected individuals in Hanoi 2007-2014. PLoS ONE 11(3): e0150781.
5. Lee KH, Dominici F, Schrag D, Haneuse S (2016) Hierarchical models for semi-competing risks data with application to quality of end-of-life care for pancreatic cancer. Journal of the American Statistical Association, Volume 111, Issue 515, 1075-1095.
6. Haneuse S and Lee KH (2016) Semi-competing risks data analysis: accounting for death as a competing risk when the outcome of interest is non-terminal event. Circulation: Cardiovascular Quality and Outcomes, Volume 9, Issue 3, 322-331.
7. Abreu MH, Lee, KH, Luquetti D, Starr JR (2016) Temporal trend in the birth prevalence of cleft lip and/or cleft palate in Brazil, 2000-2013. Birth Defects Research (Part A), Volume 106, Issue 9, 789-792.
8. Lee KH, Tadesse MG, Baccarelli AA, Schwartz J, and Coull BA (2017) Multivariate Bayesian variable selection exploiting dependence structure among outcomes: application to air pollution effects on DNA methylation. Biometrics, Volume 73, Issue 1, 232-241.
9. Abreu MH, Resende VLS, Lee KH, Matta-Machado ATG, Starr JR (2017) Regional differences in infection control conditions in a sample of primary health care services in Brazil, Cadernos de Saúde Pública (Reports in Public Health), 33(11): e00072416.
10. Lee KH, Chakraborty S, and Sun J (2017) Variable selection for high-dimensional genomic data with censored outcomes using group lasso prior. Computational Statistics and Data Analysis, Volume 112, 1-13.
11. Lee KH, Rondeau V, Haneuse S (2017) Accelerated failure time models for semi-competing risks data in the presence of complex censoring. Biometrics, Volume 73, Issue 4, 1401-1412.
12. Atia L, Bi D, Sharma Y, Mitchel JA, Gweon B, Koehler S, DeCamp S, Lan B, Kim JH, Hirsch R, Pegoraro A, Lee KH, Starr JR, Weitz DA, Martin A, Park J-A, Butler JP, Fredberg JJ (2018) Geometrical constraints during epithelial jamming. Nature Physics, Volume 14, Issue 6, 613-620.
13. Starr JR, Huang Y, Lee KH, Murphy CM, Moscicki A, Shiboski CH, Ryder MI, Yao TJ, Faller L, Van Dyke RB, Paster BJ (2018) Oral microbiota in youth with perinatally acquired HIV infection. Microbiome, 6(1):100.
14. Liu S, Bobb J, Lee KH, Gennings C, Claus Henn B, Bellinger D, Austin C, Schnaas L,Tellez-Rojo M, Hu H, Wright RO, Arora M, and Coull BA, (2018) Lagged kernel machine regression foridentifying time windows of susceptibility to exposures of complex metal mixtures. Biostatistics, Volume 19, Issue 3, 325-341.
15. Bassir SH, Kholy KE, Chen C-Y, Lee KH, Intini G, (2019) Outcome of early dental implantplacement versus other dental implant placement protocols: a systematic review and meta-analysis. Journal of Periodontology, Volume 90, Issue 5, 493-506.
16. Koch G, Hamilton A, Wang K, Herschdorfer L, Lee KH, Gallucci G, Friedland B (2019) Dimensional accuracy of cone beam computed tomography with varying angulation of the jaw to the X-ray beam. Dentomaxillofacial Radiology, Volume 48(4).
17. Alvares D, Haneuse S, Lee C, Lee KH (2019) SemiCompRisks: An R package for the analysis of independent and cluster-correlated semi-competing risks data. R Journal, Volume 11(1), 376-400.
18. Green DR, Schulte F, Lee KH, Pugach MK, Hardt M, Bidlack FB (2019) Mapping the tooth enamel proteome and amelogenin phosphorylation onto mineralizing porcine tooth crowns. Frontiers in Physiology, Volume 10, 925.
19. Goldstein JM, Valido A, Lewandowski J, Walker RG, Mills MJ, Messemer KA, Besseling P, Lee KH, Lee RT, Wagers AJ (2019) Variation in zygotic CRISPR/Cas9 gene editing outcomes generates novel reporter and deletion alleles at the Gdf11 locus. Scientific Reports, Volume 9, Issue 1, 18613.
20. Lee KH, Coull BA, Moscicki AB, Paster BJ, Starr JR (2020) Bayesian variable selection for multivariate zero-inflated models: application to microbiome count data. Biostatistics, Volume 21, Issue 3, 499-517.
21. Kim JH, Pegoraro AF, Das A, Koehler S, Ujwary SA, Lan B, Mitchel JA, Atia L, He S, Wang K, Bi D, Zaman M, Park J-A, Butler JP, Lee KH, Starr JR, Fredberg JJ (2020) Unjamming and collective migration in MCF10A breast cancer cell lines. Biochemical and Biophysical Research Communications, Volume 521, Issue 3, 706-715.
23. Li Y, Seo S, Lee KH (2021) Bayesian analysis of grouped survival data with adaptive time partition. Journal of Statistical Computation and Simulation, Volume 91, Issue 14, 2937-2952.
22. Li J, Li Y, Ivey KL, Wang D, Wilkinson JE, Franke A, Lee KH, Chan AT, HuttenhowerC, Hu FB, Rimm EB, Sun Q (2022) Interplay between diet and gut microbiome, and circulating concentrations of Trimethylamine N-oxide: findings from a longitudinal cohort of U.S. men. Gut, Volume 71, Issue 4, 724-733.
24. Haneuse S, Schrag D, Dominici, F, Normand S-L, Lee KH (2022) Measuring performance for end-of-life care: A Bayesian decision-theoretic approach. Annals of Applied Statistics, Volume 16, Issue 3, 1586-1607.
25. Wang F, Tessier A-J, Liang L, Wittenbecher C, Haslam D, Eliassen AH, Rexrode KM, Tobias DK, Li J, Zeleznik O, Stampfer MJ, Grodstein F, Martnez-Gonzlez MA, Salas-Salvad J, Clish C, Lee KH, Sun Q, Hu FB, Guasch-Ferr M. (2023) Plasma metabolomic profiles associated with mortality and longevity in a prospective analysis of 13,512 individuals. Nature Communications, Volume 14, Issue 1, 5744.
26. Yang H, Li J, Zhu L, Wang B, Li Y, Ivey KL, Lee KH, Eliassen H, Qi Q, Chan AT, Huttenhower C, Rimm EB, Hu FB, Sun Q. (2023) The interplay between diet, circulating indolepropionate level, and cardiometabolic health in US populations. Gut, Volume 72, Issue 12, 2260-2271.
27. Reeder H, Lee KH, Haneuse S (2024) Characterizing quantile-varying covariate effects under the accelerated failure time model. Biostatistics, Volume 25, Issue 2, 449-467.
28. Bui LP, Pham TT, Wang F, Chai B, Sun Q, Hu FB, Lee KH, Guasch-Ferre M, Willett WC. (2024) Planetary health diet index and risk of total and cause-specific mortality in three prospective cohorts. The American Journal of Clinical Nutrition, Volume 120, Issue 1, 80-91.
29. Reeder H, Haneuse S, Lee KH. (2024) Group lasso priors for Bayesian accelerated failure time models with left-truncated and interval-censored data. Statistical Methods in Medical Research, Volume 33, Issue 8,1412-1423.
30. Majumder S, Coull BA, Mark Welch J, La Riviere PJ, Dewhirst FE, *Starr JR, *Lee KH. (2024) Multivariate cluster point process to quantify and explore multi-entity configurations: Application to biofilm image data. Statistics in Medicine, Volume 43, Issue 28, 5446-5460. (*co-senior authors )
31. Wang F, Glenn AJ, Tessier A-J, Mei Z, Haslam DE, Guasch-Ferré M, Tobias DK, Eliassen AH, Manson JE, Clish C, Lee KH, Rimm EB, Wang DD, Sun Q, Liang L, Willett WC, Hu FB. (2024) Integration of epidemiological and blood biomarker analysis links heme iron intake to increased type 2 diabetes risk. Nature Metabolism, Volume 6, Issue 1, 1807-1818.
32. Reeder H, Lee KH, Papatheodorou SI, Haneuse S. (2024) An augmented illness-death model for semi-competing risks with clinically immediate terminal events. Statistics in Medicine, Volume 43, Issue 21, 4194-4211.
33. Nair NK, Bui LP, Sawicki CM, Anand S, Kandula NR, Kanaya AM, Lee KH, Stampfer MJ, Willett WC, Bhupathiraju SN. (2025) Adherence to the EAT-Lancet planetary health diet and cardiometabolic risk markers in the Mediators of Atherosclerosis in South Asians Living in America (MASALA) Study. Current Developments in Nutrition, Volume 9, Issue 6, Article 107468.
34. Clark-Boucher D, Coull BA, Reeder H, Wang F, Sun Q, *Starr JR, *Lee KH. (2025) Group-wise normalization in differential abundance analysis of microbiome samples. BMC Bioinformatics, Volume 26, Article 196. (*co-senior authors)
35. Nair NK, Bui LP, Sawicki CM, Anand S, Kandula N, Kanaya A, Lee KH, Stampfer M, Willett W, Bhupathiraju SN. (2025) Development and evaluation of a planetary health diet index: The Mediators of Atherosclerosis in South Asians Living in America (MASALA) Study. Journal of the Academy of Nutrition and Dietetics, in press.
Bae Y, Kim C, Wang F, Sun Q, Lee KH. Bayesian variable selection for high-dimensional mediation analysis: application to metabolomics data from epidemiological studies. arXiv
Lee KH, Coull BA, Majumder S, La Riviere P, Mark Welch JL, Starr JR. A Bayesian multivariate spatial point pattern model: application to oral microbiome FISH image data. arXiv
Wang S, Wikle C, Micheas AC, Mark Welch JL, Starr JR, Lee KH. Inference for log-Gaussian Cox point processes using Bayesian deep learning: application to human oral microbiome image data. arXiv
Clark-Boucher D, Coull BA, Reeder H, Wang F, Sun Q, *Starr JR, *Lee KH. A nutritionally informed model for Bayesian variable selection with metabolite response variables. arXiv (*co-senior authors)
*Bater J, *Majumder S, Huang Y, Moscicki A, Pater B, Yao TJ, **Lee KH, **Starr JR. Oral pathogen Filifactor alocis’s relative abundance is lower among HIV-positive youth with less antiretroviral therapy exposure. [manuscript available upon request]
(*co-first authors, **co-senior authors)