My research mainly focuses on the development of Bayesian statistical methods and computational tools motivated by problems in biomedical data analysis. The methods that I have developed find successful applications in characterization of tumor heterogeneity, clinical trial design, and inference with missing data. In addition to my primary research area, I am also interested in and have worked on a wide range of topics such as posterior contraction, spatial statistics, and model combination.
The following figure summarizes some of my works.
During tumor growth, tumor cells acquire and accumulate somatic mutations that lead to genetically different cell subpopulations. This phenomenon is known as intra-tumor heterogeneity. Each cell subpopulation, referred to as a subclone, consists of cells that have the same genetic architecture, such as point mutations and copy number aberrations. I have developed a series of methods for inference on intra-tumor heterogeneity, which can shed light on tumor progression and can further suggest personalized treatment strategy.
(Working) Papers:
Our work was reported by the University of Chicago Beagle December 2017 Newsletter.
Clinical trials play a key role in drug development. Innovative trial designs may improve the efficiency of clinical trials by means of, for example, shorter duration, fewer participants, and increased power of detecting a treatment effect if it exists. I have developed a series of innovative trial designs for various types and phases of clinical trials, some of which are currently considered by pharmaceutical companies for implementations in real-world trials.
(Working) Papers:
In longitudinal clinical studies, the research objective is often to make inference on a subject's full data response; for example, to calculate the treatment effect of a test drug at the end of a study. However, the vector of responses for a research subject is often incomplete due to dropout, and the dropout is typically non-ignorable. To make inference on the full data estimands in the presence of missing data, I developed a flexible semiparametric Bayesian approach based on a joint model for the full data response, missingness and baseline covariates.
Paper:
Paper:
(Working) Paper:
(Working) Paper:
I am interested in a much larger range of topics, including variable selection, causal inference, dynamic models, machine learning, data mining, big data, scalable algorithms, and much more.