Research

Please see the linked pages or my CV for lists of publications and software packages. If you're interested in collaboration, please send me an email at amp9@williams.edu -- I enjoy working on a variety of interesting problems! 

Microbiome profiles from Halfvarson et al (2017), showing differences in the microbiome of individuals with Crohn's disease vs. healthy controls. DOI: https://doi.org/10.1038/nmicrobiol.2017.4

Statistics for the Human Microbiome

I'm broadly interested in statistics related to the human microbiome (the community of bacteria that live in and on us). 

The microbiome is related to just about every aspect of human health, from cognitive function and emotion regulation to metabolism to how well your immune system fights off diseases. 

Microbiome Volatility

Much of my recent work has focused on how rapidly the microbiome changes over time. There's increasing evidence that a healthy microbiome tends to be fairly stable, whereas in a variety of diseases, the microbiome may be less stable. It turns out that "stability" is not a straightforward thing to quantify and compare between high-dimensional bacterial communities, so I'm working on ways to quantify it and to formally test whether volatility differs based on disease state. 

Data: Moving Pictures study, Caporaso et al (2011), Genome Biology. 

Kernel Regression Methods

A second major area of research is kernel machine regression models, which test whether an entire microbial community is associated with the outcome. For example, we could answer a question like: are the throat microbiomes of smokers more similar to other smokers than to non-smokers? On the graph above, points that are closer together represent more similar microbiomes -- so it seems that yes, there is some association between smoking status and throat microbiome. Kernel machine regression models would help us formally test this association. I've worked specifically on time-to-event outcomes (MiRKAT-S) and incorporating more omnibus tests into the MiRKAT R package. 

Data: Charlson et al (2010) throat microbiome dataset. 

A few other microbiome-related topics I'm interested in are longitudinal analysis, multi-omic analysis (how can we incorporate both the microbes that are present and, e.g., markers of what they're doing?), and high-dimensional regression models with constraints to accommodate compositional data. I also collaborate on applied projects related to irritable bowel syndrome, bacterial vaginosis, and other diseases or conditions. 

Human Population Genetics

Another area of research I've worked on is human population genetics, in particular using chunks of shared DNA ("identity by descent") to learn about historical relationships among populations and major migration events for which good records may not exist. 

The image above shows where sampled individuals lived on the island of Crete. The image to the right shows how we can see a difficult mountain pass in the genetic data: people east of the mountain pass (from Vianos to Kasteli) are not very strongly related to people west of the mountain pass. 

Figure: Heat map showing log proportion of the genome identical by descent. Colors higher on the color scale indicate more chunks of shared genetic material, on average.
DOI: https://doi.org/10.1111/ahg.12328 (Figure 1; Figure 2c)


Identity-by-descent analysis shows that people whose ancestors lived on the peninsula beyond a mountain (nearer Sitia, Ierapetra, etc.) were related to each other, but not to the folks on the other side of the mountain (Kisamos in the West through Harekas in the center). 

For Students

If you are a Williams student interested in a summer research position or an honors thesis, please feel free to set up a meeting to discuss possibilities, or see the Math/Stat website for possible thesis and colloquium topics (for me and other faculty members).