Bayesian Nonparametric Mixture Modeling for NBA Shot Data


Post Date: 11/18/2024

In the last quarter of my undergraduate education, I took a course on Bayesian Nonparametrics, and to demonstrate what I learned, I implemented a project making inference on NBA shot chart data. This data is publicly available, and the goal is to estimate a player's shooting tendencies, in terms of where they are likely to shoot, and how likely they are to make a shot given its location. These two items are highly correlated: If a player makes a lot of shots from one location on the court, they are more likely to take shots from that area in the future. Ideally, any model making inference on this data can account for this dependence. 

The model I used came from the paper Mixture Modeling for Marked Poisson Processes, by Athanasios Kottas and Matthew A. Taddy. The paper describes Bayesian Nonparametric models that are fitting to model marked point processes, and can incorporate the above-described dependence. We assume that shot charts follow a Non-homogeneous Marked Poisson Point Process (Parametric assumption here):