Rigorous Validation with Proof-By-Simulation

Abstract:
Modern experiment designs can make clinical trials faster, smaller, and more likely to select the right treatment. But, existing mathematical tools may not be strong enough to show how key performance metrics, such as the false positive probability, vary across different inputs, such as a range of different treatment effects. One common approach is to use simulations - but then, how many simulations are enough, and what if you've "missed a spot"? This is especially important for optimized experiment designs, which are incentivized to give false positives in places that are not checked. We take a comprehensive approach: building on methods from Sklar's PhD thesis, we extend simulations taken over a dense grid of input parameters into bounds that cover continuous volumes of parameter space.* With these tools, we aim to "automate the Type I Error proof" and other criteria for new trial designs, making regulation faster, easier, and more consistent. Speculatively, we also hope our approach will lay a foundation for new "black box" statistical methods. We have published our open-source prototype software: https://github.com/Confirm-Solutions/imprint.

Speaker email: michaelbsklar@gmail.com.


*Our bounds are non-asymptotic. We require that simulation outcomes depend on unknown parameters through an exponential family distribution [e.g. Gaussian, binomial, gamma, or Poisson outcomes]. For good performance, the computational cost of the simulation grid suffers from a curse of dimensionality; our examples in 3 - 4 parameter dimensions can be done on an at-home computer.


Bio:
Michael Sklar is CEO of Confirm Solutions, a public-benefit corporation creating open-source statistical software with a mission to speed up medical regulation. He holds a PhD in Statistics from Stanford, where he was later a Stein Fellow. https://www.linkedin.com/in/michael-sklar/ 

Summary