Held in conjunction with ACM SIGMETRICS

June 12th, 2026 at University of Michigan Ann Arbor, USA

Room: TBD

Workshop organizers: Devavrat Shah (MIT), Dennis Shen (USC)

Workshop committee: Hannah Li (Columbia), Devavrat Shah (MIT), Dennis Shen (USC), Christina Lee Yu (Cornell)

Overview

This one-day workshop will feature recent advances in causal inference broadly at the intersection of statistics, operations research, and engineering. The workshop aims to combine theory and applications and bring together researchers across academia and industry. The event will be organized in such a way to promote meaningful interactions and discussions, and foster interdisciplinary collaboration, through a mix of talks, poster presentations, and networking breaks.

As part of the workshop, we will be accepting short papers that are broadly at the intersection of causal inference, engineering, and operations research. This will be non-archival and accepted papers will be presented as posters along with short spotlight talks. Additionally, the top three papers will be put on a fast track submission to a top journal [authors will be provided options]. Papers should be submitted via the following google form. Authors are asked to submit extended abstracts that are at most 8-pages in length. Please use the standard PER format.

If there are any questions, please contact dennis [dot] shen [at] marshall [dot] usc [dot] edu.

Program Schedule

9:00-10:00am: Spotlight talks
- - Ziang Yuan: Rerandomization under Interference
  - Tereza Oprea: Surrogate-powered Causal Inference on Censored Survival Outcomes
  - Kyuseong Choi: One Pipeline, Many Transformers: Pattern-Specific Imputation Specialists for Tabular Missing Data
  - Shreyam Mishra: Temporal Knockoffs: Trajectory-Level Control of False Discovery Rate in Longitudinal Data
  - Aytijhya Saha: Causal Inference with Categorical Unobserved Confounder via Mixture Learning

10:00-10:30am: Coffee break (Idea Hub)

10:30-12:00pm: Speaker session
- - Jessy Xinyi Han: When-If Decision-Making Using Synthetic Survival Control
  - Yaroslav Mukhin: Spectral Estimation of Influence Functions via Kernel PCA
  - Sarah Cen: Large-Scale, Longitudinal Study of Large Language Models During the 2024 US Election Season

12:00-1:30pm: Lunch break (Rogel Ballroom)

1:30-3:00pm: Speaker session
- - Kyra Gan: Integrating Causal DAGS in Deep RL: Activating Minimal Markovian States with Multi-Order Exposure
  - Angela Zhou: Structured Offline RL via Reward Filtering and Orthogonal Q-Contrasts

3:00-3:30pm: Coffee break (Idea Hub)

3:30-5:00pm: Speaker session
- - Dogyoon Song: Regression Adjustment in High-Dimensions: A Design-Based Finite-Sample View
  - Colin Fogarty: Sample Splitting and Two-player Games in Observational Studies with Hidden Bias

Speakers

Sarah Cen

(CMU)

Large-Scale, Longitudinal Study of Large Language Models During the 2024 US Election Season

The 2024 US presidential election is the first major contest to occur in the US since the popularization of large language models (LLMs). Building on lessons from earlier shifts in media (most notably social media's well studied role in targeted messaging and political polarization) this moment raises urgent questions about how LLMs may shape the information ecosystem and influence political discourse. While platforms have announced some election safeguards, how well they work in practice remains unclear. Against this backdrop, we conduct a large-scale, longitudinal study of 12 models, queried using a structured survey with over 12,000 questions on a near-daily cadence from July through November 2024. Our design systematically varies content and format, resulting in a rich dataset that enables analyses of the models' behavior over time (e.g., across model updates), sensitivity to steering, responsiveness to instructions, and election-related knowledge and "beliefs." In the latter half of our work, we perform four analyses of the dataset that (i) study the longitudinal variation of model behavior during election season, (ii) illustrate the sensitivity of election-related responses to demographic steering, (iii) interrogate the models' beliefs about candidates' attributes, and (iv) reveal the models' implicit predictions of the election outcome. To facilitate future evaluations of LLMs in electoral contexts, we detail our methodology, from question generation to the querying pipeline and third-party tooling.

Colin Fogarty

(UMich Ann Arbor)

Sample Splitting and Two-player Games in Observational Studies with Hidden Bias

In observational studies, design decisions are known to strongly impact reported robustness of a study’s findings to unmeasured confounding. As the optimal choices depend on properties of the data themselves, splitting one’s data into planning and analysis samples appears particularly appealing: one can use a planning sample to inform the subsequent analysis of the observational study, targeting choices which improve performance in a sensitivity analysis. When viewed through the lens of a two player game however, sample splitting may put the practitioner at a disadvantage relative to approaches which use the whole data to inform design choices: the practitioner plays first, making decisions using the planning sample, and then imagines nature’s worst-case response to that decision in the analysis sample, whereas in reality hidden bias has realized before practitioner analyses the data. We characterize decision sets under which sample splitting is innocuous in terms of the limiting power of a sensitivity analysis. We provide a novel minimax theorem, while highlighting the potential breakdown when our theorem’s conditions are violated. We apply our method to investigate the effects of poverty on the emergence of cardiovascular disease risk factors in children and adolescents. We discover adverse consequences on outcomes related to body composition, physical activity, and tobacco exposure.

Kyra Gan

(Cornell Tech)

Integrating Causal DAGS in Deep RL: Activating Minimal Markovian States with Multi-Order Exposure

Reinforcement learning (RL) relies on the Markov property for guaranteed performance, but real-world applications often lack well-defined states given raw observed variables. While causal RL has attracted growing interest, existing work typically assumes Markovian states are already given and focuses on using causality to accelerate learning, leaving a fundamental gap: given a longitudinal causal graph over observed variables, how does one construct MDP states that provably satisfy the Markov property? We address this by providing a procedure that constructs a minimal state representation and proves its correctness. The significance of this construction, however, depends on the learning setting. In deep RL, we observe that the minimal representation alone empirically fails to improve performance, indicating that neural networks cannot directly exploit Markovian minimality. To address this, we propose MOSE (Multi-Order State Exposure), which feeds multi-order historical state constructions (orders 1 through $W$) into the same Q-function. MOSE consistently outperforms both the minimal state construction and single-window policies across common benchmarks and synthetic datasets. Adding the minimal representation in MOSE can further improve performance. Our results establish a core principle for causal deep RL: minimal sufficiency is not enough — controlled redundancy is necessary to unlock the benefit of causal state information.

Jessy (Xinyi) Han

(MIT)

When-If Decision-Making Using Synthetic Survival Control

Understanding the impact of decisions on when a target event occurs, not just whether it occurs, is central to many fields, including patient survival in healthcare, criminal recidivism in policy evaluation, and customer churning in business. This when-if decision-making framework integrates causal inference and survival analysis to support decisions with observational data where the timing of the target event is the key quantity of interest and the data are often sparse, censored, or confounded.

This talk is motivated by a setting that concerns evaluating the efficacy of different therapies for T-Cell Lymphoma across a heterogeneous patient population. We discuss Synthetic Survival Control (SSC), a new method for estimating counterfactual hazard trajectories in panel data with censoring and unobserved confounding. Specifically, we extend the traditional panel data literature by utilizing a causal survival panel framework with an underlying low-rank structure that naturally arises under classical parametric survival models. Within this framework, we establish both identification of the causal estimand and finite-sample guarantees for SSC. We also provide full validation of the proposed method through our motivating application in collaboration with clinicians at Massachusetts General Hospital.

Beyond healthcare, in this talk I will also discuss how this when-if framework extends to policy evaluation in criminal justice and customer churn prediction, where understanding the timing of events is crucial for intervention design and resource allocation.

Yaroslav Mukhin

(Cornell)

Spectral Estimation of Influence Functions via Kernel PCA

The influence function (IF) of a statistical parameter is the Riesz representer of its derivative, also known as its first variation and Fisher-Rao gradient. It is a key object for numerical optimization over probability measures, semiparametric estimation efficiency theory, standard constructions of efficient estimators, and an arsenal of inference methods for these estimators. Yet, deriving the IF analytically is often an obstruction for practitioners. To automate this task, we develop a novel spectral representation of the IF that lends itself to a low-rank surrogate in a reproducing kernel Hilbert space (RKHS). Our IF computation method (i) does not require analytic derivations by the user, (ii) relies on kernel Principal Component Analysis and numerical pathwise derivatives along these components. We present the spectral representation, rates of convergence of the low-rank surrogate, and applications to automated efficient parameter estimation and statistical inference.

Dogyoon Song

(UC Davis)

Regression Adjustment in High-Dimensions: A Design-Based Finite-Sample View

Regression adjustment is a classical way to improve precision in randomized experiments, but its finite-sample behavior is poorly understood when covariates are high-dimensional and p may exceed n. This talk presents a design-based, non-asymptotic analysis of regression-adjusted average treatment effect estimation under complete randomization. The framework yields oracle confidence intervals with finite-sample validity and explicit, instance-adaptive widths, without requiring a correctly specified outcome model and while allowing p>n. The key idea is a swap sensitivity analysis that separates stochastic fluctuation from design bias: the former is controlled by a variance-adaptive martingale argument and Freedman’s inequality, while the latter is bounded using Stein’s method of exchangeable pairs. The resulting bounds make explicit how covariate geometry affects concentration, bias, and the usefulness of adjustment. Time permitting, I will also discuss ongoing work to derive data-driven confidence envelopes and broader prospects for design-based concentration methods in causal inference.

Angela Zhou

(USC)

Structured Offline RL via Reward Filtering and Orthogonal Q-Contrasts

We study offline reinforcement learning under structural conditions where the dynamics may depend on many state variables, but optimal decisions depend only on a sparse, reward-relevant subset of the state. This “decision-theoretic sparsity” that optimal policy and value functions admit lower-dimensional structure, although full-state transition estimation can be difficult. First, we develop a reward-relevance-filtered approach for linear function approximation that modifies thresholded Lasso within least-squares policy evaluation and fitted Q-iteration to focus estimation on reward-relevant components. Second to improve robustness, we propose a structured difference-of-Q framework via orthogonal learning: a dynamic generalization of R-learning that targets Q-function contrasts sufficient for policy optimization, accommodates black-box nuisance estimators of Q and the behavior policy, and yields robust policy optimization guarantees under a margin condition. Together, these methods formalize and exploit reward-relevant structure to improve statistical efficiency and robustness in offline RL.

Important Dates

Paper and poster submissions: May 13, 2026
Workshop date: June 12, 2026 (full day)

Page updated

Google Sites

Report abuse

Causal Inference Workshop