Modeling the Invisible: Competition on Forecasting Viral Spread with Limited Data
July 29th - Aug 1st 2025
Georgia State University, Atlanta, Georgia
July 29th - Aug 1st 2025
Georgia State University, Atlanta, Georgia
In this competition workshop, participants will address the challenge of selecting and parameterizing models for complex biological systems. Biological systems present unique hurdles due to their complexity and variability. Even in the face of this complexity, methods to predict the future state of biological systems (and populations) from limited time series data is an ongoing challenge in medical and public heath areas.
The workshop will emphasize real-time forecasting as new data becomes available. For example, forecasting disease progression requires incorporating new data as it becomes available. Understanding disease spread within a population necessitates real-time tracking of case numbers and severity. As public health interventions are implemented, forecasts must be adjusted for real-time changes.
Data collection poses a significant challenge, as continuous monitoring is rarely feasible. Some variables cannot be measured directly, and biological data are often noisy, complicating model precision and reliability.
Prior to the competition, participants will develop any model they wish that is capable of modeling a viral pandemic. During the competition, time series data on infection occurrence will released sequentially. The participants' challenge is to predict the future course of the infection in the population.
We use yearly influenza A epidemic as a case study to model the spread of an acute infection within a population. Influenza is a recurring infection that results in tens of thousands of hospitalizations and between 5,000 and 50,000 deaths yearly in the USA [Wu 2011, Kandula 2019, Brauer 2019, Kalchev 2024]. The influenza virus mutates rapidly, and epidemiologic data suggests that infection with one strain may only temporarily, on a scale of a few months, protect from the second strain [Ferguson 2003]. The yearly strains of influenza vary widely in both their infectivity (e.g., their rate of spread) and the severity of the disease (e.g., the number of cases severe enough to require hospitalization or lead to death). Health organizations update influenza vaccines yearly attempting to predict the upcoming year’s most infectious and most common strains. The effectiveness of the vaccines, as well as the percentage of the population vaccinated, varies widely from year to year but is usually below 50% [CDCb 2024]. This combination of factors makes each influenza season unique. There are worldwide efforts to predict the severity of the yearly influenza epidemic. A key challenge arises once the influenza season begins: using early, limited data to forecast its future course. These predictions are crucial for helping public health officials prepare and respond effectively to the evolving yearly disease course.
Monthly deaths from Influenza in the US. From: Saloni Dattani and Fiona Spooner (2022) - “How many people die from the flu?” Published online at OurWorldinData.org. Retrieved from: https://ourworldindata.org/influenza-deaths
The registration deadline has been extended to Monday, June 23, 2025. However, we only have a few travel and housing slots remaining.
The goals of this workshop are to enhance current modeling practices and to establish a versatile blueprint for applying competitive modeling strategies to other biomedical challenges. By combining competition with collaboration, this workshop aims to push the boundaries of predictive biology. We will:
(1) Develop an infrastructure for small-scale, real-time biological modeling competitions.
(2) Use viral pandemic data as the test data along with an SIR model to produce realistic pandemic data that the workshop attendees attempt to model in real time.
(3) Explore best practices to identify optimal modeling strategies against uncertainty, data scarcity, and time constraints.
(4) Grow a community of trained individuals as a “Modelers taskforce” prepared to tackle pandemic-like challenges including influenza, COVID-19 and other infectious diseases.
(5) Provide an educational opportunity to undergraduate and graduate students (and other interested students of any educational level) to learn real time forecasting using real world, noisy data.
Students will be grouped into teams that will work together to develop models and modeling techniques to make predictions given a particular data set. These data sets will be provided by a concealed reference model, simulating real-world conditions where data is incomplete or noisy. Participants will navigate scenarios featuring pristine data, and noise-infused datasets, evaluating the robustness and adaptability of their models and approaches.
We encourage participants local to the Atlanta Georgia area to attend on a commuter basis.
We have funds available to help cover travel costs for participants outside the Atlanta area. Participants can apply for up to $500 to help cover the cost of travel.
In addition, participants can stay in the GSU dorms, within easy walking distance of the meeting venue, at no charge.
See the Registration page to apply and for more information.