"Many analyst" projects give teams of researchers the same data and the same research questions, then see how they use the methods of their choice to arrive at different answers to those questions. Prior to the workshop, there will be a “many analyst” collaboration (see a previous example here). Teams will be asked to analyse the mortality of the 1918–20 influenza pandemic in the US populations of Kentucky and rural Maryland. They will be asked to estimate between 3 and 6 quantities. These quantities will relate to 1) the absolute level of pandemic mortality across space, 2) the relative level of pandemic mortality between social groups (defined by gender, race, and geography), and 3) the timing of the pandemic.
Cleaned, highly granular death count and denominator data will be provided to teams. Death count data will be available by date of death, county of death, (imputed) cause of death, and single year of age for both states. In Kentucky, death counts will also be available by the deceased’s registered gender, and in Maryland, they will be available by the registered race. 20 years of death count data will be available from Kentucky, and 10 years of data will be available from Maryland. For those who want to incorporate it into their model, granular weather data and shapefiles will also be available.
Teams can comprise up to 5 researchers. Participants will be asked to submit their estimates, reproducible code that generates those estimates (in any programming language), and a written description of their estimation technique (between 250 and 1,000 words).
Data will be available to teams starting on 15 September 2025 and all submissions are due by 15 March 2026. You can already sign up to participate in the collaboration using the link below!