Health for humans, animals & plants

Imputation of the disease date in case of incomplete data in the context of the COVID19 epidemic, Austria.

When reporting cases of notifiable diseases, the date of the disease is often incomplete or available late, so a substitute date, such as the laboratory reporting date, must be used for temporal analyses.

We present a statistical model to estimate, i.e. impute, the missing data for the disease date based on the calculated difference between laboratory reporting date and disease date of case reports with complete data. The model is applied to the COVID19 surveillance dataset for Austria. The difference between disease date and laboratory reporting date averaged 5.4 days, with variability by calendar week of epidemic: the difference increased with case number per calendar week. Based on the laboratory reporting date, the case number peak was on 26.03.2020 and based on the disease date, including cases with imputed disease date, already on 16.03.2020.

Lukas Richter, Daniela Schmid, Department of Infection Epidemiology & Surveillance, AGES Ernst Stadlober, Institute of Statistics, Graz University of Technology

Last updated: 14.09.2022

automatically translated

Jump to top