Distribution Pattern Analysis of Daily Confirmed COVID-19 Cases Across Selected Countries and Indian Epidemic Waves

Authors

  • Anita Sahu Department of Statistics, School of Applied Sciences, Amity University, Jaipur, Rajasthan 303002, India
  • Jagdish Prasad Department of Statistics, School of Applied Sciences, Amity University, Jaipur, Rajasthan 303002, India

DOI:

https://doi.org/10.6000/1929-6029.2026.15.25

Keywords:

COVID-19, Distribution pattern analysis, Daily confirmed cases, Parametric distribution fitting, Skewness, Kurtosis, India epidemic waves, Public health statistics

Abstract

Daily confirmed COVID-19 cases have shown highly heterogeneous epidemic trajectories across different countries. The timing of the outbreak, the intensity of spread, the peak size, the duration of waves, and the reporting pattern varied substantially from one country to another. Due to this heterogeneity, the statistical analysis of daily confirmed cases is important not only to describe the epidemic burden but also to understand how cases are distributed over time.

In this study, the distribution pattern of COVID-19 daily cases is analysed for six countries: India, the United States, Brazil, the United Kingdom, South Africa, and Japan. The daily confirmed COVID-19 case data are taken from the Our World in Data COVID-19 dataset. The study period is restricted to a common analytical window from 30 January 2020 to 20 February 2022. After preprocessing, 753 daily observations are obtained for each country. For cross-country comparison, daily cases per million population are used, while India-specific wave analysis is performed using raw daily confirmed case counts.

The analysis evaluates descriptive distributional behaviour and parametric distribution fitting using selected candidate distributions, including Gamma, Generalised Gamma, Weibull, Log-Normal, Log-Logistic, Inverse Gaussian, Exponential, and Normal distributions. The results show that daily COVID-19 case distributions are right-skewed in all selected countries. The mean value is greater than the median value in all countries, showing that high-incidence days strongly affect the average case count. Country-wise distribution fitting shows that no single distribution is best for all countries. Log-Logistic distribution is selected for India, the United States and Japan; Inverse Gaussian distribution for Brazil; Generalised Gamma distribution for South Africa; and Log-Normal distribution for the United Kingdom.

For India, the wave-wise analysis shows that the three epidemic waves have different distributional behaviour. Wave 1 is best approximated by the Weibull distribution, Wave 2 by the Log-Normal distribution, and Wave 3 by the Generalised Gamma distribution. These findings indicate that COVID-19 daily case patterns are asymmetric, heterogeneous, and wave-specific. Hence, flexible positive-support and right-skewed distributions are more useful for describing the empirical distribution of positive daily confirmed COVID-19 cases than symmetric assumptions. However, the selected distributions are interpreted only as relative empirical approximations among the evaluated candidate distributions, and not as exact distributional laws of COVID-19 daily cases.

References

Giménez V, Thieme C, Prior D, Tortosa-Ausina E. International comparisons of COVID-19 pandemic management: What can be learned from activity analysis techniques? Omega 2024; 122: 102954.

Sun MW, Troxell D, Tibshirani R. Public health factors help explain cross-country heterogeneity in excess death during the COVID-19 pandemic. Scientific Reports 2023; 13: 16196.

Guharay S, et al. A data-driven approach to study temporal characteristics of COVID-19 infection and death time series for twelve countries across six continents. BMC Medical Research Methodology 2025.

Mathieu E, Ritchie H, Rodés-Guirao L, Appel C, Gavrilov D, Giattino C, Hasell J, Macdonald B, Dattani S, Beltekian D, Ortiz-Ospina E, Roser M. Coronavirus pandemic (COVID-19). Our World in Data 2020. https://ourworldindata.org/coronavirus

Kumar G, Bhalla A, Mukherjee A, Turuk A, Talukdar A, Mukherjee S, et al. Post COVID sequelae among COVID-19 survivors: Insights from the Indian National Clinical Registry for COVID-19. BMJ Global Health 2023; 8: e012245.

Alvarez E, Padgett D, Garnier R, Mattioli M, Engebretsen E. Limitations of COVID-19 testing and case data for evidence-informed health policy and practice. Health Research Policy and Systems 2023; 21: 11.

Cavanaugh JE, Neath AA. Akaike Information Criterion, The: Background, derivation, properties, and refinements. In: Lovric M, ed. International Encyclopedia of Statistical Science 2025; 35-41.

Charniga K, Park SW, Akhmetzhanov AR, Cori A, Dushoff J, Funk S, Gostic KM, Linton NM, Lison A, Overton CE, Pulliam JRC, Ward T, Cauchemez S, Abbott S. Best practices for estimating and reporting epidemiological delay distributions of infectious diseases. PLOS Computational Biology 2024; 20(10): e1012520.

Efron B, Tibshirani RJ. An Introduction to the Bootstrap. Chapman & Hall/CRC 1993.

Liang CW, Lv QY, Chen ZG, Xu B, Lai YS, Zhang Z. Model-inferred timing and infectious period of the chickenpox outbreak source. BMC Infectious Diseases 2024; 24(1): 1257.

Massey FJ Jr. The Kolmogorov-Smirnov test for goodness of fit. Journal of the American Statistical Association 1951; 46(253): 68-78.

McAloon C, Collins Á, Hunt K, Barber A, Byrne AW, Butler F, Casey M, Griffin J, Lane E, McEvoy D, Wall P, Green M, O’Grady L, More SJ. Incubation period of COVID-19: A rapid systematic review and meta-analysis of observational research. BMJ Open 2020; 10(8): e039652.

National Institute of Standards and Technology. Kolmogorov-Smirnov goodness-of-fit test. NIST/SEMATECH e-Handbook of Statistical Methods 2012. https://www.itl.nist.gov/div898/handbook/eda/section3/eda35g.htm

Negi S, et al. Trend of viral load during the first, second, and third wave of COVID-19 in the Indian Himalayan region: An observational study of Uttarakhand state. Frontiers in Microbiology 2024; 14: 1279632.

Niveditha D, Khan S, Khilari A, Nadkarni S, Bhalerao U, Kadam P, et al. A tale of two waves: Delineating diverse genomic and transmission landscapes driving the COVID-19 pandemic in Pune, India. Journal of Infection and Public Health 2023; 16(9): 1417-1426.

Padget M, Adam P, Rebolledo J, Riccardo F, Riess M, Rusu LC, Che D, Coignard B. A comparison of COVID-19 incidence rates across six European countries in 2021. Eurosurveillance 2023; 28(40): 2300088.

Portet S. A primer on model selection using the Akaike Information Criterion. Infectious Disease Modelling 2020; 5: 111-128.

Steinskog DJ, Tjøstheim DB, Kvamstø NG. A cautionary note on the use of the Kolmogorov-Smirnov test for normality. Monthly Weather Review 2007; 135(3): 1151-1157.

Tancredi S, Cullati S, Chiolero A. Surveillance bias in the assessment of the size of COVID-19 epidemic waves: A case study. Public Health 2024; 234: 98-104.

Pawitan Y. In All Likelihood: Statistical Modelling and Inference Using Likelihood. Oxford University Press 2001.

Schwarz G. Estimating the dimension of a model. The Annals of Statistics 1978; 6(2): 461-464.

Downloads

Published

2026-07-01

How to Cite

Sahu, A. ., & Prasad, J. . (2026). Distribution Pattern Analysis of Daily Confirmed COVID-19 Cases Across Selected Countries and Indian Epidemic Waves. International Journal of Statistics in Medical Research, 15, 274–291. https://doi.org/10.6000/1929-6029.2026.15.25

Issue

Section

General Articles