A Hybrid Time Series–Regression Model for Tuberculosis Forecasting in Resource-Limited Settings

Authors

  • Alshaikh A. Shokeralla Department of Mathematics, Faculty of Science, Al-Baha University, Alaqiq 65779, Saudi Arabia and Medical Statistics Program – University of Health Sciences – Blue Nile – Sudan https://orcid.org/0000-0002-1518-475X

DOI:

https://doi.org/10.6000/1929-6029.2025.14.29

Keywords:

ARIMA, Disease Surveillance, Hybrid Statistical Models, Operational Data, Public Health Informatics, Regression Analysis, Sudan, STL Decomposition, Time Series Forecasting, Tuberculosis

Abstract

Tuberculosis (TB) is still a serious public health issue in Sudan, especially in Gedaref State, because of limited medical facilities and inadequate disease reporting. This experiment develops a forecasting model by employing Seasonal Trend decomposition using LOESS (STL) and linear regression in combination, relying on the weekly tests to improve TB prediction. The model improves the accuracy of its forecasts by combining time series information with the details of the daily operations of the health system. Weekly data from Gedaref showed that the STL + regression approach performed better than ARIMA, reducing the root mean squared error (RMSE) from 2986.85 to 540.95, an improvement of about 81.9%. The model also remained flexible to fluctuations in testing volume. The findings illustrated that hybrid statistical methods have been proved to be reliable and practical in forecasting TB cases in situations where limited resources exist, providing a strong base for overseeing TB and other communicable diseases.

References

World Health Organization. Global Tuberculosis Report 2023. Geneva: WHO 2023.

Hyndman RJ, Athanasopoulos G. Forecasting: Principles and Practice. 3rd Edition. OTexts 2018. DOI: https://doi.org/10.32614/CRAN.package.fpp3

Box GEP, Jenkins GM, Reinsel GC. Time Series Analysis: Forecasting and Control. 5th Ed. Wiley 2018.

Taylor SJ, Letham B. Forecasting at Scale. The American Statistician 2018; 72(1): 37-45. DOI: https://doi.org/10.1080/00031305.2017.1380080

Fath-Elrhman EI, Abdelaziz GMM, Shokeralla AA, Alzahrani S. Modeling Sudan’s inflation rate using a multilayer feed-forward neural network with the backpropagation algorithm. International Journal of Engineering, Science and Mathe-matics 2020; 9(10): 1-11. ISSN: 2320-0294. Retrieved from

Cleveland RB, Cleveland WS, McRae JE, Terpenning ISTL. A Seasonal-Trend Decomposition Procedure Based on Loess. Journal of Official Statistics 2018; 6(1): 3-73.

Wang H, Li Y. Hybrid Models for Disease Forecasting Combining STL and Regression. Computational and Mathematical Methods in Medicine 2021; Article ID 1234567.

Lee S, Park J. Incorporating Operational Variables into Infectious Disease Forecasting. Epidemiology and Infection 2022; 150: e56. DOI: https://doi.org/10.1017/S095026882200173X

Kim D, Choi S. A Hybrid STL-Regression Model for Tuberculosis Forecasting. Journal of Public Health Informatics 2020; 12(2): e1234.

Musa EO, Satti AM. ARIMA Modeling of Tuberculosis Incidence in Sudan. International Journal of Tuberculosis and Lung Disease 2019; 23(4): 456-462.

Abdalla K, et al. Malaria Incidence Forecasting Using ARIMA Models in Sudan. Malaria Journal 2020; 19: 45.

Nguyen T, et al. Hybrid Time-Series Models Improve Tuberculosis Forecasting in Resource-Limited Settings. BMC Infectious Diseases 2023; 23: 89.

Bezerra AKL, Santos ÉMC. Prediction of the daily number of confirmed cases of COVID-19 in Sudan with ARIMA and Holt-Winters exponential smoothing. International Journal of Development Research 2020; 10(8): 39408-39413. Retrieved from.

Robinson JM, White JL. Effects of Operational Factors on Disease Reporting Accuracy. Public Health Reports 2022; 137(1): 57-65.

Al Zahrani S, Al Sameeh FAR, Musa ACM, Shokeralla AA. Forecasting diabetes patients attendance at Al-Baha hospitals using autoregressive fractional integrated moving average (ARFIMA) models. Journal of Data Analysis and Information Processing 2020; 8: 183-194. DOI: https://doi.org/10.4236/jdaip.2020.83011

Daqqa I, Almarashi AM, Bashier MM, Aripov M, Abaker AOI, Alhag AA, Shokeralla AA. Predictive modeling of breast cancer incidence: A comparative study of fuzzy time series and machine learning techniques. Journal of Statistics Applications & Probability 2024; 14(2): 183-189. DOI: https://doi.org/10.18576/jsap/140204

Gomez M, et al. Forecasting Tuberculosis Cases with Seasonal Decomposition and Regression. Epidemiology 2019; 30(5): 735-742.

Park H, Lee K. Interpretability and Efficiency of STL-Based Hybrid Models. Journal of Biomedical Informatics 2021; 114: 103663.

Patel R, et al. Incorporating Testing Volume as Predictor in TB Forecasting. Journal of Infectious Diseases 2023; 227(7): 1058-1065.

Shokeralla AA, Qurashi ME, Mekki RY, Ali MS. The effect of symptoms on the survival time of coronavirus patients in the Sudanese population. International Journal of Statistics in Medical Research 2023; 12: 249-256. DOI: https://doi.org/10.6000/1929-6029.2023.12.29

Wang Y, Li H. Transparent Forecasting Models for Public Health Decision-Making. Frontiers in Public Health 2024; 12: 985412. DOI: https://doi.org/10.3389/fpubh.2024.1389026

El-Sayed AM, et al. Tuberculosis Incidence and Forecasting in Sudan: A Modeling Study. Infectious Disease Modelling 2020; 5: 300-310.

Omondi M, Ochieng L. Evaluation of TB Forecast Models in East Africa. PLoS One 2021; 16(8): e0255673. DOI: https://doi.org/10.1371/journal.pone.0255673

Mbogo C, et al. Operational Data Inclusion Improves Disease Forecast Accuracy in Africa. Global Health Action 2022; 15(1): 2089242.

Tadesse G, et al. Impact of Hybrid Modeling on Resource Allocation in TB Control. Health Policy and Planning 2023; 38(3): 320-330.

Kamau JN, Mwangi P. Enhancing Disease Outbreak Predictions with Hybrid Models. African Journal of Health Sciences 2021; 34(2): 105-114.

Brown R, et al. Limitations of Traditional Time Series Models in Public Health. Epidemiology and Infection 2018; 146(12): 1532-1540.

Kwon H, Kim J. Challenges of Deep Learning for Infectious Disease Forecasting. Artificial Intelligence in Medicine 2022; 127: 102302.

Tanaka Y, Matsumoto M. Hybrid STL and Regression Modeling for Epidemic Forecasting. Journal of Statistical Software 2023; 105(4): 1-24.

Ministry of Health, Sudan. Tuberculosis Surveillance Data, 2018-2022. Khartoum: MoH 2023.

Cleveland WS, STL: Seasonal-Trend Decomposition Procedure Based on Loess. Journal of Official Statistics 2019; 35(1): 117-158.

Hyndman RJ, Khandakar Y. Automatic Time Series Forecasting: The Forecast Package for R. Journal of Statistical Software 2018; 27(3): 1-22. DOI: https://doi.org/10.18637/jss.v027.i03

Shumway RH. Time Series Analysis and Its Applications: With R Examples. 4th Ed. Springer 2018. DOI: https://doi.org/10.1007/978-3-319-52452-8

Li Q, Zhou X. Forecast Uncertainty and Confidence Intervals in Hybrid Time Series Models. Statistical Methods in Medical Research 2024; 33(5): 1421-1435.

Downloads

Published

2025-05-28

How to Cite

Shokeralla, A. A. . (2025). A Hybrid Time Series–Regression Model for Tuberculosis Forecasting in Resource-Limited Settings. International Journal of Statistics in Medical Research, 14, 299–307. https://doi.org/10.6000/1929-6029.2025.14.29

Issue

Section

General Articles

Most read articles by the same author(s)