- Research article
- Open Access
Using autoregressive integrated moving average (ARIMA) models to predict and monitor the number of beds occupied during a SARS outbreak in a tertiary hospital in Singapore
BMC Health Services Research volume 5, Article number: 36 (2005)
The main objective of this study is to apply autoregressive integrated moving average (ARIMA) models to make real-time predictions on the number of beds occupied in Tan Tock Seng Hospital, during the recent SARS outbreak.
This is a retrospective study design. Hospital admission and occupancy data for isolation beds was collected from Tan Tock Seng hospital for the period 14th March 2003 to 31st May 2003. The main outcome measure was daily number of isolation beds occupied by SARS patients. Among the covariates considered were daily number of people screened, daily number of people admitted (including observation, suspect and probable cases) and days from the most recent significant event discovery. We utilized the following strategy for the analysis. Firstly, we split the outbreak data into two. Data from 14th March to 21st April 2003 was used for model development. We used structural ARIMA models in an attempt to model the number of beds occupied. Estimation is via the maximum likelihood method using the Kalman filter. For the ARIMA model parameters, we considered the simplest parsimonious lowest order model.
We found that the ARIMA (1,0,3) model was able to describe and predict the number of beds occupied during the SARS outbreak well. The mean absolute percentage error (MAPE) for the training set and validation set were 5.7% and 8.6% respectively, which we found was reasonable for use in the hospital setting. Furthermore, the model also provided three-day forecasts of the number of beds required. Total number of admissions and probable cases admitted on the previous day were also found to be independent prognostic factors of bed occupancy.
ARIMA models provide useful tools for administrators and clinicians in planning for real-time bed capacity during an outbreak of an infectious disease such as SARS. The model could well be used in planning for bed-capacity during outbreaks of other infectious diseases as well.
Early isolation of infectious cases has been shown to be a key component for the successful management of SARS outbreaks. Due to the potential for nosocomial transmission[2–6], the imperfect ability of clinical criteria to distinguish cases of SARS at presentation, the possibility of atypical presentations[8, 9], and the lack of a sensitive diagnostic test in early disease, front-line clinicians need to err on the side of caution when admitting cases, and isolate patients until SARS can be clinically and virologically ruled out. As a result, the number of admissions and isolation beds required during management of SARS outbreaks can be expected to significantly exceed that used for actual SARS cases.
While hospitals are generally built with a fixed ratio of isolation to general ward beds, surge capacity for isolation beds can be met by conversion of single room wards, decanting of existing patients with lesser indications for isolation, and activation of isolation facilities at alternative sites and institutions. However, these processes require time, and the ability to forecast requirements is hence a critical component of efficient outbreak management.
During SARS outbreak in Singapore from 1 Mar to 31 May 2003, the Communicable Disease Centre (CDC) was the initial designated facility for the screening management of all SARS cases, beginning on 14 Mar 2003, two days after the WHO alert was sounded on 12 Mar. To accommodate the surge in cases, the parent hospital of the CDC, Tan Tock Seng Hospital (TTSH) became designated as the central facility for management of all SARS cases in Singapore from 22 Mar 2003. As a result of the above policy, 231 of 238 SARS cases diagnosed during the Singapore outbreak were admitted to TTSH. TTSH bed utilization patterns here hence reflect national level requirements for outbreak management. Various papers have described qualitative aspects of hospital management during SARS[13–15], but none have provided quantitative tools for predicting requirements for isolation beds. Yet such quantitative models would be of great utility to both hospital administrators and national level planners during outbreak management.
The main objective of this study is to apply autoregressive integrated moving average (ARIMA) models to make real-time predictions on the number of beds occupied in TTSH during the SARS outbreak, starting from 14 Mar 2003, when the CDC was activated, to 31 May 2003 when Singapore was declared SARS free.
This was a retrospective study design. Hospital admission and occupancy data for isolation beds was collected from Tan Tock Seng hospital for the period 14th March 2003 to 31st May 2003. The main outcome measure was daily number of isolation beds occupied by SARS patients, including those fulfilling WHO criteria for suspect and probable SARS, as well as those admitted not fulfilling WHO case definitions but admitted to isolation rooms for observation. Among the covariates considered were daily number of people screened, daily number of people admitted (including observation, suspect and probable cases) and days from the most recent significant event discovery. Key events considered were as follows:
14th Mar: discovery of the TTSH outbreak
22nd Mar: press release that TTSH would dedicated to SARS management
4th Apr: discovery of an outbreak at Singapore General Hospital (SGH)
11th Apr: discovery of an outbreak at National University Hospital (NUH)
20th Apr: discovery and press release on an outbreak at Pasir Panjang Wholesale Market (PPWM)
13th May: discovery of a cluster of febrile staff and patients at the Institute of Mental Health (IMH)
Details on the above can be found in the chronology of press releases on SARS events in Singapore. Events 1–5 all involved probable SARS cases, whereas event 6 proved to be a false alarm.
We utilized the following strategy for the analysis. Firstly, we split the outbreak data into two. Data from 14th March to 21st April 2003 was used for model development. We used structural ARIMA models in an attempt to model the number of beds occupied. Estimation is via the maximum likelihood method using the Kalman filter. For the ARIMA model parameters, we considered the simplest parsimonious lowest order model.
We computed various permutations of the order of correlation (AR), order of integration (I) and order of moving average (MA), and chose the optimal combination of parameters using the mean square error. The correlogram and partial correlogram graphs were also used to help in deciding the order of moving average (MA) and auto-regressive (AR) terms to include in the model. To ensure the model was robust to symmetric nonnormality in the disturbances, including heteroskedasticity, we computed Huber/White/sandwich estimator of variance for the coefficient estimates. Before modeling the bed occupancy, we examined whether the series was stationary. In the event of non-stationarity, we opted to set an a-prior value of 1 for starting the Kalman recursions.
We used the likelihood ratio test to determine if inclusion of other covariates helped improve the fit of the model. Based on the final model selected, we assessed the out-sample validity of the model, by applying the model to predict the number of beds occupied for the remaining period of the outbreak (i.e. 22nd April 2003 to 31st May 2003). In addition, we also made three-day forecasts for selected periods during the outbreak, starting from day 4 of the outbreak. We used the mean absolute percentage error (MAPE) to measure and quantify the quality of fit. A lower MAPE value will indicate a better fit of the data. All tests were conducted at the 5% level of significance, and data analysis was performed in Stata V7.0 (Stata Corporation, College Station, TX, USA).
From 14th March 2003 to 31st May 2003, the median daily number of beds occupied was 134 (IQR: 105–193). The range was 15 to 238 beds. The number of beds occupied reached it's peak on the 24th and 28th of April 2003, with a total of 238 beds. For the final ARIMA model, we found that the ARIMA (1,0,3) model was the most suitable, with an auto-regression term of 1 and a moving average term of 3. The correlogram indicated that there was a significant autocorrelation out to about 3 lags, and this autocorrelation decayed slowly over time (figure 1). The partial auto correlation function (PACF) plot suggested that the only highly significant partial autocorrelation occurred at one lag (figure 2). We found that the AR(1) coefficient of 1.02 and MA(3) coefficient of -0.95 were significant (p < 0.05). The likelihood ratio test indicated that the total number of admissions on the previous day and number of probable cases admitted on the previous day were significant predictors, and these variables were thus included in the final model. Furthermore, the estimated variance of the white-noise disturbance was found to be 4.47 (see table 1). Days from most recent significant event discovery and number of patients screened were not found to be significant predictors of daily number of isolation beds occupied.
As we can see from figure 3, the predictions from the ARIMA model performed reasonably well, both for the training and validation data. The MAPE for the training set and validation set were 5.7% and 8.6% respectively. This translated to an error rate of ± 7 beds and ± 13 beds respectively. We have also provided the model parameters and their corresponding MAPE values for some of the alternative ARIMA models that we had considered (table 2).
For three-day predictions, we found that the model fared reasonably well (see table 3). For day 4 to day 6 of the outbreak, the error rate was 6%. For day 7 to day 9, the rate was 10%, day 10 to day 12, 7% and finally, for day 13 to day 15, it was 9%. Although the MAPE values were within reasonable levels, we note that generally, the model under-predicts in the early stage of the outbreak, and over-predicts in the later stage of the outbreak (tables 3 and 4).
To the best of our knowledge, this is the first study to suggest the application of a known statistical method such as the ARIMA model, to predict and monitor the utilization of hospital isolation beds during the recent SARS outbreak in Singapore, for which Tan Tock Seng Hospital was the designated hospital for all patients presenting with SARS-like symptoms and exposures.
ARIMA models have traditionally found application in the financial sector. There has been limited literature on their use in healthcare; recent examples include their use in assessment of seasonal variation in selected medical conditions, and as a surveillance tool for outbreak detection. There has been some research indicating that time series modeling may be more appropriate than the simple trend fitting approach, which suffers from model specification error. ARIMA models have been used to forecast attendance at accident and emergency departments in the United Kingdom. Particularly, researchers have shown that the forecasting methodology can be improved by incorporating the ARIMA method. Here, we show that the ARIMA model can be used over the much shorter time-frame of a single outbreak to forecast bed-utilization. The three-day forecasts from the model are fairly reasonable. The MAPE is low, allowing planners to confidently decide, with sufficient lead-time, on the need to open new isolation wards, each of which, in our setting, holds between 10 to 20 patients.
However, the model has its limitations. Firstly, unmeasured confounders could have affected the results of this study, although we have accounted for measured confounders by incorporating significant covariates into the final model. Secondly, the most significant covariate was the number of probable SARS cases admitted. This is not surprising, as probable SARS cases stayed longer in isolation facilities compared to cases which turned out not to be SARS (unpublished data). In this analysis, we used the final classification for each case after a variable period of observation and investigation, and not the admission classification.
This approach was chosen, as time required to confirm cases will be likely be shortened in any future outbreaks, in view of various advancements for SARS diagnostics[27, 28]. However, it is still uncertain what proportion of SARS cases can be accurately classified on admission, and this may affect model performance. Another point to note is that the model generally under-predicts the number of beds occupied in the early stage of the outbreak and over-predicts at the later stage of the outbreak for three-day forecasts. One possible explanation could be that the model was derived from the first half of the data and applied to both the first and second halves of the outbreak. It is important for administrators to take this into account during bed planning, perhaps by allowing for appropriate buffer beds. Lastly, the ARIMA model parameters may differ under different practice protocols within different outbreak settings as well as between different SARS afflicted countries. It would be useful to calibrate the model using individual country level data. We would recommend that, in an actual outbreak, real-time calibration be performed, with additional data available on each day fed-back into the model to improve its predictive ability.
The application of ARIMA models in bed utilization is not only useful for an outbreak of SARS and emerging infectious diseases, but also for projecting resource requirements in bioterrorism events. It has been recognized by others that resource requirements will not include just isolation beds, but also outpatient resources, pharmaceuticals, as well as intensive care facilities; few articles, however, have proposed any models for forecasting such requirements. The challenge, therefore, lies in the collection of timely surveillance and resource utilization data for this specific purpose, peace-time exploration of the most appropriate methods of analysis, and real-time validation and application in the event of an outbreak.
The ARIMA model that we developed for modeling the number of beds occupied during the SARS outbreak performed reasonably well, with a MAPE of 5.7% for the training set, and 8.6% for the validation set. In addition, we found that three-day forecasts provided a reasonable prediction of the number of beds required during the outbreak
ARIMA models provide useful tools for administrators and clinicians in planning the use of isolation beds during an outbreak of an infectious disease such as SARS. The model could be used in planning for bed-capacity during outbreaks of other infectious diseases, as well as predicting requirements for other critical resources.
Lipsitch M, Cohen T, Cooper B, Robins JM, Ma S, James L, Gopalakrishna G, Chew SK, Tan CC, Samore MH, Fisman D, Murray M: Transmission dynamics and control of severe acute respiratory syndrome. Science. 2003, 300: 1966-70. 10.1126/science.1086616.
Lee N, Hui D, Wu A, Chan P, Cameron P, Joynt GM, Ahuja A, Yung MY, Leung CB, To KF, Lui SF, Szeto CC, Chung S, Sung JJ: A Major Outbreak of Severe Acute Respiratory Syndrome in Hong Kong. N Engl J Med. 2003, 348 (20): 1986-94. 10.1056/NEJMoa030685.
Spurgeon D: Canada reports more than 300 suspected cases of SARS. BMJ. 2003, 326 (7395): 897-10.1136/bmj.326.7395.897/a.
Poutanen SM, Low DE, Henry B, Finkelstein S, Rose D, Green K, Tellier R, Draker R, Adachi D, Ayers M, Chan AK, Skowronski DM, Salit I, Simor AE, Slutsky AS, Doyle PW, Krajden M, Petric M, Brunham RC, McGeer AJ, National Microbiology Laboratory, Canada; Canadian Severe Acute Respiratory Syndrome Study Team: Identification of Severe Acute Respiratory Syndrome in Canada. N Engl J Med. 2003, 348: 1995-2005. 10.1056/NEJMoa030634.
Dwosh HA, Hong HH, Austgarden D, Herman S, Schabas R: Identification and containment of an outbreak of SARS in a community hospital. CMAJ. 2003, 168 (11): 1415-20.
Varia M, Wilson S, Sarwal S, McGeer A, Gournis E, Galanis E, Henry B: Investigation of a nosocomial outbreak of severe acute respiratory syndrome (SARS) in Toronto, Canada. CMAJ. 2003, 169 (4): 285-92.
Rainer TH, Cameron PA, Smit D, Ong KL, Hung AN, Nin DC, Ahuja AT, Si LC, Sung JJ: Evaluation of WHO criteria for identifying patients with severe acute respiratory syndrome out of hospital: prospective observational study. BMJ. 2003, 326: 1354-8. 10.1136/bmj.326.7403.1354.
Tan TT, Tan BH, Kurup A, Oon LL, Heng D, Thoe SY, Bai XL, Chan KP, Ling AE: Atypical SARS and Escherichia coli bacteremia. Emerg Infect Dis. 2004, 10 (2): 349-52.
Tee AK, Oh HM, Lien CT, Narendran K, Heng BH, Ling AE: Atypical SARS in geriatric patient. Emerg Infect Dis. [serial online] 2004 Feb [cited 28 Feb 2004]., [http://www.cdc.gov/ncidod/EID/vol10no2/03-0322.htm]
Cheng PK, Wong DA, Tong LK, Ip SM, Lo AC, Lau CS, Yeung EY, Lim WW: Viral shedding patterns of coronavirus in patients with probable severe acute respiratory syndrome. Lancet. 2004, 363 (9422): 1699-700. 10.1016/S0140-6736(04)16255-7.
World Health Organization, Communicable Disease Surveillance and Response: Update 95 – SARS: Chronology of a serial killer. 4 Jul 2003. Accessed 22 Oct 2003, [http://www.who.int/csr/don/2003_07_04/en/]
Ministry of Health, Singapore: MOH SARS Press Releases 22 Mar 2003: Enhanced Precautionary Measures To Break SARS Transmission. Accessed on 11 Sep 2003., [http://www.moh.gov.sg/corp/sars/news/update_details.jsp?id=10092827]
McDonald LC, Simor AE, Su IJ, Maloney S, Ofner M, Chen KT, Lando JF, McGeer A, Lee ML, Jernigan DB: SARS in healthcare facilities, Toronto and Taiwan. Emerg Infect Dis. 2004, 10 (5): 777-81.
Loutfy MR, Wallington T, Rutledge T, Mederski B, Rose K, Kwolek S, McRitchie D, Ali A, Wolff B, White D, Glassman E, Ofner M, Low DE, Berger L, McGeer A, Wong T, Baron D, Berall G: Hospital preparedness and SARS. Emerg Infect Dis. 2004, 10 (5): 771-6.
Gopalakrishna G, Choo P, Leo YS, Tay BK, Lim YT, Khan AS, Tan CC: SARS transmission and hospital containment. Emerg Infect Dis. 2004, 10 (3): 395-400.
World Health Organization, Communicable Disease Surveillance and Response: Case definitions for surveillance of Severe Acute Respiratory Syndrome (SARS). Revised 1 May 2003. Accessed on 22 Oct 2003., [http://www.who.int/csr/sars/casedefinition/en/]
Ministry of Health, Singapore: MOH SARS Press Releases. Accessed on 25th Nov 2003., [http://www.moh.gov.sg/corp/sars/news/updates.jsp?year=2003]
Chong SA, Subramaniam M, Chua HC, Lee CE: SARS or not SARS: outbreak of fever in a state mental institute in Singapore. Can J Psychiatry. 2004, 49 (3): 216-7.
Box GEP, Jenkins GM, Reinsel GC: Time Series Analysis: Forecasting and Control. 1994, Englewood Cliffs, NJ: Prentice-Hall, 3
Harvey AC: Forecasting, structural time series models and the Kalman filter. 1989, Cambridge: Cambridge University Press
Hamilton JD: Time Series Analysis. 1994, Princeton: Princeton University Press
Kalman RE: A new approach to linear filtering and prediction problems. Journal of Basic Engineering, Transactions of the ASME Series D. 1960, 82: 35-45.
Moineddin R, Upshur RE, Crighton E, Mamdani M: Autoregression as a means of assessing the strength of seasonality in a time series. Popul Health Metr. 2003, 1 (1): 10-10.1186/1478-7954-1-10.
Reis BY, Mandl KD: Time series modeling for syndromic surveillance. BMC Med Inform Decis Mak. 2003, 3 (1): 2-10.1186/1472-6947-3-2.
Farmer RD, Emami J: Models for forecasting hospital bed requirements in the acute sector. J Epidemiol Community Health. 1990, 44 (4): 307-12.
Milner PC: Ten-year follow-up of ARIMA forecasts of attendances at accident and emergency departments in the Trent region. Stat Med. 1997, 16 (18): 2117-25. 10.1002/(SICI)1097-0258(19970930)16:18<2117::AID-SIM649>3.0.CO;2-E.
Mahony JB, Petrich A, Louie L, Song X, Chong S, Smieja M, Chernesky M, Loeb M, Richardson S, Ontario Laboratory Working Group for the Rapid Diagnosis of Emerging Infections: Performance and Cost evaluation of one commercial and six in-house conventional and real-time reverse transcription-pcr assays for detection of severe acute respiratory syndrome coronavirus. J Clin Microbiol. 2004, 42 (4): 1471-6. 10.1128/JCM.42.4.1471-1476.2004.
Guan M, Chan KH, Peiris JS, Kwan SW, Lam SY, Pang CM, Chu KW, Chan KM, Chen HY, Phuah EB, Wong CJ: Evaluation and validation of an enzyme-linked immunosorbent assay and an immunochromatographic test for serological diagnosis of severe acute respiratory syndrome. Clin Diagn Lab Immunol. 2004, 11 (4): 699-703. 10.1128/CDLI.11.4.699-703.2004.
Hupert N, Mushlin AI, Callahan MA: Modeling the public health response to bioterrorism: using discrete event simulation to design antibiotic distribution centers. Med Decis Making. 2002, 22 (5 Suppl): S17-25. 10.1177/027298902237709.
Cohen V: Organization of a health-system pharmacy team to respond to episodes of terrorism. Am J Health Syst Pharm. 60 (12): 1257-63. 2003 Jun 15
White SM: Chemical and biological weapons. Implications for anaesthesia and intensive care. Br J Anaesth. 2002, 89 (2): 306-24. 10.1093/bja/aef168.
The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1472-6963/5/36/prepub
We would like to thank all the doctors, nurses, respiratory therapists, physiotherapists, radiographers, medical social workers, healthcare attendants and clerks who worked in Tan Tock Seng Hospital, Singapore during the SARS outbreak in 2003.
The author(s) declare that they have no competing interests.
DN conceived the study and contributed to the study design, analysis and interpretation. AE contributed to the statistical analysis, interpretation and writing of the manuscript. MIC contributed to the statistical analysis, interpretation and writing of the manuscript. LYS contributed to the interpretation and writing of the manuscript.
Arul Earnest, Mark I Chen, Donald Ng and Leo Yee Sin contributed equally to this work.
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
Rights and permissions
About this article
Cite this article
Earnest, A., Chen, M.I., Ng, D. et al. Using autoregressive integrated moving average (ARIMA) models to predict and monitor the number of beds occupied during a SARS outbreak in a tertiary hospital in Singapore. BMC Health Serv Res 5, 36 (2005). https://doi.org/10.1186/1472-6963-5-36
- Mean Absolute Percentage Error
- ARIMA Model
- Daily Number
- Occupancy Data
- Autoregressive Integrate Moving Average