Influenza Forecasting with Google Flu Trends

Fred Torcaso; Scott Levin

Influenza Forecasting with Google Flu Trends

Fred Torcaso

Scott Levin

2013, Plos One

visibility

…

description

7 pages

link

1 file

Background: We developed a practical influenza forecast model based on real-time, geographically focused, and easy to access data, designed to provide individual medical centers with advanced warning of the expected number of influenza cases, thus allowing for sufficient time to implement interventions. Secondly, we evaluated the effects of incorporating a real-time influenza surveillance system, Google Flu Trends, and meteorological and temporal information on forecast accuracy.

Influenza Forecasting with Google Flu Trends Andrea Freyer Dugas1*, Mehdi Jalalpour2, Yulia Gel2,3, Scott Levin1,2, Fred Torcaso2, Takeru Igusa2, Richard E. Rothman1 1 Department of Emergency Medicine, Johns Hopkins University, Baltimore, Maryland, United States of America, 2 Whiting School of Engineering, Johns Hopkins University, Baltimore, Maryland, United States of America, 3 University of Waterloo, Waterloo, Ontario, Canada Abstract Background: We developed a practical influenza forecast model based on real-time, geographically focused, and easy to access data, designed to provide individual medical centers with advanced warning of the expected number of influenza cases, thus allowing for sufficient time to implement interventions. Secondly, we evaluated the effects of incorporating a real-time influenza surveillance system, Google Flu Trends, and meteorological and temporal information on forecast accuracy. Methods: Forecast models designed to predict one week in advance were developed from weekly counts of confirmed influenza cases over seven seasons (2004–2011) divided into seven training and out-of-sample verification sets. Forecasting procedures using classical Box-Jenkins, generalized linear models (GLM), and generalized linear autoregressive moving average (GARMA) methods were employed to develop the final model and assess the relative contribution of external variables such as, Google Flu Trends, meteorological data, and temporal information. Results: A GARMA(3,0) forecast model with Negative Binomial distribution integrating Google Flu Trends information provided the most accurate influenza case predictions. The model, on the average, predicts weekly influenza cases during 7 out-of-sample outbreaks within 7 cases for 83% of estimates. Google Flu Trend data was the only source of external information to provide statistically significant forecast improvements over the base model in four of the seven out-ofsample verification sets. Overall, the p-value of adding this external information to the model is 0.0005. The other exogenous variables did not yield a statistically significant improvement in any of the verification sets. Conclusions: Integer-valued autoregression of influenza cases provides a strong base forecast model, which is enhanced by the addition of Google Flu Trends confirming the predictive capabilities of search query based syndromic surveillance. This accessible and flexible forecast model can be used by individual medical centers to provide advanced warning of future influenza cases. Citation: Dugas AF, Jalalpour M, Gel Y, Levin S, Torcaso F, et al. (2013) Influenza Forecasting with Google Flu Trends. PLoS ONE 8(2): e56176. doi:10.1371/ journal.pone.0056176 Editor: Cécile Viboud, National Institutes of Health, United States of America Received October 2, 2012; Accepted January 7, 2013; Published February 14, 2013 Copyright: ß 2013 Dugas et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Funding: This work was supported by the Department of Homeland Security (PACER: National Center for Study of Preparedness and Response [grant number: 2010-ST-061-PA0001]); the National Science Foundation Systems Engineering and Design Program [grant number: NSF CMMI 0927207]; and the Natural Sciences and Engineering Research Council of Canada. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. Competing Interests: The authors have declared that no competing interests exist. * E-mail: [email protected] Numerous potential surveillance systems exist to identify influenza outbreaks. Traditional surveillance such as The Centers for Disease Control and Prevention’s (CDC) US Influenza Sentinel Provider Surveillance Network relies on the collection of numerous indicators including clinical symptoms, virology laboratory results, hospital admissions and mortality statistics resulting in a several week lag in data reporting [11]. New digital surveillance sources, such as Google Flu Trends (GFT), offer the potential to identify influenza surges in real-time, optimizing timely outbreak detection and response. GFT utilizes internet search queries to detect the presence of influenza like illness (ILI) on a national, regional, state and city level 7–10 days prior to the U.S. Influenza Sentinel Provider Surveillance Network and was recently validated to show a strong correlation with ED influenza cases at a local level [12,13,14]. However, the forecasting capabilities of GFT remain unknown. Given the real-time nature Introduction Influenza is a substantial cause of morbidity and mortality with up to five million cases of severe illness and 500,000 deaths worldwide each year [1]. In the United States, seasonal influenza results in increased emergency department (ED) visits and hospitalizations, straining an already stressed medical system [2,3,4,5,6]. Increased patient volume caused by seasonal influenza is a contributor to ED crowding, which has been linked to delays in critical treatments and increased mortality [7,8,9,10]. An influenza pandemic presents a well recognized and serious threat to the United States healthcare infrastructure [3,6]. Effective management of both seasonal and pandemic influenza requires early detection of the outbreak through timely and accurate surveillance linked with a rapid response to mitigate crowding. PLOS ONE | www.plosone.org 1 February 2013 | Volume 8 | Issue 2 | e56176 Influenza Forecasting with Google Flu Trends of GFT surveillance, and the demonstrated strong correlation of GFT with ED influenza cases, GFT has the potential to go beyond early detection and forecast future influenza outbreaks. Previous forecast models have lacked flexibility, due to restrictive or inappropriate assumptions, technically demanding computational requirements, or inclusion of data elements which are not universally available in real time, reducing practical utility. Initial influenza prediction models followed the classic compartmental Susceptible-Infected-Recovered (SIR) or Susceptible-Exposed-Infected-Recovered (SEIR) framework [15,16,17]. Model parameters representing flow between compartments require frequent parameter refitting in order to track and update. Others have relied on the nonparametric method of analogues, which is rooted in meteorology, and based on selecting historical patterns of influenza dynamics that most closely match current influenza observations for forecasting future influenza outbreaks [18,19,20]. As mentioned by Ackerman and Knox, the method of analogues approach is not suitable for practical implementation, due to a virtual impossibility of selecting a perfectly matching analog, as well as the sensitivity of the forecasts to minor mismatches in selected patterns [21]. Recent suggestions to forecast influenza outbreaks employ either the particle learning approach coupled with Bayes Factors, or a chain binomial model [22,23]. Though both may provide accurate predictions, they involve computationally intensive routines of parameter estimation limiting their practical applicability in the clinical setting. Several recent influenza forecasting studies have used a BoxJenkins methodology, in particular, an autoregressive integrated moving average (ARIMA) model [24,25,26]. These models assume Gaussian errors for residuals and can be applied to count data using a logarithmic transformation. In order to capture the most recent outcomes in influenza counts and to assess nonstationarity of influenza dynamics, we chose to employ the Generalized Autoregressive Moving Average (GARMA) model with discrete-valued distributions and integrated external variables [27]. External variables include any data used for prediction that is independent of the outcome predicted (i.e. weekly counts of influenza cases). Published forecast models integrate the main outcome variable, typically previous influenza cases or ILI visits, with several external variables related to influenza surveillance (type of influenza virus, number of emergency medical service [EMS] calls, sick leaves, and over the counter drug sales) or climate (temperature, humidity, rainfall, and atmospheric pressure) to produce forecasts [19,25,26]. Several of these surveillance data elements are often not available in real-time which severely limits their usefulness for real-time prediction. However, meteological factors, which have previously shown significant correlation with influenza cases, are often available in real time in targeted geographic areas, and can be easily accessed via the internet [28,29,30]. Optimizing management of influenza outbreaks relies on early detection tied to a timely and effective response. This requirement is amplified in the ED where influenza-related crowding can impact quality of patient care. Our primary objective was thus to develop and validate a forecast model which would be practically useful and have broad applicability for providing advanced warning of an influenza outbreak. Accordingly, we chose to include only easily accessible, real-time data, available at the city or medical center level. Our secondary objective was to evaluate the added predictive capability of novel search query-based surveillance, such as GFT, in comparison and integrated with more commonly considered meteorological information. PLOS ONE | www.plosone.org Methods Study Population and Setting This is a retrospective evaluation of data from an urban tertiary care ED with an annual volume of 60,000 adult and 24,000 pediatric visits. Data Collection and Methods of Measurement The primary outcome was number of influenza-related ED patient visits over seven influenza seasons from 2004 to 2011. Weekly influenza-related ED visits were calculated by summing the number of patients with a positive influenza test sent from the ED during each week of the study period. This study was approved by the institutional review board with a waiver of consent as this study used anonymous, aggregated data. Additional external sources of information including GFT, local temperature (degrees Fahrenheit), local relative humidity, and Julian weeks (week of the year from January 1, listed as week 1, to December 31, listed as week 52, and considering the leap years with 53 weeks) were examined for predictive capability. External data is publically available for download on a daily (i.e. real-time) basis. GFT data for the city of Baltimore was downloaded directly from https:// www.google.org/flutrends in April 2012 [12]. Daily temperature and relative humidity measures were downloaded directly from Weather Underground for the city of Baltimore and then averaged over each week to correspond with Google Flu Trend and influenza case data [30]. Statistical Analysis Forecast models were developed from training sets and evaluated against out-of-sample verification sets using a leaveone-out approach. We partitioned the data set into 7 years, where each year begins near September 1, and includes 52 or 53 weeks depending on the leap year status. This left us with 5 typical influenza seasons (2004–2008 and 2010–2011) and 2 atypical influenza seasons (2008–2009 and 2009–2010). We then trained our model for 6 seasons and validated the model on the remaining season, and continued this approach until each season has been used in the validation set exactly once. Models were compared using a summation of the global forecast deviance of each of the verification sets (further referred to as the global forecast deviance), as recommended for GARMA and GLM models [27,31], and forecast confidence. Global deviance is a statistical measure of accuracy, which is defined as twice the negative of the log-likelihood function magnitude of the fitted model on the verification data set. Therefore, global forecast deviance measures the lack of fit between the fitted model and the actual forecasted data, and thus a model with lower global forecast deviance is preferred. Forecast confidence is the percentage of forecast values that are within a predefined difference of the actual data during an influenza peak (here chosen as 20% of the mean of the maximal point of the influenza peak, or seven influenza cases). For this evaluation, an influenza peak is defined as three or more weeks with three or greater confirmed influenza cases. Although forecast confidence provides an easily interpretable evaluation of the model’s performance, all model selection was based upon the more statistically rigorous global forecast deviance. Time series models that showed autocorrelation of residuals were discarded from the analysis. The GARMA(p,q) model with Poisson or Negative Binomial distribution used for this analysis can be expressed as: 2 February 2013 | Volume 8 | Issue 2 | e56176 Influenza Forecasting with Google Flu Trends level of 0.05 in 4 of the verification sets (seasons 2005–2006, 2006– 2007, 2009–2010, and 2010–2011). No other variable was significant in any of the out-of-sample verification sets. Assuming the inclusion of GFT using forward selection, addition of any of the remaining external variables did not significantly reduce the global forecast deviance of the final model, or sometimes increased it due to over-fitting. Thus, the final model selected was GARMA(3,0) with Negative Binomial distribution and dispersion parameter of 2 with Google Flu Trend data as the external variable lagged back one week with respect to the responses. This model has a forecast confidence of 83% indicating that 83% of the forecasted values, during all influenza peaks, were within seven influenza cases of the actual data. As displayed in Figure 3, the forecast of this model indicates that the yielded GARMA out-ofsample forecasts closely follow the observed influenza counts during both an atypical (2008–2009) and typical (2010–2011) influenza season. 0 logðmt Þ~Xt{1 b ( !) p q n o X X yt{j 0 z qi logðyt{i Þ{Xt{1{i b z hj log mt{j i~1 j~1 ð1Þ In Equation (1), primes stand for transpose, b, W and h are model parameters, which were estimated based on data from the training set and a maximum likelihood approach, m is expected value of response (y), and X is the vector of external variables. A logarithmic link function is used here as proposed by Benjamin et al for the case of Poisson or Negative Binomial distribution. The external variables are lagged by one-week with respect to forecasted values to ensure that the model uses only available data to for a prediction one week in the future [27]. Based on global forecast deviance, we chose to employ a Negative Binomial GARMA (3,0) model as inclusion of moving average terms or using a Poisson distribution yielded a higher global deviance. The Negative Binomial distribution outperforms the Poisson distribution because it adjusts for over dispersion with the dispersion parameter (k). Using global deviance, we found that for our dataset, the best dispersion parameter for the Negative Binomial distribution is 2. Primarily, forecast models with the outcome of a point estimate of counts of weekly influenza-related ED patient visits were designed using a Negative Binomial Generalized Autoregressive Moving Average model (GARMA) [27]. Secondarily, external variables such as GFT, meteorological data (temperature, change in temperature, and relative humidity) and temporal variables (Julian weeks, and seasonality) were modeled individually with Negative Binomial generalized linear models (GLM), and were subsequently also added to the baseline model as external variables using a forward selection method. This univariate analysis was used to assess the predictive set of variables for the final forecast model. The developed MATLAB routine for modeling and forecasting influenza counts with GARMA models along with the user manual is publicly available [32]. Discussion Seasonal and pandemic influenza leads to ED crowding, which results in reduced quality of patient care. Early detection or forecasting of an impending influenza outbreak, coupled with an effective intervention designed to mitigate ED crowding, allows for improved management of the anticipated increase in patient volumes. Though several surveillance systems have been designed to provide advance warning, few provide reliable data in near-real time, and fewer still have demonstrated the capability to provide advanced forecasting of impending influenza cases, which would provide the additional critical time necessary for activating a robust response. Although hospital and ED planners often rely on information from their individual facilities, integration of broader surveillance information from the city, state, or national level can provide increased awareness and earlier detection of an impending threat [33,34,35,36]. Previously, hospital and ED planners have used surveillance information to understand disease prevalence for testing and treatment decisions, institute infection control precautions to contain outbreaks, and anticipate ED surge [33,34,35,36]. Particularly for anticipation or detection of ED surge, surveillance has lead to increased ED capacity and staffing, purchase of additional supplies, and reallocation of hospital resources such as staffing and beds [33,36]. In a qualitative analysis by Buehler, one hospital planner noted ‘‘[The syndromic surveillance report] really helped me in continuity of business planning in reference to what we can anticipate in the next 24 hours, in reference to staffing our ERs and what our capacities were going to be’’ [33]. Earlier warning of an impending outbreak though a focused influenza forecast model could increase planning capabilities beyond simply the next 24 hours, giving hospitals the crucial time needed to prepare for increased patient volumes whether through distribution or purchase of supplies, increased staffing, or opening additional annex areas to increase bed capacity. An easily accessible, flexible, forecast model, such as the one developed here, could easily be distributed and geographically focused to provide individual medical centers with their own influenza prediction model, allowing for advanced influenza planning. We sought to develop a practical forecast model, which could ultimately be used in a clinical setting to guide implementation of an intervention to mitigate crowding resulting from influenza. This end goal of clinical application limited the potential data sources which could be used in the forecasting model. Further, in order to Results The derived forecast models are based upon seven influenza seasons of weekly data including number of influenza-related ED patient visits, GFT, mean temperature, and mean relative humidity as shown in Figure 1. The eight included influenza peaks (the 2008–2009 season had 2 peaks) spiked at a mean value of 33.1 (95% confidence interval: 20.5–45.5) weekly influenza cases. Several GLM and GARMA forecasting models were examined systematically to determine the optimal (most accurate) set of input data characterized by the final model, as shown in Table 1. As expected, the GARMA model displayed the lowest global forecast deviance (i.e., highest accuracy). GFT alone significantly outperformed temperature, relative humidity, and Julian weeks when using GLM models. Forecasts from this final model are shown in Figure 2 during both an atypical (2008–2009) and a typical (2010–2011) influenza season, where the quality of this model can be observed. Using the selected GARMA(3,0) base model, several external variables were added using a forward selection method and were evaluated for improved forecasting capabilities which are presented in Table 2. Adding GFT significantly improved the model as demonstrated by the highly statistically significant p-value of 0.0005. On all 7 out-of-sample verification sets, GFT was the only exogenous variable to be statistically significant at a significance PLOS ONE | www.plosone.org 3 February 2013 | Volume 8 | Issue 2 | e56176 Influenza Forecasting with Google Flu Trends Figure 1. Input variables over study timeframe. doi:10.1371/journal.pone.0056176.g001 be effective, the final model must not only be accurate and timely, but also easy to implement and applicable to both an individual city and clinical center. Thus, we selected data with these parameters in mind. Figure 2. Base Autoregressive Forecast Model. Number of confirmed Emergency Department (ED) influenza cases (dots) compared to the base 3rd order Negative Binomial Generalized Autoregressive Poisson (GARMA) model (line) over (a) the 2008–2009 atypical influenza season and (b) the 2010–2011 typical influenza season. doi:10.1371/journal.pone.0056176.g002 PLOS ONE | www.plosone.org 4 February 2013 | Volume 8 | Issue 2 | e56176 Influenza Forecasting with Google Flu Trends Table 1. Forecast of Emergency Department Influenza Cases. Covariate Forecast Global Deviance Forecast Confidence Autoregression First order autoregression 1049 81% Second order autoregression 1039 79% Third order autoregression 1016 81% Google Flu Trends 1609 77% D Google Flu Trends 2758 56% Google Flu Trends Climate Temperature 2470 66% D Temperature 3070 60% Humidity 2181 69% Julian weeks 2890 63% Sin(2p/52) + Cos(2p/52) 3551 62% Time Capability of Generalized Autoregressive Negative Binomial (GARMA) and univariate generalized linear models (GLM) to forecast the number of confirmed Emergency Department (ED) influenza cases. D indicates the change of the indicated variable between the prior and current week. Forecast Global Deviance indicates the sum of each forecast global deviance for all 7 leave-one-out validation models. Forecast Confidence indicates the average of confidences from all 7 leave-one-out validation models. Forecast confidence is the percentage of forecast values, during an influenza peak, that are within seven influenza cases of the actual data. doi:10.1371/journal.pone.0056176.t001 GFT can be narrowed down to city level data, is available in real time, and has been previously validated to strongly correlate with the number of influenza and ILI cases at the medical center level [14]. This particular surveillance system was chosen over others due to the near real-time nature of the data, which is available 7–10 days before traditional systems [13]. Additionally, GFT is a free data source, which is easily downloaded and available for all to access, permitting convenient input for potential end-users into a forecast model. These same principles of city level detail, real-time availability, and ease of use also applied to the Figure 3. Final Autoregressive Forecast Model. Number of confirmed Emergency Department (ED) influenza cases (dots) compared to the final 3rd order Negative Binomial Generalized Autoregressive Poisson (GARMA) model with Google Flu Trends as an added external variable (line) over (a) the 2008–2009 atypical influenza season and (b) the 2010–2011 typical influenza season. doi:10.1371/journal.pone.0056176.g003 PLOS ONE | www.plosone.org 5 February 2013 | Volume 8 | Issue 2 | e56176 Influenza Forecasting with Google Flu Trends Table 2. Capability of adding an exogenous covariate to forecast. P-value Covariate Forecast Global Deviance Forecast Confidence none (Baseline model) 1016 81% Google Flu Trends 1004 83% 0.0005 D Google Flu Trends 1017 80% .0.05 Temperature 1039 80% .0.05 D Temperature 1017 83% .0.05 Humidity 1030 82% .0.05 Julian weeks 1014 82% .0.05 Seasonality2Sin(2p/52)+Cos(2p/52) 1040 83% .0.05 Google Flu Trends Climate Time The number of confirmed Emergency Department (ED) influenza cases compared to the base 3rd order Negative Binomial Generalized Autoregressive Poisson (GARMA) model. D indicates the change of the indicated variable between the prior and current week. Forecast Global Deviance indicates the sum of each forecast global deviance for all 7 leave-one-out validation models. Forecast Confidence indicates the average of confidences from all 7 leave-one-out validation models. Forecast confidence is the percentage of forecast values, during an influenza peak, that are within seven influenza cases of the actual data. doi:10.1371/journal.pone.0056176.t002 other parameters we evaluated including meteorological variables and basic time variables. Notably, our model differs from many previous forecast models used for influenza surveillance which have incorporated parameters that are either difficult to access or not universally available in real-time, such as over the counter drug sales and school absenteeism [19,25]. Inclusion of such variables in those models has limited their applicability with regard to data availability or feasibility of data collection and synthesis. The one element considered in this model which is not readily available from an online source is the number of influenza cases at the medical center where the model may be used; however, this data is almost always tracked and easily available to the medical institution itself. As shown by the numerous models examined here, autoregression of actual influenza cases is critical to a strong influenza forecast model. Without the autoregressive component, models solely relying on surveillance, meteorological, or temporal components had much greater error in the resulting forecasts. Although GFT does statistically improve the baseline autoregressive forecast model, the practical significance of this improvement is marginal, as addition of GFT to the model improves the forecast confidence from 81% to 83%. However, given the ease of use and integration of GFT into the model, there is little downside to its inclusion in the final model. The significance of including confirmed influenza cases in the model underscores the importance of medical centers maintaining influenza testing programs, as well as efficient laboratory information systems which can process and result this data in a timely, accessible fashion. Findings here build on prior work by our group where we reported a strong correlation between GFT and local influenza cases, and suggested an early rise in GFT 1–2 weeks prior to actual increases in confirmed influenza cases [14]. Our derived forecast model provides the first-ever demonstration of GFT’s forecasting capabilities by showing significant improvements in the forecasting estimates of a baseline GARMA model with the addition of GFT data. This validation of the predictive capabilities of GFT was performed using the current GFT algorithm. Previous GFT algorithms (prior to 2009) had lower correlation with influenza PLOS ONE | www.plosone.org cases during the 2009 pandemic, likely due to a change in internet search patterns [37,38]. Though the current algorithm is shown to have predictive capacities, there remains a future possibility of additional alterations to search patterns, and poor correlation between GFT and influenza cases requiring additional updates. This study is limited by the lack of demonstration of regional generalizability, as the models were developed using influenza data from one medical center. Further, though the models used citylevel GFT and meteorological data, it is unclear whether these data elements are uniform throughout the city or vary by smaller geographic regions. The ultimate forecast model should be flexible based on geographic area and local data; thus additional study is necessary to fully evaluate the geographic generalizability of our model. Temporal generalizability is also of concern; however, these models were validated using both atypical and typical influenza seasons, thus accounting for various potential influenza patterns. Overall, we have developed a practical, city-based forecast model based upon generalized autoregression of laboratory confirmed influenza cases and GFT. The addition of real-time, easily accessible GFT improves the forecasting capabilities and demonstrates the predictive potential of GFT. This practical forecast model could provide advanced warning for medical centers, thus allowing time to implement an appropriate response to control infection or mitigate influenza-related increases in patient volumes. Ultimately designed for clinical application, integrating this potentially powerful forecasting tool into medical center use requires additional clinical feasibility and effectiveness studies to link this forecast with a clinical intervention designed to mitigate crowding. Author Contributions Conceived and designed the experiments: AFD MJ YG SL FT TI RR. Performed the experiments: AFD MJ YG SL FT TI RR. Analyzed the data: MJ YG SL. Contributed reagents/materials/analysis tools: MJ YG SL TI. Wrote the paper: AFD MJ YG SL FT TI RR. 6 February 2013 | Volume 8 | Issue 2 | e56176 Influenza Forecasting with Google Flu Trends References 1. World Health Organization (2009) Influenza (Seasonal); Available: https://www. who.int/mediacentre/factsheets/fs211/en/.Accessed April 5th, 2012. 2. McDonnell WM, Nelson DS, Schunk JE (2012) Should we fear ‘‘flu fear’’ itself? Effects of H1N1 influenza fear on ED use. Am J Emerg Med 30: 275–282. 3. Institute of Medicine (2006) The future of emergency care in the United States health system. Ann Emerg Med 48: 115–120. 4. Glaser CA, Gilliam S, Thompson WW, et al. (2002) Medical care capacity for influenza outbreaks, Los Angeles. Emerg Infect Dis 8: 569–574. 5. Schull MJ, Mamdani MM, Fang J (2004) Community influenza outbreaks and emergency department ambulance diversion. Ann Emerg Med 44: 61–67. 6. Osterholm MT (2005) Preparing for the next pandemic. N Engl J Med 352: 1839–1842. 7. Bernstein SL, Aronsky D, Duseja R, et al. (2009) The effect of emergency department crowding on clinically oriented outcomes. Acad Emerg Med 16: 1– 10. 8. Pines JM, Hollander JE, Localio AR, Metlay JP (2006) The association between emergency department crowding and hospital performance on antibiotic timing for pneumonia and percutaneous intervention for myocardial infarction. Acad Emerg Med 13: 873–878. 9. Pines JM, Pollack CV, Dierks DB, Chang AM, Shofer FS, et al. (2009) The association between emergency department crowding and adverse cardiovascular outcomes in patients with chest pain. Acad Emerg Med 16: 617–625. 10. Pines JM, Shofer FS, Isserman JA, Abbuhl SB, Mills AM (2010) The effect of emergency department crowding on analgesia in patients with back pain in two hospitals. Acad Emerg Med 17: 276–283. 11. Centers for Disease Control and Prevention (Overview of Influenza Surveillance in the United States). Available: https://www.cdc.gov/flu/weekly/overview.htm. Accessed April 5th, 2012. 12. Google (2012) Google Flu Trends. Available: https://www.google.org/flutrends. Accessed April 5th, 2012. 13. Ginsberg J, Mohebbi MH, Patel RS, Brammer L, Smolinski MS, et al. (2009) Detecting influenza epidemics using search engine query data. Nature 2009: 7232. 14. Dugas AF, Hsieh YH, Levin SR, Pines JM, Mareiniss DP, et al. (2012) Google Flu Trends: correlation with emergency department influenza rates and crowding metrics. Clin Infect Dis 54: 463–469. 15. Longini IM, Fine PE, Thacker SB (1986) Predicting the global spread of new infectious agents. Am J Epidemiol 123: 383–391. 16. Hall IM, Gani R, Hughes HE, Leach S (2007) Real-time epidemic forecasting for pandemic influenza. Epidemiol Infect 135: 372–385. 17. Birrell PJ, Ketsetzis G, Gay NJ, Cooper BS, Presanis AM, et al. (2011) Bayesian modeling to unmask and predict influenza A/H1N1pdm dynamics in London. Proc Natl Acad Sci U S A 108: 18238–18243. 18. Viboud C, Boëlle PY, Carrat F, Valleron AJ, Flahault A (2003) Prediction of the spread of influenza epidemics by the method of analogues. Am J Epidemiol 158: 996–1006. 19. Kim MJ, Nembhard HB, Lambert B, Turbelin C, Flahault A, et al. (2011) A syndromic surveillance system for clinical and non-clinical health data. IIE Transactions on Healthcare Systems Engineering 1: 37–48. PLOS ONE | www.plosone.org 20. Boyle JR, Sparks RS, Keijzers GB, Crilly JL, Lind JF, et al. (2011) Prediction and surveillance of influenza epidemics. Med J Aust 194: S28–33. 21. Ackerman SA, Knox JA (2002) Meteorology: Understanding the Atmosphere: Brooks/Cole Publishing Company. 22. Dukic VM, Lopes HF, Polson N (2012) Tracking Epidemics With Google Flu Trends Data and a State-Space SEIR Model. J Am Stat Assoc 107: 1410–1426. 23. Nishiura H (2011) Real-time forecasting of an epidemic using a discrete time stochastic model: a case study of pandemic influenza (H1N1–2009). Biomed Eng Online 10: 15. 24. Stroup DF, Thacker SB, Herndon JL (1988) Application of multiple time series analysis to the estimation of pneumonia and influenza mortality by age 1962– 1983. Stat Med 7: 1045–1059. 25. Quénel P, Dab W (1998) Influenza A and B epidemic criteria based on timeseries analysis of health services surveillance data. Eur J Epidemiol 14: 275–285. 26. Soebiyanto RP, Adimi F, Kiang RK (2010) Modeling and predicting seasonal influenza transmission in warm regions using climatological parameters. PLoS One 5: e9450. 27. Benjamin M, Rigby R, Stasinopoulos D (2003) Generalized Autoregressive Moving Average Models. J Am Stat Assoc 98: 214–223. 28. Lowen AC, Mubareka S, Steel J, Palese P (2007) Influenza virus transmission is dependent on relative humidity and temperature. PLoS Pathog 3: 1470–1476. 29. Sloan C, Moore ML, Hartert T (2011) Impact of pollution, climate, and sociodemographic factors on spatiotemporal dynamics of seasonal respiratory viruses. Clin Transl Sci 4: 48–54. 30. Weather Underground (2012) Weather Underground. Available: https://www. wunderground.com/history/. Accessed April 5th, 2012. 31. Kedem B, Fokianos K (2005) Regression Models for Time Series Analysis: Wiley. 32. Jalalpour M (2012) GARMA ToolKit. Available: https://jshare.johnshopkins. edu/mjalalp1/public_html/MehdiJalalpour.htm. Accessed October 30, 2012. 33. Buehler JW, Whitney EA, Smith D, Prietula MJ, Stanton SH, et al. (2009) Situational uses of syndromic surveillance. Biosecur Bioterror 7: 165–177. 34. Reeder B, Revere D, Olson DR, Lober WB (2011) Perceived usefulness of a distributed community-based syndromic surveillance system: a pilot qualitative evaluation study. BMC Res Notes 14: 187. 35. Rebmann T, 2008 APIC Emergency Preparedness Committee (2009) APIC State-of-the-Art Report: the role of the infection preventionist in emergency management. Am J Infect Control 37: 271–281. 36. Farias DR, Raffo L, Bacigalupo S, Cremaschi M, Vence L, et al. (2010) Data for decision making: strategic information tools for hospital management during a pandemic. Disaster Med Public Health Prep 4: 207–212. 37. Cook S, Conrad C, Fowlkes AL, Mohebbi MH (2011) Assessing Google flu trends performance in the United States during the 2009 influenza virus A (H1N1) pandemic. PLoS One 6: e23610. 38. Ortiz JR, Zhou H, Shay DK, Neuzil KM, Fowlkes AL, et al. (2011) Monitoring influenza activity in the United States: a comparison of traditional surveillance systems with Google Flu Trends. PLoS One 6: e18687. 7 February 2013 | Volume 8 | Issue 2 | e56176

Log In

Influenza Forecasting with Google Flu Trends

Related papers

Related papers

Related topics