- Research article
- Open Access
The effect of seasonal and extreme floods on hospitalizations for Legionnaires’ disease in the United States, 2000–2011
BMC Infectious Diseases volume 22, Article number: 550 (2022)
An increasing severity of extreme storms and more intense seasonal flooding are projected consequences of climate change in the United States. In addition to the immediate destruction caused by storm surges and catastrophic flooding, these events may also increase the risk of infectious disease transmission. We aimed to determine the association between extreme and seasonal floods and hospitalizations for Legionnaires’ disease in 25 US states during 2000–2011.
We used a nonparametric bootstrap approach to examine the association between Legionnaires’ disease hospitalizations and extreme floods, defined by multiple hydrometeorological variables. We also assessed the effect of extreme flooding associated with named cyclonic storms on hospitalizations in a generalized linear mixed model (GLMM) framework. To quantify the effect of seasonal floods, we used multi-model inference to identify the most highly weighted flood-indicator variables and evaluated their effects on hospitalizations in a GLMM.
We found a 32% increase in monthly hospitalizations at sites that experienced cyclonic storms, compared to sites in months without storms. Hospitalizations in months with extreme precipitation were in the 89th percentile of the bootstrapped distribution of monthly hospitalizations. Soil moisture and precipitation were the most highly weighted variables identified by multi-model inference and were included in the final model. A 1-standard deviation (SD) increase in average monthly soil moisture was associated with a 49% increase in hospitalizations; in the same model, a 1-SD increase in precipitation was associated with a 26% increase in hospitalizations.
This analysis is the first to examine the effects of flooding on hospitalizations for Legionnaires’ disease in the United States using a range of flood-indicator variables and flood definitions. We found evidence that extreme and seasonal flooding is associated with increased hospitalizations; further research is required to mechanistically establish whether floodwaters contaminated with Legionella bacteria drive transmission.
Legionnaires’ disease is among the most severe and costly waterborne illnesses in the United States, where it is responsible for an estimated 15% of all deaths related to waterborne infectious disease  and between 3 and 9% of all cases of community-acquired pneumonia [2, 3]. Legionnaires’ disease was so named in 1977, when a cooling tower contaminated with the bacteria was found to be the cause of a pneumonia outbreak among guests at a hotel , and its incidence has substantially increased since 2000 [5, 6]. Outbreaks of Legionnaires’ disease have decreased over the last 40 years, however, with only 4% of reported cases since 2000 linked to a cluster . Over 80% of cases are sporadic , and the source of infection is never identified for the majority of these cases. Legionnaires’ disease cases typically peak in late summer or early fall, and this consistent seasonality suggests that environmental factors affect transmission [9, 10] and may help explain the origin of these sporadic infections.
Environmental conditions affect the proliferation of Legionella bacteria in lakes, streams, and estuaries [11, 12], and the contamination events that may lead to disease transmission [13, 14]. The bacteria are abundant in aqueous environments  and survive by parasitizing amoebae, including many that persist in environmental biofilms [16, 17]. The bacteria optimally grow in wet, warm conditions (between 25 and 42 °C) and flourish in sessile biofilm communities [18, 19] in the natural and built environment [20, 21]. Environmental events that mobilize biofilms may be an important driver of infection by increasing the bacterial load in plumbed water [22,23,24], water used for industrial processes [25, 26], and surface water where direct exposure can occur . Susceptible individuals can become infected by inhaling aerosolized bacteria from these contaminated water sources.
Previous studies have found positive associations between cases of Legionnaires’ disease and rainfall [28,29,30,31], relative humidity [10, 32], and streamflow , and inconsistent associations with proximity to rivers or river height [11, 33]. Temperature has been positively associated with cases in several studies, though its effect is often attenuated when adjusting for other seasonal factors [10, 30]. While many of these hydrometeorological variables are associated with flood events, the relationship between Legionnaires’ disease and flooding has not been formally evaluated. Flooding is known to mobilize bacteria-rich biofilms in water bodies [34, 35], which may lead to increased bacterial colonization of the built environment. Churning flood waters may also lead to the direct aerosolization of bacteria and increased risk of exposure for individuals close to flood waters.
Flooding during extreme storms may be of particular concern because high winds and storm surges can damage or overwhelm the water treatment infrastructure necessary to address contamination events [36, 37]. The effect of extreme floods on waterborne infectious diseases has not been systematically examined in the US; rather, it has only been assessed after specific storm events (e.g. Superstorm Sandy [38, 39], Hurricane Katrina ). Increased incidence of intestinal illness has been reported after major storms, however, and post-storm microbiological analyses have found high concentrations of pathogenic bacteria in floodwater [37, 41].
Floods can be measured with a range of hydrometeorological variables and those that best describe extreme or seasonal events often vary by region to reflect local hydroclimatology, geography, and the built environment [42, 43]. These factors determine the conditions under which a flood occurs and help explain, for example, how a single heavy precipitation event can lead to a devastating flash flood in an urban area with a small watershed, whereas the same amount of precipitation has no effect in a rural area with a large drainage basin . Precipitation has traditionally been the primary variable used to determine flood magnitude; however, recent research has demonstrated that soil moisture, snowmelt, and precipitation excess might better characterize flooding in many regions . Most studies that have examined the association between floods and health outcomes have used a single hydrologic indicator [46, 47] or observed storms records . Given that floods cannot be defined by the same set of hydrometeorological variables across all locations, this approach does not allow for the identification of all major flood types in the US (i.e. river, coastal, and flash floods as well as flooding after cyclonic storms).
Understanding the association between Legionnaires’ disease infections and flood events is particularly important given that the severity of flooding is predicted to increase in conjunction with rising temperatures [49, 50]. The severity and timing of river floods is projected to increase due to earlier snowmelt and more intense precipitation [51, 52]. The number of major, billion-dollar floods has increased by 5% each year in the US since 1980 ; this is a trend that is likely to continue under future global warming, as more severe cyclonic storms and coastal flood events are projected to occur in the coming decades .
In this study, we used nonparametric and generalized linear mixed models to determine the effect of extreme and seasonal floods on hospitalizations for legionnaires’ disease across the US. Previous research has examined the association between single hydrometeorological variables and cases, but a thorough examination of the effect of flooding on legionnaires’ disease has not been conducted. Earlier studies have also been limited to small geographic regions, primarily in the northeastern US, whereas this study includes hospitalizations from 25 states throughout the US. Using this national dataset, we have quantified the effects of extreme and seasonal floods, measured using multiple flood-indicator variables, on hospitalizations for Legionnaires’ disease across the US.
Legionnaires’ disease infections occur primarily among older or immunocompromised individuals, and an estimated 97% of identified cases are hospitalized [1, 18]. We used the National Inpatient Sample (NIS) from the Healthcare Cost and Utilization Project (HCUP) to identify Legionnaires’ disease hospitalizations between 2000 and 2011 throughout the US. The NIS is the largest publicly available all-payer inpatient database in the US; it captures 20% of hospitalizations per year and is designed to be representative of all hospitalizations nationwide. We identified infections by ICD-9 code (482.84) and found the monthly Legionnaires’ disease hospitalization count for each hospital. We restricted our analysis to hospitals that contributed at least 4 years of data to the NIS dataset, provided monthly counts of hospitalizations, and reported their geographic location.
Hospitals that reported no Legionnaires’ disease cases were excluded from the analysis because the absence of cases could indicate that Legionella were not present in environmental or household water sources in that region, or because a hospital was not testing for Legionnaires’ disease among patients with pneumonia. Many hospitals reported only one case during the study period; as a sensitivity analysis, we repeated the analyses using several case count thresholds to further restrict the included hospitals. We created subsets of our hospitalization data containing hospitals with at least 1, 5, 10, 15, and 20 Legionnaires’ disease cases during the study period; all of the analyses were repeated with these case count threshold datasets.
The NIS includes the location of the reporting hospital, but not the cases’ residential locations. To address the possibility of misclassification bias, given that the flood data are associated with the location of the hospital, we matched the hospitals to Hospital Service Areas (HSA) provided by the Dartmouth Atlas of Healthcare . The HSA is the catchment area for each hospital and includes the zip codes where most Medicare patients receive care from the given hospital. We repeated the analyses using flood data associated with catchment area, instead of the hospital location, as a sensitivity analysis to assess the consistency of our findings.
Flooding can be characterized by several hydrometeorological variables, and we used multiple flood-indicator variables to account for the range of flood-types found across the study sites (e.g. river floods, coastal floods, flash floods), and to distinguish between extreme and seasonal events. Precipitation, soil moisture, and surface runoff data were obtained from the NASA/ NOAA North American Land Data Assimilation System 2 (NLDAS-2) forcing dataset and were aggregated from an hourly temporal resolution to mean monthly values for each hospital location . We used the United States Geological Survey (USGS) National Water Information System to find the stream gages closest to each hospital, for those that had a stream gage in the same zip code, and obtained daily median and maximum stream discharge measurements, which were aggregated to monthly means .
Data on flooding associated with tropical cyclones were obtained from the NOAA Storm Event Database, which tracks the location, type, and severity of named storms in the Atlantic Storm Basin [58, 59]. For each named storm that occurred during the study period, we extracted county-level data on: 1) storm-related precipitation, 2) reported flooding, and 3) distance from the storm track. Exposure to each of these extreme flood-related indicators was assessed for each hospital and month in the study period.
Extreme floods associated with cyclonic storms
Two methods for identifying extreme floods were used to account for the range of flood types that occur in the US. In the first approach, we defined extreme floods as those associated with named cyclonic storms, and restricted the dataset to the hospitals that experienced these storms and to the months of the Atlantic Basin storm season (June – November).
We modeled the association between Legionnaires’ disease hospitalizations and extreme storm-related floods using a negative binomial generalized linear mixed model (GLMM) framework to account for the over-dispersed hospitalization data. The counties with HCUP-contributing hospitals were categorized as exposed or unexposed to storms for each month during the storm season between 2000 and 2011. A county was considered exposed if it was within 150 km of the storm track and unexposed if it was outside of that range. In addition to the binary exposure variable, we assessed storm-related precipitation and proximity to the storm track as continuous variables and as categorical variables grouped by quartile.
The model included a binary location variable to assess differences between rural and urban hospitals and hospital-specific monthly discharges as an offset to obtain the rate of Legionnaires’ disease hospitalizations. We also included hospital-specific random intercepts nested within state-specific random intercepts to account for underlying differences in hospitalization policies (e.g. testing, reporting, and admitting practices) as well as state-level responses to extreme events (Additional file 1: Model S1). The storm-related variables were modeled separately and jointly, and model fit was assessed using the Akaike Information Criterion (AIC). To assess the consistency of our findings, this analysis was repeated for each Legionnaires’ disease case threshold to determine whether a storm in the preceding month was associated with hospitalizations.
Extreme floods associated with anomalous hydrometeorology
In the second analysis, we classified months with anomalously high precipitation, soil moisture, surface runoff, or streamflow discharge as those with extreme flooding. For each hospital, we found the months with mean hydrometeorological variables above the 95th percentile and averaged the number of Legionnaires’ disease hospitalizations in this “extreme group”. We compared the hospitalizations in the extreme group to a bootstrapped distribution of monthly Legionnaires’ disease hospitalizations.
The bootstrap generated a sampling distribution by randomly selecting 5% of months in the time series, with replacement, averaging the number of hospitalizations in those months, and then repeating the process 10,000 times. To control for seasonality, the sample was selected from the same range of months as those included in the extreme group for each hospital (i.e. if the extreme group for a given hospital did not include hospitalizations for November, then other November months in that hospital’s time series were not selected during the bootstrapping process). The probability of the Legionnaires’ disease hospitalizations in the extreme group was determined by comparison to the empirical cumulative distribution generated by the bootstrap. The bootstrap process was repeated for the all of the hydrometeorological flood indicators and for each case threshold.
In a third analysis, we used a multimodel inference approach to determine the effect of seasonal flood indicators on Legionnaires’ disease hospitalizations for the whole time series, not restricted to months with extreme floods or during the Atlantic Basin hurricane season. Multimodel inference was conducted on candidate models that varied only in the explanatory hydrometeorological variables, but that otherwise had the same structure. All combinations of standardized precipitation, soil moisture, surface runoff, and observed flood count were included in the candidate models; temperature was also included, given that the growth of Legionella has been associated with temperature seasonality. The models also included terms to control for seasonal and secular trends and a random intercept for each hospital (Additional file 1: Model S2). The streamflow variables were excluded from this analysis due to missing data for hospitals that were not near USGS stream gages.
We used the log likelihood and number of parameters to calculate the Akaike weight for each model. The models were ranked by weight, and the top models, the smallest number of models whose weights added to 0.90, were selected as the best-fitting models. Among the top models, variable weight importance for the hydrometeorological and temperature variables was determined. Cross-validation was performed by removing 20% of the data and conducting multimodel inference on the remainder; this process was iterated 1,000 times to evaluate the consistency of the weights and effect estimates, and to compare them to the top full models. These analyses were repeated for each Legionnaires’ disease case threshold.
There were 1,376 Legionnaires’ disease hospitalizations between 2000 and 2011 at the 75 hospitals that met our inclusion criteria for the primary analysis (Fig. 1a). Most of these hospitals were large facilities (65.4%) and located in urban areas in the Northeast (66.2%) or Midwest (16.8%). The number, size, and geographic breakdown of the hospitals was relatively consistent across years in the study period, with the exception of 2008 when there were no rural hospitals in the dataset (Table 1). The rural/urban location and hospital bed-size variables were not included in the 2011 HCUP dataset, but the mean annual discharge and geographic region breakdown for this year are consistent with previous years.
Seasonality and secular trends in hospitalizations varied by geographic region (Fig. 1). In the Northeast and Midwest, hospitalizations increased between July and October, peaking in August, and also increased over time (Fig. 1b). In the Southwest, hospitalizations exhibited an attenuated seasonality, with increased hospitalizations between March and October, and fluctuated over time. There was no clear seasonal or secular trend in hospitalizations among the hospitals in western states.
The hospital characteristics varied considerably across the datasets with different case count thresholds used in the secondary analysis. The subset of hospitals with at least one Legionnaires’ disease case included 378 hospitals, with many located on the West Coast (17.6%) and in rural areas (23.1%) (Additional file 1: Table S1). At higher case count thresholds, the included hospitals on average had larger bed capacity and were concentrated in urban areas in the Northeast; among the 15-case and 20-case threshold hospitals, none were from rural areas or located on the West Coast (Additional file 1: Table S1). Seasonal and secular trends were consistent across the different case count thresholds.
Fifteen named hurricanes or tropical cyclones affected counties with hospitals included in the dataset (Additional file 1: Table S2). Among the hospitals that experienced these storms, there was a significant increase in Legionnaires’ disease hospitalizations during months with a storm compared to months during the Atlantic storm season when a storm did not occur (Fig. 2). There was a 32% increase in monthly Legionnaires’ disease hospitalizations among hospitals that experienced a cyclonic storm compared to those that did not. This association was consistent across the case count thresholds, though it was insignificant in the 1-case and 5-case subsets and stronger in the 15-case and 20-case subsets, where there was a 46% and 54% increase in hospitalizations, respectively, in months with a cyclonic storm compared to those without storms (Fig. 2).
The intensity of storm-related precipitation and proximity to the storm track were not significantly associated with Legionnaires’ disease hospitalizations (Additional file 1: Fig. S1). Hospitals that experienced the most intense storm-related precipitation (quartile 4 of maximum rainfall) had an increase in hospitalizations compared to hospitals that did not experience storm-related precipitation, but the difference was insignificant (Additional file 1: Fig. S1a). Among the hospitals in the 15-case and 20-case subsets, however, this association was significant; hospitalizations increased by 81% and 90%, respectively, with moderate storm-related precipitation (quartile 3 of maximum rainfall) (Additional file 1: Fig. S1a). Proximity to the storm track was not associated with hospitalizations for any of the case count thresholds (Additional file 1: Figure S1b). The sensitivity analysis with storm data aggregated to each hospital’s catchment area yielded results consistent with the primary analysis. There was a 50% increase in monthly Legionnaires’ disease hospitalizations among hospitals in HSAs that experienced a cyclonic storm compared to those that did not (Additional file 1: Fig. S2a), and no significant association with precipitation intensity (Additional file 1: Fig. S2b) or proximity to the storm track (Additional file 1: Fig. S2c).
The average number of Legionnaires’ disease hospitalizations in months with extreme precipitation was in the 89th percentile of the bootstrapped distribution (Fig. 3), which was substantially higher than the average number of hospitalizations for all other causes in the same months. The strength of this association increased among the 15-case and 20-case threshold subsets to the 92nd and 94th percentiles, respectively (Table 2). Across all case-count thresholds, Legionnaires’ disease hospitalizations in months with extreme runoff, soil moisture, or temperature did not significantly vary from the bootstrapped averages (Table 2). These findings are supported by the sensitivity analysis using meteorological data aggregated to the hospitals’ catchment areas; the average number of hospitalizations in months with extreme precipitation was in the 84th percentile and increased to the 91st and 93rd percentiles among the higher case thresholds (Additional file 1: Table S3).
The hydrometeorological flood-indicator variables exhibited seasonal patterns that varied by geographic region (Additional file 1: Fig. S3). Precipitation typically peaked between June and September in the Northeast, Midwest, and Southwest, whereas along the West Coast it was driest during the summer and peaked in December or January (Additional file 1: Fig. S3a). Soil moisture seasonality was consistent across the US, with maxima occurring in January or February and minima mid-summer, but the range varied by region (Additional file 1: Fig. S3b). In the Northeast and Midwest, monthly soil moisture was relatively stable, whereas in the Southwest and on the West Coast there was a steep decline in soil moisture during the summer. Surface runoff exhibited the most distinct seasonality by region; on the West Coast it peaked during the winter, coinciding with the precipitation peaks, whereas in the Northeast surface runoff peaked in late spring, prior to the precipitation peak. Many areas in the Midwest experienced two peaks, one in the early spring and one in the later summer (Additional file 1: Fig. S3c).
Soil moisture and precipitation were the most highly weighted variables identified by the importance weighting and multimodel inference (Fig. 4). Both variables were positively associated with a significant increase in monthly Legionnaires’ disease hospitalizations and were included in all of the top models (Table 3). A 1-standard deviation increase in average soil moisture was associated with a 49% increase in hospitalizations in the most highly weighted model (Table 3). In the same model, a 1-standard deviation increase in average precipitation was associated with a 26% increase in hospitalizations. Temperature and the other hydrometeorological variables were not significantly associated with Legionnaires’ disease hospitalizations in any of the top models. The importance weights, top models, and effect estimates were consistent across all hospitalization thresholds (Additional file 1: Table S4) and in the cross-validation sensitivity analysis. Similarly, multimodel inference using flood-indicator data aggregated to the hospitals’ catchment areas identified the same top models and comparable effect estimates. In the most highly weighted model, a 1-standard deviation increase in soil moisture and precipitation at the catchment level was associated with a 53% and 26% increase in hospitalizations, respectively (Additional file 1: Table S5).
The incidence of sporadic Legionnaires’ disease has increased for over 20 years, but the association between cases and flooding as a potential driver of disease has not been thoroughly examined in the US. Previous studies have found positive associations among cases, rainfall, and relative humidity, but most have focused on specific cities or small geographic regions [10, 11, 60]. Many have also relied on weather data from a single source [28, 30] or a single source per state , which may obscure local variation in meteorological conditions. In this study, the association between flooding, measured by several hydrometeorological variables, and Legionnaires’ disease hospitalizations was analyzed across 75 hospitals in 25 states in the US, a geographic scope that encompasses a range of climatological regimes and demographics.
This work suggests that flooding, which can lead to the contamination of household and recreational water sources [37, 41], may be associated with hospitalizations for Legionnaires’ disease. While previous work has implied that rainfall influences the spread of disease via contamination [61, 62], none have focused on identifying or quantifying flood events as the driver of transmission. To address this gap, we used several methods to characterize extreme and seasonal floods; we found that hospitalizations increased during months with flooding due to extreme storms and were positively associated with monthly precipitation and soil moisture, which are common flood-indicator variables.
The seasonality and intensity of flooding varies considerably throughout the US, and these events cannot be measured with a single flood-indicator variable. We used two definitions of extreme events to account for the variety of flood types that occur, including those associated with hurricanes or tropical storms as well as those unrelated to cyclonic storms (e.g. due to intense precipitation and snowmelt). In the first approach, hospitalizations increased 32% in months with named storms during the Atlantic storm basin season among hospitals in the mid-Atlantic and Northeast. The second approach reinforced this finding and determined that hospitalizations throughout the US increased in months with anomalously high precipitation, not just those affected by cyclonic storms. The extreme event analysis supported a 2005 study that found legionellosis was positively associated with high atmospheric pressure more 10 days before occurrence and low atmospheric pressure within 5 days of occurrence, consistent with the transition that occurs when a storm front moves through an area .
In addition to extreme floods, many parts of the US experience seasonal flooding, including floods associated with snowmelt, frequent thunderstorms, and flash floods after droughts. We used multiple flood-indicator variables to characterize these seasonal floods and found that monthly soil moisture and precipitation are associated with increased Legionnaires’ disease hospitalizations. The association between rainfall and cases is well-supported, but this is the first to assess soil moisture, which functions as an integrator of rainfall and is an important flood indicator. Legionella bacteria thrive in extremely warm environments but in our analysis, extreme or seasonal temperature was not significantly associated with hospitalizations. This effect of temperature on Legionnaires’ disease is inconsistent with previous studies; temperature lagged from 1 to 9 weeks was predictive of cases in some studies [28, 31] and associated with a decrease in disease rates in others. Given laboratory studies demonstrating that Legionella bacteria preferentially grow at high temperatures, it is likely that environmental temperature influences transmission. Our findings suggest, however, that extreme or seasonal flood events are more strongly associated with increased hospitalizations whereas temperature alone is not.
While the effect of flooding on Legionnaires’ disease has not been examined in the US, our findings are supported by earlier research on the relationship between flood-indicator variables and outbreaks of other waterborne diseases [35, 46, 47]. An analysis of 42 years of outbreaks in the US found that 51% were preceded by extreme rainfall and that 60% were attributed to drinking water contamination . This study did not examine the mechanisms by which flooding affects Legionnaires’ disease hospitalizations, but previous research has identified Legionella in environments that are vulnerable to flooding. Legionella have been detected in surface runoff [27, 61, 62], which can directly contaminate drinking water sources or overwhelm water treatment systems during floods. The bacteria have also been found in wastewater and sewage treatment plants [63, 64], which are prone to overflows and contamination events associated with floods . Our findings indicate an association between flooding and Legionnaires’ disease, and future research should focus on examining the mechanisms by which flooding could lead to contamination and drive transmission.
Sources of contamination typical in Legionnaires’ disease outbreaks, namely cooling towers, plumbing systems, and recreational or decorative pools , are often not the source for sporadic cases . Transmission of sporadic, community-acquired cases may instead be driven by household water and environmental exposures. Previous studies have attributed up to 40% of sporadic cases to potable water , and an elevated risk of infection has been associated with water from private wells  and from surface water (compared to potable water from groundwater sources) . Water quality data from a range of sources could help determine the primary modes of exposure to Legionella for sporadic cases not associated with point source contamination. Detailed exposure analyses would also lead to an improved understanding of how infection occurs; while Legionnaires’ disease transmission is thought to occur primarily via the inhalation of aerosolized bacteria, some studies suggest that infection also occurs via aspiration [69, 70]. Contaminated drinking water may be a crucial source of exposure if infection occurs via aspiration, as aerosolization by a household item (e.g. a showerhead, faucet, or hose) would not be required for transmission.
Our findings are constrained by a number of limitations related to the availability and resolution of the hospitalization data. The analysis does not include any data from the Southeastern US because these states either do not contribute to the HCUP dataset or do not provide monthly data; this is a major limitation, as states in this region are most prone to cyclonic storms. However, regions with the highest incidence of Legionnaires’ disease were included in the analysis, and states that did not contribute to the HCUP dataset generally had lower incidence compared to the national average . A recent analysis of Legionnaires’ disease epidemiological trends in the United States between 1992 and 2018 found that age-standardized average incidence was higher in the Northeast and Midwest compared to the South and West, and highest in New England and the Mid-Atlantic states ; these geographic differences in incidence were more pronounced later in the time series (after 2002), which overlaps with most of the study period in this analysis. Future studies should examine the associations among hydrometeorological conditions and Legionnaires’ disease throughout the US, particularly in the Southeast, but the regions included in the study capture the states with the highest Legionnaires’ disease burden.
Despite rising incidence, hospitalizations for Legionnaires’ disease are relatively uncommon and as such our study relies on a small number of cases. During the study period, the total number of annual cases, not just hospitalizations, in the US reported to the Centers for Disease Control and Prevention (CDC) ranged from 969 to 3,676 . To address this limitation, we repeated the analysis using several case-count thresholds in order to examine the consistency of our findings when different hospitals were included in the dataset. The stability of the associations, even when hospitals with a single case were included in the dataset, indicate that the findings are robust.
The National Inpatient Sample only provides monthly hospitalization data, which prohibits a more temporally resolved analysis, and the geographic location of the hospital, not the residential locations of the cases. The absence of more temporally or geographically resolved data introduces the possibility of misclassification bias, given that the flood data associated with the hospital’s zip code may not accurately reflect the conditions at the cases’ residential zip codes. We aimed to address these limitations by including a large number of hospitals in the study from rural, urban, and suburban areas and evaluating the consistency of our findings across different study sites. Our findings are also consistent with small-scale studies that used daily case data  or had residential location data .
Both seasonal and extreme flooding is projected to increase in conjunction with warming atmospheric temperatures, and our ability to mitigate the effect of these floods is contingent upon a thorough understanding of flood-disease dynamics and how they geographically vary. Our findings suggest that the increase of Legionnaires’ disease across the US may be explained by flooding and that mitigating the effects of these events in the future is key to reducing the spread of disease. These results also suggest that current flood or contamination control measures are insufficient with respect to Legionella and may indicate that more rigorous water and wastewater treatment policies are required. The findings may also be of use to clinicians treating patients with respiratory symptoms in the wake of extreme events or during seasonal flood periods. While awareness of and testing for legionnaires’ disease has increased, it remains substantially underdiagnosed and underreported among younger and immune-competent individuals. Future analysis should incorporate detailed water quality data from natural and built environments to better understand the routes of exposure, and how hydrological events affect transmission.
Availability of data and materials
The datasets supporting the conclusions of this article are available from HCUP (hospitalization data), NLDAS (meteorological data), NOAA (storm data), and USGS (stream flow data). HCUP data can be requested and purchased through the HCUP Central Distributor. The NLDAS, NOAA, and USGS datasets are publicly available and the sources are referenced in the paper.
Akaike information criterion
Generalized linear mixed model
Healthcare cost and utilization project
Hospital service area
National inpatient sample
NASA/NOAA North American land data assimilation system-2
National oceanic and atmospheric administration
United States Geological Survey
Collier SA, Deng L, Adam EA, Benedict KM, Beshearse EM, Blackstock AJ, et al. Estimate of burden and direct healthcare cost of infectious waterborne disease in the United States. Emerg Infect Dis. 2021;27(1):140–9.
Marston BJ, Plouffe JF, File TM Jr, Hackman BA, Salstrom S-J, Lipman HB, et al. Incidence of community-acquired pneumonia requiring hospitalization: results of a population-based active surveillance study in Ohio. Arch Intern Med. 1997;157(15):1709–18.
Stout JE, Yu VL. Legionellosis. N Engl J Med. 1997;337(10):682–7.
Fraser DW, Tsai TR, Orenstein W, Parkin WE, Beecham HJ, Sharrar RG, et al. Legionnaires’ disease: description of an epidemic of pneumonia. N Engl J Med. 1977;297(22):1189–97.
CDC. Legionnaires’ Disease Surveillance Summary Report, United States 2016–2017. 2020. https://www.cdc.gov/legionella/health-depts/surv-reporting/2016-17-surv-report-508.pdf.
Neil K, Berkelman R. Increasing incidence of legionellosis in the United States, 1990–2005: changing epidemiologic trends. Clin Infect Dis. 2008;47(5):591–9.
Cunha BA, Burillo A, Bouza E. Legionnaires’ disease. Lancet. 2016;387(10016):376–85.
Marston BJ, Lipman HB, Breiman RF. Surveillance for legionnaires’ disease: risk factors for morbidity and mortality. Arch Intern Med. 1994;154(21):2417–22.
Phin N, Parry-Ford F, Harrison T, Stagg HR, Zhang N, Kumar K, et al. Epidemiology and clinical management of Legionnaires’ disease. Lancet Infect Dis. 2014;14(10):1011–21.
Fisman DN, Lim S, Wellenius GA, Johnson C, Britz P, Gaskins M, et al. It’s not the heat, it’s the humidity: wet weather increases legionellosis risk in the greater Philadelphia metropolitan area. J Infect Dis. 2005;192(12):2066–73.
Ng V, Tang P, Jamieson F, Drews SJ, Brown S, Low DE, et al. Going with the flow: legionellosis risk in Toronto, Canada is strongly associated with local watershed hydrology. EcoHealth. 2008;5(4):482–90.
States SJ, Conley LF, Kuchta JM, Oleck BM, Lipovich MJ, Wolford RS, et al. Survival and multiplication of Legionella pneumophila in municipal drinking water systems. Appl Environ Microbiol. 1987;53(5):979–86.
Rose JB, Daeschner S, Easterling DR, Curriero FC, Lele S, Patz JA. Climate and waterborne disease outbreaks. Journal AWWA. 2000;92(9):77–87.
Curriero FC, Patz JA, Rose JB, Lele S. The association between extreme precipitation and waterborne disease outbreaks in the United States, 1948–1994. Am J Public Health. 2001;91(8):1194–9.
Fliermans CB, Cherry WB, Orrison LH, Smith SJ, Tison DL, Pope DH. Ecological distribution of Legionella pneumophila. Appl Environ Microbiol. 1981;41(1):9–16.
Declerck P. Biofilms: the environmental playground of Legionella pneumophila. Environ Microbiol. 2010;12(3):557–66.
Fields BS, Benson RF, Besser RE. Legionella and Legionnaires’ disease: 25 years of investigation. Clin Microbiol Rev. 2002;15(3):506–26.
Edelstein PH, Cianciotto NP. Legionella Species and Legionnaires’ Disease. In: Dworkin M, Falkow S, Rosenberg E, Schleifer K-H, Stackebrandt E, editors. The prokaryotes: a handbook on the biology of bacteria volume 6: proteobacteria: gamma subclass. New York: Springer; 2006. p. 988–1033.
Berendt RF. Survival of Legionella pneumophila in aerosols: effect of relative humidity. J Infect Dis. 1980;141(5):689.
Declerck P, Behets J, van Hoef V, Ollevier F. Detection of Legionella spp. and some of their amoeba hosts in floating biofilms from anthropogenic and natural aquatic environments. Water Res. 2007;41(14):3159–67.
Rogers J, Dowsett AB, Dennis PJ, Lee JV, Keevil CW. Influence of temperature and plumbing material selection on biofilm formation and growth of Legionella pneumophila in a model potable water system containing complex microbial flora. Appl Environ Microbiol. 1994;60(5):1585–92.
Stout JE, Yu VL, Yee YC, Vaccarello S, Diven W, Lee TC. Legionella pneumophila in residential water supplies: environmental surveillance with clinical assessment for Legionnaires’ disease. Epidemiol Infect. 1992;109(1):49–57.
Best M, Yu VL, Stout J, Goetz A, Muder RR, Taylor F. Legionellaceae in the hospital water-supply. Epidemiological link with disease and evaluation of a method for control of nosocomial legionnaires’ disease and Pittsburgh pneumonia. Lancet. 1983;2(8345):307–10.
Garrison LE, Kunz JM, Cooley LA, Moore MR, Lucas C, Schrag S, et al. Vital signs: deficiencies in environmental control identified in outbreaks of legionnaires’ disease—North America, 2000–2014. MMWR Morb Mortal Wkly Rep. 2016;65(22):576–84.
Walczak M, Kletkiewicz H, Burkowska A. Occurrence of Legionella pneumophila in lakes serving as a cooling system of a power plant. Environ Sci Process Impacts. 2013;15(12):2273–8.
Morton S, Bartlett CL, Bibby LF, Hutchinson DN, Dyer JV, Dennis PJ. Outbreak of legionnaires’ disease from a cooling water system in a power station. Br J Ind Med. 1986;43(9):630–5.
Sakamoto R, Ohno A, Nakahara T, Satomura K, Iwanaga S, Kouyama Y, et al. Legionella pneumophila in rainwater on roads. Emerg Infect Dis. 2009;15(8):1295–7.
Brandsema PS, Euser SM, Karagiannis I, Den Boer JW, Van Der Hoek W. Summer increase of Legionnaires’ disease 2010 in The Netherlands associated with weather conditions and implications for source finding. Epidemiol Infect. 2014;142(11):2360–71.
Hicks LA, Rose CE Jr, Fields BS, Drees ML, Engel JP, Jenkins PR, et al. Increased rainfall is associated with increased risk for legionellosis. Epidemiol Infect. 2007;135(5):811–7.
Karagiannis I, Brandsema P. Warm, wet weather associated with increased Legionnaires’ disease incidence in The Netherlands. Epidemiol Infect. 2009;137(2):181–7.
Halsby KD, Joseph CA, Lee JV, Wilkinson P. The relationship between meteorological variables and sporadic cases of Legionnaires’ disease in residents of England and Wales. Epidemiol Infect. 2014;142(11):2352–9.
Ricketts KD, Charlett A, Gelb D, Lane C, Lee JV, Joseph CA. Weather patterns and Legionnaires’ disease: a meteorological study. Epidemiol Infect. 2009;137(7):1003–12.
Cassell K, Gacek P, Warren JL, Raymond PA, Cartter M, Weinberger DM. Association between sporadic legionellosis and river systems in connecticut. J Infect Dis. 2018;217(2):179–87.
Lane K, Charles-Guzman K, Wheeler K, Abid Z, Graber N, Matte T. Health effects of coastal storms and flooding in urban areas: a review and vulnerability assessment. J Environ Public Health. 2013;2013: 913064.
Wade TJ, Sandhu SK, Levy D, Lee S, LeChevallier MW, Katz L, et al. Did a severe flood in the Midwest cause an increase in the incidence of gastrointestinal symptoms? Am J Epidemiol. 2004;159(4):398–405.
Gibson CJ, Stadterman KL, States S, Sykora J. Combined sewer overflows: a source of Cryptosporidium and Giardia? Water Sci Technol. 1998;38(12):67–72.
Amaral-Zettler LA, Rocca JD, Lamontagne MG, Dennett MR, Gast RJ. Changes in microbial community structure in the wake of Hurricanes Katrina and Rita. Environ Sci Technol. 2008;42(24):9072–8.
Greene SK, Wilson EL, Konty KJ, Fine AD. Assessment of reportable disease incidence after Hurricane Sandy, New York City, 2012. Disaster Med Public Health Prep. 2013;7(5):513–21.
Ridpath AD, Bregman B, Jones L, Reddy V, Waechter H, Balter S. Challenges to implementing communicable disease surveillance in New York City evacuation shelters after Hurricane Sandy, November 2012. Public Health Rep. 2015;130(1):48–53.
Presley SM, Rainwater TR, Austin GP, Platt SG, Zak JC, Cobb GP, et al. Assessment of pathogens and toxicants in New Orleans, LA following Hurricane Katrina. Environ Sci Technol. 2006;40(2):468–74.
Sinigalliano CD, Gidley ML, Shibata T, Whitman D, Dixon TH, Laws E, et al. Impacts of Hurricanes Katrina and Rita on the microbial landscape of the New Orleans area. Proc Natl Acad Sci U S A. 2007;104(21):9029–34.
Villarini G, Smith JA. Flood peak distributions for the eastern United States. Water Resour Res. 2010. https://doi.org/10.1029/2009WR008395.
Berghuijs WR, Woods RA, Hutton CJ, Sivapalan M. Dominant flood generating mechanisms across the United States. Geophys Res Lett. 2016;43(9):4382–90.
Villarini G. On the seasonality of flooding across the continental United States. Adv Water Resour. 2016;87:80–91.
Parinussa RM, Lakshmi V, Johnson FM, Sharma A. A new framework for monitoring flood inundation using readily available satellite data. Geophys Res Lett. 2016;43(6):2599–605.
Guzman Herrador BR, de Blasio BF, MacDonald E, Nichols G, Sudre B, Vold L, et al. Analytical studies assessing the association between extreme precipitation or temperature and drinking water-related waterborne infections: a review. Environ Health. 2015;14:29.
Soneja S, Jiang C, Romeo Upperman C, Murtugudde R, Blythe D, et al. Extreme precipitation events and increased risk of campylobacteriosis in Maryland, USA. Environ Res. 2016;149:216–21.
Cann KF, Thomas DR, Salmon RL, Wyn-Jones AP, Kay D. Extreme water-related weather events and waterborne disease. Epidemiol Infect. 2013;141(4):671–86.
Arnell NW, Gosling SN. The impacts of climate change on river flood risk at the global scale. Clim Change. 2014;134(3):387–401.
Heim RR, Peterson TC, Hirsch R, Kaiser DP, Brooks H, Diffenbaugh NS, et al. Monitoring and understanding changes in heat waves, cold waves, floods, and droughts in the United States: state of knowledge. Bull Am Meteor Soc. 2013;94(6):821–34.
Hirsch RM, Ryberg KR. Has the magnitude of floods across the USA changed with global CO2 levels? Hydrol Sci J. 2012;57(1):1–9.
Georgakakos A, Fleming P, Dettinger M, Peters-Lidard C, Richmond T, Reckhow K, et al. Ch. 3: Water Resources. Climate change impacts in the United States: The Third National Climate Assessment, J. M. Melillo, Terse (T.C.) Richmond, and G.W. Yohe, Eds. U.S. Global Change Research Program. 2014:69–112.
Smith AB, Katz RW. US billion-dollar weather and climate disasters: data sources, trends, accuracy and biases. Nat Hazards. 2013;67(2):387–410.
Melillo JM, Richmond, T.C., Yohe, G.W. 2014: Highlights of climate change impacts in the United States: the third national climate assessment. US Global Change Research Program. 2014; https://nca2014.globalchange.gov/downloads/low/NCA3_Highlights_LowRes.pdf.
TDI. Research Methods. The Dartmouth Atlast of health care the Dartmouth institute for health policy and clinical practice. 2020; https://www.dartmouthatlas.org/downloads/methods/research_methods.pdf. Accessed 15 Nov 2020.
Mocko DNGH. NLDAS Primary Forcing Data L4 Monthly 0.125 x 0.125 degree V002. 2012;USA, Goddard Earth Sciences Data and Information Services Center (GES DISC).
De Cicco LA, Lorenz D, Hirsch RM, Watkins W, Johnson M. dataRetrieval: R packages for discovering and retrieving water data available from U.S. federal hydrologic web services. 2022. https://doi.org/10.5066/P9X4L3GE, https://code.usgs.gov/water/dataRetrieval.
Anderson B YM, Ferreri J, Crosson W, Al-Hamdan M, Schumacher A and Eddelbuettel D. hurricaneexposure: explore and map county-level hurricane exposure in the United States. R package version 0.1.1. 2020. http://CRAN.R-project.org/package=hurri.
Anderson B, Yan M, Ferreri J, Crosson W, Al-Hamdan M, Schumacher A, Eddelbuettel D. hurricaneexposure: Explore and Map County-Level Hurricane Exposure in the United States. R package version 0.1.1. 2020. https://cran.r-project.org/package=hurricaneexposure.
Garcia-Vidal C, Labori M, Viasus D, Simonetti A, Garcia-Somoza D, Dorca J, et al. Rainfall is a risk factor for sporadic cases of Legionella pneumophila pneumonia. PLoS ONE. 2013;8(4): e61036.
Schalk JA, Docters van Leeuwen AE, Lodder WJ, de Man H, Euser S, den Boer JW, et al. Isolation of Legionella pneumophila from pluvial floods by amoebal coculture. Appl Environ Microbiol. 2012;78(12):4519–21.
Den Boer JW, Coutinho RA, Yzerman EP, van der Sande MA. Use of surface water in drinking water production associated with municipal Legionnaires’ disease incidence. J Epidemiol Community Health. 2008;62(4): e1.
Caicedo C, Beutel S, Scheper T, Rosenwinkel KH, Nogueira R. Occurrence of Legionella in wastewater treatment plants linked to wastewater characteristics. Environ Sci Pollut Res. 2016;23(16):16873–81.
Loenenbach AD, Beulens C, Euser SM, van Leuken JPG, Bom B, van der Hoek W, et al. Two community clusters of legionnaires’ disease directly linked to a biologic wastewater treatment plant, the Netherlands. Emerg Infect Dis. 2018;24(10):1914–8.
Hummel M, Hummel M, Berry M, Berry M, Stacey M, Stacey M. Sea level rise impacts on wastewater treatment systems along the U.S. Coasts. Earth’s Future. 2018. https://doi.org/10.1002/2017EF000805.
van Heijnsbergen E, Schalk JA, Euser SM, Brandsema PS, den Boer JW, de Roda Husman AM. Confirmed and potential sources of legionella reviewed. Environ Sci Technol. 2015;49(8):4797–815.
Stout JE, Yu VL, Muraca P, Joly J, Troup N, Tompkins LS. Potable water as a cause of sporadic cases of community-acquired legionnaires’ disease. N Engl J Med. 1992;326(3):151–5.
Straus WL, Plouffe JF, File TM Jr, Lipman HB, Hackman BH, Salstrom SJ, et al. Risk factors for domestic acquisition of legionnaires disease. Ohio legionnaires Disease Group. Arch Intern Med. 1996;156(15):1685–92.
Yu VL. Could aspiration be the major mode of transmission for Legionella? Am J Med. 1993;95(1):13–5.
Breiman RF, Butler JC. Legionnaires’ disease: clinical, epidemiological, and public health perspectives. Semin Respir Infect. 1998;13(2):84–9.
Barskey AE, Derado G, Edens C. Rising incidence of legionnaires’ disease and associated epidemiologic patterns, United States, 1992–2018. Emerg Infect Dis. 2022;28(3):527–38.
Division of Health Informatics and Surveillance. National notifiable diseases surveillance system, weekly tables of infectious disease data. Atlanta: Centers for Disease Control and Prevention; 2011.
This study had no funding for its design, data analysis or interpretation, or writing. VL was supported by a training grant from the National Institutes of Health (T32ES023770); both authors were supported by NIH grant R01AI163023.
Ethics approval and consent to participate
This study uses the National Inpatient Sample (NIS) hospitalization dataset compiled by the Healthcare Cost and Utilization Project (HCUP), an administrative dataset maintained by the Agency for Healthcare Research and Quality (AHRQ) within the Department of Health and Human Services (HHS). Since the NIS data are de-identified and in a publicly available dataset, Columbia University’s Human Research Protection Office does not consider this to be research with human subjects and thus not require an IRB review. No individual subjects were involved in the work, we do not have access to information about any individuals associated with hospital discharge records, and the data contain no direct patient identifiers. The raw HCUP datasets were accessed after successful completion of the HCUP Data Use Agreement training course and certification, administered by the HCUP Central Distributor. Permission from Columbia University was not required.
Consent for publication
JS and Columbia University disclose partial ownership of SK Analytics. JS discloses consulting for BNI.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Legionnaires_Flooding_Supplement. Table S1. Description of HCUP hospitals grouped by Legionnaires’ disease case count thresholds. Table S2. Cyclonic storms that affected counties with HCUP hospitals between 2000 and 2011. Table S3. Percentile of average monthly hospitalizations in months with extreme meteorological conditions averaged across Hospital Service Areas (HSAs) compared to bootstrapped distribution of average monthly hospitalizations. Table S4. Association between Legionnaires’ disease hospitalizations and meteorological variables in the most highly weighted model for each hospitalization threshold. Table S5. Association between Legionnaires’ disease hospitalizations and meteorological variables averaged across Hospital Service Areas (HSAs) in the most highly weighted models. Figure S1. a–b. a Precipitation associated with cyclonic storms and b proximity to the storm track were not associated with a consistent significant change in monthly Legionnaires’ disease hospitalizations among hospitals that experienced the storms, compared to hospitals that were unexposed to the storms. Moderately intense precipitation (quartile 3) was associated with a significant increase in hospitalizations among the hospitals with a minimum of 15 and 20 total cases, but this association was insignificant at different precipitation levels and case thresholds. The analysis was restricted hospitals in regions that experience cyclonic storms from the Atlantic storm basin and to the months of the Atlantic storm season (June–November). Figure S2. a–c The association between exposure to cyclonic storms at the Hospital Service Area (HSA) level of analysis and monthly Legionnaires’ disease hospitalizations did not substantially differ from the associations identified using the county-level storm data. a Among hospitals in the 10-, 15-, and 20-case thresholds, hospitals in HSAs exposed to storms had a significant increase in hospitalizations compared to those in HSAs unexposed to storms. b Cyclonic-storm related precipitation and c proximity to storm tracks at the HSA level were not associated with significant changes in monthly hospitalizations; these findings are consistent with the analyses using county-level storm data (Fig. 2, Additional file 1: Fig. S1). Figure S3. a–d Monthly hydrometeorological flood-indicator variables averaged across the 75 hospitals in the primary analysis between 2000 and 2011, grouped by state (lines) and geographic region (color). The seasonality of a precipitation and c runoff differs in the Northeast and Midwest compared to the Southwest and West Coast, with peaks typically occurring in opposite months of the year. The seasonal pattern of b soil moisture and d streamflow is more consistent across the US, but the magnitude of the seasonal variation differs by region. Model S1. Description of negative binomial generalized mixed model used to assess the effect of cyclonic storms on hospitalizations. Model S2. Description of negative binomial generalized mixed model used to assess the effect of seasonal hydrometeorology and temperature on hospitalizations.
About this article
Cite this article
Lynch, V.D., Shaman, J. The effect of seasonal and extreme floods on hospitalizations for Legionnaires’ disease in the United States, 2000–2011. BMC Infect Dis 22, 550 (2022). https://doi.org/10.1186/s12879-022-07489-x
- Legionnaires’ disease
- Extreme storms