Skip to main content

Early warning signal for dengue outbreaks and identification of high risk areas for dengue fever in Colombia using climate and non-climate datasets



Dengue has been prevalent in Colombia with high risk of outbreaks in various locations. While the prediction of dengue epidemics will bring significant benefits to the society, accurate forecasts have been a challenge. Given competing health demands in Colombia, it is critical to consider the effective use of the limited healthcare resources by identifying high risk areas for dengue fever.


The Climate Risk Factor (CRF) index was constructed based upon temperature, precipitation, and humidity. Considering the conditions necessary for vector survival and transmission behavior, elevation and population density were taken into account. An Early Warning Signal (EWS) model was developed by estimating the elasticity of the climate risk factor function to detect dengue epidemics. The climate risk factor index was further estimated at the smaller geographical unit (5 km by 5 km resolution) to identify populations at high risk.


From January 2007 to December 2015, the Early Warning Signal model successfully detected 75% of the total number of outbreaks 1 ~ 5 months ahead of time, 12.5% in the same month, and missed 12.5% of all outbreaks. The climate risk factors showed that populations at high risk are concentrated in the Western part of Colombia where more suitable climate conditions for vector mosquitoes and the high population level were observed compared to the East.


This study concludes that it is possible to detect dengue outbreaks ahead of time and identify populations at high risk for various disease prevention activities based upon observed climate and non-climate information. The study outcomes can be used to minimize potential societal losses by prioritizing limited healthcare services and resources, as well as by conducting vector control activities prior to experiencing epidemics.

Peer Review reports


Dengue is complicated. There are four serotypes of the dengue virus, and dengue infection occurs in almost all age groups [1, 2]. Dengue is endemic in many parts of the tropics and subtropics, and dengue endemic countries are also exposed to the risk of periodic outbreaks [1, 3]. In Colombia, dengue has been prevalent over the last 20 years with different degrees of incidence rates and epidemics in various geographical locations [4, 5]. Due to the complexity of the disease, there are still large knowledge gaps regardingthe causes of dengue epidemics [6,7,8,9]. Infection with one serotype provides life-long immunity to that specific serotype. Therefore, subsequent introduction of the same serotype in a community would be less likely to cause the occurrence of a dengue epidemic if there were a small population of dengue-susceptible individuals [7, 8, 10]. However, due to a high degree of antigenic cross-reactivity, sequential infection of two different serotypes can bring favorable or detrimental outcomes depending upon known and unknown factors including timing of infection [2, 11, 12]. For example, a primary infection may help slow the spread of secondary heterologous infection when some degrees of cross-protection are conferred [2, 12,13,14]. On the other hand, many studies have shown that subsequent heterologous infection would likely increase the probability of experiencing severe dengue fever [15,16,17,18]. One of the known mechanisms is the antibody dependent enhancement (ADE) during the second infection mediated by non-protective heterotypic antibodies arising from the primary infection [2, 11, 14, 19]. In dengue endemic countries such as Colombia, the number of dengue cases is periodically reported to the upper- level health management unit (i.e. provincial or Ministry of Health) from various health facilities at the municipality level [4, 20]. In the case of dengue fever, like any other diseases, severe cases are detected more easily than mild symptoms, which in turn, leads to a higher volume of reported caseload [21]. Thus, having more severe cases is also related to the high likelihood of observing dengue epidemics when an epidemic is determined based on official statistics of reported cases.

While it is undeniable that all of these aspects would affect the occurrence of dengue epidemics directly and indirectly, it does not appear to be practical in proving the impacts of these factors on the occurrence of dengue epidemics due to the following reasons: (1) despite various efforts to disentangle the complexity of the disease [11], it is still uncertain to generalize how one serotype reacts with another in terms of cross-protection or ADE for all possible scenarios among four serotypes, as well as the duration of the interactions [22, 23]; (2) even if this uncertainty is going to be uncovered in the near future, it would be very difficult to obtain the details of sero-prevalence history over a long period of time for each cohort in all specific locations. These limitations make it difficult to understand how much of each factor would contribute to the actual probability of a dengue epidemic occurrence [7, 9, 24].

A more practical way is to focus on the basic principle of the occurrence of a dengue epidemic. Simply put, a dengue epidemic occurs when a large number of people become infected within a short period of time [2, 7]. It requires a large number of vector mosquitoes (Aedes aegypti), as well as high transmission probability, and frequent contact between people and the vectors (biting rate) to sustain transmission [2, 3, 7]. In other words, a dengue epidemic would more likely occur when vector mosquitoes increase within a short time period in a location where dengue viruses are currently circulating and population density with no immunity to one of the four serotypes is high during the same period [8, 9, 24]. Further, the importation of infected cases into a community where there is no immunity to that specific serotype would cause an epidemic as well.

Following this principle, the main concept of this study lies in the increase of vector mosquitoes as a primary factor of a dengue epidemic taking into account population density at different elevation levels. As a vector-borne viral disease, there is a wide range of factors that influence the spatial and temporal dynamics of mosquito populations: temperature, rainfall, and humidity, etc. [9, 24, 25]. There have been several efforts to understand the relationship between dengue epidemics and climate change. Juffrie and Focks used sea surface temperature anomalies to identify the occurrence of dengue epidemics in Yogyakarta, Indonesia and Bangkok, Thailand [26]. Lowe et al. developed an epidemic early warning system for Southeast Brazil using several climate and non-climate datasets [27]. More recently, Huang et al. found that El Nino-Southern Oscillation climate cycles and temperature were important factors affecting the weekly occurrence of the four dengue serotypes in Cairns, Australia [23]. Adde et al. also identified summer Equatorial Pacific Ocean sea surface temperatures and Azore high sea-level pressure as significant indicators in predicting dengue epidemics in French Guiana [28]. While some of the climate factors were more commonly used due to the nature of a vector-borne disease, their applications varied and were geographically focused. These findings from previous literature showed that climate factors play a significant role in the occurrence of dengue epidemics.

This study first attempts to predict a dengue epidemic by developing an Early Warning Signal (EWS) model based upon the temporal relationship between the occurrence of dengue epidemics and climate variability which affects mosquito populations in Colombia. Furthermore, using climate data and topographical information, the study identifies population at high risk for dengue fever for efficient disease prevention activities.


Dengue Incidence Proxy (DIP) was created to observe the trend of the dengue incidence in Colombia. The number of dengue fever cases and population data were obtained from SIVIGILA and Departamento Administrativo Nacional de Estadistica (DANE) which are both official governmental programs in Colombia [4, 29]. Dividing the dengue fever cases reported by population can be used as a good proxy to observe the overall trend of dengue fever. SIVIGILA also provides a weekly report on epidemiological events (Boletin Epidemiologico) which discloses the proportions of municipalities that were not responsive for each department [30]. Thus, the number of cases was adjusted by the proportions for underreporting by assuming that a non-responsive municipality would have the average number of cases per responsive municipality of that department: the reported cases by department was divided by the number of the responsive municipalities in that department, applied to non-responsive municipalities, and added to the reported cases by department. DIP was estimated by dividing the adjusted cases by population. While Boletin Epidemiologico was available over the study period, a more consistent pattern of the underreporting system was observed in the reports since 2011 after the large outbreak in 2010. Because a robust case reporting system is critical for determining the relationships between DIP and climate data, some departments out of 31 departments were excluded if over 20% of underreporting based on Boletin Epidemiologico occurred more than twice since 2011. An outbreak was defined as a relative term in this study. In other words, as long as an unusual peak in DIP was observed in a department, it was considered as an outbreak even if the DIP value in that department was relatively low compared to other departments where dengue is more prevalent. An unusual peak was marked by department if the slope of DIP over every six months fell into the highest 10% of the observations.

Table 1 summarized the datasets used in this study. Considering the spatial and temporal dynamics of mosquito populations, three climate datasets and two non-climate datasets were selected as factors which can explain variation in DIP. The climate raster datasets include air temperature, precipitation, and specific humidity [31,32,33]. The monthly climate datasets were obtained from 2006 to 2015, and all the raster files were resampled into 0.008 by 0.008 degree resolution by taking the nearest neighbor assignments. It should be noted that the study presumed that it is critical to consider how long favorable conditions for vector mosquitoes persist [9, 23]. In other words, a current epidemic is a result of the climate conditions consistently observed during the past months, rather than single temporal (monthly or daily) values at present. For example, if warm temperature and high humidity were observed only for a short time period of each year, these conditions would less likely affect the larval development or virus replication to cause an epidemic [25]. Thus, after checking cross-correlograms to define a proper period, the 12-month moving average of the mean values of each climate data was estimated by department (Additional file 1).

Table 1 Data description

In addition to the climate factors, night light data and elevation raster files were included [34, 35]. Night lights data which is available by year was used to understand population density instead of conventional population statistics. The use of the night-lights data provides more flexibility to estimate population density at various levels of geographical units over time than the projected population data [36]. Prior to applying the night-lights data, correlations between night-lights data and population data were tested to ensure that the night-lights data can be used as an appropriate proxy (ρ = 0.94). The most recent night-lights data was for 2013 at the time of the research. As the population level does not change dramatically during a short period of time, the population level in 2013 was assumed to be consistent in 2014 and 2015. High population density would have two opposite effects in terms of transmission intensity depending upon the level of a reproduction number: (1) dilution of infectious individuals by having a large pool of host populations, (2) a large number of susceptible hosts to be infected, leading to the surge of infected cases. For the latter case, while transmission would be more intensive in a place where population density is high, holding other climate factors constant, it does not have to be necessarily true in areas at high elevations [9]. A previous study found that it is difficult for Aedes aegypti mosquitoes to survive at an elevation of 6000–8000 ft or even at lower elevations in temperate latitudes [37]. Because many people in Colombia live at high elevations (i.e. Bogota), the mean value of the night lights was used to estimate population density separately for people living under 1500 m and those living over 1500 m by department [38].

The three climate datasets are partially correlated but also have their own distinctive characteristics. In order to preserve all the information contained in each of the climate datasets, the Climate Risk Factor (CRF) index was created. The advantage of using a composite index is that it prevents multicollinearity when running regressions against independent variables with some level of the correlations among the variables. The three climate variables and population density under 1500 m were used by department. The precipitation variable, which has a negative relationship with DIP, was reversed, so all variables go towards the same underlying concept (the increase in DIP). The variables were first standardized individually by subtracting the mean and dividing by the standard deviation. The standardized values were then averaged across the variables [36, 39]. The final values were converted into a range from zero (low risk) to one (high risk) and multiplied by 100 for an easier interpretation. It should be noted that the temperature and specific humidity data used in this study are measures at the surface level. More precisely, air temperature is at 2 m above the ground surface, and specific humidity is measured near surface at sea level with pressure level 1000 millibars. Thus, it would be desirable to adjust the CRF index by the risk proportion at low and high elevation. The proportion at risk was estimated by dividing the sum of the night lights observed under 1500 m elevation by the sum of the total night lights in each department. The final CRF index was the product of the raw CRF index and the proportion at risk.

There were two dominant patterns observed during past dengue epidemics in Colombia: (1) rapid increase of the CRF index, (2) relatively steady increase of the CRF index at different levels of the CRF and DIP values. In other words, the slope of the CRF index curve at various levels of the CRF index and DIP values appeared to be critical in predicting the occurrence of dengue epidemics. In order to assess this combined relationship, the elasticity of the CRF index curve was estimated. This is defined as the percentage change in DIP in response to a 1 % change in the CRF index [40, 41]. The stationarity of the dataset was tested to ensure that there were no trend and periodic seasonal effects. The augmented Dickey-Fuller (ADF) unit-root test was used to test whether the dataset is stationary by department [42, 43]. DIP is non-negative integer values, and count models were used to fit DIP as a function of the CRF index (Additional file 1: Supplementary 2). The DIP dataset consists of two parts: (1) model dataset, (2) validation dataset. The model was constructed based on monthly DIP and the CRF index by department from January 2007 to December 2015. The validation dataset which was separated from the model dataset was established from January 2016 to April 2016 and used to validate the model performance. Overdispersion—where the variance is greater than the mean—was tested using the Z-score test at the 5% significant level [44,45,46]. In addition, the Akaike Information Criterion (AIC) fit test was used to compare the model fits between Poisson and negative binomial models. Being a non-linear model, the elasticity of the CRF function can be given as [46]:

$$ E lasticity=\frac{\partial E\left({y}_i|{x}_i\right)}{\partial {x}_i}\bullet \frac{x}{y}= \exp \left({x}_i^{\hbox{'}}{\beta}_k\right){\beta}_k\bullet \frac{x}{y} $$

where \( \exp \left({x}_i^{\prime}\beta \right) \) is the expected DIP values, β k is the coefficient of CRF, x is the explanatory, and y is the response.

As shown above, the main interest of the study lay in estimating elasticities, and count models were used as an intermediary step in calculating elasticities. Given the geographical variations of dengue outbreaks, it is critical to estimate the elasticities separately by department with varying coefficient values of CRF. In this context, the current model was preferred to non-linear mixed models with a fixed coefficient and random effects since the use of coefficients and the measure of marginal effects and elasticities were more straightforward, reducing any possibility of potential overspecification (i.e. multiple adjustments) [46, 47]. Because the model was run separately for each department allowing variation in the CRF index by department, there is no concern about creating the effect of spatial autocorrelation. The elasticities were derived for every six months from January 2007 to December 2015. Early Warning Signal (EWS) was modeled such that dengue epidemics in Colombia can likely occur when the elasticity of the CRF index is maximized given the instantaneous slopes of DIP and the CRF index over time are positive minimizing the squared residuals.


$$ Elasticity, E $$

Subject to:

$$ (1)\ {\beta}_{DIP}=\frac{\sum_{i= t}^{t+5}\left({DIP}_i-\overset{-}{DIP}\right)}{\sum_{i= t}^{t+5}\left({T}_i-\overset{-}{T}\right)}>0 $$


$$ (2)\ {\beta}_{CRF}=\frac{\sum_{i= t}^{t+5}\left({CRF}_i-\overset{-}{CRF}\right)}{\sum_{i= t}^{t+5}\left({T}_i-\overset{-}{T}\right)}>0 $$

where \( \overset{-}{DIP}\ and\ \overset{-}{CRF} \) are the means of DIP and CRF, T is time (month). The elasticities were then categorized into three percentiles: low level warning (0–50%), medium level warning (50–75%), and high level warning (75–100%). As expressed by Adde et al., the hit rate (HR) and false alarm rate (FAR) were defined as below [28]:

$$ HR\ \left( or\ sensitivity\right)=\frac{\left( Detections\ |\ Outbreak\right)}{\left( Detections+ Misses\ |\ Outbreak\right)} $$
$$ FAR\ \left( or\ 1- specificity\right)=\frac{\left( False\ signals\ |\ No\ outbreak\right)}{\left( No\ signals+ False\ signals\ |\ No\ outbreak\right)} $$

In addition, a sensitivity analysis was conducted with various moving average scenarios to make sure that the 12-month moving average is the most suitable period for the performance of the EWS model.

Given that the CRF index is statistically significant to explain variance of DIP for the departments where significant underreporting was not observed, the CRF index was further estimated at the smaller geographical level (5 km by 5 km resolution) for the entire country and used to identify high risk areas.


During the period from January 2007 to December 2015, two major outbreaks were observed in many parts of Colombia. Figure 1 presents the overall trends of the three climate factors, as well as the DIP from 2007 to 2015 in Valle del Cauca, one of the departments where dengue fever is highly prevalent (see Additional file 1: Supplementary 3 for other departments). Looking at the bottom right panel in Fig. 1, there were two major outbreaks in 2010 and 2013 in the department. Comparing the trend of DIP with the climate factors, DIP appears to be positively correlated with temperature and humidity, but has a negative relationship with precipitation.

Fig. 1
figure 1

Climate factors and DIP over time in Valle del Cauca*. * See Additional file 1: Supplementary 3 for other departments

13 of 31 departments in Colombia were chosen after checking the robustness of the case reporting system. The ADF test showed that we reject the null hypothesis, which means that the dataset is stationary. As shown in Table 2, the CRF index is highly significant for all departments except Guaviare and Magdalena, thus 11 departments were selected for further analysis.

Table 2 Regression outputs of the CRF index on DIP

The CRF index and DIP were plotted over time to show the overall trend in Fig. 2 (see Additional file 1: Supplementary 4 for other departments). It is clear that the epidemic that occurred in 2010 was picked up by the steep increase of the CRF index. In 2013, another epidemic was observed. While there was no rapid change in terms of the CRF index during a short period in 2013, the CRF index reached its high level following the steady increase of the index since 2012. These provide an important point where an occurrence of future dengue epidemic can be related not only to the rapid increase of the CRF index, but also to the various levels of the CRF index and DIP. These combined relationships can be further explained by the elasticity of the CRF index which was used to develop an Early Warning Signal (EWS) model. In Fig. 3, the EWS based on the elasticity of the function was demonstrated for Valle del Cauca. In the department, the peak DIP was observed in March 2010, and the EWS signaled the high level warning sign two months before the peak (January 2010). Similarly, the second peak occurred in May 2013, and the EWS level went up from low to medium in January 2013 and remained at the same level until the end of the peak. It should be noted that there was no major outbreak observed throughout 2015 despite the continuous increase of the CRF index. Instead, Zika, another viral disease caused by Aedes aegypti emerged in 2015 and continued to increase in 2016. Overall, all 11 departments experienced dengue epidemics in 2010, and nine of them had additional minor outbreaks since 2011. Among the total of 24 observed outbreaks, EWS successfully detected 18 (75%) 1 ~ 5 months ahead of time and, three (12.5%) in the same month, and missed three (12.5%) (Additional file 1: Supplementary 6).

Fig. 2
figure 2

The CRF index and DIP over time in Valle del Cauca***. * DIP was smoothed out to reduce short-term fluctuations and highlight longer term trends for demonstration. **Zika cases were reported in 2015 as well, but zika incidence rates (/100,000) were not clearly shown for year 2015 due to the low number of reported cases. *** See Additional file 1: Supplementary 4 for other departments

Fig. 3
figure 3

Early Warning Signal in Valle del Cauca

The EWS model predictability was examined with the validation data in 2016 which was separated from the model. It is interesting to see that the EWS already signaled the high level warning sign at the end of 2015, which accurately predicted another outbreak in two months (February 2016) that is out of the study period. Figure 4 further demonstrates the EWS model performance with the validation data for all 11 departments. 6 of 11 departments experienced outbreaks between January 2016 and April 2016. The EWS model successfully predicted these outbreaks 1 ~ 5 months ahead of time for all departments except Boyaca (HR = 83.3%). In addition, the EWS model did not send out any false alarms for the other 5 departments where no outbreak occurred during the out-of-sample period (FAR = 0%). In other words, sensitivity (HR), specificity, positive predictive value, and negative predictive value of the validation data were as follows: 83.3%, 100%, 100%, and 83.3%.

Fig. 4
figure 4

EWS accuracy with the validation data in 2016

The sensitivity analysis was performed with different moving average scenarios (12 months, 6 months, current value). As shown in Table 3, the hit rate was the highest with the 12-month moving average scenario, meaning that the current model produced the most accurate prediction compared to the 6-month and no-moving-average scenarios. The false alarm rate increased as the moving average period was shortened. This is mainly because the index becomes too sensitive and changes quickly due to the short duration of the moving averages of the climate datasets. As a result, it does not distinguish between minor fluctuations and major outbreaks (Fig. 5). This sensitive behavior of the CRF index with the shorter term scenarios proves our presumption that a current dengue epidemic is a result of the consistent long-term patterns of the climate conditions.

Table 3 Sensitivity analysis with additional moving average scenarios
Fig. 5
figure 5

The CRF index with different moving average scenarios in Valle del Cauca

Given that the CRF index explains variation in DIP reasonably well, the CRF index was estimated at 5 km by 5 km resolution, and the most recent time of the index (December 2015) was presented in Fig. 6 (see Additional file 1: Supplementary 5 for more details). As expected, populations at high risk are concentrated in the Western part of the country due to more suitable climate conditions for vector mosquitoes and the high population level compared to the East. Using the geo-coordinates of the high risk areas at 5 km by 5 km resolution, it is possible to identify the locations for people at high risk more accurately for efficient disease prevention activities.

Fig. 6
figure 6

Identification of high risk areas in Dec, 2015*. * See Additional file 1: Supplementary 5 for more details


This study confirms that dengue fever transmission is strongly related to climate factors as well as population density at different topographical conditions. One of the advantages of the CRF index is to prevent multicollinearity by combining all relevant climate indicators which may have some degrees of correlations with each other but have distinctive characteristics at the same time. During the study period from January 2007 to December 2015, the nationwide dengue epidemic occurred in 2010 was well explained by the rapid changes of the CRF index. Even if the CRF index increased steadily, the study found that it was still possible to detect an epidemic by adopting the elasticity of the function which takes into account not only the slopes but also the various levels of CRF and DIP.

In 2015, some inconsistent patterns between CRF and DIP were observed for some departments (Additional file 1: Supplementary 7). This inconsistency may be related to the unexpected emergence of Zika, which started being reported in 2015. As shown in Fig. 2, the number of Zika cases has continuously increased since 2015. However, it is still premature to make any firm statements regarding the impact of Zika on dengue fever due to uncertainty of the diseases. Given that reported cases are mainly based on clinical symptoms, there may have been a chance of misdiagnosis between the two diseases. In addition, due to the surge of an unfamiliar disease (Zika) imposing more difficulties on resource allocation at the local health facility level, it would be difficult to keep a consistent pattern in the case-reporting system from municipality-level heath facilities. Excluding 2015, a number of false alarms where EWS sends out the medium or high level signals but DIP remains low were only observed twice in Cauca (April and December 2014) during the study period.

Some areas of uncertainty deserve attention. While the CRF index performed well for 11 of 13 departments, the index was not statistically significant in Magdalena and Guaviare. This may have been caused partly by the inconsistent patterns of reported cases over time. Because the EWS was estimated based upon the most recent observed climate datasets, the EWS in this study is limited to issuing alerts with short-time intervals (1 ~ 5 months ahead). Given that, at present there are 1 ~ 2 month delays until the climate data become available, EWS with the short intervals (i.e. less than two months) may not, for now, be practical in operational modes. However, this limitation can be improved based upon the availability of the climate datasets in real-time in the future, and the 1 ~ 5 month intervals would provide enough room for public health officials to prepare for selected vector control activities and healthcare interventions (i.e. increase the number of beds at high risk areas) in the dengue-endemic setting [9, 26]. It should be noted that the study did not attempt to produce any longer-term predictions due to chaos and uncertainty in climate forecasts in the long run. Considering that long-term climate forecasts could be variable depending upon assumptions (i.e. future CO2 omission level), the method proposed in this study could minimize potential bias which may be caused by uncertainty in input datasets. The climate datasets have coarse resolutions. While the datasets were resampled using the nearest option in this study, the model outcomes can be further improved with finer scale resolutions. It is worth noting that the cycling of El Niño and La Niña, called El Niño Southern Oscillation (ENSO), may have indirect impacts on the occurrence of dengue epidemics in South America by changing the patterns of climate variables such as temperature, precipitation, and humidity [28]. While any unusual changes of the climate variables affected by such events were captured by using the 12-month moving averages, further investigation would be needed to identify accurate impacts of El Niño on climate factors including its timing.

Nonetheless, our model provided accurate forecasts for the validation period for 5 of 6 departments that experienced outbreaks in 2016. In addition, this study identified populations at high risk for dengue at 5 km by 5 km resolution. The study findings can be used to accelerate introduction of dengue prevention activities and prioritize alternative health interventions among competing health demands in Colombia.


The CRF index summarized multiple climate and non-climate risk factors into a single indicator which helps decision makers to understand easily [36]. While some of the climate factors were more commonly used in the existing literature due to the nature of a vector-borne disease, the applications of the climate data in these studies appeared to vary. The proposed EWS model in this study used the concept of elasticity to understand how DIP changes to varying levels of the CRF index and successfully detected dengue outbreaks in Colombia. In addition, the CRF index was further estimated at 5 km by 5 km resolution. The areas where the CRF index values have been continuously high over time can be prioritized for appropriate healthcare interventions. Furthermore, this can guide decision makers to find relevant locations where future surveillance studies can be conducted.



Antibody Dependent Enhancement


Augmented Dickey Fuller


Akaike Information Criterion


Climate Risk Factors


Dengue Incidence Proxy


Early Warning Signal


  1. WHO: Dengue: prevention and control. In: Sixty-eight World Health Assembly. Geneva, Switzerland: World Health Organization; 2015.

  2. Rodriguez-Roche R, Gould EA. Understanding the dengue viruses and progress towards their control. Biomed Res Int. 2013;2013:20.

    Article  Google Scholar 

  3. Transmission of the Dengue Virus [].

  4. Instituto Nacional de Salud (INS): Dengue Report (Informe de evento Dengue INT- PRO-R02.003.0000–004). . In: Informe de evento. Bogotá, Colombia: Instituto Nacional de Salud. Subdirección de Prevencion, Vigilancia y Control en Salud Pública,; 2014: 37.

  5. Villar LA, Rojas DP, Besada-Lombana S, Sarti E. Epidemiological trends of dengue disease in Colombia (2000-2011): a systematic review. PLoS Negl Trop Dis. 2015;9(3):e0003499.

    Article  PubMed  PubMed Central  Google Scholar 

  6. Murray NEA, Quam MB, Wilder-Smith A. Epidemiology of dengue: past, present and future prospects. Clinical epidemiology. 2013;5:299.

    PubMed  PubMed Central  Google Scholar 

  7. Imai N, Dorigatti I, Cauchemez S, Ferguson NM. Estimating dengue transmission intensity from Sero-prevalence surveys in multiple countries. PLoS Negl Trop Dis. 2015;9(4):e0003719.

    Article  PubMed  PubMed Central  Google Scholar 

  8. Barmak DH, Dorso CO, Otero M, Solari HG. Dengue epidemics and human mobility. Phys Rev E Stat Nonlinear Soft Matter Phys. 2011;84

  9. Lowe R, Bailey TC, Stephenson DB, Jupp TE, Graham RJ, Barcellos C, Carvalho MS. The development of an early warning system for climate-sensitive disease risk with a focus on dengue epidemics in Southeast Brazil. Stat Med. 2013;32

  10. Runge-Ranzinger S, Horstick O, Marx M, Kroeger A. What does dengue disease surveillance contribute to predicting and detecting outbreaks and describing trends? Tropical Med Int Health. 2008;13

  11. Bhoomiboonchoo P, Nisalak A, Chansatiporn N, Yoon IK, Kalayanarooj S, Thipayamongkolgul M, Endy T, Rothman AL, Green S, Srikiatkhachorn A, et al. Sequential dengue virus infections detected in active and passive surveillance programs in Thailand, 1994-2010. BMC Public Health. 2015;15:250.

    Article  PubMed  PubMed Central  Google Scholar 

  12. Halsey ES, Marks MA, Gotuzzo E, Fiestas V, Suarez L, Vargas J, Aguayo N, Madrid C, Vimos C, Kochel TJ, et al. Correlation of serotype-specific dengue virus infection with clinical manifestations. PLoS Negl Trop Dis. 2012;6(5):e1638.

    Article  PubMed  PubMed Central  Google Scholar 

  13. Sabin AB. Research on dengue during world war II. Am J Trop Med Hyg. 1952;1(1):30–50.

    Article  CAS  PubMed  Google Scholar 

  14. Reiner RC, Stoddard ST, Forshey BM, King AA, Ellis AM, Lloyd AL, Long KC, Rocha C, Vilcarromero S, Astete H, et al. Time-varying, serotype-specific force of infection of dengue virus. Proc Natl Acad Sci. 2014;111(26):E2694–702.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  15. Halstead SB, Simasthien P. Observations related to the pathogenesis of dengue hemorrhagic fever. II. Antigenic and biologic properties of dengue viruses and their association with disease response in the host. Yale J Biol Med. 1970;42(5):276–92.

    CAS  PubMed  PubMed Central  Google Scholar 

  16. Russell PK, Yuill TM, Nisalak A, Udomsakdi S, Gould DJ, Winter PE. An insular outbreak of dengue hemorrhagic fever. II. Virologic and serologic studies. Am J Trop Med Hyg. 1968;17(4):600–8.

    Article  CAS  PubMed  Google Scholar 

  17. Guzman MG, Kouri G, Valdes L, Bravo J, Alvarez M, Vazques S, Delgado I, Halstead SB. Epidemiologic studies on dengue in Santiago de Cuba, 1997. Am J Epidemiol. 2000;152(9):793–9. discussion 804

    Article  CAS  PubMed  Google Scholar 

  18. Graham RR, Juffrie M, Tan R, Hayes CG, Laksono I, Ma'roef C, Erlin S, Porter KR, Halstead SB. A prospective seroepidemiologic study on dengue in children four to nine years of age in Yogyakarta, Indonesia I. Studies in 1995-1996. Am J Trop Med Hyg. 1999;61(3):412–9.

    Article  CAS  PubMed  Google Scholar 

  19. Lourenco J, Recker M. Natural, persistent oscillations in a spatial multi-strain disease system with application to dengue. PLoS Comput Biol. 2013;9(10):e1003308.

    Article  PubMed  PubMed Central  Google Scholar 

  20. Carabalí JM, Hendrickx D. Dengue and health care access: the role of social determinants of health in dengue surveillance in Colombia. Glob Health Promot. 2012;19(4):45–50.

    Article  PubMed  Google Scholar 

  21. Sarti E, L’Azou M, Mercado M, Kuri P, Siqueira JB Jr, Solis E, Noriega F, Ochiai RL. A comparative study on active and passive epidemiological surveillance for dengue in five countries of Latin America. Int J Infect Dis.

  22. Lourenço J, Recker M. Dengue serotype immune-interactions and their consequences for vaccine impact predictions. Epidemics. 2016;16:40–8.

    Article  PubMed  PubMed Central  Google Scholar 

  23. Huang X, Clements AC, Williams G, Devine G, Tong S, Hu W. El Nino-southern oscillation, local weather and occurrences of dengue virus serotypes. Sci Rep. 2015;5:16806.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  24. Louis VR, Phalkey R, Horstick O, Ratanawong P, Wilder-Smith A, Tozan Y, Dambach P. Modeling tools for dengue risk mapping - a systematic review. Int J Health Geogr. 2014;13(1):1–15.

    Article  Google Scholar 

  25. Hii YL, Zhu H, Ng N, Ng LC, Rocklov J. Forecast of dengue incidence using temperature and rainfall. Plos Neglect Trop Dis. 2012;6

  26. Juffrie M, Focks DAFDA. EarlyWarning System (EWS) for Dengue in Indonesia and Thailand. Journal of the Medical Sciences (Berkala ilmu Kedokteran). 2009:41(03).

  27. Lowe R, Bailey TC, Stephenson DB, Jupp TE, Graham RJ, Barcellos C, Carvalho MS. The development of an early warning system for climate-sensitive disease risk with a focus on dengue epidemics in Southeast Brazil. Stat Med. 2013;32(5):864–83.

    Article  PubMed  Google Scholar 

  28. Adde A, Roucou P, Mangeas M, Ardillon V, Desenclos JC, Rousset D, Girod R, Briolant S, Quenel P, Flamand C. Predicting dengue fever outbreaks in French Guiana using climate indicators. PLoS Negl Trop Dis. 2016;10(4):e0004681.

    Article  PubMed  PubMed Central  Google Scholar 

  29. DANE DNdEdC, . Estimaciones de población 1985–2005 y proyecciones de población 2005–2020 total municipal por área. In: Departamento Nacional de Estadistica de Colombia. DANE.

  30. Sivigila Informe Epidemiológico de Evento [].

  31. GHCN Gridded V2 data products from PSD [].

  32. GPCC Full Data Reanalysis Version 7.0 at 1.0°: Monthly Land-Surface Precipitation from Rain-Gauges built on GTS-based and Historic Data [].

  33. NCEP Reanalysis Derived data [].

  34. Nighttime Lights Time Series [].

  35. The Global Land One-kilometer Base Elevation (GLOBE) Digital Elevation Model [].

  36. Lee J-S, Mogasale VV, Mogasale V, Lee K. Geographical distribution of typhoid risk factors in low and middle income countries. BMC Infect Dis. 2016;16(1):732.

    Article  PubMed  PubMed Central  Google Scholar 

  37. Christophers S: Aedes aegypti (L.) the yellow fever mosquito: its life history, bionomics and structure. Rickard 1960.

  38. The Climate Connection to Dengue Fever [].

  39. Treiman DJ: Quantitative data analysis: doing social research to test ideas: John Wiley & Sons; 2014.

    Google Scholar 

  40. Bratt JH, Weaver MA, Foreit J, De Vargas T, Janowitz B. The impact of price changes on demand for family planning and reproductive health services in Ecuador. Health Policy Plan. 2002;17(3):281–7.

    Article  PubMed  Google Scholar 

  41. Huang KS, Lin B-H: Estimation of food demand and nutrient elasticities from household survey data: US Department of Agriculture, Economic Research Service; 2000.

    Google Scholar 

  42. StataCorp: Stata 14 Base Reference Manual. Stata Press 2015.

  43. Dickey DA, Fuller WA. Distribution of the estimators for autoregressive time series with a unit root. J Am Stat Assoc. 1979;74(366a):427–31.

    Article  Google Scholar 

  44. Piza EL. Using poisson and negative binomial regression models to measure the influence of risk on crime incident counts. Rutgers Center o n Public Security. 2012;

  45. Gardner W, Mulvey EP, Shaw EC. Regression analyses of counts and rates: Poisson, overdispersed Poisson, and negative binomial models. Psychol Bull. 1995;118(3):392–404.

    Article  CAS  PubMed  Google Scholar 

  46. Hilbe JM. Negative binomial regression. 2nd ed. Cambridge: Cambridge University Press; 2011.

    Book  Google Scholar 

  47. Atkins DC, Baldwin SA, Zheng C, Gallop RJ, Neighbors C. A tutorial on count regression and zero-altered count models for longitudinal substance use data. Psychol Addict Behav. 2013;27(1):166.

    Article  PubMed  Google Scholar 

Download references


Not applicable.


No funding required.

Availability of data and materials

All data generated or analyzed during this study are included in this published article.

Author information

Authors and Affiliations



JS Lee conducted the analyses and wrote the manuscript; M Carabali and IY Park carried out data collection and data interpretation; JK Lim, M Herrera, and LA Villa contributed to data interpretation and study design; A Farlow contributed to data interpretation and overall inputs on the study outcomes.

Corresponding author

Correspondence to Jung-Seok Lee.

Ethics declarations

Ethics approval

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional files

Additional file 1:

Supplementary 1.Cross-correlograms of climate datasets and DIP. Supplementary 2. Model specifications. Supplementary 3. Climate factors and DIP over time by department. Supplementary 4. The CRF index and DIP over time by department. Supplementary 5. Identification of high risk areas for dengue fever. Supplementary 6. EWS for 11 departments during the study period. Supplementary 7. EWS for 11 departments in 2015. (ZIP 2317 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Lee, JS., Carabali, M., Lim, J.K. et al. Early warning signal for dengue outbreaks and identification of high risk areas for dengue fever in Colombia using climate and non-climate datasets. BMC Infect Dis 17, 480 (2017).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: