Skip to main content

Identifying and characterizing hepatitis C virus hotspots in Massachusetts: a spatial epidemiological approach



Hepatitis C virus (HCV) infections have increased during the past decade but little is known about geographic clustering patterns.


We used a unique analytical approach, combining geographic information systems (GIS), spatial epidemiology, and statistical modeling to identify and characterize HCV hotspots, statistically significant clusters of census tracts with elevated HCV counts and rates. We compiled sociodemographic and HCV surveillance data (n = 99,780 cases) for Massachusetts census tracts (n = 1464) from 2002 to 2013. We used a five-step spatial epidemiological approach, calculating incremental spatial autocorrelations and Getis-Ord Gi* statistics to identify clusters. We conducted logistic regression analyses to determine factors associated with the HCV hotspots.


We identified nine HCV clusters, with the largest in Boston, New Bedford/Fall River, Worcester, and Springfield (p < 0.05). In multivariable analyses, we found that HCV hotspots were independently and positively associated with the percent of the population that was Hispanic (adjusted odds ratio [AOR]: 1.07; 95% confidence interval [CI]: 1.04, 1.09) and the percent of households receiving food stamps (AOR: 1.83; 95% CI: 1.22, 2.74). HCV hotspots were independently and negatively associated with the percent of the population that were high school graduates or higher (AOR: 0.91; 95% CI: 0.89, 0.93) and the percent of the population in the “other” race/ethnicity category (AOR: 0.88; 95% CI: 0.85, 0.91).


We identified locations where HCV clusters were a concern, and where enhanced HCV prevention, treatment, and care can help combat the HCV epidemic in Massachusetts. GIS, spatial epidemiological and statistical analyses provided a rigorous approach to identify hotspot clusters of disease, which can inform public health policy and intervention targeting. Further studies that incorporate spatiotemporal cluster analyses, Bayesian spatial and geostatistical models, spatially weighted regression analyses, and assessment of associations between HCV clustering and the built environment are needed to expand upon our combined spatial epidemiological and statistical methods.

Peer Review reports


Approximately 71.1 million people are estimated to be currently infected with the hepatitis C virus (HCV) worldwide [1]. The number of people infected in the U.S. alone is estimated to be more than three million, of whom 75% were born between 1945 and 1965 [2]. Age-adjusted HCV mortality rates have increased steadily in recent decades [3], and notable increases in HCV cases have been reported in many states across the U.S. [4] At least 60% of prevalent HCV infections and nearly all incident HCV infections are attributed to injection drug use (IDU) [58], with an increasing burden among young people who inject drugs (PWID) [36] in non-urban, white communities [4].

Recent HCV surveillance data in Massachusetts (MA) identified a new HCV epidemic pattern characterized by a bimodal age distribution, with the expected peak among 45–65 year olds and an emerging peak among 15–30 year olds [9, 10]. There was a 78% increase in HCV cases among adolescents and young adults (15–24 years) in MA between 2002 and 2009, and an increase of 137% in the 15–29-year-old population between 2002 and 2013 [9, 10]. HCV surveillance is greatly underfunded across the United States [11], and limited spatial epidemiological and geostatistical analysis of surveillance data has been performed to date.

Innovative analytical approaches can foster a better understanding of disease transmission and improved preparation for targeted responses. Studies have demonstrated that GIS-guided approaches to disease screenings have a higher yield than traditional screening methods [1215]. Information on where infections cluster may enhance disease prevention, treatment, and care. For example, in Connecticut, researchers used a mix of statistical and geographic information system (GIS) methods to identify unique spatial distributions and characteristics for HCV and related infections [16]. Along the U.S.-Mexico border, spatial epidemiological methods were used to identify HIV clusters, and to determine that most seroconversions were occurring in a 2.5 block radius in the red light district of Tijuana [17]. In MA, we identified HCV and HIV all-cause mortality hotspots from 2002 to 2012 [18]. However, GIS and spatial analyses to date have been relatively simplistic in their approach and new, more methodologically rigorous spatial cluster analyses are needed to pinpoint locations (i.e., “hotspots”) with highest needs for intervention. The objectives of our study were to: (1) identify HCV clusters in MA using GIS and spatial epidemiology, and (2) to characterize the HCV clusters through statistical modeling.


We defined a hotspot as a location with a statistically significant cluster of census tracts with higher counts and rates of HCV than the average count/rate for all census tracts in MA [18].

Data and measures

We obtained a partially de-identified, limited HCV surveillance dataset from the Massachusetts Department of Public Health (MDPH) that included 99,780 records of people with evidence of past or present infection with HCV reported from 2002 to 2013. All laboratory test results indicative of HCV infection are reportable to MDPH. Most of these reports are received via electronic laboratory reporting into the MA Virtual Epidemiologic Network (MAVEN), MDPH’s secure, web-based, electronic surveillance system [19]. The data are comprised of all newly reported probable and confirmed cases of HCV during the time period and offer the most complete window to prevalent HCV infection patterns available. CDC case definitions were employed. A probable HCV case was a case that did not meet the case definition for acute HCV, was anti-HCV positive (repeat reactive) by enzyme immunoassays (EIA), and had alanine aminotransferase (ALT or SGPT) values above the upper limit of normal, but the anti-HCV EIA result had not been verified by an additional more specific assay or the signal to cut-off ratio was unknown. A confirmed HCV case was defined as a case that was laboratory confirmed and did not meet the case definition for acute HCV. Reported variables include year of report, sex, mode of transmission (IDU vs. non-IDU), race/ethnicity, and address. Address information was missing for 15,541 records. Our final analytical dataset included 84,255 HCV cases.

Demographic and risk variables and rate computation

We obtained population denominators and sociodemographic data on the census tract level from U.S. Census Bureau’s American Community Survey (ACS) [20]. As of the 2010 Census, there were 1478 census tracts in MA. Using GIS and ACS data, we calculated HCV counts and rates of infection per 100,000 persons at the census tract level. Independent variables were selected based on prior research and the scientific literature. Continuous explanatory variables included race/ethnicity, age, education level, households receiving food stamps, and housing status. We used rank order histograms to determine median or tertile cut-offs based on the skewness of the data. We categorized continuous variables using the median as a cut point for non-linear increases or decreases that were monotonic. We categorized continuous variables by tertile for non-linear trends that demonstrated a U-shape or inverted U-shape. Explanatory variables for total population, and household income were dichotomized at the median, and the continuous variables for households living in poverty, households with one worker, and households with two or more workers were categorized by tertiles. Census tracts with a total population of <1 were excluded from all analyses, resulting in omission of 14 tracts (n = 1464).

Other potentially associated geographic features

Addresses for health facilities, including HIV testing sites and syringe exchanges, were obtained from MDPH. Addresses for pharmacies and prisons were obtained from the MA State Board of Pharmacy and MassGIS, respectively. Adult bookstores and gay bars were obtained through publicly available webpages and Internet directories targeting this community. All health resources and other venues that could be associated with hotspots, based on the literature and prior research, were aggregated to the census tract level.

Spatial analyses

We used a combination of GIS, spatial epidemiological, and statistical analyses to: (1) identify the geolocation of statistically significant hotspot clusters of HCV cases, and (2) determine community-level factors (i.e., social determinants) associated with these clusters.

Data cleaning

We standardized address fields in SAS (v9.4, Cary, NC). The process involved parsing the addresses into different segments, unifying acronyms (e.g., changing “Str,” “Street,” “street,” and “st” into “St”), removing superfluous characters (e.g., the “#” sign), and correcting mismatched ZIP codes. The resultant segments were then rearranged into three variables: street address, town, and ZIP code, which were used for geocoding.


Geocoding is the process of obtaining longitude and latitude coordinates for an address or the geocentroid of a polygon (e.g., census tract) [21]. We geocoded the cleaned HCV cases in ArcGIS 10.2.2, using an address locator created from the U.S. Census Bureau’s 2014 Topologically Integrated Geographic Encoding and Referencing (TIGER) Line Shapefiles. We ran addresses that did not initially match through Google Earth’s desktop geocoder, which uses multiple algorithms to improve geographic match rates. Throughout our extensive cleaning processes, 66,023 observations were successfully matched in ArcMap with 16,133 more matching in Google Earth. This resulted in a total of 82,211 matched addresses for an overall match rate of 82.3% (82,211/99,780). For our analytical dataset, which included complete addresses for all HCV cases (n = 84,255), we achieved a match rate of 97.6% (82,211/84,255). These match rates are similar to those in other studies in the literature [22].

Descriptive mapping and cluster analyses

We conducted GIS and spatial epidemiological analyses to determine the burden of disease across MA. First, we created thematic GIS maps to determine the initial spatial distribution of HCV cases and rates. Data were aggregated at the census tract level to protect the confidentiality of people living with HCV while maintaining the optimal spatial resolution. Next, we used Kernel Density Estimation (KDE) to construct a smoothed surface of HCV cases across MA. KDE analyses incorporate the number and proximity of cases within defined geographic areas [23]. The final results from this method can be interpreted as the density of HCV cases per square mile.

We then used tests of spatial autocorrelation and hotspot analyses to identify the location of clusters of HCV case counts and rates per 100,000 population. Using ArcGIS 10.2.2 (ESRI, Redlands, CA), we first used incremental spatial autocorrelation at 30 different distances to determine the distance at which clustering was most intense (i.e., had the highest z-score) for each specified outcome [24]. The spatial scale (i.e., the distance) obtained from these results was then entered as a parameter in the subsequent Getis-Ord Gi* hotspot analyses. These analyses allowed us to identify the location of statistically significant clusters of census tracts with higher (or lower) values for HCV cases and infection rates [25]. Next, we conducted spatiotemporal cluster analysis (SaTScan v9.3) to determine temporal clustering patterns in micro-communities across specific years using a space-time cluster scanning statistic [26]. This model can be run at different scanning window sizes which limit the potential size of a discovered cluster. We ran this model at a 25% scanning window which we selected a priori to represent a balance of granular level cluster detection without exposing the analysis to excessive noise. Finally, we compared results across our four geostatistical analyses to observe consistencies in conclusions. Our spatial analytical methods are described in further detail elsewhere [18, 27].

Statistical analyses

After identifying HCV hotspot clusters, we calculated descriptive statistics (Chi-squared tests and two sample t-tests) for HCV cases with and without addresses and for matched and unmatched addresses in the geocoding process to determine whether there were differences in the HCV cases by geocode status. Next, we conducted logistic regression analyses to determine the factors associated with HCV hotspots.

Statistical analysis was performed in two stages. First, to obtain the unadjusted relationship we regressed hotspot status on each of the independent variables. Variables with an overall p-value <0.25 were retained for the multivariable models. For multiple logistic regression analyses, significant collinearities (variance inflation factor > 6) were observed between “% of owner-occupied housing units” and “% of renter-occupied housing units”; between “% African American” and “% White; and “% Asian” and “% other race/ethnicity”; and between “% male” and “% female”. To alleviate the inflation in variance caused by the aforementioned collinearities, and based on strength of association with the outcome, we removed percentages of female, total population of owner-occupied housing, White, and Asian from the adjusted model. DFBeta and residual analyses revealed no influential points or potential outliers. A Hosmer and Lemeshow Goodness-of-Fit test was conducted with p = 0.46, indicating the model fit the data well. All statistical analyses were performed using SAS v9.4. Statistical significance was determined a priori at p < 0.05.


Spatial analysis results

Through initial descriptive GIS maps, we found that reported HCV infection case counts and rates were prominent across many MA municipalities between 2002 and 2013 (Fig. 1a, b). Through KDE, we identified areas with the highest densities of HCV cases per square mile. The Greater Boston area, New Bedford, Fall River, Lawrence, Lowell, Worcester, Springfield, Holyoke, and Fitchburg had densities of 477 to 1070 HCV cases per square mile (Fig. 1c). We identified nine statistically significant hotspot clusters for HCV case counts, with the largest clusters in Boston, New Bedford/Fall River, Worcester, and Springfield (p < 0.05) (Fig. 1d). We found clusters for HCV rates per 100,000 persons in Boston, New Bedford, and Springfield (Fig. 1e). Through space-time cluster analyses, we detected similar geographic clusters, with a large coldspot in the Metro West region, and hotspots in the Greater Boston area and the South Shore of Massachusetts (Fig. 1f). We ran hotspot cluster analyses separately for HCV cases among 15–30 year olds and 45–65 year olds and no notable differences in spatial clustering patterns were detected. In Table 1, we highlight agreement in results across our four geostatistical analytical methods, and rank areas of highest concern.

Fig. 1

a HCV cases aggregated at the census tract level in Massachusetts, 2002–2013. b HCV rates per 100,000 in Massachusetts census tracts, 2002–2013. c Density of HCV cases per square mile in Massachusetts, 2002–2013. d HCV clusters in Massachusetts based on reported HCV counts aggregated at the census tract level (n = 1464). Census tracts with elevated numbers of HCV cases (red) represent hotspots (p < 0.05); census tracts with low numbers of HCV cases (blue) represent coldspots (p < 0.05); census tracts with average numbers of HCV cases are represented in yellow. e HCV clusters in Massachusetts based on rates per 100,000 population at the census tract level. Census tracts with elevated HCV rates (red) represent hotspots (p < 0.05); census tracts with low HCV rates (blue) represent coldspots (p < 0.05); census tracts with average HCV rates are represented in yellow; f Space-time clusters of HCV in Massachusetts, 2002–2013

Table 1 Comparison of geostatistical findings highlighting locations with high and low reported HCV infections, Massachusetts, 2002–2013

Statistical results

In Table 2, we present descriptive statistics for the characteristics of the HCV cases we analyzed, comparing HCV cases with and without addresses, and HCV cases with addresses that were successfully and unsuccessfully matched through geocoding. We noted significant differences across HCV case characteristics in both comparisons.

Table 2 Descriptive statistics of HCV cases by address availability and geocoding match status

We present descriptive statistics for HCV hotspot and non-hotspot census tracts in Table 3. We noted significant differences, across race/ethnicity, gender, and socioeconomic variables. In bivariate logistic regression analyses (Table 4), total population, population density, percent of population that was Hispanic, percent high school graduates or higher, percent living in poverty, median household income, and median age were associated with HCV hotspots. In multivariable logistic regression analyses, we found that HCV hotspots were independently and positively associated with the percent of the population that was Hispanic (adjusted odds ratio [AOR]: 1.07; 95% confidence interval [CI]: 1.04, 1.09) and the percent of households receiving food stamps (AOR: 1.83; 95% CI: 1.22, 2.74). HCV hotspots were independently and negatively associated with the percent of the population that were high school graduates or higher (AOR: 0.91; 95% CI: 0.89, 0.93) and the percent of the population in the “other” race/ethnicity category (AOR: 0.88; 95% CI: 0.85, 0.91) (Table 4).

Table 3 Descriptive statistics of Massachusetts census tracts, 2002–2013 (n = 1464)
Table 4 Factors associated with HCV hotspots in Massachusetts, 2002–2013


Our combined use of GIS, spatial epidemiological, and statistical modeling approaches allowed us to identify and characterize statistically significant geographic HCV hotspot clusters. We observed agreement across the different geostatistical methods we conducted, which lends support to the idea that these locations in MA present significantly greater burdens of disease. Our HCV hotspots based on HCV counts highlighted nine clusters, the largest of which included Boston, Fall River, New Bedford, Worcester, and Springfield, where the burden of disease is highest. We detected hotspots for HCV rates per 100,000 persons in Boston, New Bedford, and Springfield. Given that these hotspots are based on rates, controlling for population differences across census tracts in MA, they may represent locations with some of the greatest need for enhanced HCV prevention and treatment interventions. It is notable that each of these cities has a population with lower median age than those found across MA, and that authorized syringe exchange programs have not been available in New Bedford and Springfield, despite initiation of such programs elsewhere in MA in the 1990s, and Worcester, Lawrence and a number of additional sites in 2016.

In our multivariable analyses, we found that the percent of population that was Hispanic and the percent of households receiving food stamps were independently and positively associated with HCV hotspots, while the percent of the population that was another race/ethnicity and the percent of the population that had a high school education or higher were negatively associated with HCV hotspots. These associations help us to understand the underlying sociodemographic factors that are associated with HCV clusters. The associations we observed with racial/ethnic groups differ from recent findings that highlight an emerging HCV epidemic among non-urban, white youth and younger populations [4]. However, the associations we observed between high school education, belonging to the “other” racial/ethnic groups, and HCV hotspots echo the significant health disparities that exist among different racial/ethnic minorities, educational attainment, and income levels for HCV [28]. Since the surveillance data upon which our analyses depend cover the timeframe from 2002 to 2013, it may not be possible to discern more recent patterns in disease transmission. Confirmed and probable cases of HCV infection include individuals that have past exposure, but not current infection. Analysis of more recent surveillance data may begin to uncover different patterns, including the emerging epidemic among young, suburban, white communities.

Our findings may have implications for future disease transmission risks and for the future success of HCV treatment among PWID, for which there are many sound recommendations [29]. PWID have experienced a number of barriers along the HCV care cascade, which includes diagnosis of disease, linkage to care, treatment, retention, and cure. Targeting of future treatment or cure as prevention (CasP) approaches [30, 31] could benefit from spatial epidemiological analyses that identify and characterize HCV hotspot clusters. Antiviral treatment uptake among PWID has been limited by barriers at patient, provider, and structural levels [32]. With the advent of direct acting antiviral (DAA) therapy, many payers, including most state Medicaid programs, continue to require abstinence from illicit drug use and often restrict treatment by fibrosis stage [33]. These policies systematically exclude many PWID who have not yet suffered from decades of infection and present barriers to performing real-life CasP interventions in locations and populations with the highest risk of HCV transmission. In MA, as of August 2016, all versions of Medicaid and most private insurers approve DAAs at fibrosis stage F0 or higher and without strict sobriety criteria, facilitating PWID treatment. However, multiple barriers remain for PWID along the cascade, resulting in a very small fraction likely to reach cure [31].

There are several limitations in the current study. First, HCV surveillance systems may not capture all HCV cases. While MDPH’s MAVEN surveillance and electronic laboratory reporting system has led to improved efficiency and timeliness in capturing HCV data [19], disease surveillance data only include information on individuals who have been tested and reported to health authorities. People who are not tested or are not engaged in care are not represented in these data. Race/ethnicity data are missing or incomplete for many HCV cases, making it difficult to discern representation in the HCV surveillance data by race/ethnicity. Second, the MAVEN system captures address at time of report, typically many years after initial HCV infection, and residence rather than place of infection. As a result, spatial results may represent risk for HCV transmission 10–20 years earlier. However, in the absence of large cohort studies that follow PWID over time, starting at a young age and allowing for calculation of HCV incidence rates and a better understanding of the geographic location where transmission occurred, such case reporting systems offer the best option currently available on a statewide level. Third, our dataset includes people who are positive for anti-HCV antibody through EIA. It is possible that some HCV cases tested positive for antibody but negative for HCV ribonucleic acid, which would over-estimate HCV cases of prevalent infection. Spontaneous viral clearance occurs in approximately 25% of people initially diagnosed with HCV [34]. However, it is unlikely that there are differential spatial distributions for reported HCV cases that ultimately clear the virus compared to cases that do not. Fourth, in our statistical models, we assessed associations at the census tract level, rather than the individual level. HCV surveillance (outcomes) and ACS data (explanatory variables) for the entire state are included in our analyses, and represent the best data available on the micro (i.e., neighborhood) level, and have been used in previous small area analyses with similar outcomes [35]. Finally, differences in characteristics of HCV cases with and without addresses and for matched and unmatched addresses following geocoding can help to illustrate which groups and characteristics may be underrepresented and can inform training needs for completion of future HCV case report forms, which can further enhance the precision of HCV surveillance data as well as spatial and statistical analyses.

Future research

There are a number of unique spatial analytical tools and approaches that are available to explore HCV, and other infection clustering patterns, in MA and the US. Limited spatial epidemiological and geostatistical analysis of surveillance data has been performed to date, and the spatial uncertainty of such analyses is unknown. Future research should incorporate spatiotemporal cluster analyses, additional spatial modeling approaches, including Bayesian spatial and geostatistical models, and spatially weighted regression analyses, to enhance spatial precision and build upon our approaches. In addition, nuanced analyses that compare clustering patterns across different population subgroups and the role of the built environment are needed to determine differences by age cohort, risk behaviors, environment, and race/ethnicity over time.


Combined geospatial and statistical analyses can help identify regions of risk and regions of disease, and inform public health policy decisions. Through our analytical approach, we identified and characterized clusters in MA where enhanced HCV screening, prevention, treatment, and care could help to combat the HCV epidemic.



American Community Survey


Adjusted Odds Ratio


Centers for Disease Control and Prevention


Confidence Interval


Enzyme Immunoassay


Geographic Information System


Hepatitis C Virus


Human Immunodeficiency Virus


Kernel Density Estimates




Massachusetts Virtual Epidemiologic Network


Massachusetts Department of Public Health


People who Inject Drugs


Topologically Integrated Geographic Encoding and Referencing


  1. 1.

    POLARIS. Global prevalence and genotype distribution of hepatitis C virus infection in 2015: a modelling study. Lancet Gastro Hep. 2016;2(3):161–76.

    Google Scholar 

  2. 2.

    Denniston MM, Jiles RB, Drobeniuc J, et al. Chronic hepatitis C virus infection in the United States, National Health and nutrition examination survey 2003 to 2010. Ann Intern Med. 2014;160(5):293–300.

    Article  PubMed  PubMed Central  Google Scholar 

  3. 3.

    Ly KN, Xing J, Klevens RM, Jiles RB, Ward JW, Holmberg SD. The increasing burden of mortality from viral hepatitis in the United States between 1999 and 2007. Ann Intern Med. 2012;156(4):271–8.

    Article  PubMed  Google Scholar 

  4. 4.

    Suryaprasad AG, White JZ, Xu F, et al. Emerging epidemic of hepatitis C virus infections among young nonurban persons who inject drugs in the United States, 2006-2012. Clin Infect Dis. 2014;59(10):1411–9.

    CAS  Article  PubMed  Google Scholar 

  5. 5.

    Williams I. Epidemiology of hepatitis C in the United States. Am J Med. 1999;107(6B):2S–9S.

    CAS  Article  PubMed  Google Scholar 

  6. 6.

    Amon J, Garfein R, Ahdieh-Grant L, et al. Prevalence of hepatitis C virus infection among injection drug users in the United States, 1994-2004. Clin Infect Dis. 2008;46(12):1852–8.

    Article  PubMed  Google Scholar 

  7. 7.

    Armstrong G, Wasley A, Simard E, McQuillan G, Kuhnert W, Alter M. The prevalence of hepatitis C virus infection in the United States, 1999 through 2002. Ann Intern Med. 2006;144(10):705–14.

    Article  PubMed  Google Scholar 

  8. 8.

    Garfein RS, Doherty MC, Monterroso ER, Thomas DL, Nelson KE, Vlahov D. Prevalence and incidence of hepatitis C virus infection among young adult injection drug users. J Acquir Immune Defic Syndr Hum Retrovirol. 1998;18(Suppl 1):S11–9.

    Article  PubMed  Google Scholar 

  9. 9.

    Hepatitis C. Virus infection among adolescents and young adults:Massachusetts, 2002-2009. MMWR Morb Mortal Wkly Rep. 2011;60(17):537–41.

    Google Scholar 

  10. 10.

    Bureau of Infectious Diseases, MDPH. Shifting epidemics: HIV and hepatitis C infection among injection drug users in Massachusetts. 2012.

    Google Scholar 

  11. 11.

    Institute of Medicine (US). Committee on the Prevention and Control of Viral Hepatitis Infection. In: Colvin HM, Mitchell AE, editors. Washington (DC): National Academies Press (US); 2010.

  12. 12.

    Moonan PK, Oppong J, Sahbazian B, et al. What is the outcome of targeted tuberculosis screening based on universal genotyping and location? Am J Respir Crit Care Med. 2006;174(5):599–604.

    Article  PubMed  Google Scholar 

  13. 13.

    Geanuracos CG, Cunningham SD, Weiss G, Forte D, Reid LM, Ellen JM. Use of geographic information systems for planning HIV prevention interventions for high-risk youths. Am J Public Health. 2007;97(11):1974–81.

    Article  PubMed  PubMed Central  Google Scholar 

  14. 14.

    Magnus M, Kuo I, Shelley K, et al. Risk factors driving the emergence of a generalized heterosexual HIV epidemic in Washington, District of Columbia networks at risk. AIDS. 2009;23(10):1277–84.

    Article  PubMed  Google Scholar 

  15. 15.

    Goswami ND, Hecker EJ, Vickery C, et al. Geographic information system-based screening for TB, HIV, and syphilis (GIS-THIS): a cross-sectional study. PLoS One. 2012;7(10):e46029.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  16. 16.

    Morano JP, Gibson BA, Altice FL. The burgeoning HIV/HCV syndemic in the urban northeast: HCV, HIV, and HIV/HCV coinfection in an urban setting. PLoS One. 2013;8(5):e64321.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  17. 17.

    Brouwer KC, Rusch ML, Weeks JR, et al. Spatial epidemiology of HIV among injection drug users in Tijuana. Mexico Ann Assoc Am Geogr. 2012;102(5):1190–9.

    Article  PubMed  Google Scholar 

  18. 18.

    Meyers DJ, Hood ME, Stopka TJ. HIV and hepatitis C mortality in Massachusetts, 2002-2011: spatial cluster and trend analysis of HIV and HCV using multiple cause of death. PLoS One. 2014;9(12):e114822.

    Article  PubMed  PubMed Central  Google Scholar 

  19. 19.

    Heisey-Grove DM, Church DR, Haney GA, Demaria A. Enhancing surveillance for hepatitis C through public health informatics. Public Health Rep. 2011;126(1):13–8.

    PubMed  PubMed Central  Google Scholar 

  20. 20.

    United States Census Bureau. American FactFinder. Available at:

  21. 21.

    Drummond WJ. Address matching: GIS technology for mapping human activity patterns. J Am Plan Assoc. 1995;61(2):240–51.

    Article  Google Scholar 

  22. 22.

    Krieger N, Waterman P, Lemieux K, Zierler S, Hogan JW. On the wrong side of the tracts? Evaluating the accuracy of geocoding in public health research. Am J Public Health. 2001;91(7):1114–6.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  23. 23.

    Cooper HL, Bossak BH, Tempalski B, Friedman SR, Des Jarlais DC. Temporal trends in spatial access to pharmacies that sell over-the-counter syringes in new York City health districts: relationship to local racial/ethnic composition and need. J Urban Health. 2009;86(6):929–45.

    Article  PubMed  PubMed Central  Google Scholar 

  24. 24.

    Ord J, Getis A. Local spatial autocorrelation statistics: distributional issues and an application. Geogr Anal. 1995;27:286–306.

    Article  Google Scholar 

  25. 25.

    Bailey SL, Ouellet LJ, Mackesy-Amiti ME, et al. Perceived risk, peer influences, and injection partner type predict receptive syringe sharing among young adult injection drug users in five U.S. cities. Drug Alcohol Depend. 2007;91(Suppl 1):S18–29.

    Article  PubMed  Google Scholar 

  26. 26.

    Kulldorff M, Heffernan R, Hartman J, Assunção R, Mostashari F. A space-time permutation scan statistic for disease outbreak detection. PLoS Med. 2005;2(3):e59.

    Article  PubMed  PubMed Central  Google Scholar 

  27. 27.

    Stopka TJ, Krawczyk C, Gradziel P, Geraghty EM. Use of spatial epidemiology and hot spot analysis to target women eligible for prenatal women, infants, and children services. Am J Public Health. 2014;104(Suppl 1):S183–9.

    Article  PubMed  PubMed Central  Google Scholar 

  28. 28.

    Tohme RA, Xing J, Liao Y, Holmberg SD. Hepatitis C testing, infection, and linkage to care among racial and ethnic minorities in the United States, 2009-2010. Am J Public Health. 2013;103(1):112–9.

    Article  PubMed  PubMed Central  Google Scholar 

  29. 29.

    Grebely J, Robaeys G, Bruggmann P, et al. Recommendations for the management of hepatitis C virus infection among people who inject drugs. Int J Drug Policy. 2015;26(10):1028–38.

    Article  PubMed  Google Scholar 

  30. 30.

    Cousien A, Tran VC, Deuffic-Burban S, Jauffret-Roustide M, Dhersin JS, Yazdanpanah Y. Hepatitis C treatment as prevention of viral transmission and liver-related morbidity in persons who inject drugs. Hepatology. 2016;63(4):1090–101.

    Article  PubMed  Google Scholar 

  31. 31.

    Martin NK, Vickerman P, Grebely J, et al. Hepatitis C virus treatment for prevention among people who inject drugs: modeling treatment scale-up in the age of direct-acting antivirals. Hepatology. 2013;58(5):1598–609.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  32. 32.

    Robaeys G, Grebely J, Mauss S, et al. Recommendations for the management of hepatitis C virus infection among people who inject drugs. Clin Infect Dis. 2013;57(Suppl 2):S129–37.

    Article  PubMed  Google Scholar 

  33. 33.

    Barua S, Greenwald R, Grebely J, Dore GJ, Swan T, Taylor LE. Restrictions for Medicaid reimbursement of Sofosbuvir for the treatment of hepatitis C virus infection in the United States. Ann Intern Med. 2015;163(3):215–23.

    Article  PubMed  Google Scholar 

  34. 34.

    Conrad C, Bradley HM, Broz D, et al. Community outbreak of HIV infection linked to injection drug use of Oxymorphone - Indiana, 2015. MMWR Morb Mortal Wkly Rep. 2015;64(16):443–4.

    PubMed  Google Scholar 

  35. 35.

    Krieger N, Chen JT, Waterman PD, Rehkopf DH, Subramanian SV. Race/ethnicity, gender, and monitoring socioeconomic gradients in health: a comparison of area-based socioeconomic measures--the public health disparities geocoding project. Am J Public Health. 2003;93(10):1655–71.

    Article  PubMed  PubMed Central  Google Scholar 

Download references


We are grateful to Dr. Monina Klevens, Dr. Al DeMaria, and Mr. Kevin Cranston for their invaluable review of an earlier version of this manuscript.


This research was funded in part by a 2014 developmental and 2015 supplemental grant from the Providence/Boston Center for AIDS Research (PI: Stopka). The project described was supported by Grant Number P30AI042853 from the National Institute of Allergy and Infectious Diseases. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institute of Allergy and Infectious Diseases or the National Institute of Health.

Availability of data and materials

The data that support the findings of this study are available from the Massachusetts Department of Public Health but restrictions apply to the availability of these data, which were used under agreement and IRB approval for the current study, and so are not publicly available. Data may be available from the Massachusetts Department of Public Health:

Authors’ contributions

TS conceptualized the study, oversaw study implementation and all GIS and spatial analyses, and interpreted findings. MG developed statistical models and interpreted the analytical results regarding factors associated with HCV hotpots. DM conducted GIS and spatial analyses and interpreted HCV hotspots results. MH contributed to GIS mapping and spatial analyses, and assisted in interpreting HCV hotspots results. KB configured the MDPH HCV surveillance dataset, facilitated access to data, and assisted with variable definitions and interpretation of findings. SO provided guidance on the HCV surveillance data and informed analytical approaches and interpretation of findings. DC oversaw HCV surveillance data access and informed analytical approaches and interpretation of findings. AD assisted with GIS mapping and spatial analysis of HCV surveillance data. KC oversaw all statistical modeling and analyses and contributed to interpretation of results. All authors read and approved the final manuscript.

Competing interests

The authors declare that they have no competing interests.

Consent for publication

Not applicable.

Ethics approval and consent to participate

This study was reviewed and approved by the Tufts University Health Sciences and Massachusetts Department of Public Health Institutional Review Boards.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Author information



Corresponding author

Correspondence to Thomas J. Stopka.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Stopka, T.J., Goulart, M.A., Meyers, D.J. et al. Identifying and characterizing hepatitis C virus hotspots in Massachusetts: a spatial epidemiological approach. BMC Infect Dis 17, 294 (2017).

Download citation


  • Getis-Ord Gi*
  • Spatial clusters
  • GIS
  • Surveillance
  • Infectious diseases