Skip to main content
  • Research article
  • Open access
  • Published:

Human papillomavirus (HPV) types 16, 18, 31, 45 DNA loads and HPV-16 integration in persistent and transient infections in young women



HPV burden is a predictor for high-grade cervical intraepithelial neoplasia and cancer. The natural history of HPV load in young women being recently exposed to HPV is described in this paper.


A total of 636 female university students were followed for 2 years. Cervical specimens with HPV-16, -18, -31, or -45 DNA by consensus PCR were further evaluated with type-specific and β-globin real-time PCR assays. Proportional hazards regression was used to estimate hazard ratios (HR) of infection clearance. Generalized estimating equations assessed whether HPV loads was predictive of HPV infection at the subsequent visit.


HPV loads were consistently higher among women <25 years old, and those who had multiple sex partners, multiple HPV type infections and smokers. HPV-16 integration was encountered only in one sample. Infection clearance was faster among women at lower tertiles of HPV-16 (HR = 2.8, 95%CI: 1.0-8.1), HPV-18 (HR = 3.5, 95%CI: 1.1-11.2) or combined (HR = 2.4, 95%CI: 1.8-6.2) DNA loads. The relationship between HPV-16 and HPV-18 DNA loads and infection clearance followed a clear dose-response pattern, after adjusting for age and number of sexual partners. GEE Odds Ratios for HPV persistence of the middle and upper tertiles relative to the lower tertile were 2.7 and 3.0 for HPV-16 and 3.8 and 39.1 for HPV-18, respectively. There was no association between HPV-31 or -45 DNA loads and persistence.


The association between HPV load and persistence is not uniform across high-risk genital genotypes. HPV-16 integration was only rarely demonstrated in young women.

Peer Review reports


Sexually active women are at risk for genital infection by human papillomavirus (HPV). Most genital HPV infections regress within two years and only a minority of women will develop persistent HPV infection that could eventually cause cervical intraepithelial neoplasia (CIN). High-grade CIN (grades 2 and 3) are precursors of invasive cervical cancer. Although measuring persistence has prognostic value in understanding the natural history of HPV infection and CIN, there is a need for studying additional virological endpoints to assist in risk prediction.

HPV-16 DNA load seems to be independently associated with high-grade CIN and invasive cancer [16]. Most studies also reported that HPV load was an ancillary marker for persistent HPV infection [3, 711]. HPV-16 or 18 infections are cleared more slowly than infections caused by other high-risk types [12]. Since the biological behavior of HPV types differ, the predictive value for persistence of HPV DNA load may also vary between types [13]. We know little about type-specific viral loads and their relation with clearance of HPV infection. Moreover, most studies on HPV viral load have focused on older women at risk for CIN. The evaluation of HPV viral load in recently-infected younger women remains largely unexplored.

Integration of high oncogenic risk HPV types (HR-HPV) is considered to be a key event in the progression of CIN to invasive cancer [14]. Recent data casts doubt on the notion that viral integration into the host genome is a marker of progression to CIN2/3 [1520]. Indeed, integrated HPV-16 DNA can be detected in women with CIN-1 or normal cervical samples, although these results were not confirmed by others [21]. It is important to establish whether HPV integration occurs early in the course of HPV infection to assess its contribution to carcinogenesis. Overall, the longitudinal assessment of HR-HPV load and integration in the natural history of HPV infection considering various viral outcomes such as clearance and persistence has received little attention up to now. In 1996, we began a prospective cohort study of young women attending college in Montreal, Canada, to investigate the epidemiologic determinants of persistent and transient cervical HPV infections [22, 23].

The focus of the current study was to assess prospectively, in this cohort of young women, the time course and association between HPV-16 integration, HPV-16, 18, 31 and 45 DNA loads, and type-specific viral outcomes.


Study subjects

Female students attending either McGill University or Concordia University Health Clinics were invited to participate if they intended to remain in Montreal for the next 2 years and had not been treated for cervical disease in the last 12 months [22]. A total of 636 female university students were recruited between November 1996 and January 1999, and were followed for 2 years; with clinic visits every 6 months. Detailed information was obtained at enrolment via a self-administrated questionnaire and changes in lifestyle characteristics were obtained at each follow-up visit with an abridged questionnaire, as described previously [22]. Two cervical samples were collected with an Accelon cervical biosampler at every visit. A Papanicolaou smear was prepared with the first sampler. The remaining cells along with those collected with the second sampler were processed for HPV testing. Informed consent was obtained from all study participants. The study was reviewed and approved by the Ethics committees from each participating institution.

HPV DNA testing

HPV DNA testing has been described elsewhere [22]. Briefly, five μl of sample DNA purified with QIAamp columns (Qiagen, CA) were first amplified for β-globin with PC04/GH20 primers to demonstrate the integrity of extracted DNA. β-globin-positive specimens were further tested with the L1 consensus HPV primers MY09/MY11 and HMB01 using AmpliTaq gold (Roche Diagnostic Systems, Laval, Canada) and with the Line blot assay for the detection of 27 genital HPV genotypes [22].

HPV-16, 18, 31 and 45 viral loads

A total of 382 specimens collected from 183 participants were positive for HPV-16, -18, -31 or -45 DNA. Nineteen women were infected concurrently with two or more of these genotypes. Extracted DNA from these samples was first screened for the presence of PCR inhibitors by amplification of internal controls for HPV-16, HPV-18, HPV-31 or HPV-45, and for ί-globin DNA, as described previously [18, 24]. The presence of PCR inhibitors was suspected when 1000 copies of at least one internal control generated a signal corresponding to less than 700 copies, as previously described [25]. All samples were free of inhibitors. Two μl of processed sample were tested in duplicate for quantification of ί-globin DNA to estimate the cell content of samples [18, 24]. HPV-16 E6 and HPV-18 E7 DNA was quantified using a standard protocol [26]. HPV-31 L1 DNA was measured with the assay described by Weissenborn et al. [27]. HPV-45 E6 DNA was amplified in a Light Cycler PCR and detection system (Roche Molecular Systems) in a 20-μl reaction mixture containing 1× DNA Master Hybridization Probe Mix with the Fast Start Taq DNA polymerase (Roche Molecular Biochemicals), 0.3 pmoles of each HPV-45 primer 45 E6-F (nucleotide position 463-486; 5'-TTAAGGACAAACGAAGATTTCACA-3') and 45 E6-R (nucleotide position 670-647; 5'-ACACAACAGGTCAACAGGATCTAA-3'), and 50 nM of fluorogenic 45 E6-TM probe (nucleotide position 491-514; FAM-5'-AGCTGGACAGTACCGAGGGCAGTG-3'-TAMRA). Cycling parameters included an activation step at 95°C for 10 min followed by 50 cycles at 95°C for 15 sec, 60°C for 5 sec and 65°C for 45 sec. For each of the four genotypes analyzed, cycle thresholds obtained for each sample were compared to those of a titration curve obtained by serial ten-fold dilutions of HPV-16, 18, 31 or 45 plasmids in a fixed amount of 75 ng of human genomic DNA (Roche Diagnostics) in 10 mM Tris-HCl [pH 8.2]. Each assay consistently detected 10 HPV DNA copies (data not shown). HPV viral loads were expressed as the number of HPV DNA copies per cell.

HPV-16 integration assays

The presence of integrated HPV-16 was investigated with real-time PCR assays targeting E6 and E2, as previously described [28]. Since disruption of the E2 gene often results in HPV integration, detection of a greater quantity of HPV-16 E6 compared to HPV-16 E2 strongly suggests the presence of integrated HPV-16 DNA [16, 17]. Two μl of each processed sample was tested in duplicate in each HPV-16 E6 and E2 assays. Cycle thresholds were compared to those of serial ten-fold dilutions of an HPV-16 plasmid in a fixed amount of 2,000 copies of human genomic DNA (Roche Diagnostics). We assumed that HPV-16 DNA was integrated into the host genome if the ratio of copy numbers for HPV-16 E6 and E2 (HPV-16 E6/E2) in a specimen was two or higher [18, 28, 29].

HPV-16 integration was confirmed by restriction site PCR (RS-PCR), a sequencing technique that demonstrates the presence of HPV-16 and human DNA junctions in the same amplicon [30]. Briefly, 9 HPV-16-specific primers spanning the entire HPV-16 genome were combined separately with 6 restriction site oligonucleotides designed to anneal on selected restriction sites on the human genome, in a two-step hemi-nested PCR performed in a 9600 Thermal Cycler. Amplicons were migrated in a 2% agarose gel stained with ethidium bromide. When visible bands were obtained, direct double-stranded PCR-sequencing was done by a cycle-sequencing method (BigDye terminator ready reaction kit, Perkin-Elmer) using the internal primers and a sequencing primer on 20 ng of purified amplicon.

Statistical Methods

We used Wilcoxon rank sum tests to compare type-specific viral loads among visits. The strength of the correlations among log-transformed viral load measurements at entry and follow-up visits for all four HPV types was measured using Pearson correlation coefficient (r). Geometric means of viral loads were calculated as a function of selected characteristics reported at the index visit of first positivity for a specific HPV type. The Kaplan-Meier technique was used to estimate the cumulative probability of infection clearance as a function of time since enrolment into the study. Proportional hazards regression was used to estimate the relative rate of clearance of type-specific infection, stratified by tertiles of the respective viral load distributions [31]. Logistic generalized estimating equations (GEE) were used to assess whether viral load measured at a given visit was a predictor of persistent HPV infection at the subsequent visit [32, 33]. Crude and adjusted (for age and sexual partners) odds ratios (OR) between persistent HPV at visit (t) and viral load at visit (t-1) within specified periods of follow-up were calculated to measure the magnitude of the longitudinal associations.


The 636 women enrolled in the McGill-Concordia cohort contributed 2694 completed visits (mean of 4.2 visits/subject) during an average of 21.5 months of follow-up. Nearly all cervical specimens (n = 2,626, 97.4%) were suitable for HPV testing. Among those enrolled in the cohort, 68% completed all 5 visits, contributing a total of 13,353 woman-months of follow-up. The mean age of participants was 23 years (median, 21 years; range, 17-42 years). Four in five women reported that they were Caucasian, and one in four are smokers. Nearly half of participants had more than 4 lifetime sexual partners. The prevalence (any HPV = 29%, high risk HPV = 22%), cumulative incidence rates (cumulative incidence of any HPV = 36% and high risk HPV = 29% in 2 years) and mean duration of HPV infections have been reported elsewhere [22]. High-grade and low-grade squamous intraepithelial lesions (SIL) were found in only 4 and 49 Pap smears, respectively, which precluded the analysis of association between HPV loads and SIL.

Samples positive for HPV-16 (n = 220 from 104 women, mean of 2.1 samples/woman), HPV-18 (n = 80 from 43 women, mean of 1.9 samples/woman), HPV-31 (n = 75 from 36 women, mean of 2.1 samples/woman), or HPV-45 (n = 33 from 19 women, mean of 1.7 samples/woman), were further tested with type-specific quantitative real-time PCR assays. Nineteen samples contained more than one of the studied genotype. Descriptive statistics of log-transformed HPV loads stratified by baseline and follow-up time points are presented in Figure 1. Except for the fluctuation in HPV-45 loads, which could be due to the small number of samples, we observed no significant between-visit variation among type-specific viral loads.

Figure 1
figure 1

HPV loads (HPV DNA copies per cell) at enrolment and at follow-up visits (total 5 visits) among HPV-positive women for the 4 genotypes studied (Y-axis in log scale). The length of each box corresponds to the interquartile range, with the top boundary of the box representing 75th and bottom boundary the 25th percentile. The horizontal line in the box indicates the median value. Outlier values are shown in circles outside the boxes.

To investigate the consistency of type-specific viral load over time we calculated correlation coefficients for measurements taken in all possible pairs of visits (Table 1). HPV loads were significantly correlated only when considering consecutive visits. The strength of between-visit correlations seemed to become weaker as time progressed. Stronger correlations between consecutive visits were found in women with HPV-16 infection. Correlation coefficients of HPV loads were non-significant when visits 12 months apart were compared, except when viral loads for all four HPV types were combined (Table 1 and Figure 2).

Table 1 Between-visit correlation coefficients of HPV type-specific and combined viral load measurements at entry and follow-up visits in the McGill-Concordia cohort.
Figure 2
figure 2

Between-visit correlation graphs of viral load measurements of four combined HPV types (HPV-16, 18, 31, 45) at enrolment and follow-up visits (see material and methods). For example, the upper left graph shows viral load at entry versus viral load at 6 months, the lower left graph shows viral load at entry versus viral load at 24 months.

Table 2 shows geometric mean viral loads and respective 95% confidence intervals according to selected characteristics determined at the first visit in which positivity was detected for each given HPV type. HPV DNA loads were slightly higher among women younger than 25 years of age, with >2 lifetime sex partners, who were regular oral contraceptive users, and with sequential HPV co-infections but not those with concurrent co-infections. However, confidence intervals were largely overlapping for most comparisons. HPV DNA loads did not show a consistent pattern of variation according to smoking categories or condom use.

Table 2 Means of HPV viral loads according to selected characteristics at the first occurrence of positivity for a given HPV type

Once detected, 30% of HPV-16, 44% of HPV-18, 63% of HPV-31 and 55% of HPV-45 infections cleared within 6 months. Table 3 shows the relative rates of clearance of HPV infection according to the HPV DNA load for each genotype classified into tertiles. For HPV-16 and HPV-18, infection clearance was inversely associated with viral load but no clear pattern of association emerged for HPV-31 and HPV-45. Owing to the preponderance of HPV-16 and HPV-18 in the cohort the overall association between combined viral loads and clearance maintained the inverse pattern seen with the latter types. Figure 3 shows the cumulative rates of positivity by tertile of the combined viral load distribution.

Table 3 Relative rates* of type-specific and combined HPV infection clearance according to viral load
Figure 3
figure 3

Cumulative positivity rate for infections with HPV types 16, 18, 31, and 45 according to tertiles of cumulative viral load for these 4 types. Unit of analysis is an individual infection episode, which makes observations not independent as a subject may contribute multiple episodes. Dotted line: upper tertile; dashed line: middle; solid line: lower tertile viral load.

The association between HPV viral load and persistence of infection was investigated for each genotype by calculating crude and adjusted (for age and number of sexual partners) GEE ORs (Table 4). HPV persistence was significantly associated with intermediate and high viral load levels at a preceding visit for HPV-16 and HPV-18. There was no consistent effect of viral load on persistence for HPV-31 and HPV-45 infections.

Table 4 Odds ratios for associations between persistent HPV infection at visit (t) and viral load at visit (t-1) estimated via GEE models

We then investigated if persistent HPV-16 infection and/or high HPV-16 viral loads in young women could result in integration of HPV-16 DNA into the human genome. Forty-eight specimens were excluded from this analysis because they contained <15 copies of HPV-16 DNA per μl of extracted DNA. HPV-16 E6 and E2 DNA were thus quantitated on the remaining 172 HPV-16-positive samples that had sufficiently high viral load to permit reliable measurement of integration. The mean HPV-16 E6 viral load on the selected samples was 57.5 ± 324.6 DNA copies per cell (95% confidence interval, 8.6-106.4; median, 0.92, range 0.0001-4084.7 copies per cell). The mean HPV-16 E6/E2 ratio was 0.97 ± 0.25 (95% confidence interval, 0.93-1.01; median, 0.97; range, 0.5-2.48). Of the 172 specimens analyzed, 169 had a HPV-16 E6/E2 ratio <1.5, two samples had HPV-16 a E6/E2 ratio >1.5 and <2.0 (1.54 and 1.70) and one had a ratio above 2. Samples collected at consecutive visits from the two participants with HPV-16 ratios of 1.54 and 1.70 yielded HPV-16 E6/E2 ratios near or below 1.0 (data not shown). The only sample collected from a participant that generated a HPV-16 E6/E2 above 2 (ratio of 2.48) contained 0.94 copy/cell of HPV-16 E6 DNA and was obtained at the last visit. HPV-16 was detected only at the last of five visits attended by this participant. Normal cytology smears were obtained at the first four visits for this participant while a low-grade SIL smear was obtained at the fifth visit. RS-PCR was performed on the 18 samples that generated a HPV-16 E6/E2 ratio ≥1.2. Despite using several primer combinations, we could not demonstrate the presence of cellular and HPV junctions in any of the samples tested. A minimal amount of 35 μg of cellular DNA per test was analyzed for the only sample with a ratio >2, unsuccessfully, which could have limited our ability to sequence HPV-human DNA junctions.

Since we detected HPV-16 integrated forms only once, we investigated if quantity of cellular DNA introduced in the quantitative assays hampered our ability to measure HPV-16 E6 and E2, as reported by another group [34]. When mixtures of episomal HPV-16 and DNA extracted from SiHa cells were tested, we observed interference with quantitation of HPV-16 E2 and E6 with DNA extracted from 106 and from 104 SiHa cells, respectively (data not shown). One thousand copies of episomal HPV-16 DNA was detected without loss of signal when tested in a mixture of DNA extracted from up to 200,000 cells in the HPV-16 E2 assay and up to 40,000 cells in the HPV-16 E6 test (Table 5). Interference of HPV-16 quantitation by input DNA was not an issue in our study since all samples tested contained ≤39,200 copies of cellular DNA per test.

Table 5 Interference of background human DNA in quantitation of HPV-16 DNA with HPV-16 E6 and HPV-16 E2 real-time PCR assays.


We measured HPV DNA load for four high-risk types (16, 18, 31, 45) with real-time PCR on a set of samples collected prospectively in young women. These four genotypes are amongst the most frequently detected in cervical cancer. Contrary to cross-sectional studies of older women, the four HPV genotypes were detected at similar loads and were not substantially different between women with single and multiple type infections [1, 11, 35].

The quantitative real-time PCR assays we utilized to estimate HPV loads were specific and reproducible [25, 28]. The number of HPV DNA copies was normalized for cell content by quantitation of β-globin DNA. The HPV-16 integration assay was devised considering the genetic diversity of HPV-16 [28, 36]. Using type-specific quantitative assays allowed isolating the effect of HPV type loads in multiple type infections. The cohort design also allowed testing longitudinal correlations between pairs of visits. Consistent measurements for HPV types 16, 18, and 31 were shown for the five visits with evidence of correlation between loads among visits within subjects, whenever statistical precision was sufficiently high.

As reported by others, younger woman (<25 years) harbored higher HPV loads than those >25 years of age [1]. These women were possibly exposed to HPV while they were immunologically naïve to HPV. We also observed a somewhat greater HPV burden among current and former smokers, although not consistently for all four HPV types. This finding could be explained by a possible defective cell-mediated immunity against HPV induced by tobacco [37]. Results from this cohort as well as those of others suggest that tobacco smoking may increase the duration of high-risk HPV infection [23, 38]. This could be explained in part by the increased replication of HPV in women exposed to tobacco.

Although the regular use of condoms protects against most sexually transmitted infections, they are not as efficient against HPV infection [39]. We found a trend with the consistent use of condoms for having higher HPV loads for most genotypes. These results could be biased because use of condoms may be associated with risky sexual behavior or exposure to new sexual partners [23]. Condom use has been associated in one study with regression of CIN and clearance of HPV [40]. Although oral contraceptive use did not modify the duration of high-risk HPV infection in our cohort [23], HPV-18 DNA loads were markedly increased in women on oral contraceptives. Oral contraceptives may also be a proxy for a higher number of sexual partners.

HPV-16 and -18 loads were good predictors of the duration of infection with these types but the same finding was not held for HPV-31 and -45. HPV-16 load has been reported by others to be a stronger predictor for persistence or lesions than HPV-18, 31 or 33 loads [13]. There was a clear dose-response relationship between HPV load and persistence of HPV-16 and HPV-18 infections. We found that clearance rates depended largely on the level of HPV load. Viral-host interactions play an influential role in the clearance of viral infections [41]. HPV has developed several mechanisms to evade the host immune system [42]. Functional differences between HPV-encoded proteins could also explain why some types and variants have a better viral fitness with a greater ability to persist [14, 41, 43]. For HPVs 16 and 18, viral loads were greater with HPV co-infections at different visits (sequential) than concurrent co-infection. In recent studies, two or more oncogenic HPV types diagnosed concurrently did not confer an additional risk of developing lesions [44, 45]. All but one study confirmed that sustained or increased viral loads, especially with HPV-16, were predictive of persistent infection [3, 5, 8, 11]. In a cohort of nearly 6,000 women in France, women with HPV loads above 10 pg/ml were less likely to clear the infection, irrespective of the age of participants [8]. Similarly, another cohort study conducted in the Netherlands reported that women with low HPV-16 loads were five times more likely to clear HPV-16 infection [5]. In a third study conducted in Brazil, there was a dose-response relationship between increasing viral loads and risk of incident abnormal smear over time [3].

HPV-16 integration often results in the disruption of the E2 gene, leading to uncontrolled expression of HPV-16 oncoproteins [14, 17]. HPV-16 integration could thus occur in the course of persistent HPV-16 infection and increase the likelihood of progression to higher grade lesions. Integration of the HPV-16 genome was believed to occur at the CIN-2,3 stage and beyond [14]. Recent studies support that integration can occur even in the absence of CIN [15, 28]. In one study conducted in women age 15 to 19 years old [46], disruption of the E2 gene was demonstrated in up to 25% of incident HPV-16 infections, suggesting that HPV-16 E2 disruption was a common event occurring early during infection. In contrast to this latter study, HPV-16 E2 disruption was rare in our study population, perhaps because unlike Collins et al [46] who studied the entire E2 gene for disruption our assay only focused on the hinge region of E2. While disruption of E2 is thought to occur more frequently in the very early phases of infection in younger women at least one other study of older women followed longitudinally, observed approximately 50% of women with persistent or transient infections to have mixed integrated and episomal forms [11]. However, unlike our study, over 50% of these participants had Pap smear abnormalities. Furthermore, HPV integration was not found to be associated with persistence [11]. Results from other cohorts are needed to assess the rate of integration and to clarify the role of HPV integration on persistence at early stages of infection.

There are some limitations to our study. We cannot exclude the possibility that we may have underestimated HPV-16 integration due to HPV-16 disruption during integration in areas outside of the E2 hinge region. Nevertheless, HPV-16 E2 hinge is the most frequently disrupted site in studies conducted in North America [16].

The analytical sensitivity of real-time PCR assays for quantitation of HPV-16 E6 and E2 DNA utilized in this study has been demonstrated previously and shown to be excellent [18, 28, 26]. These reagents were optimized to avoid any influences of viral polymorphism on the efficiency of HPV-16 DNA amplification [28]. Nevertheless, the clinical sensitivity and specificity of these assays to detect HPV integration have not been thoroughly assessed. Reconstitution experiments have demonstrated that HPV-16 integration is detected with real time PCR assays measuring E6 and E2 when integrated HPV-16 forms are in 100-fold excess of episomal HPV-16 DNA [47]. Theoretically, a HPV-16 E6/E2 ratio above 1.0 could suggest integration. However, previous work on HPV-6, HPV-16 and -33 have demonstrated that HPV E6/E2 ratios below 2.0 results from assay variability rather than true differences between E6 and E2 quantities [2, 18, 29, 34]. Real time PCR tests may thus be falsely negative for HPV-16 integration in the presence of low quantities of integrated forms and high quantities of episomal forms.

We could not confirm the presence of HPV-16 integration with a standard technique identifying the presence of HPV-human DNA junctions in the only sample with a HPV-16 E6/E2 ratio above 2.0. However, the quantity of sample that could be analyzed was limited. A recent report using a similar technique to demonstrate the presence of HPV-human junctions did not find HPV-16 integration in specimens from women with low-grade SIL [21]. Real-time PCR assays may be helpful to detect integrated HPV forms but further studies on greater number of specimens comparing different techniques for detection of integrated HPV-16 need to be conducted.

In our study, few women had abnormal smears, reducing our power to test the associations between HPV load and lesion outcomes. The time interval between visits can influence the assessment of persistence and clearance. The majority of women in our cohort returned within 6 months of each visit and there was only a small proportion of women whose time interval between visits extended beyond one year [22]. The association between higher viral loads and persistence would only be distorted if it had been associated differentially with time between visits. Given that the participants were unaware of their HPV and viral load status, this is an unlikely scenario. It is also unlikely that their behaviour and other risk factors will change in this short span of time. The same associations between HPV loads and persistence were obtained when HPV persistence was defined more stringently by using three consecutive HPV-positive visits for the same type.

Consecutive detection of HPV DNA is due to either ongoing viral replication, reactivation of latent infections or new infections [48, 49]. The design of our study cannot discriminate between these three possibilities. Participants considered as having persistent infection could have indeed, been reinfected with another isolate of the same HPV type. However, this seems unlikely because women with persistent infection were all infected with the same intratypic HPV variant. Both prevalent and incident HPV infections were included in our analysis of persistence, increasing the power of our analyses. Although it may not have a direct implication on HPV clearance, the exact duration of prevalent cases is unknown. By including them we could have introduced a bias because a greater proportion of prevalent HPV infections represent persistent infections compared to incident infections. However, our conclusions did not change when we restricted our analyses to incident infections (data not shown). We also analyzed the entire cohort to utilize the clustered binary outcome of persistence, by using the 'visit number' as panel variable in logistic GEE models. Apart from the gain in precision when using multiple visits from the same women, the findings from these models were in agreement with those from Cox models that analyzed clearance in individual women by tertile of viral load. Therefore, we conclude that the bias incurred due to inclusion of prevalent cases is minimal. We also doubt that misclassification of HPV status may have affected our results since there were very few women with persistent type-specific infections who had an intervening visit with a negative HPV test result and more than 80% of the same type persistent infections occurred during consecutive visits [22].


In conclusion, this study demonstrates a clear association between HPV load and persistence of HPV-16 and 18 infections in young women at the early stages of their sexual life. The association between viral load and infection clearance seems to be type specific and, except for age, it is not substantially affected by sociodemographic characteristics and other risk factors for infection. More longitudinal studies are needed to clarify the onset of HPV integration and its relationship with disease progression.


  1. Flores R, Papenfuss M, Klimecki WT, Giuliano AR: Cross-sectional analysis of oncogenic HPV viral load and cervical intraepithelial neoplasia. Int J Cancer. 2006, 118: 1187-1193. 10.1002/ijc.21477.

    Article  CAS  PubMed  Google Scholar 

  2. Tsai HT, Wu CH, Lai HL, Li RN, Tung YC, Chuang HY, Wu TN, Lin LJ, Ho CK, Liu HW, Wu MT: Association between quantitative high-risk human papillomavirus DNA load and cervical intraepithelial neoplasm risk. Cancer Epidemiol Biomarkers Prev. 2005, 14: 2544-2549. 10.1158/1055-9965.EPI-05-0240.

    Article  CAS  PubMed  Google Scholar 

  3. Schlecht NF, Trevisan A, Duarte-Franco E, Rohan T, Ferenczy A, Villa LL, Franco EL: Viral load as a predictor of the risk of cervical intraepithelial neoplasia. Int J Cancer. 2003, 103: 519-524. 10.1002/ijc.10846.

    Article  CAS  PubMed  Google Scholar 

  4. Ylitalo N, Sorensen P, Josefsson AM, Magnusson PKE, Anderson PK, Ponten J, Adami HO, Gyllensten UB, Melbye M: Consistent high viral load of human papillomavirus 16 and risk of cervical carcinoma in situ: a nested case-control study. Lancet. 2000, 355: 2194-2198. 10.1016/S0140-6736(00)02402-8.

    Article  CAS  PubMed  Google Scholar 

  5. Van Duin M, Snijders PJE, Schrijnemakers HFJ, Voorhorst FJ, Rozendaal L, Nobbenhuis MAE, van den Brule AJC, Verheijen RHM, Helmerhorst TJ, Meijer CJLM: Human papillomavirus 16 load in normal and abnormal cervical scrapes: an indicator of CIN II/III and viral clearance. Int J Cancer. 2002, 98: 590-595. 10.1002/ijc.10232.

    Article  CAS  PubMed  Google Scholar 

  6. Gravitt PE, Kovacic MB, Herrero R, Schiffman M, Bratti C, Hildesheim A, Morales J, Alfaro M, Sherman ME, Wacholder S, Rodriguez AC, Burk RD: High load for most high risk human papillomavirus genotypes is associated with prevalent cervical cancer precursors but only HPV16 load predicts the development of incident disease. Int J Cancer. 2007, 121: 2787-2793. 10.1002/ijc.23012.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  7. Ho GYF, Burk RD, Klein S, Kadish AS, Chang CJ, PalanP BasuJ, Tachezy R, Lewis R, Romney S: Persistant genital human papillomavirus infection as a risk factor for persistent cervical dysplasia. J Natl Cancer Inst. 1995, 87: 1365-1371. 10.1093/jnci/87.18.1365.

    Article  CAS  PubMed  Google Scholar 

  8. Dalstein V, Riethmuller D, Pretet JL, Le Bail C, Sautiere JL, Carbillet JP, Kantelip B, Schaal JP, Mougin C: Persistence and load of high-risk HPV are predictors for development of high-grade cervical lesions: a longitudinal French cohort study. Int J Cancer. 2003, 106: 396-403. 10.1002/ijc.11222.

    Article  CAS  PubMed  Google Scholar 

  9. Ho GY, Bierman R, Beardsley L, Chang CJ, Burk RD: Natural history of cervicovaginal papillomavirus infection in young women. New Engl J Med. 1998, 338: 423-428. 10.1056/NEJM199802123380703.

    Article  CAS  PubMed  Google Scholar 

  10. Fontaine J, Hankins C, Money D, Rachlis A, Pourreaux K, Ferenczy A, Coutlee F: Human papillomavirus type 16 (HPV-16) viral load and persistence of HPV-16 infection in women infected or at risk for HIV. J Clin Virol. 2008, 43: 307-312. 10.1016/j.jcv.2008.07.013.

    Article  CAS  PubMed  Google Scholar 

  11. Kulmala SM, Shabalova IP, Petrovitchev N, Syrjanen KJ, Gyllensten UB, Johansson BC, Syrjanen SM: Type-specific persistence of high-risk human papillomavirus infections in the New Independent States of the former Soviet Union Cohort Study. Cancer Epidemiol Biomarkers Prev. 2007, 16: 17-22. 10.1158/1055-9965.EPI-06-0649.

    Article  CAS  PubMed  Google Scholar 

  12. Trottier H, Mahmud S, Prado J, Sobrinho JS, Costa MC, Rohan TE, Villa LL, Franco EL: Type-specific duration of human papillomavirus infection: implications for human papillomavirus screening and vaccination. J Infect Dis. 2008, 197: 1436-1447. 10.1086/587698.

    Article  PubMed  Google Scholar 

  13. Cuzick J: Viral load as surrogate for persistence in cervical human papillomavirus infection. Development in Cervical Cancer Screening and prevention. Edited by: Franco EL, Monsonego J. 1997, Blackwell, Oxford, 372-378.

    Google Scholar 

  14. Doorbar J: Papillomavirus life cycle organization and biomarker selection. Dis Markers. 2007, 23: 297-313.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  15. Kulmala SMA, Syrjanen SM, Gyllensten UB, Shabalova IP, Petrovichev N, Tosi PT, Syrjanen KJ, Johansson BC: Early integration of high copy HPV-16 detectable in women with normal and low grade cervical cytology and histology. J Clin Pathol. 2007, 59: 513-517. 10.1136/jcp.2004.024570.

    Article  Google Scholar 

  16. Peitsaro P, Johansson B, Syrjanen S: Integrated human papillomavirus type 16 is frequently found in cervical cancer precursors as demonstrated by a novel quantitative real-time PCR technique. J Clin Microbiol. 2002, 40: 886-891. 10.1128/JCM.40.3.886-891.2002.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Thiery F: Transcription regulation of the papillomavirus oncogenes by cellular and viral transcription factors in cervical carcinoma. Virology. 2009, 384: 375-379. 10.1016/j.virol.2008.11.014.

    Article  Google Scholar 

  18. Fontaine J, Hankins C, Mayrand MH, Lefevre J, Money D, Gagnon S, Rachlis A, Pourreaux K, Ferenczy A, Coutlee F: High levels of episomal and integrated HPV-16 DNA are associated with high-grade cervical lesions in women at risk or infected with HIV. AIDS. 2005, 19: 785-794. 10.1097/01.aids.0000168972.65304.6b.

    Article  PubMed  Google Scholar 

  19. Cheung JL, Lo KW, Cheung TH, Tang JW, Chan PK: Viral load, E2 gene disruption status, and lineage of human papillomavirus type 16 infections in cervical neoplasia. J Infect Dis. 2006, 194: 1706-1712. 10.1086/509622.

    Article  CAS  PubMed  Google Scholar 

  20. Briolat J, Dalstein V, Saunier M, Joseph K, Caudroy S, Pretet JL, Birembaut P, Clavel C: HPV prevalence, viral load and physical state of HPV-16 in cervical smears of patients with different grades of CIN. Int J Cancer. 2007, 121: 2198-2204. 10.1002/ijc.22959.

    Article  CAS  PubMed  Google Scholar 

  21. Matovina M, Sabol I, Grubisic G, Gasperov NM, Grce M: Identification of human papillomavirus type 16 integration sites in high-grade precancerous cervical lesions. Gynecol Oncol. 2009, 113: 120-127. 10.1016/j.ygyno.2008.12.004.

    Article  CAS  PubMed  Google Scholar 

  22. Richardson H, Kelsall G, Tellier P, Voyer H, Abrahamowicz M, Coutlée F, Franco EL: The natural history of type-specific HPV infections in female university students. Cancer. Epidemiol Biom Prev. 2003, 12: 485-490.

    Google Scholar 

  23. Richardson H, Abrahamowicz M, Tellier PP, Kelsall G, du-Berger R, Ferenczy A, Coutlee F, Franco EL: Modifiable risk factors associated with clearance of type-specific cervical human papillomavirus infections in a cohort of university students. Cancer Epidemiol Biomarkers Prev. 2005, 14: 1149-1156. 10.1158/1055-9965.EPI-04-0230.

    Article  PubMed  Google Scholar 

  24. Lefevre J, Hankins C, Pourreaux K: The Canadian Women's HIV study Group, and F. Coutlée. Prevalence of selective inhibition of HPV-16 DNA amplification in cervicovaginal lavages. J Med Virol. 2004, 72: 132-137. 10.1002/jmv.10539.

    Article  CAS  PubMed  Google Scholar 

  25. Lefevre J, Hankins C, Pourreaux K, Voyer H: The Canadian Women's HIV study Group, and F. Coutlée. Real -time PCR assays using internal Controls for quantitation of HPV-16 and b-globin DNA in cervicovaginal lavages. J Virol Methods. 2003, 144: 135-144. 10.1016/j.jviromet.2003.09.003.

    Article  Google Scholar 

  26. Gravitt PE, Peyton C, Wheeler C, Apple R, Higuchi R, Shah KV: Reproducibility of HPV 16 and HPV 18 viral load quantitation using TaqMan real-time PCR assays. J. Virol Methods. 2003, 112: 23-33. 10.1016/S0166-0934(03)00186-1.

    Article  CAS  Google Scholar 

  27. Weissenborn SJ, Funke AM, Hellmich M, Mallmann P, Fuschs PG, Pfister HG, Wieland : Oncogenic Human papillomavirus DNA loads in human immunodeficiency virus-positive women with High-grade cervical lesions are strongly elevated. J Clin Microbiol. 2003, 41: 2763-2767. 10.1128/JCM.41.6.2763-2767.2003.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  28. Azizi N, Brazete J, Hankins C, Money J, Fontaine A, Koushik A, Rachlis K, Pourreaux A, Franco Ferenczy E, Coutlee F: Influence of HPV-16 E2 polymorphism on quantitation of HPV-16 episomal and integrated DNA in cervicovaginal lavages from women with cervical intraepithelial neoplasia. J Gen Virol. 2008, 89: 1716-1728. 10.1099/vir.0.83579-0.

    Article  CAS  PubMed  Google Scholar 

  29. Khouadri S, Villa LL, Gagnon S, Koushik A, Richardson H, Ferenczy A, Matlashewski , Roger M, Franco EL, Coutlée F: Viral load of episomal and integrated forms of Human papillomavirus type 33 in High-grade squamous intraepithelial lesions of the uterine cervix. Int J Cancer. 2007, 121: 2674-2681. 10.1002/ijc.23006.

    Article  CAS  PubMed  Google Scholar 

  30. Thorland EC, Myers SL, Persing DH, Sarkar G, McGovern RM, Gostout BS, Smith DI: Human papillomavirus type 16 integrations in cervical tumors frequently occur in common fragile sites. Cancer Res. 2000, 60: 5916-5921.

    CAS  PubMed  Google Scholar 

  31. Cox DR: Regression Models and Life Tables. Journal of the Royal Statistical Society. 1972, 34: 187-220. Series B

    Google Scholar 

  32. Liang KY, Zeger SL: Generalized linear model in longitudinal data analysis. Biometrika. 1986, 73: 13-22. 10.1093/biomet/73.1.13.

    Article  Google Scholar 

  33. Hardin JW, Hilbe JM: Generalized estimating equations. 2003, New York. Chapman and Hall/CRC

    Google Scholar 

  34. Ruutu MP, Kulmala SM, Peitsaro P, Syrjanen SM: The performance of the HPV16 real-time PCR integration assay. Clin Biochem. 2008, 41: 423-428. 10.1016/j.clinbiochem.2007.12.014.

    Article  CAS  PubMed  Google Scholar 

  35. Snijders PJF, Hogewoning CJ, Hesselink HT, Berkhof J, Voohorst FJ, Bleeker MC, Meijer CJLM: Determination of viral load thresholds in cervical scrapes to rule out CIN3 in HPV 16, 18, 31 and 33-positive women with normal cytology. Int J Cancer. 2006, 119: 1102-1107. 10.1002/ijc.21956.

    Article  CAS  PubMed  Google Scholar 

  36. Coutlee F, Mayrand MH, Roger M, Franco EL: Detection and typing of human papillomavirus nucleic acids in biological fluids. Public Health Genomics. 2009, 12: 308-318. 10.1159/000214921.

    Article  PubMed  Google Scholar 

  37. Poppe WA, Ide PS, Drijkoningen MP, Lauweryns JM, Van Assche FA: Tobacco smoking impairs the local immunosurveillance in the uterine cervix. An immunohistochemical study. Gynecol Obstet Invest. 1995, 39: 34-38. 10.1159/000292372.

    Article  CAS  PubMed  Google Scholar 

  38. Giuliano AR, Sedjo RL, Roe DJ, Harri R, Baldwi S, Papenfuss MS, Abrahamsen M, Inserra P: Clearance of oncogenic human papillomavirus (HPV) infection: effect of smoking (United States). Cancer Causes Control. 2002, 13: 839-846. 10.1023/A:1020668232219.

    Article  PubMed  Google Scholar 

  39. Burchell AN, Tellier PP, Hanley J, Coutlee F, Franco EL: HPV infection in young adult couples in new sexual relationships: the importance of partner's HPV status. Sex Transm Dis. 2010, 37 (1): 34-40. 10.1097/OLQ.0b013e3181b35693.

    Article  PubMed  Google Scholar 

  40. Bleeker MC, Hogewoning CJ, Voorhorst FJ, van den Brule AJ, Snijders PJ, Starink TM, Berkhof J, Meijer CJ: Condom use promotes regression of human papillomavirus-associated penile lesions in male sexual partners of women with cervical intraepithelial neoplasia. Int J Cancer. 2003, 107: 804-810. 10.1002/ijc.11473.

    Article  CAS  PubMed  Google Scholar 

  41. Wang SS, Hildesheim A: Viral and host factors in human papillomavirus persistence and progression. J Natl Cancer Inst Monogr. 2003, 35-40.

    Google Scholar 

  42. Lehoux M, D'Abramo CM, Archambault J: Molecular mechanisms of human papillomavirus-induced carcinogenesis. Public Health Genomics. 2009, 12: 268-280. 10.1159/000214918.

    Article  PubMed  PubMed Central  Google Scholar 

  43. Burk RD, Chen Z, Van Doorslaer K: Human papillomaviruses: genetic basis of carcinogenicity. Public Health Genomics. 2009, 12: 281-290. 10.1159/000214919.

    Article  PubMed  PubMed Central  Google Scholar 

  44. Levi JE, Fernandes S, Tateno AF, Motta E, Lima LP, Eluf-Neto J, Pannuti CS: Presence of multiple human papillomavirus types in cervical samples from HIV-infected women. Gyn Oncol. 2004, 92: 225-231. 10.1016/j.ygyno.2003.10.004.

    Article  Google Scholar 

  45. An HJ, Cho NH, Lee SY, Kim IH, Lee C, Kim SJ, Mun M, Kim SH, Jeong JK: Correlation of cervical carcinoma and precancerous lesions with human papillomavirus (HPV) genotypes detected with the HPV DNA chip microarray method. Cancer. 2003, 97: 1672-1680. 10.1002/cncr.11235.

    Article  PubMed  Google Scholar 

  46. Collins SI, Williams CC, Wen K, Young LS, Roberts S, Murray PG, Woodman CB: Disruption of the E2 gene is a common and early event in the natural history of cervical human papillomavirus infection: a longitudinal cohort study. Cancer Res. 2009, 69: 3828-3832. 10.1158/0008-5472.CAN-08-3099.

    Article  CAS  PubMed  Google Scholar 

  47. Arias-Pulido H, Peyton CL, Joste NE, Vargas H, Wheeler CM: Human papillomavirus type 16 integration in cervical carcinoma in situ and in invasive cervical cancer. J Clin Microbiol. 2006, 44: 1755-1762. 10.1128/JCM.44.5.1755-1762.2006.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  48. Cuzick J, Beverley E, Ho L, Terry G, Sapper H, Mielzynska I, Lorincz A, Chan WK, Krausz T, Soutter P: HPV testing in primary screening of older women. Br J Cancer. 1999, 81: 554-8. 10.1038/sj.bjc.6690730.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  49. Cuschieri KS, Cubie HA, Whitley MW, Gilkison G, Arends MJ, Graham C, McGoogan E: Persistent high risk HPV infection associated with development of cervical neoplasia in a prospective population study. J Clin Pathol. 2005, 58: 946-950. 10.1136/jcp.2004.022863.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Pre-publication history

Download references


This study was supported by Canadian Institutes of Health Research operating grants MT-13649 and MOP-53111 and team grant #83320.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Eduardo L Franco.

Additional information

Competing interests

Financial competing interests: None

Non-Financial competing interests: None

The authors declare that they have no competing interests

Authors' contributions

AVR performed the statistical analysis and drafted the first version of the manuscript. OG performed viral load measurements and integration studies, and interpreted their findings. HR participated in the design and coordination of the study and collection of epidemiologic data. PT participated in the design of the study, oversaw recruitment of subjects, and collected samples. AF participated in the design of the study, read Pap smears, and provided expert oversight. FC helped design the study, supervised HPV genotyping, viral load, and integration assays, participated in the analysis of results and helped interpreting the results and drafting of the manuscript. EF was the study's principal investigator, and participated in its design and coordination, supervision of analyses, and drafting. All authors read and approved the final manuscript and subsequent revisions.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Ramanakumar, A.V., Goncalves, O., Richardson, H. et al. Human papillomavirus (HPV) types 16, 18, 31, 45 DNA loads and HPV-16 integration in persistent and transient infections in young women. BMC Infect Dis 10, 326 (2010).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: