Skip to main content

Validity of claims-based diagnoses for infectious diseases common among immunocompromised patients in Japan

Abstract

Background

To validate Japanese claims-based disease-identifying algorithms for herpes zoster (HZ), Mycobacterium tuberculosis (MTB), nontuberculous mycobacteria infections (NTM), and Pneumocystis jirovecii pneumonia (PJP).

Methods

VALIDATE-J, a multicenter, cross-sectional, retrospective study, reviewed the administrative claims data and medical records from two Japanese hospitals. Claims-based algorithms were developed by experts to identify HZ, MTB, NTM, and PJP cases among patients treated 2012–2016. Diagnosis was confirmed with three gold standard definitions; positive predictive values (PPVs) were calculated for prevalent (regardless of baseline disease-free period) and incident (preceded by a 12-month disease-free period for the target conditions) cases.

Results

Of patients identified using claims-based algorithms, a random sample of 377 cases was included: HZ (n = 95 [55 incident cases]); MTB (n = 100 [58]); NTM (n = 82 [50]); and PJP (n = 100 [84]). PPVs ranged from 67.4–70.5% (HZ), 67.0–90.0% (MTB), 18.3–63.4% (NTM), and 20.0–45.0% (PJP) for prevalent cases, and 69.1–70.9% (HZ), 58.6–87.9% (MTB), 10.0–56.0% (NTM), and 22.6–51.2% (PJP) for incident cases, across definitions. Adding treatment to the algorithms increased PPVs for HZ, with a small increase observed for prevalent cases of NTM.

Conclusions

VALIDATE-J demonstrated moderate to high PPVs for disease-identifying algorithms for HZ and MTB using Japanese claims data.

Peer Review reports

Introduction

Infectious diseases represent a major cause of morbidity and mortality worldwide. Despite substantial improvements in healthcare and medical technology in recent years, infectious diseases are directly responsible for approximately 9% of deaths globally [1]. Individuals who are immunocompromised (including those infected with human immunodeficiency virus, transplant recipients, and patients with chronic renal failure, malignancies, or autoimmune/inflammatory disorders) and receiving immunosuppressive treatment [2,3,4], as well as older adults [5], are particularly vulnerable to serious infection with opportunistic pathogens or the reactivation of latent varicella zoster virus leading to herpes zoster (HZ). Additionally, epidemiologic studies have reported that certain geographic populations are at an increased risk of opportunistic infection. The global incidence of HZ is approximately 3–5/1000 person-years; the incidence in the United States (US) is approximately 3.2–5.2/1000 person-years [6], and the incidence in Europe is approximately 2.0–4.6/1000 person-years [7]. However, the incidence of HZ in the Asia-Pacific region (including Australia, Taiwan, South Korea, and Japan) is even higher, at approximately 3–10/1000 person-years [8]. In Japanese patients aged ≥50 years, this increases to 10.9/1000 person-years [9]. Identifying patient cohorts that are highly susceptible to infectious diseases, as well as improving diagnostic accuracy, are essential to improving health outcomes associated with specific infections.

Administrative healthcare claims databases provide longitudinal real-world data of hospitalizations, outpatient visits, major procedures, and medication use in large populations. These data bolster health services and outcomes research, as well as pharmacoepidemiologic research. Claims-based definitions for numerous infectious diseases, including HZ, Mycobacterium tuberculosis (MTB), nontuberculous mycobacteria infections (NTM), and Pneumocystis jirovecii pneumonia (PJP), have been developed in the US [10,11,12,13,14,15] and are validated for the identification of prevalent and/or incident cases. However, claims-based definitions using administrative healthcare data for HZ, MTB, NTM, and PJP have not yet been validated in Japan. Validation of such claims-based definitions against “gold standard” definitions of infectious diseases, based on medical records, is needed to better reflect the unique claims data and clinical practice environment of Japan.

In the Validity of Algorithms in Large Databases: Infectious Diseases, Rheumatoid Arthritis (RA), and Tumor Evaluation in Japan (VALIDATE-J) study, experts developed new or modified claims-based disease-identifying algorithms and validated these against gold standard definitions using hospital claims data from Japan. The overall objectives of the study were to validate claims-based algorithms for HZ, MTB, NTM, PJP, cancer, and RA in the Japanese clinical practice environment. Here, the concordance between these algorithms and definitions is reported (positive predictive values [PPVs]) for HZ, MTB, NTM, and PJP to assess the validity of the claims-based algorithms. Data for cancer are reported elsewhere [16] and will be reported separately for RA.

Methods

Study design and patients

The VALIDATE-J study comprised a cross-sectional retrospective review of claims data (including patient demographics and clinical characteristics, details of diagnoses, medical procedures, and medications taken within the same month or ± 1 claim month), medical records, and registry data from two general acute-care hospitals in Japan that routinely diagnose and treat patients with infectious diseases, cancer, and RA. Hospital A was a >900-bed private teaching hospital located in a rural area, and Hospital B was a >700-bed community teaching hospital located in a city; both were within the Chiba prefecture.

Figure 1 summarizes the methods applied to the infectious diseases cohort.

Fig. 1
figure 1

Study flow chart for the infectious diseases cohort. HZ, herpes zoster; MTB, Mycobacterium tuberculosis infection; NTM, nontuberculous mycobacteria infection; PJP, Pneumocystis jirovecii pneumonia; PPV, positive predictive value. aClaims data did not include personal health information. bTwo infectious disease experts formed an adjudication committee, which was a subcommittee within a steering committee. The primary role of the adjudication committee was to assess whether patients met the gold standard definitions for the respective infectious diseases, based on the abstracted medical records and anonymized claims data

Prior to study initiation, a feasibility assessment was conducted, during which structured data abstraction forms for each infectious disease were developed, along with operational procedures for abstracting patient data from medical records. As part of this phase, a steering committee of infectious disease and epidemiologic methodology specialists developed claims-based algorithms for HZ, MTB, NTM, and PJP (Table 1) based on combinations of International Classification of Diseases, Tenth Edition (ICD-10) diagnosis codes and claims codes for these diseases, and relevant tests and therapies. ICD-10 diagnosis codes and claims codes used for the algorithms and selected drugs for each infectious disease are shown in Supplemental Tables 1 and 2, respectively. Gold standard definitions for the diagnosis of HZ, MTB, NTM, or PJP using the hospital data are shown in Table 1.

Table 1 Claims-based algorithms and gold standard definitions for HZ, MTB, NTM, and PJP

The data collection period occurred between January 1, 2012 (Hospital A) or March 1, 2012 (Hospital B), and December 31, 2016. Outpatients or inpatients who were treated at either hospital during this time were assessed to determine whether they met the claims-based algorithms for HZ, MTB, NTM, or PJP (Table 1). Of those meeting these criteria, a random sample of 200 patients from each hospital were linked with medical records, and data on disease status were obtained from the associated medical charts using abstraction forms (Fig. 1). Prevalent cases were those identified regardless of baseline disease-free period, and incident cases were those preceded by a 12-month disease-free period, per the VALIDATE-J malignancy study [16]. Using abstracted medical records and associated anonymized claims data, expert adjudicators identified confirmed or probable infectious disease cases according to the gold standard definitions described above and in Table 1.

A pilot study of five cases at each hospital was conducted prior to main data collection. The abstraction process for each case was carried out independently by two abstracters to resolve any inconsistencies ahead of the main study, and to assess inter-adjudicator variability. Modifications of the gold standard definition, adjudication form, and abstraction process were performed to reduce variability.

Validity measures

Using the anonymized database of claims data, abstracted medical records, and adjudication results, PPVs for the claims-based algorithms were calculated. While treatment was included in the claims-based algorithms for MTB and PJP, including treatment in the claims-based algorithms for HZ and NTM was performed as a sensitivity analysis. PPVs were also calculated for PJP excluding the period prior to August 2012 as an ad hoc analysis.

Ethics

An Independent Ethics Committee and the Institutional Review Board at each participating hospital approved the study protocol. The study was conducted in accordance with accepted practices for pharmacoepidemiology studies issued by the International Society for Pharmacoepidemiology [19] and the Council for International Organizations of Medical Sciences [20]. Patients identified in the claims databases were not required to provide consent and could opt-out from participating in the study.

Statistical analysis

Demographic and disease characteristics were summarized using descriptive statistics, with means and standard deviations (SDs) for continuous variables, and percentages and counts for dichotomous variables. It was estimated that a sample size of ≥400 infectious disease cases overall (comprising HZ, MTB, NTM, and PJP cases) would result in a confidence limit of 10%, assuming a PPV of 85.0%. For each claims-based algorithm, PPVs with 95% confidence intervals (CIs) were calculated as the number of cases meeting the claims-based algorithm that were confirmed using the gold standard definitions (i.e., true positives) divided by the total number of cases meeting the claims-based algorithm (i.e., true and false positives) (Supplemental Table 3). The 95% CI for PPVs were calculated using the normal approximation of the binomial distribution. Anonymized data were analyzed using Python version 3.6.0 (2016).

Results

Patients

Of 4031 patients with infectious diseases identified using the claims-based algorithms during the data collection period (2012–2016), a random sample of 377 infectious disease cases (out of 400 cases initially selected across both hospitals) were used for the final analyses. The sample included cases of HZ (n=95 [including 55 incident cases]), MTB (n=100 [including 58 incident cases]), NTM (n=82 [including 50 incident cases]), and PJP (n=100 [including 84 incident cases]). Out of the randomly selected 400 cases, 23 patients were excluded; 5 HZ cases were excluded following further refinement of the main algorithm to include only patients aged ≥18 years and without facial palsy, and 18 NTM cases were excluded following revision of the main algorithm to include the following procedures: acid fast staining and culture, or polymerase chain reaction (PCR), within a claim month or ±1 claim month. The numbers of cases identified in the individual hospital claims data were 181 at Hospital A (HZ: n=49; MTB: n=50; NTM: n=32; PJP: n=50) and 196 at Hospital B (HZ: n=46; MTB: n=50; NTM: n=50; PJP: n=50).

Demographics for patients identified using the claims-based algorithms are shown in Table 2. Approximately half of cases for each infectious disease were in females. The mean ages (SD) of patients with HZ, MTB, NTM, and PJP ranged from 61.5 (20.2) years (HZ) to 69.1 (12.3) years (NTM). Disease characteristics of prevalent cases by infection type are in Supplemental Tables 4–7. Approximately 20% of patients identified were receiving immunosuppressive therapy, except for PJP, for which the proportion was much higher (92.0%). Comorbidities and therapies were as expected for a true diagnosis of each infection.

Table 2 Demographics of prevalent infectious disease cases identified using claims data from two hospitals

Validity of claims-based algorithms

The PPVs for claims-based algorithms were similar for prevalent and incident cases across the four infections, regardless of whether gold standard definition 1 or 2 was used, and they were consistently highest for MTB (range 87.9–90.0%) and lowest for PJP (range 45.0–51.2%; Table 3). For prevalent cases, PPVs for claims-based algorithms using gold standard definition 1 (physician diagnosis) or 2 (overall adjudicator decision; confirmed or probable cases), respectively, were 67.4% and 70.5% for HZ, 90.0% (both definitions) for MTB, 63.4% (both definitions) for NTM, and 45.0% (both definitions) for PJP. For incident cases, PPVs were 69.1% and 70.9% for HZ, 87.9% (both definitions) for MTB, 56.0% and 54.0% for NTM, and 48.8% and 51.2% for PJP.

Table 3 PPVs (95% CI) of claims-based algorithms for infectious diseases from two hospitals

Comparison of claims-based algorithms with gold standard definition 3 (overall adjudicator decision; confirmed cases) resulted in the lowest PPVs across prevalent MTB, NTM, and PJP cases (67.0%, 18.3%, and 20.0%, respectively) and incident MTB, NTM, and PJP cases (58.6%, 10.0%, and 22.6%, respectively).

In sensitivity analyses, the inclusion of treatment in the claims-based algorithms for HZ and NTM resulted in increased PPVs for prevalent and incident cases of HZ regardless of which gold standard definition was used (PPV for prevalent cases: 79.6% and 83.7% for gold standard definition 1 and 2, respectively; PPV for incident cases: 80.0% and 83.3% for gold standard definition 1 and 2, respectively; Table 3). The PPVs for claims-based algorithms of incident cases of NTM decreased with the inclusion of treatment in the algorithm (Table 3). PPVs for cases of PJP slightly increased when claims prior to August 2012 were excluded (PPV for prevalent cases: 49.5% and 48.4% for gold standard definition 1 and 2, respectively; PPV for incident cases: 51.3% and 52.5% for gold standard definition 1 and 2, respectively; Table 3).

The PPVs for prevalent and incident cases identified in the individual hospital data were generally consistent with the overall analysis, although the sample sizes were relatively small for each hospital separately (Supplemental Tables 8 and 9).

Discussion

To our knowledge, VALIDATE-J is one of the first studies conducted in Japan to validate claims-based algorithms for HZ, MTB, NTM, and PJP. The claims-based algorithms, developed with expert input, identified cohorts of patients with demographics and clinical characteristics as expected for these infectious diseases. Other retrospective claims database studies in Japan used algorithms with ICD-10 diagnosis codes only [21], or ICD-10 diagnosis codes plus claims data regarding prescription medication [2, 22]. These studies did not validate the algorithms using clinical information. In contrast, the algorithms in the VALIDATE-J study were validated using clinical information and further included additional criteria of exception (i.e., facial palsy for HZ), laboratory tests (acid fast staining and culture, or PCR for NTM; β-D-glucan test for PJP), and details regarding dosage and duration of prescribed drugs (for HZ and PJP). PPVs were higher across infectious diseases when gold standard definition 1 (physician diagnosis) or 2 (overall adjudicator decision; confirmed or probable cases) were applied (45–90%), compared with gold standard definition 3. For gold standard definition 1 and 2, PPVs were 67–84% for HZ and 88–90% for MTB. The algorithms developed for NTM and PJP generally did not have adequate PPVs across gold standard definitions (NTM: 8–70%; PJP: 20–51%) to support use with Japanese claims data, except for NTM cases in the sensitivity analysis which incorporated treatment in the algorithm, when gold standard definition 2 was applied (70%).

The low PPVs for cases of NTM could be a result of an NTM diagnosis being recorded when mycobacterial tests were ordered, rather than reflecting a true diagnosis. Moreover, NTM diagnoses were often made based on clinical imaging findings or a single positive culture, rather than a confirmed diagnosis based on two positive cultures, which could account for the lowest PPVs using gold standard definition 3 (overall adjudicator decision; confirmed cases). Compared with data reported for the preferred algorithms identified in studies using US data in which PPVs were 70.0–100% [12], the prevalent PPVs reported in the current analysis for NTM when gold standard definition 1 or 2 were applied (63.4%) were slightly lower. This could be explained partly by the use of culture-based case finding algorithms in US studies [12]. In contrast, the claims data used in the current analysis did not include sufficient data from culture; therefore, such algorithms could not be applied.

The low PPVs for cases of PJP may be explained by providers coding PJP diagnoses on the basis of prophylactic antibiotic use for PJP, rather than reflecting a true diagnosis of PJP. For example, the prophylactic dose of atovaquone is the same as the therapeutic dose, and may have been considered as a PJP diagnosis [23]. Trimethoprim-sulfamethoxazole is the most frequently used prophylactic antibiotic for PJP [24]; however, it was not approved for prophylactic use in Japan until August 2012. Thus, patients who received trimethoprim-sulfamethoxazole to prevent PJP were most likely coded as PJP cases for reimbursement purposes prior to August 2012. An ad hoc analysis excluding the period prior to August 2012 showed slightly increased PPV for PJP; however, this coding practice might have continued even after that in some cases. The algorithms were further refined by including criteria regarding dosage and duration of trimethoprim-sulfamethoxazole and the performance of β-D-glucan test to exclude cases with prophylactic treatment, although these were applied only in Hospital B. While a sensitivity analysis was not performed, the slightly higher PPV in Hospital B versus Hospital A was likely due to the exclusion of more cases with prophylactic treatment in Hospital B. Finally, PCR-based diagnosis of PJP has become more commonplace, which may have resulted in an increase in PJP diagnoses in the later years of the study.

The application of gold standard definition 2 resulted in higher PPVs of claims-based algorithms for prevalent HZ cases than the application of gold standard definition 1. As cases of HZ are likely to be less severe than MTB, NTM, and PJP, HZ is typically treated in the outpatient setting where physicians might be less likely to record the diagnosis in the patient records than in hospitalized cases. Moreover, in some cases, a diagnosis of HZ was not recorded, but an antiviral drug for HZ was prescribed; this would be classified as HZ according to gold standard definition 2, but it may not be according to gold standard definition 1. These factors may have accounted for the differences observed in gold standard definition 1 and 2 in HZ.

PPVs calculated using gold standard definition 3 had the lowest PPVs of claims-based algorithms for prevalent and incident cases of MTB, NTM, and PJP. This gold standard definition identified cases using microbiologic and laboratory tests. The information required for a confirmed diagnosis (see Table 1) was often unavailable in the medical charts, which could partly account for the lower PPVs observed. Most cases of MTB identified were diagnosed based on microbiologic confirmation, which may explain why the PPVs using gold standard definition 3 were higher for MTB than for NTM and PJP.

Including treatment in the claims-based algorithm did not improve the PPVs for incident cases of NTM, and only slightly improved PPVs for prevalent cases. This may reflect that appropriate treatment regimens for NTM are still being established [25]. In addition, it is possible that the treatments were used as prophylactic therapy for other conditions (e.g., opportunistic infections in patients with human immunodeficiency virus, or prevention of pneumocystis pneumonia infections in patients with human immunodeficiency virus, or prevention of pneumocystis pneumonia in immunocompromised patients) rather than for NTM. Finally, the treatments may have been prescribed for suspected NTM, which were then discontinued if the laboratory tests for NTM came back negative. Including three or more NTM drugs in the criteria would have improved PPV, but also would have increased false negatives unacceptably.

Our study has some limitations that should be acknowledged. First, the sample size for each infectious disease for which we were able to review charts was small, resulting in wider 95% CIs for PPV estimates. However, these cases were randomly sampled, and the point estimates are likely representative of larger case bases. Second, we were not able to estimate negative predictive value, sensitivity, and specificity, which is an inherent limitation of our study design. Third, the revised algorithm for PJP was not applied in Hospital A because the data collection from Hospital A had already been completed. Application of the revised algorithm in both hospitals may have improved the PPV for PJP. Additionally, there were differences in the diagnostic and treatment strategies between the two study sites. However, such variability by site is expected in large databases consisting of multiple hospitals, thus the generalizability of our results is higher than for single center studies. Moreover, the comorbidities associated with each infectious disease are likely to differ across hospitals, which means that the data from the two hospitals included here may not be representative of Japan; further studies using different institutions is warranted. While sampling directly from claims to review medical charts from multiple hospitals is the ideal way of sampling for validation studies, privacy laws in Japan prohibit the identification of patients directly from administrative healthcare databases. Finally, this study focused exclusively on traditional claims data, which are applicable to all hospitals and both inpatients and outpatients in Japan. Examining the validity of DPC data is outside of the scope of this study.

In conclusion, the claims-based algorithms developed for MTB may be applied to Japanese claims database studies to identify cases with high accuracy (88–90%), and the algorithms developed for HZ may be applied to identify cases with moderate accuracy (67–84%). The algorithms developed for NTM and PJP did not have adequate PPVs to support their use in research using Japanese claims data. Incorporating treatment into the claims-based algorithm improved PPVs for HZ, but it did not greatly improve PPVs for NTM. Future research should focus on developing improved claims-based algorithms for PJP and NTM and confirming these validation results in other hospital samples as well as further Japanese populations.

Data availability

The datasets supporting the conclusions of this article are included within the article (and additional files).

Abbreviations

AFB:

acid fast bacilli

CI:

confidence interval

CS:

cupric silver (Grocott methenamine silver stain or Diff-Quik)

DPC:

Diagnosis Procedure Combination

HZ:

herpes zoster

ICD-10:

International Classification of Diseases, Tenth Edition

LAMP:

loop-mediated isothermal amplification

MTB:

Mycobacterium tuberculosis

NTM:

nontuberculous mycobacteria infections

PCR:

polymerase chain reaction

PJP:

Pneumocystis jirovecii pneumonia

PPV:

positive predictive value

RA:

rheumatoid arthritis

SD:

standard deviation

VALIDATE-J:

Validity of Algorithms in Large Databases: Infectious Diseases, Rheumatoid Arthritis, and Tumor Evaluation in Japan (VALIDATE-J)

References

  1. World Health Organization. Global health estimates: leading causes of death 2000–2019. 2020. https://www.who.int/data/gho/data/themes/mortality-and-global-health-estimates/ghe-leading-causes-of-death. Accessed 1 Jul 2022.

  2. Imafuku S, Matsuki T, Mizukami A, Goto Y, de Souza S, Jégou C, et al. Burden of herpes zoster in the Japanese population with immunocompromised/chronic disease conditions: results from a cohort study claims database from 2005–2014. Dermatol Ther (Heidelb). 2019;9:117–33.

    Article  PubMed  Google Scholar 

  3. Muñoz-Quiles C, López-Lacort M, Díez-Domingo J, Orrico-Sánchez A. Herpes zoster risk and burden of disease in immunocompromised populations: a population-based study using health system integrated databases, 2009–2014. BMC Infect Dis. 2020;20:905.

    Article  PubMed  PubMed Central  Google Scholar 

  4. Sester M, van Leth F, Bruchfeld J, Bumbacea D, Cirillo DM, Dilektasli AG, et al. Risk assessment of tuberculosis in immunocompromised patients. A TBNET study. Am J Respir Crit Care Med. 2014;190:1168–76.

    Article  PubMed  Google Scholar 

  5. Yoshikawa TT. Epidemiology and unique aspects of aging and infectious diseases. Clin Infect Dis. 2000;30:931–3.

    Article  CAS  PubMed  Google Scholar 

  6. Kawai K, Gebremeskel BG, Acosta CJ. Systematic review of incidence and complications of herpes zoster: towards a global perspective. BMJ Open. 2014;4:e004833.

    Article  PubMed  PubMed Central  Google Scholar 

  7. Pinchinat S, Cebrián-Cuenca AM, Bricout H, Johnson RW. Similar herpes zoster incidence across Europe: results from a systematic literature review. BMC Infect Dis. 2013;13:170.

    Article  PubMed  PubMed Central  Google Scholar 

  8. Chen LK, Arai H, Chen LY, Chou MY, Djauzi S, Dong B, et al. Looking back to move forward: a twenty-year audit of herpes zoster in Asia-Pacific. BMC Infect Dis. 2017;17:213.

    Article  PubMed  PubMed Central  Google Scholar 

  9. Takao Y, Miyazaki Y, Okeda M, Onishi F, Yano S, Gomi Y, et al. Incidences of herpes zoster and postherpetic neuralgia in Japanese adults aged 50 years and older from a community-based prospective cohort study: the SHEZ study. J Epidemiol. 2015;25:617–25.

    Article  PubMed  PubMed Central  Google Scholar 

  10. Calderwood MS, Platt R, Hou X, Malenfant J, Haney G, Kruskal B, et al. Real-time surveillance for tuberculosis using electronic health record data from an ambulatory practice in eastern Massachusetts. Public Health Rep. 2010;125:843–50.

    Article  PubMed  PubMed Central  Google Scholar 

  11. Trepka MJ, Beyer TO, Proctor ME, Davis JP. An evaluation of the completeness of tuberculosis case reporting using hospital billing and laboratory data; Wisconsin, 1995. Ann Epidemiol. 1999;9:419–23.

    Article  CAS  PubMed  Google Scholar 

  12. Winthrop KL, Baxter R, Liu L, McFarland B, Austin D, Varley C, et al. The reliability of diagnostic coding and laboratory data to identify tuberculosis and nontuberculous mycobacterial disease among rheumatoid arthritis patients using anti-tumor necrosis factor therapy. Pharmacoepidemiol Drug Saf. 2011;20:229–35.

    Article  CAS  PubMed  Google Scholar 

  13. Yokoe DS, Coon SW, Dokholyan R, Iannuzzi MC, Jones TF, Meredith S, et al. Pharmacy data for tuberculosis surveillance and assessment of patient management. Emerg Infect Dis. 2004;10:1426–31.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Klompas M, Kulldorff M, Vilk Y, Bialek SR, Harpaz R. Herpes zoster and postherpetic neuralgia surveillance using structured electronic data. Mayo Clin Proc. 2011;86:1146-53.

  15. Long MD, Farraye FA, Okafor PN, Martin C, Sandler RS, Kappelman MD. Increased risk of pneumocystis jiroveci pneumonia among patients with inflammatory bowel disease. Inflamm Bowel Dis. 2013;19:1018–24.

    Article  PubMed  Google Scholar 

  16. de Luise C, Sugiyama N, Morishima T, Higuchi T, Katayama K, Nakamura S, et al. Validity of claims-based algorithms for selected cancers in Japan: results from the VALIDATE-J study. Pharmacoepidemiol Drug Saf. 2021;30:1153–61.

    Article  PubMed  PubMed Central  Google Scholar 

  17. Griffith DE, Aksamit T, Brown-Elliott BA, Catanzaro A, Daley C, Gordin F, et al. An official ATS/IDSA statement: diagnosis, treatment, and prevention of nontuberculous mycobacterial diseases. Am J Respir Crit Care Med. 2007;175:367–416.

    Article  CAS  PubMed  Google Scholar 

  18. Nakashima K, Aoshima M, Nakashita T, Hara M, Otsuki A, Noma S, et al. Low-dose trimethoprim-sulfamethoxazole treatment for pneumocystis pneumonia in non-human immunodeficiency virus-infected immunocompromised patients: a single-center retrospective observational cohort study. J Microbiol Immunol Infect. 2018;51:810–20.

    Article  CAS  PubMed  Google Scholar 

  19. International Society of Pharmacoepidemiology. Guidelines for good pharmacoepidemiology practices (GPP). Pharmacoepidemiol Drug Saf. 2008;17:200–8.

    Article  Google Scholar 

  20. Council for International Organizations of Medical Sciences. International ethical guidelines for epidemiological studies. 2009. https://cioms.ch/publications/product/international-ethical-guidelines-for-epidemiological-studies/. Accessed 1 Jul 2022.

  21. Uno S, Asakura T, Morimoto K, Yoshimura K, Uwamino Y, Nishimura T, et al. Comorbidities associated with nontuberculous mycobacterial disease in Japanese adults: a claims-data analysis. BMC Pulm Med. 2020;20:262.

    Article  PubMed  PubMed Central  Google Scholar 

  22. Matsuoka K, Togo K, Yoshii N, Hoshi M, Arai S. Incidence rates for hospitalized infections, herpes zoster, and malignancies in patients with ulcerative colitis in Japan: an administrative health claims database analysis. Intest Res. 2023;21:88–99.

    Article  PubMed  Google Scholar 

  23. Electronic Medicines Compendium. Atovaquone/proguanil hydrochloride 250 mg/100 mg film-coated tablets. 2017. https://www.medicines.org.uk/emc/product/9902/smpc. Accessed 1 Jul 2022.

  24. Ghembaza A, Vautier M, Cacoub P, Pourcher V, Saadoun D. Risk factors and prevention of pneumocystis jirovecii pneumonia in patients with autoimmune and inflammatory diseases. Chest. 2020;158:2323–32.

    Article  CAS  PubMed  Google Scholar 

  25. Haworth CS, Banks J, Capstick T, Fisher AJ, Gorsuch T, Laurenson IF, et al. British Thoracic Society guidelines for the management of non-tuberculous mycobacterial pulmonary disease (NTM-PD). Thorax. 2017;72:ii1–ii64.

    Article  PubMed  Google Scholar 

Download references

Acknowledgments

Medical writing support, under the direction of the authors, was provided by Lauren Hogarth MSc, CMC Connect, a division of IPG Health Medical Communications, and was funded by Pfizer, New York, NY, USA, in accordance with Good Publication Practice (GPP 2022) guidelines (Ann Intern Med 2022; 175: 1298 − 304).

Funding

This study was sponsored by Pfizer.

Author information

Authors and Affiliations

Authors

Contributions

All authors participated in the conception and development of this article. R.H., D.S., T.H., K.K., M.K. S.J., T.M., and Y.T. were involved in data interpretation. H.C., E.N., S.S., and Y.T. were involved in data collection and analysis. R.H. and D.S. were involved in the adjudication process. All authors critically reviewed the manuscript and approved the final draft prior to submission.

Corresponding authors

Correspondence to Naonobu Sugiyama or Soko Setoguchi.

Ethics declarations

Ethical approval and consent to participate

The Independent Ethics Committee and the Institutional Review Boards at Japanese Red Cross Narita Hospital and Kameda Medical Center approved the study protocol. The study was conducted in accordance with accepted practices for pharmacoepidemiology studies issued by the International Society for Pharmacoepidemiology [19] and the Council for International Organizations of Medical Sciences [20]. Patients identified in the claims databases were not required to provide consent and could opt-out from participating in the study. As this study was a cross-sectional review using de-identified claims data and medical records without invasive procedures and interventions that did not use samples from human participants, written or oral informed consent was waived by the Institutional Review Boards of Japanese Red Cross Narita Hospital and Kameda Medical Center. This study used an opt-out approach to guarantee opportunities for study participants to obtain disclosed information and to refuse to participate in this study.

Consent for publication

Not applicable.

Competing interests

R.H., E. N., T.H., M.K., S.J., and T.M. have received consultancy fees from Pfizer Inc in connection with this study. D.S. has received consultancy fees from Pfizer Inc in connection with this study and is currently affiliated with the Department of Infectious Diseases, Anjo Kosei Hospital, Anjo, Aichi, Japan. K.K. has received consultancy fees from Pfizer Inc in connection with this study and is currently affiliated with the Faculty of Informatics, Gunma University, Maebashi, Gunma, Japan. H.C. has received funding from Pfizer Inc in connection with this study through her university and has received funding from the Cystic Fibrosis Foundation, National Institute of Health, and Pfizer Inc. C.D. and N.S. are employees of Pfizer Inc. Y.T. has received speaker fees and/or honoraria from AbbVie, AstraZeneca, Boehringer Ingelheim, Bristol Myers Squibb, Chugai Pharmaceutical Co., Ltd, Daiichi Sankyo, Eisai, Eli Lilly, Gilead, GlaxoSmithKline, Mitsubishi Tanabe Pharma, and Pfizer Inc; and research grants from AbbVie, Asahi Kasei, Boehringer Ingelheim, Chugai Pharmaceutical Co., Ltd, Daiichi Sankyo, Eisai, and Takeda. S.S. has received consultancy fees from Pfizer Inc in connection with this study and has received funding from Bristol Myers Squibb, the Cystic Fibrosis Foundation, Janssen, National Institute of Health, PCORI, Pfizer Inc, and Pfizer Japan Inc, and personal consulting fees from Daiichi Sankyo, Janssen, Medtronic, Merck, and Pfizer Inc.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Hase, R., Suzuki, D., de Luise, C. et al. Validity of claims-based diagnoses for infectious diseases common among immunocompromised patients in Japan. BMC Infect Dis 23, 653 (2023). https://doi.org/10.1186/s12879-023-08466-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12879-023-08466-8

Keywords