Skip to main content

A study to design minimum data set of COVID-19 registry system



From the beginning of the COVID-19 pandemic, the development of infrastructures to record, collect and report COVID-19‏ ‏data has become a fundamental necessity in the world. The disease registry system can help build an infrastructure to collect data systematically. The study aimed to design a minimum data set for the COVID-19 registry system.


A qualitative study to design an MDS for the COVID-19 registry system was performed in five phases at Ahvaz University of Medical Sciences in Khuzestan Province in southwestern Iran, 2020–2021. In the first phase, assessing the information requirements was performed for the COVID-19 registry system. Data elements were identified in the second phase. In the third phase, the MDS was selected, and in the four phases, the COVID-19 registry system was implemented as a pilot study to test the MDS. Finally, based on the experiences gained from the COVID-19 registry system implementation, the MDS were evaluated, and corrections were made.


MDS of the COVID-19 registry system contains eight top groups including administrative (34 data elements), disease exposure (61 data elements), medical history and physical examination (138 data elements), findings of clinical diagnostic tests (101 data elements), disease progress and outcome of treatment (55 data elements), medical diagnosis and cause of death (12 data elements), follow-up (14 data elements), and COVID-19 vaccination (19 data elements) data, respectively.


Creating a standard and comprehensive MDS can help to design any national data dictionary for COVID-19 and improve the quality of COVID-19 data.

Peer Review reports


Although nearly 16 months have passed since the COVID-19 outbreak, and the vaccination is underway in most countries [1], there is still a long way to go to control the pandemic fully [2].

Inequality in vaccine access, insufficient access to vaccines in most poor countries [3,4,5], and COVID-19 virus mutations are some of the biggest challenges to control the COVID-19 pandemic, especially in developing countries [6]. Besides, there are still many questions about the long-term side effects of recovered patients and the impact of immunity from vaccination [7].

From the beginning of the pandemic, the rapid and surprising outbreak of COVID-19, its complex and unknown nature, has created data collection infrastructures for it, has become a fundamental necessity worldwide [8]. The fundamental step in designing a system to record, collect and report data on a particular disease is determining the target data elements [9]. One of the most well-known tools to determine data elements for disease registration and reporting systems is the minimum data set (MDS) [10]. MDS is a standardized set of data elements defined in a health information system to index the minimum amount of essential data for different users in healthcare sectors [11]. By precisely defining data elements, MDS creates a common language among all those involved in recording, collecting, analyzing, reporting, and interpreting data and ensures the collection of essential data [12, 13]. Furthermore, MDS helps improve medical records documentation, data comparability, data dictionary design, electronic data exchange among different healthcare systems and ultimately improve data quality [12, 14].

This feature has made MDS determination in disease registry systems one of the main steps to implement the system. Based on nature and application, the MDS data elements of a disease can be divided into two general categories of administrative and clinical data. The administrative data usually include demographic, socio-economic, address, phone number, patient referral data, and the main characteristic of a healthcare provider or healthcare center [15,16,17,18].

The clinical data vary depending on the type of diseases but generally include diagnosis, past medical history, laboratory findings, medical imaging findings, health interventions, disease course and progress, and outcome [16, 19]. In the case of COVID-19, the data structure is almost the same. Most case report forms, datasets, information systems, and MDSs designed for COVID-19 include patient demographic data, patient’s addresses, and telephone, Profile of healthcare center, how did infected with COVID-19, early signs and symptoms, previous health condition and underlying diseases, COVID-19 testing, CT scan findings, and disease outcome [13, 17, 20,21,22].

In Iran, at the beginning of the COVID-19 epidemic, the Ministry of Health, as the custodian of the health system, set up two information systems for reporting COVID-19 in healthcare centers. Although these two systems collect basic information about patients with COVID-19, there are problems with data quality and data adequacy to support COVID-19 research [2]. Perhaps this is why several universities of medical sciences in Iran have launched a COVID-19 registry system [22,23,24,25,26].

According to the important role of MDS designing in the success of a disease registry system, this study was performed to design a minimum data set for the COVID-19 registry system in Khuzestan province, southwestern Iran.


A qualitative study to design MDS for the COVID-19 registry system was conducted five phases at Ahvaz University of Medical Sciences in Khuzestan Province in southwestern Iran from March 2020 to March 2021.

Phase 1: Study design and information needs assessment

The purpose of this phase was to answer the following critical questions:

  • What is the information that users such as researchers and policymakers of the registry program need for the future?

  • According to those requirements, what information of COVID-19 should be recorded?

  • What details are available about specific information that is recorded?

  • What resources can identify the information?

  • What are the minimum data elements?

  • Given the urgency of setting up the COVID-19 registry system, what methods can be used to determine MDS?

According to the above-mentioned, a project team was formed under the supervision of the Diseases and Health Outcomes Registry Systems Office in Deputy of Research and Technology of Ahvaz Jundishapur University of Medical Sciences (AJUMS).

The project team reviews the titles of the COVID-19 research proposals submitted to AJUMS as well as interviews with some researchers and experts in the field of disease registry. Moreover, the project team interview with senior managers of Deputy of Research and Technology in AJUMS. In addition, a survey was done on the structure and contents of health information systems such as SIB (an abbreviation for the Persian equivalent of “Integrated Health System”) and hospital information system (HIS), Medical Care Monitoring Center ‏system (a system for reporting COVID-19 cases in hospitals) and the Management of Communicative Disease Prevention and Control system (a system for reporting COVID-19 cases from health centers). Also, the protocol and MDS of previous programs of disease registry at the university were studied. Furthermore, a list of the required information of the COVID-19 registry system were prepared.

In total, the following seven general areas were identified as information needed for the COVID-19 registry system in Khuzestan province:

  1. 1.

    Epidemiological study of COVID-19 prevalence and incidence.

  2. 2.

    The pattern of COVID-19 outbreaks and the factors affecting it.

  3. 3.

    The role of lifestyle in COVID-19 risk.

  4. 4.

    Symptoms, signs, and abnormal medical imaging and laboratory findings of COVID-19.

  5. 5.

    The course of the disease, drug treatment, and the patient’s response to care.

  6. 6.

    Outcomes of COVID-19‏ ‏and the effect of underlying health conditions on its outcomes.

  7. 7.

    Follow up of treated patients.

Phase 2: Identification of data elements

In the second phase, the goal was to identify the data elements for each of the information needs specified in the previous phase. To identify the data elements, the project team first compiled a list of resources that could contribute to this goal. The resources included articles related to COVID-19, COVID-19 data collection tools, health information systems in Iran, medical records of patients with COVID-19, and expert opinions (such as specialist physicians, health information management specialists, and …). Table 1 describes the information resources used, the search strategy, and the data acquisition method.

Table 1 The information sources used for data gathering

The collections form was used as an instrument. The form contained five sections: the name of the data element, class, application, value, and additional descriptions. The form was designed based on a literature review on MDSs [15, 27,28,29,30,31,32,33].

After collecting data, a list of data elements was prepared using the content analysis method. Duplicate or overlap items were removed. After that, similar items were categorized into the main class, subclass, data element, and values. In general, 475 data elements were identified in 18 subclasses and eight main classes.

Phase 3: Select MDS

At this phase, the purpose was to select MDS from the data elements identified in the previous phase. The expert panel method was used to select MDS. Firstly, a list of medical specialists who are members of the university hospitals of Ahvaz and involved in the care of patients with COVID-19 was prepared. Next, the project team was selected some specialists from the list based on indicators such as interest to join the registry and have a position in the relevant specialized department. The expert panels included medical specialists and some members of the project team. Table 2 shows the demographic characteristics of participants in the expert panels. In the panels, a list of data elements that scaled with the Likert score (1–5) was distributed among participants. During the meetings, data element and their values and feasibility for collection it such as accessibility, method, and time of gathering was discussed. Finally, the primary data elements for MDS were selected.

Table 2 Demographic characteristics of participants in the expert panel

Furthermore, some new data elements suggested by experts were considered. All data elements with more than 50% of agreement among panelists were elected to apply in the primary MDS.

Phase 4: Field-testing

Based on primary MDS, we developed web-based COVID-19 registry software (available at using the Rapid Application Development (RAD) technique. In addition to registry software, paper-based forms were designed to help collect data from the hospitals. In the paper forms and registry software, data elements were divided into six top groups: administrative, disease exposure, medical history and physical examination, clinical diagnostic test findings, disease progress and outcome of treatment, and follow-up‏ ‏data. A WhatsApp group that had the project team and experts members was created. The designed forms were shared in the group, and the members were asked to comment on them. After reviewing the members’ comments, the approved comments were applied to the forms.

In the next step, to evaluate the following items, the registry program was tested as a pilot in Razi hospital in Ahvaz city (the capital of Khuzestan province). The evaluated items included structure and content of the designed forms, data values, mandatory fields in the registry software, and identify new data needs. Razi is a 200-bed university hospital considered the main hospital for admitting patients with COVID-19 since late February 2020, when the first positive COVID-19 case was diagnosed in Khuzestan province.

An official letter was sent to Razi Hospital and to make an announcement to invite those interested in participating in the COVID-19 registry program. The Eleven staff of the hospital were selected to participate in the registry system. After training the participants, the registry system was implemented as a pilot for 3 weeks. The participants joined a WhatsApp group to share their opinions and suggestions regarding the data elements, the data collection forms, and the registry software. In addition, two persons of participants were selected to be responsible for monitoring the data collection process and data quality control. They documented all weaknesses and problems related to the data collection process and submitted their recommendations daily. At the end of the pilot, the comments were summarized and submitted to the project manager. Based on the results of the pilot study, some changes were made to MDS, forms, and registry software. The changes included deleting and adding some data elements, renaming the subclass title or data element field name, and modifying their values.

Phase 5: Evaluate and modify MDS

At the end of May 2020, the COVID-19 registry program was implemented. Six months later, the MDS was evaluated based on the feedback obtained from the initial data analysis and the experiences gotten of the implementation process.

We modified the MDS, removed useless data elements, and added new data elements to the data collection forms and registry software. Finally, when the COVID-19 vaccination was launched in March 2021 in Ahvaz, a new top group was created for recording the vaccination data.


In the second phase of the study, 375 data elements were categorized into 14 subclasses and eight main classes based on a review of information resources. Then, in the third phase, according to the output of Expert panels, 11 data elements were eliminated, and 25 new data elements were added to the MDS.

Also, according to expert comments, some changes were made in the subclasses, and the number of subclasses increased to 27. For example, the clinical laboratory tests class was divided into five subclasses: hematology test, biochemistry tests, arterial blood gas test, hemostasis test, and other diagnostic tests finding.

After the pilot study, 12 data elements were deleted due to the impossibility of collecting in the routine process of patients with COVID-19 care. Also, 25 new data elements suggested by the participants and approved by the project team were added to the MDS. For example, the ICD10 code and the causes of death recorded in the death certificate were added.

Since to avoid users’ misconceptions and to become data elements transparent, the titles of 13 data elements were renamed.

The values of 26 data elements were also modified in the software. For example, values related to education, job title, and discharge status were changed.

Also, pilot study results showed that the data collection process should be in accordance with the patient care process. Furthermore, to facilitate data collection, the structure of data collection forms and the registry software should be as similar as possible to the medical record forms and HIS.

In the fifth phase, the database was evaluated and modified again. The modifications covered adding a new top group, two main classes, 12 subclasses, and 59 new data elements, as well as removing five subclass and 26 useless data elements. For example, due to the COVID-19 outbreak in Khuzestan province, some data element such as “disease exposure data top group”, and two subclass including “observance of safety measures” and “contact with health care facility” have lost their importance, and finally were removed.

The final MDS for the COVID-19 registry system was divided into eight top groups, 18 main classes, 70 subclasses, and 434 data elements (Table 3). The details of the final MDS are shown in Tables 4, 5, 6, 7, 8, 9, 10 and 11.

Table 3 Overall view MDS for the COVID-19 registry system
Table 4 Administrative data about a probable or confirmed cases of COVID-19
Table 5 Disease exposure data about a probable or confirmed case of COVID-19
Table 6 Medical history and physical examination data about a probable or confirmed case of COVID-19
Table 7 Clinical diagnostic test findings data about a probable or confirmed case of COVID-19
Table 8 Disease progress and outcome of treatment data in a probable or confirmed case of COVID-19
Table 9 Medical diagnosis and cause of death data in a probable or confirmed case of COVID-19
Table 10 Follow-up data in a probable or confirmed case of COVID-19
Table 11 COVID-19 vaccination‏ ‏data

In general, the administrative data group includes two main classes, five subclasses, and 34 data elements (Table 4).

In general, the disease exposure data group includes two main classes, seven subclasses, and 61 data elements (Table 5).

In general, the medical history and physical examination data group embraces four main classes, 27 subclasses, and 138 data elements (Table 6).

In general, the clinical diagnostic test findings data group consists of three main classes, 12 subclasses, and 101 data elements (Table 7).

In general, the disease progress and outcome of treatment data group includes three main classes, 12 subclasses, and 55 data elements (Table 8).

In general, medical diagnosis and cause of death data group consists of three main classes, six subclasses, and 12 data elements (Table 9).

In general, follow-up‏ ‏data group embraces one main class, three subclasses, and 14 data elements (Table 10).

In general, the COVID-19 vaccination‏ ‏data group includes one main class, two subclasses and, and 19 data elements (Table 11).


A study on the coronavirus family showed that creating a data set can help disease control intervention [35]. The lack of MDS is one of the technical barriers to data sharing in public health [36]. Therefore, from the beginning of the COVID-19 outbreak, World Health Organization has recommended a “Revised case reporting form for 2019 Novel‏ ‏Coronavirus of confirmed and probable cases” to report COVID-19 cases and has requested national authorities to design MDS for reporting COVID-19 based on the form [37]. However, that form may not be suitable for the COVID-19 registry system. Since the many aspects of COVID-19 were unknown at the beginning of the pandemic, there were a lot of questions about the disease in the minds of scientists and health policymakers. Therefore, the COVID-19 registry system should provide comprehensive information [2]. This study tried to fulfill that requirement. The approach used to design MDS included identifying the required data elements, the initial design of MDS, the pilot test, and modification of MDS based on the results obtained from the implementation. Stanfill et al., in the study entitled “Health information management best practices for quality health data during COVID-19 global pandemic”, had the same approach as our study. They emphasized identifying the data elements, considered the expected results, and evaluated the output data to justify data elements [38].

In the MDS for the COVID-19 registry system, 396 data elements were identified in eight top groups, 17 main classes, and 67 subclasses. The first top group included administrative data that are commonly used to identify and register patients, identify a healthcare institution, reimburse the costs of healthcare services [39, 40], do medical research, and carry out a survey on the outcome and quality of healthcare [21, 41].

The second top group was disease exposure data. These data can help to determine the method of individual transmission of disease and identify early signs and symptoms. Therefore, in order to show how the patient was exposed to infection, the exposure data was used in the coronavirus family data set. For instance, COVID-19 forms and dataset [37], British government MDS form for MERS disease [42], and MDS for SARS [35].

Another main issue with COVID-19 is the new variants of the virus. One of the principal concerns about the coronavirus variants is whether the mutations could affect treatment and prevention or not [43]. The importance of recording COVID-19 variants led to consider a subclass for infection with mutated strains of COVID-19 in the fifth phase of the study.

The third top group of MDS was dedicated to medical history and physical examination data. It included 22 subclasses for recording signs and symptoms of COVID-19 at the time of admission, as well as documenting the patient’s underlying medical conditions. From the beginning of the COVID-19 outbreak, much attention was paid to identifying the signs and symptoms of the disease and distinguishing it from other infectious diseases. Moreover, since the early studies in China showed that underlying medical conditions impact on COVID-19 outcome [44], in all forms, MDS or dataset, and health information systems related to COVID-19, the data elements of underlying medical conditions were considered. Furthermore, the information needs assessment phase indicated that most titles of COVID-19 research proposals submitted to AJUMS focused on the effect of underlying medical conditions on the outcome of COVID-19. In addition, most of the data elements in the two national COVID-19 recording and reporting systems of the Iranian Ministry of Health (Management of Communicative Disease Prevention and Control and MCMC system) are dedicated to recording the sign and symptoms and underlying diseases. Therefore, in the MDS of the COVID-19 registry system, a classified approach was used to define the symptoms and underlying diseases to record them more accurately.

The fourth top group of MDS was related to clinical diagnostic test findings. The purpose of the three main classes in this group was to record abnormal medical imaging, laboratory, and cardiology diagnostic tests in patients with COVID-19, especially those hospitalized. One of the most important of the data was medical imaging data. The findings of CT scan and chest x-ray play an essential role in the diagnosis, the clinical course of the disease, and the choice of treatment for patients with COVID-19 [45,46,47].

The importance of imaging data has encouraged the Iranian Ministry of Health, in collaboration with the Iranian Society of Radiology to propose a standard form for reporting medical imaging data in patients with COVID-19 [48]. In the same manner, the Radiological Society of North America (RSNA) also has provided a standard classification for reporting CT scan findings on COVID-19 [49]. The study indicated that laboratory diagnostic data and cardiology diagnostic tests were other essential data that medical specialists emphasized on registering them. The reason for this attention may be the importance of the data elements in the treatment of inpatients with COVID-19 [50,51,52,53,54].

Disease progress and outcome of treatment data were the next top group of MDS. The data shows a period of the disease and the outcome of care. If the data combine with other data such as age, sex, underlying medical conditions and comorbidity, and clinical diagnostic test findings, they can facilitate many statistical analysis and artificial intelligence data modeling that can be applied in reliable prediction about the outcome of the disease based on extracting hidden relationships among the clinical and non-clinical data [55,56,57].

Therefore, 52 data elements were considered to record clinical course (such as hospitalization), disease complications, treatment measures, and disease outcome. The disease complications and outcomes were considered essential data in other forms, MDSs, and data sets related to COVID-19. For example, Shanbehzadeh and Kazemi-Arpanahi, in their study entitled “Development of minimal basic data set to report COVID-19”, proposed two subclasses for data elements of disease complications and outcomes [13].

Another questionable issue related to the COVID-19 is the effectiveness of various medications on the treatment outcome. Although the forms and databases for recording and reporting COVID-19 do not usually mention the medications used to treat the patient, we have considered a separate class for medications because the topic is important for medical research.

The data elements related to medications may help to identify the effects of different medications in the treatment of COVID-19.

The top group for medical diagnosis and cause of death data was considered to record the relationship between primary and final diagnosis and the cause of death in patients with COVID-19.

The group included diagnostic (based on WHO’s ICD-10 classification system), causes of death, and medical procedures codes (based on ICD9CM Volume 3).

Clinical coding of medical records can help to manage statistical data about diseases and causes of death at the national or international levels [58]. Therefore, that data could facilitate retrieving information about patients with COVID-19 in the future. Most studies conducted to design MDS of diseases have suggested using diagnostic data elements [16, 27, 29].

Coinciding with the COVID-19 Pandemic, the WHO proposed new ICD codes for the disease to help classify and report it. WHO also published a guideline for recording and coding the causes of death due to COVID-19 [59]. In addition to WHO efforts, some countries have established a process to record COVID-19 coding data. For example, in the Irish COVID-19 data set, “diagnosis concepts” is considered for recording diagnostic data [60]. Also, in the United States, several guidelines were published by the CDC, AHIMA, and AMA regarding the recording and coding of diagnoses and medical procedures for COVID-19 [13, 61, 62]. As the same manner, Iran’s Ministry of Health has already issued three guidelines for coding COVID-19 [48].

The top group was follow-up data. In the group, data related to patient monitoring up to 2 weeks after discharge from the hospital or quarantine at home was recorded.

The patient follow-up is important for two reasons: firstly, to ensure that the signs and symptoms of the disease improve, and secondly, to track people who are in close contact with the patient. Therefore, the patient follow-up is essential protocols in controlling COVID-19 especially for a patient who is in quarantine at home.

Another fundamental way to effectively control the disease is vaccination. COVID-19 vaccination started in March 2021 in Ahvaz. Therefore, at the beginning of vaccination, we considered a top group of MDS to record its data. Recording vaccination information is valuable in several ways. Firstly, the data can be used to evaluate the effectiveness of different vaccines. Secondly, they help to evaluate the vaccination effectiveness on disease control. Finally, the data can aid in assessing the side effects of vaccines.


The MDS designed in the present study has tried as much as possible to cover the essential data for the COVID-19 registry system. Using a comprehensive MDS and a systematic approach to data collection of COVID-19 can provide valuable information for health policymakers, researchers, and clinical specialists.

Given the importance of having accurate, complete, and up-to-date information on COVID-19 disease control, it strongly recommends that different countries should design a comprehensive national MDS for COVID-19. The methodology used in this study can also help other developing countries that do not have a comprehensive infrastructure for disease recording and reporting systems to implement similar disease registry systems.

Limitation of study

At the beginning of the project, there were few related studies to our topic. Also, due to the difficulty in long-term follow-up of patients, data elements have not been considered for long-term COVID-19 complications.

Further study

We recommend that future studies can focus on the effectiveness of the COVID-19 registry system in data quality. Also, we strongly recommend that an international MDS be designed for the long-term complications of COVID-19.

Availability of data and materials

The datasets used during the current study are available from the first author on reasonable request.



American Health Information Management Association


Ahvaz Jundishapur University of Medical Sciences


Acute kidney injury


American Medical Association


Acute respiratory distress syndrome


Body mass index


Blood urea nitrogen


Coronary artery disease


Center for Disease Control


Congestive heart failure


Chronic kidney disease


Chronic obstructive pulmonary disease


C-reactive protein


Cerebra vascular accident


Disseminated intravascular coagulation


Glasgow coma scale/score






Hospital information system


International Committee on Taxonomy of Viruses


Interstitial lung disease


International normalized ratio


Lactic acid dehydrogenase


Length of stay


Mean corpuscular hemoglobin concentration


Medical Care Monitoring Center System


Mean corpuscular volume


Minimum data set


Multiple sclerosis


N-terminal pro brain natriuretic peptide




Pulmonary regurgitation


Prothrombin time


Partial thromboplastin time


Rapid Application Development


Red cell distribution width


Tricuspid regurgitation


White blood cell


World Health Organization


  1. WHO Coronavirus (COVID-19) dashboard.

  2. Zarei J, Dastoorpoor M, Jamshidnezhad A, Cheraghi M, Sheikhtaheri A. Regional COVID-19 registry in Khuzestan, Iran: a study protocol and lessons learned from a pilot implementation. Inform Med Unlocked. 2021;23:100520.

    Article  PubMed  PubMed Central  Google Scholar 

  3. Alaran AJ, Adebisi YA, Badmos A, Khalid-Salako F, Ilesanmi EB, Olaoye DQ, Bamisaiye A, Lucero-Prisno DE III. Uneven power dynamics must be levelled in COVID-19 vaccines access and distribution. Public Health Pract. 2021;2:100096.

    Article  Google Scholar 

  4. Shrestha GS, Lamsal R. Rational use of oxygen in COVID-19 pandemic-are we doing enough? J Nepal Med Assoc. 2021;59(236):429–31.

    Article  Google Scholar 

  5. Acharya KP, Ghimire TR, Subramanya SH. Access to and equitable distribution of COVID-19 vaccine in low-income countries. npj Vaccines. 2021;6(1):1–3.

    Article  CAS  Google Scholar 

  6. Zhao S, Lou J, Chong MK, Cao L, Zheng H, Chen Z, Chan RW, Zee BC, Chan PK, Wang MH. Inferring the association between the risk of COVID-19 case fatality and N501Y substitution in SARS-CoV-2. Viruses. 2021;13(4):638.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  7. Forni G, Mantovani A. COVID-19 vaccines: where we stand and challenges ahead. Cell Death Differ. 2021;28:626–39.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  8. Ahmed K, Bukhari MA, Mlanda T, Kimenyi JP, Wallace P, Lukoya CO, Hamblion EL, Impouma B. Novel approach to support rapid data collection, management, and visualization during the COVID-19 outbreak response in the world health organization African region: development of a data summarization and visualization tool. JMIR Public Health Surveill. 2020;6(4):e20355.

    Article  PubMed  PubMed Central  Google Scholar 

  9. Kodra Y, Weinbach J, Posada-de-la-Paz M, Coi A, Lemonnier SL, van Enckevort D, Roos M, Jacobsen A, Cornet R, Ahmed SF. Recommendations for improving the quality of rare disease registries. Int J Environ Res Public Health. 2018;15(8):1644.

    Article  PubMed Central  Google Scholar 

  10. Nouhjah S, Shahbazian H, Zarei J, Sharifi M. Designing a data set for postpartum recall registry of women with gestational diabetes in recent pregnancy and its implementation in Iranian urban health centers. Diabetes Metab Syndr. 2021;15(1):1–6.

    Article  Google Scholar 

  11. Domensino A-F, Winkens I, van Haastregt JC, van Bennekom CA, van Heugten CM. Defining the content of a minimal dataset for acquired brain injury using a Delphi procedure. Health Qual Life Outcomes. 2020;18(1):1–10.

    Article  Google Scholar 

  12. Abbasi R, Khajouei R, Mirzaee M. Evaluating the demographic and clinical minimum data sets of Iranian National Electronic Health Record. BMC Health Serv Res. 2019;19(1):1–10.

    Article  Google Scholar 

  13. Shanbehzadeh M, Kazemi-Arpanahi H. Development of minimal basic data set to report COVID-19. Med J Islam Repub Iran. 2020;34:111.

    PubMed  PubMed Central  Google Scholar 

  14. Shanbehzadeh M, Ahmadi M. Identification of the necessary data elements to report AIDS: a systematic review. Electron Physician. 2017;9(12):5920.

    Article  PubMed  PubMed Central  Google Scholar 

  15. Ahmadi M, Madani T, Alipour J. Development a national minimum data set (MDS) of the information management system for disability in Iran. Disabil Health J. 2019;12(4):641–8.

    Article  PubMed  Google Scholar 

  16. Ahmadi M, Mohammadi A, Chraghbaigi R, Fathi T, Baghini MS. Developing a minimum data set of the information management system for orthopedic injuries in iran. Iran Red Crescent Med J. 2014;16(7):e17020.

    Article  PubMed  PubMed Central  Google Scholar 

  17. Shanbehzadeh M, Kazemi-Arpanahi H, Mazhab-Jafari K, Haghiri H. Coronavirus disease 2019 (COVID-19) surveillance system: development of COVID-19 minimum data set and interoperable reporting framework. J Educ Health Promot. 2020;9:203.

    Article  PubMed  PubMed Central  Google Scholar 

  18. Bitarafan B, Jahanshahi A, Shahbazian H, Nooshabadi Z. Designing a minimum data set for thyroid cancer in Iran. Int J Cancer Manage. 2020.

  19. Rampisheh Z, Kameli ME, Zarei J, Barzaki AV, Meraji M, Mohammadi A. Developing a national minimum data set for hospital information systems in the Islamic Republic of Iran. East Mediterr Health J. 2020;26(4):400–9.

    Article  PubMed  Google Scholar 


  21. Weingart SN, Iezzoni LI, Davis RB, Palmer RH, Cahalane M, Hamel MB, Mukamal K, Phillips RS, Davies DT Jr, Banks NJ. Use of administrative data to find substandard care: validation of the complications screening program. Med Care. 2000;38:796–806.

    Article  CAS  PubMed  Google Scholar 

  22. Javanmard SH, Nasirian M, Ataei B, Vaseghi G, Vaezi A, Changiz T. Isfahan COvid-19 REgistry (I-CORE): design and methodology. J Res Med Sci. 2020;25:32.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  23. Talebpour M, Hadadi A, Oraii A, Ashraf H. Rationale and design of a registry in a referral and educational medical center in Tehran, Iran: Sina Hospital Covid-19 Registry (SHCo-19R). Front Emerg Med. 2020;4(2s):e53–e53.

    Google Scholar 

  24. Khorrami F, Shahi M, DavariDolatabadi N, Karami NA, HasaniAzad M, Jafariyan F, Sheikhtaheri A. Implementation of regional COVID-19 registry in Hormozgan (RCovidRH), Iran: rationale and study protocol. Med J Islam Repub Iran. 2020;34:96.

    PubMed  PubMed Central  Google Scholar 

  25. Emami A, Akbari A, Javanmardi F, Lotfi M, Moghadami M, Sharifi M, Bakhtiari H, Falahati F. Designing a multicenter registry of COVID-19 and other respiratory infections in Fars, Iran. Disaster Med Public Health Prep. 2021.

  26. Kazemi-Arpanahi H, Moulaei K, Shanbehzadeh M. Design and development of a web-based registry for Coronavirus (COVID-19) disease. Med J Islam Repub Iran. 2020;34:68.

    PubMed  PubMed Central  Google Scholar 

  27. Rampisheh Z, Kameli M, Zarei J, Vahedi Barzaki A, Meraji M, Mohammadi A. Developing a national minimum data set for hospital information systems in the Islamic Republic of Iran. East Mediterr Health J. 2019;25:400–9.

    Google Scholar 

  28. Abbasi M, Ahmadian L, Amirian M, Tabesh H, Eslami S. The development of a minimum data set for an infertility registry. Perspect Health Inf Manage. 2018;15(Winter):1b.

    Google Scholar 

  29. Ahmadi M, Alipour J, Mohammadi A, Khorami F. Development a minimum data set of the information management system for burns. Burns. 2015;41(5):1092–9.

    Article  PubMed  Google Scholar 

  30. Hayrinen K, Saranto K. The core data elements of electronic health record in Finland. Stud Health Technol Inform. 2005;116:131–6.

    PubMed  Google Scholar 

  31. Nsw Health Client Data Stream Data Dictionary.

  32. COVID-19 Dataset Specification. Ver 06. Irish National Release Centre (NRC); 2020.

  33. National Health Board. National Minimum Dataset (Hospital Events) DATA DICTIONARY, vol. 7.9.1. Wellington: Ministry of Health; 2014.

  34. Zhan C, Miller M. Administrative data based patient safety research: a critical review. Qual Saf Health Care. 2003;12(suppl 2):ii58–63.

    PubMed  PubMed Central  Google Scholar 

  35. Scott RD 2nd, Gregg E, Meltzer MI. Collecting data to assess SARS interventions. Emerg Infect Dis. 2004;10(7):1290–2.

    Article  PubMed  PubMed Central  Google Scholar 

  36. Sane J, Edelstein M. Overcoming barriers to data sharing in public health. A global perspective. London: Chatham House; 2015.

    Google Scholar 

  37. World Health Organization. Revised case report form for confirmed novel Coronavirus COVID-19 (report to WHO within 48 hours of case identification). Geneva: WHO; 2020.

    Google Scholar 

  38. Stanfill M, Giannangelo K, Fenton S. Health information management best practices for quality health data during the COVID-19 global pandemic. In: Perspectives in health information management, vol. Winter 2020; 2020.

  39. De Coster C, Quan H, Finlayson A, Gao M, Halfon P, Humphries KH, Johansen H, Lix LM, Luthi J-C, Ma J. Identifying priorities in methodological research using ICD-9-CM and ICD-10 administrative data: report from an international consortium. BMC Health Serv Res. 2006;6(1):77.

    Article  PubMed  PubMed Central  Google Scholar 

  40. Wager KA, Lee FW, Glaser JP. Managing health care information systems: a practical approach for health care executives. Hoboken: Wiley; 2005.

    Google Scholar 

  41. Zhan C, Miller M. Administrative data based patient safety research: a critical review. BMJ Qual Saf. 2003;12(suppl 2):ii58–63.

    Google Scholar 

  42. MERS-CoV: minimum data set form for possible cases.

  43. Le Page M. Threats from new variants. Amsterdam: Elsevier; 2021.

    Book  Google Scholar 

  44. Yang C, Jin Z. An acute respiratory infection runs into the most common noncommunicable epidemic-COVID-19 and cardiovascular diseases. JAMA Cardiol. 2020;5(7):743–4.

    Article  PubMed  Google Scholar 

  45. Bernheim A, Mei X, Huang M, Yang Y, Fayad ZA, Zhang N, Diao K, Lin B, Zhu X, Li K. Chest CT findings in coronavirus disease-19 (COVID-19): relationship to duration of infection. Radiology. 2020;295:200463.

  46. Li Y, Xia L. Coronavirus disease 219 (COVID-19): role of chest CT in diagnosis and management. Am J Roentgenol. 2020;214:1280–6.

  47. Tenda ED, Yulianti M, Asaf MM, Yunus RE, Septiyanti W, Wulani V, Pitoyo CW, Rumende CM, Setiati S. The importance of chest CT scan in COVID-19. Acta Med Indones. 2020;52(1):68–73.

    PubMed  Google Scholar 

  48. The Ministry of Health and Medical Education. Diagnosis recording and coding instructions of COVID-19, based on the 10th revision of the International Statistical Classification of Diseases and Related Health Problems (ICD 10), Version2. Tehran: The Ministry of Health and Medical Education; 2020.

    Google Scholar 

  49. Reporting of COVID-19 chest CT findings.

  50. Fan BE, Chong VCL, Chan SSW, Lim GH, Lim KGE, Tan GB, Mucheli SS, Kuperan P, Ong KH. Hematologic parameters in patients with COVID-19 infection. Am J Hematol. 2020;95:E215.

    CAS  PubMed  PubMed Central  Google Scholar 

  51. Tian S, Liu H, Liao M, Wu Y, Yang C, Cai Y, Peng Z, Xiao S-Y. Analysis of mortality in patients of COVID-19: clinical and laboratory parameters. In: Open forum infectious diseases. Oxford: Oxford University Press.

  52. Inciardi RM, Lupi L, Zaccone G, Italia L, Raffo M, Tomasoni D, Cani DS, Cerini M, Farina D, Gavazzi E. Cardiac involvement in a patient with coronavirus disease 2019 (COVID-19). JAMA Cardiol. 2020;5(7):819–24.

  53. Skulstad H, Cosyns B, Popescu BA, Galderisi M, Salvo GD, Donal E, Petersen S, Gimelli A, Haugaa KH, Muraru D. COVID-19 pandemic and cardiac imaging: EACVI recommendations on precautions, indications, prioritization, and protection for patients and healthcare personnel. Eur Heart J-Cardiovasc Imaging. 2020;21(6):592–8.

    Article  PubMed  Google Scholar 

  54. Liu Y, Du X, Chen J, Jin Y, Peng L, Wang HH, Luo M, Chen L, Zhao Y. Neutrophil-to-lymphocyte ratio as an independent risk factor for mortality in hospitalized patients with COVID-19. J Infect. 2020;81:e6–12.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  55. Zhou F, Yu T, Du R, Fan G, Liu Y, Liu Z, Xiang J, Wang Y, Song B, Gu X. Clinical course and risk factors for mortality of adult inpatients with COVID-19 in Wuhan, China: a retrospective cohort study. Lancet. 2020;395:1054–62.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  56. Petrilli CM, Jones SA, Yang J, Rajagopalan H, O’Donnell LF, Chernyak Y, Tobin K, Cerfolio RJ, Francois F, Horwitz LI. Factors associated with hospitalization and critical illness among 4,103 patients with COVID-19 disease in New York City. MedRxiv. 2020.

  57. Guo T, Fan Y, Chen M, Wu X, Zhang L, He T, Wang H, Wan J, Wang X, Lu Z. Cardiovascular implications of fatal outcomes of patients with coronavirus disease 2019 (COVID-19). JAMA Cardiol. 2020;5:811.

    Article  PubMed  Google Scholar 

  58. Organization WH. ICD-10: international statistical classification of diseases and related health problems: tenth revision. Geneva: WHO; 2004.

    Google Scholar 

  59. Emergency use ICD codes for COVID-19 disease outbreak.

  60. Mounesan L, Eybpoosh S, Haghdoost A, Moradi G, Mostafavi E. Is reporting many cases of COVID-19 in Iran due to strength or weakness of Iran’s health system? Iran J Microbiol. 2020;12(2):73–6.

    PubMed  PubMed Central  Google Scholar 

  61. International Classification of Diseases, tenth revision, clinical modification (ICD-10-CM).

  62. COVID-19 News & Resources.

Download references


Authors are express gratitude and appreciation for cooperating of Dr. Ali Mohammad Hadianfard‬ for editing the manuscript.‬


Deputy of Research in Ahvaz Jundishapur University of Medical Sciences has found this project for implementation during pandemic disease of COVID-19 in the world. But this fund is no effect on the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.

Author information

Authors and Affiliations



JZ: Study conception and design, drafting the manuscript. MB, MK: consult scientific. MHS, HR: data collection. MD: data analysis. FY: study conception. MC: supervised the research, study conception, critical revision. All authors read and approved the manuscript.

Corresponding author

Correspondence to Maria Cheraghi.

Ethics declarations

Ethics approval and consent to participate

Ethical issues such as plagiarism, data fabrication, data falsification, conflict of interest, misconduct, double publication and/or submission, redundancy, etc. have been completely observed by the authors. Ethics committee of Ahvaz Jondishapur University of Medical Sciences, Ahvaz, Iran approved the study with Ethical approval code: IR.AJUMS.REC.1399.061. Written informed consent was obtained from all participants in this study.

Consent for publication

Not applicable.

Competing interests

The authors state that there is no competing of interest in this article.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zarei, J., Badavi, M., Karandish, M. et al. A study to design minimum data set of COVID-19 registry system. BMC Infect Dis 21, 773 (2021).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:


  • Minimum data set
  • Registry system
  • COVID-19