Skip to main content

Table 1 Description and properties for each dataset

From: Machine learning pipeline for blood culture outcome prediction using Sysmex XN-2000 blood sample results in Western Australia

Dataset

Time period

Overview

Training

Between 1 January 2018 and 31 December 2019

The training set contains 10965 samples. 10134 of these blood samples were taken with negative BC results (92.42%), and 831 were drawn with positive BC results (7.58%).

Internal validation

Between 1 January 2020 and 31 May 2020

This set contains 318 samples. 292 of these blood samples were drawn with negative BC results (91.82%), and 26 were drawn with positive BC results (8.18%).

External validation

Between 1 January 2020 and 31 May 2020

This set contains 1245 samples. 1138 of these blood samples were drawn with negative BC results (91.41%), and 107 were drawn with positive BC results (8.59%).