Chimeric forecasting: combining probabilistic predictions from computational models and human judgment

BMC Infectious Diseases

Table 1 Five procedures were chosen to impute missing forecasts

Imputation technique	G	Summary
Mean	\(I^{-1}\sum _{i}{q_{i,k}}\)	Take the mean of all present quantiles where the set I is an index for present forecasts
Median	\(\min _{x} \left\{ F(x)-1/2 \right\}\)	Take the median of all present quantiles where F is the empirical cdf over all I quantiles
Bayesian Ridge regression	\({\mathbb {E}}(X) \text { where } X\sim {\mathcal {N}}(Q_{-k}\beta ,\sigma ^{2})\) \(\beta \sim {\mathcal {N}}(0,\uplambda ^{-1} I) \;\sigma ^{2} \sim \Gamma (\alpha ,\gamma )\)	The matrix \(Q_{-k}\) has two columns: a column of ones and a second column of quantiles from present forecasts.
Decision Tree regression	–	The missing quantile value is imputed by the mean of quantiles in the same partition.
Extremely Randomized Trees	–	Multiple decision trees \((D_{i})\) are fit to random subsets of quantiles and the missing forecast is imputed as the average over \(D_{i}\).

Mean and median imputation only use information about a single quantile to impute missing forecasts, while the three regression approaches use all the quantiles from all present forecasts to impute missing forecasts

ISSN: 1471-2334