A hybrid model for hand-foot-mouth disease prediction based on ARIMA-EEMD-LSTM

Wan, Yiran; Song, Ping; Liu, Jiangchen; Xu, Ximing; Lei, Xun

doi:10.1186/s12879-023-08864-y

Research
Open access
Published: 15 December 2023

A hybrid model for hand-foot-mouth disease prediction based on ARIMA-EEMD-LSTM

Yiran Wan^1,2,3,4,
Ping Song⁵,
Jiangchen Liu⁶,
Ximing Xu⁵ &
…
Xun Lei^1,2,3,4

BMC Infectious Diseases volume 23, Article number: 879 (2023) Cite this article

972 Accesses
1 Citations
Metrics details

Abstract

Background

Hand, foot, and mouth disease (HFMD) is a common infectious disease that poses a serious threat to children all over the world. However, the current prediction models for HFMD still require improvement in accuracy. In this study, we proposed a hybrid model based on autoregressive integrated moving average (ARIMA), ensemble empirical mode decomposition (EEMD) and long short-term memory (LSTM) to predict the trend of HFMD.

Methods

The data used in this study was sourced from the National Clinical Research Center for Child Health and Disorders, Chongqing, China. The daily reported incidence of HFMD from 1 January 2015 to 27 July 2023 was collected to develop an ARIMA-EEMD-LSTM hybrid model. ARIMA, LSTM, ARIMA-LSTM and EEMD-LSTM models were developed to compare with the proposed hybrid model. Root mean square error (RMSE), mean absolute error (MAE) and coefficient of determination (R²) were adopted to evaluate the performances of the prediction models.

Results

Overall, ARIMA-EEMD-LSTM model achieved the most accurate prediction for HFMD, with RMSE, MAPE and R² of 4.37, 2.94 and 0.996, respectively. Performing EEMD on the residual sequence yields 11 intrinsic mode functions. EEMD-LSTM model is the second best, with RMSE, MAPE and R² of 6.20, 3.98 and 0.996.

Conclusion

Results showed the advantage of ARIMA-EEMD-LSTM model over the ARIMA model, the LSTM model, the ARIMA-LSTM model and the EEMD-LSTM model. For the prevention and control of epidemics, the proposed hybrid model may provide a more powerful help. Compared with other three models, the two integrated with EEMD method showed significant improvement in predictive capability, offering novel insights for modeling of disease time series.

Peer Review reports

Background

Hand, foot, and mouth disease (HFMD) is a common infectious disease caused by a group of enteroviruses, particularly among children under the age of 5 [1, 2]. The main symptoms of HFMD are fever, rashes and ulcers in the hand, foot, or oral mucosa [3]. Although HFMD tends to be mild and self-limiting in the majority of patients, it can lead to neurological complications, pulmonary edema, and even death in severe cases [4].

A high prevalence of HFMD has been reported in Asia [5]. During the past two decades, HFMD outbreaks were reported frequently from Asian countries such as Japan [6], Malaysia [7], Singapore [8] and Vietnam [9]. In China, the number of HFMD confirmed cases has been the first among all the statutory reported infectious diseases since it was included in the management of Category C statutory infectious diseases in 2008 [10].

Accurate prediction and early warning for the trend of HFMD provides valuable insights for the rational allocation of medical resources and assist in the prevention and control of HFMD. Since the outbreak of HFMD, researchers around the world have conducted numerous studies and developed various prediction models. Autoregressive integrated moving average (ARIMA) model, which has been widely used for epidemic prediction during the past decades, has been built based on the historical data from China [11, 12] and Malaysia [7]. However, as a linear model, ARIMA is not good at extracting non-linear features. Deep learning algorithms perform better when deal with non-linear features of time series. Long short-term memory (LSTM) model is one of the most popular deep learning algorithms, which has been increasingly used for epidemic prediction in recent years [11, 13]. To further improve the prediction accuracy, a variety of hybrid models have been proposed. The combination of ARIMA and LSTM combines the advantages of both linear and non-linear models to achieve higher accuracy, which has been applied in various fields such as environmental science [14] and epidemic prediction [15].

Empirical Mode Decomposition (EMD) is a data analysis method that has gained attention in various fields for its ability to decompose complex and nonstationary signals [16]. However, EMD suffers from mode mixing and end effects, which limit its applicability. In order to address these issues, Ensemble Empirical Mode Decomposition (EEMD) was proposed as an improved version of EMD. EEMD overcomes the limitations of EMD by adding a noise-assisted step to the decomposition process [17]. It involves repeatedly adding white noise to the original signal and decomposing the resulting ensemble of signals using EMD. By averaging the obtained intrinsic mode functions (IMFs) across the ensemble, EEMD effectively reduces mode mixing and suppresses the end effects. It provides a valuable tool for analyzing and extracting meaningful information from complex and nonlinear data. By decomposing the data into simpler and more suitable components, EEMD allows for better understanding of the underlying patterns and facilitates the application of subsequent analysis techniques.

In this study, we trained a hybrid ARIMA-EEMD-LSTM model based on the daily incidence of HFMD extracted from the largest pediatric hospital in Chongqing, China from 2015 to 2021, to predict the incidence of HFMD in 2022 and 2023. The proposed hybrid model is expected to provide more accurate predictions and better decision-making support for disease prevention and control.

Materials and methods

Study design and data collection

The data used in this study was obtained from the Clinical Research Data Platform of the Children's Hospital of Chongqing Medical University (the National Clinical Research Center for Child Health and Disorders in China), which contained the clinical information of more than 8,000,000 pediatric outpatients and inpatients as of July 2023. We collected the daily confirmed incidence of HFMD from 1 January 2015 to 27 July 2023. 80% of the data was used as the training set to train the prediction models, and the remaining 20% was used as the test set to evaluate model performance.

Autoregressive integrated moving average (ARIMA)

ARIMA is one of the classical methods of time series prediction analysis, whose main idea is to transform the non-stationary time series into stationary time series by differencing before model development. The process of constructing an ARIMA model is as follows: ADF test is performed on the differenced series to judge its stationarity, determine the number of differences d of the stationary series obtained, and determine the parameters p and q by autocorrelation function (ACF) and partial autocorrelation function (PACF), and then get the ARIMA (p, d, q) model.

Ensemble empirical mode decomposition (EEMD)

Empirical mode decomposition (EMD) is a method proposed by E. Huang for the analysis and processing of non-linear and non-stationary signals, which considers that any signal can be split into several intrinsic mode functions (IMF) and a residual component [16]. The EMD decomposition is given as

$${\mathrm{X}}_{(\mathrm{t})}={\sum }_{\mathrm{i}=1}^{\mathrm{n}}{\mathrm{c}}_{\mathrm{i}}(\mathrm{t})+\mathrm{r}(\mathrm{t})$$

${\mathrm{X}}_{(\mathrm{t})}$ is the original time series, ${\mathrm{c}}_{\mathrm{i}}(\mathrm{t})$ is the i^th IMF, r(t) is the residual.

However, the EMD method has some limitations because the IMFs obtained from its decomposition suffer from mode mixing. In order to solve this problem, Huang E and his team developed an improved EMD method based on noise-assisted analysis, named ensemble empirical mode decomposition (EEMD) [17].

The essence of the EEMD method is a multiple EMD with superimposed Gaussian white noise, which takes advantage of the statistical property of Gaussian white noise with uniform frequency distribution to change the polar characteristics of the signal by adding different white noise of the same amplitude each time, and then the corresponding IMF obtained from the multiple EMD is averaged to cancel the added white noise, thus effectively suppressing the generation of modal aliasing.

Long short-term memory (LSTM)

Long Short-Term Memory, is a special type of recurrent neural network (RNN) model that effectively addresses the issue of long-term dependencies commonly encountered in conventional RNN models. LSTM enhances the performance of the traditional RNN architecture by incorporating a more complex cell structure in the hidden layer. In contrast to the regular RNN, LSTM introduces three gate controllers, namely the input gate, forget gate, and output gate, which are controlled by sigmoid functions and combined with the tanh function. Additionally, a summation operation is applied to reduce the possibility of gradient vanishing or exploding. These modifications enable the LSTM neural network to maintain longer-term memory capacity, as illustrated in Fig. 1.

The previous short-term memory h_t-1 and the current input features x_t undergo the forget gate f_t, input gate (update gate) i_t, and $\widetilde{c}$ (a candidate for the new cell state), followed by the output gate o_t, and then normalized through activation functions. The formulas are as follows:

$$\begin{array}{c}{f}_{t}=\sigma ({W}_{f}\left[{h}_{t-1},{x}_{t}\right]+{b}_{f})\\ {i}_{t}=\sigma ({W}_{i}\left[{h}_{t-1},{x}_{t}\right]+{b}_{i})\\ \begin{array}{c}{\widetilde{C}}_{t}=tanh({W}_{C}\left[{h}_{t-1},{x}_{t}\right]+{b}_{C})\\ {C}_{t}={f}_{t}*{C}_{t-1}+{i}_{t}*{\widetilde{C}}_{t}\end{array}\end{array}$$

C_t is the updated cell state.

$$\begin{array}{c}{o}_{t}=\sigma ({W}_{o}\left[{h}_{t-1},{x}_{t}\right]+{b}_{o})\\ {h}_{t}={o}_{t}*{\mathrm{tanh}(C}_{t})\end{array}$$

h_t is the output vector at the current timestep.

ARIMA-EEMD-LSTM

The distribution of HFMD incidence data along the time dimension exhibits both linear and nonlinear features. The ARIMA model, belonging to linear models, can only capture the linear characteristics in time series data, while the LSTM neural network can compensate for this limitation. Moreover, for complex nonlinear and non-stationary residual sequences, there may exist multiple fluctuation patterns simultaneously, which makes it challenging for the LSTM model to learn their features comprehensively and accurately. The Empirical Mode Decomposition Ensemble (EEMD) method can analyze nonlinear and non-stationary time series by decomposing them into several relatively simpler component sequences, facilitating the LSTM model to learn their patterns effectively.

In this study, we combined the advantages of the ARIMA and LSTM models. The ARIMA model was utilized to capture the linear temporal features in the time series data, while the LSTM model was employed to fit the nonlinear temporal features in the residual sequence. Additionally, the EEMD method was applied to decompose the residual sequence before constructing the LSTM model, thereby improving the prediction performance. The proposed ARIMA-EEMD-LSTM combination model consisted of the following steps: firstly, an ARIMA model was built for the original time series, and the residuals were obtained by subtracting the model's predicted values from the actual values; secondly, the residual sequence was decomposed using the EEMD method into several Intrinsic Mode Function (IMF) components and a trend component; thirdly, individual LSTM models were constructed and used to predict each component sequence, and the predicted values from each model were summed to reconstruct the residual predictions; finally, the ARIMA model's predicted values were adjusted by subtracting the residual predictions, yielding the final predictions of the combination model.

The structure of the ARIMA-EEMD-LSTM combination model is illustrated in Fig. 2.

Evaluation of model performance

To provide a comprehensive evaluation of the model performance, this study adopted root mean square error (RMSE), mean absolute error (MAE), and coefficient of determination (R²), to assess the model performance from different perspectives. The formulas for calculating these metrics are as follows:

$$\mathrm{RMSE}=\sqrt{\frac{1}{\mathrm{n}}\sum_{\mathrm{i}=1}^{\mathrm{n}}{({\widehat{\mathrm{y}}}_{\mathrm{i}}-{\mathrm{y}}_{\mathrm{i}})}^{2}}$$

$$\mathrm{MAE}=\frac{1}{\mathrm{n}}\sum_{\mathrm{i}=1}^{\mathrm{n}}\left|{\widehat{\mathrm{y}}}_{\mathrm{i}}-{\mathrm{y}}_{\mathrm{i}}\right|$$

$${R}^{2}=1-\frac{\sum_{i=1}^{n}{({\widehat{y}}_{i}-{y}_{i})}^{2}}{\sum_{i=1}^{n}{({\overline{y} }_{i}-{y}_{i})}^{2}}$$

${y}_{i}$ is the actual value at the i-th time point, ${\widehat{y}}_{i}$ is the predicted value at the i-th time point, and ${\overline{y} }_{i}$ is the mean value of the actual values.

Statistical analysis

In this study, we used R for data preprocessing and the developing of ARIMA model, while we employed Python for EEMD decomposition and LSTM model development. For the LSTM model, we set the number of neurons to 100 and the dropout rate to 0.2. We used Mean Squared Error (MSE) as the loss function. Regarding parameters such as batch_size, epoch, and optimizer, we employed a grid search tuning approach to select the best parameter set..

Results

The development of ARIMA-EEMD-LSTM

In this study, the original time series was divided into a training set, covering the period from 1 January 2015, to 7 January 2022 (80% of the data), and a testing set, covering the period from 8 January 2022, to 27 July 2023 (20% of the data). A rolling forecast approach was employed, where 60 days of historical data were used to predict the next 1 day.

To begin, the 'forecast' package in R was utilized. The 'auto.arima' function was employed to identify the optimal model parameters for the training data, resulting in the creation of an ARIMA(5,1,2) model. The ARIMA model was fitted to the training set and used to make predictions on the testing set.

The EEMD method was applied to decompose the residual series of the ARIMA model, and the results are shown in Fig. 3. The original residual series was decomposed into 11 IMF series and 1 trend series. The IMF series with lower indices represent high-frequency signals in the original sequence, while the IMF series with higher indices represent low-frequency signals. From the decomposition results, it can be observed that the original data contains significant high-frequency signals. When these signals are included in the original time series, they are not easily learned by the LSTM model. However, separating these signals facilitates the learning process for LSTM.

These decomposed series were used as inputs to train the LSTM models, and the performances of these models on the testing set is shown in Fig. 4. It can be observed that the predicted values of each component series closely match the true values in terms of numerical values and trend, without significant lag.

The predicted values of the IMF series and the trend series were summed up to obtain the predicted results of the residual series, as shown in Fig. 5. Compared to the actual residual series, the predicted series demonstrates strong consistency in terms of frequency and amplitude of fluctuations, indicating a good predictive effect for the residual series.

Finally, the predicted values of the ARIMA model and the residual series were added up to obtain the final predicted values, which were compared to the true values in Fig. 6. From the figure, it can be observed that the model accurately predicts the changing trend of the original time series and can capture significant fluctuations.

The development of other models

In this study, we developed 4 more models as comparison: the ARIMA model, the LSTM model, the ARIMA-LSTM model and the EEMD-LSTM model. The results of those models are shown in Supplemental Figures 1–4.

Model evaluation and comparison

The evaluation results of the hybrid ARIMA-EEMD-LSTM model, as well as the ARIMA, LSTM, ARIMA-LSTM, and EEMD-LSTM models on the training set and the testing set, are shown in Table 1.

Table 1 Comparison of the prediction performances between ARIMA-EEMD-LSTM and other models

Full size table

The proposed ARIMA-EEMD-LSTM model achieved an RMSE of 4.37, MAE of 2.94, and an R² of 0.996 on the testing set, demonstrating accurate predictions of the incidence of HFMD. In comparison, the ARIMA model had an RMSE of 6.95, MAE of 3.68, and an R² of 0.990, while the LSTM model had an RMSE of 13.93, MAE of 8.07, and an R² of 0.961. The hybrid model outperformed these single models in accuracy and goodness of fit, achieving better predictive performance.

Furthermore, two other hybrid models, ARIMA-LSTM and EEMD-LSTM, were also developed. On the testing set, the ARIMA-LSTM model had an RMSE of 9.85, MAE of 8.11, and an R² of 0.980, while the EEMD-LSTM model had an RMSE of 6.20, MAE of 3.98, and an R² of 0.992. Compared with the LSTM model, EEMD-LSTM showed improvements in RMSE from 13.93 to 6.20, MAE from 8.07 to 3.98, and R² from 0.961 to 0.992. Compared with the ARIMA-LSTM model, ARIMA-EEMD-LSTM showed improvements in RMSE from 9.85 to 4.37, MAE from 8.11 to 2.94, and R² from 0.980 to 0.996. These results indicate that the inclusion of the EEMD method significantly enhances the predictive performance of the models.

Overall, the hybrid ARIMA-EEMD-LSTM model demonstrates superior predictive accuracy and fitness compared with the ARIMA, LSTM, ARIMA-LSTM, and EEMD-LSTM models. The addition of the EEMD method contributes to the improvement of the model's predictive performance.

Discussion

In this study, we proposed a novel hybrid prediction model which combined the strength of linear statistical model, advanced deep learning model and the cutting-edge EEMD technology to achieve accurate prediction for HFMD incidence. The proposed hybrid ARIMA-EEMD-LSTM model outperformed the other four prediction models developed in this study-ARIMA, LSTM, ARIMA-LSTM and EEMD-LSTM according to the evaluation results, which means the ARIMA-EEMD-LSTM model provides more accurate predictions.

ARIMA, as a classical time series prediction model, has been applied widely in disease predictions [18,19,20]. However, since belongs to lineal models, ARIMA can only capture the linear characteristics. Many time series in real world contain a mixture of linear and non-linear features, which poses challenges for the predictions of ARIMA model. But the deep learning algorithm can compensate for this limitation. The combination of ARIMA model and LSTM model,the widely used deep learning model for time series,keeps ARIMA’s advantage in capturing linear trends and dependencies within time series while excels at capturing complex,nonlinear patterns and long-term dependencies.

EEMD is a novel technology for processing non-linear and non-stationary data, and has been successfully applied in various fields [21,22,23]. However, there have been few studies which use EEMD for epidemic predictions. With EEMD method, complex data can be decomposed into relatively simple components that are more suitable for model training. This compensates for the limitation of the LSTM model in dealing with nonstationary time series.

In this study, we compared the hybrid ARIMA-EEMD-LSTM model with two single models-ARIMA and LSTM, and two hybrid models-ARIMA-LSTM and EEMD-LSTM. The evaluation results showed that the ARIMA-EEMD-LSTM model exhibited the best predictive performance with the RMSE, MAPE and R² of 4.37, 2.94 and 0.996, respectively. The predcition performance of the proposed model suggests its potential utility in epidemic prevention and control. And the two models integrated with EEMD method showed significant improvement in predictive capability when compared with other three models. The inclusion of EEMD can have great impact on model performance, offering novel insights for modeling of disease time series.

There are also several limitations in this study. Firstly, the data used in this study were from the National Children's Regional Medical Center (Southwest Region), and more cross-center studies are needed to verify the validity and generalizability of the results. Secondly, models developed in this study only utilized daily cases of HFMD, and more related factors such as temperature and humidity should be considered to furtherly enhance the prediction performance.

Conclusion

In conclusion, this study proposed an innovative hybrid ARIMA-EEMD-LSTM model for predicting the incidence of HFMD. By integrating the strengths of the ARIMA model, LSTM model, and EEMD method, the hybrid model achieved enhanced prediction accuracy and fit, and can serve as a valuable tool for healthcare professionals and policymakers in understanding and managing the spread of HFMD and other epidemics.

Availability of data and materials

The dataset used in the study are available from the corresponding author on reasonable request.

Abbreviations

HFMD:: Hand, foot and mouth disease
ARIMA:: Autoregressive integrated moving average
EMD:: Empirical mode decomposition
EEMD:: Ensemble empirical mode decomposition
LSTM:: Long short-term memory

References

Xing W, Liao Q, Viboud C, Zhang J, Sun J, Wu JT, et al. Hand, foot, and mouth disease in China, 2008–12: an epidemiological study. Lancet Infect Dis. 2014;14(4):308–18.
Article PubMed PubMed Central Google Scholar
Park K, Lee B, Baek K, Cheon D, Yeo S, Park J, et al. Enteroviruses isolated from herpangina and hand-foot-and-mouth disease in Korean children. Virol J. 2012;9:205.
Article PubMed PubMed Central Google Scholar
Alsop J, Flewett TH, Foster JR. “Hand-foot-and-mouth disease” in Birmingham in 1959. Br Med J. 1960;2(5214):1708–11.
Article CAS PubMed PubMed Central Google Scholar
Huang CC, Liu CC, Chang YC, Chen CY, Wang ST, Yeh TF. Neurologic complications in children with enterovirus 71 infection. N Engl J Med. 1999;341(13):936–42.
Article CAS PubMed Google Scholar
Koh WM, Bogich T, Siegel K, Jin J, Chong EY, Tan CY, et al. The epidemiology of hand, foot and mouth disease in asia: a systematic review and analysis. Pediatr Infect Dis J. 2016;35(10):e285-300.
Article PubMed PubMed Central Google Scholar
Gonzalez G, Carr MJ, Kobayashi M, Hanaoka N, Fujimoto T. Enterovirus-associated hand-foot and mouth disease and neurological complications in Japan and the rest of the world. Int J Mol Sci. 2019;20(20):5201.
Article CAS PubMed PubMed Central Google Scholar
Jayaraj VJ, Hoe VCW. Forecasting HFMD cases using weather variables and google search queries in Sabah, Malaysia. Int J Environ Res Public Health. 2022;19(24):16880.
Article PubMed PubMed Central Google Scholar
Kua JA, Pang J. The epidemiological risk factors of hand, foot, mouth disease among children in Singapore: a retrospective case-control study. PLoS ONE. 2020;15(8):e0236711.
Article CAS PubMed PubMed Central Google Scholar
Nhan LNT, Turner HC, Khanh TH, Hung NT, Lien LB, Hong NTT, et al. Economic burden attributed to children presenting to hospitals with hand, foot, and mouth disease in Vietnam. Open Forum Infect Dis. 20191;6(7):284.
Y H, H J, W S, C D, T C, L C, et al. Disease burden in patients with severe hand, foot, and mouth disease in Jiangsu Province: a cross-sectional study. Human vaccines & immunotherapeutics. 2022;18(5). Available from: https://pubmed.ncbi.nlm.nih.gov/35476031/. Cited 13 Aug 2023
Zhang R, Guo Z, Meng Y, Wang S, Li S, Niu R, et al. Comparison of ARIMA and LSTM in Forecasting the Incidence of HFMD Combined and Uncombined with Exogenous Meteorological Variables in Ningbo, China. Int J Environ Res Public Health. 2021;18(11):6174.
Article PubMed PubMed Central Google Scholar
Liu L, Luan RS, Yin F, Zhu XP, Lü Q. Predicting the incidence of hand, foot and mouth disease in Sichuan province, China using the ARIMA model. Epidemiol Infect. 2016;144(1):144–51.
Article CAS PubMed Google Scholar
Borges D, Nascimento MCV. COVID-19 ICU demand forecasting: a two-stage Prophet-LSTM approach. Appl Soft Comput. 2022;125:109181.
Article PubMed PubMed Central Google Scholar
Xu D, Zhang Q, Ding Y, Zhang D. Application of a hybrid ARIMA-LSTM model based on the SPEI for drought forecasting. Environ Sci Pollut Res Int. 2022;29(3):4128–44.
Article PubMed Google Scholar
Yang E, Zhang H, Guo X, Zang Z, Liu Z, Liu Y. A multivariate multi-step LSTM forecasting model for tuberculosis incidence with model explanation in Liaoning Province, China. BMC Infect Dis. 2022;22(1):490.
Article CAS PubMed PubMed Central Google Scholar
Huang NE, Shen Z, Long SR, Wu MC, Shih HH, Zheng Q, Yen N-C, Tung CC, Liu HH. The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis. Proc R Soc London Ser A Math Phys Eng Sci. 1998;454:903–95.
Article Google Scholar
Wu Z, Huang NE. Ensemble empirical mode decomposition: a noise-assisted data analysis method. Adv Adapt Data Anal. 2009;01(01):1–41.
Article Google Scholar
Wang M, Pan J, Li X, Li M, Liu Z, Zhao Q, et al. ARIMA and ARIMA-ERNN models for prediction of pertussis incidence in mainland China from 2004 to 2021. BMC Public Health. 2022;22(1):1447.
Article PubMed PubMed Central Google Scholar
Alabdulrazzaq H, Alenezi MN, Rawajfih Y, Alghannam BA, Al-Hassan AA, Al-Anzi FS. On the accuracy of ARIMA based prediction of COVID-19 spread. Results Phys. 2021;27:104509.
Article PubMed PubMed Central Google Scholar
Zhang R, Song H, Chen Q, Wang Y, Wang S, Li Y. Comparison of ARIMA and LSTM for prediction of hemorrhagic fever at different time scales in China. PLoS ONE. 2022;17(1):e0262009.
Article CAS PubMed PubMed Central Google Scholar
Shao L, Guo Q, Li C, Li J, Yan H. Short-term load forecasting based on EEMD-WOA-LSTM combination model. Appl Bionics Biomech. 2022;2022:2166082.
Article PubMed PubMed Central Google Scholar
Xie Z, Li Z, Mo C, Wang J. A PCA-EEMD-CNN-Attention-GRU-Encoder-Decoder Accurate Prediction Model for Key Parameters of Seawater Quality in Zhanjiang Bay. Materials (Basel). 2022;15(15):5200.
Article CAS PubMed Google Scholar
Zhao J, Nie G, Wen Y. Monthly precipitation prediction in Luoyang city based on EEMD-LSTM-ARIMA model. Water Sci Technol. 2023;87(1):318–35.
Article PubMed Google Scholar

Download references

Acknowledgements

We would like to thank Children’s Hospital of Chongqing Medical University for providing the data of confirmed hand, foot and mouth disease cases in Chongqing.

Funding

This work was supported by the National Key Research and Development Program of China (No. 2022YFC2704900), the National Natural Science Foundation of China (No. 72174033) and the Program for Youth Innovation in Future Medicine, Chongqing Medical University (No. W0013).

Author information

Authors and Affiliations

School of Public Health, Chongqing Medical University, Chongqing, China
Yiran Wan & Xun Lei
Research Center for Medicine and Social Development, Chongqing, China
Yiran Wan & Xun Lei
Collaborative Innovation Center of Social Risks Governance in Health, Chongqing Medical University, Chongqing, China
Yiran Wan & Xun Lei
Research Center for Public Health Security, Chongqing Medical University, No1 Medical College Rd, Yuzhong District, Chongqing, 400016, People’s Republic of China
Yiran Wan & Xun Lei
Big Data Center for Children’s Medical Care, Children’s Hospital of Chongqing Medical University, National Clinical Research Center for Child Health and Disorders, Ministry of Education Key Laboratory of Child Development and Disorders, No 136. Zhongshan 2Nd Rd, Yuzhong District, Chongqing, 400014, People’s Republic of China
Ping Song & Ximing Xu
School of Mathematical Science, Chongqing Normal University, Chongqing, China
Jiangchen Liu

Authors

Yiran Wan
View author publications
You can also search for this author in PubMed Google Scholar
Ping Song
View author publications
You can also search for this author in PubMed Google Scholar
Jiangchen Liu
View author publications
You can also search for this author in PubMed Google Scholar
Ximing Xu
View author publications
You can also search for this author in PubMed Google Scholar
Xun Lei
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

XL and XMX designed the study. YRW collected and analyzed the data and wrote the manuscript. PS and JCL conducted the literature review and managed the project. All authors contributed to research performing, drafting, and revising the article, gave final approval of the version to be published, and agree to be accountable for all aspects of the work. The authors read and approved the final manuscript.

Corresponding authors

Correspondence to Ximing Xu or Xun Lei.

Ethics declarations

Ethics approval and consent to participate

The ethics committee of Children’s Hospital of Chongqing Medical University approved this study protocol and waived the need for informed consent of the patients.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Wan, Y., Song, P., Liu, J. et al. A hybrid model for hand-foot-mouth disease prediction based on ARIMA-EEMD-LSTM. BMC Infect Dis 23, 879 (2023). https://doi.org/10.1186/s12879-023-08864-y

Download citation

Received: 23 August 2023
Accepted: 04 December 2023
Published: 15 December 2023
DOI: https://doi.org/10.1186/s12879-023-08864-y

A hybrid model for hand-foot-mouth disease prediction based on ARIMA-EEMD-LSTM

Abstract

Background

Methods

Results

Conclusion

Background

Materials and methods

Study design and data collection

Autoregressive integrated moving average (ARIMA)

Ensemble empirical mode decomposition (EEMD)

Long short-term memory (LSTM)

ARIMA-EEMD-LSTM

Evaluation of model performance

Statistical analysis

Results

The development of ARIMA-EEMD-LSTM

The development of other models

Model evaluation and comparison

Discussion

Conclusion

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Supplementary Information

Additional file 1.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Infectious Diseases

Contact us