x
The following code compares dynamic and static models built using a Gibbs sampler and automatic function builder based on BSS-ANOVA GPs to Random Forest and Neural Nets.

The GP models are built using a static derivative dataset in the scripts SIR_GP_Model and CasTanks_GP_5fold (for the SIR data and Cascaded Tanks data respectively). The static cascaded tanks derivative model is then compared to a Random Forest and Residual Neural Net in the scripts CasTanksRF and CasTanksNN.

The static derivative models created by the GP were saved in the files 'SIR GP Models.mat' and 'GP 5fold Models.mat'. These models were loaded and integrated for the initial conditions and forcing functions present in the test data in the scripts SIR_GP_5K4Int CasTank_GP_RK4Int. Bounds for the integrated model's predictions were also produced in SIR_GP_RK4Bounds and CasTank_GP_Bounds. Dynamic sequence based Neural Network models were created for both problems as comparisons. Their models and results were produced in the following scripts: SIR_GRU, SIR_LSTM, CasTanksGRU and CasTanksLSTM.

Select results were saved for the GP and LSTM dynamic cascaded tanks problem in the files 'GP CasTanks Fifth Fold Results.mat' and 'LSTM CasTanks Fifth Fold Results.mat'. These results were then plotted in greater detail in the script CasTanksPlots. Similarly, several results of the GP model for the dynamic SIR problem were saved in the files 'Ext Wavy SIR.mat', 'Less Wavy SIR.mat' and 'Ramp SIR.mat'. These results were also plotted in greater detail in the script SIR_Plots. The corresponding plots for the GRU results in this case come directly from the SIR_GRU script.

A brief overview of the functions used in building the GP models follows.

The main routine is 'emulator' -- notes in the preamble to that function describe the inputs and outputs.

The coefficients ('betas') and interaction matrix ('mtx') may be used directly within the main GP evaluation routine 'bss_eval'.

The multiple-input, single-output GP functions are constructed through an unsupervised learning routine utilizing linear Bayesian inference through Gibbs sampling on a sequence of models. The Bayesian information criterion is used to track model fitness scores; the routine requires a sequential number increases in the BIC or AIC (the BIC optimum is a minimum) as specified by the tolerance before model termination, at which point the optimum model is returned.

Results can be visualized with coverage
