- Statist. Sci.
- Volume 31, Number 1 (2016), 40-60.
Analysis Methods for Computer Experiments: How to Assess and What Counts?
Statistical methods based on a regression model plus a zero-mean Gaussian process (GP) have been widely used for predicting the output of a deterministic computer code. There are many suggestions in the literature for how to choose the regression component and how to model the correlation structure of the GP. This article argues that comprehensive, evidence-based assessment strategies are needed when comparing such modeling options. Otherwise, one is easily misled. Applying the strategies to several computer codes shows that a regression model more complex than a constant mean either has little impact on prediction accuracy or is an impediment. The choice of correlation function has modest effect, but there is little to separate two common choices, the power exponential and the Matérn, if the latter is optimized with respect to its smoothness. The applications presented here also provide no evidence that a composite of GPs provides practical improvement in prediction accuracy. A limited comparison of Bayesian and empirical Bayes methods is similarly inconclusive. In contrast, we find that the effect of experimental design is surprisingly large, even for designs of the same type with the same theoretical properties.
Statist. Sci., Volume 31, Number 1 (2016), 40-60.
First available in Project Euclid: 10 February 2016
Permanent link to this document
Digital Object Identifier
Mathematical Reviews number (MathSciNet)
Chen, Hao; Loeppky, Jason L.; Sacks, Jerome; Welch, William J. Analysis Methods for Computer Experiments: How to Assess and What Counts?. Statist. Sci. 31 (2016), no. 1, 40--60. doi:10.1214/15-STS531. https://projecteuclid.org/euclid.ss/1455115913
- Supplement to “Analysis Methods for Computer Experiments: How to Assess and What Counts?”. This report (whatcounts-supp.pdf) contains further description of the test functions and data from running them, further results for root mean squared error, findings for maximum absolute error, further results on uncertainty of prediction, and details of the simulation investigating regression terms. Inputs to the Arctic sea-ice code—ice-x.txt. Outputs from the code—ice-y.txt.