Conditional forecasts of risk measures play an important role in internal risk management of financial institutions as well as in regulatory capital calculations. In order to assess forecasting performance of a risk measurement procedure, risk measure forecasts are compared to the realized financial losses over a period of time and a statistical test of correctness of the procedure is conducted. This process is known as backtesting. Such traditional backtests are concerned with assessing some optimality property of a set of risk measure estimates. However, they are not suited to compare different risk estimation procedures. We investigate the proposal of comparative backtests, which are better suited for method comparisons on the basis of forecasting accuracy, but necessitate an elicitable risk measure. We argue that supplementing traditional backtests with comparative backtests will enhance the existing trading book regulatory framework for banks by providing the correct incentive for accuracy of risk measure forecasts. In addition, the comparative backtesting framework could be used by banks internally as well as by researchers to guide selection of forecasting methods. The discussion focuses on three risk measures, Value at Risk, expected shortfall and expectiles, and is supported by a simulation study and data analysis.
Ann. Appl. Stat.
11(4):
1833-1874
(December 2017).
DOI: 10.1214/17-AOAS1041
Bank for International Settlements (2013). Consultative document: Fundamental review of the trading book: A revised marked risk framework. Available at http://www.bis.org/publ/bcbs265.pdf.Bank for International Settlements (2013). Consultative document: Fundamental review of the trading book: A revised marked risk framework. Available at http://www.bis.org/publ/bcbs265.pdf.
Bank for International Settlements (2014). Consultative document: Fundamental review of the trading book: Outstanding issues. Available at http://www.bis.org/bcbs/publ/d305.pdf.Bank for International Settlements (2014). Consultative document: Fundamental review of the trading book: Outstanding issues. Available at http://www.bis.org/bcbs/publ/d305.pdf.
Bollerslev, T. and Wooldridge, J. M. (1992). Quasi-maximum likelihood estimation and inference in dynamic models with time-varying covariances. Econometric Rev. 11 143–172.Bollerslev, T. and Wooldridge, J. M. (1992). Quasi-maximum likelihood estimation and inference in dynamic models with time-varying covariances. Econometric Rev. 11 143–172.
Christoffersen, P. F. (1998). Evaluating interval forecasts. Internat. Econom. Rev. 39 841–862. MR1661906 10.2307/2527341Christoffersen, P. F. (1998). Evaluating interval forecasts. Internat. Econom. Rev. 39 841–862. MR1661906 10.2307/2527341
Cont, R., Deguest, R. and Scandolo, G. (2010). Robustness and sensitivity analysis of risk measurement procedures. Quant. Finance 10 593–606. MR2676786 10.1080/14697681003685597Cont, R., Deguest, R. and Scandolo, G. (2010). Robustness and sensitivity analysis of risk measurement procedures. Quant. Finance 10 593–606. MR2676786 10.1080/14697681003685597
Costanzino, N. and Curran, M. (2015). Backtesting general spectral risk measures with application to expected shortfall. Journal of Risk Model Validation 9 21–33.Costanzino, N. and Curran, M. (2015). Backtesting general spectral risk measures with application to expected shortfall. Journal of Risk Model Validation 9 21–33.
Diebold, F. X., Gunther, T. A. and Tay, A. S. (1998). Evaluating density forecasts with applications to financial risk management. Internat. Econom. Rev. 39 863–883.Diebold, F. X., Gunther, T. A. and Tay, A. S. (1998). Evaluating density forecasts with applications to financial risk management. Internat. Econom. Rev. 39 863–883.
Diebold, F. X., Schuermann, T. and Stroughair, J. D. (2000). Pitfalls and opportunities in the use of extreme value theory in risk management. J. Risk Finance 1 30–35.Diebold, F. X., Schuermann, T. and Stroughair, J. D. (2000). Pitfalls and opportunities in the use of extreme value theory in risk management. J. Risk Finance 1 30–35.
Efron, B. and Tibshirani, R. J. (1993). An Introduction to the Bootstrap. Monographs on Statistics and Applied Probability 57. Chapman & Hall, New York.Efron, B. and Tibshirani, R. J. (1993). An Introduction to the Bootstrap. Monographs on Statistics and Applied Probability 57. Chapman & Hall, New York.
Ehm, W., Gneiting, T., Jordan, A. and Krüger, F. (2016). Of quantiles and expectiles: Consistent scoring functions, Choquet representations and forecast rankings. J. R. Stat. Soc. Ser. B. Stat. Methodol. 78 505–562.Ehm, W., Gneiting, T., Jordan, A. and Krüger, F. (2016). Of quantiles and expectiles: Consistent scoring functions, Choquet representations and forecast rankings. J. R. Stat. Soc. Ser. B. Stat. Methodol. 78 505–562.
Embrechts, P., Klüppelberg, C. and Mikosch, T. (1997). Modelling Extremal Events for Insurance and Finance. Applications of Mathematics (New York) 33. Springer, Berlin.Embrechts, P., Klüppelberg, C. and Mikosch, T. (1997). Modelling Extremal Events for Insurance and Finance. Applications of Mathematics (New York) 33. Springer, Berlin.
Engle, R. F. and Manganelli, S. (2004). CAViaR: Conditional autoregressive value at risk by regression quantiles. J. Bus. Econom. Statist. 22 367–381.Engle, R. F. and Manganelli, S. (2004). CAViaR: Conditional autoregressive value at risk by regression quantiles. J. Bus. Econom. Statist. 22 367–381.
Fissler, T. and Ziegel, J. F. (2016). Higher order elicitability and Osband’s principle. Ann. Statist. 44 1680–1707. MR3519937 10.1214/16-AOS1439 euclid.aos/1467894712
Fissler, T. and Ziegel, J. F. (2016). Higher order elicitability and Osband’s principle. Ann. Statist. 44 1680–1707. MR3519937 10.1214/16-AOS1439 euclid.aos/1467894712
Fissler, T., Ziegel, J. F. and Gneiting, T. (2016). Expected shortfall is jointly elicitable with value at risk—implications for backtesting. Risk Mag. January 58–61.Fissler, T., Ziegel, J. F. and Gneiting, T. (2016). Expected shortfall is jointly elicitable with value at risk—implications for backtesting. Risk Mag. January 58–61.
Frongillo, R. and Kash, I. (2015). Vector-valued property elicitation. In Proceedings of the 28th Conference on Learning Theory (S. Kale, P. Grünwald and E. Hazan, eds.). JMLR Workshop and Conference Proceedings 40.Frongillo, R. and Kash, I. (2015). Vector-valued property elicitation. In Proceedings of the 28th Conference on Learning Theory (S. Kale, P. Grünwald and E. Hazan, eds.). JMLR Workshop and Conference Proceedings 40.
Gneiting, T. and Ranjan, R. (2011). Comparing density forecasts using threshold- and quantile-weighted scoring rules. J. Bus. Econom. Statist. 29 411–422.Gneiting, T. and Ranjan, R. (2011). Comparing density forecasts using threshold- and quantile-weighted scoring rules. J. Bus. Econom. Statist. 29 411–422.
Kou, S. and Peng, X. (2016). On the measurement of economic tail risk. Oper. Res. 64 1056–1072. MR3558435 10.1287/opre.2016.1539Kou, S. and Peng, X. (2016). On the measurement of economic tail risk. Oper. Res. 64 1056–1072. MR3558435 10.1287/opre.2016.1539
Kuan, C.-M., Yeh, J.-H. and Hsu, Y.-C. (2009). Assessing value at risk with CARE, the conditional autoregressive expectile models. J. Econometrics 150 261–270.Kuan, C.-M., Yeh, J.-H. and Hsu, Y.-C. (2009). Assessing value at risk with CARE, the conditional autoregressive expectile models. J. Econometrics 150 261–270.
Lambert, N. (2013). Elicitation and evaluation of statistical functionals. Preprint. Available at https://web.stanford.edu/~nlambert/papers/elicitation_july2013.pdf.Lambert, N. (2013). Elicitation and evaluation of statistical functionals. Preprint. Available at https://web.stanford.edu/~nlambert/papers/elicitation_july2013.pdf.
Lambert, N., Pennock, D. M. and Shoham, Y. (2008). Eliciting properties of probability distributions. In Proceedings of the 9th ACM Conference on Electronic Commerce 129–138. Chicago, IL. Extended abstract.Lambert, N., Pennock, D. M. and Shoham, Y. (2008). Eliciting properties of probability distributions. In Proceedings of the 9th ACM Conference on Electronic Commerce 129–138. Chicago, IL. Extended abstract.
McNeil, A. J. and Frey, R. (2000). Estimation of tail-related risk measures for heteroscedastic financial time series: An extreme value approach. J. Empir. Finance 7 271–300.McNeil, A. J. and Frey, R. (2000). Estimation of tail-related risk measures for heteroscedastic financial time series: An extreme value approach. J. Empir. Finance 7 271–300.
McNeil, A. J., Frey, R. and Embrechts, P. (2005). Quantitative Risk Management: Concepts, Techniques and Tools. Princeton Series in Finance. Princeton Univ. Press, Princeton, NJ.McNeil, A. J., Frey, R. and Embrechts, P. (2005). Quantitative Risk Management: Concepts, Techniques and Tools. Princeton Series in Finance. Princeton Univ. Press, Princeton, NJ.
Newey, W. K. and Powell, J. L. (1987). Asymmetric least squares estimation and testing. Econometrica 55 819–847. MR906565 10.2307/1911031Newey, W. K. and Powell, J. L. (1987). Asymmetric least squares estimation and testing. Econometrica 55 819–847. MR906565 10.2307/1911031
Nolde, N. and Ziegel, J. F. (2017). Supplement to “Elicitability and backtesting: Perspectives for banking regulation”. DOI:10.1214/17-AOAS1041SUPP.Nolde, N. and Ziegel, J. F. (2017). Supplement to “Elicitability and backtesting: Perspectives for banking regulation”. DOI:10.1214/17-AOAS1041SUPP.
Patton, A. J. (2006). Volatility forecast comparison using imperfect volatility proxies. Research Paper 175, Quantitative Finance Research Centre, Univ. Technology, Sydney.Patton, A. J. (2006). Volatility forecast comparison using imperfect volatility proxies. Research Paper 175, Quantitative Finance Research Centre, Univ. Technology, Sydney.
Patton, A. J. and Sheppard, K. (2009). Evaluating volatility and correlation forecasts. In Handbook of Financial Time Series (T. Mikosch, J.-P. Kreiss, R. A. Davis and T. G. Andersen, eds.) 801–838. Springer, Berlin.Patton, A. J. and Sheppard, K. (2009). Evaluating volatility and correlation forecasts. In Handbook of Financial Time Series (T. Mikosch, J.-P. Kreiss, R. A. Davis and T. G. Andersen, eds.) 801–838. Springer, Berlin.
R Core Team (2015). R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. Available at http://www.R-project.org/.R Core Team (2015). R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. Available at http://www.R-project.org/.
Steinwart, I., Pasin, C., Williamson, R. and Zhang, S. (2014). Elicitation and identification of properties. J. Mach. Learn. Res. Workshop Conf. Proc. 35 1–45.Steinwart, I., Pasin, C., Williamson, R. and Zhang, S. (2014). Elicitation and identification of properties. J. Mach. Learn. Res. Workshop Conf. Proc. 35 1–45.
Tsyplakov, A. (2014). Theoretical guidelines for a partially informed forecast examiner. MPRA Paper, 55017. Available at http://mpra.ub.uni-muenchen.de/55017.Tsyplakov, A. (2014). Theoretical guidelines for a partially informed forecast examiner. MPRA Paper, 55017. Available at http://mpra.ub.uni-muenchen.de/55017.