February 2022 Backfitting for large scale crossed random effects regressions
Swarnadip Ghosh, Trevor Hastie, Art B. Owen
Author Affiliations +
Ann. Statist. 50(1): 560-583 (February 2022). DOI: 10.1214/21-AOS2121


Regression models with crossed random effect errors can be very expensive to compute. The cost of both generalized least squares and Gibbs sampling can easily grow as N3/2 (or worse) for N observations. Papaspiliopoulos, Roberts and Zanella (Biometrika 107 (2020) 25–40) present a collapsed Gibbs sampler that costs O(N), but under an extremely stringent sampling model. We propose a backfitting algorithm to compute a generalized least squares estimate and prove that it costs O(N). A critical part of the proof is in ensuring that the number of iterations required is O(1), which follows from keeping a certain matrix norm below 1δ for some δ>0. Our conditions are greatly relaxed compared to those for the collapsed Gibbs sampler, though still strict. Empirically, the backfitting algorithm has a norm below 1δ under conditions that are less strict than those in our assumptions. We illustrate the new algorithm on a ratings data set from Stitch Fix.

Funding Statement

This work was supported by the U.S. National Science Foundation under Grant IIS-1837931.


We are grateful to Brad Klingenberg and Stitch Fix for sharing some test data with us and James Johndrow for some helpful discussions. We thank the reviewers for remarks that have helped us improve the paper.


Download Citation

Swarnadip Ghosh. Trevor Hastie. Art B. Owen. "Backfitting for large scale crossed random effects regressions." Ann. Statist. 50 (1) 560 - 583, February 2022. https://doi.org/10.1214/21-AOS2121


Received: 1 August 2020; Revised: 1 July 2021; Published: February 2022
First available in Project Euclid: 16 February 2022

MathSciNet: MR4382028
zbMATH: 1486.62201
Digital Object Identifier: 10.1214/21-AOS2121

Primary: 62J05
Secondary: 65C60

Keywords: collapsed Gibbs sampler , generalized least squares , mixed effect models

Rights: Copyright © 2022 Institute of Mathematical Statistics

Vol.50 • No. 1 • February 2022
Back to Top