The Annals of Applied Statistics
- Ann. Appl. Stat.
- Volume 11, Number 3 (2017), 1275-1308.
Maximum likelihood features for generative image models
Most approaches to computer vision can be thought of as lying somewhere on a continuum between generative and discriminative. Although each approach has had its successes, recent advances have favored discriminative methods, most notably the convolutional neural network. Still, there is some doubt about whether this approach will scale to a human-level performance given the numbers of samples that are needed to train state-of-the-art systems. Here, we focus on the generative or Bayesian approach, which is more model based and, in theory, more efficient. Challenges include latent-variable modeling, computationally efficient inference, and data modeling. We restrict ourselves to the problem of data modeling, which is possibly the most daunting, and specifically to the generative modeling of image patches. We formulate a new approach, which can be broadly characterized as an application of “conditional modeling,” designed to sidestep the high-dimensionality and complexity of image data. A series of experiments, learning appearance models for faces and parts of faces, illustrates the flexibility and effectiveness of the approach.
Ann. Appl. Stat. Volume 11, Number 3 (2017), 1275-1308.
Received: July 2016
Revised: January 2017
First available in Project Euclid: 5 October 2017
Permanent link to this document
Digital Object Identifier
Chang, Lo-Bin; Borenstein, Eran; Zhang, Wei; Geman, Stuart. Maximum likelihood features for generative image models. Ann. Appl. Stat. 11 (2017), no. 3, 1275--1308. doi:10.1214/17-AOAS1025. https://projecteuclid.org/euclid.aoas/1507168830