Open Access
November 2010 Learning Active Basis Models by EM-Type Algorithms
Zhangzhang Si, Haifeng Gong, Song-Chun Zhu, Ying Nian Wu
Statist. Sci. 25(4): 458-475 (November 2010). DOI: 10.1214/09-STS281

Abstract

EM algorithm is a convenient tool for maximum likelihood model fitting when the data are incomplete or when there are latent variables or hidden states. In this review article we explain that EM algorithm is a natural computational scheme for learning image templates of object categories where the learning is not fully supervised. We represent an image template by an active basis model, which is a linear composition of a selected set of localized, elongated and oriented wavelet elements that are allowed to slightly perturb their locations and orientations to account for the deformations of object shapes. The model can be easily learned when the objects in the training images are of the same pose, and appear at the same location and scale. This is often called supervised learning. In the situation where the objects may appear at different unknown locations, orientations and scales in the training images, we have to incorporate the unknown locations, orientations and scales as latent variables into the image generation process, and learn the template by EM-type algorithms. The E-step imputes the unknown locations, orientations and scales based on the currently learned template. This step can be considered self-supervision, which involves using the current template to recognize the objects in the training images. The M-step then relearns the template based on the imputed locations, orientations and scales, and this is essentially the same as supervised learning. So the EM learning process iterates between recognition and supervised learning. We illustrate this scheme by several experiments.

Citation

Download Citation

Zhangzhang Si. Haifeng Gong. Song-Chun Zhu. Ying Nian Wu. "Learning Active Basis Models by EM-Type Algorithms." Statist. Sci. 25 (4) 458 - 475, November 2010. https://doi.org/10.1214/09-STS281

Information

Published: November 2010
First available in Project Euclid: 14 March 2011

zbMATH: 1329.62297
MathSciNet: MR2807764
Digital Object Identifier: 10.1214/09-STS281

Keywords: generative models , object recognition , wavelet sparse coding

Rights: Copyright © 2010 Institute of Mathematical Statistics

Vol.25 • No. 4 • November 2010
Back to Top