Bilinear classifiers for visual recognition
We describe an algorithm for learning bilinear SVMs. Bilinear classifiers are a discriminative variant of bilinear models, which capture the dependence of data on multiple factors. Such models are particularly appropriate for visual data that is better represented as a matrix or tensor, rather than a vector. Matrix encodings allow for more natural regularization through rank restriction. For example, a rank-one restriction produces a bilinear classifier that can be interpreted as a separable filter. We also use bilinear classifiers for transfer learning by sharing linear factors between different tasks. Finally, we show that bilinear classifiers can be trained with biconvex programs. Such programs are optimized with coordinate descent, where each step is equivalent to a standard convex problem. This allows us to leverage existing SVM solvers during learning. We demonstrate bilinear SVMs on difficult problems of people detection in video sequences and action classification of video sequences, achieving state-of-the-art results in both.
Text ReferenceHamed Pirsiavash, Deva Ramanan, and Charless Fowlkes. Bilinear classifiers for visual recognition. In Neural Info. Proc. Systems (NIPS). 2009.
author = "Pirsiavash, Hamed and Ramanan, Deva and Fowlkes, Charless",
title = "Bilinear classifiers for visual recognition",
booktitle = "Neural Info. Proc. Systems (NIPS)",
year = "2009",
tag = "object_recognition"