Bayesian Latent Semantic Analysis of Multimedia Databases

ID
TR-2001-15
Authors
Nando de Freitas and Kobus Barnard
Publishing date
October 11, 2001
Length
35 pages
Abstract
We present a Bayesian mixture model for probabilistic latent semantic analysis of documents with images and text. The Bayesian perspective allows us to perform automatic regularisation to obtain sparser and more coherent clustering models. It also enables us to encode a priori knowledge, such as word and image preferences. The learnt model can be used for browsing digital databases, information retrieval with image and/or text queries, image annotation (adding words to an image) and text illustration (adding images to a text).