TY - JOUR
T1 - A Probabilistic Multimedia Retrieval Model and its Evaluation
AU - Westerveld, T.H.W.
AU - de Vries, A.P.
AU - de Vries, A.J.
AU - van Ballegooij, A.
AU - de Jong, Franciska M.G.
AU - Hiemstra, Djoerd
A2 - Sayed, A.H.
N1 - Imported from EWI/DB PMS [db-utwente:arti:0000000017]
PY - 2003
Y1 - 2003
N2 - We present a probabilistic model for the retrieval of multimodal documents. The model is based on Bayesian decision theory and combines models for text-based search with models for visual search. The textual model is based on the language modelling approach to text retrieval, and the visual information is modelled as a mixture of Gaussian densities. Both models have proved successful on various standard retrieval tasks. We evaluate the multimodal model on the search task of TREC′s video track. We found that the disclosure of video material based on visual information only is still too difficult. Even with purely visual information needs, text-based retrieval still outperforms visual approaches. The probabilistic model is useful for text, visual, and multimedia retrieval. Unfortunately, simplifying assumptions that reduce its computational complexity degrade retrieval effectiveness. Regarding the question whether the model can effectively combine information from different modalities, we conclude that whenever both modalities yield reasonable scores, a combined run outperforms the individual runs.
AB - We present a probabilistic model for the retrieval of multimodal documents. The model is based on Bayesian decision theory and combines models for text-based search with models for visual search. The textual model is based on the language modelling approach to text retrieval, and the visual information is modelled as a mixture of Gaussian densities. Both models have proved successful on various standard retrieval tasks. We evaluate the multimodal model on the search task of TREC′s video track. We found that the disclosure of video material based on visual information only is still too difficult. Even with purely visual information needs, text-based retrieval still outperforms visual approaches. The probabilistic model is useful for text, visual, and multimedia retrieval. Unfortunately, simplifying assumptions that reduce its computational complexity degrade retrieval effectiveness. Regarding the question whether the model can effectively combine information from different modalities, we conclude that whenever both modalities yield reasonable scores, a combined run outperforms the individual runs.
KW - DB-MMR: MULTIMEDIA RETRIEVAL
KW - METIS-216472
KW - IR-66373
KW - EWI-6962
U2 - 10.1155/S111086570321101X
DO - 10.1155/S111086570321101X
M3 - Article
SN - 1110-8657
VL - 2
SP - 186
EP - 198
JO - EURASIP journal on applied signal processing
JF - EURASIP journal on applied signal processing
IS - 2
M1 - 10.1155/S111086570321101X
ER -