A generic probabilistic graphical model for region-based scene interpretation

Michael Ying Yang*

*Corresponding author for this work

Research output: Contribution to conferencePaperpeer-review

1 Citation (Scopus)
30 Downloads (Pure)


The task of semantic scene interpretation is to label the regions of an image and their relations into meaningful classes. Such task is a key ingredient to many computer vision applications, including object recognition, 3D reconstruction and robotic perception. The images of man-made scenes exhibit strong contextual dependencies in the form of the spatial and hierarchical structures. Modeling these structures is central for such interpretation task. Graphical models provide a consistent framework for the statistical modeling. Bayesian networks and random fields are two popular types of the graphical models, which are frequently used for capturing such contextual information. Our key contribution is the development of a generic statistical graphical model for scene interpretation, which seamlessly integrates different types of the image features, and the spatial structural information and the hierarchical structural information defined over the multi-scale image segmentation. It unifies the ideas of existing approaches, e. g. conditional random field and Bayesian network, which has a clear statistical interpretation as the MAP estimate of a multi-class labeling problem. We demonstrate experimentally the application of the proposed graphical model on the task of multi-class classification of building facade image regions.

Original languageEnglish
Number of pages6
Publication statusPublished - 1 Jan 2015
Externally publishedYes
Event10th International Conference on Computer Vision Theory and Applications, VISAPP 2015 - Berlin, Germany
Duration: 11 Mar 201514 Mar 2015
Conference number: 10


Conference10th International Conference on Computer Vision Theory and Applications, VISAPP 2015
Abbreviated titleVISAPP 2015
OtherPart of VISIGRAPP, the 10th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications.
Internet address


  • Bayesian network
  • Conditional random field
  • Energy function
  • Scene interpretation


Dive into the research topics of 'A generic probabilistic graphical model for region-based scene interpretation'. Together they form a unique fingerprint.

Cite this