Abstract
The task of semantic scene interpretation is to label the regions of an image and their relations into meaningful classes. Such task is a key ingredient to many computer vision applications, including object recognition, 3D reconstruction and robotic perception. The images of man-made scenes exhibit strong contextual dependencies in the form of the spatial and hierarchical structures. Modeling these structures is central for such interpretation task. Graphical models provide a consistent framework for the statistical modeling. Bayesian networks and random fields are two popular types of the graphical models, which are frequently used for capturing such contextual information. Our key contribution is the development of a generic statistical graphical model for scene interpretation, which seamlessly integrates different types of the image features, and the spatial structural information and the hierarchical structural information defined over the multi-scale image segmentation. It unifies the ideas of existing approaches, e. g. conditional random field and Bayesian network, which has a clear statistical interpretation as the MAP estimate of a multi-class labeling problem. We demonstrate experimentally the application of the proposed graphical model on the task of multi-class classification of building facade image regions.
Original language | English |
---|---|
Pages | 486-491 |
Number of pages | 6 |
DOIs | |
Publication status | Published - 1 Jan 2015 |
Externally published | Yes |
Event | 10th International Conference on Computer Vision Theory and Applications, VISAPP 2015 - Berlin, Germany Duration: 11 Mar 2015 → 14 Mar 2015 Conference number: 10 http://www.visapp.visigrapp.org/?y=2015 |
Conference
Conference | 10th International Conference on Computer Vision Theory and Applications, VISAPP 2015 |
---|---|
Abbreviated title | VISAPP 2015 |
Country/Territory | Germany |
City | Berlin |
Period | 11/03/15 → 14/03/15 |
Other | Part of VISIGRAPP, the 10th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications. |
Internet address |
Keywords
- Bayesian network
- Conditional random field
- Energy function
- Scene interpretation